Abstract: The vast information landscape of the Internet constitute a significant challenge for extracting valuable content. The lack of standardized data models and structures necessitates ad hoc ...