A comprehensive data quality methodology for web and structured data