The paper explores the possibility to employ the source code of corporate websites as an information source for research in innovation studies. Research in this area is generally based on studies that collect data on patents or official data sources. Our paper links the standard economic information of the firm with web-based data and joins the ongoing debate with a threefold contribution. First, whereas the majority of the literature focused on the linguistic content of web-pages, we mostly use HTML tags. Second, we propose a method to assess the quality of the linkage of Web data to firm-level information. Third, we show that the data retrieved from corporate websites can aid to identify 'innovative SMEs'.

Bottai, C., Crosato, L., Domenech, J., Guerzoni, M., Liberati, C. (2022). Unconventional data for policy: Using Big Data for detecting Italian innovative SMEs. In 2nd ACM Conference on Information Technology for Social Good, GoodIT 2022 (pp.338-344). Association for Computing Machinery [10.1145/3524458.3547246].

Unconventional data for policy: Using Big Data for detecting Italian innovative SMEs

Bottai, Carlo;Guerzoni, Marco;Liberati, Caterina
2022

Abstract

The paper explores the possibility to employ the source code of corporate websites as an information source for research in innovation studies. Research in this area is generally based on studies that collect data on patents or official data sources. Our paper links the standard economic information of the firm with web-based data and joins the ongoing debate with a threefold contribution. First, whereas the majority of the literature focused on the linguistic content of web-pages, we mostly use HTML tags. Second, we propose a method to assess the quality of the linkage of Web data to firm-level information. Third, we show that the data retrieved from corporate websites can aid to identify 'innovative SMEs'.
paper
HTML tags; Innovation; SMEs; Website data;
English
2nd ACM Conference on Information Technology for Social Good, GoodIT 2022 - 7 September 2022 through 9 September 2022
2022
2nd ACM Conference on Information Technology for Social Good, GoodIT 2022
9781450392846
2022
338
344
https://dl.acm.org/doi/proceedings/10.1145/3524458
reserved
Bottai, C., Crosato, L., Domenech, J., Guerzoni, M., Liberati, C. (2022). Unconventional data for policy: Using Big Data for detecting Italian innovative SMEs. In 2nd ACM Conference on Information Technology for Social Good, GoodIT 2022 (pp.338-344). Association for Computing Machinery [10.1145/3524458.3547246].
File in questo prodotto:
File Dimensione Formato  
Bottai-2022-GoodIT '22-AAM.pdf

Solo gestori archivio

Descrizione: Research Article
Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Dimensione 655.58 kB
Formato Adobe PDF
655.58 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/391635
Citazioni
  • Scopus 3
  • ???jsp.display-item.citation.isi??? ND
Social impact