Research in innovation usually builds on conventional data such as balance sheets, surveys, patents, or product catalogs. This paper intends to explore unconventional data, specifically web-scraped data, as an information source for innovation studies, proposing a careful procedure to establish the veracity of the linkage between web-based data and firm-level information retrieved from conventional sources. The study regards a sample of Italian manufacturing small and medium enterprises active in 2016, comprehending both innovative and non-innovative firms. It is based on HTML tags, whilst most of the previous literature worked on the web-pages text and related semantics. Our paper provides evidence that the way HTML language is applied to build a corporate website unveils the capabilities of the owner firm, helping to distinguish innovative from non-innovative SMEs.

Crosato, L., Bottai, C., Domenech, J., Guerzoni, M., Liberati, C. (2023). Can websites reveal a firm’s innovativeness? Empirical evidence on Italian manufacturing SMEs. In CARMA 2023 Proceedings of 5th International Conference on Advanced Research Methods and Analytics (pp.19-26). Sevilla : Editorial Universitat Politècnica de València [10.4995/CARMA2023.2023.16466].

Can websites reveal a firm’s innovativeness? Empirical evidence on Italian manufacturing SMEs

Crosato, L;Bottai, C;Guerzoni, M;Liberati, C
2023

Abstract

Research in innovation usually builds on conventional data such as balance sheets, surveys, patents, or product catalogs. This paper intends to explore unconventional data, specifically web-scraped data, as an information source for innovation studies, proposing a careful procedure to establish the veracity of the linkage between web-based data and firm-level information retrieved from conventional sources. The study regards a sample of Italian manufacturing small and medium enterprises active in 2016, comprehending both innovative and non-innovative firms. It is based on HTML tags, whilst most of the previous literature worked on the web-pages text and related semantics. Our paper provides evidence that the way HTML language is applied to build a corporate website unveils the capabilities of the owner firm, helping to distinguish innovative from non-innovative SMEs.
paper
innovation, SMEs, unconventional data, HTML code, webscraping
English
5th International Conference on Advanced Research Methods and Analytics (CARMA2023)
2023
Martinez-Torres, M; Toral, SL
CARMA 2023 Proceedings of 5th International Conference on Advanced Research Methods and Analytics
9788413960869
2023
2023
19
26
http://ocs.editorial.upv.es/index.php/CARMA/CARMA2023/paper/view/16466
open
Crosato, L., Bottai, C., Domenech, J., Guerzoni, M., Liberati, C. (2023). Can websites reveal a firm’s innovativeness? Empirical evidence on Italian manufacturing SMEs. In CARMA 2023 Proceedings of 5th International Conference on Advanced Research Methods and Analytics (pp.19-26). Sevilla : Editorial Universitat Politècnica de València [10.4995/CARMA2023.2023.16466].
File in questo prodotto:
File Dimensione Formato  
Crosato-2023-CARMA-VoR.pdf

accesso aperto

Descrizione: Intervento a convegno
Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Licenza: Creative Commons
Dimensione 383.17 kB
Formato Adobe PDF
383.17 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/440238
Citazioni
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
Social impact