Online job portals collecting web vacancies have become important media for job demand and supply matching. They also represent a growing research area for the application of analytical methods to study the labour market using innovative data sources. This paper analyses Italian web job vacancies scraped from several types of Italian web job portals between June and September 2015. After describing how the occupations associated with each web vacancy (classification up to level 4) were identified and the related skills retrieved in texts using mixed supervised and unsupervised text mining approaches, we focused on job vacancies related to ICT and statistical positions. The principal aim of this paper is to describe these jobs in terms of the required skills that have emerged in the labour market from a demand perspective and to identify those skills that best distinguish statisticians from other ICT occupations. Hence, several machine learning techniques were used to assess those skills that best distinguish occupation codes from other job groups. After quality control and removal of duplications, the scraping collected more than 110,000 job advertisements: nearly 6,200 were classified as ICT or statistical positions (largely dominated by software developers). The data indicate that high-level statisticians have superior and heterogeneous professional backgrounds, linked to theoretical statistics, where analytic skills are more relevant than computing skills. Many soft and management-oriented skills were also called for, which are missing among lower level statisticians, who are restricted to more technical jobs oriented towards general computing and informatics.

Lovaglio, P., Cesarini, M., Mercorio, F., Mezzanzanica, M. (2018). Skills in demand for ICT and statistical occupations: Evidence from web-based job vacancies. STATISTICAL ANALYSIS AND DATA MINING, 11(2), 78-91 [10.1002/sam.11372].

Skills in demand for ICT and statistical occupations: Evidence from web-based job vacancies

Lovaglio, PG;Cesarini, M;Mercorio, F
;
Mezzanzanica, M
2018

Abstract

Online job portals collecting web vacancies have become important media for job demand and supply matching. They also represent a growing research area for the application of analytical methods to study the labour market using innovative data sources. This paper analyses Italian web job vacancies scraped from several types of Italian web job portals between June and September 2015. After describing how the occupations associated with each web vacancy (classification up to level 4) were identified and the related skills retrieved in texts using mixed supervised and unsupervised text mining approaches, we focused on job vacancies related to ICT and statistical positions. The principal aim of this paper is to describe these jobs in terms of the required skills that have emerged in the labour market from a demand perspective and to identify those skills that best distinguish statisticians from other ICT occupations. Hence, several machine learning techniques were used to assess those skills that best distinguish occupation codes from other job groups. After quality control and removal of duplications, the scraping collected more than 110,000 job advertisements: nearly 6,200 were classified as ICT or statistical positions (largely dominated by software developers). The data indicate that high-level statisticians have superior and heterogeneous professional backgrounds, linked to theoretical statistics, where analytic skills are more relevant than computing skills. Many soft and management-oriented skills were also called for, which are missing among lower level statisticians, who are restricted to more technical jobs oriented towards general computing and informatics.
Articolo in rivista - Articolo scientifico
labour market data; machine learning; text mining; Web data;
labour market data, machine learning, text mining, Web data
English
2018
11
2
78
91
partially_open
Lovaglio, P., Cesarini, M., Mercorio, F., Mezzanzanica, M. (2018). Skills in demand for ICT and statistical occupations: Evidence from web-based job vacancies. STATISTICAL ANALYSIS AND DATA MINING, 11(2), 78-91 [10.1002/sam.11372].
File in questo prodotto:
File Dimensione Formato  
SAM11372.pdf

Solo gestori archivio

Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Dimensione 639.25 kB
Formato Adobe PDF
639.25 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
SADM.pdf

accesso aperto

Tipologia di allegato: Submitted Version (Pre-print)
Dimensione 575.38 kB
Formato Adobe PDF
575.38 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/189703
Citazioni
  • Scopus 46
  • ???jsp.display-item.citation.isi??? 30
Social impact