We use data derived from the World Bank and UNESCO Institute for Statis- tics, which are referred to 217 countries followed for 18 years, from 2000 to 2017, to study the persistence of economic conditions at country level over time. We consider several variables, such as GDP per capita, educational levels, life expectancy at birth, infant mortality rate, school enrollment, government expenditure in education, and the value of the Gini index that measures the inequality level of the distribution of income through household survey data (Schultz, 1998). These variables are related to the human development index proposed by the United Nations Development Programme (1990) for measuring the well-being at country level. The proposed approach allow us to charac- terize disparities among countries in a dynamic fashion so as to evaluate the changes in inequality over time by estimating a multivariate hidden Markov (HM) model (Bartolucci, Farcomeni, and Pennoni, 2013; Zucchini, MacDonald, and Langrock, 2016), also known as latent Markov model. In particular we pro- vide a dynamic clustering of the countries on the basis of the observed economic variables and, in turn, this permits to analyze inequalities in the development level. The resulting clustering is obtained by relying on a model tailored for longitudinal data rather than by predetermined threshold values as currently adopted by many international organizations to classify countries. The HM model assumes the existence of an unobservable process, which follows a Markov chain with a discrete number of latent states, affecting the distribution of the observed variables. In particular, we consider multivariate continuous re- sponses that, for the same time occasion, are assumed to be correlated accord- ing to a specific variance-covariance matrix, even conditionally on the latent states. In such a context, maximum likelihood estimation of model parameters is straightforward by means of the Expectation-Maximization (EM) algorithm (Dempster, Laird, and Rubin, 1977). Missing observations represent a relevant problem in this analysis, where values for some countries may be not available. Therefore, in this work, we also propose an approach for inference with missing data based on modifying the standard steps of the EM algorithm in a suitable way. In particular, we assume a structure of missing at random (MAR) re- sponses (Little and Rubin, 2002) where the missing patterns are independent of the missing responses given all the observed data. The resulting EM algorithm provides exact maximum likelihood estimates of the model parameters. Since the resulting latent states may be ordered according to the average values of the economic variables, inequalities among countries are detected for each year on the basis of the estimated posterior probabilities of the countries of be- longing to each latent state. In this way, we also explore patterns of disparities among countries and track their evolution from a dynamic perspective.

Pandolfi, S., Bartolucci, F., Pennoni, F., Serafini, A. (2020). Diversity in socio-economic growth at country level: a multivariate hidden Markov model. In Book of Abstracts: Distributions and Inequality Measures in Economics (pp.74-75).

Diversity in socio-economic growth at country level: a multivariate hidden Markov model

Pennoni, F;
2020

Abstract

We use data derived from the World Bank and UNESCO Institute for Statis- tics, which are referred to 217 countries followed for 18 years, from 2000 to 2017, to study the persistence of economic conditions at country level over time. We consider several variables, such as GDP per capita, educational levels, life expectancy at birth, infant mortality rate, school enrollment, government expenditure in education, and the value of the Gini index that measures the inequality level of the distribution of income through household survey data (Schultz, 1998). These variables are related to the human development index proposed by the United Nations Development Programme (1990) for measuring the well-being at country level. The proposed approach allow us to charac- terize disparities among countries in a dynamic fashion so as to evaluate the changes in inequality over time by estimating a multivariate hidden Markov (HM) model (Bartolucci, Farcomeni, and Pennoni, 2013; Zucchini, MacDonald, and Langrock, 2016), also known as latent Markov model. In particular we pro- vide a dynamic clustering of the countries on the basis of the observed economic variables and, in turn, this permits to analyze inequalities in the development level. The resulting clustering is obtained by relying on a model tailored for longitudinal data rather than by predetermined threshold values as currently adopted by many international organizations to classify countries. The HM model assumes the existence of an unobservable process, which follows a Markov chain with a discrete number of latent states, affecting the distribution of the observed variables. In particular, we consider multivariate continuous re- sponses that, for the same time occasion, are assumed to be correlated accord- ing to a specific variance-covariance matrix, even conditionally on the latent states. In such a context, maximum likelihood estimation of model parameters is straightforward by means of the Expectation-Maximization (EM) algorithm (Dempster, Laird, and Rubin, 1977). Missing observations represent a relevant problem in this analysis, where values for some countries may be not available. Therefore, in this work, we also propose an approach for inference with missing data based on modifying the standard steps of the EM algorithm in a suitable way. In particular, we assume a structure of missing at random (MAR) re- sponses (Little and Rubin, 2002) where the missing patterns are independent of the missing responses given all the observed data. The resulting EM algorithm provides exact maximum likelihood estimates of the model parameters. Since the resulting latent states may be ordered according to the average values of the economic variables, inequalities among countries are detected for each year on the basis of the estimated posterior probabilities of the countries of be- longing to each latent state. In this way, we also explore patterns of disparities among countries and track their evolution from a dynamic perspective.
abstract + slide
continuous outcomes, countries inequalities, dynamic clustering, missing data
English
Distributions and Inequality Measures in Economics
2020
Book of Abstracts: Distributions and Inequality Measures in Economics
2020
74
75
https://dime2020.dismeq.unimib.it/
open
Pandolfi, S., Bartolucci, F., Pennoni, F., Serafini, A. (2020). Diversity in socio-economic growth at country level: a multivariate hidden Markov model. In Book of Abstracts: Distributions and Inequality Measures in Economics (pp.74-75).
File in questo prodotto:
File Dimensione Formato  
Abs_Pennoni_Inequalty_Bicocca_2020.pdf

accesso aperto

Descrizione: Abstract
Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Dimensione 415.62 kB
Formato Adobe PDF
415.62 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/263858
Citazioni
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
Social impact