In this paper we introduce %CEM, a macro package allowing researchers to automatically perform coarsened exact matching (CEM) in SAS environment. CEM is a non-parametric matching method widely used by researchers to avoid the confounding influence of pre-treatment control variables to improve causal inference in quasi-experimental studies. %CEM introduces a completely automated process which allows SAS users to efficiently perform CEM in fields in which large data sets are common and where SAS is the most popular statistical tool. In addition, such a macro may be used to test several coarsening combinations of numeric variables. This option also provides a visual representation of the matching frontier, thus enabling researchers to select the optimal setting which takes into account both the L1 imbalance and the percentage of matched units. The paper concludes with an empirical application comparing computational performance and results obtained using alternative available software (SAS, R and STATA) using multiple administrative data sets from a large regional database.
Berta, P., Bossi, M., Verzillo, S. (2017). %CEM: a SAS macro to perform coarsened exact matching. JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 87(2), 227-238 [10.1080/00949655.2016.1203433].
%CEM: a SAS macro to perform coarsened exact matching
BERTA, PAOLOPrimo
;
2017
Abstract
In this paper we introduce %CEM, a macro package allowing researchers to automatically perform coarsened exact matching (CEM) in SAS environment. CEM is a non-parametric matching method widely used by researchers to avoid the confounding influence of pre-treatment control variables to improve causal inference in quasi-experimental studies. %CEM introduces a completely automated process which allows SAS users to efficiently perform CEM in fields in which large data sets are common and where SAS is the most popular statistical tool. In addition, such a macro may be used to test several coarsening combinations of numeric variables. This option also provides a visual representation of the matching frontier, thus enabling researchers to select the optimal setting which takes into account both the L1 imbalance and the percentage of matched units. The paper concludes with an empirical application comparing computational performance and results obtained using alternative available software (SAS, R and STATA) using multiple administrative data sets from a large regional database.File | Dimensione | Formato | |
---|---|---|---|
170498.pdf
Solo gestori archivio
Tipologia di allegato:
Publisher’s Version (Version of Record, VoR)
Dimensione
1.99 MB
Formato
Adobe PDF
|
1.99 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.