The Library
Model-based clustering of non-Gaussian panel data based on skew-t distributions
Tools
Juárez, Miguel A. and Steel, Mark F. J.. (2010) Model-based clustering of non-Gaussian panel data based on skew-t distributions. Journal of Business and Economic Statistics, Vol.28 (No.1). pp. 52-66. ISSN 0735-0015
|
PDF
WRAP_Steel_090611-clustfinal.pdf - Accepted Version - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader Download (1345Kb) |
Official URL: http://dx.doi.org/10.1198/jbes.2009.07145
Abstract
We propose a model-based method to cluster units within a panel. The underlying model is autoregressive and non-Gaussian, allowing for both skewness and fat tails, and the units are clustered according to their dynamic behavior. equilibrium level, and the effect of covariates. Inference is addressed from a Bayesian perspective, and model comparison is conducted using Bayes factors. Particular attention is paid to prior elicitation and posterior propriety. We suggest priors that require little subjective input and have hierarchical structures that enhance inference robustness. We apply our methodology to GDP growth of European regions and to employment growth of Spanish firms.
| Item Type: | Journal Article |
|---|---|
| Subjects: | H Social Sciences > HB Economic Theory Q Science > QA Mathematics |
| Divisions: | Faculty of Science > Statistics |
| Library of Congress Subject Headings (LCSH): | Cluster analysis, Gross domestic product -- Europe -- Econometric models, Labor market -- Spain -- Econometric models |
| Journal or Publication Title: | Journal of Business and Economic Statistics |
| Publisher: | Americal Statistical Association |
| ISSN: | 0735-0015 |
| Date: | January 2010 |
| Volume: | Vol.28 |
| Number: | No.1 |
| Number of Pages: | 15 |
| Page Range: | pp. 52-66 |
| Identification Number: | 10.1198/jbes.2009.07145 |
| Status: | Peer Reviewed |
| Publication Status: | Published |
| Access rights to Published version: | Restricted or Subscription Access |
| Funder: | Engineering and Physical Sciences Research Council (EPSRC) |
| Grant number: | GR/T17908/01 (EPSRC) |
| References: | Alonso-Borrego, C. and Arellano, M. (1999). Symmetrically normalised intrumental variable estimation using panel data. Journal of Business & Economic Statistics, 17, 36–49. Arellano, M. (2003). Panel Data Econometrics. Oxford: University Press. Azzalini, A. and Capitanio, A. (2003). Distributions generated by perturbations of symmetry with emphasis on a multivariate skew-t distribution. Journal of the Royal Statistical Society B, 65, 367–389. Baltagi, B. (2001). Econometric Analysis of Panel Data. Chichester: Wiley, 2nd ed. Banfield, J.D. and Raftery, A.E. (1993). Model-based Gaussian and non-Gaussian clustering. Biometrics, 49, 803–821. Bauwens, L. and Rombouts, J.V.K. (2007). Bayesian clustering of many GARCH models. Econometric Reviews, 26, 365–386. Bensmail, H., Celeux, G., Raftery, A.E. and Robert, C.P. (1997). Inference in model-based cluster analysis. Statistics and Computing, 7, 1–10. Berger, J.O. and Bernardo, J.M. (1992). Ordered group reference priors with application to the multinomial problem. Biometrika, 79, 25–37. Canova, F. (2004). Testing for convergence clubs in income per capita: A predictive density approac. International Economic Review, 45, 49–77. Casella, G., Mengersen, K.L., Robert, C.P. and Titterington, D.M. (2002). Perfect samplers for mixtures of distributions. Journal of the Royal Statistical Society B, 64, 777–790. Casella, G., Robert, C.P. and Wells, M.T. (2004). Mixture models, latent variables and partitioned important sampling. Statistical Methodology, 1, 1–18. Celeux, G., Hurn, M. and Robert, C.P. (2000). Computational and inferential difficulties with mixture posterior distributions. Journal of the American Statistical Association, 95, 957–970. Chib, S. (1995). Marginal likelihood from the Gibbs output. Journal of the American Statistical Association, 90, 1313–1321. Deschamps, P.J. (2006). A flexible prior distribution for Markov switching autoregressions with Student-t errors. Journal of Econometrics, 133, 153–190. DiCiccio, J., Kass, R.E., Raftery, A.E. and Wasserman, L. (1997). Computing Bayes factors by combining simulations and asymptotic approximations. Journal of the American Statistical Association, 92, 903–915. Diebolt, J. and Robert, C.P. (1994). Estimation of finite mixture distributions through Bayesian sampling. Journal of the Royal Statistical Society B, 56, 363–375. Diggle, P.J., Heagerty, P., Liand, K.Y. and Zeger, S.L. (2002). Analysis of longitudinal data. Oxford: University Press, 2nd ed. Durlauf, S.N. and Johnson, P.A. (1995). Multiple regimes and cross-country growth behaviour. Journal of Applied Econometrics, 10, 365–384. Durlauf, S.N. and Quah, D.T. (1999). The new empirics of economic growth. J.B. Taylor and M. Woodford, eds., Handbook of Macroeconomics, vol. 1. Amsterdam: Elsevier, vol. 1, pp. 235–308. Fernández, C. and Steel, M.F.J. (1998). On Bayesian modeling of fat tails and skewness. Journal of the American Statistical Association, 93, 359–371. Fonseca, T., Ferreira, M. and Migon, H. (2006). Objective Bayesian analysis for the Student-t regression model. Tech. Report 187, Statistics Department, Federal University of Rio de Janeiro. Forni, M., Hallin, M., Lippi, M. and Reichlin, L. (2005). The generalized dynamic factor model: one-sided estimation and forecasting. Journal of the American Statistical Association, 100, 830–840. Fraley, C. and Raftery, A.E. (2002). Model-based clustering, discriminant analysis, and density estimation. Journal of the American Statistical Association, 97, 611–631. Frühwirth-Schnatter, S. (2001). Markov chain Monte Carlo estimation of classical and dynamic switching and mixture models. Journal of the American Statistical Association, 96, 194–209. Frühwirth-Schnatter, S. (2004). Estimating marginal likelihoods for mixture and Markov switching models using bridge sampling techniques. Econometrics Journal, 7, 143–167. Frühwirth-Schnatter, S. (2006). Finite Mixtures and Markov Switching Models. New York: Springer. Frühwirth-Schnatter, S. and Kaufmann, S. (2006). How do changes in monetary policy affect bank lending? An analysis of Austrian bank data. Journal of Applied Econometrics, 21, 275–305. Frühwirth-Schnatter, S. and Kaufmann, S. (2008). Model-based clustering of multiple time series. Journal of Business and Economic Statistics, 26, 78–89. Gelman, A. (2006). Prior distributions for variance parameters in hierarchical models. Bayesian Analysis, 1, 1–19. Geweke, J. (1993). Bayesian treatment of the independent Student-t linear model. Journal of Applied Econometrics, 8, S19–S40. Geweke, J. (2007). Interpretation and inference in mixture models: Simple MCMC works. Computational Statistics & Data Analysis, 51, 3529–3550. Green, P.J. (1995). Reversible jump Markov chain Monte Carlo computation and Bayesian model determination. Biometrika, 82, 711–732. Hirano, K. (2002). Semiparametric Bayesian inference in autoregressive panel data models. Econometrica, 70, 781–799. Hoogstrate, A.J., Palm, F.C. and Pfann, G.A. (2000). Pooling in dynamic panel-data models: An application to forecasting GDP growth rates. Journal of Business & Economic Statistics, 18, 274–283. Hsiao, C. (2003). Analysis of Panel Data. Cambridge: University Press, 2nd ed. Ishwaran, H., James, L.F. and Sun, J. (2001). Bayesian model selection in finite mixtures by marginal density decompositions. Journal of the American Statistical Association, 96, 1316–1322. Jasra, A., Holmes, C.C. and Stephens, D.A. (2005). Markov chain Monte Carlo methods and the label switching problem in Bayesian mixture modelling. Statistical Science, 20, 50–67. Jones, M.C. and Faddy, M.J. (2003). A skew extension of the t-distribution, with applications. Journal of the Royal Statistical Society B, 65, 159–174. Juárez, M.A. and Steel, M.F.J. (2006). Non-Gaussian dynamic Bayesian modelling for panel data. Working Paper 06-05, CRiSM, University of Warwick. Lin, C.C. and Ng, S. (2007). Estimation of panel data models with parameter heterogeneity when group membership is unknown. Tech. report, Department of Economics, Columbia University. Liu, M.C. and Tiao, G.C. (1980). Random coefficient first-order autoregressive models. Journal of Econometrics, 13, 305–325. Marin, J.M., Mengersen, K. and Robert, C.P. (2005). Bayesian modelling and inference on mixtures of distributions. D. Dey and C.R. Rao, eds., Handbook of Statistics, vol. 25, Amsterdam: North-Holland. pp. 459–207. McLachlan, G.J. and Peel, D. (2000). Finite Mixture Models. New York: Wiley. Meng, X.L. andWong,W.H. (1996). Simulating ratios of normalizing constants via a simple identity: A theoretical exploration. Statistica Sinica, 6, 831–860. Nandram, B. and Petruccelli, J.D. (1997). A Bayesian analysis of autoregressive time series panel data. Journal of Business and Economic Statistics, 15, 328–334. Newton, M.A. and Raftery, A.E. (1994). Approximate Bayesian inference with the weighted likelihood bootstrap. Journal of the Royal Statistical Society B, 56, 3–48. Pesaran, M.H. (2007). A pair-wise approach to testing for output and growth convergence. Journal of Econometrics, 138, 312–355. Phillips, D.B. and Smith, A.F.M. (1996). Bayesian model comparison via jump diffusions. W.R. Gilks, S. Richardson and S.J. Spiegelhalter, eds., Markov chain Monte Carlo in practice. Boca Raton: Chapman & Hall, pp. 215–240. Quah, D.T. (1997). Empirics for growth distribution: stratification, polarization and convergence clubs. Journal of Economic Growth, 2, 27–59. Raftery, A.E. (1996). Hypothesis testing and model selection. W.R. Gilks, S. Richardson and S.J. Spiegelhalter, eds., Markov chain Monte Carlo in practice. Boca Raton: Chapman & Hall, pp. 163–188. Richardson, S. and Green, P.J. (1997). On Bayesian analysis of mixtures with an unknown number of components. Journal of the Royal Statistical Society B, 59, 731–792. (with discussion). Steele, R.J., Raftery, A.E. and Emond, M.J. (2006). Computing normalizing constants for finite mixture models via incremental mixture important sampling. Journal of Computational and Graphical Statistics, 15, 712–734. Stephens, M. (2000a). Bayesian analysis of mixture models with an unknown number of components– an alternative to reversible jump methods. The Annals of Statistics, 28, 40–74. Stephens, M. (2000b). Dealing with label switching in mixture models. Journal of the Royal Statistical Society B, 62, 795–809. Stock, J. and Watson, M. (2002). Forecasting using principal components from a large number of predictors. Journal of the American Statistical Association, 97, 1167–1179. Temple, J. (1999). The new growth evidence. Journal of Economic Literature, 37, 112–156. Titterington, D.M., Smith, A.F.M. and Makov, U.E. (1985). Statistical Analysis of Finite Mixture Distributions. Chichester: Wiley. Verdinelli, I. and Wasserman, L. (1995). Computing Bayes factors using a generalization of the Savage-Dickey density ratio. Journal of the American Statistical Association, 90, 614–618. Weiss, R.E. (2005). Modeling Longitudinal Data. New York: Springer. |
| URI: | http://wrap.warwick.ac.uk/id/eprint/16591 |
Data sourced from Thomson Reuters' Web of Knowledge
Actions (login required)
![]() |
View Item |
Tools
Tools

