Skip to content Skip to navigation
University of Warwick
  • Study
  • |
  • Research
  • |
  • Business
  • |
  • Alumni
  • |
  • News
  • |
  • About

University of Warwick
Publications service & WRAP

Highlight your research

  • WRAP
    • Home
    • Search WRAP
    • Browse by Warwick Author
    • Browse WRAP by Year
    • Browse WRAP by Subject
    • Browse WRAP by Department
    • Browse WRAP by Funder
    • Browse Theses by Department
  • Publications Service
    • Home
    • Search Publications Service
    • Browse by Warwick Author
    • Browse Publications service by Year
    • Browse Publications service by Subject
    • Browse Publications service by Department
    • Browse Publications service by Funder
  • Statistics
  • Help & Advice
University of Warwick

The Library

  • Login

Model-based clustering of non-Gaussian panel data based on skew-t distributions

Tools
- Tools
+ Tools

Juárez, Miguel A. and Steel, Mark F. J.. (2010) Model-based clustering of non-Gaussian panel data based on skew-t distributions. Journal of Business and Economic Statistics, Vol.28 (No.1). pp. 52-66. ISSN 0735-0015

[img]
Preview
PDF
WRAP_Steel_090611-clustfinal.pdf - Accepted Version - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader

Download (1345Kb)
Official URL: http://dx.doi.org/10.1198/jbes.2009.07145

Abstract

We propose a model-based method to cluster units within a panel. The underlying model is autoregressive and non-Gaussian, allowing for both skewness and fat tails, and the units are clustered according to their dynamic behavior. equilibrium level, and the effect of covariates. Inference is addressed from a Bayesian perspective, and model comparison is conducted using Bayes factors. Particular attention is paid to prior elicitation and posterior propriety. We suggest priors that require little subjective input and have hierarchical structures that enhance inference robustness. We apply our methodology to GDP growth of European regions and to employment growth of Spanish firms.

Item Type: Journal Article
Subjects: H Social Sciences > HB Economic Theory
Q Science > QA Mathematics
Divisions: Faculty of Science > Statistics
Library of Congress Subject Headings (LCSH): Cluster analysis, Gross domestic product -- Europe -- Econometric models, Labor market -- Spain -- Econometric models
Journal or Publication Title: Journal of Business and Economic Statistics
Publisher: Americal Statistical Association
ISSN: 0735-0015
Date: January 2010
Volume: Vol.28
Number: No.1
Number of Pages: 15
Page Range: pp. 52-66
Identification Number: 10.1198/jbes.2009.07145
Status: Peer Reviewed
Publication Status: Published
Access rights to Published version: Restricted or Subscription Access
Funder: Engineering and Physical Sciences Research Council (EPSRC)
Grant number: GR/T17908/01 (EPSRC)
References: Alonso-Borrego, C. and Arellano, M. (1999). Symmetrically normalised intrumental variable estimation using panel data. Journal of Business & Economic Statistics, 17, 36–49. Arellano, M. (2003). Panel Data Econometrics. Oxford: University Press. Azzalini, A. and Capitanio, A. (2003). Distributions generated by perturbations of symmetry with emphasis on a multivariate skew-t distribution. Journal of the Royal Statistical Society B, 65, 367–389. Baltagi, B. (2001). Econometric Analysis of Panel Data. Chichester: Wiley, 2nd ed. Banfield, J.D. and Raftery, A.E. (1993). Model-based Gaussian and non-Gaussian clustering. Biometrics, 49, 803–821. Bauwens, L. and Rombouts, J.V.K. (2007). Bayesian clustering of many GARCH models. Econometric Reviews, 26, 365–386. Bensmail, H., Celeux, G., Raftery, A.E. and Robert, C.P. (1997). Inference in model-based cluster analysis. Statistics and Computing, 7, 1–10. Berger, J.O. and Bernardo, J.M. (1992). Ordered group reference priors with application to the multinomial problem. Biometrika, 79, 25–37. Canova, F. (2004). Testing for convergence clubs in income per capita: A predictive density approac. International Economic Review, 45, 49–77. Casella, G., Mengersen, K.L., Robert, C.P. and Titterington, D.M. (2002). Perfect samplers for mixtures of distributions. Journal of the Royal Statistical Society B, 64, 777–790. Casella, G., Robert, C.P. and Wells, M.T. (2004). Mixture models, latent variables and partitioned important sampling. Statistical Methodology, 1, 1–18. Celeux, G., Hurn, M. and Robert, C.P. (2000). Computational and inferential difficulties with mixture posterior distributions. Journal of the American Statistical Association, 95, 957–970. Chib, S. (1995). Marginal likelihood from the Gibbs output. Journal of the American Statistical Association, 90, 1313–1321. Deschamps, P.J. (2006). A flexible prior distribution for Markov switching autoregressions with Student-t errors. Journal of Econometrics, 133, 153–190. DiCiccio, J., Kass, R.E., Raftery, A.E. and Wasserman, L. (1997). Computing Bayes factors by combining simulations and asymptotic approximations. Journal of the American Statistical Association, 92, 903–915. Diebolt, J. and Robert, C.P. (1994). Estimation of finite mixture distributions through Bayesian sampling. Journal of the Royal Statistical Society B, 56, 363–375. Diggle, P.J., Heagerty, P., Liand, K.Y. and Zeger, S.L. (2002). Analysis of longitudinal data. Oxford: University Press, 2nd ed. Durlauf, S.N. and Johnson, P.A. (1995). Multiple regimes and cross-country growth behaviour. Journal of Applied Econometrics, 10, 365–384. Durlauf, S.N. and Quah, D.T. (1999). The new empirics of economic growth. J.B. Taylor and M. Woodford, eds., Handbook of Macroeconomics, vol. 1. Amsterdam: Elsevier, vol. 1, pp. 235–308. Fernández, C. and Steel, M.F.J. (1998). On Bayesian modeling of fat tails and skewness. Journal of the American Statistical Association, 93, 359–371. Fonseca, T., Ferreira, M. and Migon, H. (2006). Objective Bayesian analysis for the Student-t regression model. Tech. Report 187, Statistics Department, Federal University of Rio de Janeiro. Forni, M., Hallin, M., Lippi, M. and Reichlin, L. (2005). The generalized dynamic factor model: one-sided estimation and forecasting. Journal of the American Statistical Association, 100, 830–840. Fraley, C. and Raftery, A.E. (2002). Model-based clustering, discriminant analysis, and density estimation. Journal of the American Statistical Association, 97, 611–631. Frühwirth-Schnatter, S. (2001). Markov chain Monte Carlo estimation of classical and dynamic switching and mixture models. Journal of the American Statistical Association, 96, 194–209. Frühwirth-Schnatter, S. (2004). Estimating marginal likelihoods for mixture and Markov switching models using bridge sampling techniques. Econometrics Journal, 7, 143–167. Frühwirth-Schnatter, S. (2006). Finite Mixtures and Markov Switching Models. New York: Springer. Frühwirth-Schnatter, S. and Kaufmann, S. (2006). How do changes in monetary policy affect bank lending? An analysis of Austrian bank data. Journal of Applied Econometrics, 21, 275–305. Frühwirth-Schnatter, S. and Kaufmann, S. (2008). Model-based clustering of multiple time series. Journal of Business and Economic Statistics, 26, 78–89. Gelman, A. (2006). Prior distributions for variance parameters in hierarchical models. Bayesian Analysis, 1, 1–19. Geweke, J. (1993). Bayesian treatment of the independent Student-t linear model. Journal of Applied Econometrics, 8, S19–S40. Geweke, J. (2007). Interpretation and inference in mixture models: Simple MCMC works. Computational Statistics & Data Analysis, 51, 3529–3550. Green, P.J. (1995). Reversible jump Markov chain Monte Carlo computation and Bayesian model determination. Biometrika, 82, 711–732. Hirano, K. (2002). Semiparametric Bayesian inference in autoregressive panel data models. Econometrica, 70, 781–799. Hoogstrate, A.J., Palm, F.C. and Pfann, G.A. (2000). Pooling in dynamic panel-data models: An application to forecasting GDP growth rates. Journal of Business & Economic Statistics, 18, 274–283. Hsiao, C. (2003). Analysis of Panel Data. Cambridge: University Press, 2nd ed. Ishwaran, H., James, L.F. and Sun, J. (2001). Bayesian model selection in finite mixtures by marginal density decompositions. Journal of the American Statistical Association, 96, 1316–1322. Jasra, A., Holmes, C.C. and Stephens, D.A. (2005). Markov chain Monte Carlo methods and the label switching problem in Bayesian mixture modelling. Statistical Science, 20, 50–67. Jones, M.C. and Faddy, M.J. (2003). A skew extension of the t-distribution, with applications. Journal of the Royal Statistical Society B, 65, 159–174. Juárez, M.A. and Steel, M.F.J. (2006). Non-Gaussian dynamic Bayesian modelling for panel data. Working Paper 06-05, CRiSM, University of Warwick. Lin, C.C. and Ng, S. (2007). Estimation of panel data models with parameter heterogeneity when group membership is unknown. Tech. report, Department of Economics, Columbia University. Liu, M.C. and Tiao, G.C. (1980). Random coefficient first-order autoregressive models. Journal of Econometrics, 13, 305–325. Marin, J.M., Mengersen, K. and Robert, C.P. (2005). Bayesian modelling and inference on mixtures of distributions. D. Dey and C.R. Rao, eds., Handbook of Statistics, vol. 25, Amsterdam: North-Holland. pp. 459–207. McLachlan, G.J. and Peel, D. (2000). Finite Mixture Models. New York: Wiley. Meng, X.L. andWong,W.H. (1996). Simulating ratios of normalizing constants via a simple identity: A theoretical exploration. Statistica Sinica, 6, 831–860. Nandram, B. and Petruccelli, J.D. (1997). A Bayesian analysis of autoregressive time series panel data. Journal of Business and Economic Statistics, 15, 328–334. Newton, M.A. and Raftery, A.E. (1994). Approximate Bayesian inference with the weighted likelihood bootstrap. Journal of the Royal Statistical Society B, 56, 3–48. Pesaran, M.H. (2007). A pair-wise approach to testing for output and growth convergence. Journal of Econometrics, 138, 312–355. Phillips, D.B. and Smith, A.F.M. (1996). Bayesian model comparison via jump diffusions. W.R. Gilks, S. Richardson and S.J. Spiegelhalter, eds., Markov chain Monte Carlo in practice. Boca Raton: Chapman & Hall, pp. 215–240. Quah, D.T. (1997). Empirics for growth distribution: stratification, polarization and convergence clubs. Journal of Economic Growth, 2, 27–59. Raftery, A.E. (1996). Hypothesis testing and model selection. W.R. Gilks, S. Richardson and S.J. Spiegelhalter, eds., Markov chain Monte Carlo in practice. Boca Raton: Chapman & Hall, pp. 163–188. Richardson, S. and Green, P.J. (1997). On Bayesian analysis of mixtures with an unknown number of components. Journal of the Royal Statistical Society B, 59, 731–792. (with discussion). Steele, R.J., Raftery, A.E. and Emond, M.J. (2006). Computing normalizing constants for finite mixture models via incremental mixture important sampling. Journal of Computational and Graphical Statistics, 15, 712–734. Stephens, M. (2000a). Bayesian analysis of mixture models with an unknown number of components– an alternative to reversible jump methods. The Annals of Statistics, 28, 40–74. Stephens, M. (2000b). Dealing with label switching in mixture models. Journal of the Royal Statistical Society B, 62, 795–809. Stock, J. and Watson, M. (2002). Forecasting using principal components from a large number of predictors. Journal of the American Statistical Association, 97, 1167–1179. Temple, J. (1999). The new growth evidence. Journal of Economic Literature, 37, 112–156. Titterington, D.M., Smith, A.F.M. and Makov, U.E. (1985). Statistical Analysis of Finite Mixture Distributions. Chichester: Wiley. Verdinelli, I. and Wasserman, L. (1995). Computing Bayes factors using a generalization of the Savage-Dickey density ratio. Journal of the American Statistical Association, 90, 614–618. Weiss, R.E. (2005). Modeling Longitudinal Data. New York: Springer.
URI: http://wrap.warwick.ac.uk/id/eprint/16591

Data sourced from Thomson Reuters' Web of Knowledge

Request changes to a record

Actions (login required)

View Item View Item

Document Downloads

More statistics for this item...
twitter

Email us: publications@warwick.ac.uk
Contact Details
About Us