A multivariate extension of a vector of two-parameter Poisson-Dirichlet processes Articles
Overview
published in
publication date
- January 2015
start page
- 89
end page
- 105
issue
- 1
volume
- 27
Digital Object Identifier (DOI)
International Standard Serial Number (ISSN)
- 1048-5252
Electronic International Standard Serial Number (EISSN)
- 1029-0311
abstract
- In the big data era there is a growing need to model the main features of large and non-trivial data sets. This paper proposes a Bayesian nonparametric prior for modelling situations where data are divided into different units with different densities, allowing information pooling across the groups. Leisen and Lijoi [(2011), 'Vectors of Poisson-Dirichlet processes', J. Multivariate Anal., 102, 482-495] introduced a bivariate vector of random probability measures with Poisson-Dirichlet marginals where the dependence is induced through a Levy's Copula. In this paper the same approach is used for generalising such a vector to the multivariate setting. A first important contribution is the derivation of the Laplace functional transform which is non-trivial in the multivariate setting. The Laplace transform is the basis to derive the exchangeable partition probability function (EPPF) and, as a second contribution, we provide an expression of the EPPF for the multivariate setting. Finally, a novel Markov Chain Monte Carlo algorithm for evaluating the EPPF is introduced and tested. In particular, numerical illustrations of the clustering behaviour of the new prior are provided.
Classification
keywords
- bayesian nonparametrics; pitman–yor process; lauricella function of fourth kind; multivariate lévy measure; partial exchangeability; exchangeable partition probability function