The Library
Separating populations with wide data : a spectral analysis
Tools
Blum, Avrim, Coja-Oghlan, Amin, Frieze, Alan and Zhou, Shuheng (2009) Separating populations with wide data : a spectral analysis. Electronic Journal of Statistics, Vol.3 . pp. 76-113. doi:10.1214/08-EJS289 ISSN 1935-7524.
PDF
EJS-2008-289.pdf - Published Version Embargoed item. Restricted access to Repository staff only - Requires a PDF viewer. Download (397Kb) |
Official URL: http://dx.doi.org/10.1214/08-EJS289
Abstract
In this paper, we consider the problem of partitioning a small data sample drawn from a mixture of k product distributions. We are interested in the case that individual features are of low average quality γ, and we want to use as few of them as possible to correctly partition the sample. We analyze a spectral technique that is able to approximately optimize the total data size—the product of number of data points n and the number of features K—needed to correctly perform this partitioning as a function of 1/γ for K>n. Our goal is motivated by an application in clustering individuals according to their population of origin using markers, when the divergence between any two of the populations is small.
Item Type: | Journal Article | ||||
---|---|---|---|---|---|
Subjects: | Q Science > QA Mathematics | ||||
Divisions: | Faculty of Science, Engineering and Medicine > Science > Mathematics | ||||
Journal or Publication Title: | Electronic Journal of Statistics | ||||
Publisher: | Institute of Mathematical Statistics | ||||
ISSN: | 1935-7524 | ||||
Official Date: | 2009 | ||||
Dates: |
|
||||
Volume: | Vol.3 | ||||
Page Range: | pp. 76-113 | ||||
DOI: | 10.1214/08-EJS289 | ||||
Status: | Peer Reviewed | ||||
Publication Status: | Published | ||||
Access rights to Published version: | Restricted or Subscription Access | ||||
Date of first compliant deposit: | 1 August 2016 |
Request changes or add full text files to a record
Repository staff actions (login required)
View Item |