CLUSTERING CRITERIA FOR DISCRETE DATA AND LATENT CLASS MODELS |
|||||||
文献名称 | CLUSTERING CRITERIA FOR DISCRETE DATA AND LATENT CLASS MODELS | ||||||
文献作者 | Gilles CELEUX;Gerard GOVAERT | ||||||
作者所在单位 | INRIA Domaine de Voluceau Rocquencourt B.P. 105 78153 Le Chesnay Cedex;Universite de Metz et INRIA-Lorraine Ile du Saulcy 57045 Metz Cedex 1 | ||||||
文献分类 | 已发表文献 | ||||||
学科一级分类 | 统计 | ||||||
学科二级分类 | 统计学 | ||||||
文献摘要 |
We show that some well known clustering criteria for discrete data, the information criterion and the x2 criterion, are closely related with the classification maximum likelihood criterion for the latent class model. Emphasis is placed on binary clustering criteria which are analyzed under the maximum likelihood approach for different multivariate Bernoulli mixtures. This alternative form of criteria reveals non-apparent aspects of clustering techniques. All the discussed criteria can be optimized with the alternating optimization algorithm. |
||||||
参考文献 |
Aitchison, J. and Aitken, C.G.G. (1976), Multivariate binary discrimination by the kernel method. Biometrika 63, 413-420. Benzecri, J.P. (1973), Thdorie de 1'information et classification d'apres un tableau de contingence. L'Analyse des donnees, tome 1, Dunod. Bezdek, J.C., Hathaway, R.J., Howard, R.E., Wilson, C.A. and Windham, M.P. (1987), Local convergence analysis of a grouped variable version of coordinate descent. JOTA 54 n03, 471-477. Bozdogan, H. (1987), Selecting loglinear models and subset selection of variables in multiway contingency tables using Akaike's information criterion. Classification and related methods of Data Analysis, North Holland, 609-616. Bryant, P. (1988), On characterizing optimisation-based clustering methods. Journal of Classification 5, 81-84. Bryant, P. and Williamson, J.A. (1978), Asymptotic behaviour of classification maximum likelihood estimates. Biometrika 65, 273-281.. Celeux, G. (1988), Classification et modeles. R.S.A. 36 n°4, 43-58. Diday, E. and Simon, J.C. (1976), Clustering analysis. Digital Pattern Recognition. Springer-Verlag, 47-94. Everitt, B. (1984), An introduction to latent variable models. Chapman and Hall. Goodman, L.A. (1974), Exploratory latent structure models using both identifiable and unidentifiable models. Biometrika 61, 215-231. Govaert, G. (1983), Classification croisee. Thesis Universite Paris 6. Govaert, G. (1989), Classification binaire et modeles, R.S.A. 37 (to appear). Marriott, F.M.C. (1982), Separating mixtures of normal distributions. Biometrics 31, 767-769. Scott, A.J. and Symons, M.J. (1971), Clustering methods based on likelihood ratio criteria. Biometrics 27, 387-397. Windham, M.P. (1987), Parameter modification for clustering. Journal of Classification 4, 191-214. |
||||||
关键字 | Binary clustering, Lt distance, mixture, latent class models | ||||||
发表所在刊物(或来源) | apports de Recherche No 1122 Programme 5 Automatique, Productique, Traitement du Signal et des Donnees | ||||||
发表时间 | Novembre 1989 | ||||||
适用研究领域 | |||||||
评论 | |||||||
上传时间 | 2011-1-20 13:53 | ||||||
下载文献 |
RR-1122.pdf[563.68 KB]
注:下载文献会消耗您一个“当日剩余下载次数” |
||||||
会员评论 |
|||||||
liyinguocumt |
liyinguocumt发表于:2011-9-16 00:08 正在学习中,谢谢分享! |
京ICP备16021002-2号 京B2-20170662号 京公网安备 11010802022788号 论坛法律顾问:王进律师 知识产权保护声明 免责及隐私声明