Monthly
288 pp. per issue, 6 x 9,
illustrated
Founded: 1989
ISSN 0899-7667
E-ISSN 1530-888X
2008 ISI Impact Factor: 2.378
|
September 2000, Vol. 12, No. 9, Pages 2109-2128
Posted Online March 13, 2006.
(doi:10.1162/089976600300015088)
© 2000 Massachusetts Institute of Technology
SMEM Algorithm for Mixture Models Naonori UedaNTT Communication Science Laboratories, Hikaridai, Seika-cho, Soraku-gun, Kyoto 619-0237 Japan Ryohei NakanoNTT Communication Science Laboratories, Hikaridai, Seika-cho, Soraku-gun, Kyoto 619-0237 Japan Zoubin GhahramaniGatsby Computational Neuroscience Unit, University College London, London WC1N 3AR, U.K. Geoffrey E. HintonGatsby Computational Neuroscience Unit, University College London, London WC1N 3AR, U.K. * Present address: Nagoya Institute of Technology, Gokiso-cho, Showa-Ku, Nagoya 466–8555 Japan
We present a split-and-merge expectation-maximization (SMEM) algorithm to overcome the local maxima problem in parameter estimation of finite mixture models. In the case of mixture models, local maxima often involve having too many components of a mixture model in one part of the space and too few in another, widely separated part of the space. To escape from such configurations, we repeatedly perform simultaneous split-and-merge operations using a new criterion for efficiently selecting the split-and-merge candidates. We apply the proposed algorithm to the training of gaussian mixtures and mixtures of factor analyzers using synthetic and real data and show the effectiveness of using the split- and-merge operations to improve the likelihood of both the training data and of held-out test data. We also show the practical usefulness of the proposed algorithm by applying it to image compression and pattern recognition problems. Cited byChristian Hennig. (2010) Methods for merging Gaussian mixture components. Advances in Data Analysis and Classification Online publication date: 22-Jan-2010. CrossRef Zhong Li, Jianping Fan. (2010) Exploit camera metadata for enhancing interesting region detection and photo retrieval. Multimedia Tools and Applications 46:2-3, 207-233 Online publication date: 1-Jan-2010. CrossRef Antonio Penalver Benavent, Francisco Escolano Ruiz, Juan Manuel Saez. (2009) Learning Gaussian Mixture Models With Entropy-Based Criteria. IEEE Transactions on Neural Networks 20:11, 1756-1771 Online publication date: 1-Nov-2009. CrossRef Ezequiel López-Rubio, Juan Miguel Ortiz-de-Lazcano-Lobato. (2009) Automatic Model Selection by Cross-Validation for Probabilistic PCA. Neural Processing Letters 30:2, 113-132 Online publication date: 1-Oct-2009. CrossRef L.J. Latecki, M. Sobel, R. Lakaemper. (2009) Piecewise Linear Models with Guaranteed Closeness to the Data. IEEE Transactions on Pattern Analysis and Machine Intelligence 31:8, 1525-1531 Online publication date: 1-Aug-2009. CrossRef Hangzai Luo, Jianping Fan, Xiaodong Lin, Aoying Zhou, Elisa Bertino. (2009) A distributed approach to enabling privacy-preserving model-based classifier training. Knowledge and Information Systems 20:2, 157-185 Online publication date: 1-Aug-2009. CrossRef Shih-Sian Cheng, Hsin-Chia Fu, Hsin-Min Wang. (2009) Model-Based Clustering by Probabilistic Self-Organizing Maps. IEEE Transactions on Neural Networks 20:5, 805-826 Online publication date: 1-May-2009. CrossRef Kenichi Kurihara, Max Welling. (2009) Bayesian k-Means as a “Maximization-Expectation” Algorithm. Neural Computation 21:4, 1145-1172 Online publication date: 1-Apr-2009. Abstract
| Full Text
| PDF (391 KB)
| PDF Plus (360 KB) Kuang Lin, Dirk Husmeier. (2009) Modelling Transcriptional Regulation with a Mixture of Factor Analyzers and Variational Bayesian Expectation Maximization. EURASIP Journal on Bioinformatics and Systems Biology 2009, 1-27 Online publication date: 1-Jan-2009. CrossRef Zhong Li, Jianping Fan. (2009) Stochastic contour approach for automatic image segmentation. Journal of Electronic Imaging 18:4, 043004 Online publication date: 1-Jan-2009. CrossRef Jian-Hua Zhao, P.L.H. Yu. (2008) Fast ML Estimation for the Mixture of Factor Analyzers via an ECM Algorithm. IEEE Transactions on Neural Networks 19:11, 1956-1961 Online publication date: 1-Nov-2008. CrossRef Wenbin Guo, Shuguang Cui. (2008) A $q$-Parameterized Deterministic Annealing EM Algorithm Based on Nonextensive Statistical Mechanics. IEEE Transactions on Signal Processing 56:7, 3069-3080 Online publication date: 1-Jul-2008. CrossRef C.K. Reddy, Hsiao-Dong Chiang, B. Rajaratnam. (2008) TRUST-TECH-Based Expectation Maximization for Learning Finite Mixture Models. IEEE Transactions on Pattern Analysis and Machine Intelligence 30:7, 1146-1157 Online publication date: 1-Jul-2008. CrossRef Yi Ma, Harm Derksen, Wei Hong, John Wright. (2007) Segmentation of Multivariate Mixed Data via Lossy Data Coding and Compression. IEEE Transactions on Pattern Analysis and Machine Intelligence 29:9, 1546-1562 Online publication date: 1-Sep-2007. CrossRef Kazumi Saito, Ryohei Nakano. (2007) Bidirectional clustering of weights for neural networks with common weights. Systems and Computers in Japan 38:10, 46-57 Online publication date: 1-Sep-2007. CrossRef Jussi Tohka, Evgeny Krestyannikov, Ivo D. Dinov, Allan MacKenzie Graham, David W. Shattuck, Ulla Ruotsalainen, Arthur W. Toga. (2007) Genetic Algorithms for Finite Mixture Model Based Voxel Classification in Neuroimaging. IEEE Transactions on Medical Imaging 26:5, 696-711 Online publication date: 1-May-2007. CrossRef Constantinos Constantinopoulos, Aristidis Likas. (2007) Unsupervised Learning of Gaussian Mixtures Based on Variational Component Splitting. IEEE Transactions on Neural Networks 18:3, 745-755 Online publication date: 1-May-2007. CrossRef Zhenyue Zhang, Yiu-ming Cheung. (2006) On Weight Design of Maximum Weighted Likelihood and an Extended EM Algorithm. IEEE Transactions on Knowledge and Data Engineering 18:10, 1429-1434 Online publication date: 1-Oct-2006. CrossRef Olivier Juan, Renaud Keriven, Gheorghe Postelnicu. (2006) Stochastic Motion and the Level Set Method in Computer Vision: Stochastic Active Contours. International Journal of Computer Vision 69:1, 7-25 Online publication date: 1-Aug-2006. CrossRef Zhe Chen, Suzanna Becker, Jeff Bondy, Ian C. Bruce, Simon Haykin. (2005) A Novel Model-Based Hearing Compensation Design Using a Gradient-Free Optimization Method. Neural Computation 17:12, 2648-2671 Online publication date: 1-Dec-2005. Abstract
| PDF (657 KB)
| PDF Plus (669 KB) Sethu Vijayakumar, Aaron D'Souza, Stefan Schaal. (2005) Incremental Online Learning in High Dimensions. Neural Computation 17:12, 2602-2634 Online publication date: 1-Dec-2005. Abstract
| PDF (871 KB)
| PDF Plus (439 KB) D. Endres, P. Foldiak. (2005) Bayesian Bin Distribution Inference and Mutual Information. IEEE Transactions on Information Theory 51:11, 3766-3779 Online publication date: 1-Nov-2005. CrossRef F. Pernkopf, D. Bouchaffra. (2005) Genetic-based EM algorithm for learning Gaussian mixture models. IEEE Transactions on Pattern Analysis and Machine Intelligence 27:8, 1344-1348 Online publication date: 1-Aug-2005. CrossRef Carlos Ordonez, Edward Omiecinski. (2005) Accelerating EM clustering to find high-quality solutions. Knowledge and Information Systems 7:2, 135-157 Online publication date: 1-Feb-2005. CrossRef Z. Zivkovic, F. van der Heijden. (2004) Recursive unsupervised learning of finite mixture models. IEEE Transactions on Pattern Analysis and Machine Intelligence 26:5, 651-656 Online publication date: 1-May-2004. CrossRef J. J. Verbeek, N. Vlassis, B. Kröse. (2003) Efficient Greedy Learning of Gaussian Mixture Models. Neural Computation 15:2, 469-485 Online publication date: 1-Feb-2003. Abstract
| PDF (597 KB)
| PDF Plus (297 KB) Akihiro Minagawa, Yukihiko Kobayashi, Norio Tagawa, Toshiyuki Tanaka. (2003) Detection of vanishing point sequence with temporal fluctuation. Systems and Computers in Japan 34:2, 1-12 Online publication date: 1-Feb-2003. CrossRef A. Kehagias, V. Petridis. (2002) Predictive modular neural networks for unsupervised segmentation of switching time series: the data allocation problem. IEEE Transactions on Neural Networks 13:6, 1432-1449 Online publication date: 1-Nov-2002. CrossRef Michalis K. Titsias, Aristidis Likas. (2002) Mixture of Experts Classification Using a Hierarchical Mixture Model. Neural Computation 14:9, 2221-2244 Online publication date: 1-Sep-2002. Abstract
| PDF (304 KB)
| PDF Plus (310 KB) Akihiro Minagawa, Norio Tagawa, Toshiyuki Tanaka. (2002) SMEM Algorithm Is Not Fully Compatible with Maximum-Likelihood Framework. Neural Computation 14:6, 1261-1266 Online publication date: 1-Jun-2002. Abstract
| PDF (118 KB)
| PDF Plus (121 KB) M.A.F. Figueiredo, A.K. Jain. (2002) Unsupervised learning of finite mixture models. IEEE Transactions on Pattern Analysis and Machine Intelligence 24:3, 381-396 Online publication date: 1-Mar-2002. CrossRef Naonori Ueda. (2001) Transactions of the Japanese Society for Artificial Intelligence 16, 299-308 Online publication date: 1-Jan-2001. CrossRef
|