Activate Activate Activate
contact  
Hello. Sign in to personalize your visit. New user? Register now.  

In
By author

Monthly
288 pp. per issue, 6 x 9,
illustrated
Founded: 1989
ISSN 0899-7667
E-ISSN 1530-888X
2008 ISI Impact Factor: 2.378

Neural Computation

September 2000, Vol. 12, No. 9, Pages 2109-2128
Posted Online March 13, 2006.
(doi:10.1162/089976600300015088)
© 2000 Massachusetts Institute of Technology
SMEM Algorithm for Mixture Models

Naonori Ueda

NTT Communication Science Laboratories, Hikaridai, Seika-cho, Soraku-gun, Kyoto 619-0237 Japan

Ryohei Nakano

NTT Communication Science Laboratories, Hikaridai, Seika-cho, Soraku-gun, Kyoto 619-0237 Japan

Zoubin Ghahramani

Gatsby Computational Neuroscience Unit, University College London, London WC1N 3AR, U.K.

Geoffrey E. Hinton

Gatsby Computational Neuroscience Unit, University College London, London WC1N 3AR, U.K.

* Present address: Nagoya Institute of Technology, Gokiso-cho, Showa-Ku, Nagoya 466–8555 Japan

PDF (3,139.524 KB) PDF Plus (443.803 KB)

We present a split-and-merge expectation-maximization (SMEM) algorithm to overcome the local maxima problem in parameter estimation of finite mixture models. In the case of mixture models, local maxima often involve having too many components of a mixture model in one part of the space and too few in another, widely separated part of the space. To escape from such configurations, we repeatedly perform simultaneous split-and-merge operations using a new criterion for efficiently selecting the split-and-merge candidates. We apply the proposed algorithm to the training of gaussian mixtures and mixtures of factor analyzers using synthetic and real data and show the effectiveness of using the split- and-merge operations to improve the likelihood of both the training data and of held-out test data. We also show the practical usefulness of the proposed algorithm by applying it to image compression and pattern recognition problems.

Cited by

Christian Hennig. (2010) Methods for merging Gaussian mixture components. Advances in Data Analysis and Classification
Online publication date: 22-Jan-2010.
CrossRef
Zhong Li, Jianping Fan. (2010) Exploit camera metadata for enhancing interesting region detection and photo retrieval. Multimedia Tools and Applications 46:2-3, 207-233
Online publication date: 1-Jan-2010.
CrossRef
Antonio Penalver Benavent, Francisco Escolano Ruiz, Juan Manuel Saez. (2009) Learning Gaussian Mixture Models With Entropy-Based Criteria. IEEE Transactions on Neural Networks 20:11, 1756-1771
Online publication date: 1-Nov-2009.
CrossRef
Ezequiel López-Rubio, Juan Miguel Ortiz-de-Lazcano-Lobato. (2009) Automatic Model Selection by Cross-Validation for Probabilistic PCA. Neural Processing Letters 30:2, 113-132
Online publication date: 1-Oct-2009.
CrossRef
L.J. Latecki, M. Sobel, R. Lakaemper. (2009) Piecewise Linear Models with Guaranteed Closeness to the Data. IEEE Transactions on Pattern Analysis and Machine Intelligence 31:8, 1525-1531
Online publication date: 1-Aug-2009.
CrossRef
Hangzai Luo, Jianping Fan, Xiaodong Lin, Aoying Zhou, Elisa Bertino. (2009) A distributed approach to enabling privacy-preserving model-based classifier training. Knowledge and Information Systems 20:2, 157-185
Online publication date: 1-Aug-2009.
CrossRef
Shih-Sian Cheng, Hsin-Chia Fu, Hsin-Min Wang. (2009) Model-Based Clustering by Probabilistic Self-Organizing Maps. IEEE Transactions on Neural Networks 20:5, 805-826
Online publication date: 1-May-2009.
CrossRef
Kenichi Kurihara, Max Welling. (2009) Bayesian k-Means as a “Maximization-Expectation” Algorithm. Neural Computation 21:4, 1145-1172
Online publication date: 1-Apr-2009.
Abstract | Full Text | PDF (391 KB) | PDF Plus (360 KB) 
Kuang Lin, Dirk Husmeier. (2009) Modelling Transcriptional Regulation with a Mixture of Factor Analyzers and Variational Bayesian Expectation Maximization. EURASIP Journal on Bioinformatics and Systems Biology 2009, 1-27
Online publication date: 1-Jan-2009.
CrossRef
Zhong Li, Jianping Fan. (2009) Stochastic contour approach for automatic image segmentation. Journal of Electronic Imaging 18:4, 043004
Online publication date: 1-Jan-2009.
CrossRef
Jian-Hua Zhao, P.L.H. Yu. (2008) Fast ML Estimation for the Mixture of Factor Analyzers via an ECM Algorithm. IEEE Transactions on Neural Networks 19:11, 1956-1961
Online publication date: 1-Nov-2008.
CrossRef
Wenbin Guo, Shuguang Cui. (2008) A $q$-Parameterized Deterministic Annealing EM Algorithm Based on Nonextensive Statistical Mechanics. IEEE Transactions on Signal Processing 56:7, 3069-3080
Online publication date: 1-Jul-2008.
CrossRef
C.K. Reddy, Hsiao-Dong Chiang, B. Rajaratnam. (2008) TRUST-TECH-Based Expectation Maximization for Learning Finite Mixture Models. IEEE Transactions on Pattern Analysis and Machine Intelligence 30:7, 1146-1157
Online publication date: 1-Jul-2008.
CrossRef
Yi Ma, Harm Derksen, Wei Hong, John Wright. (2007) Segmentation of Multivariate Mixed Data via Lossy Data Coding and Compression. IEEE Transactions on Pattern Analysis and Machine Intelligence 29:9, 1546-1562
Online publication date: 1-Sep-2007.
CrossRef
Kazumi Saito, Ryohei Nakano. (2007) Bidirectional clustering of weights for neural networks with common weights. Systems and Computers in Japan 38:10, 46-57
Online publication date: 1-Sep-2007.
CrossRef
Jussi Tohka, Evgeny Krestyannikov, Ivo D. Dinov, Allan MacKenzie Graham, David W. Shattuck, Ulla Ruotsalainen, Arthur W. Toga. (2007) Genetic Algorithms for Finite Mixture Model Based Voxel Classification in Neuroimaging. IEEE Transactions on Medical Imaging 26:5, 696-711
Online publication date: 1-May-2007.
CrossRef
Constantinos Constantinopoulos, Aristidis Likas. (2007) Unsupervised Learning of Gaussian Mixtures Based on Variational Component Splitting. IEEE Transactions on Neural Networks 18:3, 745-755
Online publication date: 1-May-2007.
CrossRef
Zhenyue Zhang, Yiu-ming Cheung. (2006) On Weight Design of Maximum Weighted Likelihood and an Extended EM Algorithm. IEEE Transactions on Knowledge and Data Engineering 18:10, 1429-1434
Online publication date: 1-Oct-2006.
CrossRef
Olivier Juan, Renaud Keriven, Gheorghe Postelnicu. (2006) Stochastic Motion and the Level Set Method in Computer Vision: Stochastic Active Contours. International Journal of Computer Vision 69:1, 7-25
Online publication date: 1-Aug-2006.
CrossRef
Zhe Chen, Suzanna Becker, Jeff Bondy, Ian C. Bruce, Simon Haykin. (2005) A Novel Model-Based Hearing Compensation Design Using a Gradient-Free Optimization Method. Neural Computation 17:12, 2648-2671
Online publication date: 1-Dec-2005.
Abstract | PDF (657 KB) | PDF Plus (669 KB) 
Sethu Vijayakumar, Aaron D'Souza, Stefan Schaal. (2005) Incremental Online Learning in High Dimensions. Neural Computation 17:12, 2602-2634
Online publication date: 1-Dec-2005.
Abstract | PDF (871 KB) | PDF Plus (439 KB) 
D. Endres, P. Foldiak. (2005) Bayesian Bin Distribution Inference and Mutual Information. IEEE Transactions on Information Theory 51:11, 3766-3779
Online publication date: 1-Nov-2005.
CrossRef
F. Pernkopf, D. Bouchaffra. (2005) Genetic-based EM algorithm for learning Gaussian mixture models. IEEE Transactions on Pattern Analysis and Machine Intelligence 27:8, 1344-1348
Online publication date: 1-Aug-2005.
CrossRef
Carlos Ordonez, Edward Omiecinski. (2005) Accelerating EM clustering to find high-quality solutions. Knowledge and Information Systems 7:2, 135-157
Online publication date: 1-Feb-2005.
CrossRef
Z. Zivkovic, F. van der Heijden. (2004) Recursive unsupervised learning of finite mixture models. IEEE Transactions on Pattern Analysis and Machine Intelligence 26:5, 651-656
Online publication date: 1-May-2004.
CrossRef
J. J. Verbeek, N. Vlassis, B. Kröse. (2003) Efficient Greedy Learning of Gaussian Mixture Models. Neural Computation 15:2, 469-485
Online publication date: 1-Feb-2003.
Abstract | PDF (597 KB) | PDF Plus (297 KB) 
Akihiro Minagawa, Yukihiko Kobayashi, Norio Tagawa, Toshiyuki Tanaka. (2003) Detection of vanishing point sequence with temporal fluctuation. Systems and Computers in Japan 34:2, 1-12
Online publication date: 1-Feb-2003.
CrossRef
A. Kehagias, V. Petridis. (2002) Predictive modular neural networks for unsupervised segmentation of switching time series: the data allocation problem. IEEE Transactions on Neural Networks 13:6, 1432-1449
Online publication date: 1-Nov-2002.
CrossRef
Michalis K. Titsias, Aristidis Likas. (2002) Mixture of Experts Classification Using a Hierarchical Mixture Model. Neural Computation 14:9, 2221-2244
Online publication date: 1-Sep-2002.
Abstract | PDF (304 KB) | PDF Plus (310 KB) 
Akihiro Minagawa, Norio Tagawa, Toshiyuki Tanaka. (2002) SMEM Algorithm Is Not Fully Compatible with Maximum-Likelihood Framework. Neural Computation 14:6, 1261-1266
Online publication date: 1-Jun-2002.
Abstract | PDF (118 KB) | PDF Plus (121 KB) 
M.A.F. Figueiredo, A.K. Jain. (2002) Unsupervised learning of finite mixture models. IEEE Transactions on Pattern Analysis and Machine Intelligence 24:3, 381-396
Online publication date: 1-Mar-2002.
CrossRef
Naonori Ueda. (2001) Transactions of the Japanese Society for Artificial Intelligence 16, 299-308
Online publication date: 1-Jan-2001.
CrossRef

Technology Partner - Atypon Systems, Inc.
  CrossRef member COUNTER member