Monthly
288 pp. per issue
6 x 9, illustrated
Founded: 1989
ISSN 0899-7667
E-ISSN 1530-888X
2011 Impact Factor: 1.884
|
May 2004, Vol. 16, No. 5, Pages 1039-1062
Posted Online March 13, 2006.
(doi:10.1162/089976604773135096)
© 2004 Massachusetts Institute of Technology
Greedy Learning of Multiple Objects in Images Using Robust Statistics and Factorial LearningChristopher K.I. WilliamsSchool of Informatics, University of Edinburgh, Edinburgh EH1 2QL, U.K., c.k.i.williams@ed.ac.uk Michalis K. TitsiasSchool of Informatics, University of Edinburgh, Edinburgh EH1 2QL, U.K., M.Titsias@sms.ed.ac.uk
We consider data that are images containing views of multiple objects. Our task is to learn about each of the objects present in the images. This task can be approached as a factorial learning problem, where each image must be explained by instantiating a model for each of the objects present with the correct instantiation parameters. A major problem with learning a factorial model is that as the number of objects increases, there is a combinatorial explosion of the number of configurations that need to be considered. We develop a method to extract object models sequentially from the data by making use of a robust statistical method, thus avoiding the combinatorial explosion, and present results showing successful extraction of objects from real images. Cited byQiang Zhang, Long Wang, Zhaokun Ma, Huijuan Li. (2012) A novel video fusion framework using surfacelet transform. Optics Communications 285:13-14, 3032-3041 Online publication date: 1-Jun-2012. Nizar Bouguila, Khaled Almakadmeh, Sabri Boutemedjet. (2011) A finite mixture model for simultaneous high-dimensional clustering, localized feature selection and outlier rejection. Expert Systems with ApplicationsOnline publication date: 1-Dec-2011. Nicolas Le Roux, Nicolas Heess, Jamie Shotton, John Winn. (2011) Learning a Generative Model of Images by Factoring Appearance and Shape. Neural Computation 23:3, 593-650 Online publication date: 1-Mar-2011. Abstract | Full Text | PDF (3629 KB) | PDF Plus (1367 KB) Moray Allan, Christopher K.I. Williams. (2009) Object localisation using the Generative Template of Features. Computer Vision and Image Understanding 113:7, 824-838 Online publication date: 1-Jul-2009. Zhi Zhang, Runsheng Wang. (2009) Robust image superresolution method to handle localized motion outliers. Optical Engineering 48:7, 077005 Online publication date: 1-Jan-2009. T.M. Hospedales, S. Vijayakumar. (2008) Structure Inference for Bayesian Multisensory Scene Understanding. IEEE Transactions on Pattern Analysis and Machine Intelligence 30:12, 2140-2157 Online publication date: 1-Dec-2008. M. Pawan Kumar, P. H. S. Torr, A. Zisserman. (2008) Learning Layered Motion Segmentations of Video. International Journal of Computer Vision 76:3, 301-319 Online publication date: 1-Mar-2008. S TODOROVIC, M NECHYBA. (2007) Interpretation of complex scenes using dynamic tree-structure Bayesian networks  . Computer Vision and Image Understanding 106:1, 71-84 Online publication date: 1-Apr-2007.
|