Monthly
288 pp. per issue, 6 x 9,
illustrated
Founded: 1989
ISSN 0899-7667
E-ISSN 1530-888X
2008 ISI Impact Factor: 2.378
|
August 15, 1998, Vol. 10, No. 6, Pages 1455-1480
Posted Online March 13, 2006.
(doi:10.1162/089976698300017269)
© 1998 Massachusetts Institute of Technology
An Equivalence Between Sparse Approximation and Support Vector Machines Federico GirosiCenter for Biological and Computational Learning and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02139, U.S.A.
This article shows a relationship between two different approximation techniques: the support vector machines (SVM), proposed by V. Vapnik (1995) and a sparse approximation scheme that resembles the basis pursuit denoising algorithm (Chen, 1995; Chen, Donoho, & Saunders, 1995). SVM is a technique that can be derived from the structural risk minimization principle (Vapnik, 1982) and can be used to estimate the parameters of several different approximation schemes, including radial basis functions, algebraic and trigonometric polynomials, B-splines, and some forms of multilayer perceptrons. Basis pursuit denoising is a sparse approximation technique in which a function is reconstructed by using a small number of basis functions chosen from a large set (the dictionary). We show that if the data are noiseless, the modified version of basis pursuit denoising proposed in this article is equivalent to SVM in the following sense: if applied to the same data set, the two techniques give the same solution, which is obtained by solving the same quadratic programming problem. In the appendix, we present a derivation of the SVM technique in the framework of regularization theory, rather than statistical learning theory, establishing a connection between SVM, sparse approximation, and regularization theory. Cited byGiorgio Gnecco, Marcello Sanguineti. (2010) Regularization Techniques and Suboptimal Solutions to Optimization Problems in Learning from Data. Neural Computation 22:3, 793-829 Online publication date: 1-Mar-2010. Abstract
| Full Text
| PDF (290 KB)
| PDF Plus (304 KB) Snehamoy Chatterjee, Sukumar Bandopadhyay, David Machuca. (2010) Ore Grade Prediction Using a Genetic Algorithm and Clustering Based Ensemble Neural Network Model. Mathematical Geosciences Online publication date: 2-Feb-2010. CrossRef Gao Guo, Jiang-She Zhang, Gai-Ying Zhang. (2010) A method to sparsify the solution of support vector regression. Neural Computing and Applications 19:1, 115-122 Online publication date: 1-Feb-2010. CrossRef Gao Guo, Jiang-She Zhang, Gai-Ying Zhang. (2009) A method to sparsify the solution of support vector regression. Neural Computing and Applications 18:8, 919-926 Online publication date: 1-Nov-2009. CrossRef G. Gnecco, M. Sanguineti. (2009) Estimates of Variation with Respect to a Set and Applications to Optimization Problems. Journal of Optimization Theory and Applications Online publication date: 30-Oct-2009. CrossRef Julie Caron, Alain Mangé, Bernard Guillot, Jérôme Solassol. (2009) Highly sensitive detection of melanoma based on serum proteomic profiling. Journal of Cancer Research and Clinical Oncology 135:9, 1257-1264 Online publication date: 1-Sep-2009. CrossRef Giorgio Gnecco, Marcello Sanguineti. (2009) Accuracy of suboptimal solutions to kernel principal component analysis. Computational Optimization and Applications 42:2, 265-287 Online publication date: 1-Mar-2009. CrossRef Giorgio Gnecco, Marcello Sanguineti. (2009) The weight-decay technique in learning from data: an optimization point of view. Computational Management Science 6:1, 53-79 Online publication date: 1-Feb-2009. CrossRef Grzegorz Zadora. (2009) Classification of Glass Fragments Based on Elemental Composition and Refractive Index*. Journal of Forensic Sciences 54:1, 49-59 Online publication date: 1-Jan-2009. CrossRef Yi-Chung Lin, Raphael T. Haftka, Nestor V. Queipo, Benjamin J. Fregly. (2009) Two-Dimensional Surrogate Contact Modeling for Computationally Efficient Dynamic Simulation of Total Knee Replacements. Journal of Biomechanical Engineering 131:4, 041010 Online publication date: 1-Jan-2009. CrossRef Egar Sanchez, Salvador Pintos, Nestor V. Queipo. (2008) Toward an optimal ensemble of kernel-based approximations with engineering applications. Structural and Multidisciplinary Optimization 36:3, 247-261 Online publication date: 1-Sep-2008. CrossRef Liefeng Bo, Ling Wang, Licheng Jiao. (2008) Training Hard-Margin Support Vector Machines Using Greedy Stagewise Algorithm. IEEE Transactions on Neural Networks 19:8, 1446-1455 Online publication date: 1-Aug-2008. CrossRef Wang Liejun, Jia Zhenhong, Lu Zhaogan. (2008) Multi-Resolution Signal Decomposition and Approximation Based on SVMS. Information Technology Journal 7:2, 320-325 Online publication date: 1-Feb-2008. CrossRef A. D'Addabbo, A. Latiano, O. Palmieri, R. Maglietta, V. Annese, N. Ancona. (2007) Regularized Least Squares Classifiers may Predict Crohn's Disease from Profiles of Single Nucleotide Polymorphisms. Annals of Human Genetics 71:4, 537-549 Online publication date: 1-Jul-2007. CrossRef Waqar Saleem, Oliver Schall, Giuseppe Patanè, Alexander Belyaev, Hans-Peter Seidel. (2007) On stochastic methods for surface reconstruction. The Visual Computer 23:6, 381-395 Online publication date: 3-May-2007. CrossRef Peter Williams, Sheng Li, Jianfeng Feng, Si Wu. (2007) A Geometrical Method to Improve Performance of the Support Vector Machine. IEEE Transactions on Neural Networks 18:3, 942-947 Online publication date: 1-May-2007. CrossRef Grzegorz Zadora. (2007) Glass analysis for forensic purposes—a comparison of classification methods. Journal of Chemometrics 21:5-6, 174-186 Online publication date: 1-May-2007. CrossRef Licheng Jiao, Liefeng Bo, Ling Wang. (2007) Fast Sparse Approximation for Least Squares Support Vector Machine. IEEE Transactions on Neural Networks 18:3, 685-697 Online publication date: 1-May-2007. CrossRef Gabriele Steidl, Stephan Didas, Julia Neumann. (2006) Splines in Higher Order TV Regularization. International Journal of Computer Vision 70:3, 241-255 Online publication date: 1-Dec-2006. CrossRef Yubing Tong, Dongkai Yang, Qishan Zhang. (2006) Wavelet kernel Support Vector Machines for sparse approximation. Journal of Electronics (China) 23:4, 539-542 Online publication date: 1-Jul-2006. CrossRef Anoop A. Mullur, Achille Messac. (2006) Metamodeling using extended radial basis functions: a comparative approach. Engineering with Computers 21:3, 203-217 Online publication date: 1-May-2006. CrossRef Nicola Ancona, Sebastiano Stramaglia. (2006) An Invariance Property of Predictors in Kernel-Induced Hypothesis Spaces. Neural Computation 18:4, 749-759 Online publication date: 1-Apr-2006. Abstract
| PDF (97 KB)
| PDF Plus (100 KB) Marcelo Espinoza, Johan A. K. Suykens, Bart De Moor. (2006) Fixed-size Least Squares Support Vector Machines: A Large Scale Application in Electrical Load Forecasting. Computational Management Science 3:2, 113-129 Online publication date: 1-Apr-2006. CrossRef T.V. Pham, A.W.M. Smeulders. (2006) Sparse representation for coarse and fine object recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence 28:4, 555-567 Online publication date: 1-Apr-2006. CrossRef J.A. Tropp. (2006) Just relax: convex programming methods for identifying sparse signals in noise. IEEE Transactions on Information Theory 52:3, 1030-1051 Online publication date: 1-Mar-2006. CrossRef G. Steidl. (2006) A Note on the Dual Treatment of Higher-Order Regularization Functionals. Computing 76:1-2, 135-148 Online publication date: 1-Jan-2006. CrossRef A.F. Atiya, M.A. Aly, A.G. Parlos. (2005) Sparse Basis Selection: New Results and Application to Adaptive Prediction of Video Source Traffic. IEEE Transactions on Neural Networks 16:5, 1136-1146 Online publication date: 1-Sep-2005. CrossRef Kwa, M.O. Franz, B. Scholkopf. (2005) Iterative kernel principal component analysis for image modeling. IEEE Transactions on Pattern Analysis and Machine Intelligence 27:9, 1351-1366 Online publication date: 1-Sep-2005. CrossRef Henri Baillères, Olivier Vitrac, Tahiana Ramananantoandro. (2005) Assessment of continuous distribution of wood properties from a low number of samples: Application to the variability of modulus of elasticity between trees and within a tree. Holzforschung 59:5, 524-530 Online publication date: 1-Sep-2005. CrossRef Tomasz Czekaj, Wen Wu, Beata Walczak. (2005) About kernel latent variable approaches and SVM. Journal of Chemometrics 19:5-7, 341-354 Online publication date: 1-May-2005. CrossRef Tirusew Asefa, Mariush Kemblowski, Upmanu Lall, Gilberto Urroz. (2005) Support vector machines for nonlinear state space reconstruction: Application to the Great Salt Lake time series. Water Resources Research 41:12, Online publication date: 1-Jan-2005. CrossRef Liam Paninski, Jonathan W. Pillow, Eero P. Simoncelli. (2004) Maximum Likelihood Estimation of a Stochastic Integrate-and-Fire Neural Encoding Model. Neural Computation 16:12, 2533-2561 Online publication date: 1-Dec-2004. Abstract
| PDF (548 KB)
| PDF Plus (592 KB) Masashi Sugiyama, Motoaki Kawanabe, Klaus-Robert Müller. (2004) Trading Variance Reduction with Unbiasedness: The Regularized Subspace Information Criterion for Robust Model Selection in Kernel Regression. Neural Computation 16:5, 1077-1104 Online publication date: 1-May-2004. Abstract
| PDF (185 KB)
| PDF Plus (201 KB) Tirusew Asefa. (2004) Support vectors–based groundwater head observation networks design. Water Resources Research 40:11, Online publication date: 1-Jan-2004. CrossRef M.K. Omar, M. Hasegawa-Johnson. (2003) Approximately independent factors of speech using nonlinear symplectic transformation. IEEE Transactions on Speech and Audio Processing 11:6, 660-671 Online publication date: 1-Nov-2003. CrossRef Zhao Kun, Liu Guoqing, Ge Wenzhong, Dang Renqing, Takao Takeda. (2003) Retrieval of single-Doppler radar wind field by nonlinear approximation. Advances in Atmospheric Sciences 20:2, 195-204 Online publication date: 1-Jun-2003. CrossRef Tong Zhang. (2002) Approximation Bounds for Some Sparse Kernel Regression Algorithms. Neural Computation 14:12, 3013-3042 Online publication date: 1-Dec-2002. Abstract
| PDF (202 KB)
| PDF Plus (197 KB) Zhe Chen, Simon Haykin. (2002) On Different Facets of Regularization Theory. Neural Computation 14:12, 2791-2846 Online publication date: 1-Dec-2002. Abstract
| PDF (1217 KB)
| PDF Plus (1268 KB) (2002) Book Reviews. IEEE Transactions on Neural Networks 13:3, 785-787 Online publication date: 1-May-2002. CrossRef Enrico Capobianco. (2002) Multiresolution approximation for volatility processes. Quantitative Finance 2:2, 91-110 Online publication date: 1-Apr-2002. CrossRef Jiann-Ming Wu. (2002) Natural Discriminant Analysis Using Interactive Potts Models. Neural Computation 14:3, 689-713 Online publication date: 1-Mar-2002. Abstract
| PDF (379 KB)
| PDF Plus (359 KB) Ying Guo, P.L. Bartlett, J. Shawe-Taylor, R.C. Williamson. (2002) Covering numbers for support vector machines. IEEE Transactions on Information Theory 48:1, 239 CrossRef J. B. Gao, C. J. Harris, S. R. Gunn. (2001) On a Class of Support Vector Kernels Based on Frames in Function Hilbert Spaces. Neural Computation 13:9, 1975-1994 Online publication date: 1-Sep-2001. Abstract
| PDF (323 KB)
| PDF Plus (350 KB) Bernhard Schölkopf, John C. Platt, John Shawe-Taylor, Alex J. Smola, Robert C. Williamson. (2001) Estimating the Support of a High-Dimensional Distribution. Neural Computation 13:7, 1443-1471 Online publication date: 1-Jul-2001. Abstract
| PDF (1463 KB)
| PDF Plus (1185 KB) T. Van Gestel, J.A.K. Suykens, D.-E. Baestaens, A. Lambrechts, G. Lanckriet, B. Vandaele, B. De Moor, J. Vandewalle. (2001) Financial time series prediction using least squares support vector machines within the evidence framework. IEEE Transactions on Neural Networks 12:4, 809-821 Online publication date: 1-Jul-2001. CrossRef G. De Nicolao, G. Ferrari-Trecate. (2001) Regularization networks: fast weight calculation via Kalman filtering. IEEE Transactions on Neural Networks 12:2, 228-235 Online publication date: 1-Mar-2001. CrossRef A. Ruiz, P.E. Lopez-de-Teruel. (2001) Nonlinear kernel-based statistical pattern analysis. IEEE Transactions on Neural Networks 12:1, 16 CrossRef Volker Tresp. (2000) A Bayesian Committee Machine. Neural Computation 12:11, 2719-2741 Online publication date: 1-Nov-2000. Abstract
| PDF (144 KB)
| PDF Plus (170 KB) C. Liu, H. Wechsler. (2000) Evolutionary pursuit and its application to face recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence 22:6, 570-582 Online publication date: 1-Jun-2000. CrossRef Bernhard Schölkopf, Alex J. Smola, Robert C. Williamson, Peter L. Bartlett. (2000) New Support Vector Algorithms. Neural Computation 12:5, 1207-1245 Online publication date: 1-May-2000. Abstract
| PDF (1169 KB)
| PDF Plus (392 KB) Azriel Rosenfeld, Harry Wechsler. (2000) Pattern recognition: Historical perspective and future directions. International Journal of Imaging Systems and Technology 11:2, 101-116 Online publication date: 1-Jan-2000. CrossRef Shun-ichi Amari. (1999) Natural Gradient Learning for Over- and Under-Complete Bases in ICA. Neural Computation 11:8, 1875-1883 Online publication date: 1-Nov-1999. Abstract
| PDF (134 KB)
| PDF Plus (170 KB) D. Mattera, F. Palmieri, S. Haykin. (1999) Simple and robust methods for support vector expansions. IEEE Transactions on Neural Networks 10:5, 1038 CrossRef B. Scholkopf, S. Mika, C.J.C. Burges, P. Knirsch, K.-R. Muller, G. Ratsch, A.J. Smola. (1999) Input space versus feature space in kernel-based methods. IEEE Transactions on Neural Networks 10:5, 1000 CrossRef V.N. Vapnik. (1999) An overview of statistical learning theory. IEEE Transactions on Neural Networks 10:5, 988 CrossRef Tomaso Poggio, Federico Girosi. (1998) A Sparse Representation for Function Approximation. Neural Computation 10:6, 1445-1454 Online publication date: 1-Aug-1998. Abstract
| PDF (244 KB)
| PDF Plus (154 KB)
|