|
|
|
|
|
Quarterly (Spring, Summer, Fall, Winter)
141 pp. per issue
7 x 10
Founded: 1993
ISSN 1063-6560
E-ISSN 1530-9304
2008 ISI Impact Factor: 3.000
|
Winter 2004, Vol. 12, No. 4, Pages 495-515
Posted Online March 13, 2006.
(doi:10.1162/1063656043138923)
© 2004 Massachusetts Institute of Technology
Measures of Diversity for Populations and Distances Between Individuals with Highly Reorganizable Genomes Claudio MattiussiInstitute of Systems Engineering Swiss Federal Institute of Technology of Lausanne (EPFL) 1015 Lausanne, Switzerland, claudio.mattiussi@epfl.ch Markus WaibelInstitute of Systems Engineering Swiss Federal Institute of Technology of Lausanne (EPFL) 1015 Lausanne, Switzerland, markus.waibel@epfl.ch Dario FloreanoInstitute of Systems Engineering Swiss Federal Institute of Technology of Lausanne (EPFL) 1015 Lausanne, Switzerland, dario.floreano@epfl.ch
In this paper we address the problem of defining a measure of diversity for a population of individuals whose genome can be subjected to major reorganizations during the evolutionary process. To this end, we introduce a measure of diversity for populations of strings of variable length defined on a finite alphabet, and from this measure we derive a semi-metric distance between pairs of strings. The definitions are based on counting the number of substrings of the strings, considered first separately and then collectively. This approach is related to the concept of linguistic complexity, whose definition we generalize from single strings to populations. Using the substring count approach we also define a new kind of Tanimoto distance between strings. We show how to extend the approach to representations that are not based on strings and, in particular, to the tree-based representations used in the field of genetic programming. We describe how suffix trees can allow these measures and distances to be implemented with a computational cost that is linear in both space and time relative to the length of the strings and the size of the population. The definitions were devised to assess the diversity of populations having genomes of variable length and variable structure during evolutionary computation runs, but applications in quantitative genomics, proteomics, and pattern recognition can be also envisaged.
|
|
|
|
MIT Press Journals |
Subscribe |
Contact Us |
Search |
Privacy Statement |
Terms and Conditions
© 2009 The MIT Press
|
| Technology Partner - Atypon Systems, Inc. |
|