A clustering method for repeat analysis in DNA sequences.

Published on Aug 1, 2001in Genome Biology 13.21
· DOI :10.1186/gb-2001-2-8-research0027
Natalia Volfovsky15
Estimated H-index: 15
Brian J. Haas45
Estimated H-index: 45
Steven L. Salzberg118
Estimated H-index: 118
Background A computational system for analysis of the repetitive structure of genomic sequences is described. The method uses suffix trees to organize and search the input sequences; this data structure has been used previously for efficient computation of exact and degenerate repeats.
