Clustering genomic words in human DNA using peaks and trends of distributions

Volume: 14, Issue: 1, Pages: 57 - 76
Published: May 31, 2019
Abstract
In this work we seek clusters of genomic words in human DNA by studying their inter-word lag distributions. Due to the particularly spiked nature of these histograms, a clustering procedure is proposed that first decomposes each distribution into a baseline and a peak distribution. An outlier-robust fitting method is used to estimate the baseline distribution (the `trend'), and a sparse vector of detrended data captures the peak structure. A...
Paper Details
Title
Clustering genomic words in human DNA using peaks and trends of distributions
Published Date
May 31, 2019
Volume
14
Issue
1
Pages
57 - 76
Citation AnalysisPro
  • Scinapse’s Top 10 Citation Journals & Affiliations graph reveals the quality and authenticity of citations received by a paper.
  • Discover whether citations have been inflated due to self-citations, or if citations include institutional bias.