< Back to previous page
Publication
Large-scale analysis of tandem repeat variability in the human genome
Journal Contribution - Journal Article
Tandem repeats are short DNA sequences that are repeated head-to-tail with a propensity to be variable. They constitute a significant proportion of the human genome, also occurring within coding and regulatory regions. Variation in these repeats can alter the function and/or expression of genes allowing organisms to swiftly adapt to novel environments. Importantly, some repeat expansions have also been linked to certain neurodegenerative diseases. Therefore, accurate sequencing of tandem repeats could contribute to our understanding of common phenotypic variability and might uncover missing genetic factors in idiopathic clinical conditions. However, despite long-standing evidence for the functional role of repeats, they are largely ignored because of technical limitations in sequencing, mapping and typing. Here, we report on a novel capture technique and data filtering protocol that allowed simultaneous sequencing of thousands of tandem repeats in the human genomes of a three generation family using GS-FLX-plus Titanium technology. Our results demonstrated that up to 7.6% of tandem repeats in this family (4% in coding sequences) differ from the reference sequence, and identified a de novo variation in the family tree. The method opens new routes to look at this underappreciated type of genetic variability, including the identification of novel disease-related repeats.
Journal: Nucleic Acids Research
ISSN: 0305-1048
Issue: 9
Volume: 42
Pages: 5728 - 5741
Publication year:2014
BOF-keylabel:yes
IOF-keylabel:yes
BOF-publication weight:10
CSS-citation score:2
Authors:International
Authors from:Government, Higher Education
Accessibility:Open