GLORIA

GEOMAR Library Ocean Research Information Access

feed icon rss

Your email was sent successfully. Check your inbox.

An error occurred while sending the email. Please try again.

Proceed reservation?

Export
  • 1
    Publication Date: 2022-05-25
    Description: © The Author(s), 2015. This article is distributed under the terms of the Creative Commons Attribution License. The definitive version was published in Genome Biology and Evolution 7 (2015): 3207-3225, doi:10.1093/gbe/evv210.
    Description: High-throughput sequencing of reduced representation libraries obtained through digestion with restriction enzymes—generically known as restriction site associated DNA sequencing (RAD-seq)—is a common strategy to generate genome-wide genotypic and sequence data from eukaryotes. A critical design element of any RAD-seq study is knowledge of the approximate number of genetic markers that can be obtained for a taxon using different restriction enzymes, as this number determines the scope of a project, and ultimately defines its success. This number can only be directly determined if a reference genome sequence is available, or it can be estimated if the genome size and restriction recognition sequence probabilities are known. However, both scenarios are uncommon for nonmodel species. Here, we performed systematic in silico surveys of recognition sequences, for diverse and commonly used type II restriction enzymes across the eukaryotic tree of life. Our observations reveal that recognition sequence frequencies for a given restriction enzyme are strikingly variable among broad eukaryotic taxonomic groups, being largely determined by phylogenetic relatedness. We demonstrate that genome sizes can be predicted from cleavage frequency data obtained with restriction enzymes targeting “neutral” elements. Models based on genomic compositions are also effective tools to accurately calculate probabilities of recognition sequences across taxa, and can be applied to species for which reduced representation data are available (including transcriptomes and neutral RAD-seq data sets). The analytical pipeline developed in this study, PredRAD (https://github.com/phrh/PredRAD), and the resulting databases constitute valuable resources that will help guide the design of any study using RAD-seq or related methods.
    Description: This research was supported by the Office of Ocean Exploration and Research of the National Oceanic and Atmospheric Administration (NA09OAR4320129 to T.S.); the Division of Ocean Sciences of the National Science Foundation (OCE-1131620 to T.S.); the Astrobiology Science and Technology for Exploring Planets program of the National Aeronautics and Space Administration (NNX09AB76G to T.S.); and the Academic Programs Office (Ocean Ventures Fund to S.H.), the Ocean Exploration Institute (Fellowship support to T.M.S.), and the Ocean Life Institute of the Woods Hole Oceanographic Institution (internal grant to T.M.S. and S.H.).
    Keywords: RAD-seq ; Reduced representation sequencing ; PredRAD ; Experimental design ; Genome size prediction ; Restriction recognition sequence probability
    Repository Name: Woods Hole Open Access Server
    Type: Article
    Format: application/vnd.ms-excel
    Format: application/pdf
    Location Call Number Limitation Availability
    BibTip Others were also interested in ...
Close ⊗
This website uses cookies and the analysis tool Matomo. More information can be found here...