GLORIA

GEOMAR Library Ocean Research Information Access

Your email was sent successfully. Check your inbox.

An error occurred while sending the email. Please try again.

Proceed reservation?

Export
Filter
  • American Association for the Advancement of Science (AAAS)  (2)
  • 1
    In: Science, American Association for the Advancement of Science (AAAS), Vol. 380, No. 6643 ( 2023-04-28)
    Abstract: Comparative genomics provides valuable insights into gene function, phylogeny, molecular evolution, and associations between phenotypic and genomic differences. Such analyses require knowledge about which genes originated from a speciation event (orthologs) or from a duplication event (paralogs). Existing methods to detect orthologs in turn require knowledge of the location of genes in the genome (gene annotation), which is itself a challenging problem, resulting in a growing gap between sequenced and annotated genomes. RATIONALE We developed TOGA (Tool to infer Orthologs from Genome Alignments), a genomics method that integrates orthology inference and gene annotation. TOGA takes as input a gene annotation of a reference species (e.g., human, mouse, or chicken) and a whole-genome alignment between the reference and a query genome (e.g., other mammals or birds). It infers orthologous gene loci in the query genome, annotates and classifies orthologous genes, detects gene losses and duplications, and generates protein and codon alignments. Orthology detection relies on the principle that orthologous sequences are generally more similar to each other than to paralogous sequences. Whereas existing methods work with annotated protein-coding sequences, TOGA extends this similarity principle to non-exonic regions (introns and intergenic regions) and uses machine learning to detect orthologous gene loci based on alignments of intronic and intergenic regions. RESULTS We demonstrate that TOGA’s machine learning classifier detects orthologous gene loci with a very high accuracy, and also works for orthologous genes that underwent translocations or inversions. TOGA improves ortholog detection and comprehensively annotates conserved genes, even if transcriptomics data are available. Although homology-based methods such as TOGA cannot annotate orthologs of genes that are not present in the reference, we show that reference bias can be effectively counteracted by integrating annotations generated with multiple reference species. TOGA can also be applied to highly fragmented genome assemblies, where genes are often split across scaffolds. By accurately identifying and joining orthologous gene fragments, TOGA annotates entire genes and thus increases the utility of fragmented genomes for comparative analyses. TOGA’s gene classification explicitly distinguishes between genes with missing sequences (indicative of assembly incompleteness) and genes with inactivating mutations (potentially indicative of base errors). We show that this classification provides a superior benchmark for assembly completeness and quality. As genomes are generated at an increasing rate, annotation and orthology inference methods that can handle hundreds or thousands of genomes are needed. TOGA’s reference species methodology scales linearly with the number of query species. By applying TOGA with human and mouse as references to 488 placental mammal assemblies and using chicken as a reference for 501 bird assemblies, we created large comparative resources for mammals and birds that comprise gene annotations, ortholog sets, lists of inactivated genes, and multiple codon alignments. CONCLUSION TOGA provides a general strategy to cope with the annotation and orthology inference bottleneck. We envision three major uses. First, TOGA enables phylogenomic analyses of orthologous genes and screens for gene changes (e.g., selection, loss, and duplication) that are associated with phenotypic differences. Second, TOGA provides annotations of genes that are conserved in newly sequenced genomes, which can be supplemented with transcriptomics data to detect lineage-specific genes or exons. Finally, TOGA’s gene classification provides a powerful genome assembly quality benchmark. A different paradigm for orthology inference. Orthologous, but not paralogous, genes have partially aligning intronic and intergenic regions. TOGA uses this principle to infer orthologous gene loci and integrates orthology inference with gene annotation. Using a reference species, TOGA can be applied to hundreds of aligned query genomes to provide rich comparative genomics resources.
    Type of Medium: Online Resource
    ISSN: 0036-8075 , 1095-9203
    RVK:
    RVK:
    Language: English
    Publisher: American Association for the Advancement of Science (AAAS)
    Publication Date: 2023
    detail.hit.zdb_id: 128410-1
    detail.hit.zdb_id: 2066996-3
    detail.hit.zdb_id: 2060783-0
    SSG: 11
    Location Call Number Limitation Availability
    BibTip Others were also interested in ...
  • 2
    In: Science, American Association for the Advancement of Science (AAAS), Vol. 379, No. 6628 ( 2023-01-13), p. 185-190
    Abstract: FBP2 loss in hummingbirds coincided with the evolution of true hovering flight and likely contributed to muscle adaptations.
    Type of Medium: Online Resource
    ISSN: 0036-8075 , 1095-9203
    RVK:
    RVK:
    Language: English
    Publisher: American Association for the Advancement of Science (AAAS)
    Publication Date: 2023
    detail.hit.zdb_id: 128410-1
    detail.hit.zdb_id: 2066996-3
    detail.hit.zdb_id: 2060783-0
    SSG: 11
    Location Call Number Limitation Availability
    BibTip Others were also interested in ...
Close ⊗
This website uses cookies and the analysis tool Matomo. More information can be found here...