Publication Date:
2022-05-26
Description:
© The Author(s), 2015. This article is distributed under the terms of the Creative Commons Attribution License. The definitive version was published in Bioinformatics 31 (2015): 1872-1874, doi:10.1093/bioinformatics/btv045.
Description:
The association of organisms to their environments is a key issue in exploring biodiversity patterns. This knowledge has traditionally been scattered, but textual descriptions of taxa and their habitats are now being consolidated in centralized resources. However, structured annotations are needed to facilitate large-scale analyses. Therefore, we developed ENVIRONMENTS, a fast dictionary-based tagger capable of identifying Environment Ontology (ENVO) terms in text. We evaluate the accuracy of the tagger on a new manually curated corpus of 600 Encyclopedia of Life (EOL) species pages. We use the tagger to associate taxa with environments by tagging EOL text content monthly, and integrate the results into the EOL to disseminate them to a broad audience of users.
Description:
The Encyclopedia Of Life Rubenstein Fellows Program [CRDF EOL-33066-13/E33066], the LifeWatchGreece Research Infrastructure [384676-94/GSRT/ NSRF(C&E)] and the Novo Nordisk Foundation Center for Protein Research [NNF14CC0001].
Repository Name:
Woods Hole Open Access Server
Type:
Article
Format:
application/pdf
Permalink