GLORIA

GEOMAR Library Ocean Research Information Access

Your email was sent successfully. Check your inbox.

An error occurred while sending the email. Please try again.

Proceed reservation?

Export
  • 1
    In: Research Ideas and Outcomes, Pensoft Publishers, Vol. 6 ( 2020-02-03)
    Abstract: Plants, fungi and algae are important components of global biodiversity and are fundamental to all ecosystems. They are the basis for human well-being, providing food, materials and medicines. Specimens of all three groups of organisms are accommodated in herbaria, where they are commonly referred to as botanical specimens. The large number of specimens in herbaria provides an ample, permanent and continuously improving knowledge base on these organisms and an indispensable source for the analysis of the distribution of species in space and time critical for current and future research relating to global biodiversity. In order to make full use of this resource, a research infrastructure has to be built that grants comprehensive and free access to the information in herbaria and botanical collections in general. This can be achieved through digitization of the botanical objects and associated data. The botanical research community can count on a long-standing tradition of collaboration among institutions and individuals. It agreed on data standards and standard services even before the advent of computerization and information networking, an example being the Index Herbariorum as a global registry of herbaria helping towards the unique identification of specimens cited in the literature. In the spirit of this collaborative history, 51 representatives from 30 institutions advocate to start the digitization of botanical collections with the overall wall-to-wall digitization of the flat objects stored in German herbaria. Germany has 70 herbaria holding almost 23 million specimens according to a national survey carried out in 2019. 87% of these specimens are not yet digitized. Experiences from other countries like France, the Netherlands, Finland, the US and Australia show that herbaria can be comprehensively and cost-efficiently digitized in a relatively short time due to established workflows and protocols for the high-throughput digitization of flat objects. Most of the herbaria are part of a university (34), fewer belong to municipal museums (10) or state museums (8), six herbaria belong to institutions also supported by federal funds such as Leibniz institutes, and four belong to non-governmental organizations. A common data infrastructure must therefore integrate different kinds of institutions. Making full use of the data gained by digitization requires the set-up of a digital infrastructure for storage, archiving, content indexing and networking as well as standardized access for the scientific use of digital objects. A standards-based portfolio of technical components has already been developed and successfully tested by the Biodiversity Informatics Community over the last two decades, comprising among others access protocols, collection databases, portals, tools for semantic enrichment and annotation, international networking, storage and archiving in accordance with international standards. This was achieved through the funding by national and international programs and initiatives, which also paved the road for the German contribution to the Global Biodiversity Information Facility (GBIF). Herbaria constitute a large part of the German botanical collections that also comprise living collections in botanical gardens and seed banks, DNA- and tissue samples, specimens preserved in fluids or on microscope slides and more. Once the herbaria are digitized, these resources can be integrated, adding to the value of the overall research infrastructure. The community has agreed on tasks that are shared between the herbaria, as the German GBIF model already successfully demonstrates. We have compiled nine scientific use cases of immediate societal relevance for an integrated infrastructure of botanical collections. They address accelerated biodiversity discovery and research, biomonitoring and conservation planning, biodiversity modelling, the generation of trait information, automated image recognition by artificial intelligence, automated pathogen detection, contextualization by interlinking objects, enabling provenance research, as well as education, outreach and citizen science. We propose to start this initiative now in order to valorize German botanical collections as a vital part of a worldwide biodiversity data pool.
    Type of Medium: Online Resource
    ISSN: 2367-7163
    Language: Unknown
    Publisher: Pensoft Publishers
    Publication Date: 2020
    detail.hit.zdb_id: 2833254-4
    Location Call Number Limitation Availability
    BibTip Others were also interested in ...
  • 2
    In: Biodiversity Information Science and Standards, Pensoft Publishers, Vol. 3 ( 2019-06-18)
    Abstract: Herbarium specimens have always played a central role in the classical disciplines of plant sciences and the global digitisation efforts now open new horizons. To make full use of the inherent possibilities of specimen based taxonomic descriptions corresponding workflows are needed. A crucial step in the comparative analyses of organisms is the preparation of a character matrix to record and compare the morphological variation of taxa on the basis of individual specimens. This project focuses on the optimisation of the taxonomic research process with respect to delimitation and characterisation (“descriptions”) of taxa (Henning et al. 2018). The angiosperm order Caryophyllales provides exemplar use cases through cooperation with the Global Caryophyllales Initiative (Borsch et al. 2015). The workflow for sample data handling (Kilian et al. 2015), implemented on the EDIT Platform for Cybertaxonomy (http://www.cybertaxonomy.org, Ciardelli et al. 2009), has been extended to support additive characterisation of taxa via specimen character data. The Common Data Model (CDM), already supporting persistent inter-linking of specimens and their metadata (Plitzner et al. 2017), has been adapted to facilitate specimen descriptions with characters constructed from the combination of structure and property terms and their corresponding states. Semantic web technology is used to establish and continuously elaborate expert community-coordinated exemplar vocabularies with term ontologies and explanations for characters and states (GFBio Terminology Service, Karam et al. 2016). Character data are recorded and stored in structured form in character state matrices for individual specimens instead of taxa, which allows generation of taxon characterisations by aggregating the data sets for the individual specimens included. Separating characters in structures and properties, which are based on concepts in public ontologies, guarantees a high visibility and instant re-usability of these character data. Taking into account that taxon concepts evolve during the iterative knowledge generation process in systematic biology, additivity of character data from specimen to taxon level therefore greatly facilitates the construction and reproducibility of taxon characterisations from changing specimen and character data sets.
    Type of Medium: Online Resource
    ISSN: 2535-0897
    Language: Unknown
    Publisher: Pensoft Publishers
    Publication Date: 2019
    detail.hit.zdb_id: 3028709-1
    Location Call Number Limitation Availability
    BibTip Others were also interested in ...
  • 3
    In: Biodiversity Information Science and Standards, Pensoft Publishers, Vol. 3 ( 2019-06-21)
    Abstract: The International Code of Nomenclature (ICN) for algae, fungi, and plants provides for nomenclatural indexing through nomenclatural repositories (Turland et al. 2018, Art. 42). Registering nomenclatural novelties and nomenclatural acts means that repositories will keep track of names (species names and names at all ranks, replacement names, names proposed for conservation or rejection, validated names) and of nomenclatural types, including lectotypes and epitypes. Thus, PhycoBank has been advocated by different players such as the International Society for Diatom Research (ISDR), the Global Biodiversity Information Facility (GBIF), and the Special Committee on Registration of Algal and Plant Names (including fossils). Aided by a grant from the German Research Foundation (DFG, JA 874/8-1), PhycoBank has been established at the BGBM Berlin as the repository for nomenclatural acts of algae. As added value, PhycoBank deals with orthographical variants in linking the published spelling of a name to the corrected one with reference to the respective article of the ICN (Turland et al. 2018, Art. 60). Almost all nomenclatural acts are the result of taxonomic issues but also have implications for the taxonomic work of specialists worldwide. The challenge for implementing a registration system like PhycoBank is to inform individual scientists as well as to feed data into data networks, to strengthen their underlying names backbone linking scientific names to occurrences. Since June 2018, PhycoBank staff are operating the registration system using a user-friendly data entry web application. This interface for data entry by volunteers has been available since March 2019. All data entered into the system undergoes a curatorial process to assure a high level of data quality. The data entry web application is complemented by a public data access portal which is available under https://www.phycobank.org (Fig. 1). PhycoBank can be searched for scientific names (including unregistered formal or informal higher rank names), for categories of types and PhycoBank identifiers. PhycoBank assigns resolvable and globally unique HTTP-based identifiers for nomenclatural acts, e.g. for the genus Iconella, https://phycobank.org/100040. Via these PhycoBank identifiers, the corresponding data and metadata can be retrieved in human- and machine-readable formats. More than ten journals have published PhycoBank identifiers so far, allowing cross-linking between their PDF and the PhycoBank system. The Pensoft journals are pioneering an automatic registration workflow modeled and specified by the PhycoBank team. Classifications are frequently subject to changes. Currently, the algal classification is under discussion because of results from phylogenetic research. PhycoBank aims to be neutral with respect to higher classification, but tracks classification information of each name that is registered into a directed graph of available higher rank names to record fragments of higher classification information and to facilitate search functionalities. All scientists, editors, and publishers involved in the publication of nomenclatural novelties are invited to contact PhycoBank (curation@phycobank.org) to influence the prototypical registration process and to improve PhycoBank’s functionality.
    Type of Medium: Online Resource
    ISSN: 2535-0897
    Language: Unknown
    Publisher: Pensoft Publishers
    Publication Date: 2019
    detail.hit.zdb_id: 3028709-1
    Location Call Number Limitation Availability
    BibTip Others were also interested in ...
  • 4
    Online Resource
    Online Resource
    Pensoft Publishers ; 2018
    In:  Biodiversity Information Science and Standards Vol. 2 ( 2018-05-21), p. e25415-
    In: Biodiversity Information Science and Standards, Pensoft Publishers, Vol. 2 ( 2018-05-21), p. e25415-
    Abstract: On herbarium sheets, data elements such as plant name, collection site, collector, barcode and accession number are found mostly on labels glued to the sheet. The data are thus visible on specimen images. With continuously improving technologies for collection mass-digitisation it has become easier and easier to produce high quality images of herbarium sheets and in the last few years herbarium collections worldwide have started to digitize specimens on an industrial scale (Tegelberg et al. 2014). To use the label data contained in these massive numbers of images, they have to be captured and databased. Currently, manual data entry prevails and forms the principal cost and time limitation in the digitization process. The StanDAP-Herb Project has developed a standard process for (semi-) automatic detection of data on herbarium sheets. This is a formal extensible workflow integrating a wide range of automated specimen image analysis services, used to replace time-consuming manual data input as far as possible. We have created web-services for OCR (Optical Character Recognition); for identifying regions of interest in specimen images and for the context-sensitive extraction of information from text recognized by OCR. We implemented the workflow as an extension of the OpenRefine platform (Verborgh and De Wilde 2013).
    Type of Medium: Online Resource
    ISSN: 2535-0897
    Language: Unknown
    Publisher: Pensoft Publishers
    Publication Date: 2018
    detail.hit.zdb_id: 3028709-1
    Location Call Number Limitation Availability
    BibTip Others were also interested in ...
  • 5
    Online Resource
    Online Resource
    Pensoft Publishers ; 2017
    In:  Proceedings of TDWG Vol. 1 ( 2017-08-15), p. e20317-
    In: Proceedings of TDWG, Pensoft Publishers, Vol. 1 ( 2017-08-15), p. e20317-
    Type of Medium: Online Resource
    ISSN: 2535-0897
    Language: Unknown
    Publisher: Pensoft Publishers
    Publication Date: 2017
    detail.hit.zdb_id: 3028709-1
    Location Call Number Limitation Availability
    BibTip Others were also interested in ...
  • 6
    Online Resource
    Online Resource
    Pensoft Publishers ; 2019
    In:  Biodiversity Information Science and Standards Vol. 3 ( 2019-06-13)
    In: Biodiversity Information Science and Standards, Pensoft Publishers, Vol. 3 ( 2019-06-13)
    Abstract: The World Flora Online initiative (www.worldfloraonline.org) is a global consortium of many of the world’s leading botanical institutions with the aim to offer a worldwide information resource for plant information (Miller 2019). It aggregates information provided by the botanical community, either through specialized information systems or published taxonomic treatments and floras. WFO distinguishes contributions to the Taxonomic Backbone (i.e. the community-curated consensus system of scientific names, taxa, synonyms and their classification) from Content contributions (i.e. descriptive data, images, distribution, etc.). In the course of writing the guidelines for contributors, a format for the electronic submission of these data had to be developed. The expectation was that this would be a comparatively simple task, drawing on existing TDWG standards and using established formats and tools, i.e. Darwin Core Archive, the Integrated Publishing Toolkit and the DwC-A Validator tool. Actually, it was not that simple, as several problems had to be solved. First of all it was somewhat difficult to find authoritative sources on the web for existing data definitions. That solved, the actual definitions were, in some cases not really adequate for use by the botanical community, or a narrower description had to be given, or our portal software (based on the eMonocot portal system developed by the Royal Botanic Gardens, Kew) required a different controlled vocabulary. A decision was taken to follow the DwC naming conventions for data elements, although in some cases the designations - or at least the applications in a checklist context - were patently wrong (e.g. “taxonID” as the identifier for names, including synonyms). For Content contributions, the DwC-A standard star schema was useful, but it was not appropriate for backbone contributions with their multiple relationships e.g., to literature references. This experience underlines the necessity for a coherent documentation of standards (see Blum et al. (2019)), including user-friendly access to definitions, data validation tools and clear guidelines for extensions/subtyping also at the element-level.
    Type of Medium: Online Resource
    ISSN: 2535-0897
    Language: Unknown
    Publisher: Pensoft Publishers
    Publication Date: 2019
    detail.hit.zdb_id: 3028709-1
    Location Call Number Limitation Availability
    BibTip Others were also interested in ...
  • 7
    In: Biodiversity Information Science and Standards, Pensoft Publishers, Vol. 3 ( 2019-06-18)
    Abstract: The Platform for Cybertaxonomy (http://www.cybertaxonomy.org) is a standards-based open-source software framework covering the breadth of the taxonomic workflow, from fieldwork to publication (Ciardelli et al. 2009). It provides coupled tools for full, customized access to taxonomic data, editing and management, and collaborative team work. At the core of the platform is the Common Data Model (CDM, Müller et al. 2017), offering a comprehensive information model covering all relevant data domains: names and classifications, descriptive data (morphological and molecular), media, geographic information, literature, specimens, types, persons, and external resources. Platform compliant software interacts via services and includes the following components: CDM Server Taxonomic Editor Rich Client Web-based editors Drupal-based and highly configurable portal software Map services and map viewer Xper2 descriptive data editor Specimen search tool Import and export modules CDM Server Taxonomic Editor Rich Client Web-based editors Drupal-based and highly configurable portal software Map services and map viewer Xper2 descriptive data editor Specimen search tool Import and export modules Recent platform-based developments include software components for deriving formal species-level descriptions from measurements on individual specimens (Henning et al. 2018) as well as a registration system for nomenclatural acts of algae (Phycobank, https://www.phycobank.org/). Currently, about 30 portals with regional and taxonomic foci are using the Platform for Cybertaxonomy as their technical basis for capturing, managing, and publishing biodiversity data over the World Wide Web. Prominent examples are the Euro+Med Plantbase, the International Caryophyllales Network, and the Flora of Greece.
    Type of Medium: Online Resource
    ISSN: 2535-0897
    Language: Unknown
    Publisher: Pensoft Publishers
    Publication Date: 2019
    detail.hit.zdb_id: 3028709-1
    Location Call Number Limitation Availability
    BibTip Others were also interested in ...
  • 8
    In: Biodiversity Information Science and Standards, Pensoft Publishers, Vol. 2 ( 2018-05-21), p. e26658-
    Abstract: A simple, permanent and reliable specimen identifier system is needed to take the informatics of collections into a new era of interoperability. A system of identifiers based on HTTP URI (Uniform Resource Identifiers), endorsed by the Consortium of European Taxonomic Facilities (CETAF), has now been rolled out to 14 member organisations (Güntsch et al. 2017). CETAF-Identifiers have a Linked Open Data redirection mechanism for both human- and machine-readable access and, if fully implemented, provide Resource Description Framework (RDF) -encoded specimen data following best practices continuously improved by members of the initiative. To date, more than 20 million physical collection objects have been equipped with CETAF Identifiers (Groom et al. 2017). To facilitate the implementation of stable identifiers, simple redirection scripts and guidelines for deciding on the local identifier syntax have been compiled (http://cetafidentifiers.biowikifarm.net/wiki/Main_Page). Furthermore, a capable "CETAF Specimen URI Tester" (http://herbal.rbge.info/) provides an easy-to-use service for testing whether the existing identifiers are operational. For the usability and potential of any identifier system associated with evolving data objects, active links to the source information are critically important. This is particularly true for natural history collections facing the next wave of industrialised mass digitisation, where specimens come online with only basic, but rapidly evolving label data. Specimen identifier systems must therefore have components for monitoring the availability and correct implementation of individual data objects. Our next implementation steps will involve the development of a "Semantic Specimen Catalogue", which has a list of all existing specimen identifiers together with the latest RDF metadata snapshot. The catalogue will be used for semantic inference across collections as well as the basis for periodic testing of identifiers.
    Type of Medium: Online Resource
    ISSN: 2535-0897
    Language: Unknown
    Publisher: Pensoft Publishers
    Publication Date: 2018
    detail.hit.zdb_id: 3028709-1
    Location Call Number Limitation Availability
    BibTip Others were also interested in ...
  • 9
    Online Resource
    Online Resource
    Pensoft Publishers ; 2019
    In:  Biodiversity Information Science and Standards Vol. 3 ( 2019-06-19)
    In: Biodiversity Information Science and Standards, Pensoft Publishers, Vol. 3 ( 2019-06-19)
    Abstract: Herbarium specimens are central to botanical science and of rising importance thanks to increasing accessibility and broadened usability. Alongside the many new uses of specimen data, sit a range of traditional uses supporting the collection of morphological data and their application to taxonomy and systematics. (Henning et al. 2018). Technical workflows are needed to support the sustainable collection of this traditional information and maintain the high quality of the morphological data. Data exchange and re-usability requires the use of accepted controlled vocabularies (community approved) that are accessible (web-based ontologies and term vocabularies) and reliable (long-term availability/unique identifiers). The same applies to datasets that must be stored accessibly and sustainably by maintaining all data relationships that would facilitate convenient re-use. This project aims to construct a comprehensive workflow to optimise the delimitation and characterisation (“descriptions”) of taxa (see complementary talk by Plitzner et al.). It is implemented on the open-source software framework of the EDIT Platform for Cybertaxonomy (http://www.cybertaxonomy.org, Ciardelli et al. 2009) extending the workflow for sample data processing developed in a preceding project (Kilian et al. 2015). The principal goals of this new software component are: specimen-level recording and storage of character data in structured character matrices generating taxon characterisations by aggregating the individual specimen-based datasets using and developing community-coordinated, ontology-based exemplar vocabularies persistently linking character datasets with source specimens for high visibility and re-usability specimen-level recording and storage of character data in structured character matrices generating taxon characterisations by aggregating the individual specimen-based datasets using and developing community-coordinated, ontology-based exemplar vocabularies persistently linking character datasets with source specimens for high visibility and re-usability The angiosperm order, Caryophyllales, provides an exemplar use case through cooperation with the Global Caryophyllales Initiative (Borsch et al. 2015). A basic set of morphological terms and vocabularies has been obtained from various online sources (ontologies, glossaries) and can be used, searched and expanded in the EDIT platform. The terms are categorised into: structures, properties and states. Different editors have been developed to combine structure and property terms to characters and assign a customised state vocabulary (categorical) or suitable values and units (numerical) to them. The workflow is built around a data set defining the taxonomic environment of individual use cases. A data set is specified by the characters and a taxonomic group, which can be filtered by area or rank. The dataset can be opened in a tabular representation (character matrix) to enter preselected state terms or values for the individual specimen. The matrix provides several features for basic comparison and analysis and allows the entry of alternative datasets (e.g. literature). Finally, the aggregation of data subsets to potential taxonomic units by adding up the values and summarising character states, allows the convenient test of taxonomic hypotheses. The term additivity is used here to describe this set of workflows and processes adding value to herbarium specimens and accumulating the specimen data for a taxon description.
    Type of Medium: Online Resource
    ISSN: 2535-0897
    Language: Unknown
    Publisher: Pensoft Publishers
    Publication Date: 2019
    detail.hit.zdb_id: 3028709-1
    Location Call Number Limitation Availability
    BibTip Others were also interested in ...
  • 10
    In: Biodiversity Information Science and Standards, Pensoft Publishers, Vol. 3 ( 2019-07-02)
    Abstract: For the last 15 years, Biodiversity Information Standards (TDWG) has recognized two competing standards for organism occurrence data, ABCD (Access to Biological Collections Data; Holetschek et al. 2012) and DarwinCore (Wieczorek et al. 2012). These two representations emerged from contrasting strategies for mobilizing information about organism occurrences (also commonly called species occurrence data). ABCD was capable of representing details of more kinds of information, but was necessarily more complicated. DarwinCore, on the other hand, was simpler but more limited in its ability to represent data of different kinds and formats. TDWG endorsed both standards because the different projects and communities that generated them remained dedicated to their different strategies and tool sets, and the Global Biodiversity Information Facility (GBIF) developed the ability to integrate data published in either standard. Since their inceptions, DarwinCore and ABCD have become more similar. DarwinCore has gotten more complicated through the addition of terms and has begun to assign terms to classes. ABCD is now expressed in RDF (Resource Description Framework), potentially enabling re-use of terms with alternative structures among classes. At the same time, methodologies for conceptual modeling and representing complex scientific data have continued to evolve. In particular, a suite of modeling and data representation methods related to linked data and the semantic web, i.e., RDF, SKOS (Simple Knowledge Organization System), and OWL (web Ontology Language), promise to make it easier for us to reconcile shared concepts among different representations or schemas. A mapping between ABCD 2.1 and DarwinCore has existed since before 2005.*1 ABCD 3.0 and DarwinCore are both now represented in RDF. In addition, the BioCollections Ontology (BCO) covers many of the shared concepts and is derived from the Basic Formal Ontology (BFO), an upper level ontology that has oriented many other biomedical ontologies. Reconciling ABCD and DarwinCore through alignment with BCO (in the OBO Foundry; Smith et al. 2007) would better connect TDWG standards to other domains in biology. We appreciate that many working scientists and data managers perceive ontologies as overly complicated. To mitigate the steep learning curve associated with ontologies, we expect to create simpler application profiles or schemas to guide and serve narrower communities of practice within the wider biodiversity domain. We also plan to integrate the current work of the Taxonomic Names and Concepts Interest Group and thereby eliminate the redundancy between DarwinCore and Taxonomic Concepts Transfer Schema (TCS; Kennedy et al. 2006). At the time of this writing, we have only agreements from the authors (i.e., conveners of relevant TDWG Interest Groups and other key stakeholders) to collaborate in pursuit of these common goals. In this presentation we will give a more detailed description of our objectives and products, the methods we are using to achieve them, and our progress to date.
    Type of Medium: Online Resource
    ISSN: 2535-0897
    Language: Unknown
    Publisher: Pensoft Publishers
    Publication Date: 2019
    detail.hit.zdb_id: 3028709-1
    Location Call Number Limitation Availability
    BibTip Others were also interested in ...
Close ⊗
This website uses cookies and the analysis tool Matomo. More information can be found here...