Onderzoeksportaal

Engels

Towards "Biodiversity PMC"

Onderzoeksoutput: Bijdrage aan tijdschriftAbstract van een congres in een tijdschrift

  • Emilie Pasche
  • Donat Agosti
  • Lyubomir Penev
  • Quentin Groom
  • Alexandre Flament
  • Julien Gobeill
  • Patrick Ruch
The Swiss Institute of Bioinformatics (SIB) Literature services (SIBiLS, Gobeill et al. 2020) provides powerful search capabilities to explore the life and health sciences literature by mirroring the United States National Institute of Health's National Library of Medicine (NIH/NLM) (MEDLINE) and National Center for Biotechnology Information (NCBI) PubMed Central® contents.In the course of the BiCIKL project, SIBiLS started indexing a larger set of biodiversity-related contents in the broad sense including environmental sciences and ecology, to build a new literature database called "Biodiversity PMC". In addition to MEDLINE and PubMed Central, SIBiLS is now providing a unique entry point to half a million taxonomic treatments extracted by Plazi, as well as to a growing set of full-text article XMLs from Pensoft, which were not included into the original PubMed Central. The services can be accessed via a new Graphic User Inteface and an OpenAPI. In addition to usual search operators (using the Apache Lucene syntax), the contents are normalized using a large collection of life sciences terminologies and ontologies. Each instance of a term (or its synonym) is normalized with a unique accession number to support a semantically richer search experience. Of particular interest for the biodiversity communities, SIBiLS contents are normalized using ENVO (Environmental Ontology). Further, taxonomic names are normalized using both the NCBI Taxonomy and the Open Tree of Life, which include names from the Catalogue of Life. The resulting data graph contains 12 billion normalized descriptors and supports access via keyword search, as well as via an original question answering interface, which can help provide new perspectives when navigating the life and health sciences. The data (Journal Publishing Tag Set, JATS, and BioC) are fully available under CC-BY 4.0 licences.
Originele taal-2Engels
TijdschriftBiodiversity Information Science and Standards
Volume7
Pagina's (van-tot)e111647
DOI's
StatusGepubliceerd - 28-aug.-2023
Inloggen in Pure