Content Types


AID systems



Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type


Metadata standards

PID systems

Provider types

Quality management

Repository languages



Repository types


  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 97 result(s)
CiteSeerx is an evolving scientific literature digital library and search engine that focuses primarily on the literature in computer and information science. CiteSeerx aims to improve the dissemination of scientific literature and to provide improvements in functionality, usability, availability, cost, comprehensiveness, efficiency, and timeliness in the access of scientific and scholarly knowledge. Rather than creating just another digital library, CiteSeerx attempts to provide resources such as algorithms, data, metadata, services, techniques, and software that can be used to promote other digital libraries. CiteSeerx has developed new methods and algorithms to index PostScript and PDF research articles on the Web.
The database contains all the variants published as pathogenic mutations in the international literature up to November 2007. In addition, unpublished Usher mutations and non-pathogenic variants from the laboratory of Montpellier have been included.
Plants of TAIWAN includes digitized plant specimens and historical botanical literature of Taiwan, and a database of plant names and information for about 5000 species in Taiwan.
dictyBase is an integrated genetic and literature database that contains published Dictyostelium discoideum literature, genes, expressed sequence tags (ESTs), as well as the chromosomal and mitochondrial genome sequences. Direct access to the genome browser, a Blast search tool, the Dictyostelium Stock Center, research tools, colleague databases, and much much more are just a mouse click away. Dictybase is a genome portal for the Amoebozoa. dictyBase is funded by a grant from the National Institute for General Medical Sciences.
The repository is no longer available <<<!!!<<< TOXNET has moved. Most content will continue to be collected and reviewed; selected information is accessible through PubChem, PubMed, and Bookshelf. If you have questions, please contact NLM Customer Support at >>>!!!>>>
BioModels is a repository of mathematical models of biological and biomedical systems. It hosts a vast selection of existing literature-based physiologically and pharmaceutically relevant mechanistic models in standard formats. Our mission is to provide the systems modelling community with reproducible, high-quality, freely-accessible models published in the scientific literature.
The Répertoire International des Sources Musicales (RISM) is an international, non-profit organization with the aim of comprehensively documenting extant musical sources anywhere in the world. Cataloging musical sources is financed and carried out by various national and international institutions. Independent national working groups at libraries and archives in many countries worldwide catalog historical musical sources: music prints, music manuscripts, libretti, and theoretical writings about music. The results are edited and published by RISM. RISM documents what exists and where it is kept. RISM's database offers the most comprehensive documentation available for music manuscripts and printed music for the time between 1600 and 1800. It continues to grow through monthly updates and averages around 30,000 new records anually. This online publication is made possible through a partnership between the Bavarian State Library (Munich), the State Library of Berlin, and RISM. The RISM Zentralredaktion is a project of the Academy of Science and Literature, Mainz. More information can be found on the RISM website.
The Autism Chromosome Rearrangement Database is a collection of hand curated breakpoints and other genomic features, related to autism, taken from publicly available literature: databases and unpublished data. The database is continuously updated with information from in-house experimental data as well as data from published research studies.
NONCODE is an integrated knowledge database dedicated to non-coding RNAs (excluding tRNAs and rRNAs). Now, there are 16 species in NONCODE(human, mouse, cow, rat, chicken, fruitfly, zebrafish, celegans, yeast, Arabidopsis, chimpanzee, gorilla, orangutan, rhesus macaque, opossum and platypus).The source of NONCODE includes literature and other public databases. We searched PubMed using key words ‘ncrna’, ‘noncoding’, ‘non-coding’,‘no code’, ‘non-code’, ‘lncrna’ or ‘lincrna. We retrieved the new identified lncRNAs and their annotation from the Supplementary Material or web site of these articles. Together with the newest data from Ensembl , RefSeq, lncRNAdb and GENCODE were processed through a standard pipeline for each species.
Content type(s)
The Blue Obelisk Data Repository lists many important chemoinformatics data such as element and isotope properties, atomic radii, etc. including references to original literature. Developers can use this repository to make their software interoperable.
>>>!!! <<< 2019-05-15: LogKOW is offline >>>!!!<<< A databank of experimental data about partition coefficients, retrieved from the literature, on over 20.000 organic coumpounds (carbon numbers 1 - 130; gases, liquids and solids). former URL:;
BsubCyc is a model-organism database for the bacterium Bacillus subtilis and is based on the updated B. subtilis 168 genome sequence and annotation published by Barbe et al. in 2009. Gene function annotations are being updated when new literature is available. Subscriptions are now required to access BsubCyc. For more information on obtaining a subscription, click here:
PalDat provides a large amount of data from a variety of plant families. Each data entry ideally includes a detailed description of the pollen grain, images of each pollen grain (LM, SEM and TEM), images of the plant/inflorescence/flower and relevant literature.
The BioProject database is a searcheable collection of complete and incomplete (in-progress) large-scale molecular projects including genome sequencing and assembly, transcriptome, metagenomic, annotation, expression and mapping projects. BioProject provides a central point to link to all data associated with a project in the NCBI molecular and literature databases.
Citrination is the premier open database and analytics platform for the world's material and chemical information. Here you can find tabulated materials property data, that users have contributed or Citrine has automatically extracted from literature.
Content type(s)
Fondo Antiguo is part of UVaDOC Repositorio Documental de la Universidad de Valladolid. It contains ancient printed documents.
The repository of the Donders Institute for Brain, Cognition and Behaviour at the Radboud University is used to manage, share and publish neuroscience and neuroimaging data, including MRI, EEG, MEG and other types of research data.
The goal of the NeuroElectro Project is to extract information about the electrophysiological properties (e.g. resting membrane potentials and membrane time constants) of diverse neuron types from the existing literature and place it into a centralized database.
HADb provides a complete and an up-to-date list of human genes and proteins involved directly or indirectly in autophagy as described in literature.
ConsensusPathDB integrates interaction networks in humans (and in the model organisms - yeast and mouse) including binary and complex protein-protein, genetic, metabolic, signaling, gene regulatory and drug-target interactions, as well as biochemical pathways. Data originate from public resources for interactions and interactions curated from the literature. The interaction data are integrated in a complementary manner to avoid redundancies.
Collection of maps showing reconstructions of routes and paths through Rome described in Renaissance guidebooks and antiquarian literature.