Content Types


AID systems



Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type


Metadata standards

PID systems

Provider types

Quality management

Repository languages



Repository types


  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 2773 result(s)
BRENDA is the main collection of enzyme functional data available to the scientific community worldwide. The enzymes are classified according to the Enzyme Commission list of enzymes. It is available free of charge for via the internet ( and as an in-house database for commercial users (requests to our distributor Biobase). The enzymes are classified according to the Enzyme Commission list of enzymes. Some 5000 "different" enzymes are covered. Frequently enzymes with very different properties are included under the same EC number. BRENDA includes biochemical and molecular information on classification, nomenclature, reaction, specificity, functional parameters, occurrence, enzyme structure, application, engineering, stability, disease, isolation, and preparation. The database also provides additional information on ligands, which function as natural or in vitro substrates/products, inhibitors, activating compounds, cofactors, bound metals, and other attributes.
CBS offers Comprehensive public databases of DNA- and protein sequences, macromolecular structure, g ene and protein expression levels, pathway organization and cell signalling, have been established to optimise scientific exploitation of the explosion of data within biology. Unlike many other groups in the field of biomolecular informatics, Center for Biological Sequence Analysis directs its research primarily towards topics related to the elucidation of the functional aspects of complex biological mechanisms. Among contemporary bioinformatics concerns are reliable computational interpretation of a wide range of experimental data, and the detailed understanding of the molecular apparatus behind cellular mechanisms of sequence information. By exploiting available experimental data and evidence in the design of algorithms, sequence correlations and other features of biological significance can be inferred. In addition to the computational research the center also has experimental efforts in gene expression analysis using DNA chips and data generation in relation to the physical and structural properties of DNA. In the last decade, the Center for Biological Sequence Analysis has produced a large number of computational methods, which are offered to others via WWW servers.
GovData the data portal for Germany offers consistent and central access to administrative data at the federal, state, and local level. Objective is to make data more available and easier to use at a single location. As set out in the concept of "open data", we attempt to facilitate the use of open licenses and to increase the supply of machine-readable raw data.
The National Deep Submergence Facility (NDSF) operates the Human Occupied Vehicle (HOV) Alvin, the Remote Operated Vehicle (ROV) Jason 2, and the Autonomous Underwater Vehicle (AUV) Sentry. Data acquired with these platforms is provided both to the science party on each expedition, and to the Woods Hole Oceanographic Institution (WHOI) Data Library.
dbEST is a division of GenBank that contains sequence data and other information on "single-pass" cDNA sequences, or "Expressed Sequence Tags", from a number of organisms. Expressed Sequence Tags (ESTs) are short (usually about 300-500 bp), single-pass sequence reads from mRNA (cDNA). Typically they are produced in large batches. They represent a snapshot of genes expressed in a given tissue and/or at a given developmental stage. They are tags (some coding, others not) of expression for a given cDNA library. Most EST projects develop large numbers of sequences. These are commonly submitted to GenBank and dbEST as batches of dozens to thousands of entries, with a great deal of redundancy in the citation, submitter and library information. To improve the efficiency of the submission process for this type of data, we have designed a special streamlined submission process and data format. dbEST also includes sequences that are longer than the traditional ESTs, or are produced as single sequences or in small batches. Among these sequences are products of differential display experiments and RACE experiments. The thing that these sequences have in common with traditional ESTs, regardless of length, quality, or quantity, is that there is little information that can be annotated in the record. If a sequence is later characterized and annotated with biological features such as a coding region, 5'UTR, or 3'UTR, it should be submitted through the regular GenBank submissions procedure (via BankIt or Sequin), even if part of the sequence is already in dbEST. dbEST is reserved for single-pass reads. Assembled sequences should not be submitted to dbEST. GenBank will accept assembled EST submissions for the forthcoming TSA (Transcriptome Shotgun Assembly) division. The individual reads which make up the assembly should be submitted to dbEST, the Trace archive or the Short Read Archive (SRA) prior to the submission of the assemblies.
UniProtKB/Swiss-Prot is the manually annotated and reviewed section of the UniProt Knowledgebase (UniProtKB). It is a high quality annotated and non-redundant protein sequence database, which brings together experimental results, computed features and scientific conclusions. Since 2002, it is maintained by the UniProt consortium and is accessible via the UniProt website.
The Khazar University Institutional Repository (KUIR), a suite of services offered by the Library Information Center, is an institutional repository maintained to support the university's researchers, collaborators, and students. Repository content consists of collections of research materials in digital format produced and selected by Khazar University faculty and their collaborators.
The NCAA Student-Athlete Experiences Data Archive provides access to data about student athletes and will grow to include a handful of user-friendly data collections related to graduation rates; team-level Academic Progress Rates in Division I; and individual-level data on the experiences of current and former student-athletes from the NCAA's Growth, Opportunities, Aspirations and Learning of Students in college study (GOALS), and the Study of College Outcomes and Recent Experiences (SCORE). In the long run, the NCAA expects to follow this initial release with the publication of as much data as possible from its archives. The data is used by college presidents, athletic personnel, faculty, student-athlete groups, media members, and researchers in looking at issues related to intercollegiate athletics and higher education.
The India Water Portal is a web-based platform for sharing water management knowledge in India amongst practitioners and the general public. The included datasets can be browsed by data type, location, time, and other metadata. Data include rainfall, watersheds, groundwater, water quality, and irrigation.
The Scholarly Database (SDB) at Indiana University aims to serve researchers and practitioners interested in the analysis, modeling, and visualization of large-scale scholarly datasets. The online interface provides access to six datasets: MEDLINE papers, registered Clinical Trials, U.S. Patent and Trademark Office patents (USPTO), National Science Foundation (NSF) funding, National Institutes of Health (NIH) funding, and National Endowment for the Humanities funding – over 26 million records in total.
Xanthobase provides information on Xanthomonas oryzae pv oryzae (Xoo), the rice (Oryza sativa) pathogenic bacterium in which genome sequencing has revealed very extensive race differentiation. The whole genome sequence of its native host has also been completed, and analysis of the host parasite interaction on the basis of the two genomes can be expected to be useful.
The NADP monitors precipitation chemistry from numerous sites around the United States. The NADP consists of 5 networks: National Trends Network, Mercury Deposition Network, Atmospheric Integrated Research Monitoring Network, Atmospheric Mercury Network, and Ammonia Monitoring Network. Data is provided by each network.
Gramene is a platform for comparative genomic analysis of agriculturally important grasses, including maize, rice, sorghum, wheat and barley. Relationships between cereals are queried and displayed using controlled vocabularies (Gene, Plant, Trait, Environment, and Gramene Taxonomy) and web-based displays, including the Genes and Quantitative Trait Loci (QTL) modules.
Reactome is a manually curated, peer-reviewed pathway database, annotated by expert biologists and cross-referenced to bioinformatics databases. Its aim is to share information in the visual representations of biological pathways in a computationally accessible format. Pathway annotations are authored by expert biologists, in collaboration with Reactome editorial staff and cross-referenced to many bioinformatics databases. These include NCBI Gene, Ensembl and UniProt databases, the UCSC and HapMap Genome Browsers, the KEGG Compound and ChEBI small molecule databases, PubMed, and Gene Ontology.
The Lunar Orbiter Photographic Atlas of the Moon by Bowker and Hughes (NASA SP-206) is considered the definitive reference manual to the global photographic coverage of the Moon. The images contained within the atlas are excellent for studying lunar morphology because they were obtained at low to moderate Sun angles. The digital Lunar Orbiter Atlas of the Moon is a reproduction of the 675 plates contained in Bowker and Hughes. The digital archive, however, offers many improvements upon its original hardbound predecessor. Multiple search capabilities were added to the database to expedite locating images and features of interest. For accuracy and usability, surface feature information has been updated and improved. Lastly, to aid in feature identification, a companion image containing feature annotation has been included. The symbols on the annotated overlays, however, should only be used as locators and not for precise measurements. More detailed information about the digital archive process can be read in abstracts presented at the 30th and 31st Lunar and Planetary Science Conferences.
Content type(s)
The EVIA Digital Archive Project is a repository of ethnographic video recordings and an infrastructure of tools and systems supporting scholars in the ethnographic disciplines. The project focuses on the fields of ethnomusicology, folklore, anthropology, and dance ethnology.
The HUGO Gene Nomenclature Committee (HGNC) assigned unique gene symbols and names to over 35,000 human loci, of which around 19,000 are protein coding. This curated online repository of HGNC-approved gene nomenclature and associated resources includes links to genomic, proteomic and phenotypic information, as well as dedicated gene family pages.
!!!!! This database doesn't exist anymore. 2017-09-05 !!!!!BeetleBase is a comprehensive sequence database and important community resource for Tribolium genetics, genomics and developmental biology. It provides genetic data on the Tribolium Castaneum, Red Flour Beetle, as gene maps, official gene set, reference sequences, predicted models, and whole-genome tiling array representing developmental stages.
The Organelle Genome Megasequencing Program (OGMP) provides mitochondrial, chloroplast, and mitochondrial plasmid genome data. OGMP tools allow direct comparison of OGMP and NCBI validated records. Includes GOBASE, a taxonomically broad organelle genome database that organizes and integrates diverse data related to mitochondria and chloroplasts.
Chempound is a new generation repository architecture based on RDF, semantic dictionaries and linked data. It has been developed to hold any type of chemical object expressible in CML and is exemplified by crystallographic experiments and computational chemistry calculations. In both examples, the repository can hold >50k entries which can be searched by SPARQL endpoints and pre-indexing of key fields. The Chempound architecture is general and adaptable to other fields of data-rich science. The Chempound software is hosted at and is available under the Apache License, Version 2.0
The project analyzes educational processes in Germany from early childhood to late adulthood. The National Educational Panel Study (NEPS) has been set up to find out more about the acquisition of education in Germany, to plot the consequences of education for individual biographies, and to describe central educational processes and trajectories across the entire life span. Such an interdisciplinary consortium of research institutes, researcher groups, and research. personalities has been assembled in Bamberg. In addition, the competencies and experiences with longitudinal research available at numerous other locations have been networked to form a cluster of excellence.
This Animal Quantitative Trait Loci (QTL) database (Animal QTLdb) is designed to house all publicly available QTL and trait mapping data (i.e. trait and genome location association data; collectively called "QTL data" on this site) on livestock animal species for easily locating and making comparisons within and between species. New database tools are continuely added to align the QTL and association data to other types of genome information, such as annotated genes, RH / SNP markers, and human genome maps. Besides the QTL data from species listed below, the QTLdb is open to house QTL/association date from other animal species where feasible. Note that the JAS along with other journals, now require that new QTL/association data be entered into a QTL database as part of their publication requirements.
The Plant Metabolic Network (PMN) provides a broad network of plant metabolic pathway databases that contain curated information from the literature and computational analyses about the genes, enzymes, compounds, reactions, and pathways involved in primary and secondary metabolism in plants. The PMN currently houses one multi-species reference database called PlantCyc and 22 species/taxon-specific databases.