Reset all


Content Types


AID systems



Data access

Data access restrictions

Database access

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type


Metadata standards

PID systems

Provider types

Quality management

Repository languages



Repository types


  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 27 result(s)
The RAMEDIS system is a platform independent, web-based information system for rare metabolic diseases based on filed case reports. It was developed in close cooperation with clinical partners to allow them to collect information on rare metabolic diseases with extensive details, e.g. about occurring symptoms, laboratory findings, therapy and molecular data.
The Plant Metabolic Network (PMN) provides a broad network of plant metabolic pathway databases that contain curated information from the literature and computational analyses about the genes, enzymes, compounds, reactions, and pathways involved in primary and secondary metabolism in plants. The PMN currently houses one multi-species reference database called PlantCyc and 22 species/taxon-specific databases.
The Organelle Genome Megasequencing Program (OGMP) provides mitochondrial, chloroplast, and mitochondrial plasmid genome data. OGMP tools allow direct comparison of OGMP and NCBI validated records. Includes GOBASE, a taxonomically broad organelle genome database that organizes and integrates diverse data related to mitochondria and chloroplasts.
The Database of Genomic Variants archive provides curated archiving and distribution of publicly available genomic structural variants. Direct submissions are accepted as well as published data. The DGVa is the primary supplier of data to the Database of Genomic Variants (DGV) (hosted by The Centre for Applied Genomics in Toronto, Canada).
The Benchmark Energy & Geometry Database (BEGDB) collects results of highly accurate QM calculations of molecular structures, energies and properties. These data can serve as benchmarks for testing and parameterization of other computational methods.
Content type(s)
The JCB DataViewer is an image hosting and presentation platform for original image datasets associated with articles published in The Journal of Cell Biology, a peer-reviewed journal from the Rockefeller University Press.
GOBASE is a taxonomically broad organelle genome database that organizes and integrates diverse data related to mitochondria and chloroplasts. GOBASE is currently expanding to include information on representative bacteria that are thought to be specifically related to the bacterial ancestors of mitochondria and chloroplasts
This library is a public and easily accessible resource database of images, videos, and animations of cells, capturing a wide diversity of organisms, cell types, and cellular processes. The Cell Image Library has been merged with "Cell Centered Database" in 2017. The purpose of the database is to advance research on cellular activity, with the ultimate goal of improving human health.
BiGG is a knowledgebase of Biochemically, Genetically and Genomically structured genome-scale metabolic network reconstructions. BiGG integrates several published genome-scale metabolic networks into one resource with standard nomenclature which allows components to be compared across different organisms. BiGG can be used to browse model content, visualize metabolic pathway maps, and export SBML files of the models for further analysis by external software packages. Users may follow links from BiGG to several external databases to obtain additional information on genes, proteins, reactions, metabolites and citations of interest.
The Database contains all publicly available HMS LINCS datasets and information for each dataset about experimental reagents (small molecule perturbagens, cells, antibodies, and proteins) and experimental and data analysis protocols.
The Virtual Liver Network (VLN) represents a major research investment by the German Government focusing on work at the “bleeding edge” of Systems Biology and Systems Medicine. This Flagship Programme is tackling one of the major challenges in the life sciences: that is, how to integrate the wealth of data we have acquired post-genome, not just in a mathematical model, but more importantly in a series of models that are linked across scales to represent organ function. As the project is prototyping how to achieve true multi-scale modelling within a single organ and linking this to human physiology, it will be developing tools and protocols that can be applied to other systems, helping to drive forward the application of modelling and simulation to modern medical practice. It is the only programme of its type to our knowledge that bridges investigations from the sub-cellular through to ethically cleared patient and volunteer studies in an integrated workflow. As such, this programme is contributing significantly to the development of a new paradigm in biology and medicine.
Content type(s)
Genome resource samples of wild animals, particularly those of endangered mammalian and avian species, are very difficult to collect. In Korea, many of these animals such as tigers, leopards, bears, wolves, foxes, gorals, and river otters, are either already extinct, long before the Korean biologists had the opportunity to study them, or are near extinction. Therefore, proposal for a systematic collection and preservation of genetic samples of these precious animals was adopted by Korea Science & Engineering Foundation (KOSEF). As an outcome, Conservation Genome Resource Bank for Korean Wildlife (CGRB; was established in 2002 at the College of Veterinary Medicine, Seoul National University as one of the Special Research Materials Bank supported by the Scientific and Research Infrastructure Building Program of KOSEF. CGRB operates in collaboration with Seoul Grand Park Zoo managed by Seoul Metropolitan Government, and has offices and laboratories at both Seoul National University and Seoul Grand Park, where duplicate samples are maintained, thereby assuring a long-term, safe preservation of the samples. Thus, CGRB is the first example of the collaborative scientific infrastructure program between university and zoo in Korea.
Recode2 is a database of genes that utilize non-standard translation for gene expression purposes. Recoding events described in the database include programmed ribosomal frameshifting, translational bypassing (aka hopping) and mRNA specific codon redefinition. Frameshifting at a particular site often yields two protein products from one coding sequence and sometimes serves a regulatory purpose by acting as a sensor of the level of product protein or of some external ligand. Bypassing (hopping) allows the coupling of two ORFs separated on an mRNA by a coding gap. Codon redefinition occurs when a stop codon is decoded as a standard amino acid (often glutamine or tryptophan), or the 21st amino acid selenocysteine. These recoding events are in competition with standard decoding and are site specific. The efficiency of recoding is often modulated by cis-stimulators and sometimes by trans-factors. The sequences of the genes that use recoding for their expression are in the database. The recoding sites and the known stimulatory signals are annotated in the database together with notes on factors that are known to affect recoding efficiencies.
Complete Genomics provides free public access to a variety of whole human genome data sets generated from Complete Genomics’ sequencing service. The research community can explore and familiarize themselves with the quality of these data sets, review the data formats provided from our sequencing service, and augment their own research with additional summaries of genomic variation across a panel of diverse individuals. The quality of these data sets is representative of what a customer can expect to receive for their own samples. This public genome repository comprises genome results from both our Standard Sequencing Service (69 standard, non-diseased samples) and the Cancer Sequencing Service (two matched tumor and normal sample pairs). In March 2013 Complete Genomics was acquired by BGI-Shenzhen , the world’s largest genomics services company. BGI is a company headquartered in Shenzhen, China that provides comprehensive sequencing and bioinformatics services for commercial science, medical, agricultural and environmental applications. Complete Genomics is now focused on building a new generation of high-throughput sequencing technology and developing new and exciting research, clinical and consumer applications.
MassBank of North America (MoNA) is a metadata-centric, auto-curating repository designed for efficient storage and querying of mass spectral records. It intends to serve as a the framework for a centralized, collaborative database of metabolite mass spectra, metadata and associated compounds. MoNA currently contains over 200,000 mass spectral records from experimental and in-silico libraries as well as from user contributions.
The Toxin and Toxin Target Database is a unique bioinformatics resource that combines detailed toxin data with comprehensive toxin target information. The focus of the T3DB is on providing mechanisms of toxicity and target proteins for each toxin. This dual nature of the T3DB, in which toxin and toxin target records are interactively linked in both directions, makes it unique from existing databases.
EcoGene is a database and website devoted to continuously improving the structural and functional annotation of Escherichia coli K-12, one of the most well understood model organisms, represented by the MG1655(Seq) genome sequence and annotations.
PDBe is the European resource for the collection, organisation and dissemination of data on biological macromolecular structures. In collaboration with the other worldwide Protein Data Bank (wwPDB) partners - the Research Collaboratory for Structural Bioinformatics (RCSB) and BioMagResBank (BMRB) in the USA and the Protein Data Bank of Japan (PDBj) - we work to collate, maintain and provide access to the global repository of macromolecular structure data. We develop tools, services and resources to make structure-related data more accessible to the biomedical community.
The 1000 Genomes Project is an international collaboration to produce an extensive public catalog of human genetic variation, including SNPs and structural variants, and their haplotype contexts. This resource will support genome-wide association studies and other medical research studies. The genomes of about 2500 unidentified people from about 25 populations around the world will be sequenced using next-generation sequencing technologies. The results of the study will be freely and publicly accessible to researchers worldwide. The International Genome Sample Resource (IGSR) has been established at EMBL-EBI to continue supporting data generated by the 1000 Genomes Project, supplemented with new data and new analysis.
NetSlim is a resource of high-confidence signaling pathway maps derived from NetPath pathway reactions. 40-60% of the molecules and their reactions in NetPath pathways are available in NetSlim.
Pathway Commons is a convenient point of access to biological pathway information collected from public pathway databases. Information is sourced from public pathway databases and is readily searched, visualized, and downloaded. The data is freely available under the license terms of each contributing database.
The Yeast Resource Center provides access to data about mass spectrometry, yeast two-hybrid arrays, deconvolution florescence microscopy, protein structure prediction and computational biology. These services are provided to further the goal of a complete understanding of the chemical interactions required for the maintenance and faithful reproduction of a living cell. The observation that the fundamental biological processes of yeast are conserved among all eukaryotes ensures that this knowledge will shape and advance our understanding of living systems.
KEGG is a database resource for understanding high-level functions and utilities of the biological system, such as the cell, the organism and the ecosystem, from molecular-level information, especially large-scale molecular datasets generated by genome sequencing and other high-throughput experimental technologies