Reset all


Content Types


AID systems



Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type


Metadata standards

PID systems

Provider types

Quality management

Repository languages



Repository types


  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 401 result(s)
jPOSTrepo (Japan ProteOme STandard Repository) is a repository of sharing MS raw/processed data. It consists of a high-speed file upload process, flexible file management system and easy-to-use interfaces. Users can release their "raw/processed" data via this site with a unique identifier number for the paper publication. Users also can suspend (or "embargo") their data until their paper is published. The file transfer from users’ computer to our repository server is very fast (roughly ten times faster than usual file transfer) and uses only web browsers – it does not require installing any additional software.
CODEX is a database of NGS mouse and human experiments. Although, the main focus of CODEX is Haematopoiesis and Embryonic systems, the database includes a large variety of cell types. In addition to the publically available data, CODEX also includes a private site hosting non-published data. CODEX provides access to processed and curated NGS experiments. To use CODEX: (i) select a specialized repository (HAEMCODE or ESCODE) or choose the whole compendium (CODEX), then (ii) filter by organism and (iii) choose how to explore the database.
BioPortal is an open repository of biomedical ontologies, a service that provides access to those ontologies, and a set of tools for working with them. BioPortal provides a wide range of such tools, either directly via the BioPortal web site, or using the BioPortal web service REST API. BioPortal also includes community features for adding notes, reviews, and even mappings to specific ontologies. BioPortal has four major product components: the web application; the API services; widgets, or applets, that can be installed on your own site; and a Virtual Appliance version that is available for download or through Amazon Web Services machine instance (AMI). There is also a beta release SPARQL endpoint.
Clinical Genomic Database (CGD) is a manually curated database of conditions with known genetic causes, focusing on medically significant genetic data with available interventions.
Content type(s)
The IDR makes datasets that have never previously been accessible publicly available, allowing the community to search, view, mine and even process and analyze large, complex, multidimensional life sciences image data. Sharing data promotes the validation of experimental methods and scientific conclusions, the comparison with new data obtained by the global scientific community, and enables data reuse by developers of new analysis and processing tools.
This site provides access to complete, annotated genomes from bacteria and archaea (present in the European Nucleotide Archive) through the Ensembl graphical user interface (genome browser). Ensembl Bacteria contains genomes from annotated INSDC records that are loaded into Ensembl multi-species databases, using the INSDC annotation import pipeline.
The Yeast Resource Center Public Image Repository is a database of fluorescent microscopy images and their associated metadata/experimental parameters. The images depict the localization, co-localization and FRET (fluorescence energy transfer) of proteins in cells, particularly in the budding yeast Saccharomyces cerevisiae as a model organism. Users may download the entire datasets to improve their research.
ACEpepDB is a database ran by the Central Food Technological Research Institute. It contains records of about 865 peptides. Each record provides information on the food source, preparation, purification and any other additional information. Each record includes the reference(s). The database provides a search and browsing option for a more personalized research experience.
Human Protein Reference Database (HPRD) has been established by a team of biologists, bioinformaticists and software engineers. This is a joint project between the PandeyLab at Johns Hopkins University, and Institute of Bioinformatics, Bangalore. HPRD is a definitive repository of human proteins. This database should serve as a ready reckoner for researchers in their quest for drug discovery, identification of disease markers and promote biomedical research in general. Human Proteinpedia ( is its associated data portal.
Content type(s)
A small genotype data repository containing data used in recent papers from the Estonian Biocentre. Most of the data pertains to human population genetics. PDF files of the papers are also freely available.
The Protein Circular Dichroism Data Bank (PCDDB) provides and accepts a circular dichroism spectra data. The PCDDB and it's parent organization, the Institute of Structural and Molecular Biology (ISMB), investigate molecular structure using techniques such as biomolecular nuclear magnetic resonance, X-ray crystallography and computational structure prediction, as methods for protein production and biological characterization.
The HomoloGene database provides a system for the automated detection of homologs among annotated genes of genomes across multiple species. These homologs are fully documented and organized by homology group. HomoloGene processing uses proteins from input organisms to compare and sequence homologs, mapping back to corresponding DNA sequences.
BioModels Database is a repository of peer-reviewed, published, computational models that allows biologists to store, search and retrieve published mathematical models from the field of systems biology, but also more generally those of biological interest. Models in the database can be used to generate sub-models, can be simulated online, and can be converted between different representational formats.
The Ligand-Gated Ion Channel database provides access to information about transmembrane proteins that exist under different conformations, with three primary subfamilies: the cys-loop superfamily, the ATP gated channels superfamily, and the glutamate activated cationic channels superfamily.**The development of the Ligand-Gated Ion Channel database was started in 1994, as part of Le Novère's work on the phylogeny of those receptors' subunits. It grew into a serious data resource, that served the community at large. However, it is not actively maintained anymore. In addition, bioinformatics technology evolved a lot over the last two decades, so that scientists can now generate quickly customised databases from trustworthy primary data resources. Therefore, we decided to officialy freeze the data resource. The resource will not disappear, and all the information and links will stay there. But people should not consider it as an up-to-date trustable resource.**
PhytoPath is a new bioinformatics resource that integrates genome-scale data from important plant pathogen species with literature-curated information about the phenotypes of host infection. Using the Ensembl Genomes browser, it provides access to complete genome assembly and gene models of priority crop and model-fungal, oomycete and bacterial phytopathogens. PhytoPath also links genes to disease progression using data from the curated PHI-base resource. PhytoPath portal is a joint project bringing together Ensembl Genomes with PHI-base, a community-curated resource describing the role of genes in pathogenic infection. PhytoPath provides access to genomic and phentoypic data from fungal and oomycete plant pathogens, and has enabled a considerable increase in the coverage of phytopathogen genomes in Ensembl Fungi and Ensembl Protists. PhytoPath also provides enhanced searching of the PHI-base resource as well as the fungi and protists in Ensembl Genomes.
The modENCODE Project, Model Organism ENCyclopedia Of DNA Elements, was initiated by the funding of applications received in response to Requests for Applications (RFAs) HG-06-006, entitled Identification of All Functional Elements in Selected Model Organism Genomes and HG-06-007, entitled A Data Coordination Center for the Model Organism ENCODE Project (modENCODE). The modENCODE Project is being run as an open consortium and welcomes any investigator willing to abide by the criteria for participation that have been established for the project. Both computational and experimental approaches are being applied by modENCODE investigators to study the genomes of D. melanogaster and C. elegans. An added benefit of studying functional elements in model organisms is the ability to biologically validate the elements discovered using methods that cannot be applied in humans. The comprehensive dataset that is expected to result from the modENCODE Project will provide important insights into the biology of D. melanogaster and C. elegans as well as other organisms, including humans.
Synapse is an open source software platform that clinical and biological data scientists can use to carry out, track, and communicate their research in real time. Synapse enables co-location of scientific content (data, code, results) and narrative descriptions of that work.
PANDIT is a collection of multiple sequence alignments and phylogenetic trees. It contains corresponding amino acid and nucleotide sequence alignments, with trees inferred from each alignment. PANDIT is based on the Pfam database (Protein families database of alignments and HMMs), and includes the seed amino acid alignments of most families in the Pfam-A database. DNA sequences for as many members of each family as possible are extracted from the EMBL Nucleotide Sequence Database and aligned according to the amino acid alignment. PANDIT also contains a further copy of the amino acid alignments, restricted to the sequences for which DNA sequences were found.
BCSDB database is aimed at provision of structural, bibliographic, taxonomic and related information on bacterial carbohydrate structures. Two key points of this service are: covering - is above 90% in the scope of bacterial carbohydrates. This means the negative search answer remains valuable scientific information. And consistence - we manually check the data, and aim at hight quality error-free content. The main source of data is a retrospective literature analysis. About 25% of data were imported from CCSD (Carbbank, ceased in 1997, University of Georgia, Athens; structures published before 1995) with subsequent manual curation and approval. Current coverage is displayed in red on the top of the left menu. The time lag between publication of new data and their deposition ~ 1 year. The scope is "bacterial carbohydrates" and covers nearly all structures of this class published up to 2013. Bacterial means that a structure has been found in bacteria or obtained by modification of those found in bacteria. Carohydrate means a structure composed of any residues linked by glycosidic, ester, amidic, ketal, phospho- or sulpho-diester bonds, in which at least one residue is a sugar or its derivative.
Content type(s)
!! see caMOD Retirement Announcement !! Query the Cancer Models database for models submitted by fellow researchers. Retrieve information about the making of models, their genetic description, histopathology, derived cell lines, associated images, carcinogenic agents, and therapeutic trials. Links to associated publications and other resources are provided.
CORUM is a manually curated dataset of mammalian protein complexes. Annotation of protein complexes includes protein complex composition and other valuable information such as method of purification, cellular function of complexes or involvement in diseases.
The Human Metabolome Database (HMDB) is a freely available electronic database containing detailed information about small molecule metabolites found in the human body. It is intended to be used for applications in metabolomics, clinical chemistry, biomarker discovery and general education.
NURSA began in 2002 with the objective to accrue, develop and communicate information about the nuclear receptor superfamily. Over the last ten years, NURSA has developed a website that has developed into a comprehensive source of information about nuclear receptors, and their co-regulators, ligands, and downstream targets. Through a series of integrated 'omics-scale and informatic approaches projects, NURSA has fostered a systems biology understanding of nuclear receptor function, physiology and regulation of target gene networks in vivo.