Reset all


Content Types


AID systems



Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type


Metadata standards

PID systems

Provider types

Quality management

Repository languages



Repository types


  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 41 result(s)
Content type(s)
The EuMMCR (European Mouse Mutant cell Repository) is the mouse ES cell distribution unit in Europe. The EuMMCR unit distributes targeting vectors and mutant ES cell lines produced in the EUCOMM and EUCOMMTOOLS consortia.
Peptidome was a public repository that archived tandem mass spectrometry peptide and protein identification data generated by the scientific community. This repository is now offline and is in archival mode. All data may be obtained from the Peptidome FTP site. Due to budgetary constraints NCBI has discontinued the Peptidome Repository. All existing data and metadata files will continue to be made available from our ftp server a indefinitely. Those files are named according to their Peptidome accession number, allowing cited data to be identified and downloaded. All of the Peptidome studies have been made publicly available at the PRoteomics IDEntifications (PRIDE) database. A map of Peptidome to Pride accessions may be found at If you have any specific questions, please feel free to contact us at
Content type(s)
The CEBS database houses data of interest to environmental health scientists. CEBS is a public resource, and has received depositions of data from academic, industrial and governmental laboratories. CEBS is designed to display data in the context of biology and study design, and to permit data integration across studies for novel meta analysis.
The Tree Atlas Project is focused on the trees and shrubs of Namibia. The project gathered and recorded information about the distribution, abundance and general biology of woody plants, which are an important resource in Namibia. The information was collected over 6 years (October 1997-December 2003) and entered into a database housed at the National Botanical Research Institute.
>>>>!!!!<<<< The Cancer Genomics Hub mission is now completed. The Cancer Genomics Hub was established in August 2011 to provide a repository to The Cancer Genome Atlas, the childhood cancer initiative Therapeutically Applicable Research to Generate Effective Treatments and the Cancer Genome Characterization Initiative. CGHub rapidly grew to be the largest database of cancer genomes in the world, storing more than 2.5 petabytes of data and serving downloads of nearly 3 petabytes per month. As the central repository for the foundational genome files, CGHub streamlined team science efforts as data became as easy to obtain as downloading from a hard drive. The convenient access to Big Data, and the collaborations that CGHub made possible, are now essential to cancer research. That work continues at the NCI's Genomic Data Commons. All files previously stored at CGHub can be found there. The Website for the Genomic Data Commons is here: >>>>!!!!<<<< The Cancer Genomics Hub (CGHub) is a secure repository for storing, cataloging, and accessing cancer genome sequences, alignments, and mutation information from the Cancer Genome Atlas (TCGA) consortium and related projects. Access to CGHub Data: All researchers using CGHub must meet the access and use criteria established by the National Institutes of Health (NIH) to ensure the privacy, security, and integrity of participant data. CGHub also hosts some publicly available data, in particular data from the Cancer Cell Line Encyclopedia. All metadata is publicly available and the catalog of metadata and associated BAMs can be explored using the CGHub Data Browser.
!!!! Retirement of UniProt Metagenomic and Environmental Sequences (UniMES): UniProt has retired UniMES as there is now a resource at the EBI that is dedicated to serving metagenomic researchers. Henceforth, we recommend using the EBI Metagenomics portal instead. In addition to providing a repository of metagenomics sequence data, EBI Metagenomics allows you to view functional and taxonomic analyses and to submit your own samples for analysis. !!! The UniProt Metagenomic and Environmental Sequences (UniMES) database is a repository specifically developed for metagenomic and environmental data. We provide UniMES clusters in order to obtain complete coverage of sequence space at different resolutions.
The Bremen Core Repository - BCR, for International Ocean Discovery Program (IODP), Integrated Ocean Discovery Program (IODP), Ocean Drilling Program (ODP), and Deep Sea Drilling Project (DSDP) cores from the Atlantic Ocean, Mediterranean and Black Seas and Arctic Ocean is operated at University of Bremen within the framework of the German participation in IODP. It is one of three IODP repositories (beside Gulf Coast Repository (GCR) in College Station, TX, and Kochi Core Center (KCC), Japan). One of the scientific goals of IODP is to research the deep biosphere and the subseafloor ocean. IODP has deep-frozen microbiological samples from the subseafloor available for interested researchers and will continue to collect and preserve geomicrobiology samples for future research.
IMGT/mAb-DB provides a unique expertised resource on monoclonal antibodies (mAbs) with diagnostic or therapeutic indications, fusion proteins for immune applications (FPIA), composite proteins for clinical applications (CPCA) and relative proteins of the immune system (RPI) with clinical indications.
This site offers an enormous collection of photographs of wild species and natural history objects. It covers most groups of organisms with the exception of birds and other vertebrates. The photographs are presented to illustrate biodiversity and as an aid to identification. The criterion for inclusion of a species is that it must have been, or might be expected to be, found in Britain or Ireland. BioImages follows the biological classification. Biota is a hierarchical system with species grouped in genera, genera in families, families in orders and so on up to kingdoms and superkingdoms. The datasets are linked to bioinfo: food webs and species interactions in the Biodiversity of UK and Ireland.
MTD is focused on mammalian transcriptomes with a current version that contains data from humans, mice, rats and pigs. Regarding the core features, the MTD browses genes based on their neighboring genomic coordinates or joint KEGG pathway and provides expression information on exons, transcripts, and genes by integrating them into a genome browser. We developed a novel nomenclature for each transcript that considers its genomic position and transcriptional features.
Virtual Fly Brain (VFB) - an interactive tool for neurobiologists to explore the detailed neuroanatomy, neuron connectivity and gene expression of the adult Drosophila melanogaster brain.
It captures and catalogues ancient human genome and microbiome data, including raw sequence and processed data, along with metadata about its provenance and production. Included datasets are generated from ancient samples studied at the Australian Centre for Ancient DNA, University of Adelaide in collaboration with other research groups. Datasets and collections in OAGR are open data resources made freely available in a reusable form, using open file formats and licensed with minimal restrictions for reuse. Digital object identifiers (DOIs) are minted for included datasets and collections to facilitate persistent identification and citation.
A data repository for the storage and sharing of Adaptive Immune Receptor Repertoire data. Primary public repository for the iReceptor Platform and Scientific Gateway.
DLESE is the Digital Library for Earth System Education, a geoscience community resource that supports teaching and learning about the Earth system. It is funded by the National Science Foundation and is being built by a community of educators, students, and scientists to support Earth system education at all levels and in both formal and informal settings. Resources in DLESE include lesson plans, scientific data, visualizations, interactive computer models, and virtual field trips - in short, any web-accessible teaching or learning material. Many of these resources are organized in collections, or groups of related resources that reflect a coherent, focused theme. In many ways, digital collections are analogous to collections in traditional bricks-and-mortar libraries.
IMGT/GENE-DB is the IMGT genome database for IG and TR genes from human, mouse and other vertebrates. IMGT/GENE-DB provides a full characterization of the genes and of their alleles: IMGT gene name and definition, chromosomal localization, number of alleles, and for each allele, the IMGT allele functionality, and the IMGT reference sequences and other sequences from the literature. IMGT/GENE-DB allele reference sequences are available in FASTA format (nucleotide and amino acid sequences with IMGT gaps according to the IMGT unique numbering, or without gaps).
STRING is a database of known and predicted protein interactions. The interactions include direct (physical) and indirect (functional) associations; they are derived from four sources: - Genomic Context - High-throughput Experiments - (Conserved) Coexpression - Previous Knowledge STRING quantitatively integrates interaction data from these sources for a large number of organisms, and transfers information between these organisms where applicable.
Content type(s)
A small genotype data repository containing data used in recent papers from the Estonian Biocentre. Most of the data pertains to human population genetics. PDF files of the papers are also freely available.
dbSTS is an NCBI resource that contains sequence data for short genomic landmark sequences or Sequence Tagged Sites. STS sequences are incorporated into the STS Division of GenBank.
arrayMap is a repository of cancer genome profiling data. Original) from primary repositories (e.g. NCBI GEO, EBI ArrayExpress, TCGA) is re-processed and annotated for metadata. Unique visualisation of the processed data allows critical evaluation of data quality and genome information. Structured metadata provides easy access to summary statistics, with a focus on copy number aberrations in cancer entities.