Reset all


Content Types


AID systems



Data access

Data access restrictions

Database access

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type


Metadata standards

PID systems

Provider types

Quality management

Repository languages



Repository types


  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 106 result(s)
PDBe is the European resource for the collection, organisation and dissemination of data on biological macromolecular structures. In collaboration with the other worldwide Protein Data Bank (wwPDB) partners - the Research Collaboratory for Structural Bioinformatics (RCSB) and BioMagResBank (BMRB) in the USA and the Protein Data Bank of Japan (PDBj) - we work to collate, maintain and provide access to the global repository of macromolecular structure data. We develop tools, services and resources to make structure-related data more accessible to the biomedical community.
iRefWeb is an interface to a relational database containing the latest build of the interaction Reference Index (iRefIndex) which integrates protein interaction data from ten different interaction databases: BioGRID, BIND, CORUM, DIP, HPRD, INTACT, MINT, MPPI, MPACT and OPHID.
The repository facilitates computation of a wide range of biosystem data. It also connects biosystem data with associated literature throughout the Entrez system.
>>>!!! <<< The Epigenomics database was retired on June 1, 2016. All epigenomics data are available in our GEO resource >>> !!! <<< The Epigenomics database provides genomics maps of stable and reprogrammable nuclear changes that control gene expression and influence health. Users can browse current epigenomic experiments as well as search, compare and browse samples from multiple biological sources in gene-specific contexts. Many epigenomes contain modifications with histone marks, DNA methylation and chromatin structure activity. NCBI Epigenomics database contains datasets from the NIH Roadmap Epigenomics Project.
FlowRepository is a web-based application accessible from a web browser that serves as an online database of flow cytometry experiments where users can query and download data collected and annotated according to the MIFlowCyt standard. It is primarily used as a data deposition place for experimental findings published in peer-reviewed journals in the flow cytometry field. FlowRepository is funded by the International Society for Advancement of Cytometry (ISAC) and powered by the Cytobank engine specifically extended for the purposes of this repository. FlowRepository has been developed by forking and extending Cytobank in 2011.
The Comparative RNA Web (CRW) Site disseminates information about RNA structure and evolution that has been determined using comparative sequence analysis. We present both raw (sequences, structure models, metadata) and processed (analyses, evolution, accuracy) data, organized into four main sections.
The miRBase database is a searchable database of published miRNA sequences and annotation. Each entry in the miRBase Sequence database represents a predicted hairpin portion of a miRNA transcript (termed mir in the database), with information on the location and sequence of the mature miRNA sequence (termed miR). Both hairpin and mature sequences are available for searching and browsing, and entries can also be retrieved by name, keyword, references and annotation. All sequence and annotation data are also available for download. The miRBase Registry provides miRNA gene hunters with unique names for novel miRNA genes prior to publication of results.
>>>!!!<<<As stated 2017-05-23 Cancer GEnome Mine is no longer available >>>!!!<<< Cancer GEnome Mine is a public database for storing clinical information about tumor samples and microarray data, with emphasis on array comparative genomic hybridization (aCGH) and data mining of gene copy number changes.
The Pig Expression Data Explorer (PEDE) database system stores full-length cDNA libraries of swine data accesible via keyword and ID searches. Data is publically available, and may specifically interest genetic researchers interested in disease sucsceptibly, and major and minor porcine specific antigens.
SWATHAtlas is a repository of mass spectrometry data of the human proteome. The repository provides open access to libraries of SWATH-MS (Sequential Windowed Acquisition of All Theoretical Fragment Ion Mass Spectra) datasets. SWATH-MS is a method which combines both data-independent acquisition (DIA) and targeted data analysis techniques for the collection and storage of fragmentation spectra of peptides. Compared to techniques of selected reaction monitoring (SRM), SWATH-MS allows for a more extensive throughput of proteins in a sample to be targeted. The spectra collected in SWATHAtlas can be interpreted with the help of software such as OpenSWATH or Peakview.
The goal of the Autophagy Database is to provide up-to-date relevant information including protein structure data to researchers of autophagy, and to disseminate important findings to a wider audience so that their ramifications can be appreciated. For this purpose, we strive to make the database to contain as much pertinent information as possible and to make the contents freely available in a user-friendly format.
Pathway Commons is a convenient point of access to biological pathway information collected from public pathway databases. Information is sourced from public pathway databases and is readily searched, visualized, and downloaded. The data is freely available under the license terms of each contributing database.
The MEROPS database is an information resource for peptidases (also termed proteases, proteinases and proteolytic enzymes) and the proteins that inhibit them.
The Yeast Resource Center provides access to data about mass spectrometry, yeast two-hybrid arrays, deconvolution florescence microscopy, protein structure prediction and computational biology. These services are provided to further the goal of a complete understanding of the chemical interactions required for the maintenance and faithful reproduction of a living cell. The observation that the fundamental biological processes of yeast are conserved among all eukaryotes ensures that this knowledge will shape and advance our understanding of living systems.
Content type(s)
Country is a database integrating physical (protein-protein) and functional interactions within the context of an E. coli knowledgebase.
This is CSDB version 1 merged from Bacterial (BCSDB) and Plant&Fungal (PFCSDB) databases. This database aims at provision of structural, bibliographic, taxonomic, NMR spectroscopic and other information on glycan and glycoconjugate structures of prokaryotic, plant and fungal origin. It has been merged from the Bacterial and Plant&Fungal Carbohydrate Structure Databases (BCSDB+PFCSDB). The key points of this service are: High coverage. The coverage for bacteria (up to 2016) and archaea (up to 2016) is above 80%. Similar coverage for plants and fungi is expected in the future. The database is close to complete up to 1998 for plants, and up to 2006 for fungi. Data quality. High data quality is achieved by manual curation using original publications which is assisted by multiple automatic procedures for error control. Errors present in publications are reported and corrected, when possible. Data from other databases are verified on import. Detailed annotations. Structural data are supplied with extended bibliography, assigned NMR spectra, taxon identification including strains and serogroups, and other information if available in the original publication. Services. CSDB serves as a platform for a number of computational services tuned for glycobiology, such as NMR simulation, automated structure elucidation, taxon clustering, 3D molecular modeling, statistical processing of data etc. Integration. CSDB is cross-linked to other glycoinformatics projects and NCBI databases. The data are exportable in various formats, including most widespread encoding schemes and records using GlycoRDF ontology. Free web access. Users can access the database for free via its web interface (see Help). The main source of data is retrospective literature analysis. About 20% of data were imported from CCSD (Carbbank, University of Georgia, Athens; structures published before 1996) with subsequent manual curation and approval. The current coverage is displayed in red on the top of the left menu. The time lag between the publication of new data and their deposition into CSDB is ca. 1 year. In the scope of bacterial carbohydrates, CSDB covers nearly all structures of this origin published up to 2016. Prokaryotic, plant and fungal means that a glycan was found in the organism(s) belonging to these taxonomic domains or was obtained by modification of those found in them. Carbohydrate means a structure composed of any residues linked by glycosidic, ester, amidic, ketal, phospho- or sulpho-diester bonds in which at least one residue is a sugar or its derivative.
NCBI Virus Variation is a specialized database which collects tools to provide searchable resources in the fields of Influenza virus, Dengue virus, and West Nile virus. Specific BLAST databases are listed. Their new publications are also available in their site. Rotavirus database will be added in their site soon.
dbSTS is an NCBI resource that contains sequence data for short genomic landmark sequences or Sequence Tagged Sites. STS sequences are incorporated into the STS Division of GenBank.
Content type(s)
TrichDB integrated genomic resources for the eukaryotic protist pathogens Trichomonas vaginalis.
British Antarctic Survey (BAS) has, for over 60 years, undertaken the majority of Britain's scientific research on and around the Antarctic continent. Atmospheric, biosphere, cryosphere, geosphere, hydrosphere, and Sun-Earth interactions metadata and data are available. Geographic information and collections are highlighted as well. Information and mapping services include a Discovery Metadata System, Data Access System, the Antarctic Digital Database (ADD), Geophysics Data Portal (BAS-GDP), ICEMAR, a fossil database, and the Antarctic Plant Database.
The NCBI Trace Archive is a permanent repository of DNA sequence chromatograms (traces), base calls, and quality estimates for single-pass reads from various large-scale sequencing projects. The Trace Archive serves as the repository of sequencing data from gel/capillary platforms such as Applied Biosystems ABI 3730®. The Sequence Read Archive (SRA) stores sequencing data from the next generation of sequencing platforms including Roche 454 GS System®, Illumina Genome Analyzer®, Applied Biosystems SOLiD® System, Helicos Heliscope®, and others. The Trace Assembly Archive stores pairwise alignment and multiple alignment of sequencing reads, linking basic trace data with finished genomic sequence.
InterPro collects information about protein sequence analysis and classification, providing access to a database of predictive protein signatures used for the classification and automatic annotation of proteins and genomes. Sequences in InterPro are classified at superfamily, family, and subfamily. InterPro predicts the occurrence of functional domains, repeats, and important sites, and adds in-depth annotation such as GO terms to the protein signatures.
GigaDB primarily serves as a repository to host data and tools associated with articles in GigaScience (GigaScience is an online, open-access journal). GigaDB defines a dataset as a group of files (e.g., sequencing data, analyses, imaging files, software programs) that are related to and support an article or study. GigaDB allows the integration of manuscript publication with supporting data and tools.