Reset all


Content Types


AID systems



Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type


Metadata standards

PID systems

Provider types

Quality management

Repository languages



Repository types


  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 39 result(s)
The EBiSC Catalogue is a collection of human iPS cells being made available to academic and commercial researchers for use in disease modelling and other forms of preclinical research. The initial collection has been generated from a wide range of donors representing specific disease backgrounds and healthy controls. As the collection grows, more isogenic control lines will become available which will add further to the collection’s appeal.
dbEST is a division of GenBank that contains sequence data and other information on "single-pass" cDNA sequences, or "Expressed Sequence Tags", from a number of organisms. Expressed Sequence Tags (ESTs) are short (usually about 300-500 bp), single-pass sequence reads from mRNA (cDNA). Typically they are produced in large batches. They represent a snapshot of genes expressed in a given tissue and/or at a given developmental stage. They are tags (some coding, others not) of expression for a given cDNA library. Most EST projects develop large numbers of sequences. These are commonly submitted to GenBank and dbEST as batches of dozens to thousands of entries, with a great deal of redundancy in the citation, submitter and library information. To improve the efficiency of the submission process for this type of data, we have designed a special streamlined submission process and data format. dbEST also includes sequences that are longer than the traditional ESTs, or are produced as single sequences or in small batches. Among these sequences are products of differential display experiments and RACE experiments. The thing that these sequences have in common with traditional ESTs, regardless of length, quality, or quantity, is that there is little information that can be annotated in the record. If a sequence is later characterized and annotated with biological features such as a coding region, 5'UTR, or 3'UTR, it should be submitted through the regular GenBank submissions procedure (via BankIt or Sequin), even if part of the sequence is already in dbEST. dbEST is reserved for single-pass reads. Assembled sequences should not be submitted to dbEST. GenBank will accept assembled EST submissions for the forthcoming TSA (Transcriptome Shotgun Assembly) division. The individual reads which make up the assembly should be submitted to dbEST, the Trace archive or the Short Read Archive (SRA) prior to the submission of the assemblies.
STRING is a database of known and predicted protein interactions. The interactions include direct (physical) and indirect (functional) associations; they are derived from four sources: - Genomic Context - High-throughput Experiments - (Conserved) Coexpression - Previous Knowledge STRING quantitatively integrates interaction data from these sources for a large number of organisms, and transfers information between these organisms where applicable.
The Structure database provides three-dimensional structures of macromolecules for a variety of research purposes and allows the user to retrieve structures for specific molecule types as well as structures for genes and proteins of interest. Three main databases comprise Structure-The Molecular Modeling Database; Conserved Domains and Protein Classification; and the BioSystems Database. Structure also links to the PubChem databases to connect biological activity data to the macromolecular structures. Users can locate structural templates for proteins and interactively view structures and sequence data to closely examine sequence-structure relationships.
EDINA delivers online services and tools to benefit students, teachers and researchers in UK Higher and Further Education and beyond.
A place where researchers can publicly store and share unthresholded statistical maps, parcellations, and atlases produced by MRI and PET studies.
The Wellcome Trust Sanger Institute is a charitably funded genomic research centre located in Hinxton, nine miles south of Cambridge in the UK. We study diseases that have an impact on health globally by investigating genomes. Building on our past achievements and based on priorities that exploit the unique expertise of our Faculty of researchers, we will lead global efforts to understand the biology of genomes. We are convinced of the importance of making this research available and accessible for all audiences. reduce global health burdens.
DEIMS-SDR (Dynamic Ecological Information Management System - Site and dataset registry) is an information management system that allows you to discover long-term ecosystem research sites around the globe, along with the data gathered at those sites and the people and networks associated with them. DEIMS-SDR describes a wide range of sites, providing a wealth of information, including each site’s location, ecosystems, facilities, parameters measured and research themes. It is also possible to access a growing number of datasets and data products associated with the sites. All sites and dataset records can be referenced using unique identifiers that are generated by DEIMS-SDR. It is possible to search for sites via keyword, predefined filters or a map search. By including accurate, up to date information in DEIMS, site managers benefit from greater visibility for their LTER site, LTSER platform and datasets, which can help attract funding to support site investments. The aim of DEIMS-SDR is to be the globally most comprehensive catalogue of environmental research and monitoring facilities, featuring foremost but not exclusively information about all LTER sites on the globe and providing that information to science, politics and the public in general.
>>>>!!!!<<<< The Cancer Genomics Hub mission is now completed. The Cancer Genomics Hub was established in August 2011 to provide a repository to The Cancer Genome Atlas, the childhood cancer initiative Therapeutically Applicable Research to Generate Effective Treatments and the Cancer Genome Characterization Initiative. CGHub rapidly grew to be the largest database of cancer genomes in the world, storing more than 2.5 petabytes of data and serving downloads of nearly 3 petabytes per month. As the central repository for the foundational genome files, CGHub streamlined team science efforts as data became as easy to obtain as downloading from a hard drive. The convenient access to Big Data, and the collaborations that CGHub made possible, are now essential to cancer research. That work continues at the NCI's Genomic Data Commons. All files previously stored at CGHub can be found there. The Website for the Genomic Data Commons is here: >>>>!!!!<<<< The Cancer Genomics Hub (CGHub) is a secure repository for storing, cataloging, and accessing cancer genome sequences, alignments, and mutation information from the Cancer Genome Atlas (TCGA) consortium and related projects. Access to CGHub Data: All researchers using CGHub must meet the access and use criteria established by the National Institutes of Health (NIH) to ensure the privacy, security, and integrity of participant data. CGHub also hosts some publicly available data, in particular data from the Cancer Cell Line Encyclopedia. All metadata is publicly available and the catalog of metadata and associated BAMs can be explored using the CGHub Data Browser.
The NIDDK Information Network (dkNET) serves the needs of basic and clinical investigators by providing seamless access to large pools of data and research resources relevant to the mission of The National Institute of Diabetes Digestive and Kidney Diseases (NIDDK).
The public MorpheusML model repository collects, curates, documents and tests computational models for multi-scale and multicellular biological systems. Model must be encoded in the model description language MorpheusML. Subsections of the repository distinguish published models from contributed non-published and example models. New models are simulated in Morpheus or Artistoo independently from the authors and results are compared to published results. Successful reproduction is documented on the model's webpage. Models in this repository are included into the CI and test pipelines for each release of the model simulator Morpheus to check and guarantee reproducibility of results across future simulator updates. The model’s webpage provides a History-link to all past model versions and edits that are automatically tracked via Git. Each model is registered with a unique and persistent ID of the format M..... The model description page (incl. the biological context and key results of that model), the model’s XML file, the associated paper, and all further files (often simulation result videos) connected with that model can be retrieved via a persistent URL of the format - for technical details on the citable ModelID please see - for the model definition standard MorpheusML please see - for the model simulator Morpheus please see - for the model simulator Artistoo please see
From 2005 to 2008, with the support of the Ministry of Science and Technology (MOST), the construction of parasite germplasm repositories has spread to 20 conservation institutions in 15 provinces (cities) nationwide, with 3 physical exhibition halls; 3 live parasite conservation centers. A total of 1115 species/117814 pieces of parasitic germplasm resources of 23 orders in 11 phyla have been integrated into the physical library and database, including human parasites and vectors, animal parasites, plant nematodes, medical insects, trematodes, and parasitic snails, and the resources are combined with moderate distribution, medium- and long-term support, and off-site duplicates. The number of resources accounts for 39.27% of the national total. Through 10 years of accumulation, we have built the largest and only parasite species resource database in the field of parasites in China, and created a sharing platform of parasite germplasm resource center.
BRENDA is the main collection of enzyme functional data available to the scientific community worldwide. The enzymes are classified according to the Enzyme Commission list of enzymes. It is available free of charge for via the internet ( and as an in-house database for commercial users (requests to our distributor Biobase). The enzymes are classified according to the Enzyme Commission list of enzymes. Some 5000 "different" enzymes are covered. Frequently enzymes with very different properties are included under the same EC number. BRENDA includes biochemical and molecular information on classification, nomenclature, reaction, specificity, functional parameters, occurrence, enzyme structure, application, engineering, stability, disease, isolation, and preparation. The database also provides additional information on ligands, which function as natural or in vitro substrates/products, inhibitors, activating compounds, cofactors, bound metals, and other attributes.
More than 25 years ago FIZ Karlsruhe started depositing crystal structure data linked to publications in German journals. At that time it was irrelevant whether the deposited structures were organic or inorganic. Today FIZ Karlsruhe is responsible for storing the structure data of inorganic compounds. Organic structure data are stored by the Cambridge Crystallographic Data Center. Nowadays many publishers inform their authors that in parallel to a publication in a scientific journal, crystal structure data should also be stored in the Crystal Structure Depot at FIZ Karlsruhe. A CSD number will be assigned to the data for later reference in the publication. The data can then be ordered from the Crystal Structure Depot at FIZ Karlsruhe.
CMO is a long-term project for the critical edition of Near Eastern music manuscripts. The project focusing on manuscripts of Ottoman music written in Hampartsum and staff notations during the nineteenth century, is funded by the German Research Foundation (DFG). This platform provides access to the online versions of both music and text editions, as well as the source catalogue, which is a comprehensive database of printed, manuscript and online sources.
ROHub is a holistic solution for the storage, lifecycle management and preservation of scientific investigations, campaigns and operational processes via research objects. It makes these resources available to others, allows to publish and release them through a DOI, and allows to discover and reuse pre-existing scientific knowledge. Built entirely around the research object concept and inspired by sustainable software management principles, ROHub is the reference platform implementing natively the full research object model and paradigm, which provides the backbone to a wealth of RO-centric applications and interfaces across different scientific communities.
Numerical database of atomic and molecular processes and particle-surface interactions. ALADDIN has formatted data on atomic structure and spectra (energy levels,wave lengths, and transition probabilities); electron and heavy particle collisions with atoms, ions, and molecules (cross sections and/or rate coefficients, including, in most cases, analytic fit to the data); sputtering of surfaces by impact of main plasma constituents and self sputtering; particle reflection from surfaces; thermophysical and thermomechanical properties of beryllium and pyrolytic graphites.
Content type(s)
The Antibody Registry supports the RRID Initiative and exists to give researchers a way to universally identify antibodies used in publications. The registry lists many commercial antibodies from over 200 vendors, which have been assigned a unique identifier and over 2000 individual laboratories. If the antibody that you are using does not appear in the list, an entry can be made by filling in as little as 2 pieces of information: the catalog number and the url of the vendor where our curators can find information and material data sheets.
LINCS Data Portal provides access to LINCS data from various sources. The program has six Data and Signature Generation Centers: Drug Toxicity Signature Generation Center, HMS LINCS Center, LINCS Center for Transcriptomics, LINCS Proteomic Characterization Center for Signaling and Epigenetics, MEP LINCS Center, and NeuroLINCS Center.
The World Register of Marine Species (WoRMS) integrates approximately 100 marine datbases to provide an authoritative and comprehensive list of marine organisms. WoRMS has an editorial system where taxonomic groups are managed by experts responsible for the quality of the information. WorMS register of marine species emerged from the European Register of Marine Species (ERMS) and the Flanders Marine Institute (VLIZ). WoRMS is a contribution to Lifewatch, Catalogue of Life, Encyclopedia of Life, Global Biodiversity Information Facility and the Census of Marine Life.
OpenKIM is an online suite of open source tools for molecular simulation of materials. These tools help to make molecular simulation more accessible and more reliable. Within OpenKIM, you will find an online resource for standardized testing and long-term warehousing of interatomic models and data, and an application programming interface (API) standard for coupling atomistic simulation codes and interatomic potential subroutines.