Reset all


Content Types


AID systems



Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type


Metadata standards

PID systems

Provider types

Quality management

Repository languages



Repository types


  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 401 result(s)
The 1000 Genomes Project is an international collaboration to produce an extensive public catalog of human genetic variation, including SNPs and structural variants, and their haplotype contexts. This resource will support genome-wide association studies and other medical research studies. The genomes of about 2500 unidentified people from about 25 populations around the world will be sequenced using next-generation sequencing technologies. The results of the study will be freely and publicly accessible to researchers worldwide.
4TU.ResearchData, previously known as 3TU.Datacentrum, is an archive for research data. It offers the knowledge, experience and the tools to share and safely store scientific research data in a standardized, secure and well-documented manner. 4TU.Centre for Research Data provides the research community with: Advice and support on data management; A long-term archive for scientific research data; Support for current research projects; Tools for reusing research data.
ACEpepDB is a database ran by the Central Food Technological Research Institute. It contains records of about 865 peptides. Each record provides information on the food source, preparation, purification and any other additional information. Each record includes the reference(s). The database provides a search and browsing option for a more personalized research experience.
AceView provides a curated, comprehensive and non-redundant sequence representation of all public mRNA sequences (mRNAs from GenBank or RefSeq, and single pass cDNA sequences from dbEST and Trace). These experimental cDNA sequences are first co-aligned on the genome then clustered into a minimal number of alternative transcript variants and grouped into genes. Using exhaustively and with high quality standards the available cDNA sequences evidences the beauty and complexity of mammals’ transcriptome, and the relative simplicity of the nematode and plant transcriptomes. Genes are classified according to their inferred coding potential; many presumably non-coding genes are discovered. Genes are named by Entrez Gene names when available, else by AceView gene names, stable from release to release. Alternative features (promoters, introns and exons, polyadenylation signals) and coding potential, including motifs, domains, and homologies are annotated in depth; tissues where expression has been observed are listed in order of representation; diseases, phenotypes, pathways, functions, localization or interactions are annotated by mining selected sources, in particular PubMed, GAD and Entrez Gene, and also by performing manual annotation, especially in the worm. In this way, both the anatomy and physiology of the experimentally cDNA supported human, mouse and nematode genes are thoroughly annotated.
TERN's AEKOS data portal is the original gateway to Australian ecology data. It is a ‘data and research methods’ data portal for Australia’s land-dwelling plants, animals and their environments. The primary focus of data content is raw co-located ‘species and environment’ ecological survey data that has been collected at the ‘plot’ level to describe biodiversity, its patterns and ecological processes. It is openly accessible with standard discovery metadata and user-oriented, contextual metadata critical for data reuse. Our services support the ecosystem science community, land managers and governments seeking to publish under COPE publishing ethics and the FAIR data publishing principles. AEKOS is registered with Thomson & Reuters Data Citation Index and is a recommended repository of Nature Publishing’s Scientific Data. There are currently 97,037 sites covering mostly plant biodiversity and co-located environmental data of Australia. The AEKOS initiative is supported by TERN (, hosted by The University of Adelaide and funded by the Australian Government’s National Research Infrastructure for Australia.
ALEXA is a microarray design platform for 'alternative expression analysis'. This platform facilitates the design of expression arrays for analysis of mRNA isoforms generated from a single locus by the use of alternative transcription initiation, splicing and polyadenylation sites. We use the term 'ALEXA' to describe a collection of novel genomic methods for 'alternative expression' analysis. 'Alternative expression' refers to the identification and quantification of alternative mRNA transcripts produced by alternative transcript initiation, alternative splicing and alternative polyadenylation. This website provides supplementary materials, source code and other downloads for recent publications describing our studies of alternative expression (AE). Most recently we have developed a method, 'ALEXA-Seq' and associated resources for alternative expression analysis by massively parallel RNA sequencing.
The Allele Frequency Net Database (AFND) is a public database which contains frequency information of several immune genes such as Human Leukocyte Antigens (HLA), Killer-cell Immunoglobulin-like Receptors (KIR), Major histocompatibility complex class I chain-related (MIC) genes, and a number of cytokine gene polymorphisms. The Allele Frequency Net Database (AFND) provides a central source, freely available to all, for the storage of allele frequencies from different polymorphic areas in the Human Genome. Users can contribute the results of their work into one common database and can perform database searches on information already available. We have currently collected data in allele, haplotype and genotype format. However, the success of this website will depend on you to contribute your data.
The Allen Brain Atlas provides a unique online public resource integrating extensive gene expression data, connectivity data and neuroanatomical information with powerful search and viewing tools for the adult and developing brain in mouse, human and non-human primate
The Alzheimer Disease & Frontotemporal Dementia Mutation Database (AD&FTDMDB) aims at collecting all known mutations in the genes related to Alzheimer disease (AD) and fromtotemporal dementias (FTD). Mutations are collected from the literature and from presentations at scientific meetings. In addition, mutations can be submitted to AD&FTDMDB at this web site.
Content type(s)
While focused on supporting the scientific community, ATCC activities range widely, from repository-related operations to providing specialized services, conducting in-house R&D and intellectual property management. ATCC serves U.S. and international researchers by characterizing cell lines, bacteria, viruses, fungi and protozoa, as well as developing and evaluating assays and techniques for validating research resources and preserving and distributing biological materials to the public and private sector research communities. Our management philosophy emphasizes customer satisfaction, value addition, cost-effective operations and competitive benchmarking for all areas of our enterprise.
Androgen Receptor Gene Mutations Database is for all who are interested in mutations of the Androgen Receptor Gene. In light of the difficulty in getting new AR mutations published the curator will now accept new mutations that have not been published, provided that it is from a reputable research or clinical laboratory. The curator also strongly suggests that where possible, particularly in the case of new unique mutations that an attempt be made to at least confirm the pathogenicity of the putatative mutation, by showing that the mutation when transfected into a suitable expression system produces a mutant androgen receptor protein.
Genome track alignments using GBrowse on this site are featured with: (1) Annotated and predicted genes and transcripts; (2) QTL / SNP Association tracks; (3) OMIA genes; (4) Various SNP Chip tracks; (5) Other mapping fetures or elements that are available.
APID Interactomes is a database that provides a comprehensive collection of protein interactomes for more than 400 organisms based in the integration of known experimentally validated protein-protein physical interactions (PPIs). Construction of the interactomes is done with a methodological approach to report quality levels and coverage over the proteomes for each organism included. In this way, APID provides interactomes from specific organisms that in 25 cases have more than 500 proteins. As a whole APID includes a comprehensive compendium of 90,379 distinct proteins and 678,441 singular interactions. The analytical and integrative effort done in APID unifies PPIs from primary databases of molecular interactions (BIND, BioGRID, DIP, HPRD, IntAct, MINT) and also from experimentally resolved 3D structures (PDB) where more than two distinct proteins have been identified. In this way, 8,388 structures have been analyzed to find specific protein-protein interactions reported with details of their molecular interfaces. APID also includes a new data visualization web-tool that allows the construction of sub-interactomes using query lists of proteins of interest and the visual exploration of the corresponding networks, including an interactive selection of the properties of the interactions (i.e. the reliability of the "edges" in the network) and an interactive mapping of the functional environment of the proteins (i.e. the functional annotations of the "nodes" in the network).
ArrayExpress is one of the major international repositories for high-throughput functional genomics data from both microarray and high-throughput sequencing studies, many of which are supported by peer-reviewed publications. Data sets are either submitted directly to ArrayExpress and curated by a team of specialist biological curators, or are imported systematically from the NCBI Gene Expression Omnibus database on a weekly basis. Data is collected to MIAME and MINSEQE standards.
ASAP (a systematic annotation package for community analysis of genomes) is a relational database and web interface developed to store, update and distribute genome sequence data and gene expression data collected by or in collaboration with researchers at the University of Wisconsin - Madison. ASAP was designed to facilitate ongoing community annotation of genomes and to grow with genome projects as they move from the preliminary data stage through post-sequencing functional analysis. The ASAP database includes multiple genome sequences at various stages of analysis, and gene expression data from preliminary experiments.
AspGD is an organized collection of genetic and molecular biological information about the filamentous fungi of the genus Aspergillus. Among its many species, the genus contains an excellent model organism (A. nidulans, or its teleomorph Emericella nidulans), an important pathogen of the immunocompromised (A. fumigatus), an agriculturally important toxin producer (A. flavus), and two species used in industrial processes (A. niger and A. oryzae). AspGD contains information about genes and proteins of multiple Aspergillus species; descriptions and classifications of their biological roles, molecular functions, and subcellular localizations; gene, protein, and chromosome sequence information; tools for analysis and comparison of sequences; and links to literature information; as well as a multispecies comparative genomics browser tool (Sybil) for exploration of orthology and synteny across multiple sequenced Aspergillus species.
ASTM International, formerly known as the American Society for Testing and Materials (ASTM), is a globally recognized leader in the development and delivery of international voluntary consensus standards. Today, some 12,000 ASTM standards are used around the world to improve product quality, enhance safety, facilitate market access and trade, and build consumer confidence.
The Australian Drosophila Ecology and Evolution Resource (ADEER) from the Hoffmann lab and other contributors is a nationally significant life science collection. The Drosophila Clinal Data Collection contains data on populations along the eastern coast of Australia. It remains an excellent resource for understanding past and future evolutionary responses to climate change. The Drosophila Genomic Data Collection hosts Drosophila genomes sequenced as part of the Genomic Basis for Adaptation to Climate Change Project. 23 genomes have been sequenced as part of this project. Currently assemblies and annotations are available for Drosophila birchii, D. bunnanda, D. hydei, and D. repleta. The Drosophila Species Distribution Data Collection contains distribution data of nine drosophilid species that have been collected in Australia by the Hoffmann lab and other research groups between 1924 and 2005. More than 300 drosophilid species have been identified in the tropical and temperate forests located on the east coast of Australia. Many species are restricted to the tropics, a few are temperate specialists, and some have broad distributions across climatic regions. Their varied distribution along the tropical - temperate cline provide a powerful tool for studying climate adaptation and species distribution limits.
The Autism Chromosome Rearrangement Database is a collection of hand curated breakpoints and other genomic features, related to autism, taken from publicly available literature: databases and unpublished data. The database is continuously updated with information from in-house experimental data as well as data from published research studies.
BCSDB database is aimed at provision of structural, bibliographic, taxonomic and related information on bacterial carbohydrate structures. Two key points of this service are: covering - is above 90% in the scope of bacterial carbohydrates. This means the negative search answer remains valuable scientific information. And consistence - we manually check the data, and aim at hight quality error-free content. The main source of data is a retrospective literature analysis. About 25% of data were imported from CCSD (Carbbank, ceased in 1997, University of Georgia, Athens; structures published before 1995) with subsequent manual curation and approval. Current coverage is displayed in red on the top of the left menu. The time lag between publication of new data and their deposition ~ 1 year. The scope is "bacterial carbohydrates" and covers nearly all structures of this class published up to 2013. Bacterial means that a structure has been found in bacteria or obtained by modification of those found in bacteria. Carohydrate means a structure composed of any residues linked by glycosidic, ester, amidic, ketal, phospho- or sulpho-diester bonds, in which at least one residue is a sugar or its derivative.
The Barcode of Life Data Systems (BOLD) provides DNA barcode data. BOLD's online workbench supports data validation, annotation, and publication for specimen, distributional, and molecular data. The platform consists of four main modules: a data portal, a database of barcode clusters, an educational portal, and a data collection workbench. BOLD is the go-to site for DNA-based identification. As the central informatics platform for DNA barcoding, BOLD plays a crucial role in assimilating and organizing data gathered by the international barcode research community. Two iBOL (International Barcode of Life) Working Groups are supporting the ongoing development of BOLD.
BeeBase provides gene sequences and genomes of Bombus terrestris, B. impatiens, Apis mellifera and three of its pathogens. BeeBase data is discoverable and analyzed via genome browsers, blast search, and apollo annotation tool.
!!!!! This database doesn't exist anymore. 2017-09-05 !!!!!BeetleBase is a comprehensive sequence database and important community resource for Tribolium genetics, genomics and developmental biology. It provides genetic data on the Tribolium Castaneum, Red Flour Beetle, as gene maps, official gene set, reference sequences, predicted models, and whole-genome tiling array representing developmental stages.
The Benchmark Energy & Geometry Database (BEGDB) collects results of highly accurate QM calculations of molecular structures, energies and properties. These data can serve as benchmarks for testing and parameterization of other computational methods.