Reset all


Content Types


AID systems



Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type


Metadata standards

PID systems

Provider types

Quality management

Repository languages



Repository types


  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 181 result(s)
The GSS database collects unannotated, short, single-read, primary genomic sequences from GenBank and contains nucleic acid sequences. These sequences include random survey sequences, clone-end sequences, and exon-trapped sequences.
InterPro collects information about protein sequence analysis and classification, providing access to a database of predictive protein signatures used for the classification and automatic annotation of proteins and genomes. Sequences in InterPro are classified at superfamily, family, and subfamily. InterPro predicts the occurrence of functional domains, repeats, and important sites, and adds in-depth annotation such as GO terms to the protein signatures.
INTEGRALL is a web-based platform dedicated to compile information on integrons and designed to organize all the data available for these genetic structures. INTEGRALL provides a public genetic repository for sequence data and nomenclature and offers to scientists an easy and interactive access to integron's DNA sequences, their molecular arrangements as well as their genetic contexts.
This site provides users with access to up-to-date information about mutations at the phenylalanine hydroxylase locus. Here you will have access to the content of the database in the form of electronic reports. The database is updated manually off-line by the curators to assure that no erroneous information is appended. The curators now also accept data electronically via the submission form.
MalaCards is an integrated database of human maladies and their annotations, modeled on the architecture and richness of the popular GeneCards database of human genes. MalaCards mines and merges varied web data sources to generate a computerized web card for each human disease. Each MalaCard contains disease specific prioritized annotative information, as well as links between associated diseases, leveraging the GeneCards relational database, search engine, and GeneDecks set-distillation tool. As proofs of concept of the search/distill/infer pipeline we find expected elucidations, as well as potentially novel ones.
ArrayExpress is one of the major international repositories for high-throughput functional genomics data from both microarray and high-throughput sequencing studies, many of which are supported by peer-reviewed publications. Data sets are either submitted directly to ArrayExpress and curated by a team of specialist biological curators, or are imported systematically from the NCBI Gene Expression Omnibus database on a weekly basis. Data is collected to MIAME and MINSEQE standards.
dbSTS is an NCBI resource that contains sequence data for short genomic landmark sequences or Sequence Tagged Sites. STS sequences are incorporated into the STS Division of GenBank.
The Pseudomonas Genome Database collaborates with an international panel of expert Pseudomonas researchers to provide high quality updates to the PAO1 genome annotation and make cutting edge genome analysis data available.
The miRBase database is a searchable database of published miRNA sequences and annotation. Each entry in the miRBase Sequence database represents a predicted hairpin portion of a miRNA transcript (termed mir in the database), with information on the location and sequence of the mature miRNA sequence (termed miR). Both hairpin and mature sequences are available for searching and browsing, and entries can also be retrieved by name, keyword, references and annotation. All sequence and annotation data are also available for download. The miRBase Registry provides miRNA gene hunters with unique names for novel miRNA genes prior to publication of results.
>>>!!!<<<As stated 2017-05-23 Cancer GEnome Mine is no longer available >>>!!!<<< Cancer GEnome Mine is a public database for storing clinical information about tumor samples and microarray data, with emphasis on array comparative genomic hybridization (aCGH) and data mining of gene copy number changes.
>>>!!!<<< caArray Retirement Announcement >>>!!!<<< The National Cancer Institute (NCI) Center for Biomedical Informatics and Information Technology (CBIIT) instance of the caArray database was retired on March 31st, 2015. All publicly-accessible caArray data and annotations will be archived and will remain available via FTP download and is also available at GEO . >>>!!!<<< While NCI will not be able to provide technical support for the caArray software after the retirement, the source code is available on GitHub , and we encourage continued community development. Molecular Analysis of Brain Neoplasia (Rembrandt fine-00037) gene expression data has been loaded into ArrayExpress: >>>!!!<<< caArray is an open-source, web and programmatically accessible microarray data management system that supports the annotation of microarray data using MAGE-TAB and web-based forms. Data and annotations may be kept private to the owner, shared with user-defined collaboration groups, or made public. The NCI instance of caArray hosts many cancer-related public datasets available for download.
ALEXA is a microarray design platform for 'alternative expression analysis'. This platform facilitates the design of expression arrays for analysis of mRNA isoforms generated from a single locus by the use of alternative transcription initiation, splicing and polyadenylation sites. We use the term 'ALEXA' to describe a collection of novel genomic methods for 'alternative expression' analysis. 'Alternative expression' refers to the identification and quantification of alternative mRNA transcripts produced by alternative transcript initiation, alternative splicing and alternative polyadenylation. This website provides supplementary materials, source code and other downloads for recent publications describing our studies of alternative expression (AE). Most recently we have developed a method, 'ALEXA-Seq' and associated resources for alternative expression analysis by massively parallel RNA sequencing.
A community-based search platform. OMICtools bridges the gap between researchers and tool developers. OMICtools brings together an interactive worldwide user community, linking expert curators who submit, review and categorize tools, to users who strengthen the interface by bringing feedback and reviews.
The NCBI Trace Archive is a permanent repository of DNA sequence chromatograms (traces), base calls, and quality estimates for single-pass reads from various large-scale sequencing projects. The Trace Archive serves as the repository of sequencing data from gel/capillary platforms such as Applied Biosystems ABI 3730®. The Sequence Read Archive (SRA) stores sequencing data from the next generation of sequencing platforms including Roche 454 GS System®, Illumina Genome Analyzer®, Applied Biosystems SOLiD® System, Helicos Heliscope®, and others. The Trace Assembly Archive stores pairwise alignment and multiple alignment of sequencing reads, linking basic trace data with finished genomic sequence.
BeeBase provides gene sequences and genomes of Bombus terrestris, B. impatiens, Apis mellifera and three of its pathogens. BeeBase data is discoverable and analyzed via genome browsers, blast search, and apollo annotation tool.
Androgen Receptor Gene Mutations Database is for all who are interested in mutations of the Androgen Receptor Gene. In light of the difficulty in getting new AR mutations published the curator will now accept new mutations that have not been published, provided that it is from a reputable research or clinical laboratory. The curator also strongly suggests that where possible, particularly in the case of new unique mutations that an attempt be made to at least confirm the pathogenicity of the putatative mutation, by showing that the mutation when transfected into a suitable expression system produces a mutant androgen receptor protein.
Gene Expression Omnibus: a public functional genomics data repository supporting MIAME-compliant data submissions. Array- and sequence-based data are accepted. Tools are provided to help users query and download experiments and curated gene expression profiles.
The Pain Genes Database is an interactive web-based data browser of pain-related transgenic knockout studies. It is designed to allow easy access to and analysis of the published pain-related phenotypes of mutant mice (over 200 different mutants at the date of submission). The database features two levels of exploration, one allowing the identification of genes by name, acronym, genomic position or "summary" phenotype, and the other allowing in-depth browsing, paper-by-paper, of specific phenotypes and test parameters. Hosted by the Department of Psychology and Centre for Research on Pain at McGill University.
The objective of this project is to generate the most comprehensive description of human chromosome 7 to facilitate biological discovery, disease gene research and medical genetic applications. In our vision, the DNA sequence of chromosome 7 should be made available in a user-friendly manner having every biological and medically relevant feature annotated along its length. We have established this website and database as one step towards this goal. In addition to being a primary data source we foresee this site serving as a "weighing station" for testing community ideas and information to produce highly curated data to be submitted to other databases such as NCBI, Ensembl, and UCSC. Therefore, any useful data submitted to us will be curated and shown in this database.
APID Interactomes is a database that provides a comprehensive collection of protein interactomes for more than 400 organisms based in the integration of known experimentally validated protein-protein physical interactions (PPIs). Construction of the interactomes is done with a methodological approach to report quality levels and coverage over the proteomes for each organism included. In this way, APID provides interactomes from specific organisms that in 25 cases have more than 500 proteins. As a whole APID includes a comprehensive compendium of 90,379 distinct proteins and 678,441 singular interactions. The analytical and integrative effort done in APID unifies PPIs from primary databases of molecular interactions (BIND, BioGRID, DIP, HPRD, IntAct, MINT) and also from experimentally resolved 3D structures (PDB) where more than two distinct proteins have been identified. In this way, 8,388 structures have been analyzed to find specific protein-protein interactions reported with details of their molecular interfaces. APID also includes a new data visualization web-tool that allows the construction of sub-interactomes using query lists of proteins of interest and the visual exploration of the corresponding networks, including an interactive selection of the properties of the interactions (i.e. the reliability of the "edges" in the network) and an interactive mapping of the functional environment of the proteins (i.e. the functional annotations of the "nodes" in the network).
NCBI Virus Variation is a specialized database which collects tools to provide searchable resources in the fields of Influenza virus, Dengue virus, and West Nile virus. Specific BLAST databases are listed. Their new publications are also available in their site. Rotavirus database will be added in their site soon.
The sequencing of several bird genomes and the anticipated sequencing of many more provided the impetus to develop a model organism database devoted to the taxonomic class: Aves. Birds provide model organisms important to the study of neurobiology, immunology, genetics, development, oncology, virology, cardiovascular biology, evolution and a variety of other life sciences. Many bird species are also important to agriculture, providing an enormous worldwide food source worldwide. Genomic approaches are proving invaluable to studying traits that affect meat yield, disease resistance, behavior, and bone development along with many other factors affecting productivity. In this context, BirdBase will serve both biomedical and agricultural researchers.
The Yeast Resource Center provides access to data about mass spectrometry, yeast two-hybrid arrays, deconvolution florescence microscopy, protein structure prediction and computational biology. These services are provided to further the goal of a complete understanding of the chemical interactions required for the maintenance and faithful reproduction of a living cell. The observation that the fundamental biological processes of yeast are conserved among all eukaryotes ensures that this knowledge will shape and advance our understanding of living systems.