Reset all


Content Types


AID systems



Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type


Metadata standards

PID systems

Provider types

Quality management

Repository languages



Repository types


  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 127 result(s)
The main objective of our work is to understand the pathomechanisms of late onset neurodegenerative disorders such as Huntington's, Parkinson's, Alzheimer's and Machado Joseph disease and to develop causal therapies for them. The disease causing proteins of these illnesses have been identified, but their functions in the unaffected organism are mostly unknown. Here, we have developed a strategy combining library and matrix yeast two-hybrid screens to generate a highly connected PPI network for Huntington's disease (HD).
CorrDB has data of cattle, relating to meat production, milk production, growth, health, and others. This database is designed to collect all published livestock genetic/phenotypic trait correlation data, aimed at facilitating genetic network analysis or systems biology studies.
The European Variation Archive is an open-access database of all types of genetic variation data from all species. The EVA provides access to highly detailed, granular, raw variant data from human, with other species to follow.
The Mouse Tumor Biology (MTB) Database supports the use of the mouse as a model system of hereditary cancer by providing electronic access to: Information on endogenous spontaneous and induced tumors in mice, including tumor frequency & latency data, Information on genetically defined mice (inbred, hybrid, mutant, and genetically engineered strains of mice) in which tumors arise, Information on genetic factors associated with tumor susceptibility in mice and somatic genetic-mutations observed in the tumors, Tumor pathology reports and images, References, supporting MTB data and Links to other online resources for cancer.
GeneCards is a searchable, integrative database that provides comprehensive, user-friendly information on all annotated and predicted human genes. It automatically integrates gene-centric data from ~125 web sources, including genomic, transcriptomic, proteomic, genetic, clinical and functional information.
The Erythron Database is a resource dedicated to facilitating better understanding of the cellular and molecular underpinnings of mammalian erythropoiesis. The resource is built upon a searchable database of gene expression in murine primitive and definitive erythroid cells at progressive stages of maturation.
The PRIDE PRoteomics IDEntifications database is a centralized, standards compliant, public data repository for proteomics data, including protein and peptide identifications, post-translational modifications and supporting spectral evidence. PRIDE encourages and welcomes direct user submissions of mass spectrometry data to be published in peer-reviewed publications.
jPOSTrepo (Japan ProteOme STandard Repository) is a repository of sharing MS raw/processed data. It consists of a high-speed file upload process, flexible file management system and easy-to-use interfaces. Users can release their "raw/processed" data via this site with a unique identifier number for the paper publication. Users also can suspend (or "embargo") their data until their paper is published. The file transfer from users’ computer to our repository server is very fast (roughly ten times faster than usual file transfer) and uses only web browsers – it does not require installing any additional software.
This database serves forest tree scientists by providing online access to hardwood tree genomic and genetic data, including assembled reference genomes, transcriptomes, and genetic mapping information. The web site also provides access to tools for mining and visualization of these data sets, including BLAST for comparing sequences, Jbrowse for browsing genomes, Apollo for community annotation and Expression Analysis to build gene expression heatmaps.
Launched in 2000, WormBase is an international consortium of biologists and computer scientists dedicated to providing the research community with accurate, current, accessible information concerning the genetics, genomics and biology of C. elegans and some related nematodes. In addition to their curation work, all sites have ongoing programs in bioinformatics research to develop the next generations of WormBase structure, content and accessibility
Content type(s)
Since the first discovery of RNA pseudoknots more and many more pseudoknots have been found. However, not all of those pseudoknot data are easy to trace. Sometimes the information is hidden in a publication where the title gives no hint that pseudoknot information is there. This was the first reason that we thought that a general accessible information source for pseudoknots would be handy.
ViPR, the Virus Pathogen Resource, is a publicly available, NIAID-sponsored one-stop database and analysis resource that supports the research of viral pathogens in the NIAID Category A-C Priority Pathogen lists and those causing (re)emerging infectious diseases. ViPR integrates data from external sources (GenBank, UniProt, Immune Epitope Database, Protein Data Bank, etc.), direct submissions, and internal curation and analysis pipelines, and provides a suite of bioinformatics analysis and visualization tools to expedite virology research.
Project Achilles is a systematic effort aimed at identifying and cataloging genetic vulnerabilities across hundreds of genomically characterized cancer cell lines. The project uses genome-wide genetic perturbation reagents (shRNAs or Cas9/sgRNAs) to silence or knock-out individual genes and identify those genes that affect cell survival. Large-scale functional screening of cancer cell lines provides a complementary approach to those studies that aim to characterize the molecular alterations (e.g. mutations, copy number alterations) of primary tumors, such as The Cancer Genome Atlas (TCGA). The overall goal of the project is to identify cancer genetic dependencies and link them to molecular characteristics in order to prioritize targets for therapeutic development and identify the patient population that might benefit from such targets.
The cisRED database holds conserved sequence motifs identified by genome scale motif discovery, similarity, clustering, co-occurrence and coexpression calculations. Sequence inputs include low-coverage genome sequence data and ENCODE data. A Nucleic Acids Research article describes the system architecture
Xenbase's mission is to provide the international research community with a comprehensive, integrated and easy to use web based resource that gives access the diverse and rich genomic, expression and functional data available from Xenopus research. Xenbase also provides a critical data sharing infrastructure for many other NIH-funded projects, and is a focal point for the Xenopus community. In addition to our primary goal of supporting Xenopus researchers, Xenbase enhances the availability and visibility of Xenopus data to the broader biomedical research community.
The Database of Protein Disorder (DisProt) is a curated database that provides information about proteins that lack fixed 3D structure in their putatively native states, either in their entirety or in part. DisProt is a community resource annotating protein sequences for intrinsically disorder regions from the literature. It classifies intrinsic disorder based on experimental methods and three ontologies for molecular function, transition and binding partner.
IntEnz contains the recommendation of the Nomenclature Committee of the International Union of Biochemistry and Molecular Biology on the nomenclature and classification of enzyme-catalyzed reactions. Users can browse by enzyme classification or use advanced search options to search enzymes by class, subclass and sub-subclass information.
EBI's CSA contains data documenting enzyme active sites and catalytic residues in enzymes of 3D structure. Entries in CSA may be original hand-annotated entries from primary literature or homologous entries found by PSI-BLAST alignment.
BioPortal is an open repository of biomedical ontologies, a service that provides access to those ontologies, and a set of tools for working with them. BioPortal provides a wide range of such tools, either directly via the BioPortal web site, or using the BioPortal web service REST API. BioPortal also includes community features for adding notes, reviews, and even mappings to specific ontologies. BioPortal has four major product components: the web application; the API services; widgets, or applets, that can be installed on your own site; and a Virtual Appliance version that is available for download or through Amazon Web Services machine instance (AMI). There is also a beta release SPARQL endpoint.
Human Protein Reference Database (HPRD) has been established by a team of biologists, bioinformaticists and software engineers. This is a joint project between the PandeyLab at Johns Hopkins University, and Institute of Bioinformatics, Bangalore. HPRD is a definitive repository of human proteins. This database should serve as a ready reckoner for researchers in their quest for drug discovery, identification of disease markers and promote biomedical research in general. Human Proteinpedia ( is its associated data portal.
BioModels Database is a repository of peer-reviewed, published, computational models that allows biologists to store, search and retrieve published mathematical models from the field of systems biology, but also more generally those of biological interest. Models in the database can be used to generate sub-models, can be simulated online, and can be converted between different representational formats.
The Ligand-Gated Ion Channel database provides access to information about transmembrane proteins that exist under different conformations, with three primary subfamilies: the cys-loop superfamily, the ATP gated channels superfamily, and the glutamate activated cationic channels superfamily.**The development of the Ligand-Gated Ion Channel database was started in 1994, as part of Le Novère's work on the phylogeny of those receptors' subunits. It grew into a serious data resource, that served the community at large. However, it is not actively maintained anymore. In addition, bioinformatics technology evolved a lot over the last two decades, so that scientists can now generate quickly customised databases from trustworthy primary data resources. Therefore, we decided to officialy freeze the data resource. The resource will not disappear, and all the information and links will stay there. But people should not consider it as an up-to-date trustable resource.**
PANDIT is a collection of multiple sequence alignments and phylogenetic trees. It contains corresponding amino acid and nucleotide sequence alignments, with trees inferred from each alignment. PANDIT is based on the Pfam database (Protein families database of alignments and HMMs), and includes the seed amino acid alignments of most families in the Pfam-A database. DNA sequences for as many members of each family as possible are extracted from the EMBL Nucleotide Sequence Database and aligned according to the amino acid alignment. PANDIT also contains a further copy of the amino acid alignments, restricted to the sequences for which DNA sequences were found.