Search | re3data.org

Ensembl Bacteria

e!EnsemblBacteria

Subject(s)

Content type(s)

Country

This site provides access to complete, annotated genomes from bacteria and archaea (present in the European Nucleotide Archive) through the Ensembl graphical user interface (genome browser). Ensembl Bacteria contains genomes from annotated INSDC records that are loaded into Ensembl multi-species databases, using the INSDC annotation import pipeline.

Ensembl Protists

e!EnsemblProtists

Subject(s)

Content type(s)

Country

EnsemblProtists is a genome-centric portal for protists species.

Genome-Scale Metabolic Network DataBase

GSMNDB

Subject(s)

Content type(s)

Country

China

<<<!!!<<< 2019-12-23: the repository is offline >>>!!!>>> Introduction of genome-scale metabolic network: The completion of genome sequencing and subsequent functional annotation for a great number of species enables the reconstruction of genome-scale metabolic networks. These networks, together with in silico network analysis methods such as the constraint based methods (CBM) and graph theory methods, can provide us systems level understanding of cellular metabolism. Further more, they can be applied to many predictions of real biological application such as: gene essentiality analysis, drug target discovery and metabolic engineering

The Comprehensive Resource of Mammalian protein complexes

CORUM

Subject(s)

Content type(s)

Country

Germany

CORUM is a manually curated dataset of mammalian protein complexes. Annotation of protein complexes includes protein complex composition and other valuable information such as method of purification, cellular function of complexes or involvement in diseases.

GenomeRNAi

Subject(s)

Content type(s)

Country

GenomeRNAi is a database containing phenotypes from RNA interference (RNAi) screens in Drosophila and Homo sapiens. In addition, the database provides an updated resource of RNAi reagents and their predicted quality.

Cancer Genome Anatomy Project

CGAP

Subject(s)

Content type(s)

Country

United States

<<<!!!<<<Noticed 26.08.2020: The NCI CBIIT instance of the CGAP no longer exist on this website. The Mitelman Database of Chromosome Aberrations and Gene Fusions in Cancer has a new home at the NCI-funded Institute for Systems Biology Cancer Genomics Cloud available at the following location: https://mitelmandatabase.isb-cgc.org>>>!!!>>>

J. Craig Venter Institute

JCVI

Subject(s)

Content type(s)

Country

United States

JCVI is a world leader in genomic research. The Institute studies the societal implications of genomics in addition to genomics itself. The Institute's research involves genomic medicine; environmental genomic analysis; clean energy; synthetic biology; and ethics, law, and economics.

Combined QTL Map of Dairy Cattle Traits

Subject(s)

Content type(s)

Country

Australia

>>>!!! <<< 2021-09-01: repository is offline >>>!!!<<< Background: Many studies have been conducted to detect quantitative trait loci (QTL) in dairy cattle. However, these studies are diverse in terms of their differing resource populations, marker maps, phenotypes, etc, and one of the challenges is to be able to synthesise this diverse information. This web page has been constructed to provide an accessible database of studies, providing a summary of each study, facilitating an easier comparison across studies. However, it also highlights the need for uniform reporting of results of studies, to facilitate more direct comparisons being made. Description: Studies recorded in this database include complete and partial genome scans, single chromosome scans, as well as fine mapping studies, and contain all known reports that were published in peer-reviewed journals and readily available conference proceedings, initially up to April 2005. However, this data base is being added to, as indicated by the last web update. Note that some duplication of results will occur, in that there may be a number of reports on the same resource population, but utilising different marker densities or different statistical methodologies. The traits recorded in this map are milk yield, milk composition (protein yield, protein %, fat yield, fat %), and somatic cell score (SCS).

HAGR

Human Ageing Genomic Resources

Subject(s)

Content type(s)

Country

The Human Ageing Genomic Resources (HAGR) is a collection of databases and tools designed to help researchers study the genetics of human ageing using modern approaches such as functional genomics, network analyses, systems biology and evolutionary analyses.

4DGenome

Chromatin Interaction Database

Subject(s)

Content type(s)

Country

United States

4DGenome is a public database that archives and disseminates chromatin interaction data. Currently, 4DGenome contains over 8,038,247 interactions curated from both experimental studies (high throughput and individual studies) and computational predictions. It covers five organisms, Homo sapiens, Mus musculus, Drosophila melanogaster, Plasmodium falciparum, and Saccharomyces cerevisiae.

LabKey Open Research Portal

formerly: Zika Open-Research Portal

Subject(s)

Content type(s)

Country

United States

In response to emerging pathogens, LabKey launched the Open Research Portal in 2016 to help facilitate collaborative research. It was initially created as a platform for investigators to make Zika research data, commentary and results publicly available in real-time. It now includes other viruses like SARS-CoV-2 where there is a compelling need for real-time data sharing. Projects are freely available to researchers. If you are interested in sharing real-time data through the portal, please contact LabKey to get started.

Clinical Proteomic Tumor Analysis Consortium Data Portal

CPTAC Data Portal

Subject(s)

Content type(s)

Country

United States

The CPTAC Data Portal is the centralized repository for the dissemination of proteomic data collected by the Proteome Characterization Centers (PCCs) for the CPTAC program. The portal also hosts analyses of the mass spectrometry data (mapping of spectra to peptide sequences and protein identification) from the PCCs and from a CPTAC-sponsored common data analysis pipeline (CDAP).

SISSA Open Data

Scuola Internazionale Superiore di Studi Avanzati Open Data

Subject(s)

Content type(s)

Country

Italy

SISSA Open Data is the Sissa repository for the research data managment. It is an institutional repository that captures, stores, preserves, and redistributes the data of the SISSA scientific community in digital form. SISSA Open Data is managed by the SISSA Library as a service to the SISSA scientific community.

The Lafora Progressive Myoclonus Epilepsy Mutation and Polymorphism Database

The Lafora Database

Subject(s)

Content type(s)

Country

A curated database of mutations and polymorphisms associated with Lafora Progressive Myoclonus Epilepsy. The Lafora progressive myoclonus epilepsy mutation and polymorphism database is a collection of hand curated mutation and polymorphism data for the EPM2A and EPM2B (NHLRC1) from publicly available literature: databases and unpublished data. The database is continuously updated with information from in-house experimental data as well as data from published research studies.

IMGT/GENE-DB

ImMunoGeneTics / Gene-DB

Subject(s)

Content type(s)

Country

IMGT/GENE-DB is the IMGT genome database for IG and TR genes from human, mouse and other vertebrates. IMGT/GENE-DB provides a full characterization of the genes and of their alleles: IMGT gene name and definition, chromosomal localization, number of alleles, and for each allele, the IMGT allele functionality, and the IMGT reference sequences and other sequences from the literature. IMGT/GENE-DB allele reference sequences are available in FASTA format (nucleotide and amino acid sequences with IMGT gaps according to the IMGT unique numbering, or without gaps).

UCLanData

Subject(s)

Content type(s)

Country

United Kingdom

Repository of research data sets created under the auspices of the University of Central Lancashire

megx.net

Marine Ecological GenomiX

Subject(s)

Content type(s)

Country

Germany

<<<!!!<<< This repository is no longer open to the public !!! >>>!!!>>>

The Global Proteome Machine

GPM

Subject(s)

Content type(s)

Country

Canada

The Global Proteome Machine (GPM) is a protein identification database. This data repository allows users to post and compare results. GPM's data is provided by contributors like The Informatics Factory, University of Michigan, and Pacific Northwestern National Laboratories. The GPM searchable databases are: GPMDB, pSYT, SNAP, MRM, PEPTIDE and HOT.

DRYAD

Subject(s)

Content type(s)

Country

Dryad is an open data publishing platform and a community committed to the open availability and routine re-use of all research data. We publish data in any format and any discipline. All Dryad data undergoes a curation process and is published under a CC0 waiver to promote reuse.

Ensembl Fungi

e!EnsemblFungi

Subject(s)

Content type(s)

Country

EnsemblFungi is a genome-centric portal for fungal species. It is a project to maintain annotation on selected genomes.

EchoBase

an integrated post-genomic database for E.coli

Subject(s)

Content type(s)

Country

United Kingdom

EchoBase is a database that curates new experimental and bioinformatic information about the genes and gene products of the model bacterium Escherichia coli K-12 strain MG1655.

BioSample

BioSample at the NCBI

Subject(s)

Content type(s)

Country

United States

The BioSample database contains descriptions of biological source materials used in experimental assays.

Drosophila Genetic Reference Panel 2

DGRP2

Subject(s)

Content type(s)

Country

United States

The Drosophila Genetic Reference Panel (DGRP) is a population consisting of more than 200 inbred lines derived from the Raleigh, USA population. The DGRP is a living library of common polymorphisms affecting complex traits, and a community resource for whole genome association mapping of quantitative trait loci.

UniRef

UniProt Reference Clusters

Subject(s)

Content type(s)

Country

The UniProt Reference Clusters (UniRef) provide clustered sets of sequences from the UniProt Knowledgebase (including isoforms) and selected UniParc records in order to obtain complete coverage of the sequence space at several resolutions while hiding redundant sequences (but not their descriptions) from view.

DisGeNET

A knowledge base for disease genomics

Subject(s)

Content type(s)

Country

Spain

DisGeNET is a discovery platform containing one of the largest publicly available collections of genes and variants associated to human diseases. DisGeNET integrates data from expert curated repositories, GWAS catalogues, animal models and the scientific literature. DisGeNET data are homogeneously annotated with controlled vocabularies and community-driven ontologies. Additionally, several original metrics are provided to assist the prioritization of genotype–phenotype relationships.

Subjects

Content Types

Countries

AID systems

API

Data access

Data access restrictions

Database access

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type

Keywords

Metadata standards

PID systems

Provider types

Quality management

Repository languages

Software

Syndications

Repository types

Versioning