Search | re3data.org

Filter

Reset all

Subjects

Content Types

Countries

AID systems

API

Certificates

Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

restricted (27)

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type

non-profit (27)

Keywords

Metadata standards

PID systems

Provider types

Quality management

yes (27)

Repository languages

Software

Syndications

Repository types

Versioning

Toogle short help

* at the end of a keyword allows wildcard searches
" quotes can be used for searching phrases
+ represents an AND search (default)
| represents an OR search
- represents a NOT operation
( and ) implies priority
~N after a word specifies the desired edit distance (fuzziness)
~N after a phrase specifies the desired slop amount

← Previous
1 (current)
2
Next →

Found 27 result(s)

Array Express

functional genomics data

Subject(s)

Content type(s)

Country

ArrayExpress is one of the major international repositories for high-throughput functional genomics data from both microarray and high-throughput sequencing studies, many of which are supported by peer-reviewed publications. Data sets are submitted directly to ArrayExpress and curated by a team of specialist biological curators. In the past (until 2018) datasets from the NCBI Gene Expression Omnibus database were imported on a weekly basis. Data is collected to MIAME and MINSEQE standards.

BioProject

Subject(s)

Content type(s)

Country

United States

The BioProject database is a searcheable collection of complete and incomplete (in-progress) large-scale molecular projects including genome sequencing and assembly, transcriptome, metagenomic, annotation, expression and mapping projects. BioProject provides a central point to link to all data associated with a project in the NCBI molecular and literature databases.

Cancer Cell Line Encyclopedia

CCLE

Subject(s)

Content type(s)

Country

United States

The Cancer Cell Line Encyclopedia project is a collaboration between the Broad Institute, and the Novartis Institutes for Biomedical Research and its Genomics Institute of the Novartis Research Foundation to conduct a detailed genetic and pharmacologic characterization of a large panel of human cancer models, to develop integrated computational analyses that link distinct pharmacologic vulnerabilities to genomic patterns and to translate cell line integrative genomics into cancer patient stratification. The CCLE provides public access to genomic data, analysis and visualization for about 1000 cell lines.

Database of Interacting Proteins

DIP

Subject(s)

Content type(s)

Country

The DIP database catalogs experimentally determined interactions between proteins. It combines information from a variety of sources to create a single, consistent set of protein-protein interactions. The data stored within the DIP database were curated, both, manually by expert curators and also automatically using computational approaches that utilize the the knowledge about the protein-protein interaction networks extracted from the most reliable, core subset of the DIP data. Please, check the reference page to find articles describing the DIP database in greater detail. The Database of Ligand-Receptor Partners (DLRP) is a subset of DIP (Database of Interacting Proteins). The DLRP is a database of protein ligand and protein receptor pairs that are known to interact with each other. By interact we mean that the ligand and receptor are members of a ligand-receptor complex and, unless otherwise noted, transduce a signal. In some instances the ligand and/or receptor may form a heterocomplex with other ligands/receptors in order to be functional. We have entered the majority of interactions in DLRP as full DIP entries, with links to references and additional information

DRYAD

Subject(s)

Content type(s)

Country

Dryad is an open data publishing platform and a community committed to the open availability and routine re-use of all research data. We publish data in any format and any discipline. All Dryad data undergoes a curation process and is published under a CC0 waiver to promote reuse.

Ensembl Bacteria

e!EnsemblBacteria

Subject(s)

Content type(s)

Country

This site provides access to complete, annotated genomes from bacteria and archaea (present in the European Nucleotide Archive) through the Ensembl graphical user interface (genome browser). Ensembl Bacteria contains genomes from annotated INSDC records that are loaded into Ensembl multi-species databases, using the INSDC annotation import pipeline.

Ensembl Fungi

e!EnsemblFungi

Subject(s)

Content type(s)

Country

EnsemblFungi is a genome-centric portal for fungal species. It is a project to maintain annotation on selected genomes.

Ensembl Genomes

e!EnsemblGenomes

Subject(s)

Content type(s)

Country

The Ensembl genome annotation system, developed jointly by the EBI and the Wellcome Trust Sanger Institute, has been used for the annotation, analysis and display of vertebrate genomes since 2000. Since 2009, the Ensembl site has been complemented by the creation of five new sites, for bacteria, protists, fungi, plants and invertebrate metazoa, enabling users to use a single collection of (interactive and programatic) interfaces for accessing and comparing genome-scale data from species of scientific interest from across the taxonomy. In each domain, we aim to bring the integrative power of Ensembl tools for comparative analysis, data mining and visualisation across genomes of scientific interest, working in collaboration with scientific communities to improve and deepen genome annotation and interpretation.

Ensembl Metazoa

e!EnsemblMetazoa

Subject(s)

Content type(s)

Country

Ensembl Metazoa is a genome-centric portal for metazoan species of scientific interest.

Ensembl Protists

e!EnsemblProtists

Subject(s)

Content type(s)

Country

EnsemblProtists is a genome-centric portal for protists species.

GenBank

Subject(s)

Content type(s)

Country

GenBank® is a comprehensive database that contains publicly available nucleotide sequences for almost 260 000 formally described species. These sequences are obtained primarily through submissions from individual laboratories and batch submissions from large-scale sequencing projects, including whole-genome shotgun (WGS) and environmental sampling projects. Most submissions are made using the web-based BankIt or standalone Sequin programs, and GenBank staff assigns accession numbers upon data receipt. Daily data exchange with the European Nucleotide Archive (ENA) and the DNA Data Bank of Japan (DDBJ) ensures worldwide coverage. GenBank is accessible through the NCBI Entrez retrieval system, which integrates data from the major DNA and protein sequence databases along with taxonomy, genome, mapping, protein structure and domain information, and the biomedical journal literature via PubMed. BLAST provides sequence similarity searches of GenBank and other sequence databases. Complete bimonthly releases and daily updates of the GenBank database are available by FTP.

Gene Expression Omnibus

GEO

Subject(s)

Content type(s)

Country

United States

Gene Expression Omnibus: a public functional genomics data repository supporting MIAME-compliant data submissions. Array- and sequence-based data are accepted. Tools are provided to help users query and download experiments and curated gene expression profiles.

Genomic Observatories MetaDatabase

GEOME

Subject(s)

Content type(s)

Country

The Genomic Observatories Meta-Database (GEOME) is a web-based database that captures the who, what, where, and when of biological samples and associated genetic sequences. GEOME helps users with the following goals: ensure the metadata from your biological samples is findable, accessible, interoperable, and reusable; improve the quality of your data and comply with global data standards; and integrate with R, ease publication to NCBI's sequence read archive, and work with an associated LIMS. The initial use case for GEOME came from the Diversity of the Indo-Pacific Network (DIPnet) resource.

HUGO Gene Nomenclature Committee

HGNC

Subject(s)

Content type(s)

Country

The HUGO Gene Nomenclature Committee (HGNC) assigned unique gene symbols and names to over 35,000 human loci, of which around 19,000 are protein coding. This curated online repository of HGNC-approved gene nomenclature and associated resources includes links to genomic, proteomic and phenotypic information, as well as dedicated gene family pages.

J. Craig Venter Institute

JCVI

Subject(s)

Content type(s)

Country

United States

JCVI is a world leader in genomic research. The Institute studies the societal implications of genomics in addition to genomics itself. The Institute's research involves genomic medicine; environmental genomic analysis; clean energy; synthetic biology; and ethics, law, and economics.

NCBI

National Center for Biotechnology Information

Subject(s)

Content type(s)

Country

United States

The National Center for Biotechnology Information advances science and health by providing access to biomedical and genomic information

NCBI Gene

Gene

Subject(s)

Content type(s)

Country

United States

The Gene database provides detailed information for known and predicted genes defined by nucleotide sequence or map position. Gene supplies gene-specific connections in the nexus of map, sequence, expression, structure, function, citation, and homology data. Unique identifiers are assigned to genes with defining sequences, genes with known map positions, and genes inferred from phenotypic information. These gene identifiers are used throughout NCBI's databases and tracked through updates of annotation. Gene includes genomes represented by NCBI Reference Sequences (or RefSeqs) and is integrated for indexing and query and retrieval from NCBI's Entrez and E-Utilities systems.

NCBI Nucleotide

Subject(s)

Content type(s)

Country

United States

The NCBI Nucleotide database collects sequences from such sources as GenBank, RefSeq, TPA, and PDB. Sequences collected relate to genome, gene, and transcript sequence data, and provide a foundation for research related to the biomedical field.

NCBI PopSet

Subject(s)

Content type(s)

Country

United States

NCBI PopSet collects DNA sequences to analyze the ways that populations are related by evolution. Such sequences indicate if populations originate from different members of the same species or from organisms of different species entirely.

NCBI Protein

Subject(s)

Content type(s)

Country

United States

The Protein database is a collection of sequences from several sources, including translations from annotated coding regions in GenBank, RefSeq and TPA, as well as records from SwissProt, PIR, PRF, and PDB. Protein sequences are the fundamental determinants of biological structure and function.

NCBI Virus

Subject(s)

Content type(s)

Country

United States

NCBI Virus is a community portal for viral sequence data from RefSeq, GenBank and other NCBI repositories. To find, retrieve and analyze data, choose one of the offered options.

PhenoGen Informatics

PhenoGen

Subject(s)

Content type(s)

Country

United States

The PhenoGen website shares experimental data with a worldwide community of investigators and provides a flexible, integrated, multi-resolution repository of neuroscience transcriptomic genetic data for collaborative research on genomic disorders. The main development focus is on providing Hybrid Rat Diversity Panel transcriptomic data (sequencing, genome coverage, reconstructed totalRNA/smallRNA transcriptomes, quanification of the transcriptome, eQTLs, and WGCNA) and integrating additional tools to provide platform for visualization and analysis of HRDP transcriptome data.

TB Database

Tuberculosis Database

Subject(s)

Content type(s)

Country

United States

The repository is no longer available. >>>!!!<<< 2018-08-29: no more access to TB Database >>>!!!<<<

The Universal Protein Resource

UniProt

Subject(s)

Content type(s)

Country

The Universal Protein Resource (UniProt) is a comprehensive resource for protein sequence and annotation data. The UniProt databases are the UniProt Knowledgebase (UniProtKB), the UniProt Reference Clusters (UniRef), and the UniProt Archive (UniParc).

UniGene

Subject(s)

Content type(s)

Country

United States

<<<!!!<<< This repository is no longer available>>>!!!>>>. Although the web pages are no longer available, you will still be able to download the final UniGene builds as static content from the FTP site https://ftp.ncbi.nlm.nih.gov/repository/UniGene/. You will also be able to match UniGene cluster numbers to Gene records by searching Gene with UniGene cluster numbers. For best results, restrict to the “UniGene Cluster Number” field rather than all fields in Gene. For example, a search with Mm.2108[UniGene Cluster Number] finds the mouse transthyretin Gene record (Ttr). You can use the advanced search page https://www.ncbi.nlm.nih.gov/gene/advanced to help construct these searches. Keep in mind that the Gene record contains selected Reference Sequences and GenBank mRNA sequences rather than the larger set of expressed sequences in the UniGene cluster.

← Previous
1 (current)
2
Next →

Current projects
EOSC FAIR-IMPACT

re3data COREF

To the extent possible under law, re3data.org has waived all copyright and related or neighboring rights to the database entries of re3data.org.
Except where otherwise noted, content on this site is licensed under a Creative Commons Attribution 4.0 International License .
Cite this service: re3data.org - Registry of Research Data Repositories. https://doi.org/10.17616/R3D last accessed: 2024-05-14