Search | re3data.org

NCBI Influenza Virus Resource

NCBI Flu

Subject(s)

Content type(s)

Country

United States

This resource allows users to search for and compare influenza virus genomes and gene sequences taken from GenBank. It also provides a virus sequence annotation tool and links to other influenza resources: NIAID project, JCVI Flu, Influenza research database, CDC Flu, Vaccine Selection and WHO Flu. NOTE: The redirects that are planned for completion by May 2024 will NOT impact the Influenza Virus Resource in any way. The Influenza Virus Resource will continue to be available, serving up data to support our Flu-research community.

NCBI Protein Clusters

Subject(s)

Content type(s)

Country

United States

The Entrez Protein Clusters database contains annotation information, publications, structures and analysis tools for related protein sequences encoded by complete genomes. The data available in the Protein Clusters Database is generated from prokaryotic genomic studies and is intended to assist researchers studying micro-organism evolution as well as other biological sciences. Available genomes include plants and viruses as well as organelles and microbial genomes.

INDEPTH Data Repository

International Network for the Demographic Evaluation of Populations and Their Health Data Repository

Subject(s)

Content type(s)

Country

INDEPTH is a global network of research centres that conduct longitudinal health and demographic evaluation of populations in low- and middle-income countries (LMICs). INDEPTH aims to strengthen global capacity for Health and Demographic Surveillance Systems (HDSSs), and to mount multi-site research to guide health priorities and policies in LMICs, based on up-to-date scientific evidence. The data collected by the INDEPTH Network members constitute a valuable resource of population and health data for LMIC countries. This repository aims to make well documented anonymised longitudinal microdata from these Centres available to data users.

datastore by Universität Münster

Subject(s)

Content type(s)

Country

Germany

datastore is the cross-domain research data repository of the University Münster (Germany). In datastore, scientific members of the University Münster can publish their research data following the FAIR principles, including the assignment of a DOI for each dataset as a persistent identifier.

NCBI Clone DB

NCBI clone registry

Subject(s)

Content type(s)

Country

United States

<<<!!!<<< NCBI announced plans to retire the Clone DB web interface. Pursuant to this retirement, starting on May 27, 2019, all web pages associated with Clone DB and CloneFinder will redirect to this blog post https://ncbiinsights.ncbi.nlm.nih.gov/?s=clone+db. Links to Clone DB from the NCBI home page will also be going away. >>>!!!>>>

CancerData.org

Sharing data for cancer research

Subject(s)

Content type(s)

Country

Netherlands

The CancerData site is an effort of the Medical Informatics and Knowledge Engineering team (MIKE for short) of Maastro Clinic, Maastricht, The Netherlands. Our activities in the field of medical image analysis and data modelling are visible in a number of projects we are running. CancerData is offering several datasets. They are grouped in collections and can be public or private. You can search for public datasets in the NBIA (National Biomedical Imaging Archive) image archives without logging in.

Electron Microscopy Public Image Archive

EMPIAR

Subject(s)

Content type(s)

Country

EMPIAR, the Electron Microscopy Public Image Archive, is a public resource for raw, 2D electron microscopy images. Here, you can browse, upload, download and reprocess the thousands of raw, 2D images used to build a 3D structure. The purpose of EMPIAR is to provide an easy access to the state-of-the-art raw data to facilitate methods development and validation, which will lead to better 3D structures. It complements the Electron Microscopy Data Bank (EMDB), where 3D images are stored, and uses the fault-tolerant Aspera platform for data transfers

International Mouse Phenotyping Consortium

IMPC

Subject(s)

Content type(s)

Country

The IMPC is a confederation of international mouse phenotyping projects working towards the agreed goals of the consortium: To undertake the phenotyping of 20,000 mouse mutants over a ten year period, providing the first functional annotation of a mammalian genome. Maintain and expand a world-wide consortium of institutions with capacity and expertise to produce germ line transmission of targeted knockout mutations in embryonic stem cells for 20,000 known and predicted mouse genes. Test each mutant mouse line through a broad based primary phenotyping pipeline in all the major adult organ systems and most areas of major human disease. Through this activity and employing data annotation tools, systematically aim to discover and ascribe biological function to each gene, driving new ideas and underpinning future research into biological systems; Maintain and expand collaborative “networks” with specialist phenotyping consortia or laboratories, providing standardized secondary level phenotyping that enriches the primary dataset, and end-user, project specific tertiary level phenotyping that adds value to the mammalian gene functional annotation and fosters hypothesis driven research; and Provide a centralized data centre and portal for free, unrestricted access to primary and secondary data by the scientific community, promoting sharing of data, genotype-phenotype annotation, standard operating protocols, and the development of open source data analysis tools. Members of the IMPC may include research centers, funding organizations and corporations.

Ensembl Metazoa

e!EnsemblMetazoa

Subject(s)

Content type(s)

Country

Ensembl Metazoa is a genome-centric portal for metazoan species of scientific interest.

IntAct

IntAct Molecular Interaction Database

Subject(s)

Content type(s)

Country

IntAct provides a freely available, open source database system and analysis tools for molecular interaction data. All interactions are derived from literature curation or direct user submissions and are freely available.

Mouse Genome Informatics

MGI

Subject(s)

Content type(s)

Country

United States

MGI is the international database resource for the laboratory mouse, providing integrated genetic, genomic, and biological data to facilitate the study of human health and disease. The projects contributing to this resource are: Mouse Genome Database (MGD) Project, Gene Expression Database (GXD) Project, Mouse Tumor Biology (MTB) Database Project, Gene Ontology (GO) Project at MGI, MouseMine Project, MouseCyc Project at MGI

LSHTM Data Compass

Subject(s)

Content type(s)

Scientific and statistical data formats

Country

United Kingdom

LSHTM Data Compass is a curated digital repository of research outputs that have been produced by staff and students at the London School of Hygiene & Tropical Medicine and their collaborators. It is used to share outputs intended for reuse, including: qualitative and quantitative data, software code and scripts, search strategies, and data collection tools.

UniProtKB

UniProtKnowledgebase

Subject(s)

Content type(s)

Country

The UniProt Knowledgebase (UniProtKB) is the central hub for the collection of functional information on proteins, with accurate, consistent and rich annotation. In addition to capturing the core data mandatory for each UniProtKB entry (mainly, the amino acid sequence, protein name or description, taxonomic data and citation information), as much annotation information as possible is added. This includes widely accepted biological ontologies, classifications and cross-references, and clear indications of the quality of annotation in the form of evidence attribution of experimental and computational data. The Universal Protein Resource (UniProt) is a comprehensive resource for protein sequence and annotation data. The UniProt databases are the UniProt Knowledgebase (UniProtKB), the UniProt Reference Clusters (UniRef), and the UniProt Archive (UniParc). The UniProt Metagenomic and Environmental Sequences (UniMES) database is a repository specifically developed for metagenomic and environmental data. The UniProt Knowledgebase,is an expertly and richly curated protein database, consisting of two sections called UniProtKB/Swiss-Prot and UniProtKB/TrEMBL.

CSIRO Data Access Portal

CSIRO DAP

Subject(s)

Content type(s)

Country

Australia

The CSIRO Data Access Portal provides access to data published by CSIRO across a range of disciplines to facilitate sharing and reuse of data held by CSIRO.

Broad GDAC Firehose

Genome Data Analysis Center

Subject(s)

Content type(s)

Country

United States

Born of the desire to systematize analyses from The Cancer Genome Atlas pilot and scale their execution to the dozens of remaining diseases to be studied, GDAC Firehose now sits atop terabytes of analysis-ready TCGA data and reliably executes thousands of pipelines per month. More information: https://broadinstitute.atlassian.net/wiki/spaces/GDAC/

Integrated Relational Enzyme database

IntEnz

Subject(s)

Content type(s)

Country

IntEnz contains the recommendation of the Nomenclature Committee of the International Union of Biochemistry and Molecular Biology on the nomenclature and classification of enzyme-catalyzed reactions. Users can browse by enzyme classification or use advanced search options to search enzymes by class, subclass and sub-subclass information.

TB Database

Tuberculosis Database

Subject(s)

Content type(s)

Country

United States

The repository is no longer available. >>>!!!<<< 2018-08-29: no more access to TB Database >>>!!!<<<

Marine Microbial Database of India

bioSearch

Subject(s)

Content type(s)

Plain text

Country

India

>>>!!!<<< This repository is no longer available >>>!!!<<< Marine Microbial Database of India is an initiative of CSIR National Institute of Oceanography (NIO). It is supported by Council of Scientific and Industrial Research (CSIR) and managed by Biodiversity Informatics Group (BIG), Bioinformatics Centre of the NIO. It contains records about 1,814 marine microbes. Each record provides information on microbe’s location, habitat, importance (of the organism), threats (to the organism). The database also provides a Taxonomic Hierarchy and Scientific Name Index.

Human Proteinpedia

Subject(s)

Content type(s)

Country

Human Proteinpedia is a community portal for sharing and integration of human protein data. This is a joint project between Pandey at Johns Hopkins University, and Institute of Bioinformatics, Bangalore. This portal allows research laboratories around the world to contribute and maintain protein annotations. Human Protein Reference Database (HPRD) integrates data, that is deposited in Human Proteinpedia along with the existing literature curated information in the context of an individual protein. All the public data contributed to Human Proteinpedia can be queried, viewed and downloaded. Data pertaining to post-translational modifications, protein interactions, tissue expression, expression in cell lines, subcellular localization and enzyme substrate relationships may be deposited.

Greengenes

The Greengenes Database

Subject(s)

Content type(s)

Country

Greengenes is an Earth Sciences website that assists clinical and environmental microbiologists from around the globe in classifying microorganisms from their local environments. A 16S rRNA gene database addresses limitations of public repositories by providing chimera screening, standard alignment, and taxonomic classification using multiple published taxonomies.

MyTardis

Tardis

Subject(s)

Content type(s)

Country

Australia

MyTardis began at Monash University to solve the problem of users needing to store large datasets and share them with collaborators online. Its particular focus is on integration with scientific instruments, instrument facilities and research lab file storage. Our belief is that the less effort a researcher has to expend safely storing data, the more likely they are to do so. This approach has flourished with MyTardis capturing data from areas such as protein crystallography, electron microscopy, medical imaging and proteomics and with deployments at Australian institutions such as University of Queensland, RMIT, University of Sydney and the Australian Synchrotron. Data access via https://www.massive.org.au/ and https://store.erc.monash.edu.au/experiment/view/104/ and see 'remarks'.

Pseudobase

A Pseudoknot Database

Subject(s)

Content type(s)

Country

Netherlands

Since the first discovery of RNA pseudoknots more and many more pseudoknots have been found. However, not all of those pseudoknot data are easy to trace. Sometimes the information is hidden in a publication where the title gives no hint that pseudoknot information is there. This was the first reason that we thought that a general accessible information source for pseudoknots would be handy.

PeptideAtlas

Subject(s)

Content type(s)

Country

The PeptideAtlas validates expressed proteins to provide eukaryotic genome data. Peptide Atlas provides data to advance biological discoveries in humans. The PeptideAtlas accepts proteomic data from high-throughput processes and encourages data submission.

ChIP-Seq Transcription Factor Data

ChIP-Seq

Subject(s)

Content type(s)

Country

Canada

We developed a method, ChIP-sequencing (ChIP-seq), combining chromatin immunoprecipitation (ChIP) and massively parallel sequencing to identify mammalian DNA sequences bound by transcription factors in vivo. We used ChIP-seq to map STAT1 targets in interferon-gamma (IFN-gamma)-stimulated and unstimulated human HeLa S3 cells, and compared the method's performance to ChIP-PCR and to ChIP-chip for four chromosomes.For both Chromatin- immunoprecipation Transcription Factors and Histone modifications. Sequence files and the associated probability files are also provided.

AceView

AceView genes

Subject(s)

Content type(s)

Country

United States

AceView provides a curated, comprehensive and non-redundant sequence representation of all public mRNA sequences (mRNAs from GenBank or RefSeq, and single pass cDNA sequences from dbEST and Trace). These experimental cDNA sequences are first co-aligned on the genome then clustered into a minimal number of alternative transcript variants and grouped into genes. Using exhaustively and with high quality standards the available cDNA sequences evidences the beauty and complexity of mammals’ transcriptome, and the relative simplicity of the nematode and plant transcriptomes. Genes are classified according to their inferred coding potential; many presumably non-coding genes are discovered. Genes are named by Entrez Gene names when available, else by AceView gene names, stable from release to release. Alternative features (promoters, introns and exons, polyadenylation signals) and coding potential, including motifs, domains, and homologies are annotated in depth; tissues where expression has been observed are listed in order of representation; diseases, phenotypes, pathways, functions, localization or interactions are annotated by mining selected sources, in particular PubMed, GAD and Entrez Gene, and also by performing manual annotation, especially in the worm. In this way, both the anatomy and physiology of the experimentally cDNA supported human, mouse and nematode genes are thoroughly annotated.

Subjects

Content Types

Countries

AID systems

API

Certificates

Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type

Keywords

Metadata standards

PID systems

Provider types

Quality management

Repository languages

Software

Syndications

Repository types

Versioning