Filter
Reset all

Subjects

Content Types

Countries

AID systems

API

Certificates

Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type

Keywords

Metadata standards

PID systems

Provider types

Quality management

Repository languages

Software

Syndications

Repository types

Versioning

  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 142 result(s)
Ag Data Commons provides access to a wide variety of open data relevant to agricultural research. We are a centralized repository for data already on the web, as well as for new data being published for the first time. While compliance with the U.S. Federal public access and open data directives is important, we aim to surpass them. Our goal is to foster innovative data re-use, integration, and visualization to support bigger, better science and policy.
The Cancer Genome Atlas (TCGA) Data Portal provides a platform for researchers to search, download, and analyze data sets generated by TCGA. It contains clinical information, genomic characterization data, and high level sequence analysis of the tumor genomes. The Data Coordinating Center (DCC) is the central provider of TCGA data. The DCC standardizes data formats and validates submitted data.
<<<!!!<<<This is an archive of the old NEBC site from nebc.nerc.ac.uk and is no longer updated. For new information regarding NERC Environmental Omics and the Bio-Linux system please see the EOS site at http://environmentalomics.org. >>>!!!>>> Ongoing NEBC activities, including the development of Bio-Linux, are being moved into the new EOS programme http://environmentalomics.org/portfolio/big-data-infrastructure/ . Once the current material from this website has been moved into EOS, this NEBC site will remain on-line as an archive. EnvBase is the searchable index to the data deposited through the NEBC, as well as related NERC experimental data. At present this is chiefly from the grants funded by the NERC Environmental Genomics Science Programme and the subsequent Post-genomics and Proteomics Science Programme, but more data from ongoing projects continues to be added
Country
The National Population Health Data Center (NPHDC) is one of the 20 national science data center approved by the Ministry of Science and Technology and the Ministry of Finance. The Population Health Data Archive (PHDA) is developed by NPHDC relying on the Institute of Medical Information, Chinese Academy of Medical Sciences. PHDA mainly receives scientific data from science and technology projects supported by the national budget, and also collects data from other multiple sources such as medical and health institutions, research institutions and social individuals, which is oriented to the national big data strategy and the healthy China strategy. The data resources cover basic medicine, clinical medicine, public health, traditional Chinese medicine and pharmacy, pharmacy, population and reproduction. PHDA supports data collection, archiving, processing, storage, curation, verification, certification and release in the field of population health. Provide multiple types of data sharing and application services for different hierarchy users and help them find, access, interoperate and reuse the data in a safe and controlled environment. PHDA provides important support for promoting the open sharing of scientific data of population health and domestic and foreign cooperation.
The Registry of Open Data on AWS provides a centralized repository of public data sets that can be seamlessly integrated into AWS cloud-based applications. AWS is hosting the public data sets at no charge to their users. Anyone can access these data sets from their Amazon Elastic Compute Cloud (Amazon EC2) instances and start computing on the data within minutes. Users can also leverage the entire AWS ecosystem and easily collaborate with other AWS users.
Database and knowledgebase of authenticated microbial genomics data with full data provenance to physical materials held within American Type Culture Collection's (ATCC) biorepository and culture collections. Data includes whole genome sequencing data for bacterial, viral and fungal strains at ATCC, their genome assemblies, metadata, drug susceptibility data, and more. All data is freely available for non-commercial research use only (RUO) applications via the web portal interface or via a REST-API. The goal is to provide the research community with provenance information and authentication between the biological source materials and reference genome assemblies derived from them.
Country
This repository accepts data from life science researchers and service units in Sweden. The repository is operated by SciLifeLab, which is the national infrastructure for life science and environmental research in Sweden. This repository replaces NBIS DOI repository: https://doi.org/10.17616/R3CW52
>>>>!!!!<<<< The Cancer Genomics Hub mission is now completed. The Cancer Genomics Hub was established in August 2011 to provide a repository to The Cancer Genome Atlas, the childhood cancer initiative Therapeutically Applicable Research to Generate Effective Treatments and the Cancer Genome Characterization Initiative. CGHub rapidly grew to be the largest database of cancer genomes in the world, storing more than 2.5 petabytes of data and serving downloads of nearly 3 petabytes per month. As the central repository for the foundational genome files, CGHub streamlined team science efforts as data became as easy to obtain as downloading from a hard drive. The convenient access to Big Data, and the collaborations that CGHub made possible, are now essential to cancer research. That work continues at the NCI's Genomic Data Commons. All files previously stored at CGHub can be found there. The Website for the Genomic Data Commons is here: https://gdc.nci.nih.gov/ >>>>!!!!<<<< The Cancer Genomics Hub (CGHub) is a secure repository for storing, cataloging, and accessing cancer genome sequences, alignments, and mutation information from the Cancer Genome Atlas (TCGA) consortium and related projects. Access to CGHub Data: All researchers using CGHub must meet the access and use criteria established by the National Institutes of Health (NIH) to ensure the privacy, security, and integrity of participant data. CGHub also hosts some publicly available data, in particular data from the Cancer Cell Line Encyclopedia. All metadata is publicly available and the catalog of metadata and associated BAMs can be explored using the CGHub Data Browser.
DDBJ; DNA Data Bank of Japan is the sole nucleotide sequence data bank in Asia, which is officially certified to collect nucleotide sequences from researchers and to issue the internationally recognized accession number to data submitters.Since we exchange the collected data with EMBL-Bank/EBI; European Bioinformatics Institute and GenBank/NCBI; National Center for Biotechnology Information on a daily basis, the three data banks share virtually the same data at any given time. The virtually unified database is called "INSD; International Nucleotide Sequence Database DDBJ collects sequence data mainly from Japanese researchers, but of course accepts data and issue the accession number to researchers in any other countries.
Content type(s)
Country
A small genotype data repository containing data used in recent papers from the Estonian Biocentre. Most of the data pertains to human population genetics. PDF files of the papers are also freely available.
Country
The Open Archive for Miscellaneous Data (OMIX) database is a data repository developed and maintained by the National Genomics Data Center (NGDC). The database specializes in descriptions of biological studies, including genomic, proteomic, and metabolomic, as well as data that do not fit in the structured archives at other databases in NGDC. It can accept various types of studies described via a simple format and enables researchers to upload supplementary information and link to it from the publication.
The European Genome-phenome Archive (EGA) is designed to be a repository for all types of sequence and genotype experiments, including case-control, population, and family studies. We will include SNP and CNV genotypes from array based methods and genotyping done with re-sequencing methods. The EGA will serve as a permanent archive that will archive several levels of data including the raw data (which could, for example, be re-analysed in the future by other algorithms) as well as the genotype calls provided by the submitters. We are developing data mining and access tools for the database. For controlled access data, the EGA will provide the necessary security required to control access, and maintain patient confidentiality, while providing access to those researchers and clinicians authorised to view the data. In all cases, data access decisions will be made by the appropriate data access-granting organisation (DAO) and not by the EGA. The DAO will normally be the same organisation that approved and monitored the initial study protocol or a designate of this approving organisation. The European Genome-phenome Archive (EGA) allows you to explore datasets from genomic studies, provided by a range of data providers. Access to datasets must be approved by the specified Data Access Committee (DAC).
ALSPAC is a longitudinal birth cohort study which enrolled pregnant women who were resident in one of three Bristol-based health districts in the former County of Avon with an expected delivery date between 1st April 1991 and 31st December 1992. Around 14,000 pregnant women were initially recruited. Detailed information has been collected on these women, their partners and subsequent children using self-completion questionnaires, data extraction from medical notes, linkage to routine information systems and from hands-on research clinics. Additional cohorts of participants have since been enrolled in their own right including fathers, siblings, children of the children and grandparents of the children. Ethical approval for the study was obtained from the ALSPAC Ethics and Law Committee (IRB00003312) and Local Research Ethics.
GigaDB primarily serves as a repository to host data and tools associated with articles published by GigaScience Press; GigaScience and GigaByte (both are online, open-access journals). GigaDB defines a dataset as a group of files (e.g., sequencing data, analyses, imaging files, software programs) that are related to and support a unit-of-work (article or study). GigaDB allows the integration of manuscript publication with supporting data and tools.
GOLD is currently the largest repository for genome project information world-wide. The accurate and efficient genome project tracking is a vital criterion for launching new genome sequencing projects, and for avoiding significant overlap between various sequencing efforts and centers.
The Cancer Cell Line Encyclopedia project is a collaboration between the Broad Institute, and the Novartis Institutes for Biomedical Research and its Genomics Institute of the Novartis Research Foundation to conduct a detailed genetic and pharmacologic characterization of a large panel of human cancer models, to develop integrated computational analyses that link distinct pharmacologic vulnerabilities to genomic patterns and to translate cell line integrative genomics into cancer patient stratification. The CCLE provides public access to genomic data, analysis and visualization for about 1000 cell lines.
This Web resource provides data and information relevant to SARS coronavirus. It includes links to the most recent sequence data and publications, to other SARS related resources, and a pre-computed alignment of genome sequences from various isolates. In order to provide free and easy access to genome and protein sequences and associated metadata from the SARS-CoV-2, we created a dedicated Severe acute respiratory syndrome coronavirus 2 data hub. You can access the Results Table on SARS-CoV-2 data hub, by pressing "RefSeq genomes", "nucleotide" or "protein" links on announcement banner located on NCBI home page, in "Find data" navigation menu or using "Up-to-date SARS-CoV-2" shortcut button in "Search by virus" form. SARS-CoV-2 sequences is part of NCBI Virus https://www.re3data.org/repository/r3d100014322
Country
The National Genomics Data Center (NGDC), part of the China National Center for Bioinformation (CNCB), advances life & health sciences by providing open access to a suite of resources, with the aim to translate big data into big discoveries and support worldwide activities in both academia and industry.
The European Variation Archive is an open-access database of all types of genetic variation data from all species. The EVA provides access to highly detailed, granular, raw variant data from human, with other species to follow. As of September 2017, EMBL-EBI will maintain reliable accessions for non-human genetic variation data through the European Variation Archive (EVA). NCBI's dbSNP database will continue to maintain stable identifiers for human genetic variation data only. This change will enable a more rapid turnaround for data sharing in this burgeoning field.
ASAP (a systematic annotation package for community analysis of genomes) is a relational database and web interface developed to store, update and distribute genome sequence data and gene expression data collected by or in collaboration with researchers at the University of Wisconsin - Madison. ASAP was designed to facilitate ongoing community annotation of genomes and to grow with genome projects as they move from the preliminary data stage through post-sequencing functional analysis. The ASAP database includes multiple genome sequences at various stages of analysis, and gene expression data from preliminary experiments.
AmoebaDB belongs to the EuPathDB family of databases and is an integrated genomic and functional genomic database for Entamoeba and Acanthamoeba parasites. In its first iteration (released in early 2010), AmoebaDB contains the genomes of three Entamoeba species (see below). AmoebaDB integrates whole genome sequence and annotation and will rapidly expand to include experimental data and environmental isolate sequences provided by community researchers . The database includes supplemental bioinformatics analyses and a web interface for data-mining.
The Hymenoptera Genome Database is a genome informatics resource that supports the research of insects of the order Hymenoptera (e.g. bees, wasps, ants). HGD provides tools for data mining (HymenopteraMine), sequence searching (BLAST), genome browsing (JBrowse), genome annotation (Apollo) and data download. Available through the navigation bar on the HGD Home page are the archives Ant Genomes Portal, BeeBase, and NasoniaBase which will not be updated.