Reset all


Content Types


AID systems



Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type


Metadata standards

PID systems

Provider types

Quality management

Repository languages



Repository types


  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 173 result(s)
The European Nucleotide Archive (ENA) captures and presents information relating to experimental workflows that are based around nucleotide sequencing. A typical workflow includes the isolation and preparation of material for sequencing, a run of a sequencing machine in which sequencing data are produced and a subsequent bioinformatic analysis pipeline. ENA records this information in a data model that covers input information (sample, experimental setup, machine configuration), output machine data (sequence traces, reads and quality scores) and interpreted information (assembly, mapping, functional annotation). Data arrive at ENA from a variety of sources. These include submissions of raw data, assembled sequences and annotation from small-scale sequencing efforts, data provision from the major European sequencing centres and routine and comprehensive exchange with our partners in the International Nucleotide Sequence Database Collaboration (INSDC). Provision of nucleotide sequence data to ENA or its INSDC partners has become a central and mandatory step in the dissemination of research findings to the scientific community. ENA works with publishers of scientific literature and funding bodies to ensure compliance with these principles and to provide optimal submission systems and data access tools that work seamlessly with the published literature.
The dbVar is a database of genomic structural variation containing data from multiple gene studies. Users can browse data containing the number of variant cells from each study, and filter studies by organism, study type, method and genomic variant. Organisms include human, mouse, cattle and several additional animals. ***NCBI will phase out support for non-human organism data in dbSNP and dbVar beginning on September 1, 2017 ***
The Exome Aggregation Consortium (ExAC) is a coalition of investigators seeking to aggregate and harmonize exome sequencing data from a wide variety of large-scale sequencing projects, and to make summary data available for the wider scientific community. The data set provided on this website spans 60,706 unrelated individuals sequenced as part of various disease-specific and population genetic studies.
BioMagResBank (BMRB) is the publicly-accessible depository for NMR results from peptides, proteins, and nucleic acids recognized by the International Society of Magnetic Resonance and by the IUPAC-IUBMB-IUPAB Inter-Union Task Group on the Standardization of Data Bases of Protein and Nucleic Acid Structures Determined by NMR Spectroscopy. In addition, BMRB provides reference information and maintains a collection of NMR pulse sequences and computer software for biomolecular NMR
Satellite-tracked drifting buoys ("drifters") collect measurements of upper ocean currents and sea surface temperatures (SST) around the world as part of the Global Drifter Program. Drifter locations are estimated from 16-20 satellite fixes per day, per drifter. The Drifter Data Assembly Center (DAC) at NOAA's Atlantic Oceanographic and Meteorological Laboratory (AOML) assembles these raw data, applies quality control procedures, and interpolates them via kriging to regular six-hour intervals. The raw observations and processed data are archived at AOML and at the Marine Environmental Data Services (MEDS) in Canada. Two types of data are available: "metadata" contains deployment location and time, time of drogue (sea anchor) loss, date of final transmission, etc. for each drifter. "Interpolated data" contains the quality-controlled, interpolated drifter observations.
The Global Proteome Machine (GPM) is a protein identification database. This data repository allows users to post and compare results. GPM's data is provided by contributors like The Informatics Factory, University of Michigan, and Pacific Northwestern National Laboratories. The GPM searchable databases are: GPMDB, pSYT, SNAP, MRM, PEPTIDE and HOT.
Summit Station is a US National Science Foundation-funded research station on the Greenland ice cap. The website offers near-real time weather summaries and a webcam. Other data associated with the Summit Station can be found through the International Arctic Systems for Observing the Atmosphere (IASOA) website: or from Greenland Environmental Observatory at
The NCEP/NCAR Reanalysis Project is a joint project between the National Centers for Environmental Prediction (NCEP, formerly "NMC") and the National Center for Atmospheric Research (NCAR). The goal of this joint effort is to produce new atmospheric analyses using historical data (1948 onwards) and as well to produce analyses of the current atmospheric state (Climate Data Assimilation System, CDAS).
SkyView is a Virtual Observatory on the Net generating images of any part of the sky at wavelengths in all regimes from Radio to Gamma-Ray.
SMOKA provides public science data obtained at Subaru Telescope, 188cm telescope at Okayama Astrophysical Observatory, 105cm Schmidt telescope at Kiso Observatory (University of Tokyo), MITSuME, and KANATA Telescope at Higashi-Hiroshima Observatory. It is intended mainly for astronomical researchers.
On June 1, 1990 the German X-ray observatory ROSAT started its mission to open a new era in X-ray astronomy. Doubtless, this is the most ambitious project realized up to now in the short history of this young astronomical discipline. Equipped with the largest imaging X-ray telescope ever inserted into an earth orbit ROSAT has provided a tremendous amount of new scientific data and insights.
Measurements Of Pollution In The Troposphere (MOPITT) was launched into sun-synchronous polar orbit on December 18, 1999, aboard TERRA, a NASA satellite orbiting 705 km above the Earth. MOPITT monitors changes in pollution patterns and the effects on Earth’s troposphere. MOPITT uses near-infrared radiation at 2.3 µm and thermal-infrared radiation at 4.7 µm to calculate atmospheric profiles of CO.
OceanSITES is a worldwide system of long-term, deepwater reference stations measuring dozens of variables and monitoring the full depth of the ocean from air-sea interactions down to 5,000 meters. Since 1999, the international OceanSITES science team has shared both data and costs in order to capitalize on the enormous potential of these moorings. The growing network now consists of about 30 surface and 30 subsurface arrays. Satellite telemetry enables near real-time access to OceanSITES data by scientists and the public. OceanSITES moorings are an integral part of the Global Ocean Observing System. They complement satellite imagery and ARGO float data by adding the dimensions of time and depth.
The Solar Dynamics Observatory (SDO) studies the solar atmosphere on small scales of space and time, in multiple wavelengths. This is a searchable database of all SDO data, including citizen scientist images, space weather and near real time data, and helioseismology data.
The Conserved Domain Database is a resource for the annotation of functional units in proteins. Its collection of domain models includes a set curated by NCBI, which utilizes 3D structure to provide insights into sequence/structure/function relationships
The Solar Data Analysis Center serves data from recent and current space-based solar-physics missions, funds and hosts much of the SolarSoft library, and leads the Virtual Solar Observatory (VSO) effort. SDAC is the active archive, providing network access to data from such missions as SOHO, Yohkoh, and TRACE.
Earth System Research Laboratory (ESRL) Global Monitoring Division (GMD) provides data relating to climate change forces and models, ozone depletion and rehabilitation, and baseline air quality. Data are freely available so the public, policy makers, and scientists stay current with long-term atmospheric trends.
The Coastal Data Information Program (CDIP) is an extensive network for monitoring waves and beaches along the coastlines of the United States. Since its inception in 1975, the program has produced a vast database of publicly-accessible environmental data for use by coastal engineers and planners, scientists, mariners, and marine enthusiasts. The program has also remained at the forefront of coastal monitoring, developing numerous innovations in instrumentation, system control and management, computer hardware and software, field equipment, and installation techniques.
GenBank® is a comprehensive database that contains publicly available nucleotide sequences for almost 260 000 formally described species. These sequences are obtained primarily through submissions from individual laboratories and batch submissions from large-scale sequencing projects, including whole-genome shotgun (WGS) and environmental sampling projects. Most submissions are made using the web-based BankIt or standalone Sequin programs, and GenBank staff assigns accession numbers upon data receipt. Daily data exchange with the European Nucleotide Archive (ENA) and the DNA Data Bank of Japan (DDBJ) ensures worldwide coverage. GenBank is accessible through the NCBI Entrez retrieval system, which integrates data from the major DNA and protein sequence databases along with taxonomy, genome, mapping, protein structure and domain information, and the biomedical journal literature via PubMed. BLAST provides sequence similarity searches of GenBank and other sequence databases. Complete bimonthly releases and daily updates of the GenBank database are available by FTP.
GermOnline 4.0 is a cross-species database gateway focusing on high-throughput expression data relevant for germline development, the meiotic cell cycle and mitosis in healthy versus malignant cells. The portal provides access to the Saccharomyces Genomics Viewer (SGV) which facilitates online interpretation of complex data from experiments with high-density oligonucleotide tiling microarrays that cover the entire yeast genome.
The GeneDB project is a core part of the Sanger Institute's Pathogen Genomics activities. Its primary goals are: to provide reliable access to the latest sequence data and annotation/curation for the whole range of organisms sequenced by the Pathogen group. to develop the website and other tools to aid the community in accessing and obtaining the maximum value from these data.
HYdrological cycle in the Mediterranean EXperiemnt. Considering the science and societal issues motivating HyMeX, the programme aims to : improve our understanding of the water cycle, with emphasis on extreme events, by monitoring and modelling the Mediterranean atmosphere-land-ocean coupled system, its variability from the event to the seasonal and interannual scales, and its characteristics over one decade (2010-2020) in the context of global change, assess the social and economic vulnerability to extreme events and adaptation capacity.The multidisciplinary research and the database developed within HyMeX should contribute to: improve observational and modelling systems, especially for coupled systems, better predict extreme events, simulate the long-term water-cycle more accurately, provide guidelines for adaptation measures, especially in the context of global change.