Reset all


Content Types


AID systems



Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type


Metadata standards

PID systems

Provider types

Quality management

Repository languages



Repository types


  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 150 result(s)
GermOnline 4.0 is a cross-species database gateway focusing on high-throughput expression data relevant for germline development, the meiotic cell cycle and mitosis in healthy versus malignant cells. The portal provides access to the Saccharomyces Genomics Viewer (SGV) which facilitates online interpretation of complex data from experiments with high-density oligonucleotide tiling microarrays that cover the entire yeast genome.
The Bavarian Archive for Speech Signals (BAS) is a public institution hosted by the University of Munich. This institution was founded with the aim of making corpora of current spoken German available to both the basic research and the speech technology communities via a maximally comprehensive digital speech-signal database. The speech material will be structured in a manner allowing flexible and precise access, with acoustic-phonetic and linguistic-phonetic evaluation forming an integral part of it.
Earth System Research Laboratory (ESRL) Global Monitoring Division (GMD) provides data relating to climate change forces and models, ozone depletion and rehabilitation, and baseline air quality. Data are freely available so the public, policy makers, and scientists stay current with long-term atmospheric trends.
TPA is a database that contains sequences built from the existing primary sequence data in GenBank. TPA records are retrieved through the Nucleotide Database and feature information on the sequence, how it was cataloged, and proper way to cite the sequence information.
The repository facilitates computation of a wide range of biosystem data. It also connects biosystem data with associated literature throughout the Entrez system.
The RESID Database of Protein Modifications is a comprehensive collection of annotations and structures for protein modifications including amino-terminal, carboxyl-terminal and peptide chain cross-link post-translational modifications.
The objective of this database is to stimulate the exchange of information and the collaboration between researchers within the ChArMEx community. However, this community is not exclusive and researchers not directly involved in ChArMEx, but who wish to contribute to the achievements of ChArMEx scientific and/or educational goals are welcome to join-in. The database is a depository for all the data collected during the various projects that contribute to ChArMEx coordinated program. It aims at documenting, storing and distributing the data produced or used by the project community. However, it is also intended to host datasets that were produced outside the ChArMEx program but which are meaningful to ChArMEx scientific and/or educational goals. Any data owner who wishes to add or link his dataset to ChArMEx database is welcome to contact the database manager in order to get help and support. The ChArMEx database includes past and recent geophysical in situ observations, satellite products and model outputs. The database organizes the data management and provides data services to end-users of ChArMEx data. The database system provides a detailed description of the products and uses standardized formats whenever it is possible. It defines the access rules to the data and details the mutual rights and obligations of data providers and users (see ChArMEx data and publication policy). The database is being developed jointly by : SEDOO, OMP Toulouse , ICARE, Lille and ESPRI, IPSL Paris
HYdrological cycle in the Mediterranean EXperiemnt. Considering the science and societal issues motivating HyMeX, the programme aims to : improve our understanding of the water cycle, with emphasis on extreme events, by monitoring and modelling the Mediterranean atmosphere-land-ocean coupled system, its variability from the event to the seasonal and interannual scales, and its characteristics over one decade (2010-2020) in the context of global change, assess the social and economic vulnerability to extreme events and adaptation capacity.The multidisciplinary research and the database developed within HyMeX should contribute to: improve observational and modelling systems, especially for coupled systems, better predict extreme events, simulate the long-term water-cycle more accurately, provide guidelines for adaptation measures, especially in the context of global change.
The BBS is a cooperative effort between the U.S. Geological Survey's Patuxent Wildlife Research Center and Environment Canada's Canadian Wildlife Service to monitor the status and trends of North American bird populations. Following a rigorous protocol, BBS data are collected by thousands of dedicated participants along thousands of randomly established roadside routes throughout the continent. Professional BBS coordinators and data managers work closely with researchers and statisticians to compile and deliver these population data and population trend analyses on more than 400 bird species, for use by conservation managers, scientists, and the general public.
OceanSITES is a worldwide system of long-term, deepwater reference stations measuring dozens of variables and monitoring the full depth of the ocean from air-sea interactions down to 5,000 meters. Since 1999, the international OceanSITES science team has shared both data and costs in order to capitalize on the enormous potential of these moorings. The growing network now consists of about 30 surface and 30 subsurface arrays. Satellite telemetry enables near real-time access to OceanSITES data by scientists and the public. OceanSITES moorings are an integral part of the Global Ocean Observing System. They complement satellite imagery and ARGO float data by adding the dimensions of time and depth.
With the creation of the Metabolomics Data Repository managed by Data Repository and Coordination Center (DRCC), the NIH acknowledges the importance of data sharing for metabolomics. Metabolomics represents the systematic study of low molecular weight molecules found in a biological sample, providing a "snapshot" of the current and actual state of the cell or organism at a specific point in time. Thus, the metabolome represents the functional activity of biological systems. As with other ‘omics’, metabolites are conserved across animals, plants and microbial species, facilitating the extrapolation of research findings in laboratory animals to humans. Common technologies for measuring the metabolome include mass spectrometry (MS) and nuclear magnetic resonance spectroscopy (NMR), which can measure hundreds to thousands of unique chemical entities. Data sharing in metabolomics will include primary raw data and the biological and analytical meta-data necessary to interpret these data. Through cooperation between investigators, metabolomics laboratories and data coordinating centers, these data sets should provide a rich resource for the research community to enhance preclinical, clinical and translational research.
Originally named the Radiation Belt Storm Probes (RBSP), the mission was re-named the Van Allen Probes, following successful launch and commissioning. For simplicity and continuity, the RBSP short-form has been retained for existing documentation, file naming, and data product identification purposes. The RBSPICE investigation including the RBSPICE Instrument SOC maintains compliance with requirements levied in all applicable mission control documents.
The NCAR is a federally funded research and development center committed to research and education in atmospheric science and related scientific fields. NCAR seeks to support and enhance the scientific community nationally and globally by monitoring and researching the atmosphere and related physical and biological systems. Users can access climate and earth models created to better understand the atmosphere, the Earth and the Sun; as well as data from various NCAR research programs and projects. NCAR is sponsored by the National Science Foundation in addition to various other U.S. agencies.
Modern signal processing and machine learning methods have exciting potential to generate new knowledge that will impact both physiological understanding and clinical care. Access to data - particularly detailed clinical data - is often a bottleneck to progress. The overarching goal of PhysioNet is to accelerate research progress by freely providing rich archives of clinical and physiological data for analysis.
The Global Proteome Machine (GPM) is a protein identification database. This data repository allows users to post and compare results. GPM's data is provided by contributors like The Informatics Factory, University of Michigan, and Pacific Northwestern National Laboratories. The GPM searchable databases are: GPMDB, pSYT, SNAP, MRM, PEPTIDE and HOT.
SILVA is a comprehensive, quality-controlled web resource for up-to-date aligned ribosomal RNA (rRNA) gene sequences from the Bacteria, Archaea and Eukaryota domains alongside supplementary online services. In addition to data products, SILVA provides various online tools such as alignment and classification, phylogenetic tree calculation and viewer, probe/primer matching, and an amplicon analysis pipeline. With every full release a curated guide tree is provided that contains the latest taxonomy and nomenclature based on multiple references. SILVA is an ELIXIR Core Data Resource.
The 1000 Genomes Project is an international collaboration to produce an extensive public catalog of human genetic variation, including SNPs and structural variants, and their haplotype contexts. This resource will support genome-wide association studies and other medical research studies. The genomes of about 2500 unidentified people from about 25 populations around the world will be sequenced using next-generation sequencing technologies. The results of the study will be freely and publicly accessible to researchers worldwide. The International Genome Sample Resource (IGSR) has been established at EMBL-EBI to continue supporting data generated by the 1000 Genomes Project, supplemented with new data and new analysis.
GenBank® is a comprehensive database that contains publicly available nucleotide sequences for almost 260 000 formally described species. These sequences are obtained primarily through submissions from individual laboratories and batch submissions from large-scale sequencing projects, including whole-genome shotgun (WGS) and environmental sampling projects. Most submissions are made using the web-based BankIt or standalone Sequin programs, and GenBank staff assigns accession numbers upon data receipt. Daily data exchange with the European Nucleotide Archive (ENA) and the DNA Data Bank of Japan (DDBJ) ensures worldwide coverage. GenBank is accessible through the NCBI Entrez retrieval system, which integrates data from the major DNA and protein sequence databases along with taxonomy, genome, mapping, protein structure and domain information, and the biomedical journal literature via PubMed. BLAST provides sequence similarity searches of GenBank and other sequence databases. Complete bimonthly releases and daily updates of the GenBank database are available by FTP.
>>>!!!<<< American FactFinder has been decommissioned and is no longer available. Data are now available at: >>>!!!<<< American FactFinder, maintained by the U.S. Census Bureau, is a source for United States population, housing, economic, and geographic data. The Census Bureau conducts nearly one hundred surveys and censuses every year. Note that by law, no one is permitted to reveal information from these censuses and surveys that could identify any person, household, or business.
The WDC has a FTP-server to distribute the PCN index derived from the geomagnetic observatory Qaanaaq (THL) and the Kp-index data products derived at the geomagnetic observatory Niemegk (NGK). The WDC is also holding extensive archives of magnetograms and other geomagnetic observatory data products that predate the introduction of digital data recording. The material is in analogue form such as film or microfiche. The Polar Cap index (abbreviation PC index) consists of the Polar Cap North (PCN) and the Polar Cap South (PCS) index, which are derived from magnetic measurements taken at the geomagnetic observatories Qaanaaq (THL, Greenland, +85o magnetic latitude) and Vostok (VOS, Antarctica, -83o magnetic latitude), respectively. The idea behind these indices is to estimate the intensity of anti-sunward plasma convection in the polar caps. This convection is associated with electric Hall currents and consequent magnetic field variations perpendicular to the antisunward plasma flow (and related Hall current) which can be monitored at the Qaanaaq and Vostok magnetic observatories. PC aims at monitoring the energy input from solar wind to the magnetosphere (loading activity). The index is constructed in such a way that it has a linear relationship with the merging Electric Field at the magnetopause; consequently PC is given in units of mV/m as for the electric field. In August 2013, the International Association of Geomagnetism and Aeronomy (IAGA) endorsed the PC index. The endorsed PC index is accessible at or through WDC Copenhagen.
FANTOM stands for 'Functional Annotation of the Mammalian Genome' and is the name of an international research consortium organized by the RIKEN Omics Science Center. The FANTOM5 project aims to build a full understanding of transcriptional regulation in a human system by generating transcriptional regulatory networks that define every human cell type.
ALEXA is a microarray design platform for 'alternative expression analysis'. This platform facilitates the design of expression arrays for analysis of mRNA isoforms generated from a single locus by the use of alternative transcription initiation, splicing and polyadenylation sites. We use the term 'ALEXA' to describe a collection of novel genomic methods for 'alternative expression' analysis. 'Alternative expression' refers to the identification and quantification of alternative mRNA transcripts produced by alternative transcript initiation, alternative splicing and alternative polyadenylation. This website provides supplementary materials, source code and other downloads for recent publications describing our studies of alternative expression (AE). Most recently we have developed a method, 'ALEXA-Seq' and associated resources for alternative expression analysis by massively parallel RNA sequencing.
dbSTS is an NCBI resource that contains sequence data for short genomic landmark sequences or Sequence Tagged Sites. STS sequences are incorporated into the STS Division of GenBank.