Reset all


Content Types


AID systems



Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type


Metadata standards

PID systems

Provider types

Quality management

Repository languages



Repository types


  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 333 result(s)
Peptidome was a public repository that archived tandem mass spectrometry peptide and protein identification data generated by the scientific community. This repository is now offline and is in archival mode. All data may be obtained from the Peptidome FTP site. Due to budgetary constraints NCBI has discontinued the Peptidome Repository. All existing data and metadata files will continue to be made available from our ftp server a indefinitely. Those files are named according to their Peptidome accession number, allowing cited data to be identified and downloaded. All of the Peptidome studies have been made publicly available at the PRoteomics IDEntifications (PRIDE) database. A map of Peptidome to Pride accessions may be found at If you have any specific questions, please feel free to contact us at
The Netlib repository contains freely available software, documents, and databases of interest to the numerical, scientific computing, and other communities.
The Department of Energy (DOE) Joint Genome Institute (JGI) is a national user facility with massive-scale DNA sequencing and analysis capabilities dedicated to advancing genomics for bioenergy and environmental applications. Beyond generating tens of trillions of DNA bases annually, the Institute develops and maintains data management systems and specialized analytical capabilities to manage and interpret complex genomic data sets, and to enable an expanding community of users around the world to analyze these data in different contexts over the web. The JGI Genome Portal provides a unified access point to all JGI genomic databases and analytical tools. A user can find all DOE JGI sequencing projects and their status, search for and download assemblies and annotations of sequenced genomes, and interactively explore those genomes and compare them with other sequenced microbes, fungi, plants or metagenomes using specialized systems tailored to each particular class of organisms. Databases: Genome Online Database (GOLD), Integrated Microbial Genomes (IGM), MycoCosm, Phytozome
The HEASARC is a multi-mission astronomy archive for the EUV, X-ray, and Gamma ray wave bands. Because EUV, X and Gamma rays cannot reach the Earth's surface it is necessary to place the telescopes and sensors on spacecraft. The HEASARC now holds the data from 25 observatories covering over 30 years of X-ray, extreme-ultraviolet and gamma-ray astronomy. Data and software from many of the older missions were restored by the HEASARC staff. Examples of these archived missions include ASCA, BeppoSAX, Chandra, Compton GRO, HEAO 1, Einstein Observatory (HEAO 2), EUVE, EXOSAT, HETE-2, INTEGRAL, ROSAT, Rossi XTE, Suzaku, Swift, and XMM-Newton.
Welcome to INTERMAGNET - the global network of observatories, monitoring the Earth's magnetic field. At this site you can find data and information from geomagnetic observatories around the world. The INTERMAGNET programme exists to establish a global network of cooperating digital magnetic observatories, adopting modern standard specifications for measuring and recording equipment, in order to facilitate data exchanges and the production of geomagnetic products in close to real time.
Nuclear reaction database ENDF contains evaluated (recommended) cross sections, spectra, angular distributions, fission product yields, photo-atomic and thermal scattering law data, with emphasis on neutron induced reactions. The evaluated nuclear reaction databases, ENDF, store nuclear data from the major evaluated libraries: ENDF: Evaluated Nuclear Data File: U.S. and Canada, JEFF: Joint Evaluated Fission and Fusion File: Nuclear Energy Agency, JENDL: Japanese Evaluated Nuclear Data Library: Japan, CENDL: China Evaluated Nuclear Data Library: China, BROND: Library of Recommended Evaluated Neutron Data: Russia. as well as various special purpose evaluated libraries.
!!!! <<<< The Community Data Portal (CDP) has been retired after nearly 15 years of service and is no longer available. Data can now be found here: DASH Search: . Please contact us with questions or concerns: >>>> !!!! The Community Data Portal (CDP) is a collection of earth science datasets from NCAR, UCAR, UOP, and participating organizations.
The World Ocean Database (WOD) is a collection of scientifically quality-controlled ocean profile and plankton data that includes measurements of temperature, salinity, oxygen, phosphate, nitrate, silicate, chlorophyll, alkalinity, pH, pCO2, TCO2, Tritium, Δ13Carbon, Δ14Carbon, Δ18Oxygen, Freon, Helium, Δ3Helium, Neon, and plankton. WOD contains all data of "World Data Service Oceanography" (WDS-Oceanography).
The European Monitoring and Evaluation Programme (EMEP) is a scientifically based and policy driven programme under the Convention on Long-range Transboundary Air Pollution (CLRTAP) for international co-operation to solve transboundary air pollution problems.
The Northern California Earthquake Data Center (NCEDC) is a permanent archive and distribution center primarily for multiple types of digital data relating to earthquakes in central and northern California. The NCEDC is located at the Berkeley Seismological Laboratory, and has been accessible to users via the Internet since mid-1992. The NCEDC was formed as a joint project of the Berkeley Seismological Laboratory (BSL) and the U.S. Geological Survey (USGS) at Menlo Park in 1991, and current USGS funding is provided under a cooperative agreement for seismic network operations.
Welcome to the largest bibliographic database dedicated to Economics and available freely on the Internet. This site is part of a large volunteer effort to enhance the free dissemination of research in Economics, RePEc, which includes bibliographic metadata from over 1,800 participating archives, including all the major publishers and research outlets. IDEAS is just one of several services that use RePEc data. Authors are invited to register with RePEc to create an online profile. Then, anyone finding some of your research here can find your latest contact details and a listing of your other research. You will also receive a monthly mailing about the popularity of your works, your ranking and newly found citations. Besides that IDEAS provides software and public accessible data from Federal Reserve Bank.
The UniProtKB Sequence/Annotation Version Archive (UniSave) has the mission of providing freely to the scientific community a repository containing every version of every Swiss-Prot/TrEMBL entry in the UniProt Knowledge Base (UniProtKB). This is achieved by archiving, every release, the entry versions within the current release. The primary usage of this service is to provide open access to all entry versions of all entries. In addition to viewing their content, one can also filter, download and compare versions.
Species included in PlantTFDB 4.0 covers the main lineages of green plants. Therefore, PlantTFDB provides genomic TF repertoires across Viridiplantae. To provide comprehensive information for the TF family, a brief introduction and key references are presented for each family. Comprehensive annotations are made for each identified TF, including functional domains, 3D structures, gene ontology (GO), plant ontology (PO), expression information, expert-curated functional description, regulation information, interaction, conserved elements, references, and annotations in various databases such as UniProt, RefSeq, TransFac, STRING, and VISTA. By inferring orthologous groups and constructing phylogenetic trees, evolutionary relationships among identified TFs were inferred. In addition, PlantTFDB has a simple and user-friendly interface to allow users to query based on combined conditions or make sequence similarity search using BLAST.
The Census Bureau releases TIGER/Line shapefiles and metadata each year to the public. TIGER/Line shapefiles are spatial extracts from the Census Bureau’s MAF/TIGER database. They contain features such as roads, railroads, hydrographic features and legal and statistical boundaries.
The Satellite Application Facility on Climate Monitoring (CM SAF) develops, produces, archives and disseminates satellite-data-based products in support to climate monitoring. The product suite mainly covers parameters related to the energy & water cycle and addresses many of the Essential Climate Variables as defined by GCOS (GCOS 138). The CM SAF produces both Enviromental Data Records and Climate Data Records.
The mission of NCHS is to provide statistical information that will guide actions and policies to improve the health of the American people. As the Nation's principal health statistics agency, NCHS is responsible for collecting accurate, relevant, and timely data. NCHS' mission, and those of its counterparts in the Federal statistics system, focuses on the collection, analysis, and dissemination of information that is of use to a broad range of us.
TES is the first satellite instrument to provide simultaneous concentrations of carbon monoxide, ozone, water vapor and methane throughout Earth’s lower atmosphere. This lower atmosphere (the troposphere) is situated between the surface and the height at which aircraft fly, and is an important part of the atmosphere that we often impact with our activities.
The Expression Atlas provides information on gene expression patterns under different biological conditions such as a gene knock out, a plant treated with a compound, or in a particular organism part or cell. It includes both microarray and RNA-seq data. The data is re-analysed in-house to detect interesting expression patterns under the conditions of the original experiment. There are two components to the Expression Atlas, the Baseline Atlas and the Differential Atlas. The Baseline Atlas displays information about which gene products are present (and at what abundance) in "normal" conditions (e.g. tissue, cell type). It aims to answer questions such as "which genes are specifically expressed in human kidney?". This component of the Expression Atlas consists of highly-curated and quality-checked RNA-seq experiments from ArrayExpress. It has data for many different animal and plant species. New experiments are added as they become available. The Differential Atlas allows users to identify genes that are up- or down-regulated in a wide variety of different experimental conditions such as yeast mutants, cadmium treated plants, cystic fibrosis or the effect on gene expression of mind-body practice. Both microarray and RNA-seq experiments are included in the Differential Atlas. Experiments are selected from ArrayExpress and groups of samples are manually identified for comparison e.g. those with wild type genotype compared to those with a gene knock out. Each experiment is processed through our in-house differential expression statistical analysis pipeline to identify genes with a high probability of differential expression.
NASA’s Precipitation Measurement Missions – TRMM and GPM – provide advanced information on rain and snow characteristics and detailed three-dimensional knowledge of precipitation structure within the atmosphere, which help scientists study and understand Earth's water cycle, weather and climate.
GESDB is a platform for sharing simulation data and discussion of simulation techniques for human genetic studies. The database contains simulation scripts, simulated data, and documentations from published manuscripts. The forum provides a platform for Q&A for the simulated data and exchanging simulation ideas. GESDB aims to promote transparency and efficiency in simulation studies for human genetic studies.
The CiardRING is a global directory of web-based information services and datasets for agricultural research for development (ARD). It is the principal tool created through the CIARD initiative to allow information providers to register their services and datasets in various categories and so facilitate the discovery of sources of agriculture-related information across the world. The RING aims to provide an infrastructure to improve the accessibility of the outputs of agricultural research and of information relevant to agriculture.
This site provides information about the NIH MRI Study of Normal Brain Development (Pediatric MRI Study) and resulting Pediatric MRI Data Repository. This website serves as the portal through which data can be obtained by qualified researchers. The overarching goal of the Pediatric MRI Study is to foster a better understanding of normal brain maturation as a basis for understanding atypical brain development associated with a variety of disorders and diseases.
A database providing information on the structure of assembled genomes, assembly names and other meta-data, statistical reports, and links to genomic sequence data.