Reset all


Content Types


AID systems



Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type


Metadata standards

PID systems

Provider types

Quality management

Repository languages



Repository types


  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 356 result(s)
The CPTAC Data Portal is the centralized repository for the dissemination of proteomic data collected by the Proteome Characterization Centers (PCCs) for the CPTAC program. The portal also hosts analyses of the mass spectrometry data (mapping of spectra to peptide sequences and protein identification) from the PCCs and from a CPTAC-sponsored common data analysis pipeline (CDAP).
The Comparative Study of Electoral Systems (CSES) is a collaborative, cross-national program of comparative electoral behavior among over 60 election study teams from around the world. The CSES allows examination into how societal, political, economic and structural contexts shape citizen behavior and condition democratic choice; the nature of political and social divisions; and how citizens in different political systems evaluate democratic institutions and processes. Participating countries include a common module of survey questions in their post-election studies. The resulting data are deposited along with voting, demographic, district and macro variables. The studies are then merged into a single, free, public dataset for use in comparative study and cross-level analysis. The research agenda, questionnaires, and study design are developed by an international committee of leading scholars of electoral politics and political science. The design is implemented in each country by their foremost social scientists.
Measurements Of Pollution In The Troposphere (MOPITT) was launched into sun-synchronous polar orbit on December 18, 1999, aboard TERRA, a NASA satellite orbiting 705 km above the Earth. MOPITT monitors changes in pollution patterns and the effects on Earth’s troposphere. MOPITT uses near-infrared radiation at 2.3 µm and thermal-infrared radiation at 4.7 µm to calculate atmospheric profiles of CO.
GWAS Central (previously the Human Genome Variation database of Genotype-to-Phenotype information) is a database of summary level findings from genetic association studies, both large and small. We actively gather datasets from public domain projects, and encourage direct data submission from the community.
PeanutBase is a peanut community resource providing genetic, genomic, gene function, and germplasm data to support peanut breeding and molecular research. This includes molecular markers, genetic maps, QTL data, genome assemblies, germplasm records, and traits. Data is curated from literature and submitted directly by researchers. Funding for PeanutBase is provided by the Peanut Foundation with in-kind contributions from the USDA-ARS.
CPDB reports analyses of animal cancer tests used in support of cancer risk assessments for human. It was developed by the Carcinogenic Potency Project at the University of California, Berkeley and the Lawrence Berkeley National Laboratory. It includes 6,540 chronic, long-term animal cancer tests from the published literature as well as from the National Cancer Institute and the National Toxicology Program (NTP). Years of coverage: CPDB covers 1980 - 2011. It is no longer updated.
Content type(s)
IRIS contains data in support of human health risk assessment, including hazard identification and dose-response assessments. It is compiled by the U.S. EPA and contains descriptive and quantitative information related to human cancer and non-cancer health effects that may result from exposure to substances in the environment. IRIS data is reviewed by EPA scientists and represents EPA consensus.
The Sequence Read Archive stores the raw sequencing data from such sequencing platforms as the Roche 454 GS System, the Illumina Genome Analyzer, the Applied Biosystems SOLiD System, the Helicos Heliscope, and the Complete Genomics. It archives the sequencing data associated with RNA-Seq, ChIP-Seq, Genomic and Transcriptomic assemblies, and 16S ribosomal RNA data.
The Solar Dynamics Observatory (SDO) studies the solar atmosphere on small scales of space and time, in multiple wavelengths. This is a searchable database of all SDO data, including citizen scientist images, space weather and near real time data, and helioseismology data.
Kenya Open Data offers visualizations tools, data downloads, and easy access for software developers. Kenya Open Data provides core government development, demographic, statistical and expenditure data available for researchers, policymakers, developers and the general public. Kenya is the first developing country to have an open government data portal, the first in sub-Saharan Africa and second on the continent after Morocco. The initiative has been widely acclaimed globally as one of the most significant steps Kenya has made to improve governance and implement the new Constitution’s provisions on access to information.
Stemformatics is a collaboration between the stem cell and bioinformatics community. We were motivated by the plethora of exciting cell models in the public and private domains, and the realisation that for many biologists these were mostly inaccessible. We wanted a fast way to find and visualise interesting genes in these exemplar stem cell datasets. We'd like you to explore. You'll find data from leading stem cell laboratories in a format that is easy to search, easy to visualise and easy to export.
The Conserved Domain Database is a resource for the annotation of functional units in proteins. Its collection of domain models includes a set curated by NCBI, which utilizes 3D structure to provide insights into sequence/structure/function relationships
The BBS is a cooperative effort between the U.S. Geological Survey's Patuxent Wildlife Research Center and Environment Canada's Canadian Wildlife Service to monitor the status and trends of North American bird populations. Following a rigorous protocol, BBS data are collected by thousands of dedicated participants along thousands of randomly established roadside routes throughout the continent. Professional BBS coordinators and data managers work closely with researchers and statisticians to compile and deliver these population data and population trend analyses on more than 400 bird species, for use by conservation managers, scientists, and the general public.
The Mexican Health and Aging Study (MHAS) started as a prospective panel study of health and aging in Mexico. MHAS is nationally representative of the 13 million Mexicans born prior to 1951. The survey has national and urban/rural representation. The baseline survey, in 2001, included a nationally representative sample of Mexicans aged 50 and over and their spouse/partners regardless of their age. A direct interview was sought with each individual and proxy interviews were obtained when poor health or temporary absence precluded a direct interview. The sample was distributed in all 32 states of the country in urban and rural areas. Households in the six states which account for 40% of all migrants to the U.S. were over-sampled. A sub-sample was selected to obtain anthropometric measures.
The National Park Service Gaseous Pollutant Monitoring Program Database provides gaseous air pollutant and meteorological data as *.csv files. Queries allow filtering by location of ozone, wind speed, wind direction, ambient temperature, relative humidity, solar radiation, wetness data.
Earth System Research Laboratory (ESRL) Global Monitoring Division (GMD) provides data relating to climate change forces and models, ozone depletion and rehabilitation, and baseline air quality. Data are freely available so the public, policy makers, and scientists stay current with long-term atmospheric trends.
This website is a portal that enables access to multi-Terabyte turbulence databases. The data reside on several nodes and disks on our database cluster computer and are stored in small 3D subcubes. Positions are indexed using a Z-curve for efficient access.
CCRIS contains over 9,000 chemical records with carcinogenicity, mutagenicity, tumor promotion, and tumor inhibition test results. Data are derived from studies cited in primary journals, current awareness tools, NCI reports, and other special sources. Test results have been reviewed by experts in carcinogenesis and mutagenesis. >CCRIS provides historical information from the years 1985 - 2011. It is no longer updated.< CCRIS is accessible, free of charge, via TOXNET at: https://toxnet/
The Behavioral Risk Factor Surveillance System (BRFSS) is the world's largest, on-going telephone health survey system. As a result, surveys were developed and conducted to monitor state-level prevalence of the major behavioral risks among adults associated with premature morbidity and mortality. The basic philosophy was to collect data on actual behaviors, rather than on attitudes or knowledge, that would be especially useful for planning, initiating, supporting, and evaluating health promotion and disease prevention programs. Currently data are collected monthly in all 50 states.
The USGS Science Data Catalog includes records describing individual datasets, data collections, and observational or remotely-sensed data contained in national systems (rather than records about individual observations). The Catalog does not include USGS data for which there are currently no online access points.
ArkDB is a generic, species-independent database built to capture the state of published information on genome mapping in a given species. It stores details of references, markers and loci and genetic linkage and cytogenetic maps which can be drawn using the online map-drawing application. Data from linkage maps held within the ArkDB system can be drawn alongside their corresponding genome sequence maps (extracted from ENSEMBL).
The OFA databases are core to the organization’s objective of establishing control programs to lower the incidence of inherited disease. Responsible breeders have an inherent responsibility to breed healthy dogs. The OFA databases serve all breeds of dogs and cats, and provide breeders a means to respond to the challenge of improving the genetic health of their breed through better breeding practices. The testing methodology and the criteria for evaluating the test results for each database were independently established by veterinary scientists from their respective specialty areas, and the standards used are generally accepted throughout the world.
TPA is a database that contains sequences built from the existing primary sequence data in GenBank. TPA records are retrieved through the Nucleotide Database and feature information on the sequence, how it was cataloged, and proper way to cite the sequence information.