Content Types


AID systems



Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type


Metadata standards

PID systems

Provider types

Quality management

Repository languages



Repository types


  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 3052 result(s)
The Buckeye Corpus of conversational speech contains high-quality recordings from 40 speakers in Columbus OH conversing freely with an interviewer. The speech has been orthographically transcribed and phonetically labeled. The audio and text files, together with time-aligned phonetic labels, are stored in a format for use with speech analysis software (Xwaves and Wavesurfer). Software for searching the transcription files is currently being written.
DEG hosts records of currently available essential genomic elements, such as protein-coding genes and non-coding RNAs, among bacteria, archaea and eukaryotes. Essential genes in a bacterium constitute a minimal genome, forming a set of functional modules, which play key roles in the emerging field, synthetic biology.
Species included in PlantTFDB 4.0 covers the main lineages of green plants. Therefore, PlantTFDB provides genomic TF repertoires across Viridiplantae. To provide comprehensive information for the TF family, a brief introduction and key references are presented for each family. Comprehensive annotations are made for each identified TF, including functional domains, 3D structures, gene ontology (GO), plant ontology (PO), expression information, expert-curated functional description, regulation information, interaction, conserved elements, references, and annotations in various databases such as UniProt, RefSeq, TransFac, STRING, and VISTA. By inferring orthologous groups and constructing phylogenetic trees, evolutionary relationships among identified TFs were inferred. In addition, PlantTFDB has a simple and user-friendly interface to allow users to query based on combined conditions or make sequence similarity search using BLAST.
Dataverse to host followup observations of galaxy clusters identified in South Pole Telescope SZ Surveys. This includes: 1) GMOS spectroscopy of low to moderate redshift galaxy clusters taken as a part of NOAO Large Survey Program 11A-0034 (PI: Christopher Stubbs).
The Diabetes Study of Northern California (DISTANCE) conducts epidemiological and health services research in diabetes among a large, multiethnic cohort of patients in a large, integrated health care delivery system.
IATI is a voluntary, multi-stakeholder initiative that seeks to improve the transparency of aid, development, and humanitarian resources in order to increase their effectiveness in tackling poverty. IATI brings together donor and recipient countries, civil society organisations, and other experts in aid information who are committed to working together to increase the transparency and openness of aid. - See more at:
The Ontology Lookup Service (OLS) is a repository for biomedical ontologies that aims to provide a single point of access to the latest ontology versions. The user can browse the ontologies through the website as well as programmatically via the OLS API. The OLS provides a web service interface to query multiple ontologies from a single location with a unified output format.The OLS can integrate any ontology available in the Open Biomedical Ontology (OBO) format. The OLS is an open source project hosted on Google Code.
The repository contains the complete model of the Bern campaign; only the upper part of the vault could not be measured due to renovation works carried out on the dome at the time of the campaign.
The Spiral Digital Repository is the Imperial College London institutional open access repository. This system allows you, as an author, to make your research documents open access without incurring additional publication costs. When you self-archive a research document in Spiral it becomes free for anyone to read. You can upload copies of your publications to Spiral using Symplectic Elements. All deposited content becomes searchable online.
The Human Genetic Variation Database (HGVD) aims to provide a central resource to archive and display Japanese genetic variation and association between the variation and transcription level of genes. The database currently contains genetic variations determined by exome sequencing of 1,208 individuals and genotyping data of common variations obtained from a cohort of 3,248 individuals.
The German Neuroinformatics Node's data infrastructure (GIN) services provide a platform for comprehensive and reproducible management and sharing of neuroscience data. Building on well established versioning technology, GIN offers the power of a web based repository management service combined with a distributed file storage. The service addresses the range of research data workflows starting from data analysis on the local workstation to remote collaboration and data publication.
Online materials database (known as PAULING FILE project) with nearly 2 million entries: physical properties, crystal structures, phase diagrams, available via API, ready for modern data-intensive applications. The source of these entries are about 300,000 peer-reviewed publications in materials science, processed during the last 16 years by an international team of PhD editors. The results are presented online with a quick search interface. The basic access is provided for free.
----<<<<< This repository is no longer available. This record is out dated >>>>>----- The aim of FlyReactome, based in the Department of Genetics, University of Cambridge, is to develop a curated repository for Drosophila melanogaster pathways and reactions. The information in this database is authored by biological researchers with expertise in their fields, maintained by the FlyReactome staff.
Global Change Research Data Publishing and Repository (GCdataPR) is an open data infrastructure on earth science, particular on the global environmental changes. The GCdataPR’ management policies following the international common understanding to the data sharing principles and guidelines is the key to make the qualified data publishing and sharing smoothly and successfully. The data management policies including dataset submission for publishing policy, peer review policy data quality control policy data long-term preservation policy, data sharing policy, 10% rule for identify original dataset policy, claim discovery with both data and paper policy, and data service statistics policy.
Content type(s)
!!!This site has been decomissioned!!!! The Geographic Information Support Team (GIST) Repository at the University of Georgia is a USAID-funded global archive of spatial data collected and distributed for the greater humanitarian community. If you want to search for data, you will need a valid email address to create an account.
The data in the U of M’s Clinical Data Repository comes from the electronic health records (EHRs) of more than 2 million patients seen at 8 hospitals and more than 40 clinics. For each patient, data is available regarding the patient's demographics (age, gender, language, etc.), medical history, problem list, allergies, immunizations, outpatient vitals, diagnoses, procedures, medications, lab tests, visit locations, providers, provider specialties, and more.
The Durham High Energy Physics Database (HEPData), formerly: the Durham HEPData Project, has been built up over the past four decades as a unique open-access repository for scattering data from experimental particle physics. It currently comprises the data points from plots and tables related to several thousand publications including those from the Large Hadron Collider (LHC). The Durham HepData Project has for more than 25 years compiled the Reactions Database containing what can be loosly described as cross sections from HEP scattering experiments. The data comprise total and differential cross sections, structure functions, fragmentation functions, distributions of jet measures, polarisations, etc... from a wide range of interactions. In the new HEPData site (, you can explore new functionalities for data providers and data consumers, as well as the submission interface. HEPData is operated by CERN and IPPP at Durham University and is based on the digital library framework Invenio.
Search and access 201 data sets covering the Atmosphere, Ocean, Land and more. Explore climate indices, reanalyses and satellite data and understand their application to climate model metrics. This is the only data portal that combines data discovery, metadata, figures and world-class expertise on the strengths, limitations and applications of climate data.
GBIF is an international organisation that is working to make the world's biodiversity data accessible everywhere in the world. GBIF and its many partners work to mobilize the data, and to improve search mechanisms, data and metadata standards, web services, and the other components of an Internet-based information infrastructure for biodiversity. GBIF makes available data that are shared by hundreds of data publishers from around the world. These data are shared according to the GBIF Data Use Agreement, which includes the provision that users of any data accessed through or retrieved via the GBIF Portal will always give credit to the original data publishers.
ResearchWorks Archive is the University of Washington’s digital repository (also known as “institutional repository”) for disseminating and preserving scholarly work. ResearchWorks Archive can accept any digital file format or content (examples include numerical datasets, photographs and diagrams, working papers, technical reports, pre-prints and post-prints of published articles).
The Canadian Opinion Research Archive at Queen's University makes available commercial and independent surveys to the academic, research and journalistic communities. Founded in 1992, CORA contains hundreds of surveys including thousands of discrete items collected by major commercial Canadian firms dating back to the 1970s. CORA is continually adding new surveys and is always soliciting new data from commercial research firms, independent think tanks, research institutes, NGOs, and academic researchers. This website also includes readily accessible results from these surveys, tracking Canadian opinion over time on frequently asked survey questions, as well as tabular results from recent Canadian surveys, and more general information on polling. This material is made available as a public service by CORA and its partners.
The repository is no longer available >>>!!!<<< 2020-02-21: no more access to "Environment Climate Data Sweden" >>>!!!<<< The transfer of records from the Environment Climate Data Sweden (ECDS) database to the Swedish National Dataservice (SND) was completed in 2019. SND is a national research infrastructure with a primary function to support the accessibility, preservation, and re-use of research data and related materials. You can search the SND research data portal specifically for Natural Science or Agricultural and Veterinary Sciences datasets. Data descriptions with associated datasets, or a direct reference/URL to data, have been migrated from the ECDS portal to the SND research data portal. Previous links to these data are now automatically directed to an SND catalogue entry. Records in the ECDS catalogue that only contained metadata (ie information that data could be accessed through another portal, e.g. Pangea), now link directly to the portal in question. If you want to make one of those data descriptions searchable in SND’s catalogue, please contact SND on A small number of records were neither migrated to SND nor redirected to external providers, and they redirect. Contact SND on if you want more information about the closing of the ECDS portal and the migration of data descriptions to SND’s research data catalogue.
LEPR is a database of results of published experimental studies involving liquid-solid phase equilibria relevant to natural magmatic systems. TraceDs is a database of experimental studies involving trace element distribution between liquid, solid and fluid phases.