Reset all


Content Types


AID systems



Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type


Metadata standards

PID systems

Provider types

Quality management

Repository languages



Repository types


  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 415 result(s)
<<<!!!<<< The ZACAT server is end-of-life The ZACAT server is EOL and has been taken offline. The software driving the portal has been unmaintained for several years and could no longer be reasonably sustained. We have expanded to include information on the studies' variable level where available, which is a superset of the studies in ZACAT. Please use the variable search on to identify and download datasets. >>>!!!>>>
The India Water Portal is a web-based platform for sharing water management knowledge in India amongst practitioners and the general public. The included datasets can be browsed by data type, location, time, and other metadata. Data include rainfall, watersheds, groundwater, water quality, and irrigation.
Swiss Institute of Bioinformatics (SIB) coordinates research and education in bioinformatics throughout Switzerland and provides bioinformatics services to the national and international research community. ExPASy gives access to numerous repositories and databases of SIB. For example: array map, MetaNetX, SWISS-MODEL and World-2DPAGE, and many others see a list here
GovData the data portal for Germany offers consistent and central access to administrative data at the federal, state, and local level. Objective is to make data more available and easier to use at a single location. As set out in the concept of "open data", we attempt to facilitate the use of open licenses and to increase the supply of machine-readable raw data.
>>>!!!<<< the repository is offline >>>!!!<<< NARSTO is dedicated to improving management of air quality in North America. Additionally, NARSTO is working to improve collaboration between the air-quality and health-sciences research communities, to advance understanding of the scientific issues involved in effecting a multi-pollutant/multi-media approach to air quality management, and to increase understanding of the linkages between air quality and climate change. NARSTO is represented by private and public organizations in Canada, Mexico, and the United States. NARSTO was terminated as of December 31, 2010. While data remain available via the original NARSTO Data Archive, the permanent data archive is maintained by the NASA Langley Research Center Atmospheric Science Data Center
>>>!!!<<< The repository is no longer available. >>>!!!<<< The eagle-i National Network and eagle-i resource search at was retired on November 4, 2021.!!! Groundbreaking biomedical research requires access to cutting edge scientific resources; however such resources are often invisible beyond the laboratories or universities where they were developed. eagle-i is a discovery platform that helps biomedical scientists find previously invisible, but highly valuable, resources.
!!! >>> expired <<< !!!! Intrepid Bioinformatics serves as a community for genetic researchers and scientific programmers who need to achieve meaningful use of their genetic research data – but can’t spend tremendous amounts of time or money in the process. The Intrepid Bioinformatics system automates time consuming manual processes, shortens workflow, and eliminates the threat of lost data in a faster, cheaper, and better environment than existing solutions. The system also provides the functionality and community features needed to analyze the large volumes of Next Generation Sequencing and Single Nucleotide Polymorphism data, which is generated for a wide range of purposes from disease tracking and animal breeding to medical diagnosis and treatment.
BSRN is a project of the Radiation Panel (now the Data and Assessment Panel) from the Global Energy and Water Cycle Experiment (GEWEX) under the umbrella of the World Climate Research Programme (WCRP). It is the global baseline network for surface radiation for the Global limate Observing System (GCOS), contributing to the Global Atmospheric Watch (GAW), and forming a ooperative network with the Network for the Detection of Atmospheric Composition Change NDACC).
The Northern California Earthquake Data Center (NCEDC) is a permanent archive and distribution center primarily for multiple types of digital data relating to earthquakes in central and northern California. The NCEDC is located at the Berkeley Seismological Laboratory, and has been accessible to users via the Internet since mid-1992. The NCEDC was formed as a joint project of the Berkeley Seismological Laboratory (BSL) and the U.S. Geological Survey (USGS) at Menlo Park in 1991, and current USGS funding is provided under a cooperative agreement for seismic network operations.
The tree of life links all biodiversity through a shared evolutionary history. This project will produce the first online, comprehensive first-draft tree of all 1.8 million named species, accessible to both the public and scientific communities. Assembly of the tree will incorporate previously-published results, with strong collaborations between computational and empirical biologists to develop, test and improve methods of data synthesis. This initial tree of life will not be static; instead, we will develop tools for scientists to update and revise the tree as new data come in. Early release of the tree and tools will motivate data sharing and facilitate ongoing synthesis of knowledge.
The nationally recognized National Cancer Database (NCDB)—jointly sponsored by the American College of Surgeons and the American Cancer Society—is a clinical oncology database sourced from hospital registry data that are collected in more than 1,500 Commission on Cancer (CoC)-accredited facilities. NCDB data are used to analyze and track patients with malignant neoplastic diseases, their treatments, and outcomes. Data represent more than 70 percent of newly diagnosed cancer cases nationwide and more than 34 million historical records.
The Supreme Court Database is the definitive source for researchers, students, journalists, and citizens interested in the U.S. Supreme Court. The Database contains over two hundred pieces of information about each case decided by the Court between the 1791 and 2015 terms. Examples include the identity of the court whose decision the Supreme Court reviewed, the parties to the suit, the legal provisions considered in the case, and the votes of the Justices. The project started with Spaeth's original database. The analysis tools allow you to select and summarize cases from the Modern or Legacy Database based on your needs.
The OpenMadrigal project seeks to develop and support an on-line database for geospace data. The project has been led by MIT Haystack Observatory since 1980, but now has active support from Jicamarca Observatory and other community members. Madrigal is a robust, World Wide Web based system capable of managing and serving archival and real-time data, in a variety of formats, from a wide range of ground-based instruments. Madrigal is installed at a number of sites around the world. Data at each Madrigal site is locally controlled and can be updated at any time, but shared metadata between Madrigal sites allow searching of all Madrigal sites at once from any Madrigal site. Data is local; metadata is shared.
The Edition Topoi research platform is an innovative, reliable information infrastructure. It serves the publication of citable research data such as 3D models, high-resolution pictures, data and databases. The content and its meta data are subject to peer review and made available on an Open Access basis. The published or publishable combination of citable research content and its technical and contextually relevant meta data is defined as Citable. The public data are generated via a cloud and can be directly connected with the individual computing environment.
NASA Life Sciences Portal is the next generation of the Life Sciences Data Archive for Human, Animal and Plant Research NASA's Human Research Program (HRP) conducts research and develops technologies that allow humans to travel safely and productively in space. The Program uses evidence from data collected on astronauts, as well as other supporting studies. These data are stored in the research data repository, Life Sciences Data Archive (LSDA).
Country is Luxembourg's central and official platform for data from the public sector, from research institutes and the private sector.
The data in the U of M’s Clinical Data Repository comes from the electronic health records (EHRs) of more than 2 million patients seen at 8 hospitals and more than 40 clinics. For each patient, data is available regarding the patient's demographics (age, gender, language, etc.), medical history, problem list, allergies, immunizations, outpatient vitals, diagnoses, procedures, medications, lab tests, visit locations, providers, provider specialties, and more.
NONCODE is an integrated knowledge database dedicated to non-coding RNAs (excluding tRNAs and rRNAs). Now, there are 16 species in NONCODE(human, mouse, cow, rat, chicken, fruitfly, zebrafish, celegans, yeast, Arabidopsis, chimpanzee, gorilla, orangutan, rhesus macaque, opossum and platypus).The source of NONCODE includes literature and other public databases. We searched PubMed using key words ‘ncrna’, ‘noncoding’, ‘non-coding’,‘no code’, ‘non-code’, ‘lncrna’ or ‘lincrna. We retrieved the new identified lncRNAs and their annotation from the Supplementary Material or web site of these articles. Together with the newest data from Ensembl , RefSeq, lncRNAdb and GENCODE were processed through a standard pipeline for each species.
The GeoPortal.rlp allows the central search and visualization of geo data. Inside the geo data infrastructure of Rhineland-Palatinate the GeoPortal.rlp inherit the central duty a service orientated branch exchange between user and offerer of geo data. The GeoPortal.rlp establishes the access to geo data over the electronic network. The GeoPortal.rlp was brought on line on January, 8th 2007 for the first time, on February, 2nd 2011 it occured a site-relaunch.
The JenAge Ageing Factor Database AgeFactDB is aimed at the collection and integration of ageing phenotype and lifespan data. Ageing factors are genes, chemical compounds or other factors such as dietary restriction, for example. In a first step ageing-related data are primarily taken from existing databases. In addition, new ageing-related information is included both by manual and automatic information extraction from the scientific literature. Based on a homology analysis, AgeFactDB also includes genes that are homologous to known ageing-related genes. These homologs are considered as candidate or putative ageing-related genes.
>>>!!!<<< 2019-01: Global Land Cover Facility goes offline see ; no more access to >>>!!!<<< The Global Land Cover Facility (GLCF) provides earth science data and products to help everyone to better understand global environmental systems. In particular, the GLCF develops and distributes remotely sensed satellite data and products that explain land cover from the local to global scales.
<<<!!!<<< The website closed in January 2015. >>>!!!>>> All GeoBase products are available on the Open Government of Canada portal: GeoBase initiative provides geospatial data of the entire Canadian landmass for government, business, and/or personal assessments of sustainable resource development, public safety, sanitation, and environmental protection. Data is available for download as ESRI Shapefile, FGDB, KML, and GML.
The TRR228DB is the project-database of the Collaborative Research Centre 228 "Future Rural Africa: Future-making and social-ecological transformation" (CRC/Transregio 228, funded by the German Research Foundation (DFG, German Research Foundation – Project number 328966760). The project-database is a new implementation of the TR32DB and online since 2018. It handles all data including metadata, which are created by the involved project participants from several institutions (e.g. Universities of Cologne and Bonn) and research fields (e.g. anthropology, agroeconomics, ecology, ethnology, geography, politics and soil sciences). The data is resulting from several field campaigns, interviews, surveys, remote sensing, laboratory studies and modelling approaches. Furthermore, outcomes of the scientists such as publications, conference contributions, PhD reports and corresponding images are collected.
GEOFON seeks to facilitate cooperation in seismological research and earthquake and tsunami hazard mitigation by providing rapid transnational access to seismological data and source parameters of large earthquakes, and keeping these data accessible in the long term. It pursues these aims by operating and maintaining a global network of permanent broadband stations in cooperation with local partners, facilitating real time access to data from this network and those of many partner networks and plate boundary observatories, providing a permanent and secure archive for seismological data. It also archives and makes accessible data from temporary experiments carried out by scientists at German universities and institutions, thereby fostering cooperation and encouraging the full exploitation of all acquired data and serving as the permanent archive for the Geophysical Instrument Pool at Potsdam (GIPP). It also organises the data exchange of real-time and archived data with partner institutions and international centres.
The Geoscience Data Repository (GDR) is a collection of Earth Sciences Sector geoscience databases that is managed and accessed by a series of Information Services (GDRIS). This site allows you to discover, view and download information using these services. About 27 data resources are listed and many are also listed in the GeoConnections Discovery Portal.