Reset all


Content Types


AID systems



Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type


Metadata standards

PID systems

Provider types

Quality management

Repository languages



Repository types


  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 121 result(s)
ArrayExpress is one of the major international repositories for high-throughput functional genomics data from both microarray and high-throughput sequencing studies, many of which are supported by peer-reviewed publications. Data sets are submitted directly to ArrayExpress and curated by a team of specialist biological curators. In the past (until 2018) datasets from the NCBI Gene Expression Omnibus database were imported on a weekly basis. Data is collected to MIAME and MINSEQE standards.
The Australian Data Archive (ADA) provides a national service for the collection and preservation of digital research data and to make these data available for secondary analysis by academic researchers and other users. Data are stored in seven sub-archives: Social Science, Historical, Indigenous, Longitudinal, Qualitative, Crime & Justice and International. Along with Australian data, ADA International is also a repository for studies by Australian researchers conducted in other countries, particularly throughout the Asia-Pacific region. The ADA International data catalogue includes links to studies from countries including New Zealand, Bangladesh, Cambodia, China, Indonesia, and several other countries. In 2017 the archive systems moved from the existing Nesstar platform to the new ADA Dataverse platform
The Avian Knowledge Network (AKN) is an international network of governmental and non-governmental institutions and individuals linking avian conservation, monitoring and science through efficient data management and coordinated development of useful solutions using best-science practices based on the data.
>>>!!!<<< Sorry.we are no longer in operation >>>!!!<<< The Beta Cell Biology Consortium (BCBC) was a team science initiative that was established by the National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK). It was initially funded in 2001 (RFA DK-01-014), and competitively continued both in 2005 (RFAs DK-01-17, DK-01-18) and in 2009 (RFA DK-09-011). Funding for the BCBC came to an end on August 1, 2015, and with it so did our ability to maintain active websites.!!! One of the many goals of the BCBC was to develop and maintain databases of useful research resources. A total of 813 different scientific resources were generated and submitted by BCBC investigators over the 14 years it existed. Information pertaining to 495 selected resources, judged to be the most scientifically-useful, has been converted into a static catalog, as shown below. In addition, the metadata for these 495 resources have been transferred to dkNET in the form of RDF descriptors, and all genomics data have been deposited to either ArrayExpress or GEO. Please direct questions or comments to the NIDDK Division of Diabetes, Endocrinology & Metabolic Diseases (DEM).
BIOS is a system designed to enable the management, visualization, and analysis of biogeographic data collected by the California Department of Fish and Wildlife and its partner organizations. BIOS integrates GIS, relational database management, and ESRI's ArcGIS Server technology to create a statewide, integrated information management tool that can be used on any computer with access to the Internet.
>>>!!!<<< caArray Retirement Announcement >>>!!!<<< The National Cancer Institute (NCI) Center for Biomedical Informatics and Information Technology (CBIIT) instance of the caArray database was retired on March 31st, 2015. All publicly-accessible caArray data and annotations will be archived and will remain available via FTP download and is also available at GEO . >>>!!!<<< While NCI will not be able to provide technical support for the caArray software after the retirement, the source code is available on GitHub , and we encourage continued community development. Molecular Analysis of Brain Neoplasia (Rembrandt fine-00037) gene expression data has been loaded into ArrayExpress: >>>!!!<<< caArray is an open-source, web and programmatically accessible microarray data management system that supports the annotation of microarray data using MAGE-TAB and web-based forms. Data and annotations may be kept private to the owner, shared with user-defined collaboration groups, or made public. The NCI instance of caArray hosts many cancer-related public datasets available for download.
The DMC is designed to provide registered users with access to non-confidential petroleum exploration and production data from offshore Nova Scotia, subject to certain conditions. The DMC is housed in the CNSOPB's Geoscience Research Centre located in Dartmouth, Nova Scotia. Initially, the DMC will manage and distribute the following digital petroleum data: well data (i.e. logs and reports), seismic image files (e.g. TIFF, PDF), and production data. In the future the DMC could be expanded to include operational, safety, environmental, fisheries data, etc.
The CSSDP project provides space scientists with access to a wide range of space data, observations, and investigative tools. It provides a seamless, single- point of access to these resources through a custom web portal. To date, more than 350 scientists are registered users of the CSSDP portal. The project integrates data from sources such as the Canadian Geospace Monitoring Program and anticipates serving data from the NASA THEMIS satellite probes, the Canadian High-Artic Ionospheric Network (CHAIN), and the Alberta- based Enhanced Polar Outflow Probe (ePOP) satellite mission. This collection and presentation of space data is used to study the influence of the sun on near- Earth space environment, including phenomena such as geomagnetic storms, which cause the northern and southern lights. Geomagnetic storms are also known for often causing power outages, disturbances in polar communications, and the failure of satellites. The effects of space weather can also cause transpolar flight paths to be diverted, adding significant fuel costs to airlines and disruptions for travellers.
-----<<<<< The repository is no longer available. This record is out-dated. The Matter lab provides the archived database version of 2012 and 2013 at Data linked from the World Community Grid - The Clean Energy Project see at and on fighshare >>>>>----- The Clean Energy Project Database (CEPDB) is a massive reference database for organic semiconductors with a particular emphasis on photovoltaic applications. It was created to store and provide access to data from computational as well as experimental studies, on both known and virtual compounds. It is a free and open resource designed to support researchers in the field of organic electronics in their scientific pursuits. The CEPDB was established as part of the Harvard Clean Energy Project (CEP), a virtual high-throughput screening initiative to identify promising new candidates for the next generation of carbon-based solar cell materials.
The CGIAR Research Program No. 6 (CRP6): Forests, Trees and Agroforestry: Livelihoods, Landscapes and Governance aims to enhance the management and use of forests, agroforestry and tree genetic resources across the landscape, from farms to forests.
The ChemBio Hub vision is to provide the tools that will make it easier for Oxford University scientists to connect with colleagues to improve their research, to satisfy funders that the data they have paid for is being managed according to their policies, and to make new alliances with pharma and biotech partners. Funding and development of the ChemBio Hub was ending on the 30th June 2016. Please be reassured that the ChemBio Hub system and all your data will continue to be secured on the SGC servers for the foreseeable future. You can continue to use the services as normal. More information see:
This interface provides access to several types of data related to the Chesapeake Bay. Bay Program databases can be queried based upon user-defined inputs such as geographic region and date range. Each query results in a downloadable, tab- or comma-delimited text file that can be imported to any program (e.g., SAS, Excel, Access) for further analysis. Comments regarding the interface are encouraged. Questions in reference to the data should be addressed to the contact provided on subsequent pages.
Child Care & Early Education Research Connections promotes high quality research in child care and early education and the use of that research in policy making. Our vision is that children are well cared for and have rich learning experiences, and their families are supported and able to work. Through this Web site, we offer research and data resources for researchers, policy makers, practitioners, and others.
The repository is part of the eScience infrastructure of the University of Tübingen, which is a core facility that strongly cooperates with the library and computing center of the university. Integration of the repository into the national CLARIN-D and international CLARIN infrastructures gives it wide exposure, increasing the likelihood that the resources will be used and further developed beyond the lifetime of the projects in which they were developed. Among the resources currently available in the Tübingen Center Repository, researchers can find widely used treebanks of German (e.g. TüBa-D/Z), the German wordnet (GermaNet), the first manually annotated digital treebank (Index Thomisticus), as well as descriptions of the tools used by the WebLicht ecosystem for natural language processing.
The CLARIN Centre at the University of Copenhagen, Denmark, hosts and manages a data repository (CLARIN-DK-UCPH Repository), which is part of a research infrastructure for humanities and social sciences financed by the University of Copenhagen, and a part of the national infrastructure collaboration DIGHUMLAB in Denmark. The CLARIN-DK-UCPH Repository provides easy and sustainable access for scholars in the humanities and social sciences to digital language data (in written, spoken, video or multimodal form) and provides advanced tools for discovering, exploring, exploiting, annotating, and analyzing data. CLARIN-DK also shares knowledge on Danish language technology and resources and is the Danish node in the European Research Infrastructure Consortium, CLARIN ERIC.
clarin:el is the Greek national network of language resources, a nation-wide Research Infrastructure devoted to the sustainable storage, sharing, dissemination and preservation of language resources. CLARIN EL infrastructure, which is a Greek nation-wide Research Infrastructure devoted to the sustainable storage, sharing, dissemination and preservation of language resources (LRs) and aims at increasing access to and augmentation of such resources at a national scale and beyond. It is an open, integrated, secure and interoperable storage, sharing and processing infrastructure for LRs (datasets, tools and processing services) for all domains domains and disciplines where language plays a critical role, notably. CLARIN EL is implemented in the framework of the CLARIN Attiki, national project in support of ESFRI/2006 Research Infrastructures.
CODEX is a database of NGS mouse and human experiments. Although, the main focus of CODEX is Haematopoiesis and Embryonic systems, the database includes a large variety of cell types. In addition to the publically available data, CODEX also includes a private site hosting non-published data. CODEX provides access to processed and curated NGS experiments. To use CODEX: (i) select a specialized repository (HAEMCODE or ESCODE) or choose the whole compendium (CODEX), then (ii) filter by organism and (iii) choose how to explore the database.
World Data System for Cold and Arid Regions(CARD) is a new scientific data sharing system which is established on the basis of the former World Data Center for Glaciology and Geocryology, Lanzhou and other data centers hosted by Cold and Arid Regions Environmental and Engineering Research Institute, Chinese Academy of Sciences. World Data System for Cold and Arid Regions is one of the constituents of World Data System. The data sharing system's main goals are to collect, manage and store the scientific data of Cold and Arid Regions area in China and provide the services for the scientific research of Cold and Arid Regions.
The NSF-supported Program serves the international scientific community through research, infrastructure, data, and models. We focus on how components of the Critical Zone interact, shape Earth's surface, and support life. ARCHIVED CONTENT: In December 2020, the CZO program was succeeded by the Critical Zone Collaborative Network (CZ Net)
Research on German and European financial markets suffers from a lack of pan-European data sets. Also, existing sets do not provide a standard identification of, for example, companies. Therefore, researchers often utilize data from the United States where the integration of different databases is more advanced. As a consequence, empirical analyses are mostly based on non-European data. Because of the institutional differences, political recommendations that result from these analyses cannot – or only in a limited scope – be transferred to Europe. Against this background, the SAFE Research Data Center not only draws on the usual international data sources but also creates new European data sets, brings existing data together and processes them. The aim is to place the central research areas of SAFE on a common European data footing. Data access is provided by 'SAFE data sources' and 'FiF - Repositorium für Forschungsdaten aus dem Finanzbereich (Preview version)'
INRAE is the world’s first organisation specialized on agricultural, food and environmental sciences. Data INRAE is offered by INRAE as part of its mission to open the results of its research. Data INRAE will share research data in relation with food, nutrition, agriculture and environment. It includes experimental, simulation and observation data, omic data, survey and text data. Only data produced by or in collaboration with INRAE will be hosted in the repository, but anyone can access the metadata and the open data.
The Government is releasing public data to help people understand how government works and how policies are made. Some of this data is already available, but brings it together in one searchable website. Making this data easily available means it will be easier for people to make decisions and suggestions about government policies based on detailed information.
A data repository and social network so that researchers can interact and collaborate, also offers tutorials and datasets for data science learning. " is designed for data and the people who work with data. From professional projects to open data, helps you host and share your data, collaborate with your team, and capture context and conclusions as you work."