Reset all


Content Types


AID systems



Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type


Metadata standards

PID systems

Provider types

Quality management

Repository languages



Repository types


  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 185 result(s)
The Alternative Fuels Data Center (AFDC) is a comprehensive clearinghouse of information about advanced transportation technologies. The AFDC offers transportation decision makers unbiased information, data, and tools related to the deployment of alternative fuels and advanced vehicles. The AFDC launched in 1991 in response to the Alternative Motor Fuels Act of 1988 and the Clean Air Act Amendments of 1990. It originally served as a repository for alternative fuel performance data. The AFDC has since evolved to offer a broad array of information resources that support efforts to reduce petroleum use in transportation. The AFDC serves Clean Cities stakeholders, fleets regulated by the Energy Policy Act, businesses, policymakers, government agencies, and the general public.
Public Data Sets on AWS provides a centralized repository of public data sets that can be seamlessly integrated into AWS cloud-based applications. AWS is hosting the public data sets at no charge to their users. Anyone can access these data sets from their Amazon Elastic Compute Cloud (Amazon EC2) instances and start computing on the data within minutes. Users can also leverage the entire AWS ecosystem and easily collaborate with other AWS users.
US Department of Energy’s Atmospheric Radiation Measurement (ARM) Data Center is a long-term archive and distribution facility for various ground-based, aerial and model data products in support of atmospheric and climate research. ARM facility currently operates over 400 instruments at various observatories ( ARM Data Center (ADC) Archive currently holds over 11,000 data products with a total holding of over 1.5 petabytes of data that dates back to 1993, these include data from instruments, value added products, model outputs, field campaign and PI contributed data. The data center archive also includes data collected by ARM from related program (e.g., external data such as NASA satellite).
The Artstor Digital Library is a nonprofit resource that provides more than one million digital images in the arts, architecture, humanities, and sciences with an accessible suite of software tools for teaching and research. Its community-built collections comprise contributions from outstanding international museums, photographers, libraries, scholars, photo archives, and artists and artists' estates.
The Australian Data Archive (ADA) provides a national service for the collection and preservation of digital research data and to make these data available for secondary analysis by academic researchers and other users. Data are stored in seven sub-archives: Social Science, Historical, Indigenous, Longitudinal, Qualitative, Crime & Justice and International. Along with Australian data, ADA International is also a repository for studies by Australian researchers conducted in other countries, particularly throughout the Asia-Pacific region. The ADA International data catalogue includes links to studies from countries including New Zealand, Bangladesh, Cambodia, China, Indonesia, and several other countries. In 2017 the archive systems moved from the existing Nesstar platform to the new ADA Dataverse platform
The BioCyc database collection of Pathway/Genome Databases (PGDBs) provides a reference on the genomes and metabolic pathways of thousands of sequenced organisms. BioCyc PGDBs are generated by software that predict the metabolic pathways of completely sequenced organisms, predict which genes code for missing enzymes in metabolic pathways, and predict operons. BioCyc also integrates information from other bioinformatics databases, such as protein feature and Gene Ontology information from UniProt. The BioCyc website provides a suite of software tools for database searching and visualization, for omics data analysis, and for comparative genomics and comparative pathway questions. From 2016 on, access to the EcoCyc and MetaCyc databases will remain free. Subscriptions to the other 7,600 BioCyc databases will be available to institutions (e.g., libraries), and to individuals. Access to licensed databases via: .
BeiDare2 is currently at beta version. All new users should try the new service as we no longer provide training for the classic BioDare. - BioDare stands for Biological Data Repository, its main focus is data from circadian experiments. BioDare is an online facility to share, store, analyse and disseminate timeseries data, focussing on circadian clock data, with browser and web service interfaces. Toolbox features include an improved, speedier FFT-NLLs routine and ROBuST’s Spectrum Resampling tool that will analyse rhythmic time series data.
The BCDC serves the research data obtained, and the data syntheses assembled, by researchers within the Bjerknes Centre for Climate Research. Furthermore it is open for all interested scientists independent of institution. All data from the different disciplines (e.g. geology, oceanography, biology, model community) will be archived in a long-term repository, interconnected and made publicly available by the BCDC. BCDC has collaborations with many international data repositories and actively archives metadata and data at those ensuring quality and FAIRness. BCDC has it's main focus on services for data management for external and internal funded projects in the field of climate research, provides data management plans and ensures that data is archived accordingly according to the best practices in the field. The data management services rank from project work for small external funded project to top-of-the-art data management services for research infrastructures on the ESFRI roadmap (e.g. RI ICOS – Integrated Carbon Observation System) and for provides products and services for Copernicus Marine Environmental Monitoring Services. In addition BCDC is advising various communities on data management services e.g. IOC UNESCO, OECD, IAEA and various funding agencies. BCDC will become an Associated Data Unit (ADU) under IODE, International Oceanographic Data and Information Exchange, a worldwide network that operates under the auspices of the Intergovernmental Oceanographic Commission of UNESCO and aims at becoming a part of ICSU World Data System.
We are developing an open, online platform to provide a seamless access to cloud computing infrastructure and brain data and data derivatives. This platform is meant to reach out beyond neuroscience, allowing also computer scientists, statisticians and engineers interested in brain data to use the data to develop and publish their methods. Brain Life is a project under active development. We currently offer several cloud computing services – also called Brain Life Applications. Sixty-six collaborators from global scientific communities contribute to the project by providing data, applications, technology and products to advance understanding the human brain., launched in May 2005, is an interactive multiresolution next-generation brain atlas that is based on over 20 million megapixels of sub-micron resolution, annotated, scanned images of serial sections of both primate and non-primate brains and that is integrated with a high-speed database for querying and retrieving data about brain structure and function over the internet. Currently featured are complete brain atlas datasets for various species, including Macaca mulatta, Chlorocebus aethiops, Felis catus, Mus musculus, Rattus norvegicus, and Tyto alba.
The Cancer Cell Line Encyclopedia project is a collaboration between the Broad Institute, and the Novartis Institutes for Biomedical Research and its Genomics Institute of the Novartis Research Foundation to conduct a detailed genetic and pharmacologic characterization of a large panel of human cancer models, to develop integrated computational analyses that link distinct pharmacologic vulnerabilities to genomic patterns and to translate cell line integrative genomics into cancer patient stratification. The CCLE provides public access to genomic data, analysis and visualization for about 1000 cell lines.
BSRN is a project of the Radiation Panel (now the Data and Assessment Panel) from the Global Energy and Water Cycle Experiment (GEWEX) under the umbrella of the World Climate Research Programme (WCRP). It is the global baseline network for surface radiation for the Global limate Observing System (GCOS), contributing to the Global Atmospheric Watch (GAW), and forming a ooperative network with the Network for the Detection of Atmospheric Composition Change NDACC).
BsubCyc is a model-organism database for the bacterium Bacillus subtilis and is based on the updated B. subtilis 168 genome sequence and annotation published by Barbe et al. in 2009. Gene function annotations are being updated when new literature is available. Subscriptions are now required to access BsubCyc. For more information on obtaining a subscription, click here:
The Cooperative Association for Internet Data Analysis (CAIDA) is a collaborative undertaking among organizations in the commercial, government, and research sectors aimed at promoting greater cooperation in the engineering and maintenance of a robust, scalable global Internet infrastructure.It is an independent analysis and research group with particular focus on: Collection, curation, analysis, visualization, dissemination of sets of the best available Internet data, providing macroscopic insight into the behavior of Internet infrastructure worldwide, improving the integrity of the field of Internet science, improving the integrity of operational Internet measurement and management, informing science, technology, and communications public policies.
The Canadian Astronomy Data Centre (CADC) was established in 1986 by the National Research Council of Canada (NRC), through a grant provided by the Canadian Space Agency (CSA), as one of three world-wide distribution centres for astronomical data obtained with the Hubble Space Telescope (HST). Located at NRC Herzberg in Victoria, BC, the CADC staff consists of professional astronomers and software developers who have developed an abundance of other sophisticated tools to support and enhance the research efforts of Canadian (and international) astronomers. The CADC specializes in data mining, data processing, data distribution and data transferring of very large astronomical datasets. In 2012, the Canadian Astronomy Data Centre (CADC) delivered over 1.6 million individual files, comprising over 117TB of data and served data to roughly 2000 professional astronomers.
The Canadian Open Genetics Repository is a collaborative effort for the collection, storage, sharing and robust analysis of variants reported by medical diagnostics laboratories across Canada. As clinical laboratories adopt modern genomics technologies, the need for this type of collaborative framework is increasingly important. If you want to join COGR project and get data please send an email at and the introduction to the project will be arranged.
The NCI’s Cancer Genome Anatomy Project (CGAP) is an online resource designed to provide the scientific community with detailed characterization of gene expression in biological tissues. By characterizing normal, pre-cancer and cancer cells, CGAP aims to improve detection, diagnosis and treatment for the patient. Moreover, CGAP provides access to cDNA clones to the research community through a variety of distributors. CGAP provides a wide range of genomic data and resources
>>> !!!!! The Portal is no longer available. !!!! >>> The CARMEN pilot project seeks to create a virtual laboratory for experimental neurophysiology, enabling the sharing and collaborative exploitation of data, analysis code and expertise. This study by the DCC contributes to an understanding of the data curation requirements of the eScience community, through its extended observation of the CARMEN neurophysiology community’s specification and selection of solutions for the organisation, access and curation of digital research output.
The goal of the Center of Estonian Language Resources (CELR) is to create and manage an infrastructure to make the Estonian language digital resources (dictionaries, corpora – both text and speech –, various language databases) and language technology tools (software) available to everyone working with digital language materials. CELR coordinates and organises the documentation and archiving of the resources as well as develops language technology standards and draws up necessary legal contracts and licences for different types of users (public, academic, commercial, etc.). In addition to collecting language resources, a system will be launched for introducing the resources to, informing and educating the potential users. The main users of CELR are researchers from Estonian R&D institutions and Social Sciences and Humanities researchers all over the world via the CLARIN ERIC network of similar centers in Europe. Access to data is provided through different sites: Public Repository , Language resources, and MetaShare CELR
The ChemBio Hub vision is to provide the tools that will make it easier for Oxford University scientists to connect with colleagues to improve their research, to satisfy funders that the data they have paid for is being managed according to their policies, and to make new alliances with pharma and biotech partners. Funding and development of the ChemBio Hub was ending on the 30th June 2016. Please be reassured that the ChemBio Hub system and all your data will continue to be secured on the SGC servers for the foreseeable future. You can continue to use the services as normal. More information see:
The range of CIRAD's research has given rise to numerous datasets and databases associating various types of data: primary (collected), secondary (analysed, aggregated, used for scientific articles, etc), qualitative and quantitative. These "collections" of research data are used for comparisons, to study processes and analyse change. They include: genetics and genomics data, data generated by trials and measurements (using laboratory instruments), data generated by modelling (interpolations, predictive models), long-term observation data (remote sensing, observatories, etc), data from surveys, cohorts, interviews with players.
The data archive maintains a collection of social and economic datasets. It's a centralized source for numeric data files: their acquisition, storage, maintenance, and use. We support the research activities of social science faculty, students, and staff at Cornell University. The collection includes federal or state censuses, files based on administrative records, public opinion surveys, economic and social data from national and international organizations, and studies compiled by individual researchers. You can search our holdings or browse studies by subject area. Also see Locating and Using Archive Data.