Content Types


AID systems



Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type


Metadata standards

PID systems

Provider types

Quality management

Repository languages



Repository types


  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 545 result(s)
GroupLens is a research lab in the Department of Computer Science and Engineering at the University of Minnesota, Twin Cities specializing in recommender systems, online communities, mobile and ubiquitous technologies, digital libraries, and local geographic information systems.
The Life Science Database Archive maintains and stores the datasets generated by life scientists in Japan in a long-term and stable state as national public goods. The Archive makes it easier for many people to search datasets by metadata (description of datasets) in a unified format, and to access and download the datasets with clear terms of use. In addition, the Archive provides datasets in forms friendly to different types of users in public and private institutions, and thereby supports further contribution of each research to life science.
Blackfynn Discover is a repository for Neurology and Neuroscience datasets. This repository, funded by DARPA, the NIH, and others, provides a user-friendly solution for publishing large, complex datasets is a scalable and sustainable way. The platform aims to make data available in a meaningful way and to drive adoption of cloud-based analysis over large datasets.
Since 2004, the GFZ German Research Centre for Geosciences assigns Digital Object Identifiers (DOI) to datasets. These datasets are archived by and published through GFZ Data Services and cover all geoscientific disciplines. They range from large dynamic datasets deriving from data intensive global monitoring networks with real-time data acquisition to the full suite of highly variable datasets collected by individual researchers or small teams. These highly variable data (‘long-tail data’) are small in size, but represent an important part of the total scientific output.
>>>!!!<<< The repository is offline >>>!!!<<< A collection of open content name datasets for Information Centric Networking. The "Content Name Collection" (CNC) lists and hosts open datasets of content names. These datasets are either derived from URL link databases or web traces. The names are typically used for research on Information Centric Networking (ICN), for example to measure cache hit/miss ratios in simulations.
The aim of CfA Library Datasets Dataverse is creating a better information system to respond to the changing needs of astronomers not only at the CfA, but worldwide as well. As part of this growing partnership with the ADS, the CfA Library is expanding its metadata and data curation services, and in the process, creating datasets that the astronomy community may find useful. The CfA Library Datasets Dataverse has been created to share these datasets with the greater community with the hope that some members may find it useful. Please remember to acknowledge the CfA Library and the ADS and cite the work using the "Data Citation" presented under each study's "Cataloging Information" section.
The Database contains all publicly available HMS LINCS datasets and information for each dataset about experimental reagents (small molecule perturbagens, cells, antibodies, and proteins) and experimental and data analysis protocols. increases the ability of the public to easily find, download, and use datasets that are generated and held by the Federal Government. provides descriptions of the Federal datasets (metadata), information about how to access the datasets, and tools that leverage government datasets
The CiardRING is a global directory of web-based information services and datasets for agricultural research for development (ARD). It is the principal tool created through the CIARD initiative to allow information providers to register their services and datasets in various categories and so facilitate the discovery of sources of agriculture-related information across the world. The RING aims to provide an infrastructure to improve the accessibility of the outputs of agricultural research and of information relevant to agriculture.
It captures and catalogues ancient human genome and microbiome data, including raw sequence and processed data, along with metadata about its provenance and production. Included datasets are generated from ancient samples studied at the Australian Centre for Ancient DNA, University of Adelaide in collaboration with other research groups. Datasets and collections in OAGR are open data resources made freely available in a reusable form, using open file formats and licensed with minimal restrictions for reuse. Digital object identifiers (DOIs) are minted for included datasets and collections to facilitate persistent identification and citation.
IEEE DataPort™ is an online data repository created and supported by both the IEEE Signal Processing Society and the IEEE Big Data Initiative, and it enables data owners to upload their data without cost. DataPort makes data available in three ways: standard, open access, and data competition. By default, all "standard" datasets that are uploaded are accessible to paid DataPort subscribers. Data owners can also pay a fee (APC) to make a dataset accessible through open access to non-subscribers. The third option is to to host a "data competition" and make a dataset accessible for free for a duration with instructions on the competition and how to participate. DataPort provides workflows for receiving entries and managing the competitions. All datasets are stored on Amazon AWS S3, and each dataset can be allotted up to 2TB each.
Virginia Tech’s Data Repository TechData is a platform for openly publishing datasets or other research products created by Virginia Tech faculty, staff, and students. VTechData highlights, preserves, and provides access to research products (e.g. datasets) of the Virginia Tech community, and in doing so help to disseminate the intellectual output of the university in its land-grant mission.
Network Repository is the first interactive data repository for graph and network data. It hosts graph and network datasets, containing hundreds of real-world networks and benchmark datasets. Unlike other data repositories, Network Repository provides interactive analysis and visualization capabilities to allow researchers to explore, compare, and investigate graph data in real-time on the web.
Research Data Finder is QUT’s discovery service for research data created or collected by QUT researchers. Designed to promote the visibility of QUT research datasets, Research Data Finder provides descriptions about shareable, reusable datasets available via open or mediated access.
The Information Marketplace for Policy and Analysis of Cyber-risk & Trust (IMPACT) program supports global cyber risk research & development by coordinating, enhancing and developing real world data, analytics and information sharing capabilities, tools, models, and methodologies. In order to accelerate solutions around cyber risk issues and infrastructure security, IMPACT makes these data sharing components broadly available as national and international resources to support the three-way partnership among cyber security researchers, technology developers and policymakers in academia, industry and the government.
WBG Finances is a World Bank Group digital platform that provides our clients and partners access to public financial data and portfolio information from across all Group entities in one place. WBG Finances simplifies the presentation of financial information in an ‘easy to consume’ and in the context of Country and Portfolio across WBG. Open Finances makes World Bank Group’s financials available for everybody to explore. All the data presented is available to everybody to analyze, visualize, and share with others. We invite you to explore the numerous tools, build your own visualizations or download the data in multiple formats. If you are a developer, connect to it through the APIs associated with all the datasets.
Open access to macromolecular X-ray diffraction and MicroED datasets. The repository complements the Worldwide Protein Data Bank. SBDG also hosts reference collection of biomedical datasets contributed by members of SBGrid, Harvard and pilot communities.
The National Archives is home to millions of historical documents, known as records, which were created and collected by UK central government departments and major courts of law. Data of the fomer National Digital Archive of Datasets (NDAD) collection, which was active from 1997 to 2010 and preserves and provides online access to archived digital datasets and documents from UK central government departments, is integrated.