Content Types


AID systems



Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type


Metadata standards

PID systems

Provider types

Quality management

Repository languages



Repository types


  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 146 result(s)
SourceForge is dedicated to making open source projects successful. We thrive on community collaboration to help us create the leading resource for open source software development and distribution. IT professionals come to Sourceforge to develop, download, review, and publish open source software. Sourceforge is the largest, most trusted destination for Open Source Software discovery and development on the web.
The long term goal of the Software Heritage initiative is to collect all publicly available software in source code form together with its development history, replicate it massively to ensure its preservation, and share it with everyone who needs it. The Software Heritage archive is growing over time as we crawl new source code from software projects and development forges.
FLOSSmole is a collaborative collection of free, libre, and open source software (FLOSS) data. FLOSSmole contains nearly 1 TB of data covering the period 2004 until now, about more than 500,000 different open source projects.
Launchpad is a software collaboration platform that provides: Bug tracking, Code hosting using Bazaar, Code reviews Ubuntu package building and hosting, Translations, Mailing lists, Answer tracking and FAQs, Specification tracking. Launchpad can host your project’s source code using the Bazaar version control system
Bioconductor provides tools for the analysis and comprehension of high-throughput genomic data. Bioconductor uses the R statistical programming language, and is open source and open development. It has two releases each year, and an active user community. Bioconductor is also available as an AMI (Amazon Machine Image) and a series of Docker images.
The Wolfram Data Repository is a public resource that hosts an expanding collection of computable datasets, curated and structured to be suitable for immediate use in computation, visualization, analysis and more. Building on the Wolfram Data Framework and the Wolfram Language, the Wolfram Data Repository provides a uniform system for storing data and making it immediately computable and useful. With datasets of many types and from many sources, the Wolfram Data Repository is built to be a global resource for public data and data-backed publication.
OSGeo's mission is to support the collaborative development of open source geospatial software, in part by providing resources for projects and promoting freely available geodata. The Public Geodata Repository is a distributed repository and registry of data sources free to access, reuse, and re-distribute.
As part of the Open Data Pilot that the COEMS EU H2020 project ( is participating in, COEMS Open Data Portal devotes to collect software and hardware trace data from both academy and industry partners. Our focus lies on the detection and identification of non-deterministic software failures caused by race conditions and access to inconsistent data. Therefore we will provide an efficient real-time access and analysis for operating safe software systems.
GitHub is the best place to share code with friends, co-workers, classmates, and complete strangers. Over three million people use GitHub to build amazing things together. With the collaborative features of, our desktop and mobile apps, and GitHub Enterprise, it has never been easier for individuals and teams to write better code, faster. Originally founded by Tom Preston-Werner, Chris Wanstrath, and PJ Hyett to simplify sharing code, GitHub has grown into the largest code host in the world.
The Cognitive Interaction Toolkit provides a unified view on linked research artifacts of collaborating institutions in the Bielefeld University’s strategic research area Interactive Intelligent Systems. It binds together a framework for software integration, software and hardware components, system descriptions, experiments, data sets, and publications. The research artifacts are hosted at a distributed service infrastructure that includes project oriented collaboration platforms, opensource and opendata servers, continuous integration services, and publication data servers. These are accessible via this web catalog defining a central collaborative instance for integrated research efforts.
The NCAR Climate Data Gateway provides data discovery and access services for global and regional climate model data, knowledge, and software. The NCAR Climate Data Gateway supports community access to data products from many of NCAR's community modeling efforts, including the IPCC, PCM, AMPS, CESM, NARCCAP, and NMME activities. Data products are generally open and available, however, download access may require a login.
OpenWorm aims to build the first comprehensive computational model of the Caenorhabditis elegans (C. elegans), a microscopic roundworm. With only a thousand cells, it solves basic problems such as feeding, mate-finding and predator avoidance. Despite being extremely well studied in biology, this organism still eludes a deep, principled understanding of its biology. We are using a bottom-up approach, aimed at observing the worm behaviour emerge from a simulation of data derived from scientific experiments carried out over the past decade. To do so we are incorporating the data available in the scientific community into software models. We are engineering Geppetto and Sibernetic, open-source simulation platforms, to be able to run these different models in concert. We are also forging new collaborations with universities and research institutes to collect data that fill in the gaps All the code we produce in the OpenWorm project is Open Source and available on GitHub.
A community-based search platform. OMICtools bridges the gap between researchers and tool developers. OMICtools brings together an interactive worldwide user community, linking expert curators who submit, review and categorize tools, to users who strengthen the interface by bringing feedback and reviews.
The NDEx Project provides an open-source framework where scientists and organizations can share, store, manipulate, and publish biological network knowledge. The NDEx Project maintains a free, public website; alternatively, users can also decide to run their own copies of the NDEx Server software in cases where the stored networks must be kept in a highly secure environment (such as for HIPAA compliance) or where high application load is incompatible with a shared public resource.
Content type(s)
The Blue Obelisk Data Repository lists many important chemoinformatics data such as element and isotope properties, atomic radii, etc. including references to original literature. Developers can use this repository to make their software interoperable.
Data products developed and distributed by the National Institute of Standards and Technology span multiple disciplines of research and are widely used in research and development programs by industry and academia. NIST's publicly available data sets showcase its committment to providing accurate, well-curated measurements of physical properties, exemplified by the Standard Reference Data program, as well as its committment to advancing basic research. In accordance with U.S. Government Open Data Policy and the NIST Plan for providing public access to the results of federally funded research data, NIST maintains a publicly accessible listing of available data, the NIST Public Dataset List (json). Additionally, these data are assigned a Digital Object Identifier (DOI) to increase the discovery and access to research output; these DOIs are registered with DataCite and provide globally unique persistent identifiers. The NIST Science Data Portal provides a user-friendly discovery and exploration tool for publically available datasets at NIST. This portal is designed and developed with Project Open Data standards and principles. The portal software is hosted in the usnistgov github repository.
The Open PHACTS project will develop an open source, open standards and open access innovation platform, Open Pharmacological Space (OPS), via a semantic web approach. OPS will comprise data, vocabularies and infrastructure needed to accelerate drugoriented research. This semantic integration hub will address key bottlenecks in small molecule drug discovery: disparate information sources, lack of standards and shared concept identifiers, guided by well defined research questions assembled from participating drug discovery teams. Open PHACTS draws together multiple sources of publicly-available pharmacological and physicochemical data, accessible via the Open PHACTS Explorer, an intuitive interface, and the powerful Open PHACTS API.
e!DAL stands for electronic Data Archive Library. It is a lightweight open source software software framework for publishing and sharing research data. e!DAL was developed based on experiences coming from decades of research data management and has grown towards being a general data archiving and publication infrastructure []. First research data repository is "Plant Genomics and Phenomics Research Data Repository" [].
The Scientific Database of the Federal University of Paraná aims to gather the scientific data used in the researches that were published by the UFPR community in theses, dissertations, journal articles, and other bibliographic materials. BDC joins RDI / UFPR as an innovative service that tracks the worldwide trend in research planning, management, production, organization, storage, dissemination and reuse. The availability of research data contributes to the transparency and optimization of scientific production through the reuse of data sets and the possibility of new analyzes and approaches
The Biological Collection Access Service for Europe, BioCASE, is a transnational network of biological collections of all kinds. BioCASE enables widespread unified access to distributed and heterogeneous European collection and observational databases using open-source, system-independent software and open data standards and protocols.
The Spiral Digital Repository is the Imperial College London institutional open access repository. This system allows you, as an author, to make your research documents open access without incurring additional publication costs. When you self-archive a research document in Spiral it becomes free for anyone to read. You can upload copies of your publications to Spiral using Symplectic Elements. All deposited content becomes searchable online.
CaltechDATA is an institutional data repository for Caltech. Caltech library runs the repository to preserve the accomplishments of Caltech researchers and share their results with the world. Caltech-associated researchers can upload data, link data with their publications, and assign a permanent DOI so that others can reference the data set. The repository also preserves software and has automatic Github integration. All files present in the repository are open access or embargoed, and all metadata is always available to the public.
EarthByte is an internationally leading eGeoscience collaboration between several Australian Universities, international centres of excellence and industry partners. One of the fundamental aims of the EarthByte Group is geodata synthesis through space and time, assimilating the wealth of disparate geological and geophysical data into a four-dimensional Earth model including tectonics, geodynamics and surface processes. The EarthByte Group is pursuing open innovation via collaborative software development, high performance and distributed computing, “big data” analysis and by making open access digital data collections available to the community.
DataFirst's open research data repository, based at the University of Cape Town, gives open access to disaggregated administrative and survey data from African governments and research entities. DataFirst also operates a secure centre at the university to give researchers access to highly-disaggregated South African data.
Rodare is the institutional research data repository at HZDR (Helmholtz-Zentrum Dresden-Rossendorf). Rodare allows HZDR researchers to upload their research software and data and enrich those with metadata to make them findable, accessible, interoperable and retrievable (FAIR). By publishing all associated research software and data via Rodare research reproducibility can be improved. Uploads receive a Digital Object Identfier (DOI) and can be harvested via a OAI-PMH interface.