Reset all


Content Types


AID systems



Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type


Metadata standards

PID systems

Provider types

Quality management

Repository languages



Repository types


  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 1429 result(s)
>>>>!!!!<<<< As of 2017-05-17 the data catalog is no longer available >>>>!!!!<<<< DataFed is a web services-based software that non-intrusively mediates between autonomous, distributed data providers and users. The main goals of DataFed are: Aid air quality management and science by effective use of relevant data - Facilitate the access and flow of atmospheric data from provider to users - Support the development of user-driven data processing value chains. DataFed Catalog links searchable Datafed applications worldwide.
Content type(s)
Database of ancient sources concerning Roman Water Law. Specific legal sources, e.g. from the Corpus Iuris Civilis or the Codex Theodosianus, and literary sources, for example from Cicero, Frontinus, Hyginus, Siculus Flaccus or Vitruvius, were collected to give an overview of water related legal problems in ancient Rome. Furthermore, the aim of the database is to classify these sources into different legal topics, in order to facilitate the research for sources concerning specific questions regarding Roman Water Law.
The MEROPS database is an information resource for peptidases (also termed proteases, proteinases and proteolytic enzymes) and the proteins that inhibit them.
The University Information System RUSSIA (UIS RUSSIA) is a mutual project of Research Computing Center and Economic Faculty at Lomonosov Moscow State University. It was introduced in 2000 and has been designed as a digital library for research and educational purposes, primarily in the fields of economic and social sciences. Since then it was maintained to meet the growing interest and challenges of the Russian universities and educational community. Starting from 2003 our development team concentrated on statistical databases to build an infrastructure for educational courses, to assist broad Russian social and economic studies from regional to local and down to household level. Today profound knowledge of statistical data and ability to implement advanced modern methods of applied analysis are expected from successful university graduates and are in high demand among new specialists, particularly in economics, public administration and related areas.
Data products developed and distributed by the National Institute of Standards and Technology span multiple disciplines of research and are widely used in research and development programs by industry and academia. NIST's publicly available data sets showcase its committment to providing accurate, well-curated measurements of physical properties, exemplified by the Standard Reference Data program, as well as its committment to advancing basic research. In accordance with U.S. Government Open Data Policy and the NIST Plan for providing public access to the results of federally funded research data, NIST maintains a publicly accessible listing of available data, the NIST Public Dataset List (json). Additionally, these data are assigned a Digital Object Identifier (DOI) to increase the discovery and access to research output; these DOIs are registered with DataCite and provide globally unique persistent identifiers. The NIST Science Data Portal provides a user-friendly discovery and exploration tool for publically available datasets at NIST. This portal is designed and developed with Project Open Data standards and principles. The portal software is hosted in the usnistgov github repository.
The sources of the data sets include data sets donated by researchers, surveys carried out by SRDA, as well as by government department and other academic organizations. Prior to the release of data sets, the confidentiality and sensitivity of every survey data set are evaluated. Standard data management and cleaning procedures are applied to ensure data accuracy and completeness. In addition, metadata and relevant supplement files are also edited and attached.
The Colombian Biodiversity Information Facility (SiB Colombia) is a national initiative established in early 2000 and coordinated by Instituto Humboldt to facilitate free and open access to biodiversity data. It comprises a network of more than 100 organizations (including universities, biological collections, research institutes, environmental authorities and NGOs among others) that work together to ensure that biodiversity data is available to support further research, education, policy making and incentive measures for the conservation and sustainable use of biodiversity. SiB Colombia’s mission is to facilitate the management of biodiversity data by bringing together users, publishers and data producers to support research, education and decision making related to knowledge, conservation and sustainable use of biodiversity and ecosystem services. SiB Colombia aims to consolidate the collaborative platform that facilitates the generation, use and democratization of knowledge on the biodiversity of Colombia. Thus, SiB Colombia contributes to a vision of a society that knows and values the biodiversity in which it is immersed, and uses such knowledge for its development.
The DASH Repository provides persistent data archiving and distribution for small-scale data collections from UCAR/NCAR researchers and projects. This data repository specifically focuses on providing long-term preservation and stewardship of NCAR's small-scale data collections. Complementing other NCAR-managed data repositories, the DASH Repository helps NCAR researchers to enable long term access, interoperability, and reuse of NCAR datasets.
SAFER-Data is a web-based interface to the Environmental Data Archive maintained by the Environmental Research Centre (ERC) in the Environmental Protection Agency (EPA) of Ireland, who has responsibilities for a wide range of licensing, enforcement, monitoring and assessment activities associated with environmental protection.
The Numeric Data Services Dataverse provides access to the Cross National Time Series (Banks data), the ITERATE database, and selected survey data. The DataVerse of the Harvard's Numeric Data Services houses a curated collection of datasets to meet the research and instructional needs of the Harvard community, which are also openly accessible. Primarily social sciences.
The Neuroscience Information Framework is a dynamic inventory of Web-based neuroscience resources: data, materials, and tools accessible via any computer connected to the Internet. An initiative of the NIH Blueprint for Neuroscience Research, NIF advances neuroscience research by enabling discovery and access to public research data and tools worldwide through an open source, networked environment.
This facility permits selective searches of some atomic data files compiled by R. L. Kurucz (Harvard-Smithsonian Center for Astrophysics). The data provided are: - vacuum wavelength (in nm) [above 200 nm calculated using Edlen, Metrologia, Vol. 2, No. 2, 1966]- air wavelength (in nm) above 200 nm- log(gf), - E [in cm-1], j, parity, and configuration for the levels (lower, upper), - information regarding the source of the data. CD-ROM 18 contains the spectrum synthesis programs ATLAS7V, SYNTHE, SPECTRV, ROTATE, BROADEN, PLOTSYN, etc. and sample runs found in directory PROGRAMS; Atomic line data files BELLHEAVY.DAT, BELLLIGHT.DAT, GFIRONLAB.DAT, GULLIVER.DAT, NLTELINES.DAT, GFIRONQ.DAT, obsolete, merged into GFALL, found in directory LINELISTS: Molecular line data files C2AX.ASC, C2BA.ASC, C2DA.ASC, C2EA.ASC, CNAX.ASC, CNBX.ASC, COAX.ASC, COXX.ASC, H2.ASC, HYDRIDES.ASC, SIOAX.ASC, SIOEX.ASC, SIOXX.ASC, found in directory LINELISTS; and my solar flux atlas for test calculations SOLARFLUX.ASC.
INTEGRALL is a web-based platform dedicated to compile information on integrons and designed to organize all the data available for these genetic structures. INTEGRALL provides a public genetic repository for sequence data and nomenclature and offers to scientists an easy and interactive access to integron's DNA sequences, their molecular arrangements as well as their genetic contexts.
This database provides theoretical values of energy levels of hydrogen and deuterium for principle quantum numbers n = 1 to 200 and all allowed orbital angular momenta l and total angular momenta j. The values are based on current knowledge of the revelant theoretical contributions including relativistic, quantum electrodynamic, recoil, and nuclear size effects.
OpenML is an open ecosystem for machine learning. By organizing all resources and results online, research becomes more efficient, useful and fun. OpenML is a platform to share detailed experimental results with the community at large and organize them for future reuse. Moreover, it will be directly integrated in today’s most popular data mining tools (for now: R, KNIME, RapidMiner and WEKA). Such an easy and free exchange of experiments has tremendous potential to speed up machine learning research, to engender larger, more detailed studies and to offer accurate advice to practitioners. Finally, it will also be a valuable resource for education in machine learning and data mining.
The Animal Sound Archive at the Museum für Naturkunde in Berlin is one of the oldest and largest collections of animal sounds. Presently, the collection consists of about 120,000 bioacoustical recordings comprising almost all groups of animals: 1.800 bird species 580 mammalian species more then150 species of invertebrates; some fishes, amphibians and reptiles
PRISM is a digital archive of the University of Calgary's intellectual output. Established and maintained by Libraries and Cultural Resources to manage, preserve and make available the academic works of faculty, students and research groups. The collection includes faculty publications, masters and doctoral theses, and research output from across Southern Alberta. PRISM is updated regularly, with new works added daily.
e-cienciaDatos is a multidisciplinary data repository that houses the scientific datasets of researchers from the public universities of the Community of Madrid and the UNED, members of the Consorcio Madroño, in order to give visibility to these data, to ensure its preservation And facilitate their access and reuse. e-cienciaDatos is structured as a system constituted by different communities that collects datasets of each of the individual universities. e-cienciaDatos offers the deposit and publication of datasets, assigning a digital object identifier DOI to each of them. The association of a dataset with a DOI will facilitate data verification, dissemination, reuse, impact and long-term access. In addition, the repository provides a standardized citation for each dataset, which contains sufficient information so that it can be identified and located, including the DOI.
The Institute for Marine and Antarctic Studies (IMAS) pursues multidisciplinary and interdisciplinary work to advance understanding of temperate marine, Southern Ocean, and Antarctic environments. IMAS research is characterised as innovative, relevant, and globally distinctive. Education at IMAS delivers world class programs, resulting in highly trained graduates who serve the needs of academic institutions, industry, government, and the community. IMAS is naturally advantaged by its Southern Ocean location proximal to Antarctica, and hosts one of the world's largest critical masses of marine and Antarctic researchers. IMAS also operate facilities and host data sets of national and global interest and to the benefit of the community. The guiding framework of IMAS is that all data that are not commercial-in-confidence or restricted by legislation or agreement are owned by the University on behalf of the community or Commonwealth, are hosted by an organisation, and are shared with researchers for analysis and interpretation. IMAS is committed to the concept of Open Data. The IMAS Data Portal is an online interface showcasing the IMAS metadata catalogue and all available IMAS data. The portal aims to make IMAS data freely and openly available for the benefit of Australian marine and environmental science as a whole.
MetaboLights is a database for Metabolomics experiments and derived information. The database is cross-species, cross-technique and covers metabolite structures and their reference spectra as well as their biological roles, locations and concentrations, and experimental data from metabolic experiments.
>>>!!!!<<< Retirement of UniProt Metagenomic and Environmental Sequences (UniMES): UniProt has retired UniMES as there is now a resource at the EBI that is dedicated to serving metagenomic researchers. Henceforth, we recommend using the EBI Metagenomics portal instead . In addition to providing a repository of metagenomics sequence data, EBI Metagenomics allows you to view functional and taxonomic analyses and to submit your own samples for analysis. >>> !!!<<< The UniProt Metagenomic and Environmental Sequences (UniMES) database is a repository specifically developed for metagenomic and environmental data. We provide UniMES clusters in order to obtain complete coverage of sequence space at different resolutions.
This record is combined with 'NASA Socioeconomic Data and Applications Center' (see: ) The World Data Center for Human Interactions in the Environment has been superseded by the NASA Socioeconomic Data and Applications Center (SEDAC), which is a regular member of the World Data System (WDS). The International Council for Science (ICSU) replaced the World Data Centers (WDC) with the WDS, which supports the provision of trusted scientific data services by certifying its members to ensure that they maintain the organizational capabilities and infrastructure for managing the data products and services that they offer. SEDAC focuses on human interactions in the environment and is one of the Distributed Active Archive Centers (DAACs) in the NASA Earth Observing System Data and Information System (EOSDIS). The NASA Earth Science Data and Information System (ESDIS) Project, a WDS Network Member, manages the EOSDIS science systems.
The Ensembl genome annotation system, developed jointly by the EBI and the Wellcome Trust Sanger Institute, has been used for the annotation, analysis and display of vertebrate genomes since 2000. Since 2009, the Ensembl site has been complemented by the creation of five new sites, for bacteria, protists, fungi, plants and invertebrate metazoa, enabling users to use a single collection of (interactive and programatic) interfaces for accessing and comparing genome-scale data from species of scientific interest from across the taxonomy. In each domain, we aim to bring the integrative power of Ensembl tools for comparative analysis, data mining and visualisation across genomes of scientific interest, working in collaboration with scientific communities to improve and deepen genome annotation and interpretation.