Reset all


Content Types


AID systems



Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type


Metadata standards

PID systems

Provider types

Quality management

Repository languages



Repository types


  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 32 result(s)
Country is Luxembourg's central and official platform for data from the public sector, from research institutes and the private sector.
The Language Bank features text and speech corpora with different kinds of annotations in over 60 languages. There is also a selection of tools for working with them, from linguistic analyzers to programming environments. Corpora are also available via web interfaces, and users can be allowed to download some of them. The IP holders can monitor the use of their resources and view user statistics.
China National GeneBank DataBase (CNGBdb) is a unified platform built for biological big data sharing and application services to the research community. Based on the big data and cloud computing technologies, it provides data services such as archive, analysis, knowledge search, management authorization, and visualization. At present, CNGBdb has integrated large amounts of internal and external molecular data and other information from CNGB, NCBI, EBI, DDBJ, etc., indexed by search, covering 12 data structures. Moreover, CNGBdb correlates living sources, biological samples and bioinformatic data to realize the traceability of comprehensive data.
The ENCODE Encyclopedia organizes the most salient analysis products into annotations, and provides tools to search and visualize them. The Encyclopedia has two levels of annotations: Integrative-level annotations integrate multiple types of experimental data and ground level annotations. Ground-level annotations are derived directly from the experimental data, typically produced by uniform processing pipelines.
This Web resource provides data and information relevant to SARS coronavirus. It includes links to the most recent sequence data and publications, to other SARS related resources, and a pre-computed alignment of genome sequences from various isolates. The genome of SARS-CoV consists of a single, positive-strand RNA that is approximately 29,700 nucleotides long. The overall genome organization of SARS-CoV is similar to that of other coronaviruses. The reference genome includes 13 genes, which encode at least 14 proteins. Two large overlapping reading frames (ORFs) encompass 71% of the genome. The remainder has 12 potential ORFs, including genes for structural proteins S (spike), E (small envelope), M (membrane), and N (nucleocapsid). Other potential ORFs code for unique putative SARS-CoV-specific polypeptides that lack obvious sequence similarity to known proteins.
The AOML Environmental Data Server (ENVIDS) provides interactive, on-line access to various oceanographic and atmospheric datasets residing at AOML. The in-house datasets include Atlantic Expendable Bathythermograph (XBT), Global Lagrangian Drifting Buoy, Hurricane Flight Level, and Atlantic Hurricane Tracks (North Atlantic Best Track and Synoptic). Other available datasets include Pacific Conductivitiy/Temperature/Depth Recorder (CTD) and World Ocean Atlas 1998.
The Sloan Digital Sky Survey (SDSS) is one of the most ambitious and influential surveys in the history of astronomy. Over eight years of operations (SDSS-I, 2000-2005; SDSS-II, 2005-2008; SDSS-III 2008-2014; SDSS-IV 2013 ongoing), it obtained deep, multi-color images covering more than a quarter of the sky and created 3-dimensional maps containing more than 930,000 galaxies and more than 120,000 quasars. DSS-IV is managed by the Astrophysical Research Consortium for the Participating Institutions of the SDSS Collaboration including the Carnegie Institution for Science, Carnegie Mellon University, the Chilean Participation Group, Harvard-Smithsonian Center for Astrophysics, Instituto de Astrofísica de Canarias, The Johns Hopkins University, Kavli Institute for the Physics and Mathematics of the Universe (IPMU) / University of Tokyo, Lawrence Berkeley National Laboratory, Leibniz Institut für Astrophysik Potsdam (AIP), Max-Planck-Institut für Astrophysik (MPA Garching), Max-Planck-Institut für Extraterrestrische Physik (MPE), Max-Planck-Institut für Astronomie (MPIA Heidelberg), National Astronomical Observatory of China, New Mexico State University, New York University, The Ohio State University, Pennsylvania State University, Shanghai Astronomical Observatory, United Kingdom Participation Group, Universidad Nacional Autónoma de México, University of Arizona, University of Colorado Boulder, University of Portsmouth, University of Utah, University of Washington, University of Wisconsin, Vanderbilt University, and Yale University.
US National Science Foundation (NSF) facility to support drilling and coring in continental locations worldwide. Drill core metadata and data, borehole survey data, geophysical site survey data, drilling metadata, software code. CSDCO offers several repositories with samples, data, publications and reference collections about drilling and coring: LacCore Core Repository, Open Core Data, Index to Marine and Lacustrine Geological Samples. For " Botanical Reference Collections" contact the LacCore Curator for details.
The Research Data Center (RDC) “International Survey Programs“ provides researchers with data, services, and consultation on a number of important international study series which are under intensive curation by GESIS. They all cover numerous countries and, quite often, substantial time spans. The RDC provides optimal data preparation and access to a wide scope of data and topics for comparative analysis.
Open access to macromolecular X-ray diffraction and MicroED datasets. The repository complements the Worldwide Protein Data Bank. SBDG also hosts reference collection of biomedical datasets contributed by members of SBGrid, Harvard and pilot communities.
Kaggle is a platform for predictive modelling and analytics competitions in which statisticians and data miners compete to produce the best models for predicting and describing the datasets uploaded by companies and users. This crowdsourcing approach relies on the fact that there are countless strategies that can be applied to any predictive modelling task and it is impossible to know beforehand which technique or analyst will be most effective.
The KNMI Data Centre (KDC) provides access to weather, climate and seismological datasets of KNMI. For each dataset descriptive information is available (metadata), including a point of contact. The KNMI Data Centre (KDC) provides access to KNMI data on weather, climate and seismology. You will find KNMI data on various topics such as: the most recent 10 minutes of observations, historical data, data on meteorological stations, modeling, earthquake data and satellite products.
Scans of plates obtained at Landessternwarte Heidelberg-Königstuhl and German-Spanish Astronomical Center (Calar Alto Observatory), Spain, 1900 through 1999.
GABI, acronym for "Genomanalyse im biologischen System Pflanze", is the name of a large collaborative network of different plant genomic research projects. Plant data from different ‘omics’ fronts representing more than 10 different model or crop species are integrated in GabiPD.
The CliSAP-Integrated Climate Data Center (ICDC) allows easy access to climate relevant data from in-situ measurements and satellite remote sensing. These data are important to determine the status and the changes in the climate system. Additionally some relevant re-analysis data are included, which are modeled on the basis of observational data.
This DOI repository provides permanent identifiers to data sets generated by Life Science researchers active in Sweden, and for which no other suitable public repository is available. BILS is a distributed national research infrastructure supported by the Swedish Research Council (Vetenskapsrådet) providing bioinformatics support to life science researchers in Sweden.
RWTH Publications Research Data offers all RWTH Aachen University affiliates the organizational and technical means to electronically document and publish research data at this institutional repository. Certainly, researchers are encouraged to prefer a subject specific repository whenever appropriate and available. RWTH Aachen University is the largest technical university in Germany and one of nine 'German Universities of Excellence' (elite university). The University library Aachen operates the repository as a member of the join community.
Content type(s)
The primary objective of the PhenoCam project is to use automated, near-surface remote sensing to provide continuous, real-time monitoring of vegetation phenology across a range of ecosystems and climate zones.
LacCore curates cores and samples from continental coring and drilling expeditions around the world, and also archives metadata and contact information for cores stored at other institutions.
NSIDC offers hundreds of scientific data sets for research, focusing on the cryosphere and its interactions. Data are from satellites and field observations. All data are free of charge.
Mulce (MUltimodal contextualized Learner Corpus Exchange) is a research project supported by the National Research Agency (ANR programme: "Corpus and Tools in the Humanities", ANR-06-CORP-006). A teaching corpus (LETEC - Learning and Teaching Corpora) combines a systematic and structured data set, particularly of interactional data, and traces left by a training course experimentation, conducted partially or completely online and completed by additional technical, human, pedagogical and scientific information to enable the data to be analysed in context.