Search | re3data.org

Pacific Northwest National Laboratory DataHub: Scientific Data Repository

PNNL2

Subject(s)

Content type(s)

Country

United States

Sharing and preserving data are central to protecting the integrity of science. DataHub, a Research Computing endeavor, provides tools and services to meet scientific data challenges at Pacific Northwest National Laboratory (PNNL). DataHub helps researchers address the full data life cycle for their institutional projects and provides a path to creating findable, accessible, interoperable, and reusable (FAIR) data products. Although open science data is a crucial focus of DataHub’s core services, we are interested in working with evidence-based data throughout the PNNL research community.

CLARIN-LV repository

CLARIN Latvia repository

Subject(s)

Content type(s)

Country

CLARIN-LV is a national node of Clarin ERIC (Common Language Resources and Technology Infrastructure). The mission of the repository is to ensure the availability and long term preservation of language resources. The data stored in the repository are being actively used and cited in scientific publications.

TRR170-DB

Late Accretion Onto Terrestrial Planets

Subject(s)

Content type(s)

Country

Germany

The TRR170-DB was set up to manage data products of the collaborative research center TRR 170 'Late Accretion onto Terrestrial Planets' (https://www.trr170-lateaccretion.de/). However, meanwhile the repository also stores data by other institutions and researchers. Data include laboratory and other instrumental data on planetary samples, remote sensing data, geological maps and model simulations.

Open Access Power-Grid Frequency Database

OSF repository Open Access Power-Grid Frequency Database

Subject(s)

Content type(s)

Country

This repository stores and links the openly available power-grid frequency recordings across the globe. This database is comprised of open data existent across three dimensions: - TSO data: Transmission System's Operator (TSO) recordings made public; - Research projects: Open-data database research projects; - Independent Gatherings: Industrial, private, or personal recordings that were made publicly available.

COEMS Open Data

Continuous Observation of Embedded Multicore Systems Data

Subject(s)

Content type(s)

Country

<<<!!!<<< This repository is no longer available. >>>!!!>>>

ELRA Catalogue of Language Resources

ELRA Catalogue

Subject(s)

Content type(s)

Country

European Union

An increasing number of Language Resources (LT) in the various fields of Human Language Technology (HLT) are distributed on behalf of ELRA via its operational body ELDA, thanks to the contribution of various players of the HLT community. Our aim is to provide Language Resources, by means of this repository, so as to prevent researchers and developers from investing efforts to rebuild resources which already exist as well as help them identify and access those resources.

UCL Research Data Repository

Subject(s)

Content type(s)

Country

United Kingdom

The UCL Research Data Repository is the institutional data repository of University College London. Based on Figshare, it accepts data deposits from UCL staff and doctoral students from all disciplines. Depositors are encouraged to use CC0 licences to make their data available to the widest possible range of users and the widest range of uses.

Gendered Innovation Open Data Repository

formerly: Gender Observatory Data Repository (GO-DaRe)

Subject(s)

Content type(s)

Country

The Data Repository of the H2020/TINNGO Project (https://www.tinngo.eu/) is used to store large volumes of gendered innovation related data, acquired from 10 national hubs of a pan-European Gender Observatory, Living Labs and other sources.

SUITS Data Repository

SUITS Open Data Repository

Subject(s)

Content type(s)

Plain text

Country

European Union

The Open Data Repository of the H2020/CIVITAS/SUITS Project (https://www.suits-project.eu/) is used to store large volumes of crowdsourced traffic data, useful for SUMP implementations by local authorities of small-medium cities.

SENSOR

SENSOR.awi.de

Subject(s)

Content type(s)

Country

Germany

<<<!!!<<< This repository is no longer available. >>>!!!>>>

IIASA DARE

Data Repository of the International Institute of Applied Systems Analysis

Subject(s)

Content type(s)

Country

Austria

<<<!!!<<< This repository is no longer available. >>>!!!>>>

Eurac Research CLARIN Centre

ERCC

Subject(s)

Content type(s)

Country

The Eurac Research CLARIN Centre (ERCC) is a dedicated repository for language data. It is hosted by the Institute for Applied Linguistics (IAL) at Eurac Research, a private research centre based in Bolzano, South Tyrol. The Centre is part of the Europe-wide CLARIN infrastructure, which means that it follows well-defined international standards for (meta)data and procedures and is well-embedded in the wider European Linguistics infrastructure. The repository hosts data collected at the IAL, but is also open for data deposits from external collaborators.

DataverseNO

Subject(s)

Content type(s)

Country

Norway

DataverseNO (https://dataverse.no) is a curated, FAIR-aligned national generic repository for open research data from all academic disciplines. DataverseNO commits to facilitate that published data remain accessible and (re)usable in a long-term perspective. The repository is owned and operated by UiT The Arctic University of Norway. DataverseNO accepts submissions from researchers primarily from Norwegian research institutions. Datasets in DataverseNO are grouped into institutional collections as well as special collections. The technical infrastructure of the repository is based on the open source application Dataverse (https://dataverse.org), which is developed by an international developer and user community led by Harvard University.

Open IFC Model Repository

Subject(s)

Content type(s)

Country

New Zealand

The aim of this repository is for it to be a location from which a wide variety of well analysed IFC-based data files can be sourced. It is planned that over time the number of data files will expand to provide significant coverage of the major aspects that would need to be tested for interoperability.

ILC-CNR for CLARIN-IT repository

ILC4CLARIN

Subject(s)

Content type(s)

Country

ILC-CNR for CLARIN-IT repository is a library for linguistic data and tools. Including: Text Processing and Computational Philology; Natural Language Processing and Knowledge Extraction; Resources, Standards and Infrastructures; Computational Models of Language Usage. The studies carried out within each area are highly interdisciplinary and involve different professional skills and expertises that extend across the disciplines of Linguistics, Computational Linguistics, Computer Science and Bio-Engineering.

Cornell Activity Datasets: CAD-60 & CAD-120

Subject(s)

Content type(s)

Country

United States

<<<!!!<<< The repository is no longer available. >>>!!!>>>

VRP-REP

Vehicle routing problem repository

Subject(s)

Content type(s)

Country

France

The vehicle routing problem repository (VRP-REP) is an open data platform for sharing instances and solutions for vehicle routing problems.

CLARIN.SI repository

Slovenian CLARIN repository

Subject(s)

Content type(s)

Country

CLARIN.SI is the Slovenian node of the European CLARIN (Common Language Resources and Technology Infrastructure) Centers. The CLARIN.SI repository is hosted at the Jožef Stefan Institute and offers long-term preservation of deposited linguistic resources, along with their descriptive metadata. The integration of the repository with the CLARIN infrastructure gives the deposited resources wide exposure, so that they can be known, used and further developed beyond the lifetime of the projects in which they were produced. Among the resources currently available in the CLARIN.SI repository are the multilingual MULTEXT-East resources, the CC version of Slovenian reference corpus Gigafida, the morphological lexicon Sloleks, the IMP corpora and lexicons of historical Slovenian, as well as many other resources for a variety of languages. Furthermore, several REST-based web services are provided for different corpus-linguistic and NLP tasks.

Scientific Data Repository

Data Repository. Interactive Analytics, Exploration & Visualization.

Subject(s)

Content type(s)

Plain text

Country

United States

A machine learning data repository with interactive visual analytic techniques. This project is the first to combine the notion of a data repository with real-time visual analytics for interactive data mining and exploratory analysis on the web. State-of-the-art statistical techniques are combined with real-time data visualization giving the ability for researchers to seamlessly find, explore, understand, and discover key insights in a large number of public donated data sets. This large comprehensive collection of data is useful for making significant research findings as well as benchmark data sets for a wide variety of applications and domains and includes relational, attributed, heterogeneous, streaming, spatial, and time series data as well as non-relational machine learning data. All data sets are easily downloaded into a standard consistent format. We also have built a multi-level interactive visual analytics engine that allows users to visualize and interactively explore the data in a free-flowing manner.

CITK

Cognitive Interaction Toolkit

Subject(s)

Content type(s)

Country

Germany

<<<!!!<<< This repository is no longer available. >>>!!!>>>

CLARIN-PL

Language Technology Centre

Subject(s)

Content type(s)

Country

Polish CLARIN node – CLARIN-PL Language Technology Centre – is being built at Wrocław University of Technology. The LTC is addressed to scholars in the humanities and social sciences. Registered users are granted free access to digital language resources and advanced tools to explore them. They can also archive and share their own language data (in written, spoken, video or multimodal form).

Gulf of Mexico Research Initiative Information and Data Cooperative

GRIIDC

Subject(s)

Content type(s)

Country

United States

The Gulf of Mexico Research Initiative Information and Data Cooperative (GRIIDC) is a team of researchers, data specialists and computer system developers who are supporting the development of a data management system to store scientific data generated by Gulf of Mexico researchers. The Master Research Agreement between BP and the Gulf of Mexico Alliance that established the Gulf of Mexico Research Initiative (GoMRI) included provisions that all data collected or generated through the agreement must be made available to the public. The Gulf of Mexico Research Initiative Information and Data Cooperative (GRIIDC) is the vehicle through which GoMRI is fulfilling this requirement. The mission of GRIIDC is to ensure a data and information legacy that promotes continual scientific discovery and public awareness of the Gulf of Mexico Ecosystem.

The Content Name Collection

CNC

Subject(s)

Content type(s)

Country

Switzerland

<<<!!!<<< The repository is offline >>>!!!>>> A collection of open content name datasets for Information Centric Networking. The "Content Name Collection" (CNC) lists and hosts open datasets of content names. These datasets are either derived from URL link databases or web traces. The names are typically used for research on Information Centric Networking (ICN), for example to measure cache hit/miss ratios in simulations.

Network Repository

Network Data Repository, Graph Data, Social Networks

Subject(s)

Content type(s)

Country

United States

Network Repository is the first interactive data repository for graph and network data. It hosts graph and network datasets, containing hundreds of real-world networks and benchmark datasets. Unlike other data repositories, Network Repository provides interactive analysis and visualization capabilities to allow researchers to explore, compare, and investigate graph data in real-time on the web.

Prognostics Center of Excellence Data Set Repository

PCoE Datasets

Subject(s)

Content type(s)

Country

United States

NASA's Prognostics Center of Excellence hosts the Prognostics Data Repository to provide data used in the development of prognostic algorithms, and time series of nominal to failed states. Data are donated from universities, agencies, or companies on an ongoing process.

Subjects

Content Types

Countries

AID systems

API

Certificates

Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type

Keywords

Metadata standards

PID systems

Provider types

Quality management

Repository languages

Software

Syndications

Repository types

Versioning