Filter
Reset all

Subjects

Content Types

Countries

AID systems

API

Certificates

Data access

Data access restrictions

Database access

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type

Keywords

Metadata standards

PID systems

Provider types

Quality management

Repository languages

Software

Syndications

Repository types

Versioning

  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 222 result(s)
The repository of the Hamburg Centre for Speech Corpora is used for archiving, maintenance, distribution and development of spoken language corpora. These usually consist of audio and / or video recordings, transcriptions and other data and structured metadata. The corpora treat the focus on multilingualism and are generally freely available for research and teaching. Most of the measures maintained by the HZSK corpora were created in the years 2000-2011 in the framework of the SFB 538 "Multilingualism" at the University of Hamburg. The HZSK however also strives to take linguistic data from other projects or contexts, and to provide also the scientific community for research and teaching are available, provided that they are compatible with the current focus of HZSK, ie especially spoken language and multilingualism.
BEA produces economic accounts statistics that enable government and business decision-makers, researchers, and the American public to follow and understand the performance of the Nation's economy. To do this, BEA collects source data, conducts research and analysis, develops and implements estimation methodologies, and disseminates statistics to the public.
NAHDAP acquires, preserves and disseminates data relevant to drug addiction and HIV research. By preserving and making available an easily accessible library of electronic data on drug addiction and HIV infection in the United States, NAHDAP offers scholars the opportunity to conduct secondary analysis on major issues of social and behavioral sciences and public policy
The Substance Abuse and Mental Health Data Archive (SAMHDA) is an initiative funded under contract HHSS283201500001C with the Center for Behavioral Health Statistics and Quality (CBHSQ), Substance Abuse and Mental Health Services Administration (SAMHSA), U.S. Department of Health and Human Services (HHS). CBHSQ has primary responsibility for the collection, analysis, and dissemination of SAMHSA's behavioral health data. Public use files and restricted use files are provided. CBHSQ promotes the access and use of the nation's substance abuse and mental health data through SAMHDA. SAMHDA provides public-use data files, file documentation, and access to restricted-use data files to support a better understanding of this critical area of public health.
Time-sharing Experiments for the Social Sciences (TESS) offers researchers the opportunity to capture the internal validity of experiments while also realizing the benefits of working with a large, diverse population of research participants.
myExperiment is a collaborative environment where scientists can safely publish their workflows and in silico experiments, share them with groups and find those of others. Workflows, other digital objects and bundles (called Packs) can now be swapped, sorted and searched like photos and videos on the Web. Unlike Facebook or MySpace, myExperiment fully understands the needs of the researcher and makes it really easy for the next generation of scientists to contribute to a pool of scientific methods, build communities and form relationships — reducing time-to-experiment, sharing expertise and avoiding reinvention. myExperiment is now the largest public repository of scientific workflows.
The Measures of Effective Teaching(MET) project is the largest study of classroom teaching ever conducted in the United States. The University of Michigan compiled the MET data and video files into a rich research collection called the MET Longitudinal Database. Approved researchers can access the restricted MET quantitative and video data using secure online technical systems. The MET Longitudinal Database consists of a Web-based application for searching the collection and viewing the videos with accompanying metadata, and a Virtual Data Enclave that provides secure remote access to the quantitative data and documentation files.
The Behavioral Risk Factor Surveillance System (BRFSS) is the world's largest, on-going telephone health survey system. As a result, surveys were developed and conducted to monitor state-level prevalence of the major behavioral risks among adults associated with premature morbidity and mortality. The basic philosophy was to collect data on actual behaviors, rather than on attitudes or knowledge, that would be especially useful for planning, initiating, supporting, and evaluating health promotion and disease prevention programs. Currently data are collected monthly in all 50 states.
The Sheffield Hallam University Research Data Repository (SHURDA) is an institutional catalogue of digital and non-digital datasets that are produced by researchers at SHU and preserved at the University or elsewhere.
diversitydata.org is an online tool for exploring quality of life data across metropolitan areas for people of different racial/ethnic groups in the United States. It provides values and rankings for the largest U.S. metropolitan areas on different indicators in 8 areas of life (domains), including demographics, education, economic opportunity, housing, neighborhoods, and health. It also provides a simple mapping utility, showing the range of indicator values for metros across the U.S. Data from 1999 indicators is archives in the companion Diversity Data Archive (https://diversitydata-archive.org/). For a wider selection of data on child wellbeing, visit our partner site, diversitydatakids.org (https://www.diversitydatakids.org/). diversitydata.org has been named a Health Data All Star by the Health Data Consortium. The list was compiled in consultation with leading health researchers, government officials, entrepreneurs, advocates and others to identify the health data resources that matter most.
The goal of the Center of Estonian Language Resources (CELR) is to create and manage an infrastructure to make the Estonian language digital resources (dictionaries, corpora – both text and speech –, various language databases) and language technology tools (software) available to everyone working with digital language materials. CELR coordinates and organises the documentation and archiving of the resources as well as develops language technology standards and draws up necessary legal contracts and licences for different types of users (public, academic, commercial, etc.). In addition to collecting language resources, a system will be launched for introducing the resources to, informing and educating the potential users. The main users of CELR are researchers from Estonian R&D institutions and Social Sciences and Humanities researchers all over the world via the CLARIN ERIC network of similar centers in Europe. Access to data is provided through different sites: Public Repository https://entu.keeleressursid.ee/public-document , Language resources https://keeleressursid.ee/en/resources/corpora, and MetaShare CELR https://metashare.ut.ee/
The European Prospective Investigation into Cancer and Nutrition (EPIC) study is one of the largest cohort studies in the world, with more than half a million (521 000) participants recruited across 10 European countries and followed for almost 15 years. EPIC was designed to investigate the relationships between diet, nutritional status, lifestyle and environmental factors, and the incidence of cancer and other chronic diseases. EPIC investigators are active in all fields of epidemiology, and important contributions have been made in nutritional epidemiology using biomarker analysis and questionnaire information, as well as genetic and lifestyle investigations.
Country
clarin:el is the Greek national network of language resources, a nation-wide Research Infrastructure devoted to the sustainable storage, sharing, dissemination and preservation of language resources. CLARIN EL infrastructure, which is a Greek nation-wide Research Infrastructure devoted to the sustainable storage, sharing, dissemination and preservation of language resources (LRs) and aims at increasing access to and augmentation of such resources at a national scale and beyond. It is an open, integrated, secure and interoperable storage, sharing and processing infrastructure for LRs (datasets, tools and processing services) for all domains domains and disciplines where language plays a critical role, notably. CLARIN EL is implemented in the framework of the CLARIN Attiki, national project in support of ESFRI/2006 Research Infrastructures.
Country
RepOD is a general-purpose repository for open research data, offering all members of the academic community in Poland the possibility to deposit their work. It is intended for scientific data from all disciplines of knowledge and in all formats. The purpose of RepOD is to create a place where research data can be safely stored and openly shared with others.
BOARD (Bicocca Open Archive Research Data) is the institutional data repository of the University of Milano-Bicocca. BOARD is an open, free-to-use research data repository, which enables members of University of Milano-Bicocca to make their research data publicly available. By depositing their research data in BOARD researchers can: - Make their research data citable - Share their data privately or publicly - Ensure long-term storage for their data - Keep access to all versions - Link their article to their data
Country
PubData is Leuphana's institu­tional research data reposi­tory for the long-term preser­vation, documen­tation and publi­cation of research data from scienti­fic projects. PubData is main­tained by Leuphana's Media and Infor­mation Centre (MIZ) and is free of charge. The service is primarily aimed at Leuphana em­ployees and additionally at re­searchers from coope­ration partners con­tractually asso­ciated with Leuphana.
Content type(s)
ResearchDataGov is a web portal for discovering and requesting access to restricted microdata from US federal statistical agencies.
LINDAT/CLARIN is designed as a Czech “node” of Clarin ERIC (Common Language Resources and Technology Infrastructure). It also supports the goals of the META-NET language technology network. Both networks aim at collection, annotation, development and free sharing of language data and basic technologies between institutions and individuals both in science and in all types of research. The Clarin ERIC infrastructural project is more focused on humanities, while META-NET aims at the development of language technologies and applications. The data stored in the repository are already being used in scientific publications in the Czech Republic. In 2019 LINDAT/CLARIAH-CZ was established as a unification of two research infrastructures, LINDAT/CLARIN and DARIAH-CZ.
Country
RODBUK Cracow Open Research Data Repository is co-created by six Cracow universities: AGH University of Science and Technology, University of Physical Education in Krakow, Cracow University of Technology, Krakow University of Economics, Jagiellonian University in Kraków, Pedagogical University of Krakow. The purpose of RODBUK is to collect, develop, archive and make available in open access all types of research data created by researchers, PhD candidates and students in the course of scientific activity. RODBUK aims to implement the Open Science policy by creating a publicly available platform for depositing research datasets enabling: getting acquainted with the research conducted in Cracow's scientific centers, storage of various types of research data obtaining a permanent Digital Object Identifier (DOI) for each dataset, standardized data citation, choosing a data usage license agreement (Creative Commons or other. RODBUK allows to collect and share open research data from various disciplines and in all file formats. RODBUK applies the FAIR Principles, which means the data is findable, accessible, interoperable, reusable.
ILC-CNR for CLARIN-IT repository is a library for linguistic data and tools. Including: Text Processing and Computational Philology; Natural Language Processing and Knowledge Extraction; Resources, Standards and Infrastructures; Computational Models of Language Usage. The studies carried out within each area are highly interdisciplinary and involve different professional skills and expertises that extend across the disciplines of Linguistics, Computational Linguistics, Computer Science and Bio-Engineering.
Country
The Research Data Gouv platform is the French national federated platform for open and shared research data serving the national scientific community. This platform was an integral part of the Second National Plan for Open Science (PNSO) and offers a multidisciplinary data repository, a registry which reports data hosted in other repositories and a web portal. The multidisciplinary repository is a sovereign publishing solution for sharing and opening up data for communities which are yet to set up their own recognised thematic repository.
Country
The Research Data Centre (FDZ-RV) was set-up in 2004 as an integral part of the German Federal Pension Insurance (Deutsche Rentenversicherung). Since then, the Research Data Centre produced several cross-sectional and longitudinal datasets, also called Scientific Use Files (SUF), available to researchers interested in issues of retirement, disability and rehabilitation. The datasets are released on an annual basis. The Scientific Use Files are subsamples drawn from the pool of individuals who are insured in the Federal Pension Insurance. The information provided in the original datasets is necessary to administer the beneficiaries of the pension insurance.
Country
PsychArchives is a disciplinary repository for psychological science and neighboring disciplines. Accommodating 20 different digital research object (DRO) types, including articles, preprints, research data, code, supplements, preregistrations, tests and multimedia objects, PsychArchives provides a digital space that integrates all research-related content relevant to psychology. PsychArchives is committed to the FAIR principles, facilitating the findability, accessibility, interoperability and reusability of research and research data.
The Roper Center for Public Opinion Research is one of the world's leading archives of social science data, specializing in data from surveys of public opinion. The data held by the Roper Center range from the 1930s, when survey research was in its infancy, to the present. Most of the data are from the United States, but over 100 nations are represented.