Filter
Reset all

Subjects

Content Types

Countries

AID systems

API

Certificates

Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type

Keywords

Metadata standards

PID systems

Provider types

Quality management

Repository languages

Software

Syndications

Repository types

Versioning

  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 41 result(s)
The Scholarly Database (SDB) at Indiana University aims to serve researchers and practitioners interested in the analysis, modeling, and visualization of large-scale scholarly datasets. The online interface provides access to six datasets: MEDLINE papers, registered Clinical Trials, U.S. Patent and Trademark Office patents (USPTO), National Science Foundation (NSF) funding, National Institutes of Health (NIH) funding, and National Endowment for the Humanities funding – over 26 million records in total.
The National Archives and Records Administration (NARA) is the nation's record keeper. Of all documents and materials created in the course of business conducted by the United States Federal government, only 1%-3% are so important for legal or historical reasons that they are kept by us forever. Those valuable records are preserved and are available to you, whether you want to see if they contain clues about your family’s history, need to prove a veteran’s military service, or are researching an historical topic that interests you.
Country
The KiezDeutsch-Korpus (KiDKo) has been developed by project B6 (PI: Heike Wiese) of the collaborative research centre Information Structure (SFB 632) at the University of Potsdam from 2008 to 2015. KiDKo is a multi-modal digital corpus of spontaneous discourse data from informal, oral peer group situations in multi- and monoethnic speech communities. KiDKo contains audio data from self-recordings, with aligned transcriptions (i.e., at every point in a transcript, one can access the corresponding area in the audio file). The corpus provides parts-of-speech tags as well as an orthographically normalised layer (Rehbein & Schalowski 2013). Another annotation level provides information on syntactic chunks and topological fields. There are several complementary corpora: KiDKo/E (Einstellungen - "attitudes") captures spontaneous data from the public discussion on Kiezdeutsch: it assembles emails and readers' comments posted in reaction to media reports on Kiezdeutsch. By doing so, KiDKo/E provides data on language attitudes, language perceptions, and language ideologies, which became apparent in the context of the debate on Kiezdeutsch, but which frequently related to such broader domains as multilingualism, standard language, language prestige, and social class. KiDKo/LL ("Linguistic Landscape") assembles photos of written language productions in public space from the context of Kiezdeutsch, for instance love notes on walls, park benches, and playgrounds, graffiti in house entrances, and scribbled messages on toilet walls. Contains materials in following languages: Spanish, Italian, Greek, Kurdish, Swedish, French, Croatian, Arabic, Turkish. The corpus is available online via the Hamburger Zentrum für Sprachkorpora (HZSK) https://corpora.uni-hamburg.de/secure/annis-switch.php?instance=kidko .
The Health Data Research Innovation Gateway (the ‘Gateway’) provides a common entry point to discover and enquire about access to UK health datasets for research and innovation. It provides detailed information about the datasets, which are held by members of the UK Health Data Research Alliance, such as a description, size of the population, and the legal basis for access. The Gateway includes the ability to search for research projects, publications and health data tools, such as those related to COVID-19. New interactive features provide a community forum for researchers to collaborate and connect and the ability to add research projects. The Innovation Gateway does not hold or store any datasets or patient or health data but rather acts as a portal to allow discovery of datasets and to request access to them for health research. A dataset is a collection of related individual pieces of data but in the case of health data, identifiable information (e.g. name or NHS number) is removed and data is de-identified where possible. When you access the Gateway you will not be able to view or extract the data itself. Instead, you will be able to see information that describes what the different datasets are (e.g. where the dataset has come from, a description of the dataset, the time period and the geographical areas the dataset covers).
Country
Based on the needs of national scientific and technological innovation for laboratory animal resources, we use various methods such as foreign introduction, domestic collection, independent research and development, and protocol conservation to collect, integrate, and optimize laboratory animal resources. The resource library now preserves more than 200 varieties and strains in four categories, including mice, rats, guinea pigs, and rabbits, including routine laboratory animals, genetically modified animal models, and animal models for disease. The predecessor of the resource bank was the National Rodent Laboratory Animal Seed Center (Guoke Cai Zi [1998] No. 010), established in 1998 and based on the Laboratory Animal Resources Research Institute of the Chinese National Academy of Food and Drug Administration.
Country
The Alcohol and Gaming Commission of Ontario’s Data Inventory lists all of the agency’s data sets and identifies whether a data set is currently open, in the process of being opened or exempt from being released as open data due to legal, security, privacy, confidentiality or commercially-sensitive reasons.
Country
The center's main task is to introduce, collect and preserve dog laboratory animal varieties, strains, develop and maintain new technologies, cultivate new varieties and strains, and provide standard experimental seeds. In 2019, the Ministry of Science and Technology and the Ministry of Finance, for the purpose of improving the scientific and technological resources sharing service system, promoted the opening and sharing of scientific and technological resources to society, and carried out the optimization and adjustment of the national platform. The National Canine Laboratory Animal Seed Center was successfully approved as the only "National Canine Laboratory Animal Resource Center".
The Global Terrorism Database (GTD) is an open-source database including information on terrorist events around the world from 1970 through 2020 (with annual updates planned for the future). Unlike many other event databases, the GTD includes systematic data on domestic as well as international terrorist incidents that have occurred during this time period and now includes more than 200,000 cases.
Copernicus is a European system for monitoring the Earth. Copernicus consists of a complex set of systems which collect data from multiple sources: earth observation satellites and in situ sensors such as ground stations, airborne and sea-borne sensors. It processes these data and provides users with reliable and up-to-date information through a set of services related to environmental and security issues. The services address six thematic areas: land monitoring, marine monitoring, atmosphere monitoring, climate change, emergency management and security. The main users of Copernicus services are policymakers and public authorities who need the information to develop environmental legislation and policies or to take critical decisions in the event of an emergency, such as a natural disaster or a humanitarian crisis. Based on the Copernicus services and on the data collected through the Sentinels and the contributing missions , many value-added services can be tailored to specific public or commercial needs, resulting in new business opportunities. In fact, several economic studies have already demonstrated a huge potential for job creation, innovation and growth.
<<<!!!<<< The repository is no longer available. >>>!!!>>> Selected TOXMAP data can be accesse from the following sites: U.S. EPA Toxics Release Program (TRI) (https://www.epa.gov/toxics-release-inventory-tri-program) U.S. EPA Superfund Program (https://www.epa.gov/superfund) U.S. EPA Facilities Registry System (FRS) (https://www.epa.gov/frs) U.S. EPA Clean Air Markets Program (https://www.epa.gov/airmarkets) U.S. EPA Geospatial Applications (https://www.epa.gov/geospatial/epa-geospatial-applications) U.S. NIH NCI Surveillance, Epidemiology, and End Results Program (SEER) (https://seer.cancer.gov/) Government of Canada National Pollutant Release Inventory (NPRI) (https://www.canada.ca/en/services/environment/pollution-waste-management/national-pollutant-release-inventory.html) U.S. Census Bureau (https://www.census.gov/) U.S. Nuclear Regulatory Commission (NRC) (https://www.nrc.gov/) >>>!!!>>>
Country
The BC Oil and Gas Commission (Commission) is an independent, single-window regulatory agency with responsibilities for overseeing oil and gas operations in British Columbia, including exploration, development, pipeline transportation and reclamation. Spatial and non-spatial data is collected from various sources to support oil and gas operations in the province and is used widely within the Commission. As part of its commitment to improving citizen access and involvement, enhancing transparency and understanding, the Commission is pleased to provide interactive public access to this data. Users are encouraged to explore the site and select and download the datasets that are of interest to them.
Country
The CDPP is the French national data centre for natural plasmas of the solar system. The CDPP assures the long term preservation of data obtained primarily from instruments built using French resources, and renders them readily accessible and exploitable by the international community. The CDPP also provides services to enable on-line data analysis (AMDA), 3D data visualization in context (3DView), and a propagation tool which bridges solar perturbations to in-situ measurements. The CDPP is involved in the development of interoperability, participates in several Virtual Observatory projects, and supports data distribution for scientific missions (Solar Orbiter, JUICE).
Country
The SHIP study´s main aims include the investigation of health in all its aspects and complexity involving the collection and assessment of data relevant to the prevalence and incidence of common, population-relevant diseases and their risk factors.
>>>>!!!!<<<< The Cancer Genomics Hub mission is now completed. The Cancer Genomics Hub was established in August 2011 to provide a repository to The Cancer Genome Atlas, the childhood cancer initiative Therapeutically Applicable Research to Generate Effective Treatments and the Cancer Genome Characterization Initiative. CGHub rapidly grew to be the largest database of cancer genomes in the world, storing more than 2.5 petabytes of data and serving downloads of nearly 3 petabytes per month. As the central repository for the foundational genome files, CGHub streamlined team science efforts as data became as easy to obtain as downloading from a hard drive. The convenient access to Big Data, and the collaborations that CGHub made possible, are now essential to cancer research. That work continues at the NCI's Genomic Data Commons. All files previously stored at CGHub can be found there. The Website for the Genomic Data Commons is here: https://gdc.nci.nih.gov/ >>>>!!!!<<<< The Cancer Genomics Hub (CGHub) is a secure repository for storing, cataloging, and accessing cancer genome sequences, alignments, and mutation information from the Cancer Genome Atlas (TCGA) consortium and related projects. Access to CGHub Data: All researchers using CGHub must meet the access and use criteria established by the National Institutes of Health (NIH) to ensure the privacy, security, and integrity of participant data. CGHub also hosts some publicly available data, in particular data from the Cancer Cell Line Encyclopedia. All metadata is publicly available and the catalog of metadata and associated BAMs can be explored using the CGHub Data Browser.
Country
The ICES Data Repository consists of record-level, coded and linkable health data sets. It encompasses much of the publicly funded administrative health services records for the Ontario population eligible for universal health coverage since 1986 and is capable of integrating research-specific data, registries and surveys. Currently, the repository includes health service records for as many as 13 million people. Files in the ICES Data Repository are described in the Data Dictionary. This includes ICES General Use Data, as well as ICES Controlled Use Data. Datasets obtained by ICES for specific project(s) (project-specific data) are not described in the Data Dictionary. The ICES Data Dictionary is an essential resource for anyone doing research at ICES. The information in this Data Dictionary is almost entirely based on the metadata belonging to the datasets described.
Country
The National Microbial Resource Center (NMRC) is an important part of the national science and technology resources sharing service platform, responsible for the research, conservation, management and sharing of national microbial strain resources, ensuring the strategic security and sustainable use of microbial strain resources, and providing support for scientific and technological innovation, industrial development and social progress. The main tasks of the NMRC are: to collect, organize and preserve microbial strain resources around national needs and scientific research; to undertake the task of remitting, organizing and preserving strain resources resulting from the implementation of science and technology projects; to be responsible for the development and improvement of microbial strain resource standards, and to standardize and guide the development of microbial strain resources in various fields. The company is responsible for the development and improvement of microbial strain resource standards, standardizing and guiding the protection and utilization of microbial strain resources in various fields; building and maintaining the national strain resource online service system, and carrying out social sharing of physical and information resources of strains; developing key common technologies, creating new resources, and carrying out customized services according to innovative needs; carrying out scientific popularization for the society; carrying out international exchange and cooperation on strain resources, participating in relevant international academic organizations, and safeguarding national interests and Security
CPES provides access to information that relates to mental disorders among the general population. Its primary goal is to collect data about the prevalence of mental disorders and their treatments in adult populations in the United States. It also allows for research related to cultural and ethnic influences on mental health. CPES combines the data collected in three different nationally representative surveys (National Comorbidity Survey Replication, National Survey of American Life, National Latino and Asian American Study).
The IMLS conducts annual surveys of public and state libraries in the US that have response rates near 100%. Data is compiled for states, library systems, and individual library branches and includes statistics for circulation, visits, staff, expenditures, and more. Data is available in two formats: MS Access and flat file, plain text. Data for museums is now included.
The National Practitioner Data Bank (NPDB), or "the Data Bank," is a confidential information clearinghouse created by Congress with the primary goals of improving health care quality, protecting the public, and reducing health care fraud and abuse in the U.S.
Earthdata powered by EOSDIS (Earth Observing System Data and Information System) is a key core capability in NASA’s Earth Science Data Systems Program. It provides end-to-end capabilities for managing NASA’s Earth science data from various sources – satellites, aircraft, field measurements, and various other programs. EOSDIS uses the metadata and service discovery tool Earthdata Search https://search.earthdata.nasa.gov/search. The capabilities of EOSDIS constituting the EOSDIS Science Operations are managed by NASA's Earth Science Data and Information System (ESDIS) Project. The capabilities include: generation of higher level (Level 1-4) science data products for several satellite missions; archiving and distribution of data products from Earth observation satellite missions, as well as aircraft and field measurement campaigns. The EOSDIS science operations are performed within a distributed system of many interconnected nodes - Science Investigator-led Processing Systems (SIPS), and distributed, discipline-specific, Earth science Distributed Active Archive Centers (DAACs) with specific responsibilities for production, archiving, and distribution of Earth science data products. The DAACs serve a large and diverse user community by providing capabilities to search and access science data products and specialized services.
Funded by the National Science Foundation (NSF) and proudly operated by Battelle, the National Ecological Observatory Network (NEON) program provides open, continental-scale data across the United States that characterize and quantify complex, rapidly changing ecological processes. The Observatory’s comprehensive design supports greater understanding of ecological change and enables forecasting of future ecological conditions. NEON collects and processes data from field sites located across the continental U.S., Puerto Rico, and Hawaii over a 30-year timeframe. NEON provides free and open data that characterize plants, animals, soil, nutrients, freshwater, and the atmosphere. These data may be combined with external datasets or data collected by individual researchers to support the study of continental-scale ecological change.
MEASURE DHS is advancing global understanding of health and population trends in developing countries through nationally-representative household surveys that provide data for a wide range of monitoring and impact evaluation indicators in the areas of population, health, HIV, and nutrition. The database collects, analyzes, and disseminates data from more than 300 surveys in over 90 countries. MEASURE DHS distributes, at no cost, survey data files for legitimate academic research.
Country
CEEHRC represents a multi-stage funding commitment by the Canadian Institutes of Health Research (CIHR) and multiple Canadian and international partners. The overall aim is to position Canada at the forefront of international efforts to translate new discoveries in the field of epigenetics into improved human health. The two sites will focus on sequencing human reference epigenomes and developing new technologies and protocols; they will also serve as platforms for other CEEHRC funding initiatives, such as catalyst and team grants. The complementary reference epigenome mapping efforts of the two sites will focus on a range of common human diseases. The Vancouver group will focus on the role of epigenetics in the development of cancer, including lymphoma and cancers of the ovary, colon, breast, and thyroid. The Montreal team will focus on autoimmune / inflammatory, cardio-metabolic, and neuropsychiatric diseases, using studies of identical twins as well as animal models of human disease.