Filter
Reset all

Subjects

Content Types

Countries

AID systems

API

Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type

Keywords

Metadata standards

PID systems

Provider types

Quality management

Repository languages

Software

Syndications

Repository types

Versioning

  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 29 result(s)
The Scholarly Database (SDB) at Indiana University aims to serve researchers and practitioners interested in the analysis, modeling, and visualization of large-scale scholarly datasets. The online interface provides access to six datasets: MEDLINE papers, registered Clinical Trials, U.S. Patent and Trademark Office patents (USPTO), National Science Foundation (NSF) funding, National Institutes of Health (NIH) funding, and National Endowment for the Humanities funding – over 26 million records in total.
The Buckeye Corpus of conversational speech contains high-quality recordings from 40 speakers in Columbus OH conversing freely with an interviewer. The speech has been orthographically transcribed and phonetically labeled. The audio and text files, together with time-aligned phonetic labels, are stored in a format for use with speech analysis software (Xwaves and Wavesurfer). Software for searching the transcription files is currently being written.
LEPR is a database of results of published experimental studies involving liquid-solid phase equilibria relevant to natural magmatic systems. TraceDs is a database of experimental studies involving trace element distribution between liquid, solid and fluid phases.
The data in the U of M’s Clinical Data Repository comes from the electronic health records (EHRs) of more than 2 million patients seen at 8 hospitals and more than 40 clinics. For each patient, data is available regarding the patient's demographics (age, gender, language, etc.), medical history, problem list, allergies, immunizations, outpatient vitals, diagnoses, procedures, medications, lab tests, visit locations, providers, provider specialties, and more.
METLIN represents the largest MS/MS collection of data with the database generated at multiple collision energies and in positive and negative ionization modes. The data is generated on multiple instrument types including SCIEX, Agilent, Bruker and Waters QTOF mass spectrometers.
The National Science Foundation (NSF) Ultraviolet (UV) Monitoring Network provides data on ozone depletion and the associated effects on terrestrial and marine systems. Data are collected from 7 sites in Antarctica, Argentina, United States, and Greenland. The network is providing data to researchers studying the effects of ozone depletion on terrestrial and marine biological systems. Network data is also used for the validation of satellite observations and for the verification of models describing the transfer of radiation through the atmosphere.
Fox DEN provides investigators with a tool to explore, download and apply statistical models on aggregated data collected for the Fox Insight online clinical study. The Fox Insight study collects patient-reported outcomes and genetic data from people with Parkinson's disease and their loved ones.
Project Achilles is a systematic effort aimed at identifying and cataloging genetic vulnerabilities across hundreds of genomically characterized cancer cell lines. The project uses genome-wide genetic perturbation reagents (shRNAs or Cas9/sgRNAs) to silence or knock-out individual genes and identify those genes that affect cell survival. Large-scale functional screening of cancer cell lines provides a complementary approach to those studies that aim to characterize the molecular alterations (e.g. mutations, copy number alterations) of primary tumors, such as The Cancer Genome Atlas (TCGA). The overall goal of the project is to identify cancer genetic dependencies and link them to molecular characteristics in order to prioritize targets for therapeutic development and identify the patient population that might benefit from such targets. Project Achilles data is hosted on the Cancer Dependency Map Portal (DepMap) where it has been harmonized with our genomics and cellular models data. You can access the latest and all past datasets here: https://depmap.org/portal/download/all/
The Comparative Welfare Entitlements Dataset (CWED) contains information about the structure and generosity of social insurance benefits in 33 countries around the world. The data contained here are an updated and extended version of CWED 1, which has been available since 2004. This web site allows you to download customized portions of the CWED 2 data, browse the Working Paper Series or access documentary material. For recent updates to this dataset, please see https://www.cwep.us/
Content type(s)
The New York Brain Bank (NYBB) at Columbia University was established to collect postmortem human brains to meet the needs of neuroscientists investigating specific psychiatric and neurological disorders.
TerraSAR-X is a German satellite for Earth Observation, which was launched on July 14, 2007. The mission duration was foreseen to be 5 years. TerraSAR-X carries an innovative high resolution x-band sensor for imaging with resolution up to 1 m. TerraSAR-X carries as secondary payload an IGOR GPS receiver with GPS RO capability. GFZ provided the IGOR and is responsible for the related TOR experiment (Tracking, Occultation and Ranging). TerraSAR-X provides continuously atmospheric GPS data in near-real time. These data from GFZ are continuously assimilated in parallel with those from GRACE-A by the world-leading weather centers to improve their global forecasts. TerraSAR-X, together with TanDEM-X also forms a twin-satellite constellation for atmosphere sounding and generates an unique data set for the evaluation of the accuracy of the GPS-RO technique.
Junar provides a cloud-based open data platform that enables innovative organizations worldwide to quickly, easily and affordably make their data accessible to all. In just a few weeks, your initial datasets can be published, providing greater transparency, encouraging collaboration and citizen engagement, and freeing up precious staff resources.
In 2003, the National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK) at NIH established Data, Biosample, and Genetic Repositories to increase the impact of current and previously funded NIDDK studies by making their data and biospecimens available to the broader scientific community. These Repositories enable scientists not involved in the original study to test new hypotheses without any new data or biospecimen collection, and they provide the opportunity to pool data across several studies to increase the power of statistical analyses. In addition, most NIDDK-funded studies are collecting genetic biospecimens and carrying out high-throughput genotyping making it possible for other scientists to use Repository resources to match genotypes to phenotypes and to perform informative genetic analyses.
The main goal of the ECCAD project is to provide scientific and policy users with datasets of surface emissions of atmospheric compounds, and ancillary data, i.e. data required to estimate or quantify surface emissions. The supply of ancillary data - such as maps of population density, maps of fires spots, burnt areas, land cover - could help improve and encourage the development of new emissions datasets. ECCAD offers: Access to global and regional emission inventories and ancillary data, in a standardized format Quick visualization of emission and ancillary data Rationalization of the use of input data in algorithms or emission models Analysis and comparison of emissions datasets and ancillary data Tools for the evaluation of emissions and ancillary data ECCAD is a dynamical and interactive database, providing the most up to date datasets including data used within ongoing projects. Users are welcome to add their own datasets, or have their regional masks included in order to use ECCAD tools.
A consolidated feed from 35 million instruments provides sophisticated normalized data, streamlining analysis and decisions from front office to operations. And with flexible delivery options including cloud and API, timely accurate data enables the enterprise to capture opportunities, evaluate risk and ensure compliance in fast-moving markets.
INDI was formed as a next generation FCP effort. INDI aims to provide a model for the broader imaging community while simultaneously creating a public dataset capable of dwarfing those that most groups could obtain individually.
The Cancer Cell Line Encyclopedia project is a collaboration between the Broad Institute, and the Novartis Institutes for Biomedical Research and its Genomics Institute of the Novartis Research Foundation to conduct a detailed genetic and pharmacologic characterization of a large panel of human cancer models, to develop integrated computational analyses that link distinct pharmacologic vulnerabilities to genomic patterns and to translate cell line integrative genomics into cancer patient stratification. The CCLE provides public access to genomic data, analysis and visualization for about 1000 cell lines.
!!! >>> merged with https://www.re3data.org/repository/r3d100012653 <<< !!! RDoCdb is an informatics platform for the sharing of human subjects data generated by investigators as part of the NIMH's Research Domain Criteria initiative, and to support this initiative's aims. It also accepts and shares appropriate data related to mental health from other sources.
The USDA Agricultural Marketing Service (AMS) Cotton Program maintains a National Database (NDB) in Memphis, Tennessee for owner access to cotton classification data. The NDB is computerized telecommunications system which allows owners or authorized agents of owners to retrieve classing data from the current crop and/or the previous four crops. The NDB stores classing information from all 10 regional classing offices.
The World Religion Database (WRD) contains detailed statistics on religious affiliation for every country of the world. It provides source material, including censuses and surveys, as well as best estimates for every religion to offer a definitive picture of international religious demography. It offers best estimates at multiple dates for each of the world’s religions for the period 1900 to 2050.