Filter

Subjects

Content Types

Countries

AID systems

API

Certificates

Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type

Keywords

Metadata standards

PID systems

Provider types

Quality management

Repository languages

Software

Syndications

Repository types

Versioning

  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 40 result(s)
Harmonized, indexed, searchable large-scale human FG data collection with extensive metadata. Provides scalable, unified way to easily access massive functional genomics (FG) and annotation data collections curated from large-scale genomic studies. Direct integration (API) with custom / high-throughput genetic and genomic analysis workflows.
<<<!!!<<< As stated 2017-08-28 NEEShub is no longer available. The NEES published projects from the Project Warehouse can be found in the DesignSafe Data Depot https://www.designsafe-ci.org/data/browser/public/nees.public/. The NEES Databases https://datacenterhub.org/resources/395 are being transitioned to DataHub https://datacenterhub.org/ . Please visit DesignSafe https://www.designsafe-ci.org/ for all other inquiries. >>>!!!<<< NEES network features 14 geographically-distributed, shared-use laboratories that support several types of experimental work: geotechnical centrifuge research, shake table tests, large-scale structural testing, tsunami wave basin experiments, and field site research >>>!!!>>>
coastDat is a model based data bank developed mainly for the assessment of long-term changes in data sparse regions. A sequence of numerical models is employed to reconstruct all aspects of marine climate (such as storms, waves, surges etc.) over many decades of years relying only on large-scale information such as large-scale atmospheric conditions or bathymetry.
The Scholarly Database (SDB) at Indiana University aims to serve researchers and practitioners interested in the analysis, modeling, and visualization of large-scale scholarly datasets. The online interface provides access to six datasets: MEDLINE papers, registered Clinical Trials, U.S. Patent and Trademark Office patents (USPTO), National Science Foundation (NSF) funding, National Institutes of Health (NIH) funding, and National Endowment for the Humanities funding – over 26 million records in total.
Country
The objective of the PDRC is to construct and collect a well-designed and large-scale panel data set and provide rigorous empirical studies based on these data sets. The data will enable us (i) to provide international comparisons and fact-findings on the household income changes, social mobility, changes in employment status and the investment activities; (ii) to verify the hypotheses related to the dynamics of economic behavior derived from economic theory; and (iii) to evaluate important policy changes in the tax system and social security program, which might have lagged effects.
The Department of Energy Systems Biology Knowledgebase (KBase) is a software and data platform designed to meet the grand challenge of systems biology: predicting and designing biological function. KBase integrates data and tools in a unified graphical interface so users do not need to access them from numerous sources or learn multiple systems in order to create and run sophisticated systems biology workflows. Users can perform large-scale analyses and combine multiple lines of evidence to model plant and microbial physiology and community dynamics. KBase is the first large-scale bioinformatics system that enables users to upload their own data, analyze it (along with collaborator and public data), build increasingly realistic models, and share and publish their workflows and conclusions. KBase aims to provide a knowledgebase: an integrated environment where knowledge and insights are created and multiplied.
The Medical Expenditure Panel Survey (MEPS) is a set of large-scale surveys of families and individuals, their medical providers, and employers across the United States. MEPS is the most complete source of data on the cost and use of health care and health insurance coverage.
B2SAFE is a robust, safe and highly available service which allows community and departmental repositories to implement data management policies on their research data across multiple administrative domains in a trustworthy manner. A solution to: provide an abstraction layer which virtualizes large-scale data resources, guard against data loss in long-term archiving and preservation, optimize access for users from different regions, bring data closer to powerful computers for compute-intensive analysis
<<<!!!<<< 2020-08-28; the repository is no longer available >>>!!!>>> The South African Data Archive promotes and facilitates the sharing of research data and related documentation of computerised raw quantitative data of large scale regional, national and international research projects mainly in the humanities and social sciences. It makes these datasets available to the research community for further analysis, comparative studies, longitudinal studies, teaching and decision-making purposes.
The Natural Environment Research Council's Data Repository for Atmospheric Science and Earth Observation. The Centre for Environmental Data Analysis (CEDA) serves the environmental science community through three data centres, data analysis environments, and participation in a host of relevant research projects. We aim to support environmental science, further environmental data archival practices, and develop and deploy new technologies to enhance access to data. Additionally we provide services to aid large scale data analysis.
The BioProject database is a searcheable collection of complete and incomplete (in-progress) large-scale molecular projects including genome sequencing and assembly, transcriptome, metagenomic, annotation, expression and mapping projects. BioProject provides a central point to link to all data associated with a project in the NCBI molecular and literature databases.
Country
The Research Data Center (RDC) “International Survey Programs“ provides researchers with data, services, and consultation on a number of important international study series which are under intensive curation by GESIS. They all cover numerous countries and, quite often, substantial time spans. The RDC provides optimal data preparation and access to a wide scope of data and topics for comparative analysis.
Fossilworks is a web-based portal to the Paleobiology Database. Fossilworks is the original public interface to the PaleoDB and is housed at Macquarie. It is a non-governmental, non-profit public resource. Its purpose is to provide global, collection-based occurrence and taxonomic data for marine and terrestrial animals and plants of any geological age, as well as web-based software for statistical analysis of the data. The project's wider, long-term goal is to encourage collaborative efforts to answer large-scale paleobiological questions by developing a useful database infrastructure and bringing together large data sets.
Country
SAFER-Data is a web-based interface to the Environmental Data Archive maintained by the Environmental Research Centre (ERC) in the Environmental Protection Agency (EPA) of Ireland, who has responsibilities for a wide range of licensing, enforcement, monitoring and assessment activities associated with environmental protection.
The Paleobiology Database (PaleoBioDB) is a non-governmental, non-profit public resource for paleontological data. It has been organized and operated by a multi-disciplinary, multi-institutional, international group of paleobiological researchers. Its purpose is to provide global, collection-based occurrence and taxonomic data for organisms of all geological ages, as well data services to allow easy access to data for independent development of analytical tools, visualization software, and applications of all types. The Database’s broader goal is to encourage and enable data-driven collaborative efforts that address large-scale paleobiological questions.
The EUDAT project aims to contribute to the production of a Collaborative Data Infrastructure (CDI). The project´s target is to provide a pan-European solution to the challenge of data proliferation in Europe's scientific and research communities. The EUDAT vision is to support a Collaborative Data Infrastructure which will allow researchers to share data within and between communities and enable them to carry out their research effectively. EUDAT aims to provide a solution that will be affordable, trustworthy, robust, persistent and easy to use. EUDAT comprises 26 European partners, including data centres, technology providers, research communities and funding agencies from 13 countries. B2FIND is the EUDAT metadata service allowing users to discover what kind of data is stored through the B2SAFE and B2SHARE services which collect a large number of datasets from various disciplines. EUDAT will also harvest metadata from communities that have stable metadata providers to create a comprehensive joint catalogue to help researchers find interesting data objects and collections.
CORD is Cranfield University's research data repository, for secure preservation of institutional research data outputs. Cranfield is an exclusively postgraduate university that is a global leader for transformational research in technology and management. We are focused on the specialist themes of aerospace, defence and security, energy and power, environment and agrifood, manufacturing, transport systems, and water. The Cranfield School of Management is world leader in management education and research.
SAHFOS is an internationally funded independent research non-profit organisation responsible for the operation of the Continuous Plankton Recorder (CPR) Survey. As a large-scale global survey, it provides the scientific and policy communities with a basin-wide and long-term measure of the ecological health of marine plankton. Established in 1931, the CPR Survey is the longest running, most geographically extensive marine ecological survey in the world. It has a considerable database of marine plankton and associated metadata that is used by researchers and policy makers to examine strategically important science pillars such as climate change, human health, fisheries, biodiversity, pathogens, invasive species, ocean acidification and natural capital. The Continuous Plankton Recorder (CPR) Survey has merged with the Marine Biological Association. Today the Survey is operated by the Marine Biological Association, based in Plymouth, UK.
Country
Arquivo.pt is a research infrastructure that preserves millions of files collected from the web since 1996 and provides a public search service over this information. It contains information in several languages. Periodically it collects and stores information published on the web. Then, it processes the collect data to make it searchable, providing a “Google-like” service that enables searching the past web (English user interface available at https://arquivo.pt/?l=en). This preservation workflow is performed through a large-scale distributed information system and can also accessed through API (https://arquivo.pt/api).
The Project on Human Development in Chicago Neighborhoods (PHDCN) is a large-scale, interdisciplinary study of how families, schools, and neighborhoods affect child and adolescent development. It was designed to advance the understanding of the developmental pathways of both positive and negative human social behaviors. In particular, the project examined the causes and pathways of juvenile delinquency, adult crime, substance abuse, and violence. At the same time, the project also provided a detailed look at the environments in which these social behaviors take place by collecting substantial amounts of data about urban Chicago, including its people, institutions, and resources. Nearly all PHDCN data require an individual application with supporting materials to obtain the data. Applications are handled by the the National Archive of Criminal Justice Data (NACJD). Further instructions will appear on the study home page (linked from search results), where relevant.
The Centre for Environmental Data Analysis (CEDA) serves the environmental science community through managing data centres, data analysis environments, and participation in a host of relevant research projects. We aim to support environmental science, further environmental data archival practices, and develop and deploy new technologies to enhance access to data. Additionally we provide services to aid large scale data analysis. The CEDA Archive operates the atmospheric and earth observation data centre functions on behalf of NERC for the UK atmospheric science and earth observation communities. It covers climate, composition, observations and NWP data as well as various earth observation datasets, including airborne and satellite data and imagery. Prior to November 2016 these functions were operted by CEDA under the titles of the British Atmospheric Data Centre (BADC) and the NERC Earth Observation Data Centre (NEODC). CEDA also operates the UK Solar System Data Centre (UKSSDC), which curates and provides access to archives of data from the upper atmosphere, ionosphere and Earth's solar environment.
Project Achilles is a systematic effort aimed at identifying and cataloging genetic vulnerabilities across hundreds of genomically characterized cancer cell lines. The project uses genome-wide genetic perturbation reagents (shRNAs or Cas9/sgRNAs) to silence or knock-out individual genes and identify those genes that affect cell survival. Large-scale functional screening of cancer cell lines provides a complementary approach to those studies that aim to characterize the molecular alterations (e.g. mutations, copy number alterations) of primary tumors, such as The Cancer Genome Atlas (TCGA). The overall goal of the project is to identify cancer genetic dependencies and link them to molecular characteristics in order to prioritize targets for therapeutic development and identify the patient population that might benefit from such targets. Project Achilles data is hosted on the Cancer Dependency Map Portal (DepMap) where it has been harmonized with our genomics and cellular models data. You can access the latest and all past datasets here: https://depmap.org/portal/download/all/
FORS is the Swiss Centre of Expertise in the Social Sciences. FORS maintains a national digital archive for social science research data, implements large-scale national and international surveys, offers data and research information services to researchers and academic institutions, and conducts methodological and thematic research. FORS Data Service is FORS’ resource centre for research and teaching in the social sciences. It provides data management support and training, and it archives, disseminates and promotes quantitative and qualitative data. The Data Service maintains a comprehensive and up-to-date inventory of social science research projects in Switzerland, and makes available a wide range of datasets for secondary analysis. Databases at the FORS Data Service are: SWISSUbase and DeVisu (for variable level metadata for important surveys).