Reset all


Content Types


AID systems


Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type


Metadata standards

PID systems

Provider types

Quality management

Repository languages



Repository types


  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
  • 1 (current)
Found 21 result(s)
The National Cancer Data Base (NCDB), a joint program of the Commission on Cancer (CoC) of the American College of Surgeons (ACoS) and the American Cancer Society (ACS), is a nationwide oncology outcomes database for more than 1,500 Commission-accredited cancer programs in the United States and Puerto Rico. Some 70 percent of all newly diagnosed cases of cancer in the United States are captured at the institutional level and reported to the NCDB. The NCDB, begun in 1989, now contains approximately 29 million records from hospital cancer registries across the United States. Data on all types of cancer are tracked and analyzed. These data are used to explore trends in cancer care, to create regional and state benchmarks for participating hospitals, and to serve as the basis for quality improvement.
The PAIN Repository is a recently funded NIH initiative, which has two components: an archive for already collected imaging data (Archived Repository), and a repository for structural and functional brain images and metadata acquired prospectively using standardized acquisition parameters (Standardized Repository) in healthy control subjects and patients with different types of chronic pain. The PAIN Repository provides the infrastructure for storage of standardized resting state functional, diffusion tensor imaging and structural brain imaging data and associated biological, physiological and behavioral metadata from multiple scanning sites, and provides tools to facilitate analysis of the resulting comprehensive data sets.
The project brings together national key players providing environmentally related biological data and services to develop the ‘German Federation for Biological Data' (GFBio). The overall goal is to provide a sustainable, service oriented, national data infrastructure facilitating data sharing and stimulating data intensive science in the fields of biological and environmental research.
The objective of this Research Coordination Network project is to develop an international network of researchers who use genetic methodologies to study the ecology and evolution of marine organisms in the Indo-Pacific to share data, ideas and methods. DIPnet was created to advance genetic diversity research in the Indo-Pacific by aggregating population genetic metadata into a searchable database (GeOME).
Strong-motion data of engineering and scientific importance from the United States and other seismically active countries are served through the Center for Engineering Strong Motion Data(CESMD). The CESMD now automatically posts strong-motion data from an increasing number of seismic stations in California within a few minutes following an earthquake as an InternetQuick Report(IQR). As appropriate,IQRs are updated by more comprehensive Internet Data Reports that include reviewed versions of the data and maps showing, for example, the finite fault rupture along with the distribution of recording stations. Automated processing of strong-motion data will be extended to post the strong-motion records of the regional seismic networks of the Advanced National Seismic System (ANSS) outside California.
GLOBE (Global Collaboration Engine) is an online collaborative environment that enables land change researchers to share, compare and integrate local and regional studies with global data to assess the global relevance of their work.
The GDR is the submission point for all data collected from researchers funded by the U.S. Department of Energy's Geothermal Technologies Office. It was established to receive, manage and make available all geothermal-relevant data generated from projects funded by the DOE Geothermal Technologies Office. This includes data from GTO-funded projects associated with any portion of the geothermal project life-cycle (exploration, development, operation), as well as data produced by GTO-funded research.
This library is a public and easily accessible resource database of images, videos, and animations of cells, capturing a wide diversity of organisms, cell types, and cellular processes. The Cell Image Library has been merged with "Cell Centered Database" in 2017. The purpose of the database is to advance research on cellular activity, with the ultimate goal of improving human health.
CORUM is a manually curated dataset of mammalian protein complexes. Annotation of protein complexes includes protein complex composition and other valuable information such as method of purification, cellular function of complexes or involvement in diseases.
This guide aims to provide a starting point to locating Geographic Information System (GIS) information both through the University of Sydney library catalogue and on the World Wide Web.
The Andrews Forest is a place of inquiry. Our mission is to support research on forests, streams, and watersheds, and to foster strong collaboration among ecosystem science, education, natural resource management, and the humanities. Our place and our work are administered cooperatively by the USDA Forest Service's Pacific Northwest Research Station, Oregon State University, and the Willamette National Forest. First established in 1948 as an US Forest Service Experimental Forest, the H.J. Andrews is a 16,000-acre ecological research site in Oregon's beautiful western Cascades Mountains. The landscape is home to iconic Pacific Northwest old-growth forests of Cedar and Hemlock, and moss-draped ancient Douglas Firs; steep terrain; and fast, cold-running streams. In 1980 the Andrews became a charter member of the National Science Foundation's Long-Term Ecological Research (LTER) Program.
The PhenoGen website shares experimental data with a worldwide community of investigators and provides a flexible, integrated, multi-resolution repository of neuroscience transcriptomic genetic data for collaborative research on genomic disorders.
Mulce (MUltimodal contextualized Learner Corpus Exchange) is a research project supported by the National Research Agency (ANR programme: "Corpus and Tools in the Humanities", ANR-06-CORP-006). A teaching corpus (LETEC - Learning and Teaching Corpora) combines a systematic and structured data set, particularly of interactional data, and traces left by a training course experimentation, conducted partially or completely online and completed by additional technical, human, pedagogical and scientific information to enable the data to be analysed in context.
More than a quarter of a million people — one in 10 NSW men and women aged over 45 — have been recruited to our 45 and Up Study, the largest ongoing study of healthy ageing in the Southern Hemisphere. The baseline information collected from all of our participants is available in the Study’s Data Book. This information, which researchers use as the basis for their analyses, contains information on key variables such as height, weight, smoking status, family history of disease and levels of physical activity. By following such a large group of people over the long term, we are developing a world-class research resource that can be used to boost our understanding of how Australians are ageing. This will answer important health and quality-of-life questions and help manage and prevent illness through improved knowledge of conditions such as cancer, heart disease, depression, obesity and diabetes.
The Allele Frequency Net Database (AFND) is a public database which contains frequency information of several immune genes such as Human Leukocyte Antigens (HLA), Killer-cell Immunoglobulin-like Receptors (KIR), Major histocompatibility complex class I chain-related (MIC) genes, and a number of cytokine gene polymorphisms. The Allele Frequency Net Database (AFND) provides a central source, freely available to all, for the storage of allele frequencies from different polymorphic areas in the Human Genome. Users can contribute the results of their work into one common database and can perform database searches on information already available. We have currently collected data in allele, haplotype and genotype format. However, the success of this website will depend on you to contribute your data.
FlowRepository is a web-based application accessible from a web browser that serves as an online database of flow cytometry experiments where users can query and download data collected and annotated according to the MIFlowCyt standard. It is primarily used as a data deposition place for experimental findings published in peer-reviewed journals in the flow cytometry field. FlowRepository is funded by the International Society for Advancement of Cytometry (ISAC) and powered by the Cytobank engine specifically extended for the purposes of this repository. FlowRepository has been developed by forking and extending Cytobank in 2011.
OpenKIM is an online suite of open source tools for molecular simulation of materials. These tools help to make molecular simulation more accessible and more reliable. Within OpenKIM, you will find an online resource for standardized testing and long-term warehousing of interatomic models and data, and an application programming interface (API) standard for coupling atomistic simulation codes and interatomic potential subroutines.
ORTOLANG is an EQUIPEX project accepted in February 2012 in the framework of investissements d’avenir. Its aim is to construct a network infrastructure including a repository of language data (corpora, lexicons, dictionaries etc.) and readily available, well-documented tools for its processing. Expected outcomes comprize: promoting research on analysis, modelling and automatic processing of our language to their highest international levels thanks to effective resource pooling; facilitating the use and transfer of resources and tools set up within public laboratories to industrial partners, notably SMEs which often cannot develop such resources and tools for language processing given the cost of investment; promoting French language and the regional languages of France by sharing expertise acquired by public laboratories. ORTOLANG is a service for the language, which is complementary to the service offered by Huma-Num (très grande infrastructure de recherche). Ortolang gives access to SLDR for speech, and CNRTL for text resources.
EOL’s platforms and instruments collect large and often unique data sets that must be validated, archived and made available to the research community. The goal of EOL data services is to advance science through delivering high-quality project data and metadata in ways that are as transparent, secure, and easily accessible as possible - today and into the future. By adhering to accepted standards in data formats and data services, EOL provides infrastructure to facilitate discovery and direct access to data and software from state-of-the-art commercial and locally-developed applications. EOL’s data services are committed to the highest standard of data stewardship from collection to validation to archival.
The MARGINS Data Portal was established in fall 2003 in response to a program call for a dedicated data system to facilitate open and timely exchange of data in support of the interdisciplinary science goals of the program. The Data Portal has been built with the primary goal of providing full cataloging, open access, and long-term preservation of data collected during MARGINS/GeoPRISMS programs. The backbone of the system is an expedition metadata catalog, which provides information on field programs (who, what, when and where), inventories of sensor data and samples, relevant metadata and the links to associated data files which reside either within the Data Portal or at distributed repositories. The system is designed to leverage all relevant existing data resources and provides a framework for a broader distributed data system.