Reset all


Content Types


AID systems



Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type


Metadata standards

PID systems

Provider types

Quality management

Repository languages



Repository types


  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 313 result(s)
Content type(s)
A small genotype data repository containing data used in recent papers from the Estonian Biocentre. Most of the data pertains to human population genetics. PDF files of the papers are also freely available.
The ROAR Isolate Database is a searchable collection of commensal and complimentary pathogen isolate datasets. ROAR allows investigators to identify datasets of interest, submit datasets, or download datasets. ROAR datasets include data depositors' contact information and links to their articles in ROAR Literature Database.
A premier source for United States cancer statistics, SEER gathers information related to incidence, prevalence, and survival from specific geographic areas that represent 28 percent of the population, as well as compiles related reports and reports on the national cancer mortality rates. Their aim is to provide information related to cancer statistics and decrease the burden of cancer in the national population. SEER has been collecting data from cancer cases since 1973.
The CDHA assists researchers to create, document, and distribute public use microdata on health and aging for secondary analysis. Major research themes include: midlife development and aging; economics of population aging; inequalities in health and aging; international comparative studies of health and aging; and the investigation of linkages between social-demographic and biomedical research in population aging. The CDHA is one of fourteen demography centers on aging sponsored by the National Institute on Aging.
Content type(s)
HSDB is a toxicology database that focuses on the toxicology of potentially hazardous chemicals. It provides information on human exposure, industrial hygiene, emergency handling procedures, environmental fate, regulatory requirements, nanomaterials, and related areas. The information in HSDB has been assessed by a Scientific Review Panel.
Content type(s)
!! see caMOD Retirement Announcement !! Query the Cancer Models database for models submitted by fellow researchers. Retrieve information about the making of models, their genetic description, histopathology, derived cell lines, associated images, carcinogenic agents, and therapeutic trials. Links to associated publications and other resources are provided.
NURSA began in 2002 with the objective to accrue, develop and communicate information about the nuclear receptor superfamily. Over the last ten years, NURSA has developed a website that has developed into a comprehensive source of information about nuclear receptors, and their co-regulators, ligands, and downstream targets. Through a series of integrated 'omics-scale and informatic approaches projects, NURSA has fostered a systems biology understanding of nuclear receptor function, physiology and regulation of target gene networks in vivo.
Recode2 is a database of genes that utilize non-standard translation for gene expression purposes. Recoding events described in the database include programmed ribosomal frameshifting, translational bypassing (aka hopping) and mRNA specific codon redefinition. Frameshifting at a particular site often yields two protein products from one coding sequence and sometimes serves a regulatory purpose by acting as a sensor of the level of product protein or of some external ligand. Bypassing (hopping) allows the coupling of two ORFs separated on an mRNA by a coding gap. Codon redefinition occurs when a stop codon is decoded as a standard amino acid (often glutamine or tryptophan), or the 21st amino acid selenocysteine. These recoding events are in competition with standard decoding and are site specific. The efficiency of recoding is often modulated by cis-stimulators and sometimes by trans-factors. The sequences of the genes that use recoding for their expression are in the database. The recoding sites and the known stimulatory signals are annotated in the database together with notes on factors that are known to affect recoding efficiencies.
PhysioNet is an on-line forum for the dissemination and exchange of recorded biomedical signals and open-source software for analyzing them. It provides facilities for the cooperative analysis of data and the evaluation of proposed new algorithms. In addition to providing free electronic access to PhysioBank data and PhysioToolkit software via the World Wide Web. PhysioNet offers services and training via on-line tutorials to assist users with varying levels of expertise. PhysioNet is a resource for biomedical research and development. It has three closely interdependent components: PhysioBank is a large and growing archive of well-characterized digital recordings of physiologic signals, time series, and related data for use by the biomedical research community. PhysioBank currently includes more than 60 collections of cardiopulmonary, neural, and other biomedical signals from healthy subjects and patients with a variety of conditions with major public health implications, including sudden cardiac death, congestive heart failure, epilepsy, gait disorders, sleep apnea, and aging. PhysioToolkit is a large and growing library of software for physiologic signal processing and analysis, detection of physiologically significant events using both classical techniques and novel methods based on statistical physics and nonlinear dynamics, interactive display and characterization of signals, creation of new databases, simulation of physiologic and other signals, quantitative evaluation and comparison of analysis methods, and analysis of nonequilibrium and nonstationary processes. PhysioNetWorks is a virtual laboratory where you can work together with us and with colleagues anywhere in the world to create, evaluate, improve, document, and prepare new data and software "works" for publication on PhysioNet. Unlike all other parts of the PhysioNet web site, access to PhysioNetWorks is password-protected. (Accounts are free and a password can be obtained in a minute or two.)
Born in Bradford is one of the biggest and most important medical research studies undertaken in the UK. The project started in 2007 and is looking to answer questions about our health by tracking the lives of 13,500 babies and their families and will provide information for studies across the UK and around the world. The aim of Born in Bradford is to find out more about the causes of childhood illness by studying children from all cultures and backgrounds as their lives unfold.
Content type(s)
CaPSURE™ is a longitudinal, observational study of approximately 15,000 men with all stages of biopsy-proven prostate cancer. Patients have enrolled at 43 community urology practices, academic medical centers, and VA hospitals throughout the United States since 1995. CEASAR stands for Comparative Effectiveness Analysis of Surgery and Radiation. The ongoing goal of CEASAR is to help learn more about what prostate cancer treatments work best, for which patients, in whose hands. There are currently about 3,600 men with a prostate cancer diagnosis participating in CEASAR. Three rounds of surveys have been completed, with the first carried out in the spring of 2010. We are currently in the process of conducting our fourth survey with the same group of men in our study. This survey, our Three Year Follow-up, will occur throughout the summer of 2014.
The Twenty-07 Study was set up in 1986 in order to investigate the reasons for differences in health by socio-economic circumstances, gender, area of residence, age, ethnic group, and family type. 4510 people are being followed for 20 years. The initial wave of data collection took place in 1987/8, when respondents were aged 15, 35 and 55. The final wave of data collection took place in 2007/08 when respondents were aged 35, 55 and 75. In this way the Twenty-07 Study provides us with unique opportunities to investigate both the changes in people's lives over 20 years and how they affect their health, and the differences in people's experiences at the same ages 20 years apart, and how these have different effects on their health.
Complete Genomics provides free public access to a variety of whole human genome data sets generated from Complete Genomics’ sequencing service. The research community can explore and familiarize themselves with the quality of these data sets, review the data formats provided from our sequencing service, and augment their own research with additional summaries of genomic variation across a panel of diverse individuals. The quality of these data sets is representative of what a customer can expect to receive for their own samples. This public genome repository comprises genome results from both our Standard Sequencing Service (69 standard, non-diseased samples) and the Cancer Sequencing Service (two matched tumor and normal sample pairs). In March 2013 Complete Genomics was acquired by BGI-Shenzhen , the world’s largest genomics services company. BGI is a company headquartered in Shenzhen, China that provides comprehensive sequencing and bioinformatics services for commercial science, medical, agricultural and environmental applications. Complete Genomics is now focused on building a new generation of high-throughput sequencing technology and developing new and exciting research, clinical and consumer applications.
The NCI’s Cancer Genome Anatomy Project (CGAP) is an online resource designed to provide the scientific community with detailed characterization of gene expression in biological tissues. By characterizing normal, pre-cancer and cancer cells, CGAP aims to improve detection, diagnosis and treatment for the patient. Moreover, CGAP provides access to cDNA clones to the research community through a variety of distributors. CGAP provides a wide range of genomic data and resources
PSI is a global health organization dedicated to improving the health of people in the developing world by focusing on serious challenges like a lack of family planning, HIV and AIDS, barriers to maternal health, and the greatest threats to children under five, including malaria, diarrhea, pneumonia and malnutrition. A hallmark of PSI is a commitment to the principle that health services and products are most effective when they are accompanied by robust communications and distribution efforts that help ensure wide acceptance and proper use. PSI works in partnership with local governments, ministries of health and local organizations to create health solutions that are built to last. We use original data to monitor and evaluate our programs, generate consumer insight, estimate the impact of our solutions, and evaluate the health of the markets we work to strengthen.
MassBank of North America (MoNA) is a metadata-centric, auto-curating repository designed for efficient storage and querying of mass spectral records. It intends to serve as a the framework for a centralized, collaborative database of metabolite mass spectra, metadata and associated compounds. MoNA currently contains over 200,000 mass spectral records from experimental and in-silico libraries as well as from user contributions. (Clinical trials) is a registry and results database of publicly and privately supported clinical studies of human participants conducted around the world.
DDBJ; DNA Data Bank of Japan is the sole nucleotide sequence data bank in Asia, which is officially certified to collect nucleotide sequences from researchers and to issue the internationally recognized accession number to data submitters.Since we exchange the collected data with EMBL-Bank/EBI; European Bioinformatics Institute and GenBank/NCBI; National Center for Biotechnology Information on a daily basis, the three data banks share virtually the same data at any given time. The virtually unified database is called "INSD; International Nucleotide Sequence Database DDBJ collects sequence data mainly from Japanese researchers, but of course accepts data and issue the accession number to researchers in any other countries.
The SICAS Medical Image Repository is a database for the management of medical images and subsequently generated models of the bony anatomy. The database will provide a framework for the integration of statistical shape models. This will contribute to less invasive procedures, e.g. by reduced radiation exposure, and it will enable patient specific implant design.
Content type(s)
MIRAGE is developing a warehouse of medical images to facilitate effective online retrieval tools in the institutional web site to complement the existing online e-leaning and teaching system OASISplus, also known as Blackboard Vista , that is currently in operation at Middlesex University (MU); Follow-up project MIRAGE 2011: