Reset all


Content Types


AID systems



Data access

Data access restrictions

Database access

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type


Metadata standards

PID systems

Provider types

Quality management

Repository languages



Repository types


  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 165 result(s)
The 1000 Genomes Project is an international collaboration to produce an extensive public catalog of human genetic variation, including SNPs and structural variants, and their haplotype contexts. This resource will support genome-wide association studies and other medical research studies. The genomes of about 2500 unidentified people from about 25 populations around the world will be sequenced using next-generation sequencing technologies. The results of the study will be freely and publicly accessible to researchers worldwide. The International Genome Sample Resource (IGSR) has been established at EMBL-EBI to continue supporting data generated by the 1000 Genomes Project, supplemented with new data and new analysis.
Addgene archives and distributes plasmids for researchers around the globe. They are working with thousands of laboratories to assemble a high-quality library of published plasmids for use in research and discovery. By linking plasmids with articles, scientists can always find data related to the materials they request.
The African Development Bank Group (AfDB) is committed to supporting statistical development in Africa as a sound basis for designing and managing effective development policies for reducing poverty on the continent. Reliable and timely data is critical to setting goals and targets as well as evaluating project impact. Reliable data constitutes the single most convincing way of getting the people involved in what their leaders and institutions are doing. It also helps them to get involved in the development process, thus giving them a sense of ownership of the entire development process. The AfDB has a large team of researchers who focus on the production of statistical data on economic and social situations. The data produced by the institution’s statistics department constitutes the background information in the Bank’s flagship development publications. Besides its own publication, the AfDB also finances studies in collaboration with its partners. The Statistics Department aims to stand as the primary source of relevant, reliable and timely data on African development processes, starting with the data generated from its current management of the Africa component of the International Comparison Program (ICP-Africa). The Department discharges its responsibilities through two divisions: The Economic and Social Statistics Division (ESTA1); The Statistical Capacity Building Division (ESTA2)
A collection of data at Agency for Healthcare Research and Quality (AHRQ) supporting research that helps people make more informed decisions and improves the quality of health care services. The portal contains U.S.Health Information Knowledgebase (USHIK) and Systematic Review Data Repository (SRDR) and other sources concerning cost, quality, accesibility and evaluation of healthcare and medical insurance.
Numerical database of atomic and molecular processes and particle-surface interactions. ALADDIN has formatted data on atomic structure and spectra (energy levels,wave lengths, and transition probabilities); electron and heavy particle collisions with atoms, ions, and molecules (cross sections and/or rate coefficients, including, in most cases, analytic fit to the data); sputtering of surfaces by impact of main plasma constituents and self sputtering; particle reflection from surfaces; thermophysical and thermomechanical properties of beryllium and pyrolytic graphites.
Alaska Ocean Observing System (AOOS) provides ocean and coastal observations data. The AOOS is governed by the Integrated Ocean Observing System (IOOS) which is a partnership among federal, regional, academic and private sector groups. The Ocean Data Explorer contains scientific and management information including real-time sensor feeds, operational oceanographic and atmospheric models, satellite observations and GIS data sets that describe the biological, chemical and physical characteristics of Alaska and its surrounding waters. This map offers many new updated features that build upon the existing data system.
The SAR Data Center has a large data archive of Synthetic Aperture Radar (SAR) from a variety of sensors available at no cost. Much of the SAR data in the ASF SDC archive is limited in distribution to the scientific research community and U.S. Government Agencies. In accordance with the Memoranda of Understanding (MOU) between the relevant flight agencies (CSA, ESA, JAXA) and the U.S. State Department, the ASF SDC does not distribute SAR data for commercial use. The research community can access the data (ERS-1, ERS-2, JERS-1, RADARSAT-1, and ALOS PALSAR) via a brief proposal process.
The German General Social Survey (ALLBUS) collects up-to-date data on attitudes, behavior, and social structure in Germany. Every two years since 1980 a representative cross section of the population is surveyed using both constant and variable questions. The ALLBUS data become available to interested parties for research and teaching as soon as they are processed and documented.
AmericasBarometer surveys are multi-country, regularly conducted surveys of democratic values and behaviors in the Americas. The raw data are available for free at all LAPOP consortium member institutions, and at all other users worldwide. Besides this a permanent ownership of the data, in becoming a 'repository', is possible for a fee.
The Antarctic and Southern Ocean Data Portal, part of the US Antarctic Data Consortium, provides access to geoscience data, primarily marine, from the Antarctic region. The synthesis began in 2003 as the Antarctic Multibeam Bathymetry and Geophysical Data Synthesis (AMBS) with a focus on multibeam bathymetry field data and other geophysical data from the Southern Ocean collected with the R/V N. B. Palmer. In 2005, the effort was expanded to include all routine underway geophysical and oceanographic data collected with both the R/V N. B. Palmer and R/V L. Gould, the two primary research vessels serving the US Antarctic Program.
ArachnoServer is a manually curated database containing information on the sequence, three-dimensional structure, and biological activity of protein toxins derived from spider venom. Spiders are the largest group of venomous animals and they are predicted to contain by far the largest number of pharmacologically active peptide toxins (Escoubas et al., 2006). ArachnoServer has been custom-built so that a wide range of biological scientists, including neuroscientists, pharmacologists, and toxinologists, can readily access key data relevant to their discipline without being overwhelmed by extraneous information.
ArrayExpress is one of the major international repositories for high-throughput functional genomics data from both microarray and high-throughput sequencing studies, many of which are supported by peer-reviewed publications. Data sets are submitted directly to ArrayExpress and curated by a team of specialist biological curators. In the past (until 2018) datasets from the NCBI Gene Expression Omnibus database were imported on a weekly basis. Data is collected to MIAME and MINSEQE standards.
ASTM International, formerly known as the American Society for Testing and Materials (ASTM), is a globally recognized leader in the development and delivery of international voluntary consensus standards. Today, some 12,000 ASTM standards are used around the world to improve product quality, enhance safety, facilitate market access and trade, and build consumer confidence.
>>> --- !!!! Attention: Obviously the institute does not exist any more. The links do not work anymore. !!!! --- <<< Our center is devoted to: Collection, compilation, evaluation, and dissemination of scientific information required for fusion research, and Investigation of problems arising in the course of development of fusion research. There are atomic and molecular (A & M) numerical databases and bibliographic databases on plasma physics and atomic physics.
The Australian Antarctic Data Centre (AADC) provides data collection and data management services in Australia's Antarctic Science Program. The AADC manages science data from Australia's Antarctic research, maps Australia's areas of interest in the Antarctic region, manages Australia's Antarctic state of the environment reporting, and provides advice and education and a range of other products.
The Australian National Corpus collates and provides access to assorted examples of Australian English text, transcriptions, audio and audio-visual materials. Text analysis tools are embedded in the interface allowing analysis and downloads in *.CSV format.
>>>!!!<<< Sorry.we are no longer in operation >>>!!!<<< The Beta Cell Biology Consortium (BCBC) was a team science initiative that was established by the National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK). It was initially funded in 2001 (RFA DK-01-014), and competitively continued both in 2005 (RFAs DK-01-17, DK-01-18) and in 2009 (RFA DK-09-011). Funding for the BCBC came to an end on August 1, 2015, and with it so did our ability to maintain active websites.!!! One of the many goals of the BCBC was to develop and maintain databases of useful research resources. A total of 813 different scientific resources were generated and submitted by BCBC investigators over the 14 years it existed. Information pertaining to 495 selected resources, judged to be the most scientifically-useful, has been converted into a static catalog, as shown below. In addition, the metadata for these 495 resources have been transferred to dkNET in the form of RDF descriptors, and all genomics data have been deposited to either ArrayExpress or GEO. Please direct questions or comments to the NIDDK Division of Diabetes, Endocrinology & Metabolic Diseases (DEM).
BioMagResBank (BMRB) is the publicly-accessible depository for NMR results from peptides, proteins, and nucleic acids recognized by the International Society of Magnetic Resonance and by the IUPAC-IUBMB-IUPAB Inter-Union Task Group on the Standardization of Data Bases of Protein and Nucleic Acid Structures Determined by NMR Spectroscopy. In addition, BMRB provides reference information and maintains a collection of NMR pulse sequences and computer software for biomolecular NMR
BRENDA is the main collection of enzyme functional data available to the scientific community worldwide. The enzymes are classified according to the Enzyme Commission list of enzymes. It is available free of charge for via the internet ( and as an in-house database for commercial users (requests to our distributor Biobase). The enzymes are classified according to the Enzyme Commission list of enzymes. Some 5000 "different" enzymes are covered. Frequently enzymes with very different properties are included under the same EC number. BRENDA includes biochemical and molecular information on classification, nomenclature, reaction, specificity, functional parameters, occurrence, enzyme structure, application, engineering, stability, disease, isolation, and preparation. The database also provides additional information on ligands, which function as natural or in vitro substrates/products, inhibitors, activating compounds, cofactors, bound metals, and other attributes. is the host website of the Center for Invasive Species and Ecosystem Health at the University of Georgia (Formerly: Bugwood Network). The Center aims to develop, consolidate and disseminate information and programmes focused on invasive species, forest health, natural resources and agricultural management through technology development, programmes implementation, training, applied research and public awareness at state, regional, national and international levels. The site gives details of its products (Bugwood Image Database; Early Detection and Distribution Mapping and Bugwoodwiki). Details of its projects, services and personnel are provided. Users can also access image databases on Forestry, Insects, IPM, Invasive Species, Forest Pests, weed and Bark Beetle.
>>>!!!<<< caArray Retirement Announcement >>>!!!<<< The National Cancer Institute (NCI) Center for Biomedical Informatics and Information Technology (CBIIT) instance of the caArray database was retired on March 31st, 2015. All publicly-accessible caArray data and annotations will be archived and will remain available via FTP download and is also available at GEO . >>>!!!<<< While NCI will not be able to provide technical support for the caArray software after the retirement, the source code is available on GitHub , and we encourage continued community development. Molecular Analysis of Brain Neoplasia (Rembrandt fine-00037) gene expression data has been loaded into ArrayExpress: >>>!!!<<< caArray is an open-source, web and programmatically accessible microarray data management system that supports the annotation of microarray data using MAGE-TAB and web-based forms. Data and annotations may be kept private to the owner, shared with user-defined collaboration groups, or made public. The NCI instance of caArray hosts many cancer-related public datasets available for download.
The California Coastal Atlas is an experiment in the creation of a new information resource for the description, analysis and understanding of natural and human processes affecting the coast of California.
>>>!!!<<<As stated 2017-05-23 Cancer GEnome Mine is no longer available >>>!!!<<< Cancer GEnome Mine is a public database for storing clinical information about tumor samples and microarray data, with emphasis on array comparative genomic hybridization (aCGH) and data mining of gene copy number changes.
The CDC Data Catalogue describes the Climate Data of the DWD and provides access to data, descriptions and access methods. Climate Data refers to observations, statistical indices and spatial analyses. CDC comprises Climate Data for Germany, but also global Climate Data, which were collected and processed in the framework of international co-operation. The CDC Data Catalogue is under construction and not yet complete. The purposes of the CDC Data Catalogue are: to provide uniform access to climate data centres and climate datasets of the DWD to describe the climate data according to international metadata standards to make the catalogue information available on the Internet to support the search for climate data to facilitate the access to climate data and climate data descriptions
>>> !!!!! The Cell Centered Database is no longer on serice. It has been merged with "Cell image library": !!!!! <<<<