Reset all


Content Types


AID systems



Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type


Metadata standards

PID systems

Provider types

Quality management

Repository languages



Repository types


  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 26 result(s)
The National Cancer Data Base (NCDB), a joint program of the Commission on Cancer (CoC) of the American College of Surgeons (ACoS) and the American Cancer Society (ACS), is a nationwide oncology outcomes database for more than 1,500 Commission-accredited cancer programs in the United States and Puerto Rico. Some 70 percent of all newly diagnosed cases of cancer in the United States are captured at the institutional level and reported to the NCDB. The NCDB, begun in 1989, now contains approximately 29 million records from hospital cancer registries across the United States. Data on all types of cancer are tracked and analyzed. These data are used to explore trends in cancer care, to create regional and state benchmarks for participating hospitals, and to serve as the basis for quality improvement.
MGI is the international database resource for the laboratory mouse, providing integrated genetic, genomic, and biological data to facilitate the study of human health and disease. The projects contributing to this resource are: Mouse Genome Database (MGD) Project, Gene Expression Database (GXD) Project, Mouse Tumor Biology (MTB) Database Project, Gene Ontology (GO) Project at MGI, MouseMine Project, MouseCyc Project at MGI
The RRUFF Project is creating a complete set of high quality spectral data from well characterized minerals and is developing the technology to share this information with the world. The collected data provides a standard for mineralogists, geoscientists, gemologists and the general public for the identification of minerals both on earth and for planetary exploration.Electron microprobe analysis is used to determine the chemistry of each mineral.
HPIDB is a public resource, which integrates experimental PPIs from various databases into a single database. The Host-Pathogen Interaction Database (HPIDB) is a genomics resource devoted to understanding molecular interactions between key organisms and the pathogens to which they are susceptible.
The edoc-Server, start 1998, is the Institutional Repository of the Humboldt-Universität zu Berlin and offers the posibility of text- and data-publications. Every item is published for Open-Access with an optional embargo period of up to five years. Data publications since 01.01.2018.
Research Data Finder is QUT’s discovery service for research data created or collected by QUT researchers. Designed to promote the visibility of QUT research datasets, Research Data Finder provides descriptions about shareable, reusable datasets available via open or mediated access.
The Reciprocal Net is a distributed database used by research crystallographers to store information about molecular structures; much of the data is available to the general public. The Reciprocal Net project is still under development. Currently, there are 18 participating crystallography laboratories online. The project is funded by the National Science Foundation (NSF) and part of the National Science Digital Library. The contents of this collection will come principally from structures contributed by participating crystallography laboratories, thus providing a means for teachers, students, and the general public to connect better with current chemistry research. The Reciprocal Net's emphasis is on obtaining structures of general interest and usefulness to those several classes of digital library users.
The arctic data archive system (ADS) collects observation data and modeling products obtained by various Japanese research projects and gives researchers to access the results. By centrally managing a wide variety of Arctic observation data, we promote the use of data across multiple disciplines. Researchers use these integrated databases to clarify the mechanisms of environmental change in the atmosphere, ocean, land-surface and cryosphere. That ADS will be provide an opportunity of collaboration between modelers and field scientists, can be expected.
The Neuroscience Information Framework is a dynamic inventory of Web-based neuroscience resources: data, materials, and tools accessible via any computer connected to the Internet. An initiative of the NIH Blueprint for Neuroscience Research, NIF advances neuroscience research by enabling discovery and access to public research data and tools worldwide through an open source, networked environment.
"IndExs" is a database comprising information on exsiccatae (=exsiccatal series) with titles, abbreviations, bibliography and provides a unique and persistent Exsiccata ID for each series. Exsiccatae are defined as "published, uniform, numbered sets of preserved specimens distributed with printed labels" (Pfister 1985). Please note that there are two similar latin terms: "exsiccata, ae" is feminine and used for a set of dried specimens as defined above, whereas the term "exsiccatum, i" is neutral and used for dried specimens in general. If available, images of one or more examplary labels are added to give layout information.
ALFRED is a free, web-accessible, curated compilation of allele frequency data on DNA sequence polymorphisms in anthropologically defined human populations. ALFRED is distinct from such databases as dbSNP, which catalogs sequence variation.
FlyBase is a database of genetic, genomic and functional data for Drosophila species, with a focus on the model organism Drosophila melanogaster.FlyBase contains a complete annotation of the Drosophila melanogaster genome that is updated several times per year.It also includes a searchable bibliography of research on Drosophila genetics in the last century. The site also provides a large database of images illustrating the full genome, and several movies detailing embryogenesis.
The MG-RAST server is an open source system for annotation and comparative analysis of metagenomes. Users can upload raw sequence data in fasta format; the sequences will be normalized and processed and summaries automatically generated. The server provides several methods to access the different data types, including phylogenetic and metabolic reconstructions, and the ability to compare the metabolism and annotations of one or more metagenomes and genomes. In addition, the server offers a comprehensive search capability. Access to the data is password protected, and all data generated by the automated pipeline is available for download in a variety of common formats. MG-RAST has become an unofficial repository for metagenomic data, providing a means to make your data public so that it is available for download and viewing of the analysis without registration, as well as a static link that you can use in publications. It also requires that you include experimental metadata about your sample when it is made public to increase the usefulness to the community.
Content type(s)
RELMIN collects, studies and publishes legal texts defining the status of religious minorities in medieval Europe. The corpus of texts is rich and varied, spanning ten centuries over a broad geographical area; these texts, in Latin, Arabic, Greek, Hebrew and Aramaic (and also in Medieval Spanish, Portuguese, and other European vernaculars), are dispersed in libraries and archives across Europe. The texts are now gathered in the RELMIN Database in their original language, with translations and commentaries. They are made available to scholars, students and citizens at large. Access is unlimited, free and perennial. and to contribute to the work of compilation. RELMIN is is buil ding a digital database of legal, judicial and normative sources defining the status of religious minorities from the 5th to the 15th century.
Content type(s)
The GISAID Initiative promotes the international sharing of all influenza virus sequences, related clinical and epidemiological data associated with human viruses, and geographical as well as species-specific data associated with avian and other animal viruses, to help researchers understand how the viruses evolve, spread and potentially become pandemics. *** GISAID does so by overcoming disincentives/hurdles or restrictions, which discourage or prevented sharing of influenza data prior to formal publication. *** The Initiative ensures that open access to data in GISAID is provided free-of-charge and to everyone, provided individuals identify themselves and agree to uphold the GISAID sharing mechanism governed through its Database Access Agreement. GISAID calls on all users to agree to the basic premise of upholding scientific etiquette, by acknowledging the originating laboratories providing the specimen and the submitting laboratories who generate the sequence data, ensuring fair exploitation of results derived from the data, and that all users agree that no restrictions shall be attached to data submitted to GISAID, to promote collaboration among researchers on the basis of open sharing of data and respect for all rights and interests.
CORUM is a manually curated dataset of mammalian protein complexes. Annotation of protein complexes includes protein complex composition and other valuable information such as method of purification, cellular function of complexes or involvement in diseases.
Goal of the psychology data archive PsychData is the documentation and long-term archiving of research data from all areas of psychology and the social sciences, using specially created metadata and to provide use of the data for scientific purposes such as secondary analysis and reanalysis. Psychdata contains all areas of psychology, in particular data sets from clinical, developmental, educational, gero-, and work and organizational psychology stemming from longitudinal studies, major surveys, and test development.
More than a quarter of a million people — one in 10 NSW men and women aged over 45 — have been recruited to our 45 and Up Study, the largest ongoing study of healthy ageing in the Southern Hemisphere. The baseline information collected from all of our participants is available in the Study’s Data Book. This information, which researchers use as the basis for their analyses, contains information on key variables such as height, weight, smoking status, family history of disease and levels of physical activity. By following such a large group of people over the long term, we are developing a world-class research resource that can be used to boost our understanding of how Australians are ageing. This will answer important health and quality-of-life questions and help manage and prevent illness through improved knowledge of conditions such as cancer, heart disease, depression, obesity and diabetes.
The Atmospheric Science Data Center (ASDC) at NASA Langley Research Center is responsible for processing, archiving, and distribution of NASA Earth science data in the areas of radiation budget, clouds, aerosols, and tropospheric chemistry.The ASDC specializes in atmospheric data important to understanding the causes and processes of global climate change and the consequences of human activities on the climate.
The Landcare Research DataStore ('the DataStore') is the general data catalogue and repository for Environmental Research Data from Landcare Research. Much of Landcare Research’s research data is available through specific web pages, but many datasets sit outside these areas. This new data repository provides a mechanism for our staff to deposit and document this wider range of datasets so that they may be discovered and potentially re-used.
The Phonogrammarchiv is a multi-disciplinary research sound and video archive, covering holdings from all continents. Since its foundation in 1899 the Phonogrammarchiv has been building up its holdings by cooperating with Austrian scholars and archiving their collected material, or by fieldwork conducted by staff members on special topics exploring new fields of methods and contents. The main tasks comprise the production, annotation, cataloguing and long-term preservation of audio-visual field recordings, making the cultural heritage available for future generations and enabling the dissemination of the recordings as well as technical developments in the field of AV recording and storage. Thus the Phonogrammarchiv adds to infrastructural performance valuable to both the scholarly community and the public at large.
The Antimicrobial Peptide Database (APD) was originally created by a graduate student, Zhe Wang, as his master's thesis in the laboratory of Dr. Guangshun Wang. The project was initiated in 2002 and the first version of the database was open to the public in August 2003. It contained 525 peptide entries, which can be searched in multiple ways, including APD ID, peptide name, amino acid sequence, original location, PDB ID, structure, methods for structural determination, peptide length, charge, hydrophobic content, antibacterial, antifungal, antiviral, anticancer, and hemolytic activity. Some results of this bioinformatics tool were reported in the 2004 database paper. The peptide data stored in the APD were gleaned from the literature (PubMed, PDB, Google, and Swiss-Prot) manually in over a decade.
AGS delivers geoscience in several key areas, including surficial mapping, bedrock mapping, geological modelling, resource evaluation (hydrocarbons, minerals), groundwater, and geological hazards. We also are responsible for maintaining the Alberta Table of Formations and providing geoscience outreach to stakeholders ranging from professional colleagues and academia to the general public.