Content Types


AID systems



Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type


Metadata standards

PID systems

Provider types

Quality management

Repository languages



Repository types


  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 2053 result(s)
The UniProt Reference Clusters (UniRef) provide clustered sets of sequences from the UniProt Knowledgebase (including isoforms) and selected UniParc records in order to obtain complete coverage of the sequence space at several resolutions while hiding redundant sequences (but not their descriptions) from view.
Content type(s)
The Lamont-Doherty Core Repository (LDCR) contains one of the world’s most unique and important collection of scientific samples from the deep sea. Sediment cores from every major ocean and sea are archived at the Core Repository. The collection contains approximately 72,000 meters of core composed of 9,700 piston cores; 7,000 trigger weight cores; and 2,000 other cores such as box, kasten, and large diameter gravity cores. We also hold 4,000 dredge and grab samples, including a large collection of manganese nodules, many of which were recovered by submersibles. Over 100,000 residues are stored and are available for sampling where core material is expended. In addition to physical samples, a database of the Lamont core collection has been maintained for nearly 50 years and contains information on the geographic location of each collection site, core length, mineralogy and paleontology, lithology, and structure, and more recently, the full text of megascopic descriptions.
PetDB, the Petrological Database, is a web-based data management system that provides on-line access to geochemical and petrological data. PetDB is a global synthesis of chemical, isotopic, and mineralogical data for rocks, minerals, and melt inclusions. PetDB's current content focuses on data for igneous and metamorphic rocks from the ocean floor, specifically mid-ocean ridge basalts and abyssal peridotites and xenolith samples from the Earth's mantle and lower crust. PetDB is maintained and continuously updated as part of the EarthChem data collections.
The Ocean Date and Information System provides information on physical, chemical, biological and geological parameters of ocean and coasts on spatial and temporal domains that is vital for both research and operational oceanography. In-situ and remote sensing data are included. The Ocean Information Bank is supported by the data received from Ocean Observing Systems in the Indian Ocean (both the in-situ platforms and satellites) as well as by a chain of Marine Data Centres. Ocean and coastal measurements are available. Data products are accessible through various portals on the site and are largely available by data type (in situ or remote sensing) and then by parameter.
A repository for high-quality gene models produced by the manual annotation of vertebrate genomes. The final update of Vega, version 68, was released in February 2017 and is now archived at We plan to maintain this resource until Feb 2020.
Seafloor data at these locations include multibeam bathymetry, high-resolution scanning altimetry, deep-towed sidescan sonar, seafloor digital photographs, sample locations, deep-towed magnetic data, and more. Each dataset is geospatially registered and incorporates metadata (e.g., sample geochemistry and time and depth of collection) that can be interactively accessed by the user. The user can control which datasets are shown on a map view of the area and the scale at which the map is viewed. Datasets with resolutions inappropriate for the scale at which they are being viewed are automatically removed from the map.
World Data Service for Oceanography is one component of a global network of discipline subcenters that facilitate international exchange of scientific data. Originally established as the World Data Center system during the International Geophysical Year of 1957-58, the new World Data System continues to function under the guidance of the International Council for Science (ICSU). World Data Service for Oceanography is collocated with, and operated by, the U.S. National Oceanographic Data Center (NODC).
Kochi Core Center (KCC) houses one of the 3 Inernationational Ocean Discovery Program (IODP) core repositories, accompanied by images and x-ray CT scanning data viewable by the Virtual Core Library. And it hosts Japan Agency for Marine-Earth Science and Technology (JAMSTEC) marine core samples and associated analytical data for general scientific or educational uses, after 2 years have passed since collection of core samples.
!!!! Retirement of UniProt Metagenomic and Environmental Sequences (UniMES): UniProt has retired UniMES as there is now a resource at the EBI that is dedicated to serving metagenomic researchers. Henceforth, we recommend using the EBI Metagenomics portal instead. In addition to providing a repository of metagenomics sequence data, EBI Metagenomics allows you to view functional and taxonomic analyses and to submit your own samples for analysis. !!! The UniProt Metagenomic and Environmental Sequences (UniMES) database is a repository specifically developed for metagenomic and environmental data. We provide UniMES clusters in order to obtain complete coverage of sequence space at different resolutions.
The Animal Sound Archive at the Museum für Naturkunde in Berlin is one of the oldest and largest collections of animal sounds. Presently, the collection consists of about 120,000 bioacoustical recordings comprising almost all groups of animals: 1.800 bird species 580 mammalian species more then150 species of invertebrates; some fishes, amphibians and reptiles
The PLANTS Database provides standardized information about the vascular plants, mosses, liverworts, hornworts, and lichens of the U.S. and its territories. It includes names, plant symbols, checklists, distributional data, species abstracts, characteristics, images, crop information, automated tools, onward Web links, and references. This information primarily promotes land conservation in the United States and its territories, but academic, educational, and general use is encouraged. PLANTS reduces government spending by minimizing duplication and making information exchange possible across agencies and disciplines.
This facility permits selective searches of some atomic data files compiled by R. L. Kurucz (Harvard-Smithsonian Center for Astrophysics). The data provided are: - vacuum wavelength (in nm) [above 200 nm calculated using Edlen, Metrologia, Vol. 2, No. 2, 1966]- air wavelength (in nm) above 200 nm- log(gf), - E [in cm-1], j, parity, and configuration for the levels (lower, upper), - information regarding the source of the data. CD-ROM 18 contains the spectrum synthesis programs ATLAS7V, SYNTHE, SPECTRV, ROTATE, BROADEN, PLOTSYN, etc. and sample runs found in directory PROGRAMS; Atomic line data files BELLHEAVY.DAT, BELLLIGHT.DAT, GFIRONLAB.DAT, GULLIVER.DAT, NLTELINES.DAT, GFIRONQ.DAT, obsolete, merged into GFALL, found in directory LINELISTS: Molecular line data files C2AX.ASC, C2BA.ASC, C2DA.ASC, C2EA.ASC, CNAX.ASC, CNBX.ASC, COAX.ASC, COXX.ASC, H2.ASC, HYDRIDES.ASC, SIOAX.ASC, SIOEX.ASC, SIOXX.ASC, found in directory LINELISTS; and my solar flux atlas for test calculations SOLARFLUX.ASC.
STARK-B is a database of calculated widths and shifts of isolated lines of atoms and ions due to electron and ion collisions. This database is devoted to modeling and spectroscopic diagnostics of stellar atmospheres and envelopes. In addition, it is also devoted to laboratory plasmas, laser equipments and technological plasmas. So, the domain of temperatures and densities covered by the tables is wide and depends on the ionization degree of the considered ion. The temperature can vary from several thousands for neutral atoms to several hundred thousands of Kelvin for highly charged ions. The electron or ion density can vary from 1012 (case of stellar atmospheres) to several 1019cm-3 (some white dwarfs and some laboratory plasmas).
The CCC method yields accurate excitation and ionisation cross sections for atomic and ionic targets which are well-modelled by one or two valence electrons above a Hartree-Fock core. Inner core ionisation can be a major contributor to the total ionisation cross section. Such contributions can be estimated using various forms of Born-based approximations.
DBT is the institutional repository of the FSU Jena, the TU Ilmenau and the University of Erfurt as well as members of the other Thuringian universities and colleges can publish scientific documents in the DBT. In individual cases, land users (via the ThULB Jena) can also archive documents in the DBT.
The International Maize and Wheat Improvement Center (CIMMYT) provides a free, open access repository of research software, studies, and datasets produced and developed by CIMMYT scientists as well as the results of the Seeds of Discovery project, which makes available genetic profiles of wheat and maize, two of mankind's three major cereal crops.
figshare allows researchers to publish all of their research outputs in an easily citable, sharable and discoverable manner. All file formats can be published, including videos and datasets. Optional peer review process. figshare uses creative commons licensing.
The National Sleep Research Resource (NSRR) offers free web access to large collections of de-identified physiological signals and clinical data elements collected in well-characterized research cohorts and clinical trials.
EIDA, an initiative within ORFEUS, is a distributed data centre established to (a) securely archive seismic waveform data and related metadata, gathered by European research infrastructures, and (b) provide transparent access to the archives by the geosciences research communities. EIDA nodes are data centres which collect and archive data from seismic networks deploying broad-band sensors, short period sensors, accelerometers, infrasound sensors and other geophysical instruments. Networks contributing data to EIDA are listed in the ORFEUS EIDA networklist ( Data from the ORFEUS Data Center (ODC), hosted by KNMI, are available through EIDA. Technically, EIDA is based on an underlying architecture developed by GFZ to provide transparent access to all nodes' data. Data within the distributed archives are accessible via the ArcLink protocol (
The geothermal information system (GeotIS) provides information and data compilations on deep aquifers in Germany relevant for geothermal exploitation. GeotIS is a public internet based information system and satisfies the demand for a comprehensive, largely scale-independent form of a geothermal atlas which can be continuously updated. GeotIS helps users identify geothermal potentials by visualizing temperature, hydraulic properties and depth levels of relevant stratigraphic units. A sophisticated map interface simplifies the navigation to all areas of interest. An additional component contains a catalogue of all geothermal installations in Germany. The primary objective of this project is to improve the quality of geothermal-plant project-planning and the estimation of the exploration risk for geothermal projects on selectable locations. However, concrete, location-specific analyses still remain the task of local feasibility studies.
LOVD portal provides LOVD software and access to a list of worldwide LOVD applications through Locus Specific Database list and List of Public LOVD installations. The LOVD installations that have indicated to be included in the global LOVD listing are included in the overall LOVD querying service, which is based on an API.
The OpenNeuro project (formerly known as the OpenfMRI project) was established in 2010 to provide a resource for researchers interested in making their neuroimaging data openly available to the research community. It is managed by Russ Poldrack and Chris Gorgolewski of the Center for Reproducible Neuroscience at Stanford University. The project has been developed with funding from the National Science Foundation, National Institute of Drug Abuse, and the Laura and John Arnold Foundation.