Reset all


Content Types


AID systems



Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type


Metadata standards

PID systems

Provider types

Quality management

Repository languages



Repository types


  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 115 result(s)
TEAM is devoted to monitoring long-term trends in biodiversity, land cover change, climate and ecosystem services in tropical forests. Tropical forests received first billing because of their overwhelming significance to the global biosphere (e.g., their disproportionately large role in global carbon and energy cycles) and because of the extraordinary threats they face. About 50 percent of the species described on Earth, and an even larger proportion of species not yet described, occur in tropical forests. TEAM aims to measure and compare plants, terrestrial mammals, ground-dwelling birds and climate using a standard methodology in a range of tropical forests, from relatively pristine places to those most affected by people. TEAM currently operates in sixteen tropical forest sites across Africa, Asia and Latin America supporting a network of scientists committed to standardized methods of data collection to quantify how plants and animals respond to pressures such as climate change and human encroachment.
The task of WDC geomagnetism is to collect geomagnetic data from all over the globe and distribute those data to researchers and data users, as a World Data Center for Geomagnetism.
The long-term goal of this project is to implement a new strategy for preserving and providing access to the Astrophysical data heritage. IA2 is an ambitious Italian Astrophysical research infrastructure project that aims at co-ordinating different national initiatives to improve the quality of astrophysical data services. It aims at co-ordinating these developments and facilitating access to this data for research purposes. The first working target, is the implementation of the TNG Long-Term Archive (LTA). Its feasibility was demonstrated by the LTA pilot project prototype, funded by CNAA in 2001 and completed successfully in July 2002. The implementation of the TNG archive implies: − interfacing with the Centro "Galileo Galilei" (CGG) for the acquisition of TNG data; − long-term storage of scientific, technical and auxiliary data from the TNG; − providing accessibility by the CGG staff and by the scientific community to original and derived data; − providing tools to support the life cycle of observing proposals. The second target of the proposal aims at ensuring harmonization with other projects related to archiving of data of astrophysical interest, with particular reference to projects involving the Italian astronomical community (LBT, VST, GSC-II, DPOSS, …), to the Italian Solar and Solar System Physics community (SOLAR, SOLRA, ARTHEMIS which form SOLARNET – a future node of EGSO) and to the national and international coordination efforts fostering the idea of a multiwavelength Virtual Astronomical Observatory, and the use of the archived data through the Italian Astronomical Grid.
The National Digital Archive of Datasets (NDAD) provides access to archived datasets and documents from United Kingdom government departments which can be searched or browsed by subjects such as armed forces service or wills and death duties. Statistics and information gathered through census data as well as public records are used to compile the available datasets. All datasets are available to download and contain a record summary as well as custodial history, background on the source of the data and whether or not data may be added to the dataset in the future.
ConsensusPathDB integrates interaction networks in humans (and in the model organisms - yeast and mouse) including binary and complex protein-protein, genetic, metabolic, signaling, gene regulatory and drug-target interactions, as well as biochemical pathways. Data originate from public resources for interactions and interactions curated from the literature. The interaction data are integrated in a complementary manner to avoid redundancies.
BioModels is a repository of mathematical models of biological and biomedical systems. It hosts a vast selection of existing literature-based physiologically and pharmaceutically relevant mechanistic models in standard formats. Our mission is to provide the systems modelling community with reproducible, high-quality, freely-accessible models published in the scientific literature.
The European Nucleotide Archive (ENA) captures and presents information relating to experimental workflows that are based around nucleotide sequencing. A typical workflow includes the isolation and preparation of material for sequencing, a run of a sequencing machine in which sequencing data are produced and a subsequent bioinformatic analysis pipeline. ENA records this information in a data model that covers input information (sample, experimental setup, machine configuration), output machine data (sequence traces, reads and quality scores) and interpreted information (assembly, mapping, functional annotation). Data arrive at ENA from a variety of sources. These include submissions of raw data, assembled sequences and annotation from small-scale sequencing efforts, data provision from the major European sequencing centres and routine and comprehensive exchange with our partners in the International Nucleotide Sequence Database Collaboration (INSDC). Provision of nucleotide sequence data to ENA or its INSDC partners has become a central and mandatory step in the dissemination of research findings to the scientific community. ENA works with publishers of scientific literature and funding bodies to ensure compliance with these principles and to provide optimal submission systems and data access tools that work seamlessly with the published literature.
Earthdata powered by EOSDIS (Earth Observing System Data and Information System) is a key core capability in NASA’s Earth Science Data Systems Program. It provides end-to-end capabilities for managing NASA’s Earth science data from various sources – satellites, aircraft, field measurements, and various other programs. EOSDIS uses the metadata and service discovery tool Earthdata Search The capabilities of EOSDIS constituting the EOSDIS Science Operations are managed by NASA's Earth Science Data and Information System (ESDIS) Project. The capabilities include: generation of higher level (Level 1-4) science data products for several satellite missions; archiving and distribution of data products from Earth observation satellite missions, as well as aircraft and field measurement campaigns. The EOSDIS science operations are performed within a distributed system of many interconnected nodes - Science Investigator-led Processing Systems (SIPS), and distributed, discipline-specific, Earth science Distributed Active Archive Centers (DAACs) with specific responsibilities for production, archiving, and distribution of Earth science data products. The DAACs serve a large and diverse user community by providing capabilities to search and access science data products and specialized services.
Biological collections are replete with taxonomic, geographic, temporal, numerical, and historical information. This information is crucial for understanding and properly managing biodiversity and ecosystems, but is often difficult to access. Canadensys, operated from the Université de Montréal Biodiversity Centre, is a Canada-wide effort to unlock the biodiversity information held in biological collections.
The World Glacier Monitoring Service (WGMS) collects standardized observations on changes in mass, volume, area and length of glaciers with time (glacier fluctuations), as well as statistical information on the distribution of perennial surface ice in space (glacier inventories). Such glacier fluctuation and inventory data are high priority key variables in climate system monitoring; they form a basis for hydrological modelling with respect to possible effects of atmospheric warming, and provide fundamental information in glaciology, glacial geomorphology and quaternary geology. The highest information density is found for the Alps and Scandinavia, where long and uninterrupted records are available. As a contribution to the Global Terrestrial/Climate Observing System (GTOS, GCOS), the Division of Early Warning and Assessment and the Global Environment Outlook of UNEP, and the International Hydrological Programme of UNESCO, the WGMS collects and publishes worldwide standardized glacier data.
The National Archives of the Netherlands (Nationaal Archief), which is situated in The Hague, holds over 3.5 million records that have been created by the central government, organisations and individuals and are of national significance. Many records relate to the colonial and trading history of the Netherlands in the period from 1600 to 1975. The Dutch presence in countries in North and South America, Africa and Asia is reflected within these collections.
The NCEP/NCAR Reanalysis Project is a joint project between the National Centers for Environmental Prediction (NCEP, formerly "NMC") and the National Center for Atmospheric Research (NCAR). The goal of this joint effort is to produce new atmospheric analyses using historical data (1948 onwards) and as well to produce analyses of the current atmospheric state (Climate Data Assimilation System, CDAS).
The Substance Abuse and Mental Health Data Archive (SAMHDA) is an initiative funded under contract HHSS283201500001C with the Center for Behavioral Health Statistics and Quality (CBHSQ), Substance Abuse and Mental Health Services Administration (SAMHSA), U.S. Department of Health and Human Services (HHS). CBHSQ has primary responsibility for the collection, analysis, and dissemination of SAMHSA's behavioral health data. Public use files and restricted use files are provided. CBHSQ promotes the access and use of the nation's substance abuse and mental health data through SAMHDA. SAMHDA provides public-use data files, file documentation, and access to restricted-use data files to support a better understanding of this critical area of public health.
Measurements Of Pollution In The Troposphere (MOPITT) was launched into sun-synchronous polar orbit on December 18, 1999, aboard TERRA, a NASA satellite orbiting 705 km above the Earth. MOPITT monitors changes in pollution patterns and the effects on Earth’s troposphere. MOPITT uses near-infrared radiation at 2.3 µm and thermal-infrared radiation at 4.7 µm to calculate atmospheric profiles of CO.
CEEHRC represents a multi-stage funding commitment by the Canadian Institutes of Health Research (CIHR) and multiple Canadian and international partners. The overall aim is to position Canada at the forefront of international efforts to translate new discoveries in the field of epigenetics into improved human health. The two sites will focus on sequencing human reference epigenomes and developing new technologies and protocols; they will also serve as platforms for other CEEHRC funding initiatives, such as catalyst and team grants. The complementary reference epigenome mapping efforts of the two sites will focus on a range of common human diseases. The Vancouver group will focus on the role of epigenetics in the development of cancer, including lymphoma and cancers of the ovary, colon, breast, and thyroid. The Montreal team will focus on autoimmune / inflammatory, cardio-metabolic, and neuropsychiatric diseases, using studies of identical twins as well as animal models of human disease.
The Solar Dynamics Observatory (SDO) studies the solar atmosphere on small scales of space and time, in multiple wavelengths. This is a searchable database of all SDO data, including citizen scientist images, space weather and near real time data, and helioseismology data.
Social Computing Data Repository hosts data from a collection of many different social media sites, most of which have blogging capacity. Some of the prominent social media sites included in this repository are BlogCatalog, Twitter, MyBlogLog, Digg, StumbleUpon,, MySpace, LiveJournal, The Unofficial Apple Weblog (TUAW), Reddit, etc. The repository contains various facets of blog data including blog site metadata like, user defined tags, predefined categories, blog site description; blog post level metadata like, user defined tags, date and time of posting; blog posts; blog post mood (which is defined as the blogger's emotions when (s)he wrote the blog post); blogger name; blog post comments; and blogger social network.
SND is a service organisation for Swedish research within the humanities, social sciences and health sciences. SND helps enable Swedish and international researchers gain access to existing data within and outside of Sweden. SND provides support and guidance to researchers throughout the whole research process. SND is the Swedish node in an international network of data archives. This network is an important part of the research infrastructure.
The Cotton Database is provided by the Central Institute for Cotton Research in India. The database includes data on cotton production, protection, improvement, economy, and industry.
GenBank® is a comprehensive database that contains publicly available nucleotide sequences for almost 260 000 formally described species. These sequences are obtained primarily through submissions from individual laboratories and batch submissions from large-scale sequencing projects, including whole-genome shotgun (WGS) and environmental sampling projects. Most submissions are made using the web-based BankIt or standalone Sequin programs, and GenBank staff assigns accession numbers upon data receipt. Daily data exchange with the European Nucleotide Archive (ENA) and the DNA Data Bank of Japan (DDBJ) ensures worldwide coverage. GenBank is accessible through the NCBI Entrez retrieval system, which integrates data from the major DNA and protein sequence databases along with taxonomy, genome, mapping, protein structure and domain information, and the biomedical journal literature via PubMed. BLAST provides sequence similarity searches of GenBank and other sequence databases. Complete bimonthly releases and daily updates of the GenBank database are available by FTP.
The GeneDB project is a core part of the Sanger Institute's Pathogen Genomics activities. Its primary goals are: to provide reliable access to the latest sequence data and annotation/curation for the whole range of organisms sequenced by the Pathogen group. to develop the website and other tools to aid the community in accessing and obtaining the maximum value from these data.