Content Types


AID systems



Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type


Metadata standards

PID systems

Provider types

Quality management

Repository languages



Repository types


  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 113 result(s)
Datatang is a professional data pre-processing company. We are engaged in data collecting, annotating, and customizing to meet our clients’ various needs. We assist our clients from university research labs and company R&D departments to waive trivial yet necessary data processing procedure and make their approach to the highest-value data in a more efficient way.
SeaBASS, the publicly shared archive of in situ oceanographic and atmospheric data maintained by the NASA Ocean Biology Processing Group (OBPG). High quality in situ measurements are prerequisite for satellite data product validation, algorithm development, and many climate-related inquiries. As such, the NASA Ocean Biology Processing Group (OBPG) maintains a local repository of in situ oceanographic and atmospheric data to support their regular scientific analyses. The SeaWiFS Project originally developed this system, SeaBASS, to catalog radiometric and phytoplankton pigment data used their calibration and validation activities. To facilitate the assembly of a global data set, SeaBASS was expanded with oceanographic and atmospheric data collected by participants in the SIMBIOS Program, under NASA Research Announcements NRA-96 and NRA-99, which has aided considerably in minimizing spatial bias and maximizing data acquisition rates. Archived data include measurements of apparent and inherent optical properties, phytoplankton pigment concentrations, and other related oceanographic and atmospheric data, such as water temperature, salinity, stimulated fluorescence, and aerosol optical thickness. Data are collected using a number of different instrument packages, such as profilers, buoys, and hand-held instruments, and manufacturers on a variety of platforms, including ships and moorings.
The department specializes on developing complex distributed systems for satellite data processing. The main task given to the department is development, validation and implementation of different satellite data processing methods in the form of information services and certain systems
The Precipitation Processing System (PPS) evolved from the Tropical Rainfall Measuring Mission (TRMM) Science Data and Information System (TSDIS). The purpose of the PPS is to process, analyze and archive data from the Global Precipitation Measurement (GPM) mission, partner satellites and the TRMM mission. The PPS also supports TRMM by providing validation products from TRMM ground radar sites. All GPM, TRMM and Partner public data products are available to the science community and the general public from the TRMM/GPM FTP Data Archive. Please note that you need to register to be able to access this data. Registered users can also search for GPM, partner and TRMM data, order custom subsets and set up subscriptions using our PPS Data Products Ordering Interface (STORM)
The Ocean Biology Processing Group (OBPG) serves as the Distributed Active Archive Center (DAAC) for all Ocean Biology (OB) data produced or collected under NASA’s Earth Observing System Data and Information System (EOSDIS). This website thus serves as the primary data access portal to the NASA OB.DAAC. The links below provide a variety of methods to access the holdings of the OB.DAAC, including visual browsers that enable point-and-click access by data levels and direct access for bulk download. In agreement with partner organizations, some data access requires user registration to enable better tracking of usage metrics.
IEEE DataPort™ is an online data repository created and supported by both the IEEE Signal Processing Society and the IEEE Big Data Initiative, and it enables data owners to upload their data without cost. DataPort makes data available in three ways: standard, open access, and data competition. By default, all "standard" datasets that are uploaded are accessible to paid DataPort subscribers. Data owners can also pay a fee (APC) to make a dataset accessible through open access to non-subscribers. The third option is to to host a "data competition" and make a dataset accessible for free for a duration with instructions on the competition and how to participate. DataPort provides workflows for receiving entries and managing the competitions. All datasets are stored on Amazon AWS S3, and each dataset can be allotted up to 2TB each.
The nature of the ‘Bridge of Data’ project is to design and build a platform that allows collecting, searching, analyzing and sharing open research data and to provide it with unique data collected from the three most important Pomeranian universities: Gdańsk University of Technology, Medical University of Gdańsk and the University of Gdańsk. These data will be made available free of charge to the scientific community, entrepreneurs and the public. A bridge will be built to allow reuse of Open Research Data. The available research data will be described by standards developed by dedicated, experienced scientific teams. The metadata will allow other external computer systems to interpret the collected data. ORD descriptions will also include data reuse or reduction scenarios to facilitate further processing.
GTS AI is an Artificial Intelligence Company that offers excellent services to its clients. We use high definition images and use high quality data to analyze and help in Machine Learning Company . We are a dataset provider and we collect data in regards to artificial intelligence.
UNAVCO promotes research by providing access to data that our community of geodetic scientists uses for quantifying the motions of rock, ice and water that are monitored by a variety of sensor types at or near the Earth's surface. After processing, these data enable millimeter-scale surface motion detection and monitoring at discrete points, and high-resolution strain imagery over areas of tens of square meters to hundreds of square kilometers. The data types include GPS/GNSS, imaging data such as from SAR and TLS, strain and seismic borehole data, and meteorological data. Most of these can be accessed via web services. In addition, GPS/GNSS datasets, TLS datasets, and InSAR products are assigned digital object identifiers.
The Infrared Space Observatory (ISO) is designed to provide detailed infrared properties of selected Galactic and extragalactic sources. The sensitivity of the telescopic system is about one thousand times superior to that of the Infrared Astronomical Satellite (IRAS), since the ISO telescope enables integration of infrared flux from a source for several hours. Density waves in the interstellar medium, its role in star formation, the giant planets, asteroids, and comets of the solar system are among the objects of investigation. ISO was operated as an observatory with the majority of its observing time being distributed to the general astronomical community. One of the consequences of this is that the data set is not homogeneous, as would be expected from a survey. The observational data underwent sophisticated data processing, including validation and accuracy analysis. In total, the ISO Data Archive contains about 30,000 standard observations, 120,000 parallel, serendipity and calibration observations and 17,000 engineering measurements. In addition to the observational data products, the archive also contains satellite data, documentation, data of historic aspects and externally derived products, for a total of more than 400 GBytes stored on magnetic disks. The ISO Data Archive is constantly being improved both in contents and functionality throughout the Active Archive Phase, ending in December 2006.
Data Center of Geography was created in 2011 in the framework of the interdisciplinary structure – world data systems – to ensure gathering, processing and conversion of data and to solve fundamental and applied problems in the sphere of geographical sciences.
The Norwegian Marine Data Centre (NMD) at the Institute of Marine Research was established as a national data centre dedicated to the professional processing and long-term storage of marine environmental and fisheries data and production of data products. The Institute of Marine Research continuously collects large amounts of data from all Norwegian seas. Data are collected using vessels, observation buoys, manual measurements, gliders – amongst others. NMD maintains the largest collection of marine environmental and fisheries data in Norway.
The Atmospheric Science Data Center (ASDC) at NASA Langley Research Center is responsible for processing, archiving, and distribution of NASA Earth science data in the areas of radiation budget, clouds, aerosols, and tropospheric chemistry.The ASDC specializes in atmospheric data important to understanding the causes and processes of global climate change and the consequences of human activities on the climate.
The UCI Machine Learning Repository is a collection of databases, domain theories, and data generators that are used by the machine learning community for the empirical analysis of machine learning algorithms. It is used by students, educators, and researchers all over the world as a primary source of machine learning data sets. As an indication of the impact of the archive, it has been cited over 1000 times.
C3Grid (Collaborative Climate Community Data and Processing Grid) was a common initiative of institutions of the German Climate community. Aim of the effort was to develop an infrastructure for uniform access to heterogeneous data and distributed data processing. The work was structured in two projects funded by the Federal Ministry of Education and Research. The first project was part of the D-Grid initiative and explored the potential of grid technology for climate research and developed a prototype infrastructure. Details about the C3Grid architecture are described in “Earth System Modelling – Volume 6”. In the second phase "C3Grid - INAD: Towards an Infrastructure for General Access to Climate Data" this infrastructure was improved especially with respect to interoperability to Earth System Grid Federation (ESGF). Further the portfolio of available diagnostic workflows was expanded. These workflows can be re-used now in adjacent infrastructures MiKlip Evaluation Tool ( and as Web Processes within the Birdhouse Framework ( The Birdhouse Framework is now funded as part of the European Copernicus Climate Change Service ( managed by ECMWF and will be extended to provide scalable processing services for ESGF hosted data at DKRZ as well as IPSL and BADC.
The Centro de Documentação e Pesquisa Digital de Pesquisa (CEDAP) of the Universidade Federal do Rio Grande do Sul (UFRGS) aims to gather the scientific data used in research, classified as long syllable, in the various areas of knowledge. The Scientific Data Repository of Research of CEDAP aims to gather the scientific data used in the researches, with the provision of documentation, in order to provide an environment of study of methodologies of use and reuse of research data. Maintained in partnership with the Data Processing Center (CPD) of UFRGS for the development of policies, planning, management, description, evaluation, storage, dissemination and reuse of research data. Created in June 2017, the Research Data Repository meets the sharing needs in Brazil.
The Historical Data Centre Saxony-Anhalt was founded in 2008. Its main tasks are the computer-aided provision, processing and evaluation of historical research data, the development of theoretically consolidated normative data and vocabularies as well as the further development of methods in the context of digital humanities, research data management and quality assurance. The "Historical Data Centre Saxony-Anhalt" sees itself as a central institution for the data service of historical data in the federal state of Saxony-Anhalt and is thus part of a nationally and internationally linked infrastructure for long-term data storage and use. The Centre primarily acquires individual-specific microdata for the analysis of life courses, employment biographies and biographies (primarily quantitative, but also qualitative data), which offer a broad interdisciplinary and international analytical framework and meet clearly defined methodological and technical requirements. The studies are processed, archived and - in compliance with data protection and copyright conditions - made available to the scientifically interested public in accordance with internationally recognized standards. The degree of preparation depends on the type and quality of the study and on demand. Reference studies and studies in high demand are comprehensively documented - often in cooperation with primary researchers or experts - and summarized in data collections. The Historical Data Centre supports researchers in meeting the high demands of research data management. This includes the advisory support of the entire life cycle of data, starting with data production, documentation, analysis, evaluation, publication, long-term archiving and finally the subsequent use of data. In cooperation with other infrastructure facilities of the state of Saxony-Anhalt as well as national and international, interdisciplinary data repositories, the Data Centre provides tools and infrastructures for the publication and long-term archiving of research data. Together with the University and State Library of Saxony-Anhalt, the Data Centre operates its own data repository as well as special workstations for the digitisation and analysis of data. The Historical Data Centre aims to be a contact point for very different users of historical sources. We collect data relating to historical persons, events and historical territorial units.
Content type(s)
The IDR makes datasets that have never previously been accessible publicly available, allowing the community to search, view, mine and even process and analyze large, complex, multidimensional life sciences image data. Sharing data promotes the validation of experimental methods and scientific conclusions, the comparison with new data obtained by the global scientific community, and enables data reuse by developers of new analysis and processing tools.
NASA funded OpenAltimetry facilitates the advanced discovery, processing, and visualization services for ICESat and ICESat-2 altimeter data.
IRSA is chartered to curate the calibrated science products from NASAs infrared and sub-millimeter missions, including five major large-area/all-sky surveys. IRSA exploits a re-useable architecture to deploy cost-effective archives for customers, including: the Spitzer Space Telescope; the 2MASS and IRAS all-sky surveys; and multi-mission datasets such as COSMOS, WISE and Planck mission
The FDZ-BO at DIW Berlin is a central archive for quantitative and qualitative operational and organizational data. It archives these, informs about their existence and provides datasets for secondary analytical purposes. The archiving of studies and datasets ensures long-term security and long-term availability of the data. In consultation with the responsible scientists, access to individual datasets is made possible as scientific use files, via remote data processing or as part of guest stays. The FDZ-BO offers detailed information on current research projects and develops concepts for research data management of organizational data. The study portal (public in March 2019) provides an overview of existing studies in the field of business and organizational research: content, methodology, information on data and data availability information on how to gain access to the data.
The National Population Health Data Center (NPHDC) is one of the 20 national science data center approved by the Ministry of Science and Technology and the Ministry of Finance. The Population Health Data Archive (PHDA) is developed by NPHDC relying on the Institute of Medical Information, Chinese Academy of Medical Sciences. PHDA mainly receives scientific data from science and technology projects supported by the national budget, and also collects data from other multiple sources such as medical and health institutions, research institutions and social individuals, which is oriented to the national big data strategy and the healthy China strategy. The data resources cover basic medicine, clinical medicine, public health, traditional Chinese medicine and pharmacy, pharmacy, population and reproduction. PHDA supports data collection, archiving, processing, storage, curation, verification, certification and release in the field of population health. Provide multiple types of data sharing and application services for different hierarchy users and help them find, access, interoperate and reuse the data in a safe and controlled environment. PHDA provides important support for promoting the open sharing of scientific data of population health and domestic and foreign cooperation.
The Informatics Research Data Repository is a Japanese data repository that collects data on disciplines within informatics. Such sub-categories are things like consumerism and information diffusion. The primary data within these data sets is from experiments run by IDR on how one group is linked to another.
>>>>!!!!<<<< As of 2017-05-17 the data catalog is no longer available >>>>!!!!<<<< DataFed is a web services-based software that non-intrusively mediates between autonomous, distributed data providers and users. The main goals of DataFed are: Aid air quality management and science by effective use of relevant data - Facilitate the access and flow of atmospheric data from provider to users - Support the development of user-driven data processing value chains. DataFed Catalog links searchable Datafed applications worldwide.