Filter
Reset all

Subjects

Content Types

Countries

AID systems

API

Certificates

Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type

Keywords

Metadata standards

PID systems

Provider types

Quality management

Repository languages

Software

Syndications

Repository types

Versioning

  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 148 result(s)
CLARIN.SI is the Slovenian node of the European CLARIN (Common Language Resources and Technology Infrastructure) Centers. The CLARIN.SI repository is hosted at the Jožef Stefan Institute and offers long-term preservation of deposited linguistic resources, along with their descriptive metadata. The integration of the repository with the CLARIN infrastructure gives the deposited resources wide exposure, so that they can be known, used and further developed beyond the lifetime of the projects in which they were produced. Among the resources currently available in the CLARIN.SI repository are the multilingual MULTEXT-East resources, the CC version of Slovenian reference corpus Gigafida, the morphological lexicon Sloleks, the IMP corpora and lexicons of historical Slovenian, as well as many other resources for a variety of languages. Furthermore, several REST-based web services are provided for different corpus-linguistic and NLP tasks.
GroupLens is a research lab in the Department of Computer Science and Engineering at the University of Minnesota, Twin Cities specializing in recommender systems, online communities, mobile and ubiquitous technologies, digital libraries, and local geographic information systems.
The purpose of the Dataset Catalogue is to enhance discovery of GNS Science datasets. At a minimum, users will be able to determine whether a dataset on a specific topic exists and then whether it pertains to a specific place and/or a specific date or period. Some datasets include a web link to an online resource. In addition, contact details are provided for the custodian of each dataset as well as conditions of use.
ReDATA is the research data repository for the University of Arizona and a sister repository to the UA Campus Repository (which is intended for document-based materials). The UA Research Data Repository (ReDATA) serves as the institutional repository for non-traditional scholarly outputs resulting from research activities by University of Arizona researchers. Depositing research materials (datasets, code, images, videos, etc.) associated with published articles and/or completed grants and research projects, into ReDATA helps UA researchers ensure compliance with funder and journal data sharing policies as well as University data retention policies. ReDATA is designed for materials intended for public availability.
US Department of Energy’s Atmospheric Radiation Measurement (ARM) Data Center is a long-term archive and distribution facility for various ground-based, aerial and model data products in support of atmospheric and climate research. ARM facility currently operates over 400 instruments at various observatories (https://www.arm.gov/capabilities/observatories/). ARM Data Center (ADC) Archive currently holds over 11,000 data products with a total holding of over 3 petabytes of data that dates back to 1993, these include data from instruments, value added products, model outputs, field campaign and PI contributed data. The data center archive also includes data collected by ARM from related program (e.g., external data such as NASA satellite).
The BioCyc database collection of Pathway/Genome Databases (PGDBs) provides a reference on the genomes and metabolic pathways of thousands of sequenced organisms. BioCyc PGDBs are generated by software that predict the metabolic pathways of completely sequenced organisms, predict which genes code for missing enzymes in metabolic pathways, and predict operons. BioCyc also integrates information from other bioinformatics databases, such as protein feature and Gene Ontology information from UniProt. The BioCyc website provides a suite of software tools for database searching and visualization, for omics data analysis, and for comparative genomics and comparative pathway questions. From 2016 on, access to the EcoCyc and MetaCyc databases will remain free. Subscriptions to the other 7,600 BioCyc databases will be available to institutions (e.g., libraries), and to individuals. Access to licensed databases via: https://biocyc.org/Product-summary.shtml.
<<<!!!<<< The demand for high-value environmental data and information has dramatically increased in recent years. To improve our ability to meet that demand, NOAA’s former three data centers—the National Climatic Data Center, the National Geophysical Data Center, and the National Oceanographic Data Center, which includes the National Coastal Data Development Center—have merged into the National Centers for Environmental Information (NCEI). >>>!!!>>> NCEI is responsible for hosting and providing access to one of the most significant archives on Earth, with comprehensive oceanic, atmospheric, and geophysical data. From the depths of the ocean to the surface of the sun and from million-year-old sediment records to near real-time satellite images, NCEI is the Nation's leading authority for environmental information. The National Centers for Environmental Information (NCEI), which hosts the World Data Service for Oceanography is a national environmental data center operated by the National Oceanic and Atmospheric Administration (NOAA) of the U.S. Department of Commerce. NCEI are responsible for hosting and providing access to one of the most significant archives on earth, with comprehensive oceanic, atmospheric, and geophysical data. The primary mission of the World Data Service for Oceanography is to ensure that global oceanographic datasets collected at great cost are maintained in a permanent archive that is easily and openly accessible to the world science community and to other users.
<<<!!!<<< The demand for high-value environmental data and information has dramatically increased in recent years. To improve our ability to meet that demand, NOAA’s former three data centers—the National Climatic Data Center, the National Geophysical Data Center, and the National Oceanographic Data Center, which includes the National Coastal Data Development Center—have merged into the National Centers for Environmental Information (NCEI). >>>!!!>>> The National Oceanographic Data Center includes the National Coastal Data Development Center (NCDDC) and the NOAA Central Library, which are integrated to provide access to the world's most comprehensive sources of marine environmental data and information. NODC maintains and updates a national ocean archive with environmental data acquired from domestic and foreign activities and produces products and research from these data which help monitor global environmental changes. These data include physical, biological and chemical measurements derived from in situ oceanographic observations, satellite remote sensing of the oceans, and ocean model simulations.
The SICAS Medical Image Repository is a freely accessible repository containing medical research data including medical images, surface models, clinical data, genomics data and statistical shape models. The data can freely be organized and shared on SMIR and made publicly accessible with a DOI. Dedicated data sets are organized as collections of anatomical regions (e.g Cochlea). The data can be filtered using a modular search and accessed on the web or through the SMIR API.
The GTN-P database is an object-related database open for a diverse range of data. Because of the complexity of the PAGE21 project, data provided in the GTN-P management system are extremely diverse, ranging from active-layer thickness measurements once per year to flux measurement every second and everthing else in between. The data can be assigned to two broad categories: Quantitative data which is all data that can be measured numerically. Quantitative data comprise all in situ measurements, i.e. permafrost temperatures and active layer thickness (mechanical probing, frost/thaw tubes, soil temperature profiles). Qualitative data (knowledge products) are observations not based on measurements, such as observations on soils, vegetation, relief, etc.
Country
The Biological Collection Access Service for Europe, BioCASE, is a transnational network of biological collections of all kinds. BioCASE enables widespread unified access to distributed and heterogeneous European collection and observational databases using open-source, system-independent software and open data standards and protocols.
The National Weather Service, Fairbanks provides weather data relating to and observed in the Fairbanks, AK area. Data includes current, past, and future weather. Databases are organized primarily by the type of data (e.g. weather data, climate data, hydrology, warning/hazard alerts) and then are searchable by research location.
Country
The University of Lodz Repository is an institutional repository whose purpose is to disseminate the scholarly output of its staff and promote research conducted at the UL. It is an archive of its own electronic documents – each deposited publication or collection of research data is given a permanent identifier (handle), enabling the document to be cited and indexed in scientific databases. The University of Lodz Repository operates as an institutional repository based on Regulation No. 51 of the Rector of the University of Lodz of 31 March 2015.
ChEMBL is a database of bioactive drug-like small molecules, it contains 2-D structures, calculated properties (e.g. logP, Molecular Weight, Lipinski Parameters, etc.) and abstracted bioactivities (e.g. binding constants, pharmacology and ADMET data). The data is abstracted and curated from the primary scientific literature, and cover a significant fraction of the SAR and discovery of modern drugs We attempt to normalise the bioactivities into a uniform set of end-points and units where possible, and also to tag the links between a molecular target and a published assay with a set of varying confidence levels. Additional data on clinical progress of compounds is being integrated into ChEMBL at the current time.
The United States Census Bureau (officially the Bureau of the Census, as defined in Title 13 U.S.C. § 11) is the government agency that is responsible for the United States Census. It also gathers other national demographic and economic data. As a part of the United States Department of Commerce, the Census Bureau serves as a leading source of data about America's people and economy. The most visible role of the Census Bureau is to perform the official decennial (every 10 years) count of people living in the U.S. The most important result is the reallocation of the number of seats each state is allowed in the House of Representatives, but the results also affect a range of government programs received by each state. The agency director is a political appointee selected by the President of the United States.
The International Food Policy Research Institute (IFPRI) seeks sustainable solutions for ending hunger and poverty. In collaboration with institutions throughout the world, IFPRI is often involved in the collection of primary data and the compilation and processing of secondary data. The resulting datasets provide a wealth of information at the local (household and community), national, and global levels. IFPRI freely distributes as many of these datasets as possible and encourages their use in research and policy analysis. IFPRI Dataverse contains following dataverses: Agricultural Science and Knowledge Indicators - ASTI, HarvestChoice, Statistics on Public Expenditures for Economic Development - SPEED, International Model for Policy Analysis of Agricultural Commodities and Trade - IMPACT, Africa RISING Dataverse and Food Security Portal Dataverse.
The Centre’s vision is a rural transformation in the developing world as smallholder households strategically increase their use of trees in agricultural landscapes to improve their food security, nutrition, income, health, shelter, social cohesion, energy resources and environmental sustainability. The Centre’s mission is to generate science-based knowledge about the diverse roles that trees play in agricultural landscapes, and to use its research to advance policies and practices, and their implementation, that benefit the poor and the environment.
<<<!!!<<<This is an archive of the old NEBC site from nebc.nerc.ac.uk and is no longer updated. For new information regarding NERC Environmental Omics and the Bio-Linux system please see the EOS site at http://environmentalomics.org. >>>!!!>>> Ongoing NEBC activities, including the development of Bio-Linux, are being moved into the new EOS programme http://environmentalomics.org/portfolio/big-data-infrastructure/ . Once the current material from this website has been moved into EOS, this NEBC site will remain on-line as an archive. EnvBase is the searchable index to the data deposited through the NEBC, as well as related NERC experimental data. At present this is chiefly from the grants funded by the NERC Environmental Genomics Science Programme and the subsequent Post-genomics and Proteomics Science Programme, but more data from ongoing projects continues to be added
The Ensembl project produces genome databases for vertebrates and other eukaryotic species. Ensembl is a joint project between the European Bioinformatics Institute (EBI) and the Wellcome Trust Sanger Institute (WTSI) to develop a software system that produces and maintains automatic annotation on selected genomes.The Ensembl project was started in 1999, some years before the draft human genome was completed. Even at that early stage it was clear that manual annotation of 3 billion base pairs of sequence would not be able to offer researchers timely access to the latest data. The goal of Ensembl was therefore to automatically annotate the genome, integrate this annotation with other available biological data and make all this publicly available via the web. Since the website's launch in July 2000, many more genomes have been added to Ensembl and the range of available data has also expanded to include comparative genomics, variation and regulatory data. Ensembl is a joint project between European Bioinformatics Institute (EBI), an outstation of the European Molecular Biology Laboratory (EMBL), and the Wellcome Trust Sanger Institute (WTSI). Both institutes are located on the Wellcome Trust Genome Campus in Hinxton, south of the city of Cambridge, United Kingdom.
<<<!!!<<< 2020-08-28; the repository is no longer available >>>!!!>>> The South African Data Archive promotes and facilitates the sharing of research data and related documentation of computerised raw quantitative data of large scale regional, national and international research projects mainly in the humanities and social sciences. It makes these datasets available to the research community for further analysis, comparative studies, longitudinal studies, teaching and decision-making purposes.