Filter
Reset all

Subjects

Content Types

Countries

AID systems

API

Certificates

Data access

Data access restrictions

Database access

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type

Keywords

Metadata standards

PID systems

Provider types

Quality management

Repository languages

Software

Syndications

Repository types

Versioning

  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
  • 1 (current)
Found 23 result(s)
Country
The World Atlas of Language Structures (WALS) is a large database of structural (phonological, grammatical, lexical) properties of languages gathered from descriptive materials (such as reference grammars) by a team of 55 authors (many of them the leading authorities on the subject).
The RIT DML captures, distributes and preserves RIT's digital products. Here you can find articles, working papers, preprints, technical reports, conference papers and data sets in various digital formats.
The Mikulski Archive for Space Telescopes (MAST) is a NASA funded project to support and provide to the astronomical community a variety of astronomical data archives, with the primary focus on scientifically related data sets in the optical, ultraviolet, and near-infrared parts of the spectrum. MAST is located at the Space Telescope Science Institute (STScI).
Country
DataverseNO (https://dataverse.no) is a curated, FAIR-aligned national generic repository for open research data from all academic disciplines. DataverseNO commits to facilitate that published data remain accessible and (re)usable in a long-term perspective. The repository is owned and operated by UiT The Arctic University of Norway. DataverseNO accepts submissions from researchers primarily from Norwegian research institutions. Datasets in DataverseNO are grouped into institutional collections as well as special collections. The technical infrastructure of the repository is based on the open source application Dataverse (https://dataverse.org), which is developed by an international developer and user community led by Harvard University.
The Cancer Genome Atlas (TCGA) Data Portal provides a platform for researchers to search, download, and analyze data sets generated by TCGA. It contains clinical information, genomic characterization data, and high level sequence analysis of the tumor genomes. The Data Coordinating Center (DCC) is the central provider of TCGA data. The DCC standardizes data formats and validates submitted data.
Country
jPOSTrepo (Japan ProteOme STandard Repository) is a repository of sharing MS raw/processed data. It consists of a high-speed file upload process, flexible file management system and easy-to-use interfaces. Users can release their "raw/processed" data via this site with a unique identifier number for the paper publication. Users also can suspend (or "embargo") their data until their paper is published. The file transfer from users’ computer to our repository server is very fast (roughly ten times faster than usual file transfer) and uses only web browsers – it does not require installing any additional software.
The Southern California Earthquake Data Center (SCEDC) operates at the Seismological Laboratory at Caltech and is the primary archive of seismological data for southern California. The 1932-to-present Caltech/USGS catalog maintained by the SCEDC is the most complete archive of seismic data for any region in the United States. Our mission is to maintain an easily accessible, well-organized, high-quality, searchable archive for research in seismology and earthquake engineering.
Country
The GAVO data centre at Zentrum für Astronomie Heidelberg publishes astronomical data of all kinds – e.g., catalogues, images, spectra, time series, simulation results – in accordance with Virtual Observatory standards, making them findable and immediately usable through popular clients like TOPCAT, Aladin, or programatically through the astropy-affiliated package pyVO or the Java library STIL. We pay particular attention to providing thorough metadata to the VO Registry in order to facilitate discovery and reuse. While we have a clear focus on data produced with German contributions, we will usually publish data of other provenance, too. See https://docs.g-vo.org/DaCHS/data_checklist.html for an overview of what resource-level metadata we ask for; contact us for further information on how to publish through the German Astronomical Virtual Observatory.
ANPERSANA is the digital library of IKER (UMR 5478), a research centre specialized in Basque language and texts. The online library platform receives and disseminates primary sources of data issued from research in Basque language and culture. As of today, two corpora of documents have been published. The first one, is a collection of private letters written in an 18th century variety of Basque, documented in and transcribed to modern standard Basque. The discovery of the collection, named Le Dauphin, has enabled the emerging of new questions about the history and sociology of writing in the domain of minority languages, not only in France, but also among the whole Atlantic Arc. The second of the two corpora is a selection of sound recordings about monodic chant in the Basque Country. The documents were collected as part of a PhD thesis research work that took place between 2003 and 2012. It's a total of 50 hours of interviews with francophone and bascophone cultural representatives carried out at either their workplace of the informers or in public areas. ANPERSANA is bundled with an advanced search engine. The documents have been indexed and geo-localized on an interactive map. The platform is engaged with open access and all the resources can be uploaded freely under the different Creative Commons (CC) licenses.
OpenWorm aims to build the first comprehensive computational model of the Caenorhabditis elegans (C. elegans), a microscopic roundworm. With only a thousand cells, it solves basic problems such as feeding, mate-finding and predator avoidance. Despite being extremely well studied in biology, this organism still eludes a deep, principled understanding of its biology. We are using a bottom-up approach, aimed at observing the worm behaviour emerge from a simulation of data derived from scientific experiments carried out over the past decade. To do so we are incorporating the data available in the scientific community into software models. We are engineering Geppetto and Sibernetic, open-source simulation platforms, to be able to run these different models in concert. We are also forging new collaborations with universities and research institutes to collect data that fill in the gaps All the code we produce in the OpenWorm project is Open Source and available on GitHub.
Country
DIAS aims at collecting and storing earth observation data; analyzing such data in combination with socio-economic data, and converting data into information useful for crisis management with respect to global-scale environmental disasters, and other threats; and to make this information available within Japan and overseas.
Country
Biodiversity Maps provides access to high quality information on Ireland's biological diversity. Use the system to find out what is known about the different species that occur in Ireland, where our protected and threatened species occur, and who is recoding biodiversity. Also find out what is known about the biodiversity of your locality. The National Biodiversity Data Centre endeavours to provide high quality information through this data portal.
Country
The UTM Data Centre is responsible for managing spatial data acquired during oceanographic cruises on board CSIC research vessels (RV Sarmiento de Gamboa, RV García del Cid) and RV Hespérides. The aim is, on the one hand, to disseminate which data exist and where, how and when they have been acquired. And on the other hand, to provide access to as much of the interoperable data as possible, following the FAIR principles, so that they can be used and reused. For this purpose, the UTM has a Spatial Data Infrastructure at a national level that consists of several services: Oceanographic Cruise and Data Catalogue Including metadata from more than 600 cruises carried out since 1991, with links to documentation associated to the cruise, navigation maps and datasets Geoportal Geospatial data mapping interface Underway Plot & QC Visualization, Quality Control and conversion to standard format of meteorological data and temperature and salinity of surface water At an international level, the UTM is a National Oceanographic Data Centre (NODC) of the Distributed European Marine Data Infrastructure SeaDataNet, to which the UTM provides metadata published in the Cruise Summary Report Catalog and in the data catalog Common Data Index Catalog, as well as public data to be shared.
The Tromsø Repository of Language and Linguistics (TROLLing) is a FAIR-aligned repository of linguistic data and statistical code. The archive is open access, which means that all information is available to everyone. All data are accompanied by searchable metadata that identify the researchers, the languages and linguistic phenomena involved, the statistical methods applied, and scholarly publications based on the data (where relevant). Linguists worldwide are invited to deposit data and statistical code used in their linguistic research. TROLLing is a special collection within DataverseNO (http://doi.org/10.17616/R3TV17), and C Centre within CLARIN (Common Language Resources and Technology Infrastructure, a networked federation of European data repositories; http://www.clarin.eu/), and harvested by their Virtual Language Observatory (VLO; https://vlo.clarin.eu/).
Europeana is the trusted source of cultural heritage brought to you by the Europeana Foundation and a large number of European cultural institutions, projects and partners. It’s a real piece of team work. Ideas and inspiration can be found within the millions of items on Europeana. These objects include: Images - paintings, drawings, maps, photos and pictures of museum objects Texts - books, newspapers, letters, diaries and archival papers Sounds - music and spoken word from cylinders, tapes, discs and radio broadcasts Videos - films, newsreels and TV broadcasts All texts are CC BY-SA, images and media licensed individually.
The Arabidopsis Information Resource (TAIR) maintains a database of genetic and molecular biology data for the model higher plant Arabidopsis thaliana . Data available from TAIR includes the complete genome sequence along with gene structure, gene product information, metabolism, gene expression, DNA and seed stocks, genome maps, genetic and physical markers, publications, and information about the Arabidopsis research community. Gene product function data is updated every two weeks from the latest published research literature and community data submissions. Gene structures are updated 1-2 times per year using computational and manual methods as well as community submissions of new and updated genes. TAIR also provides extensive linkouts from our data pages to other Arabidopsis resources.
The World Register of Marine Species (WoRMS) integrates approximately 100 marine datbases to provide an authoritative and comprehensive list of marine organisms. WoRMS has an editorial system where taxonomic groups are managed by experts responsible for the quality of the information. WorMS register of marine species emerged from the European Register of Marine Species (ERMS) and the Flanders Marine Institute (VLIZ). WoRMS is a contribution to Lifewatch, Catalogue of Life, Encyclopedia of Life, Global Biodiversity Information Facility and the Census of Marine Life.
The 1000 Genomes Project is an international collaboration to produce an extensive public catalog of human genetic variation, including SNPs and structural variants, and their haplotype contexts. This resource will support genome-wide association studies and other medical research studies. The genomes of about 2500 unidentified people from about 25 populations around the world will be sequenced using next-generation sequencing technologies. The results of the study will be freely and publicly accessible to researchers worldwide. The International Genome Sample Resource (IGSR) has been established at EMBL-EBI to continue supporting data generated by the 1000 Genomes Project, supplemented with new data and new analysis.
Country
The eAtlas is a website, mapping system and set of data visualisation tools for presenting research data in an accessible form that promotes greater use of this information. The eAtlas will serve as the primary data and knowledge repository for all NERP Tropical Ecosystems Hub projects, which focus on the on the Great Barrier Reef, Wet Tropics rainforest and Torres Strait. The eAtlas will capture and record research outcomes and make them available to research-users in a timely, readily accessible manner. It will host meta-data records and provide an enduring repository for raw data. It will also develop and host web visualisations to view information using a simple and intuitive interface. This will assist scientists with data discovery and allow environmental managers to access and investigate research data.
<<<!!!<<< This repository is no longer available. >>>!!!>>> In 2016, NSIDC partnered with the United States Antarctic Program - Data Center (USAP-DC) at Columbia University to consolidate NSF glaciology data into a central USAP Project Catalog and a Data Repository for research datasets derived from these projects. From 2016 to 2018, the AGDC data sets were transferred to USAP-DC. All AGDC data previously archived with NSIDC are now available via the USAP-DC https://www.re3data.org/repository/r3d100010660.
The National Weather Service, Fairbanks provides weather data relating to and observed in the Fairbanks, AK area. Data includes current, past, and future weather. Databases are organized primarily by the type of data (e.g. weather data, climate data, hydrology, warning/hazard alerts) and then are searchable by research location.
<<<!!!<<<This is an archive of the old NEBC site from nebc.nerc.ac.uk and is no longer updated. For new information regarding NERC Environmental Omics and the Bio-Linux system please see the EOS site at http://environmentalomics.org. >>>!!!>>> Ongoing NEBC activities, including the development of Bio-Linux, are being moved into the new EOS programme http://environmentalomics.org/portfolio/big-data-infrastructure/ . Once the current material from this website has been moved into EOS, this NEBC site will remain on-line as an archive. EnvBase is the searchable index to the data deposited through the NEBC, as well as related NERC experimental data. At present this is chiefly from the grants funded by the NERC Environmental Genomics Science Programme and the subsequent Post-genomics and Proteomics Science Programme, but more data from ongoing projects continues to be added