Filter
Reset all

Subjects

Content Types

Countries

API

Certificates

Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type

Keywords

Metadata standards

PID systems

Provider types

Quality management

Repository languages

Software

Syndications

Repository types

Versioning

  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
  • 1 (current)
Found 15 result(s)
The Eurac Research CLARIN Centre (ERCC) is a dedicated repository for language data. It is hosted by the Institute for Applied Linguistics (IAL) at Eurac Research, a private research centre based in Bolzano, South Tyrol. The Centre is part of the Europe-wide CLARIN infrastructure, which means that it follows well-defined international standards for (meta)data and procedures and is well-embedded in the wider European Linguistics infrastructure. The repository hosts data collected at the IAL, but is also open for data deposits from external collaborators.
Content type(s)
Country
The Digital Collections include digitized manuscripts, prints, music, maps, photographs, newspapers and magazines from the rich holdings of the Bayerische Staatsbibliothek. Almost the entire content (>98%) is available for download for research purposes or via IIIF APIs, including all available OCR data.
AceView provides a curated, comprehensive and non-redundant sequence representation of all public mRNA sequences (mRNAs from GenBank or RefSeq, and single pass cDNA sequences from dbEST and Trace). These experimental cDNA sequences are first co-aligned on the genome then clustered into a minimal number of alternative transcript variants and grouped into genes. Using exhaustively and with high quality standards the available cDNA sequences evidences the beauty and complexity of mammals’ transcriptome, and the relative simplicity of the nematode and plant transcriptomes. Genes are classified according to their inferred coding potential; many presumably non-coding genes are discovered. Genes are named by Entrez Gene names when available, else by AceView gene names, stable from release to release. Alternative features (promoters, introns and exons, polyadenylation signals) and coding potential, including motifs, domains, and homologies are annotated in depth; tissues where expression has been observed are listed in order of representation; diseases, phenotypes, pathways, functions, localization or interactions are annotated by mining selected sources, in particular PubMed, GAD and Entrez Gene, and also by performing manual annotation, especially in the worm. In this way, both the anatomy and physiology of the experimentally cDNA supported human, mouse and nematode genes are thoroughly annotated.
A data repository and social network so that researchers can interact and collaborate, also offers tutorials and datasets for data science learning. "data.world is designed for data and the people who work with data. From professional projects to open data, data.world helps you host and share your data, collaborate with your team, and capture context and conclusions as you work."
This site is dedicated to making high value health data more accessible to entrepreneurs, researchers, and policy makers in the hopes of better health outcomes for all. In a recent article, Todd Park, United States Chief Technology Officer, captured the essence of what the Health Data Initiative is all about and why our efforts here are so important.
The NCBI Short Genetic Variations database, commonly known as dbSNP, catalogs short variations in nucleotide sequences from a wide range of organisms. These variations include single nucleotide variations, short nucleotide insertions and deletions, short tandem repeats and microsatellites. Short Genetic Variations may be common, thus representing true polymorphisms, or they may be rare. Some rare human entries have additional information associated withthem, including disease associations, genotype information and allele origin, as some variations are somatic rather than germline events. ***NCBI will phase out support for non-human organism data in dbSNP and dbVar beginning on September 1, 2017***
Spitzer is the final mission in NASA's Great Observatories Program - a family of four orbiting observatories, each observing the Universe in a different kind of light (visible, gamma rays, X-rays, and infrared). Spitzer is also a part of NASA's Astronomical Search for Origins Program, designed to provide information which will help us understand our cosmic roots, and how galaxies, stars and planets develop and form.
The NSIDC Distributed Active Archive Center (DAAC) processes, archives, documents, and distributes data from NASA's past and current Earth Observing System (EOS) satellites and field measurement programs. The NSIDC DAAC focuses on the study of the cryosphere. The NSIDC DAAC is one of NASA's Earth Observing System Data and Information System (EOSDIS) Data Centers.
The Online Data Portal (ODP) is an evolving project to support collaborative river restoration projects, such as the TRRP. The goal is to provide a centralized clearing house of documents and data for program partners, stakeholders, and the public. The functionality and data holdings will continue to be expanded over the next few years. The ability to store Data Packages is new as of Fall 2011 and holdings should expand substantially in the months afterward. A project to scan many older documents also began in December 2011. Simple time-series datasets have long been stored in the ODP, but holdings of these data are likely to increase as TRRP implements an upcoming Data Management and Utility Plan. Major upgrades to the Interactive Map are expected to start in winter and spring of 2012. The long term vision is that many data resources will be accessible both by text searches and via the Interactive Map. The ODP will be available for use by other river restoration programs. ODP is followed by TRRP DataPort.
Country
The Pig Expression Data Explorer (PEDE) database system stores full-length cDNA libraries of swine data accesible via keyword and ID searches. Data is publically available, and may specifically interest genetic researchers interested in disease sucsceptibly, and major and minor porcine specific antigens.
Country
CosmoHub is a web application based on Hadoop to perform interactive exploration and distribution of massive cosmological datasets