Repository types


NASA officially has launched a new resource to help the public search and download out-of-this-world images, videos and audio files by keyword and metadata searches from The NASA Image and Video Library website consolidates imagery spread across more than 60 collections into one searchable location. NASA Image and Video Library allows users to search, discover and download a treasure trove of more than 140,000 NASA images, videos and audio files from across the agency’s many missions in aeronautics, astrophysics, Earth science, human spaceflight, and more. Users can browse the agency’s most recently uploaded files, as well as discover historic and the most popularly searched images, audio files and videos. Other features include: Automatically scales the interface for mobile phones and tablets Displays the EXIF/camera data that includes exposure, lens used, and other information, when available from the original image Allows for easy public access to high resolution files All video includes a downloadable caption file NASA Image and Video Library’s Application Programmers Interface (API) allows automation of imagery uploads for NASA, and gives members of the public the ability to embed content in their own sites and applications. This public site runs on NASA’s cloud native “infrastructure-as-a-code” technology enabling on-demand use in the cloud.
UCL Discovery is UCL's open access repository, showcasing and providing access to UCL research publications. UCL Discovery accepts small scale datasets associated with publications.
Sinmin contains texts of different genres and styles of the modern and old Sinhala language. The main sources of electronic copies of texts for the corpus are online Sinhala newspapers, online Sinhala news sites, Sinhala school textbooks available in online, online Sinhala magazines, Sinhala Wikipedia, Sinhala fictions available in online, Mahawansa, Sinhala Blogs, Sinhala subtitles and Sri lankan gazette.
The Ensembl project produces genome databases for vertebrates and other eukaryotic species. Ensembl is a joint project between the European Bioinformatics Institute (EBI) and the Wellcome Trust Sanger Institute (WTSI) to develop a software system that produces and maintains automatic annotation on selected genomes.The Ensembl project was started in 1999, some years before the draft human genome was completed. Even at that early stage it was clear that manual annotation of 3 billion base pairs of sequence would not be able to offer researchers timely access to the latest data. The goal of Ensembl was therefore to automatically annotate the genome, integrate this annotation with other available biological data and make all this publicly available via the web. Since the website's launch in July 2000, many more genomes have been added to Ensembl and the range of available data has also expanded to include comparative genomics, variation and regulatory data. Ensembl is a joint project between European Bioinformatics Institute (EBI), an outstation of the European Molecular Biology Laboratory (EMBL), and the Wellcome Trust Sanger Institute (WTSI). Both institutes are located on the Wellcome Trust Genome Campus in Hinxton, south of the city of Cambridge, United Kingdom.
ChEMBL is a database of bioactive drug-like small molecules, it contains 2-D structures, calculated properties (e.g. logP, Molecular Weight, Lipinski Parameters, etc.) and abstracted bioactivities (e.g. binding constants, pharmacology and ADMET data). The data is abstracted and curated from the primary scientific literature, and cover a significant fraction of the SAR and discovery of modern drugs We attempt to normalise the bioactivities into a uniform set of end-points and units where possible, and also to tag the links between a molecular target and a published assay with a set of varying confidence levels. Additional data on clinical progress of compounds is being integrated into ChEMBL at the current time.
AmphibiaWeb is an online system enabling any user to search and retrieve information relating to amphibian biology and conservation. This site was motivated by the global declines of amphibians, the study of which has been hindered by the lack of multidisplinary studies and a lack of coordination in monitoring, in field studies, and in lab studies. We hope AmphibiaWeb will encourage a shared vision to collaboratively face the challenge of global amphibian declines and the conservation of remaining amphibians and their habitats.
CLARIN-UK is a consortium of centres of expertise involved in research and resource creation involving digital language data and tools. The consortium includes the national library, and academic departments and university centres in linguistics, languages, literature and computer science.
The Cancer in Young People in Canada (CYP-C) surveillance program collects in-depth data concerning risk factors, health outcomes, quality and accessibility of care, and late effects among children and youth with cancer. CYP-C represents a collaboration involving the C17 Council, Canadian Partnerships Against Cancer (CPAC), Public Health Agency of Canada (PHAC), provincial and territorial cancer registries, Statistics Canada and non-governmental organizations.
Founded in May 2000, the BDEP stores, organizes and makes available geophysical, geological and geochemical information. The database, after processing and analysis, provides help to the areas of sedimentary basins where there's more probability of oil and natural gas. The data acquisition and management of this collection guarantees Brazil to the domain about the potential of knowledge generated in hydrocarbons.
SwissLipids is an expert curated resource that provides a framework for the integration of lipid and lipidomic data with biological knowledge and models.
InnateDB is a publicly available database of the genes, proteins, experimentally-verified interactions and signaling pathways involved in the innate immune response of humans, mice and bovines to microbial infection. The database captures an improved coverage of the innate immunity interactome by integrating known interactions and pathways from major public databases together with manually-curated data into a centralised resource. The database can be mined as a knowledgebase or used with our integrated bioinformatics and visualization tools for the systems level analysis of the innate immune response.
InterPro collects information about protein sequence analysis and classification, providing access to a database of predictive protein signatures used for the classification and automatic annotation of proteins and genomes. Sequences in InterPro are classified at superfamily, family, and subfamily. InterPro predicts the occurrence of functional domains, repeats, and important sites, and adds in-depth annotation such as GO terms to the protein signatures.
Applying the Terrestrial Systems Modeling Platform, TerrSysMP, this dataset consists of the first simulated long-term (1989-2018), high-resolution (~12.5km) terrestrial system climatology over Europe, which comprises variables from groundwater across the land surface to the top of atmosphere (G2A). This data set constitutes a near-natural realization of the European terrestrial system, which cannot be obtained from observations, and can, thus, serve as a reference for global change simulations including human water use and climate change.
The Korea Polar Data Center (KPDC) is an organization dedicated for managing different types of data acquired during scientific research that South Korea carries out in Antarctic and Arctic regions. South Korea, as an Antarctic Treaty Consultative Party (ATCP) and an accredited member of the Scientific Committee on Antarctic Research (SCAR) established the Center in 2003 as part of its effort to joint international Antarctic research.
Alzforum is an independent research project to develop an online community resource to manage scientific knowledge, information, and data about Alzheimer disease (AD).
BacMap is a picture atlas of annotated bacterial genomes. It is an interactive visual database containing hundreds of fully labeled, zoomable, and searchable maps of bacterial genomes.
In collaboration with other centres in the CLARIN-D consortium, the UdS CLARIN-D centre enables eHumanities by providing a service for hosting and processing language resources (notably corpora) for members of the research community. The UdS CLARIN-D centre thus contributes of lifting the fragmentation of language resources by assisting members of the research community in preparing language materials in such a way that easy discovery is ensured, interchange is facilitated and preservation is enabled by enriching such materials with meta-information, transforming them into sustainable formats and hosting them. We have an explicit mission to archive language resources especially multilingual corpora (parallel, comparable) and corpora including specific registers, both collected by associated researchers as well as researchers who are not affiliated with us.