Filter
Reset all

Subjects

Content Types

Countries

AID systems

API

Certificates

Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type

Keywords

Metadata standards

PID systems

Provider types

Quality management

Repository languages

Software

Syndications

Repository types

Versioning

  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 179 result(s)
The 1000 Genomes Project is an international collaboration to produce an extensive public catalog of human genetic variation, including SNPs and structural variants, and their haplotype contexts. This resource will support genome-wide association studies and other medical research studies. The genomes of about 2500 unidentified people from about 25 populations around the world will be sequenced using next-generation sequencing technologies. The results of the study will be freely and publicly accessible to researchers worldwide. The International Genome Sample Resource (IGSR) has been established at EMBL-EBI to continue supporting data generated by the 1000 Genomes Project, supplemented with new data and new analysis.
AlgaeBase is a database of information on algae that includes terrestrial, marine and freshwater organisms. At present, the data for the marine algae, particularly seaweeds, are the most complete.
The Allele Frequency Net Database (AFND) is a public database which contains frequency information of several immune genes such as Human Leukocyte Antigens (HLA), Killer-cell Immunoglobulin-like Receptors (KIR), Major histocompatibility complex class I chain-related (MIC) genes, and a number of cytokine gene polymorphisms. The Allele Frequency Net Database (AFND) provides a central source, freely available to all, for the storage of allele frequencies from different polymorphic areas in the Human Genome. Users can contribute the results of their work into one common database and can perform database searches on information already available. We have currently collected data in allele, haplotype and genotype format. However, the success of this website will depend on you to contribute your data.
Content type(s)
While focused on supporting the scientific community, ATCC activities range widely, from repository-related operations to providing specialized services, conducting in-house R&D and intellectual property management. ATCC serves U.S. and international researchers by characterizing cell lines, bacteria, viruses, fungi and protozoa, as well as developing and evaluating assays and techniques for validating research resources and preserving and distributing biological materials to the public and private sector research communities. Our management philosophy emphasizes customer satisfaction, value addition, cost-effective operations and competitive benchmarking for all areas of our enterprise.
Apollo (previously DSpace@Cambridge) is the University of Cambridge’s Institutional Repository (IR), preserving and providing access to content created by members of the University. The repository stores a range of content and provides different levels of access, but its primary focus is on providing open access to the University’s research publications.
The ADS is an accredited digital repository for heritage data that supports research, learning and teaching with freely available, high quality and dependable digital resources by preserving and disseminating digital data in the long term. The ADS also promotes good practice in the use of digital data, provides technical advice to the heritage community, and supports the deployment of digital technologies.
ArrayExpress is one of the major international repositories for high-throughput functional genomics data from both microarray and high-throughput sequencing studies, many of which are supported by peer-reviewed publications. Data sets are submitted directly to ArrayExpress and curated by a team of specialist biological curators. In the past (until 2018) datasets from the NCBI Gene Expression Omnibus database were imported on a weekly basis. Data is collected to MIAME and MINSEQE standards.
Aston Data Explorer is Aston University's repository for our research datasets. It is one of three services providing information about Aston University’s research. Aston Publications Explorer holds Aston's Open Access publications and Aston Research Explorer has broader information about Aston's research work including research staff, awards and activities, projects and research groups.
ALSPAC is a longitudinal birth cohort study which enrolled pregnant women who were resident in one of three Bristol-based health districts in the former County of Avon with an expected delivery date between 1st April 1991 and 31st December 1992. Around 14,000 pregnant women were initially recruited. Detailed information has been collected on these women, their partners and subsequent children using self-completion questionnaires, data extraction from medical notes, linkage to routine information systems and from hands-on research clinics. Additional cohorts of participants have since been enrolled in their own right including fathers, siblings, children of the children and grandparents of the children. Ethical approval for the study was obtained from the ALSPAC Ethics and Law Committee (IRB00003312) and Local Research Ethics.
BOARD (Bicocca Open Archive Research Data) is the institutional data repository of the University of Milano-Bicocca. BOARD is an open, free-to-use research data repository, which enables members of University of Milano-Bicocca to make their research data publicly available. By depositing their research data in BOARD researchers can: - Make their research data citable - Share their data privately or publicly - Ensure long-term storage for their data - Keep access to all versions - Link their article to their data
Antarctic marine and terrestrial biodiversity data is widely scattered, patchy and often not readily accessible. In many cases the data is in danger of being irretrievably lost. Biodiversity.aq establishes and supports a distributed system of interoperable databases, giving easy access through a single internet portal to a set of resources relevant to research, conservation and management pertaining to Antarctic biodiversity. biodiversity.aq provides access to both marine and terrestrial Antarctic biodiversity data.
The BioImage Archive stores and distributes life sciences imaging datasets. It supports deposition of biological imaging data associated with publications for the whole research community, as well as reference imaging datasets. All data deposited to the BioImage Archive is made openly accessible to the scientific community.
This site offers an enormous collection of photographs of wild species and natural history objects. It covers most groups of organisms with the exception of birds and other vertebrates. The photographs are presented to illustrate biodiversity and as an aid to identification. The criterion for inclusion of a species is that it must have been, or might be expected to be, found in Britain or Ireland. BioImages follows the biological classification. Biota is a hierarchical system with species grouped in genera, genera in families, families in orders and so on up to kingdoms and superkingdoms. The datasets are linked to bioinfo: food webs and species interactions in the Biodiversity of UK and Ireland.
BioModels is a repository of mathematical models of biological and biomedical systems. It hosts a vast selection of existing literature-based physiologically and pharmaceutically relevant mechanistic models in standard formats. Our mission is to provide the systems modelling community with reproducible, high-quality, freely-accessible models published in the scientific literature.
Born in Bradford is one of the biggest and most important medical research studies undertaken in the UK. The project started in 2007 and is looking to answer questions about our health by tracking the lives of 13,500 babies and their families and will provide information for studies across the UK and around the world. The aim of Born in Bradford is to find out more about the causes of childhood illness by studying children from all cultures and backgrounds as their lives unfold.
Content type(s)
The British Ocean Sediment Core Research Facility (BOSCORF) is based at the Southampton site of the National Oceanography Centre and is Britain’s national deep-sea core repository. BOSCORF is responsible for long-term storage and curation of sediment cores collected through UKRI-NERC research programmes. We promote secondary usage of sediment core samples and analytical data relating to the sample collection.
The British Oceanographic Data Centre (BODC) is a national facility for looking after and distributing data concerning the marine environmentWe deal with biological, chemical, physical and geophysical data, and our databases contain measurements of nearly 22,000 different variables. Many of our staff have direct experience of marine data collection and analysis. They work alongside information technology specialists to ensure that data are documented and stored for current and future use.
The DMC is designed to provide registered users with access to non-confidential petroleum exploration and production data from offshore Nova Scotia, subject to certain conditions. The DMC is housed in the CNSOPB's Geoscience Research Centre located in Dartmouth, Nova Scotia. Initially, the DMC will manage and distribute the following digital petroleum data: well data (i.e. logs and reports), seismic image files (e.g. TIFF, PDF), and production data. In the future the DMC could be expanded to include operational, safety, environmental, fisheries data, etc.
The Centre for the Environment, Fisheries and Aquaculture Science (Cefas), as one of the world's longest-established marine research organisations, has provided advice on the sustainable exploitation of marine resources since 1902. Today Cefas works in support of a healthy environment and a growing blue economy providing innovative solutions for the aquatic environment, biodiversity and food security. The Cefas Data Hub provides access to over 2080 metadata records, with over 5500 data sets available to download and connect to in support of commitments to Open Science through the Data Portal. Datasets available are increasingly diverse and include many legacy datasets including those from fish, shellfish and plankton surveys from the 1980's to the present day. Other increasingly international datasets made available include species migration data from tagging activities and data on habitat and sediment, ecosystem change, human activities including marine litter, otolith sampling and fish stomach contents, oceanography, acoustics, health and water quality. Data is provided under Open Government License by default where feasible.
-----<<<<< The repository is no longer available. This record is out-dated. The Matter lab provides the archived database version of 2012 and 2013 at https://www.matter.toronto.edu/basic-content-page/data-download. Data linked from the World Community Grid - The Clean Energy Project see at https://www.worldcommunitygrid.org/research/cep1/overview.do and on fighshare https://figshare.com/articles/dataset/moldata_csv/9640427 >>>>>----- The Clean Energy Project Database (CEPDB) is a massive reference database for organic semiconductors with a particular emphasis on photovoltaic applications. It was created to store and provide access to data from computational as well as experimental studies, on both known and virtual compounds. It is a free and open resource designed to support researchers in the field of organic electronics in their scientific pursuits. The CEPDB was established as part of the Harvard Clean Energy Project (CEP), a virtual high-throughput screening initiative to identify promising new candidates for the next generation of carbon-based solar cell materials.
ChEMBL is a database of bioactive drug-like small molecules, it contains 2-D structures, calculated properties (e.g. logP, Molecular Weight, Lipinski Parameters, etc.) and abstracted bioactivities (e.g. binding constants, pharmacology and ADMET data). The data is abstracted and curated from the primary scientific literature, and cover a significant fraction of the SAR and discovery of modern drugs We attempt to normalise the bioactivities into a uniform set of end-points and units where possible, and also to tag the links between a molecular target and a published assay with a set of varying confidence levels. Additional data on clinical progress of compounds is being integrated into ChEMBL at the current time.
Chempound is a new generation repository architecture based on RDF, semantic dictionaries and linked data. It has been developed to hold any type of chemical object expressible in CML and is exemplified by crystallographic experiments and computational chemistry calculations. In both examples, the repository can hold >50k entries which can be searched by SPARQL endpoints and pre-indexing of key fields. The Chempound architecture is general and adaptable to other fields of data-rich science. The Chempound software is hosted at http://bitbucket.org/chempound and is available under the Apache License, Version 2.0
ChemSpider is a free chemical structure database providing fast access to over 58 million structures, properties and associated information. By integrating and linking compounds from more than 400 data sources, ChemSpider enables researchers to discover the most comprehensive view of freely available chemical data from a single online search. It is owned by the Royal Society of Chemistry. ChemSpider builds on the collected sources by adding additional properties, related information and links back to original data sources. ChemSpider offers text and structure searching to find compounds of interest and provides unique services to improve this data by curation and annotation and to integrate it with users’ applications.