Filter
Reset all

Subjects

Content Types

Countries

AID systems

API

Data access

Data access restrictions

Database access

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type

Keywords

Metadata standards

PID systems

Provider types

Quality management

Repository languages

Software

Syndications

Repository types

Versioning

  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 28 result(s)
OrthoMCL is a genome-scale algorithm for grouping orthologous protein sequences. It provides not only groups shared by two or more species/genomes, but also groups representing species-specific gene expansion families. So it serves as an important utility for automated eukaryotic genome annotation. OrthoMCL starts with reciprocal best hits within each genome as potential in-paralog/recent paralog pairs and reciprocal best hits across any two genomes as potential ortholog pairs. Related proteins are interlinked in a similarity graph. Then MCL (Markov Clustering algorithm,Van Dongen 2000; www.micans.org/mcl) is invoked to split mega-clusters. This process is analogous to the manual review in COG construction. MCL clustering is based on weights between each pair of proteins, so to correct for differences in evolutionary distance the weights are normalized before running MCL.
Reactome is a manually curated, peer-reviewed pathway database, annotated by expert biologists and cross-referenced to bioinformatics databases. Its aim is to share information in the visual representations of biological pathways in a computationally accessible format. Pathway annotations are authored by expert biologists, in collaboration with Reactome editorial staff and cross-referenced to many bioinformatics databases. These include NCBI Gene, Ensembl and UniProt databases, the UCSC and HapMap Genome Browsers, the KEGG Compound and ChEBI small molecule databases, PubMed, and Gene Ontology.
EnsemblPlants is a genome-centric portal for plant species. Ensembl Plants is developed in coordination with other plant genomics and bioinformatics groups via the EBI's role in the transPLANT consortium.
The Museum is committed to open access and open science, and has launched the Data Portal to make its research and collections datasets available online. It allows anyone to explore, download and reuse the data for their own research. Our natural history collection is one of the most important in the world, documenting 4.5 billion years of life, the Earth and the solar system. Almost all animal, plant, mineral and fossil groups are represented. These datasets will increase exponentially. Under the Museum's ambitious digital collections programme we aim to have 20 million specimens digitised in the next five years.
GENCODE is a scientific project in genome research and part of the ENCODE (ENCyclopedia Of DNA Elements) scale-up project. The GENCODE consortium was initially formed as part of the pilot phase of the ENCODE project to identify and map all protein-coding genes within the ENCODE regions (approx. 1% of Human genome). Given the initial success of the project, GENCODE now aims to build an “Encyclopedia of genes and genes variants” by identifying all gene features in the human and mouse genome using a combination of computational analysis, manual annotation, and experimental validation, and annotating all evidence-based gene features in the entire human genome at a high accuracy.
Launched in 2000, WormBase is an international consortium of biologists and computer scientists dedicated to providing the research community with accurate, current, accessible information concerning the genetics, genomics and biology of C. elegans and some related nematodes. In addition to their curation work, all sites have ongoing programs in bioinformatics research to develop the next generations of WormBase structure, content and accessibility
CDC.gov is the Centers for Disease Control and Prevention primary online communication channel. CDC.gov provides users with credible, reliable health information on Data and Statistics, Diseases and Conditions, Emergencies and Disasters, Environmental Health, Healthy Living, Injury, Violence and Safety,Life Stages and Populations, Travelers' Health, Workplace Safety and Health
<<<!!!<<< This repository is no longer available. >>>!!!>>> BioVeL is a virtual e-laboratory that supports research on biodiversity issues using large amounts of data from cross-disciplinary sources. BioVeL supports the development and use of workflows to process data. It offers the possibility to either use already made workflows or create own. BioVeL workflows are stored in MyExperiment - Biovel Group http://www.myexperiment.org/groups/643/content. They are underpinned by a range of analytical and data processing functions (generally provided as Web Services or R scripts) to support common biodiversity analysis tasks. You can find the Web Services catalogued in the BiodiversityCatalogue.
The Centre for the Environment, Fisheries and Aquaculture Science (Cefas), as one of the world's longest-established marine research organisations, has provided advice on the sustainable exploitation of marine resources since 1902. Today Cefas works in support of a healthy environment and a growing blue economy providing innovative solutions for the aquatic environment, biodiversity and food security. The Cefas Data Hub provides access to over 2080 metadata records, with over 5500 data sets available to download and connect to in support of commitments to Open Science through the Data Portal. Datasets available are increasingly diverse and include many legacy datasets including those from fish, shellfish and plankton surveys from the 1980's to the present day. Other increasingly international datasets made available include species migration data from tagging activities and data on habitat and sediment, ecosystem change, human activities including marine litter, otolith sampling and fish stomach contents, oceanography, acoustics, health and water quality. Data is provided under Open Government License by default where feasible.
<<<!!!<<< As of 2023, support to maintain the www.modencode.org and intermine.modencode.org sites have been retired following the end of funding. To access data from the modENCODE project, or for questions regarding the data they make available, please visit these databases: Fly data: FlyBase: ModENCODE data at FlyBase: https://wiki.flybase.org/wiki/FlyBase:ModENCODE_data_at_FlyBase FlyBase: https://www.re3data.org/repository/r3d100010591 Worm data: WormBase https://www.re3data.org/repository/r3d100010424 Data, including modENCODE and modERN project data, is also available at the ENCODE Portal: https://www.re3data.org/repository/r3d100013051 (search metadata and view datasets for Drosophila and Caenorhabditis https://www.encodeproject.org/matrix/?type=Experiment&control_type!=*&status=released&replicates.library.biosample.donor.organism.scientific_name=Drosophila+melanogaster&replicates.library.biosample.donor.organism.scientific_name=Caenorhabditis+elegans&replicates.library.biosample.donor.organism.scientific_name=Drosophila+pseudoobscura&replicates.library.biosample.donor.organism.scientific_name=Drosophila+mojavensis). >>>!!!>>>
ArrayExpress is one of the major international repositories for high-throughput functional genomics data from both microarray and high-throughput sequencing studies, many of which are supported by peer-reviewed publications. Data sets are submitted directly to ArrayExpress and curated by a team of specialist biological curators. In the past (until 2018) datasets from the NCBI Gene Expression Omnibus database were imported on a weekly basis. Data is collected to MIAME and MINSEQE standards.
The Electron Microscopy Data Bank (EMDB) is a public repository for electron microscopy density maps of macromolecular complexes and subcellular structures. It covers a variety of techniques, including single-particle analysis, electron tomography, and electron (2D) crystallography.
EMAGE (e-Mouse Atlas of Gene Expression) is an online biological database of gene expression data in the developing mouse (Mus musculus) embryo. The data held in EMAGE is spatially annotated to a framework of 3D mouse embryo models produced by EMAP (e-Mouse Atlas Project). These spatial annotations allow users to query EMAGE by spatial pattern as well as by gene name, anatomy term or Gene Ontology (GO) term. EMAGE is a freely available web-based resource funded by the Medical Research Council (UK) and based at the MRC Human Genetics Unit in the Institute of Genetics and Molecular Medicine, Edinburgh, UK.
This site provides access to complete, annotated genomes from bacteria and archaea (present in the European Nucleotide Archive) through the Ensembl graphical user interface (genome browser). Ensembl Bacteria contains genomes from annotated INSDC records that are loaded into Ensembl multi-species databases, using the INSDC annotation import pipeline.
PDBe is the European resource for the collection, organisation and dissemination of data on biological macromolecular structures. In collaboration with the other worldwide Protein Data Bank (wwPDB) partners - the Research Collaboratory for Structural Bioinformatics (RCSB) and BioMagResBank (BMRB) in the USA and the Protein Data Bank of Japan (PDBj) - we work to collate, maintain and provide access to the global repository of macromolecular structure data. We develop tools, services and resources to make structure-related data more accessible to the biomedical community.
Country
SISSA Open Data is the Sissa repository for the research data managment. It is an institutional repository that captures, stores, preserves, and redistributes the data of the SISSA scientific community in digital form. SISSA Open Data is managed by the SISSA Library as a service to the SISSA scientific community.
Country
The SABIO-RK is a web-based application based on the SABIO relational database that contains information about biochemical reactions, their kinetic equations with their parameters, and the experimental conditions under which these parameters were measured. It aims to support modellers in the setting-up of models of biochemical networks, but it is also useful for experimentalists or researchers with interest in biochemical reactions and their kinetics. All the data are manually curated and annotated by biological experts, supported by automated consistency checks.
The Mouse Phenome Database (MPD; phenome.jax.org) has characterizations of hundreds of strains of laboratory mice to facilitate translational discoveries and to assist in selection of strains for experimental studies.
GeneLab is an interactive, open-access resource where scientists can upload, download, store, search, share, transfer, and analyze omics data from spaceflight and corresponding analogue experiments. Users can explore GeneLab datasets in the Data Repository, analyze data using the Analysis Platform, and create collaborative projects using the Collaborative Workspace. GeneLab promises to facilitate and improve information sharing, foster innovation, and increase the pace of scientific discovery from extremely rare and valuable space biology experiments. Discoveries made using GeneLab have begun and will continue to deepen our understanding of biology, advance the field of genomics, and help to discover cures for diseases, create better diagnostic tools, and ultimately allow astronauts to better withstand the rigors of long-duration spaceflight. GeneLab helps scientists understand how the fundamental building blocks of life itself – DNA, RNA, proteins, and metabolites – change from exposure to microgravity, radiation, and other aspects of the space environment. GeneLab does so by providing fully coordinated epigenomics, genomics, transcriptomics, proteomics, and metabolomics data alongside essential metadata describing each spaceflight and space-relevant experiment. By carefully curating and implementing best practices for data standards, users can combine individual GeneLab datasets to gain new, comprehensive insights about the effects of spaceflight on biology. In this way, GeneLab extends the scientific knowledge gained from each biological experiment conducted in space, allowing scientists from around the world to make novel discoveries and develop new hypotheses from these priceless data.
The PhenoGen website shares experimental data with a worldwide community of investigators and provides a flexible, integrated, multi-resolution repository of neuroscience transcriptomic genetic data for collaborative research on genomic disorders. The main development focus is on providing Hybrid Rat Diversity Panel transcriptomic data (sequencing, genome coverage, reconstructed totalRNA/smallRNA transcriptomes, quanification of the transcriptome, eQTLs, and WGCNA) and integrating additional tools to provide platform for visualization and analysis of HRDP transcriptome data.
The NF Data Portal is designed to help openly explore and share NF datasets, analysis tools, resources, and publications related to neurofibromatosis. Anyone can join the NF Open Science Initiative (NF-OSI) to participate! We welcome contributions from anyone in the neurofibromatosis and schwannomatosis research community, such as original datasets generated by the community or analyses of data from the NF Data Portal.