Content Types


AID systems



Data access

Data access restrictions

Database access

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type


Metadata standards

PID systems

Provider types

Quality management

Repository languages



Repository types


  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 28 result(s)
<<<!!!<<< 2019-12-23: the repository is offline >>>!!!>>> Introduction of genome-scale metabolic network: The completion of genome sequencing and subsequent functional annotation for a great number of species enables the reconstruction of genome-scale metabolic networks. These networks, together with in silico network analysis methods such as the constraint based methods (CBM) and graph theory methods, can provide us systems level understanding of cellular metabolism. Further more, they can be applied to many predictions of real biological application such as: gene essentiality analysis, drug target discovery and metabolic engineering
The database aims to bridge the gap between agent repositories and studies documenting the effect of antimicrobial combination therapies. Most notably, our primary aim is to compile data on the combination of antimicrobial agents, namely natural products such as AMP. To meet this purpose, we have developed a data curation workflow that combines text mining, manual expert curation and graph analysis and supports the reconstruction of AMP-Drug combinations.
I2D (Interologous Interaction Database) is an on-line database of known and predicted mammalian and eukaryotic protein-protein interactions. It has been built by mapping high-throughput (HTP) data between species. Thus, until experimentally verified, these interactions should be considered "predictions". It remains one of the most comprehensive sources of known and predicted eukaryotic PPI. I2D includes data for S. cerevisiae, C. elegans, D. melonogaster, R. norvegicus, M. musculus, and H. sapiens.
The NDEx Project provides an open-source framework where scientists and organizations can share, store, manipulate, and publish biological network knowledge. The NDEx Project maintains a free, public website; alternatively, users can also decide to run their own copies of the NDEx Server software in cases where the stored networks must be kept in a highly secure environment (such as for HIPAA compliance) or where high application load is incompatible with a shared public resource.
Ocean Networks Canada maintains several observatories installed in three different regions in the world's oceans. All three observatories are cabled systems that can provide power and high bandwidth communiction paths to sensors in the ocean. The infrastructure supports near real-time observations from multiple instruments and locations distributed across the Arctic, NEPTUNE and VENUS observatory networks. These observatories collect data on physical, chemical, biological, and geological aspects of the ocean over long time periods, supporting research on complex Earth processes in ways not previously possible.
The PRISM Climate Group gathers climate observations from a wide range of monitoring networks, applies sophisticated quality control measures, and develops spatial climate datasets to reveal short- and long-term climate patterns. The resulting datasets incorporate a variety of modeling techniques and are available at multiple spatial/temporal resolutions, covering the period from 1895 to the present. Whenever possible, we offer these datasets to the public, either free of charge or for a fee (depending on dataset size/complexity and funding available for the activity).
<<<!!!<<< This repository is no longer available. >>>!!!>>> NetPath is currently one of the largest open-source repository of human signaling pathways that is all set to become a community standard to meet the challenges in functional genomics and systems biology. Signaling networks are the key to deciphering many of the complex networks that govern the machinery inside the cell. Several signaling molecules play an important role in disease processes that are a direct result of their altered functioning and are now recognized as potential therapeutic targets. Understanding how to restore the proper functioning of these pathways that have become deregulated in disease, is needed for accelerating biomedical research. This resource is aimed at demystifying the biological pathways and highlights the key relationships and connections between them. Apart from this, pathways provide a way of reducing the dimensionality of high throughput data, by grouping thousands of genes, proteins and metabolites at functional level into just several hundreds of pathways for an experiment. Identifying the active pathways that differ between two conditions can have more explanatory power than just a simple list of differentially expressed genes and proteins.
ConsensusPathDB integrates interaction networks in humans (and in the model organisms - yeast and mouse) including binary and complex protein-protein, genetic, metabolic, signaling, gene regulatory and drug-target interactions, as well as biochemical pathways. Data originate from public resources for interactions and interactions curated from the literature. The interaction data are integrated in a complementary manner to avoid redundancies.
Content type(s)
The IDR makes datasets that have never previously been accessible publicly available, allowing the community to search, view, mine and even process and analyze large, complex, multidimensional life sciences image data. Sharing data promotes the validation of experimental methods and scientific conclusions, the comparison with new data obtained by the global scientific community, and enables data reuse by developers of new analysis and processing tools.
The DIP database catalogs experimentally determined interactions between proteins. It combines information from a variety of sources to create a single, consistent set of protein-protein interactions. The data stored within the DIP database were curated, both, manually by expert curators and also automatically using computational approaches that utilize the the knowledge about the protein-protein interaction networks extracted from the most reliable, core subset of the DIP data. Please, check the reference page to find articles describing the DIP database in greater detail. The Database of Ligand-Receptor Partners (DLRP) is a subset of DIP (Database of Interacting Proteins). The DLRP is a database of protein ligand and protein receptor pairs that are known to interact with each other. By interact we mean that the ligand and receptor are members of a ligand-receptor complex and, unless otherwise noted, transduce a signal. In some instances the ligand and/or receptor may form a heterocomplex with other ligands/receptors in order to be functional. We have entered the majority of interactions in DLRP as full DIP entries, with links to references and additional information
The SABIO-RK is a web-based application based on the SABIO relational database that contains information about biochemical reactions, their kinetic equations with their parameters, and the experimental conditions under which these parameters were measured. It aims to support modellers in the setting-up of models of biochemical networks, but it is also useful for experimentalists or researchers with interest in biochemical reactions and their kinetics. All the data are manually curated and annotated by biological experts, supported by automated consistency checks.
The CZO Multiscale TROPIcal CatchmentS (M-TROPICS) consists in the merging, in 2016, of two previously-existing CZOs: BVET (India and Cameroon) and MSEC (Laos and Vietnam). The CZO Multiscale TROPIcal CatchmentS (M-TROPICS) provides the international scientific community with unique decennial time series of meteorological, hydrological, geochemical, and ecological variables in tropical environments. The CZO M-TROPICS involves academic and governmental partners in tropical countries (Cameroun, India, Lao PDR, and Vietnam) and is included in the Research Infrastructure OZCAR, the French contribution to the international CZO initiative.
<<<!!!<<< This repository is no longer available. >>>!!!>>> Message since 2018-06: This virtual host is being reconstructed.
BiGG is a knowledgebase of Biochemically, Genetically and Genomically structured genome-scale metabolic network reconstructions. BiGG integrates several published genome-scale metabolic networks into one resource with standard nomenclature which allows components to be compared across different organisms. BiGG can be used to browse model content, visualize metabolic pathway maps, and export SBML files of the models for further analysis by external software packages. Users may follow links from BiGG to several external databases to obtain additional information on genes, proteins, reactions, metabolites and citations of interest.
GeneWeaver combines cross-species data and gene entity integration, scalable hierarchical analysis of user data with a community-built and curated data archive of gene sets and gene networks, and tools for data driven comparison of user-defined biological, behavioral and disease concepts. Gene Weaver allows users to integrate gene sets across species, tissue and experimental platform. It differs from conventional gene set over-representation analysis tools in that it allows users to evaluate intersections among all combinations of a collection of gene sets, including, but not limited to annotations to controlled vocabularies. There are numerous applications of this approach. Sets can be stored, shared and compared privately, among user defined groups of investigators, and across all users.
WFCC-MIRCEN World Data Centre for Microorganisms (WDCM) provides a comprehensive directory of culture collections, databases on microbes and cell lines, and the gateway to biodiversity, molecular biology and genome projects.The WFCC is a Multidisciplinary Commission of the International Union of Biological Sciences (IUBS) and a Federation within the International Union of Microbiological Societies (IUMS). The WFCC is concerned with the collection, authentication, maintenance and distribution of cultures of microorganisms and cultured cells. Its aim is to promote and support the establishment of culture collections and related services, to provide liaison and set up an information network between the collections and their users, to organise workshops and conferences, publications and newsletters and work to ensure the long term perpetuation of important collections.
FANTOM stands for 'Functional Annotation of the Mammalian Genome' and is the name of an international research consortium organized by the RIKEN Omics Science Center. The FANTOM5 project aims to build a full understanding of transcriptional regulation in a human system by generating transcriptional regulatory networks that define every human cell type.
GenBase is a genetic sequence database that accepts user submissions (mRNA, genomic DNAs, ncRNA, or small genomes such as organelles, viruses, plasmids, phages from any organism) and integrates data from INSDC.
Rhea is a freely available and comprehensive resource of expert-curated biochemical reactions. It has been designed to provide a non-redundant set of chemical transformations for applications such as the functional annotation of enzymes, pathway inference and metabolic network reconstruction. There are three types of reaction participants (reactants and products): Small molecules, Rhea polymers, Generic compounds. All three types of reaction participants are linked to the ChEBI database (Chemical Entities of Biological Interest) which provides detailed information about structure, formula and charge. Rhea provides built-in validations that ensure both mass and charge balance of the reactions. We have populated the database with the reactions found in the enzyme classification (i.e. in the IntEnz and ENZYME databases), extending it with additional known reactions of biological interest. While the main focus of Rhea is enzyme-catalysed reactions, other biochemical reactions (including those that are often termed "spontaneous") also are included.
>>> !!! the repository is offline !!! <<< More information see: All NURSA-biocurated transcriptomic datasets have been preserved for data mining in SPP through an enhanced and expanded version of Transcriptomine named Ominer. To access these datasets, dkNET provides users with the information of 527 transcriptomic datasets that contain data related to nuclear receptors and nuclear receptor coregulators in the NURSA Datasets table view and redirects users to the current SPP dataset page. Once users find the specific dataset of research interest, users can download the dataset by clicking DOI and then clicking the Download Dataset button at the Signaling Pathways Project webpage. See
Starting September 2013, MINT uses the IntAct database infrastructure to limit the duplication of efforts and to optimise future software development. Data manually curated by the MINT curators can now be accessed from the IntAct homepage at the EBI. Data maintenance and release, MINT PSICQUIC and IMEx services are under the responsibility of the IntAct team, while curation effort will be carried by both groups. The MINT development team now focuses on two new developments: mentha that integrates protein interaction information curated by IMEx databases and SIGNOR a database of logic relationships between human proteins. MINT is a public repository for molecular interactions reported in peer-reviewed journals.IT is a collection of molecular interaction databases that can be used to search for, analyze and graphically display molecular interaction networks and pathways from a wide variety of species. MINT is comprised of separate database components. HomoMINT, is an inferred human protein interatction database. Domino, is database of domain peptide interactions. A new component has been added called VirusMINT that explores the interactions of viral proteins with human proteins.
The South African Marine Information Management System (MIMS) is an Open Archival Information System (OAIS) repository that plays a multifaceted role in archiving, publishing, and preserving marine-related datasets. As an IODE-accredited Associate Data Unit (ADU), MIMS serves as a national node for the IODE of the IOC of UNESCO. It archives and publishes collections and subsets of marine-related datasets for the National Department of Forestry, Fisheries, and the Environment (DFFE) and its regional partners. As an IOC member organization, DFFE is committed to supporting the long-term preservation and archival of marine and coastal data for South Africa and its regional partners, promoting open access to data, and encouraging scientific collaboration. Tasked with the long-term preservation of South Africa's marine and coastal data, MIMS functions as an institutional data repository. It provides primary access to all data collected by the DFFE Oceans and Coastal Research Directorate and acts as a trusted broker of scientific marine data for a wide range of South African institutions. MIMS hosts the IODE AFROBIS Node, an OBIS Node that coordinates and collates data management activities within the sub-Saharan African region. As part of the OBIS Steering Group, MIMS represents sub-Saharan Africa on issues around biological (biodiversity) data standards. It also facilitates data and metadata publishing for the region through the GBIF and OBIS networks. Operating on the Findable, Accessible, Interoperable, and Reusable (FAIR) data principles, MIMS aligns its practices to maximize ocean data exchange and use while respecting the conditions stipulated by the Data Provider. By integrating various functions and commitments, MIMS stands as a vital component in the marine and coastal data landscape, fostering collaboration, standardization, and accessibility in alignment with international standards and regional needs.
KEGG is a database resource for understanding high-level functions and utilities of the biological system, such as the cell, the organism and the ecosystem, from molecular-level information, especially large-scale molecular datasets generated by genome sequencing and other high-throughput experimental technologies