Filter

Subjects

Content Types

Countries

AID systems

API

Certificates

Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type

Keywords

Metadata standards

PID systems

Provider types

Quality management

Repository languages

Software

Syndications

Repository types

Versioning

  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 67 result(s)
KONECT (the Koblenz Network Collection) is a project to collect large network datasets of all types in order to perform research in network science and related fields, collected by the Institute of Web Science and Technologies at the University of Koblenz–Landau. KONECT contains over a hundred network datasets of various types, including directed, undirected, bipartite, weighted, unweighted, signed and rating networks. The networks of KONECT are collected from many diverse areas such as social networks, hyperlink networks, authorship networks, physical networks, interaction networks and communication networks. The KONECT project has developed network analysis tools which are used to compute network statistics, to draw plots and to implement various link prediction algorithms. The result of these analyses are presented on these pages. Whenever we are allowed to do so, we provide a download of the networks.
Stanford Network Analysis Platform (SNAP) is a general purpose network analysis and graph mining library. It is written in C++ and easily scales to massive networks with hundreds of millions of nodes, and billions of edges. It efficiently manipulates large graphs, calculates structural properties, generates regular and random graphs, and supports attributes on nodes and edges. SNAP is also available through the NodeXL which is a graphical front-end that integrates network analysis into Microsoft Office and Excel. The SNAP library is being actively developed since 2004 and is organically growing as a result of our research pursuits in analysis of large social and information networks. Largest network we analyzed so far using the library was the Microsoft Instant Messenger network from 2006 with 240 million nodes and 1.3 billion edges. The datasets available on the website were mostly collected (scraped) for the purposes of our research. The website was launched in July 2009.
CRAWDAD is the Community Resource for Archiving Wireless Data, a wireless network data resource for the research community. This archive has the capacity to store wireless trace data from many contributing locations, and staff to develop better tools for collecting, anonymizing, and analyzing the data. We work with community leaders to ensure that the archive meets the needs of the research community.
WISER is a self-service platform for data of the Global Networks of Isotopes in Precipitation (GNIP) and Rivers (GNIR), hosted within the IAEA's repository for technical resources (NUCLEUS). GNIP in WISER currently contains over 130,000 records, and stable isotopes are current to the end of 2013, and will be updated as verified data comes in. Parts of the GNIR water isotope data is online as well (synoptic/time series), although we are still in process of verifying and completing GNIR data uploads and for other isotopic parameters over the next year. Check back occasionally for GNIR updates. Tritium data after 2009 is in the process of being updated in the next year.
EIDA, an initiative within ORFEUS, is a distributed data centre established to (a) securely archive seismic waveform data and related metadata, gathered by European research infrastructures, and (b) provide transparent access to the archives by the geosciences research communities. EIDA nodes are data centres which collect and archive data from seismic networks deploying broad-band sensors, short period sensors, accelerometers, infrasound sensors and other geophysical instruments. Networks contributing data to EIDA are listed in the ORFEUS EIDA networklist (http://www.orfeus-eu.org/data/eida/networks/). Data from the ORFEUS Data Center (ODC), hosted by KNMI, are available through EIDA. Technically, EIDA is based on an underlying architecture developed by GFZ to provide transparent access to all nodes' data. Data within the distributed archives are accessible via the ArcLink protocol (http://www.seiscomp3.org/wiki/doc/applications/arclink).
Network Repository is the first interactive data repository for graph and network data. It hosts graph and network datasets, containing hundreds of real-world networks and benchmark datasets. Unlike other data repositories, Network Repository provides interactive analysis and visualization capabilities to allow researchers to explore, compare, and investigate graph data in real-time on the web.
Country
Introduction of genome-scale metabolic network: The completion of genome sequencing and subsequent functional annotation for a great number of species enables the reconstruction of genome-scale metabolic networks. These networks, together with in silico network analysis methods such as the constraint based methods (CBM) and graph theory methods, can provide us systems level understanding of cellular metabolism. Further more, they can be applied to many predictions of real biological application such as: gene essentiality analysis, drug target discovery and metabolic engineering
Country
VENUS coastal network, part of the Ocean Networks Canada Observatory, is a cabled undersea laboratory for ocean researchers and explorers. VENUS delivers real time information from seafloor instruments via fibre optic cables to the University of Victoria, BC. You can see ocean data live, recent and archived as well as learn more about on-going research
Country
Ocean Networks Canada maintains several observatories installed in three different regions in the world's oceans. All three observatories are cabled systems that can provide power and high bandwidth communiction paths to sensors in the ocean. The infrastructure supports near real-time observations from multiple instruments and locations distributed across the Arctic, NEPTUNE and VENUS observatory networks. These observatories collect data on physical, chemical, biological, and geological aspects of the ocean over long time periods, supporting research on complex Earth processes in ways not previously possible.
The PRISM Climate Group gathers climate observations from a wide range of monitoring networks, applies sophisticated quality control measures, and develops spatial climate datasets to reveal short- and long-term climate patterns. The resulting datasets incorporate a variety of modeling techniques and are available at multiple spatial/temporal resolutions, covering the period from 1895 to the present. Whenever possible, we offer these datasets to the public, either free of charge or for a fee (depending on dataset size/complexity and funding available for the activity).
Country
NEPTUNE Canada, the North-East Pacific Time-series Undersea Networked Experiments, is the world's first regional scale cabled deep ocean observing network. It consists of an 800km network of electro‐optic cable laid on the seabed over the northern Juan de Fuca tectonic plate, off the coast of British Columbia. This tectonic plate serves as an exceptional natural laboratory for ocean observation and experiments. NEPTUNE Canada instruments yield continuous real‐time data and imagery from the ocean surface to beneath the seafloor, and from the coast to the deep sea. They respond to events such as earthquakes, tsunamis, fish migrations, plankton blooms, storms and volcanic eruptions. NEPTUNE Canada offers a unique and exciting approach to ocean science.
The AERONET (AErosol RObotic NETwork) program is a federation of ground-based remote sensing aerosol networks established by NASA and PHOTONS (PHOtométrie pour le Traitement Opérationnel de Normalisation Satellitaire; Univ. of Lille 1, CNES, and CNRS-INSU) and is greatly expanded by networks (e.g., RIMA, AeroSpan, AEROCAN, and CARSNET) and collaborators from national agencies, institutes, universities, individual scientists, and partners. The program provides a long-term, continuous and readily accessible public domain database of aerosol optical, microphysical and radiative properties for aerosol research and characterization, validation of satellite retrievals, and synergism with other databases. The network imposes standardization of instruments, calibration, processing and distribution.
VertNet is a NSF-funded collaborative project that makes biodiversity data free and available on the web. VertNet is a tool designed to help people discover, capture, and publish biodiversity data. It is also the core of a collaboration between hundreds of biocollections that contribute biodiversity data and work together to improve it. VertNet is an engine for training current and future professionals to use and build upon best practices in data quality, curation, research, and data publishing. Yet, VertNet is still the aggregate of all of the information that it mobilizes. To us, VertNet is all of these things and more.
Country
The National Earthquake DataBase (NEDB) comprises a number of separate databases that together act as the national repository for all raw seismograph data, measurements, and derived parameters arising from the Canadian National Seismograph Network (CNSN), the Yellowknife Seismological Array (YKA), previous regional telemetered networks in eastern and western Canada (ECTN, WCTN), local telemetered networks (CLTN, SLTN), the Regional Analogue Network, and the former Standard Seismograph Network (CSN). It supports the efforts of Earthquakes Canada in Canadian seismicity monitoring, global seismic monitoring, verification of the Comprehensive nuclear Test Ban Treaty, and international data exchange. It also supports the [Nuclear Explosion Monitoring project][NEM].
Country
The Coriolis Data Centre handles operational oceanography measurements made in situ, complementing the measurement of the ocean surface made using instruments aboard satellites. This work is realised through the establishment of permanent networks with data collected by ships or autonomous systems that are either fixed or drifting. This data can be used to construct a snapshot of water mass structure and current intensity.
Country
The National Atmospheric Chemistry Database (NAtChem) is a data archival and analysis facility operated by the Science and Technology Branch of Environment and Climate Change Canada. The purpose of the NAtChem database is to enhance atmospheric research through the archival and analysis of North American air and precipitation chemistry data. Such research includes investigations into the chemical nature of the atmosphere, atmospheric processes, spatial and temporal patterns, source-receptor relationships and long range transport of air pollutants. The NAtChem Database contains air and precipitation chemistry data from many major regional-scale networks in North America. To contribute to NAtChem, networks must operate for a period of at least two years, must have wide area coverage, and must have regionally-representative sites (rural and background).
Country
GEOFON seeks to facilitate cooperation in seismological research and earthquake and tsunami hazard mitigation by providing rapid transnational access to seismological data and source parameters of large earthquakes, and keeping these data accessible in the long term. It pursues these aims by operating and maintaining a global network of permanent broadband stations in cooperation with local partners, facilitating real time access to data from this network and those of many partner networks and plate boundary observatories, providing a permanent and secure archive for seismological data. It also archives and makes accessible data from temporary experiments carried out by scientists at German universities and institutions, thereby fostering cooperation and encouraging the full exploitation of all acquired data and serving as the permanent archive for the Geophysical Instrument Pool at Potsdam (GIPP). It also organises the data exchange of real-time and archived data with partner institutions and international centres.
China Earthquake Data Center provides Seismic data, geomagnetic data, geoelectric data, terrain data and underground fluid change data. It is only open in the Seismological Bureau.
Country
FANTOM stands for 'Functional Annotation of the Mammalian Genome' and is the name of an international research consortium organized by the RIKEN Omics Science Center. The FANTOM5 project aims to build a full understanding of transcriptional regulation in a human system by generating transcriptional regulatory networks that define every human cell type.
Content type(s)
The Blue Obelisk Data Repository lists many important chemoinformatics data such as element and isotope properties, atomic radii, etc. including references to original literature. Developers can use this repository to make their software interoperable.
A collection of open content name datasets for Information Centric Networking. The "Content Name Collection" (CNC) lists and hosts open datasets of content names. These datasets are either derived from URL link databases or web traces. The names are typically used for research on Information Centric Networking (ICN), for example to measure cache hit/miss ratios in simulations.
BiGG is a knowledgebase of Biochemically, Genetically and Genomically structured genome-scale metabolic network reconstructions. BiGG integrates several published genome-scale metabolic networks into one resource with standard nomenclature which allows components to be compared across different organisms. BiGG can be used to browse model content, visualize metabolic pathway maps, and export SBML files of the models for further analysis by external software packages. Users may follow links from BiGG to several external databases to obtain additional information on genes, proteins, reactions, metabolites and citations of interest.
Content type(s)
The IDR makes datasets that have never previously been accessible publicly available, allowing the community to search, view, mine and even process and analyze large, complex, multidimensional life sciences image data. Sharing data promotes the validation of experimental methods and scientific conclusions, the comparison with new data obtained by the global scientific community, and enables data reuse by developers of new analysis and processing tools.