Reset all


Content Types


AID systems



Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type


Metadata standards

PID systems

Provider types

Quality management

Repository languages



Repository types


  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 66 result(s)
Stanford Network Analysis Platform (SNAP) is a general purpose network analysis and graph mining library. It is written in C++ and easily scales to massive networks with hundreds of millions of nodes, and billions of edges. It efficiently manipulates large graphs, calculates structural properties, generates regular and random graphs, and supports attributes on nodes and edges. SNAP is also available through the NodeXL which is a graphical front-end that integrates network analysis into Microsoft Office and Excel. The SNAP library is being actively developed since 2004 and is organically growing as a result of our research pursuits in analysis of large social and information networks. Largest network we analyzed so far using the library was the Microsoft Instant Messenger network from 2006 with 240 million nodes and 1.3 billion edges. The datasets available on the website were mostly collected (scraped) for the purposes of our research. The website was launched in July 2009.
The University has followed all of the children born in Aberdeen in 1921, 1936, and 1950-1956 as they grow and age. Collectively these groups are known as the ABERDEEN BIRTH COHORTS, and are a jewel in the crown of Scottish health research and have helped to advance our understanding of aging well. The Children of the 1950s study is a population-based resource for the study of biological and social influences on health across the life-course and between generations.
The centerpiece of the Global Trade Analysis Project is a global data base describing bilateral trade patterns, production, consumption and intermediate use of commodities and services. The GTAP Data Base consists of bilateral trade, transport, and protection matrices that link individual country/regional economic data bases. The regional data bases are derived from individual country input-output tables, from varying years.
ICRISAT performs crop improvement research, using conventional as well as methods derived from biotechnology, on the following crops: Chickpea, Pigeonpea, Groundnut, Pearl millet,Sorghum and Small millets. ICRISAT's data repository collects, preserves and facilitates access to the datasets produced by ICRISAT researchers to all users who are interested in. Data includes Phenotypic, Genotypic, Social Science, and Spatial data, Soil and Weather.
The Health and Medical Care Archive (HMCA) is the data archive of the Robert Wood Johnson Foundation (RWJF), the largest philanthropy devoted exclusively to health and health care in the United States. Operated by the Inter-university Consortium for Political and Social Research (ICPSR) at the University of Michigan, HMCA preserves and disseminates data collected by selected research projects funded by the Foundation and facilitates secondary analyses of the data. Our goal is to increase understanding of health and health care in the United States through secondary analysis of RWJF-supported data collections
The DNB Household Survey (DHS) supplies longitudinal data to the international academic community, with a focus on the psychological and economic aspects of financial behavior. The study comprises information on work, pensions, housing, mortgages, income, assets, loans, health, economic and psychological concepts, and personal characteristics. The DHS data are collected from 2,000 households participating in the CentERpanel. The CentERpanel is an Internet panel that reflects the composition of the Dutch-speaking population in the Netherlands. Both the DHS as well as the CentERpanel, in which the study in conducted, are run by CentERdata
The NCAA Student-Athlete Experiences Data Archive provides access to data about student athletes and will grow to include a handful of user-friendly data collections related to graduation rates; team-level Academic Progress Rates in Division I; and individual-level data on the experiences of current and former student-athletes from the NCAA's Growth, Opportunities, Aspirations and Learning of Students in college study (GOALS), and the Study of College Outcomes and Recent Experiences (SCORE). In the long run, the NCAA expects to follow this initial release with the publication of as much data as possible from its archives. The data is used by college presidents, athletic personnel, faculty, student-athlete groups, media members, and researchers in looking at issues related to intercollegiate athletics and higher education.
ALSPAC is a longitudinal birth cohort study which enrolled pregnant women who were resident in one of three Bristol-based health districts in the former County of Avon with an expected delivery date between 1st April 1991 and 31st December 1992. Around 14,000 pregnant women were initially recruited. Detailed information has been collected on these women, their partners and subsequent children using self-completion questionnaires, data extraction from medical notes, linkage to routine information systems and from hands-on research clinics. Additional cohorts of participants have since been enrolled in their own right including fathers, siblings, children of the children and grandparents of the children. Ethical approval for the study was obtained from the ALSPAC Ethics and Law Committee (IRB00003312) and Local Research Ethics.
The Canadian Opinion Research Archive at Queen's University makes available commercial and independent surveys to the academic, research and journalistic communities. Founded in 1992, CORA contains hundreds of surveys including thousands of discrete items collected by major commercial Canadian firms dating back to the 1970s. CORA is continually adding new surveys and is always soliciting new data from commercial research firms, independent think tanks, research institutes, NGOs, and academic researchers. This website also includes readily accessible results from these surveys, tracking Canadian opinion over time on frequently asked survey questions, as well as tabular results from recent Canadian surveys, and more general information on polling. This material is made available as a public service by CORA and its partners.
THIN has created a medical research database of anonymised patient records from information entered by general practices in their ViSion systems. THIN will supply anonymised data (with the identities of patients and practices fully protected) to approved researchers for drug safety and epidemiological studies. Such research will be approved by the appropriate ethics/scientific committee. The anonymised patient data will be collected from the practice's Vision clinical system, with the help of In Practice Systems, on a regular basis without interruption to the running of the system. CSD Medical Research UK can supply non-interventional, anonymised, longitudinal patient data for UK, France, Italy, Germany, Spain, Belgium and Australia. Data for the USA will be available in the near future.
Our lab investigates how cognition manifests in, and is influenced by, the social contexts in which it occurs. We focus: 1) on how conversational interactions can reshape memory, by promoting shared remembering and shared forgetting, and 2) on how socio-cognitive processes affect the formation of collective memories and beliefs, and the dynamics of collective decisions. In exploring these issues, while maintaining high ecological validity, our lab integrates a wide range of methodologies, including laboratory experiments, field studies, social network analysis, and agent-based simulations.
The Comparative Welfare Entitlements Dataset (CWED) contains information about the structure and generosity of social insurance benefits in 33 countries around the world. The data contained here are an updated and extended version of CWED 1, which has been available since 2004. This web site allows you to download customized portions of the CWED 2 data, browse the Working Paper Series or access documentary material.
The HSRC Research Data Service provides a digital repository facility for the HSRC's research data in support of evidence based human and social development in South Africa and the broader region. It includes both quantitative and qualitative data. Access to data is dependent on ethical requirements for protecting research participants, as well as on legal agreements with the owners, funders or in the case of data owned by the HSRC, the requirements of the depositors of the data.
The Nord-Trøndelag Health Study (The HUNT Study) is one of the largest health studies ever performed. It is a unique database of personal and family medical histories collected during three intensive studies. The fundamental strategy is to earn and maintain the confidence of the population we work in and with as is necessary for any successful population study. This strategy has been successful and has resulted in extraordinarily high participation rates. There is enthusiastic public and political support for HUNT and for the HUNT Research Centre. This has created a good basis for further health surveys in the county and an excellent research environment. Today, the HUNT Study is a database with information about approximately 120,000 people that integrates family data and individual data and can be linked to national health registries.
NACDA acquires and preserves data relevant to gerontological research, processing as needed to promote effective research use, disseminates them to researchers, and facilitates their use. By preserving and making available the largest library of electronic data on aging in the United States, NACDA offers opportunities for secondary analysis on major issues of scientific and policy relevance
The CEACS Data Library aims to support its research community to conduct quantitative research with primary and secondary data of the highest quality. The Data Library provides integrated access to an extensive collection of data for research and teaching. This collection comprises studies from major data centres as well as public collections and other datasets of special interest to members of CEACS. This section offers the possibility to search and browse the collection. The links go to records on the catalogue or the data directly on our servers or the web. If you cannot locate or access the data you are after please contact the Data Librarian for further assistance.
The UCI Machine Learning Repository is a collection of databases, domain theories, and data generators that are used by the machine learning community for the empirical analysis of machine learning algorithms. It is used by students, educators, and researchers all over the world as a primary source of machine learning data sets. As an indication of the impact of the archive, it has been cited over 1000 times.
TRAILS is a prospective cohort study, which started in 2001 with population cohort and 2004 with a clinical cohort (CC). Since then, a group of 2500 young people from the Northern part of the Netherlands has been closely monitored in order to chart and explain their mental, physical, and social development. These TRAILS participants have been measured every two to three years, by means of questionnaires, interviews, and all kinds of tests. By now, we have collected information that spans the total period from preadolescence up until young adulthood. One of the main goals of TRAILS is to contribute to the knowledge of the development of emotional and behavioral problems and the (social) functioning of preadolescents into adulthood, their determinants, and underlying mechanisms.
The Health and Retirement Study (HRS) is a longitudinal panel study that surveys a representative sample of more than 26,000 Americans over the age of 50 every two years. The study has collected information about income, work, assets, pension plans, health insurance, disability, physical health and functioning, cognitive functioning, genetic information and health care expenditures.
The Fragile Families & Child Wellbeing Study is following a cohort of nearly 5,000 children born in large U.S. cities between 1998 and 2000 (roughly three-quarters of whom were born to unmarried parents). We refer to unmarried parents and their children as “fragile families” to underscore that they are families and that they are at greater risk of breaking up and living in poverty than more traditional families. The core Study was originally designed to primarily address four questions of great interest to researchers and policy makers: (1) What are the conditions and capabilities of unmarried parents, especially fathers?; (2) What is the nature of the relationships between unmarried parents?; (3) How do children born into these families fare?; and (4) How do policies and environmental conditions affect families and children?
DR-NTU (Data) is the institutional open access research data repository for Nanyang Technological University (NTU). NTU researchers are encouraged to use DR-NTU (Data) to deposit, publish and archive their final research data in order to make their research data discoverable, accessible and reusable.
SEDAC, the Socioeconomic Data and Applications Center, is one of the Distributed Active Archive Centers (DAACs) in the Earth Observing System Data and Information System (EOSDIS) of the U.S. National Aeronautics and Space Administration. SEDAC is a regular member of the World Data System and focuses on human interactions in the environment. Its mission is to develop and operate applications that support the integration of socioeconomic and Earth science data and to serve as an "Information Gateway" between the Earth and social sciences.