Reset all


Content Types


AID systems



Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type


Metadata standards

PID systems

Provider types

Quality management

Repository languages



Repository types


  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 43 result(s)
It captures and catalogues ancient human genome and microbiome data, including raw sequence and processed data, along with metadata about its provenance and production. Included datasets are generated from ancient samples studied at the Australian Centre for Ancient DNA, University of Adelaide in collaboration with other research groups. Datasets and collections in OAGR are open data resources made freely available in a reusable form, using open file formats and licensed with minimal restrictions for reuse. Digital object identifiers (DOIs) are minted for included datasets and collections to facilitate persistent identification and citation.
The European Union Open Data Portal is the single point of access to a growing range of data from the institutions and other bodies of the European Union (EU). Data are free for you to use and reuse for commercial or non-commercial purposes. By providing easy and free access to data, the portal aims to promote their innovative use and unleash their economic potential. It also aims to help foster the transparency and the accountability of the institutions and other bodies of the EU. The EU Open Data Portal is managed by the Publications Office of the European Union. Implementation of the EU's open data policy is the responsibility of the Directorate-General for Communications Networks, Content and Technology of the European Commission.
The National Archives and Records Administration (NARA) is the nation's record keeper. Of all documents and materials created in the course of business conducted by the United States Federal government, only 1%-3% are so important for legal or historical reasons that they are kept by us forever. Those valuable records are preserved and are available to you, whether you want to see if they contain clues about your family’s history, need to prove a veteran’s military service, or are researching an historical topic that interests you.
Access analytical research reports and statistical information on citizenship and immigration trends. Research for Citizenship and Immigration Canada’s strategic research program furthers our understanding of the impact of immigration on Canadian society. Citizenship and Immigration Canada’s statistical publications provide information on permanent and temporary residents as well as immigration and citizenship programs. Older Research and Statistics reports from Library and Archives Canada. Key findings of external and internal projects related to public opinion.
Psi Open Data is an open repository for parapsychology research data, operated by the Society for Psychical Research. The datasets may be freely used, modified, and shared by anyone – subject, at most, to the requirement to attribute and/or share-alike (see the license attached to each dataset for details).
The Scholarly Database (SDB) at Indiana University aims to serve researchers and practitioners interested in the analysis, modeling, and visualization of large-scale scholarly datasets. The online interface at provides access to six datasets: MEDLINE papers, registered Clinical Trials, U.S. Patent and Trademark Office patents (USPTO), National Science Foundation (NSF) funding, National Institutes of Health (NIH) funding, and National Endowment for the Humanities funding – over 26 million records in total.
MTSA is a Metropolitan Travel Survey Archive to store, preserve, and make publicly available, via the internet, travel surveys conducted by metropolitan areas, states and localities. As a result of cooperation from several agencies, we now have been able to post databases along with relevant documentation for many regions in the archive . The databases and the documentation can be obtained from this website. In addition to making these databases publicly available, we are also in the process of converting all the databases to a common format to enhance the readability and usability of each survey, so many surveys can be used online, see analyze The results from the first year of the project, along with issues related to archiving travel survey data are provided in our reports page . Papers written by Yacov Zahavi, an instrumental figure in the development of travel surveys, are also provided here.
The Alternative Fuels Data Center (AFDC) is a comprehensive clearinghouse of information about advanced transportation technologies. The AFDC offers transportation decision makers unbiased information, data, and tools related to the deployment of alternative fuels and advanced vehicles. The AFDC launched in 1991 in response to the Alternative Motor Fuels Act of 1988 and the Clean Air Act Amendments of 1990. It originally served as a repository for alternative fuel performance data. The AFDC has since evolved to offer a broad array of information resources that support efforts to reduce petroleum use in transportation. The AFDC serves Clean Cities stakeholders, fleets regulated by the Energy Policy Act, businesses, policymakers, government agencies, and the general public.
Exposures in the period from conception to early childhood - including fetal growth, cell division, and organ functioning - may have long-lasting impact on health and disease susceptibility. To investigate these issues the Danish National Birth Cohort (Better health in generations) was established. A large cohort of pregnant women with long-term follow-up of the offspring was the obvious choice because many of the exposures of interest cannot be reconstructed with suffcient validity back in time. The study needed to be large, and the aim was to recruit 100,000 women early in pregnancy, and to continue follow-up for decades. Exposure information was collected by computer-assisted telephone interviews with the women twice during pregnancy and when their children were six and 18 months old. Participants were also asked to fill in a self-administered food frequency questionnaire in mid-pregnancy. Furthermore, a biological bank has been set up with blood taken from the mother twice during pregnancy and blood from theumbilical cord taken shortly after birth.
The Cognitive Function and Ageing Studies (CFAS) are population based studies of individuals aged 65 years and over living in the community, including institutions, which is the only large multi-centred population-based study in the UK that has reached sufficient maturity. There are three main studies within the CFAS group. MRC CFAS, the original study began in 1989, with three of its sites providing a parent subset for the comparison two decades later with CFAS II (2008 onwards). Subsequently another CFAS study, CFAS Wales began in 2011.
A collection of data at Agency for Healthcare Research and Quality (AHRQ) supporting research that helps people make more informed decisions and improves the quality of health care services. The portal contains U.S.Health Information Knowledgebase (USHIK) and Systematic Review Data Repository (SRDR) and other sources concerning cost, quality, accesibility and evaluation of healthcare and medical insurance.
The data archive maintains a collection of social and economic datasets. It's a centralized source for numeric data files: their acquisition, storage, maintenance, and use. We support the research activities of social science faculty, students, and staff at Cornell University. The collection includes federal or state censuses, files based on administrative records, public opinion surveys, economic and social data from national and international organizations, and studies compiled by individual researchers. You can search our holdings or browse studies by subject area. Also see Locating and Using Archive Data.
heidICON is provided by Heidelberg University Library and is the "Virtual Slide Collection" in progress of organization of Heidelberg University. In addition to record graphic material on current interest for research and teaching, the University departments and institutes can digitize and transfer their already existing slide collections.
The U.S. Bureau of Labor Statistics collects, analyzes, and publishes reliable information on many aspects of the United States economy and society. They measure employment, compensation, worker safety, productivity, and price movements. This information is used by jobseekers, workers, business leaders, and others to assist them in making sound decisions at work and at home. Statistical data covers a wide range of topics about the labor market, economy and society in the U.S.; subject areas include: Inflation & Prices, Employment, Unemployment, Pay & Benefits, Spending & Time Use, Productivity, Workplace Injuries, International, and Regional Resources. Data is available in multiple formats including charts and tables as well as Bureau of Labor Statistics publications.
The Nord-Trøndelag Health Study (The HUNT Study) is one of the largest health studies ever performed. It is a unique database of personal and family medical histories collected during three intensive studies. The fundamental strategy is to earn and maintain the confidence of the population we work in and with as is necessary for any successful population study. This strategy has been successful and has resulted in extraordinarily high participation rates. There is enthusiastic public and political support for HUNT and for the HUNT Research Centre. This has created a good basis for further health surveys in the county and an excellent research environment. Today, the HUNT Study is a database with information about approximately 120,000 people that integrates family data and individual data and can be linked to national health registries.
The Canada Open Data Project provides Government of Canada data to the public as potential driver for economic innovation. Searchable and browsable raw data is available for download, and the public can recommend specific data be made available.
CESSDA catalogue provides access to the national social science data archives of the CESSDA members across Europe. Having evolved from a network of European data service providers into a legal entity and large-scale infrastructure under the auspices of the European Strategy Forum on Research Infrastructures (ESFRI) it became an ERIC (European Research Infrastructure) in June 2017.
The Australian National Corpus collates and provides access to assorted examples of Australian English text, transcriptions, audio and audio-visual materials. Text analysis tools are embedded in the interface allowing analysis and downloads in *.CSV format.
ANPERSANA is the digital library of IKER (UMR 5478), a research centre specialized in Basque language and texts. The online library platform receives and disseminates primary sources of data issued from research in Basque language and culture. As of today, two corpora of documents have been published. The first one, is a collection of private letters written in an 18th century variety of Basque, documented in and transcribed to modern standard Basque. The discovery of the collection, named Le Dauphin, has enabled the emerging of new questions about the history and sociology of writing in the domain of minority languages, not only in France, but also among the whole Atlantic Arc. The second of the two corpora is a selection of sound recordings about monodic chant in the Basque Country. The documents were collected as part of a PhD thesis research work that took place between 2003 and 2012. It's a total of 50 hours of interviews with francophone and bascophone cultural representatives carried out at either their workplace of the informers or in public areas. ANPERSANA is bundled with an advanced search engine. The documents have been indexed and geo-localized on an interactive map. The platform is engaged with open access and all the resources can be uploaded freely under the different Creative Commons (CC) licenses.
The Centre conducts real-time data collection on all ongoing and incoming General and Assembly Elections, and diffuses data-driven analysis through print and electronic media. The coverage includes the analysis, contextualization, and visualisation of results and the profiling of main parties candidates. For each election, we assemble a team of field researchers and scholars to complete and expand existing data. Besides the ECI results data, we collect information on the socio-demographic profile of main parties’ candidates and on the sociological profile of constituencies.
!!!>>> 2018-06-27: no longer available on the given websites !!!<<<<!!!THIN has created a medical research database of anonymised patient records from information entered by general practices in their ViSion systems. THIN will supply anonymised data (with the identities of patients and practices fully protected) to approved researchers for drug safety and epidemiological studies. Such research will be approved by the appropriate ethics/scientific committee. The anonymised patient data will be collected from the practice's Vision clinical system, with the help of In Practice Systems, on a regular basis without interruption to the running of the system. CSD Medical Research UK can supply non-interventional, anonymised, longitudinal patient data for UK, France, Italy, Germany, Spain, Belgium and Australia. Data for the USA will be available in the near future. is the Centers for Disease Control and Prevention primary online communication channel. provides users with credible, reliable health information on Data and Statistics, Diseases and Conditions, Emergencies and Disasters, Environmental Health, Healthy Living, Injury, Violence and Safety,Life Stages and Populations, Travelers' Health, Workplace Safety and Health
Cell phones have become an important platform for the understanding of social dynamics and influence, because of their pervasiveness, sensing capabilities, and computational power. Many applications have emerged in recent years in mobile health, mobile banking, location based services, media democracy, and social movements. With these new capabilities, we can potentially be able to identify exact points and times of infection for diseases, determine who most influences us to gain weight or become healthier, know exactly how information flows among employees and productivity emerges in our work spaces, and understand how rumors spread. In an attempt to address these challenges, we release several mobile data sets here in "Reality Commons" that contain the dynamics of several communities of about 100 people each. We invite researchers to propose and submit their own applications of the data to demonstrate the scientific and business values of these data sets, suggest how to meaningfully extend these experiments to larger populations, and develop the math that fits agent-based models or systems dynamics models to larger populations. These data sets were collected with tools developed in the MIT Human Dynamics Lab and are now available as open source projects or at cost.
The European Prospective Investigation into Cancer and Nutrition (EPIC) study is one of the largest cohort studies in the world, with more than half a million (521 000) participants recruited across 10 European countries and followed for almost 15 years. EPIC was designed to investigate the relationships between diet, nutritional status, lifestyle and environmental factors, and the incidence of cancer and other chronic diseases. EPIC investigators are active in all fields of epidemiology, and important contributions have been made in nutritional epidemiology using biomarker analysis and questionnaire information, as well as genetic and lifestyle investigations.
The National Data Archive on Child Abuse and Neglect (NDACAN) promotes scholarly exchange among researchers in the child maltreatment field. NDACAN acquires microdata from leading researchers and national data collection efforts and makes these datasets available to the research community for secondary analysis.