Filter
Reset all

Subjects

Content Types

Countries

AID systems

API

Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Enhanced publication

Institution responsibility type

Institution type

Keywords

Metadata standards

PID systems

Provider types

Quality management

Repository languages

Software

Syndications

Repository types

Versioning

  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 44 result(s)
The Scholarly Database (SDB) at Indiana University aims to serve researchers and practitioners interested in the analysis, modeling, and visualization of large-scale scholarly datasets. The online interface provides access to six datasets: MEDLINE papers, registered Clinical Trials, U.S. Patent and Trademark Office patents (USPTO), National Science Foundation (NSF) funding, National Institutes of Health (NIH) funding, and National Endowment for the Humanities funding – over 26 million records in total.
The National Archives and Records Administration (NARA) is the nation's record keeper. Of all documents and materials created in the course of business conducted by the United States Federal government, only 1%-3% are so important for legal or historical reasons that they are kept by us forever. Those valuable records are preserved and are available to you, whether you want to see if they contain clues about your family’s history, need to prove a veteran’s military service, or are researching an historical topic that interests you.
A national study on socioeconomics and family health over lifetimes and across generations funded by National Science Foundation (NSF). It is the longest running longitudinal household survey in the world, started in 1968 with a nationally representative sample of over 18,000 individuals living in 5,000 families in the United States. It is recognizing the importance of the socioeconomic data, available on this website without cost to researchers and analysts.
The CMU Multi-Modal Activity Database (CMU-MMAC) database contains multimodal measures of the human activity of subjects performing the tasks involved in cooking and food preparation. The CMU-MMAC database was collected in Carnegie Mellon's Motion Capture Lab. A kitchen was built and to date twenty-five subjects have been recorded cooking five different recipes: brownies, pizza, sandwich, salad, and scrambled eggs.
The Supreme Court Database is the definitive source for researchers, students, journalists, and citizens interested in the U.S. Supreme Court. The Database contains over two hundred pieces of information about each case decided by the Court between the 1791 and 2015 terms. Examples include the identity of the court whose decision the Supreme Court reviewed, the parties to the suit, the legal provisions considered in the case, and the votes of the Justices. The project started with Spaeth's original database. The analysis tools allow you to select and summarize cases from the Modern or Legacy Database based on your needs.
The Wilson Center Digital Archive contains once-secret documents from governments all across the globe, uncovering new sources and providing fresh insights into the history of international relations and diplomacy. It contains newly declassified historical materials from archives around the world—much of it in translation and including diplomatic cables, high level correspondence, meeting minutes and more. It collects the research of three Wilson Center projects which focus on the interrelated histories of the Cold War, Korea, and Nuclear Proliferation.
WorldData.AI comes with a built-in workspace – the next-generation hyper-computing platform powered by a library of 3.3 billion curated external trends. WorldData.AI allows you to save your models in its “My Models Trained” section. You can make your models public and share them on social media with interesting images, model features, summary statistics, and feature comparisons. Empower others to leverage your models. For example, if you have discovered a previously unknown impact of interest rates on new-housing demand, you may want to share it through “My Models Trained.” Upload your data and combine it with external trends to build, train, and deploy predictive models with one click! WorldData.AI inspects your raw data, applies feature processors, chooses the best set of algorithms, trains and tunes multiple models, and then ranks model performance.
The Global Terrorism Database (GTD) is an open-source database including information on terrorist events around the world from 1970 through 2020 (with annual updates planned for the future). Unlike many other event databases, the GTD includes systematic data on domestic as well as international terrorist incidents that have occurred during this time period and now includes more than 200,000 cases.
The Metropolitan Travel Survey Archive (MTSA) includes travel surveys from numerous public agencies across the United States. The Transportation Secure Data Center has archived these surveys to ensure their continued public availability. The survey data have been converted to a standard file format and cleansed to remove personally identifiable information, including any detailed spatial data regarding individual trips.
The Roper Center has made available its entire collection of Primary exit polls. Primary exit polls datasets include standard demographic makeup of interviewee and questions pertinent to the issues of each state.
DSpace@MIT is a service of the MIT Libraries to provide MIT faculty, researchers and their supporting communities stable, long-term storage for their digital research and teaching output and to maximize exposure of their content to a world audience. DSpace@MIT content includes conference papers, images, peer-reviewed scholarly articles, preprints, technical reports, theses, working papers, research datasets and more. This collection of more than 60,000 high-quality works is recognized as among the world's premier scholarly repositories and receives, on average, more than 1 million downloads per month.
The Alternative Fuels Data Center (AFDC) is a comprehensive clearinghouse of information about advanced transportation technologies. The AFDC offers transportation decision makers unbiased information, data, and tools related to the deployment of alternative fuels and advanced vehicles. The AFDC launched in 1991 in response to the Alternative Motor Fuels Act of 1988 and the Clean Air Act Amendments of 1990. It originally served as a repository for alternative fuel performance data. The AFDC has since evolved to offer a broad array of information resources that support efforts to reduce petroleum use in transportation. The AFDC serves Clean Cities stakeholders, fleets regulated by the Energy Policy Act, businesses, policymakers, government agencies, and the general public.
The Buckeye Corpus of conversational speech contains high-quality recordings from 40 speakers in Columbus OH conversing freely with an interviewer. The speech has been orthographically transcribed and phonetically labeled. The audio and text files, together with time-aligned phonetic labels, are stored in a format for use with speech analysis software (Xwaves and Wavesurfer). Software for searching the transcription files is currently being written.
NC OneMap is a public service providing comprehensive discovery and access to North Carolina's geospatial data resources. NC OneMap, the State's Clearinghouse for geospatial information, relies on data sharing and partnerships.
PWT version 10.0 is a database with information on relative levels of income, output, input and productivity, covering 183 countries between 1950 and 2019.
DBpedia is a crowd-sourced community effort to extract structured information from Wikipedia and make this information available on the Web. DBpedia allows you to ask sophisticated queries against Wikipedia, and to link the different data sets on the Web to Wikipedia data. We hope that this work will make it easier for the huge amount of information in Wikipedia to be used in some new interesting ways. Furthermore, it might inspire new mechanisms for navigating, linking, and improving the encyclopedia itself.
The U.S. Bureau of Labor Statistics collects, analyzes, and publishes reliable information on many aspects of the United States economy and society. They measure employment, compensation, worker safety, productivity, and price movements. This information is used by jobseekers, workers, business leaders, and others to assist them in making sound decisions at work and at home. Statistical data covers a wide range of topics about the labor market, economy and society in the U.S.; subject areas include: Inflation & Prices, Employment, Unemployment, Pay & Benefits, Spending & Time Use, Productivity, Workplace Injuries, International, and Regional Resources. Data is available in multiple formats including charts and tables as well as Bureau of Labor Statistics publications.
MICASE provides a collection of transcripts of academic speech events recorded at the University of Michigan. The original DAT audiotapes are held in the English Language Institute and may be consulted by bona fide researchers under special arrangements. Additional access: https://lsa.umich.edu/eli/language-resources/micase-micusp.html
Wharton Research Data Services (WRDS) is a web-based business data research service from The Wharton School at the University of Pennsylvania. Developed in 1993 to support faculty research at Wharton, the service has evolved to become a common tool for research for over 290 institutions around the world. WRDS is the de facto standard for business data, providing researchers worldwide with instant access to financial, economic, and marketing data through a uniform, web-based interface. This hosted data service has become the locus for quantitative data research and is recognized by the academic and financial research community around the world as the leading business intelligence tool. WRDS provides access to COMPUSTAT, CRSP, IBES, NYSE-TAQ, Bureau van Dijk, Global Insight, OptionMetrics and other important business research databases.
A premier source for United States cancer statistics, SEER gathers information related to incidence, prevalence, and survival from specific geographic areas that represent 28 percent of the population, as well as compiles related reports and reports on the national cancer mortality rates. Their aim is to provide information related to cancer statistics and decrease the burden of cancer in the national population. SEER has been collecting data from cancer cases since 1973.
CDC.gov is the Centers for Disease Control and Prevention primary online communication channel. CDC.gov provides users with credible, reliable health information on Data and Statistics, Diseases and Conditions, Emergencies and Disasters, Environmental Health, Healthy Living, Injury, Violence and Safety,Life Stages and Populations, Travelers' Health, Workplace Safety and Health
TIW’s Warehouse is a centralized, electronic database holding the most current details on the official, or “gold,” record for virtually all cleared and bilateral credit default swap (CDS) contracts outstanding in the global marketplace. The Warehouse contains more than 50,000 accounts representing derivatives counterparties across 95 countries.
The IMLS conducts annual surveys of public and state libraries in the US that have response rates near 100%. Data is compiled for states, library systems, and individual library branches and includes statistics for circulation, visits, staff, expenditures, and more. Data is available in two formats: MS Access and flat file, plain text. Data for museums is now included.