Reset all


Content Types


AID systems



Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type


Metadata standards

PID systems

Provider types

Quality management

Repository languages



Repository types


  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 221 result(s)
The Digital South Asia Library provides digital materials for reference and research on South Asia to scholars, public officials, business leaders, and other users. This program builds upon a two-year pilot project funded by the Association of Research Libraries' Global Resources Program with support from the Andrew W. Mellon Foundation.
The Mutopia Project offers sheet music editions of classical music for free download. These are based on editions in the public domain, and include works by Bach, Beethoven, Chopin, Handel, Mozart, and many others. A team of volunteers are involved in typesetting the music by computer using the LilyPond software. A growing number of modern editions, arrangements and new music are also available for download. The respective editors, arrangers and composers have chosen to make these works freely available.
Bildarchiv Foto Marburg is Germany's documentation center for art history. Its mission is to collect, index and make available photographs related to European art and architecture, as well as to conduct research on the history, practice and theory of how visual cultural assets are passed on, especially the accompanying transformation process as it relates to the media, the conditions of storing knowledge in visual form, and the significance to society of remembering visual culture. The inventory of Bildarchiv Foto Marburg, the greater part of which is digitally processed, and the inventories of further cultural organizations can be viewed on the internet from the image database: Image Index of Art and Architecture:|home
The CRC806-Database platform is the Research Data Management infrastructure of the SFB / CRC 806. The infrastructure is implemented using Open Source software, and implements Open Science, Open Access and Open Data principles. The Collaborative Research Centre (CRC; ‘Sonderforschungsbereich’ or SFB) is designed to capture the complex nature of chronology, regional structure, climatic, environmental and socio-cultural contexts of major intercontinental and transcontinental events of dispersal of Modern Man from Africa to Western Eurasia, and particularly to Europe (Cited from introductory text on:
It is the objective of our motion capture database HDM05 to supply free motion capture data for research purposes. HDM05 contains more than three hours of systematically recorded and well-documented motion capture data in the C3D as well as in the ASF/AMC data format. Furthermore, HDM05 contains for more than 70 motion classes in 10 to 50 realizations executed by various actors.
Sound and Vision has one of the largest audiovisual archives in Europe. The institute manages over 70 percent of the Dutch audiovisual heritage. The collection contains more than a million hours of television, radio, music and film from the beginning in 1898 until today. All programs of the Dutch public broadcasters come in digitally every day. Individuals and institutions entrust their collection to Sound and Vision as well. The institute ensures that the material is optimally preserved for (re)use. Broadcasters, producers and editors use the archive for the creation of new programs. The collection is also used to develop products and services for a wide audience, such as exhibitions, iPhone applications, DVD boxes and various websites. The collection of Sound and Vision contains the complete radio and television archives of the Dutch public broadcasters; films of virtually every leading Dutch documentary maker; newsreels; the national music depot; various audiovisual corporate collections; advertising, radio and video material of cultural and social organizations, of scientific institutes and of all kinds of educational institutions. There are also collections of images and articles from the history of Dutch broadcasting itself, like the elaborate collection of historical television sets.
COW seeks to facilitate the collection, dissemination, and use of accurate and reliable quantitative data in international relations. Key principles of the project include a commitment to standard scientific principles of replication, data reliability, documentation, review, and the transparency of data collection procedures. More specifically, we are committed to the free public release of data sets to the research community, to release data in a timely manner after data collection is completed, to provide version numbers for data set and replication tracking, to provide appropriate dataset documentation, and to attempt to update, document, and distribute follow-on versions of datasets where possible. We intend to use our website as the center of our data distribution efforts, to serve as central site for collection of possible error information and questions, to provide a forum for interaction with users of Correlates of War data, and as a way for the international relations community to contribute to the continuing development of the project.
The focus of PolMine is on texts published by public institutions in Germany. Corpora of parliamentary protocols are at the heart of the project: Parliamentary proceedings are available for long stretches of time, cover a broad set of public policies and are in the public domain, making them a valuable text resource for political science. The project develops repositories of textual data in a sustainable fashion to suit the research needs of political science. Concerning data, the focus is on converting text issued by public institutions into a sustainable digital format (TEI/XML).
This record is combined with 'NASA Socioeconomic Data and Applications Center' (see: ) The World Data Center for Human Interactions in the Environment has been superseded by the NASA Socioeconomic Data and Applications Center (SEDAC), which is a regular member of the World Data System (WDS). The International Council for Science (ICSU) replaced the World Data Centers (WDC) with the WDS, which supports the provision of trusted scientific data services by certifying its members to ensure that they maintain the organizational capabilities and infrastructure for managing the data products and services that they offer. SEDAC focuses on human interactions in the environment and is one of the Distributed Active Archive Centers (DAACs) in the NASA Earth Observing System Data and Information System (EOSDIS). The NASA Earth Science Data and Information System (ESDIS) Project, a WDS Network Member, manages the EOSDIS science systems.
ORTOLANG is an EQUIPEX project accepted in February 2012 in the framework of investissements d’avenir. Its aim is to construct a network infrastructure including a repository of language data (corpora, lexicons, dictionaries etc.) and readily available, well-documented tools for its processing. Expected outcomes comprize: promoting research on analysis, modelling and automatic processing of our language to their highest international levels thanks to effective resource pooling; facilitating the use and transfer of resources and tools set up within public laboratories to industrial partners, notably SMEs which often cannot develop such resources and tools for language processing given the cost of investment; promoting French language and the regional languages of France by sharing expertise acquired by public laboratories. ORTOLANG is a service for the language, which is complementary to the service offered by Huma-Num (très grande infrastructure de recherche). Ortolang gives access to SLDR for speech, and CNRTL for text resources.
York Digital Library (YODL) is a University-wide Digital Library service for multimedia resources used in or created through teaching, research and study at the University of York. YODL complements the University's research publications, held in White Rose Research Online and PURE, and the digital teaching materials in the University's Yorkshare Virtual Learning Environment. YODL contains a range of collections, including images, past exam papers, masters dissertations and audio. Some of these are available only to members of the University of York, whilst other material is available to the public. YODL is expanding with more content being added all the time
Historic Environment Scotland was formed in October 2015 following the merger between Historic Scotland and The Royal Commission on the Ancient and Historical Monuments of Scotland. Historic Environment Scotland is the lead public body established to investigate, care for and promote Scotland’s historic environment. We lead and enable Scotland’s first historic environment strategy Our Place in Time, which sets out how our historic environment will be managed. It ensures our historic environment is cared for, valued and enhanced, both now and for future generations.
The scores and libretti in this Virtual Collection include first and early editions and manuscript copies of music from the eighteenth and early nineteenth centuries by J.S. Bach and Bach family members, Mozart, Schubert and other composers, as well as multiple versions of nineteenth century opera scores, seminal works of musical modernism, and music of the Second Viennese School. Many, such as variant editions of nineteenth century operas and related libretti, fall into intellectually related sets that are meant to be seen and used together. As a group, they give scholars a window into the study of historical performance practice that cannot be duplicated using the holdings of any one other library.
The Population Research in Sexual Minority Health (PRISM) Data Archive is a collaborative project of the Center for Population Research in LGBT Health and the Inter-university Consortium for Political and Social Research (ICPSR). The PRISM data archive project is a primary initiative of the Center. PRISM makes high quality datasets useful for analysis of issues affecting sexual and gender minority populations in the United States available researchers, scholars, educators and practitioners.
The project is set up in order to improve the infrastructure for text-based linguistic research and development by building a huge, automatically annotated German text corpus and the corresponding tools for corpus annotation and exploitation. DeReKo constitutes the largest linguistically motivated collection of contemporary German texts, contains fictional, scientific and newspaper texts, as well as several other text types, contains only licenced texts, is encoded with rich meta-textual information, is fully annotated morphosyntactically (three concurrent annotations), is continually expanded, with a focus on size and stratification of data, may be analyzed free of charge via the query system COSMAS II, serves as a 'primordial sample' from which users may draw specialized sub-samples (socalled 'virtual corpora') to represent the language domain they wish to investigate.
The aim of the project was to compile a representative computerized corpus of German for the period 1650-1800. This is the first such corpus of early modern German and it is intended as a primary research resource in a number of disciplines. Its structure deliberately parallels that of extant historical corpora of English in order to facilitate systematic comparative studies. The regional dimension which was an essential feature of the projects also provides information about the link between language and changes in the relative cultural and political areas within Germany.
The CLARIN­-D repository at the University of Leipzig offers long­term preservation of digital resources, along with their descriptive metadata. The mission of the repository is to ensure the availability and long­term preservation of resources, to preserve knowledge gained in research, to aid the transfer of knowledge into new contexts, and to integrate new methods and resources into university curricula. Among the resources currently available in the Leipzig repository are a set of corpora of the Leipzig Corpora Collection (LCC), based on newspaper, Wikipedia and Web text. Furthermore several REST-based webservices are provided for a variety of different NLP-relevant tasks
The repository is part of the eScience infrastructure of the University of Tübingen, which is a core facility that strongly cooperates with the library and computing center of the university. Integration of the repository into the national CLARIN-D and international CLARIN infrastructures gives it wide exposure, increasing the likelihood that the resources will be used and further developed beyond the lifetime of the projects in which they were developed. Among the resources currently available in the Tübingen Center Repository, researchers can find widely used treebanks of German (e.g. TüBa-D/Z), the German wordnet (GermaNet), the first manually annotated digital treebank (Index Thomisticus), as well as descriptions of the tools used by the WebLicht ecosystem for natural language processing.
Welcome to the UCLA Phonetics Lab Archive. For over half a century, the UCLA Phonetics Laboratory has collected recordings of hundreds of languages from around the world, providing source materials for phonetic and phonological research, of value to scholars, speakers of the languages, and language learners alike. The materials on this site comprise audio recordings illustrating phonetic structures from over 200 languages with phonetic transcriptions, plus scans of original field notes where relevant.
In the Wolfenbüttel Digital Library the Herzog August Bibliothek presents in digital facsimile selected items from its collections which are rare, outstanding, frequently used, or currently most relevant for research. All digitized titles may be accessed not only here, but also via the PICA-OPAC as long as they are monographs. The OPAC allows you to search for digitized books separately by limiting the search options within the database using the term Online Resources. Projects which provide additional indexing comprise a project-specific database, an inventory of digitized titles, information about tools and techniques, and references to literature. Here the main objective is to provide search facilities outside the scope of usual bibliographic description, such as page-related indexing.
Isidore is a platform of search allowing the access to digital data of Humanities and Social Sciences. Open to all and especially to teachers, researchers, PhD students, and students, it relies on the principles of Web of data and provides access to data in free access (open access).
CLARIN.SI is the Slovenian node of the European CLARIN (Common Language Resources and Technology Infrastructure) Centers. The CLARIN.SI repository is hosted at the Jožef Stefan Institute and offers long-term preservation of deposited linguistic resources, along with their descriptive metadata. The integration of the repository with the CLARIN infrastructure gives the deposited resources wide exposure, so that they can be known, used and further developed beyond the lifetime of the projects in which they were produced. Among the resources currently available in the CLARIN.SI repository are the multilingual MULTEXT-East resources, the CC version of Slovenian reference corpus Gigafida, the morphological lexicon Sloleks, the IMP corpora and lexicons of historical Slovenian, as well as many other resources for a variety of languages. Furthermore, several REST-based web services are provided for different corpus-linguistic and NLP tasks.