Filter
Reset all

Subjects

Content Types

Countries

AID systems

API

Certificates

Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type

Keywords

Metadata standards

PID systems

Provider types

Quality management

Repository languages

Software

Syndications

Repository types

Versioning

  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 157 result(s)
ORTOLANG is an EQUIPEX project accepted in February 2012 in the framework of investissements d’avenir. Its aim is to construct a network infrastructure including a repository of language data (corpora, lexicons, dictionaries etc.) and readily available, well-documented tools for its processing. Expected outcomes comprize: promoting research on analysis, modelling and automatic processing of our language to their highest international levels thanks to effective resource pooling; facilitating the use and transfer of resources and tools set up within public laboratories to industrial partners, notably SMEs which often cannot develop such resources and tools for language processing given the cost of investment; promoting French language and the regional languages of France by sharing expertise acquired by public laboratories. ORTOLANG is a service for the language, which is complementary to the service offered by Huma-Num (très grande infrastructure de recherche). Ortolang gives access to SLDR for speech, and CNRTL for text resources.
<<<!!!<<< This repository is no longer available. >>>!!!>>> see https://beta.ukdataservice.ac.uk/datacatalogue/studies/study?id=7021#!/details and https://ota.bodleian.ox.ac.uk/repository/xmlui/discover?query=germanc&submit=Search&filtertype_1=title&filter_relational_operator_1=contains&filter_1=&query=germanc
Country
It is the objective of our motion capture database HDM05 to supply free motion capture data for research purposes. HDM05 contains more than three hours of systematically recorded and well-documented motion capture data in the C3D as well as in the ASF/AMC data format. Furthermore, HDM05 contains for more than 70 motion classes in 10 to 50 realizations executed by various actors.
CLARIN.SI is the Slovenian node of the European CLARIN (Common Language Resources and Technology Infrastructure) Centers. The CLARIN.SI repository is hosted at the Jožef Stefan Institute and offers long-term preservation of deposited linguistic resources, along with their descriptive metadata. The integration of the repository with the CLARIN infrastructure gives the deposited resources wide exposure, so that they can be known, used and further developed beyond the lifetime of the projects in which they were produced. Among the resources currently available in the CLARIN.SI repository are the multilingual MULTEXT-East resources, the CC version of Slovenian reference corpus Gigafida, the morphological lexicon Sloleks, the IMP corpora and lexicons of historical Slovenian, as well as many other resources for a variety of languages. Furthermore, several REST-based web services are provided for different corpus-linguistic and NLP tasks.
CLARIN-UK is a consortium of centres of expertise involved in research and resource creation involving digital language data and tools. The consortium includes the national library, and academic departments and university centres in linguistics, languages, literature and computer science.
The Phonogrammarchiv is a multi-disciplinary research sound and video archive, covering holdings from all continents. Since its foundation in 1899 the Phonogrammarchiv has been building up its holdings by cooperating with Austrian scholars and archiving their collected material, or by fieldwork conducted by staff members on special topics exploring new fields of methods and contents. The main tasks comprise the production, annotation, cataloguing and long-term preservation of audio-visual field recordings, making the cultural heritage available for future generations and enabling the dissemination of the recordings as well as technical developments in the field of AV recording and storage. Thus the Phonogrammarchiv adds to infrastructural performance valuable to both the scholarly community and the public at large.
Content type(s)
Country
The aim of the research project TOPOI II A-2-4 was to re-evaluate archaeological records and finds resulting from earlier investigations within the context of recent and ongoing research across the region.
Historic Environment Scotland was formed in October 2015 following the merger between Historic Scotland and The Royal Commission on the Ancient and Historical Monuments of Scotland. Historic Environment Scotland is the lead public body established to investigate, care for and promote Scotland’s historic environment. We lead and enable Scotland’s first historic environment strategy Our Place in Time, which sets out how our historic environment will be managed. It ensures our historic environment is cared for, valued and enhanced, both now and for future generations.
Content type(s)
Country
Database of ancient sources concerning Roman Water Law. Specific legal sources, e.g. from the Corpus Iuris Civilis or the Codex Theodosianus, and literary sources, for example from Cicero, Frontinus, Hyginus, Siculus Flaccus or Vitruvius, were collected to give an overview of water related legal problems in ancient Rome. Furthermore, the aim of the database is to classify these sources into different legal topics, in order to facilitate the research for sources concerning specific questions regarding Roman Water Law.
DIAMM (the Digital Image Archive of Medieval Music) is a leading resource for the study of medieval manuscripts. We present images and metadata for thousands of manuscripts on this website. We also provide a home for scholarly resources and editions, undertake digital restoration of damaged manuscripts and documents, publish high-quality facsimiles, and offer our expertise as consultants.
Country
The sources of the data sets include data sets donated by researchers, surveys carried out by SRDA, as well as by government department and other academic organizations. Prior to the release of data sets, the confidentiality and sensitivity of every survey data set are evaluated. Standard data management and cleaning procedures are applied to ensure data accuracy and completeness. In addition, metadata and relevant supplement files are also edited and attached.
The DARIAH-DE repository is a digital long-term archive for human and cultural-scientific research data. Each object described and stored in the DARIAH-DE Repository has a unique and lasting Persistent Identifier (DOI), with which it is permanently referenced, cited, and kept available for the long term. In addition, the DARIAH-DE Repository enables the sustainable and secure archiving of data collections. The DARIAH-DE Repository is not only to DARIAH-DE associated research projects, but also to individual researchers as well as research projects that want to save their research data persistently, referenceable and long-term archived and make it available to third parties. The main focus is the simple and user-oriented access to long-term storage of research data. To ensure its long term sustainability, the DARIAH-DE Repository is operated by the Humanities Data Centre.
Apollo (previously DSpace@Cambridge) is the University of Cambridge’s Institutional Repository (IR), preserving and providing access to content created by members of the University. The repository stores a range of content and provides different levels of access, but its primary focus is on providing open access to the University’s research publications.
The Association of Religion Data Archives (ARDA) strives to democratize access to the best data on religion. Founded as the American Religion Data Archive in 1997 and going online in 1998, the initial archive was targeted at researchers interested in American religion. The targeted audience and the data collection have both greatly expanded since 1998, now including American and international collections and developing features for educators, journalists, religious congregations, and researchers. Data included in the ARDA are submitted by the foremost religion scholars and research centers in the world.
Country
In the Wolfenbüttel Digital Library the Herzog August Bibliothek presents in digital facsimile selected items from its collections which are rare, outstanding, frequently used, or currently most relevant for research. All digitized titles may be accessed not only here, but also via the PICA-OPAC as long as they are monographs. The OPAC allows you to search for digitized books separately by limiting the search options within the database using the term Online Resources. Projects which provide additional indexing comprise a project-specific database, an inventory of digitized titles, information about tools and techniques, and references to literature. Here the main objective is to provide search facilities outside the scope of usual bibliographic description, such as page-related indexing.
The repository is part of the National Research Data Infrastructure initiative Text+, in which the University of Tübingen is a partner. It is housed at the Department of General and Computational Linguistics. The infrastructure is maintained in close cooperation with the Digital Humanities Centre, which is a core facility of the university, colaborating with the library and computing center of the university. Integration of the repository into the national CLARIN-D and international CLARIN infrastructures gives it wide exposure, increasing the likelihood that the resources will be used and further developed beyond the lifetime of the projects in which they were developed. Among the resources currently available in the Tübingen Center Repository, researchers can find widely used treebanks of German (e.g. TüBa-D/Z), the German wordnet (GermaNet), the first manually annotated digital treebank (Index Thomisticus), as well as descriptions of the tools used by the WebLicht ecosystem for natural language processing.
The scores and libretti in this Virtual Collection include first and early editions and manuscript copies of music from the eighteenth and early nineteenth centuries by J.S. Bach and Bach family members, Mozart, Schubert and other composers, as well as multiple versions of nineteenth century opera scores, seminal works of musical modernism, and music of the Second Viennese School. Many, such as variant editions of nineteenth century operas and related libretti, fall into intellectually related sets that are meant to be seen and used together. As a group, they give scholars a window into the study of historical performance practice that cannot be duplicated using the holdings of any one other library.
Welcome to the UCLA Phonetics Lab Archive. For over half a century, the UCLA Phonetics Laboratory has collected recordings of hundreds of languages from around the world, providing source materials for phonetic and phonological research, of value to scholars, speakers of the languages, and language learners alike. The materials on this site comprise audio recordings illustrating phonetic structures from over 200 languages with phonetic transcriptions, plus scans of original field notes where relevant.
Country
The UniSC Research Bank is the institutional research repository for the University of the Sunshine Coast. It provides an open access showcase of the University's scholarly research output ensuring that research is made available to the local, national and international communities. UniSC Research Bank is harvested by search engines, and is also indexed by the National Library of Australia's TROVE. By making research easily accessible, it also facilitates collaboration between researchers. Where possible, access to the full text of the publication is made available, in line with copyright permissions for each output. To access relevant research, use the Browse function, or specific records can be searched for by using the search box. Find research data by filtering by resource type 'Research Dataset'.
The Forensic Linguistic Databank (FoLD) is a permanent, controlled access online repository for forensic linguistic data, including malicious communication data, investigative interview data, and forensic evidence validation data for both speech and text. We broadly understand forensic linguistics as any academic research with a potential to improve the delivery of justice through the analysis of language. FoLD thus comprises a wide range of datasets with relevance to forensic linguistics and language and law, including commercial extortion letters, investigative interviews in police and other contexts, legal documents, forum posts from far-right online groups, and comment threads from political blogs. The intention for the databank is to not only further academic research into forensic linguistics by developing new methods and approaches but also to directly contribute to impact in assisting the delivery of justice. Therefore, research projects using this data will validate methods for forensic analysis, further the effectiveness of interviewing techniques used by British police, and help tackle internet crime and abuse on behalf of law enforcement beneficiaries, such as the National Crime Agency.
In collaboration with other centres in the Text+ consortium and in the CLARIN infrastructure, the CLARIND-UdS enables eHumanities by providing a service for hosting and processing language resources (notably corpora) for members of the research community. CLARIND-UdS centre thus contributes of lifting the fragmentation of language resources by assisting members of the research community in preparing language materials in such a way that easy discovery is ensured, interchange is facilitated and preservation is enabled by enriching such materials with meta-information, transforming them into sustainable formats and hosting them. We have an explicit mission to archive language resources especially multilingual corpora (parallel, comparable) and corpora including specific registers, both collected by associated researchers as well as researchers who are not affiliated with us.