Search | re3data.org

Filter

Reset all

Subjects

Content Types

Countries

AID systems

API

Certificates

Data access

Data access restrictions

Database access

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type

Keywords

Metadata standards

PID systems

Provider types

Quality management

Repository languages

Software

Syndications

Repository types

Versioning

Toogle short help

* at the end of a keyword allows wildcard searches
" quotes can be used for searching phrases
+ represents an AND search (default)
| represents an OR search
- represents a NOT operation
( and ) implies priority
~N after a word specifies the desired edit distance (fuzziness)
~N after a phrase specifies the desired slop amount

← Previous
1 (current)
2
Next →

Found 34 result(s)

META-SHARE

Subject(s)

Content type(s)

Country

European Union

META-SHARE, the open language resource exchange facility, is devoted to the sustainable sharing and dissemination of language resources (LRs) and aims at increasing access to such resources in a global scale. META-SHARE is an open, integrated, secure and interoperable sharing and exchange facility for LRs (datasets and tools) for the Human Language Technologies domain and other applicative domains where language plays a critical role. META-SHARE is implemented in the framework of the META-NET Network of Excellence. It is designed as a network of distributed repositories of LRs, including language data and basic language processing tools (e.g., morphological analysers, PoS taggers, speech recognisers, etc.). Data and tools can be both open and with restricted access rights, free and for-a-fee.

Spanish CLARIN K-Centre

Centro-K CLARIN

Subject(s)

Content type(s)

Country

Competence Centre IULA-UPF-CC CLARIN manages, disseminates and facilitates this catalogue, which provides access to reference information on the use of language technology projects and studies in different disciplines, especially with regard to Humanities and Social Sciences. The Catalog relates information that is organized by Áreas, (disciplines and research topics), Projects (of research that use or have used language technologies), Tasks (that make the tools), Tools (of language technology), Documentation (articles regarding the tools and how they are used) and resources such as Corpora (collections of annotated texts) and Lexica (collections of words for different uses).

Kikapu

University of the Western Cape Institutional Research Data Respository

Subject(s)

Content type(s)

Country

South Africa

The University of the Western Cape (UWC) uses Figshare for Institutions for their institutional research data repository. It is called Kikapu, and serves as a repository for storing and disseminating research data.

Språkbanken Text

Språkbanken

Subject(s)

Content type(s)

Country

Språkbanken was established in 1975 as a national center located in the Faculty of Arts, University of Gothenburg. Allén's groundbreaking corpus linguistic research resulted in the creation of one of the first large electronic text corpora in another language than English, with one million words of newspaper text. The task of Språkbanken is to collect, develop, and store (Swedish) text corpora, and to make linguistic data extracted from the corpora available to researchers and to the public.

bonndata

Subject(s)

Content type(s)

Country

Germany

bonndata is the institutional, FAIR-aligned and curated, cross-disciplinary research data repository for the publication of research data for all researchers at the University of Bonn. The repository is fully embedded into the University IT and Data Center and curated by the Research Data Service Center (https://www.forschungsdaten.uni-bonn.de/en). The software that bonndata is based on is the open source software Dataverse (https://dataverse.org)

CLARIN-ERIC

Common Language Resources and Technology Infrastructure - European Research Infrastructure Consortium

Subject(s)

Content type(s)

Country

CLARIN is a European Research Infrastructure for the Humanities and Social Sciences, focusing on language resources (data and tools). It is being implemented and constantly improved at leading institutions in a large and growing number of European countries, aiming at improving Europe's multi-linguality competence. CLARIN provides several services, such as access to language data and tools to analyze data, and offers to deposit research data, as well as direct access to knowledge about relevant topics in relation to (research on and with) language resources. The main tool is the 'Virtual Language Observatory' providing metadata and access to the different national CLARIN centers and their data.

Dipòsit Digital de la Universitat de Barcelona Dades

DD Dipòsit Digital

Subject(s)

Content type(s)

Country

Spain

The Universitat de Barcelona Digital Repository is an institutional resource containing open-access digital versions of publications related to the teaching, research and institutional activities of the UB's teaching staff and other members of the university community, including research data.

datastore by Universität Münster

Subject(s)

Content type(s)

Country

Germany

datastore is the cross-domain research data repository of the University Münster (Germany). In datastore, scientific members of the University Münster can publish their research data following the FAIR principles, including the assignment of a DOI for each dataset as a persistent identifier.

O2 Repositori UOC - Dades

Subject(s)

Content type(s)

Country

Spain

O2 is the UOC's institutional repository. Section 'Dades' contains primary data accompanying documents published in the Research and Institutional communities.

CLARIN-LT

CLARIN-LT Repository

Subject(s)

Content type(s)

Country

Lithuania became a full member of CLARIN ERIC in January of 2015 and soon CLARIN-LT consortium was founded by three partner universities: Vytautas Magnus University, Kaunas Technology University and Vilnius University. The main goal of the consortium is to become a CLARIN B centre, which will be able to serve language users in Lithuania and Europe for storing and accessing language resources.

melbourne.figshare.com

University of Melbourne data repository

Subject(s)

Content type(s)

Country

melbourne.figshare.com is a specialised service that has been tailored according to specific needs and requirements of the University and our community of researchers. The service offered at the University is free to use, provides 100GB of data, and stores all data on the University's storage system.

Repository CLARIN-D Centre CEDIFOR

Centre for the Digital Foundation of Research in the Humanities, Social, and Educational Sciences

Subject(s)

Content type(s)

Country

The CLARIN-D Centre CEDIFOR provides a repository for long-term storage of resources and meta-data. Resources hosted in the repository stem from research of members as well as associated research projects of CEDIFOR. This includes software and web-services as well as corpora of text, lexicons, images and other data.

University of Manchester figshare

Subject(s)

Content type(s)

Country

United Kingdom

The University selected figshare as a general purpose research data repository to enable researchers to share research data, facilitate open research practices and meet the evolving requirements of research funders and academic publishers. This is a public-facing platform for researchers to share their data and build, over time, a comprehensive representation of the research done at the University across all faculties and disciplines.

ANPERSANA

bibliothèque numérique

Subject(s)

Content type(s)

Country

ANPERSANA is the digital library of IKER (UMR 5478), a research centre specialized in Basque language and texts. The online library platform receives and disseminates primary sources of data issued from research in Basque language and culture. As of today, two corpora of documents have been published. The first one, is a collection of private letters written in an 18th century variety of Basque, documented in and transcribed to modern standard Basque. The discovery of the collection, named Le Dauphin, has enabled the emerging of new questions about the history and sociology of writing in the domain of minority languages, not only in France, but also among the whole Atlantic Arc. The second of the two corpora is a selection of sound recordings about monodic chant in the Basque Country. The documents were collected as part of a PhD thesis research work that took place between 2003 and 2012. It's a total of 50 hours of interviews with francophone and bascophone cultural representatives carried out at either their workplace of the informers or in public areas. ANPERSANA is bundled with an advanced search engine. The documents have been indexed and geo-localized on an interactive map. The platform is engaged with open access and all the resources can be uploaded freely under the different Creative Commons (CC) licenses.

ELRA Catalogue of Language Resources

ELRA Catalogue

Subject(s)

Content type(s)

Country

European Union

An increasing number of Language Resources (LT) in the various fields of Human Language Technology (HLT) are distributed on behalf of ELRA via its operational body ELDA, thanks to the contribution of various players of the HLT community. Our aim is to provide Language Resources, by means of this repository, so as to prevent researchers and developers from investing efforts to rebuild resources which already exist as well as help them identify and access those resources.

Czech National Corpus

CNC

Subject(s)

Content type(s)

Country

The aim of the project is systematic mapping of Czech and other languages in comparison with Czech. CNC corpora are accessible to everybody interested in studying the language after free registration.

St. Edward's University institutional repository

St. Edward’s University Figshare

Subject(s)

Content type(s)

Country

Additionally to the institutional repository, current St. Edward's faculty have the option of uploading their work directly to their own SEU accounts on stedwards.figshare.com. Projects created on Figshare will automatically be published on this website as well. For more information, please see documentation

Bath Spa University figshare

BathSPAdata

Subject(s)

Content type(s)

Country

United Kingdom

The University research data repository – BathSPAdata – enables staff to upload their research data into a secure space, and to share this data publicly where appropriate, or where funders or publishers require this as part of their conditions. Resources and toolkits for external use can be made available through this forum, and can be used by Schools, policy makers, business and industry, and the cultural sector.

Kielipankki

The Language Bank of Finland

Subject(s)

Content type(s)

Country

The Language Bank features text and speech corpora with different kinds of annotations in over 60 languages. There is also a selection of tools for working with them, from linguistic analyzers to programming environments. Corpora are also available via web interfaces, and users can be allowed to download some of them. The IP holders can monitor the use of their resources and view user statistics.

Open Research Data Online

ORDO

Subject(s)

Content type(s)

Country

United Kingdom

The figshare service for The Open University was launched in 2016 and allows researchers to store, share and publish research data. It helps the research data to be accessible by storing metadata alongside datasets. Additionally, every uploaded item receives a Digital Object Identifier (DOI), which allows the data to be citable and sustainable. If there are any ethical or copyright concerns about publishing a certain dataset, it is possible to publish the metadata associated with the dataset to help discoverability while sharing the data itself via a private channel through manual approval.

Repositório de Dados de Pesquisa Unifesp

Unifesp Data Repository

Subject(s)

Content type(s)

Country

Brazil

The Unifesp Research Data Repository is a platform for storing, preserving and accessing research data for the institution's academic community.

Center of Estonian Language Resources

Eesti Keeleressursside Keskus

Subject(s)

Content type(s)

Country

The goal of the Center of Estonian Language Resources (CELR) is to create and manage an infrastructure to make the Estonian language digital resources (dictionaries, corpora – both text and speech –, various language databases) and language technology tools (software) available to everyone working with digital language materials. CELR coordinates and organises the documentation and archiving of the resources as well as develops language technology standards and draws up necessary legal contracts and licences for different types of users (public, academic, commercial, etc.). In addition to collecting language resources, a system will be launched for introducing the resources to, informing and educating the potential users. The main users of CELR are researchers from Estonian R&D institutions and Social Sciences and Humanities researchers all over the world via the CLARIN ERIC network of similar centers in Europe. Access to data is provided through different sites: Public Repository https://entu.keeleressursid.ee/public-document , Language resources https://keeleressursid.ee/en/resources/corpora, and MetaShare CELR https://metashare.ut.ee/

Bibliothèques Virtuelles Humanistes

BVH

Subject(s)

Content type(s)

Country

France

The program "Humanist Virtual Libraries" distributes heritage documents and pursues research associating skills in human sciences and computer science. It aggregates several types of digital documents: A selection of facsimiles of Renaissance works digitized in the Central Region and in partner institutions, the Epistemon Textual Database, which offers digital editions in XML-TEI, and Transcripts or analyzes of notarial minutes and manuscripts

clarin:el inventory of language resources and services

Subject(s)

Content type(s)

Country

Greece

clarin:el is the Greek national network of language resources, a nation-wide Research Infrastructure devoted to the sustainable storage, sharing, dissemination and preservation of language resources. CLARIN EL infrastructure, which is a Greek nation-wide Research Infrastructure devoted to the sustainable storage, sharing, dissemination and preservation of language resources (LRs) and aims at increasing access to and augmentation of such resources at a national scale and beyond. It is an open, integrated, secure and interoperable storage, sharing and processing infrastructure for LRs (datasets, tools and processing services) for all domains domains and disciplines where language plays a critical role, notably. CLARIN EL is implemented in the framework of the CLARIN Attiki, national project in support of ESFRI/2006 Research Infrastructures.

ILC-CNR for CLARIN-IT repository

ILC4CLARIN

Subject(s)

Content type(s)

Country

ILC-CNR for CLARIN-IT repository is a library for linguistic data and tools. Including: Text Processing and Computational Philology; Natural Language Processing and Knowledge Extraction; Resources, Standards and Infrastructures; Computational Models of Language Usage. The studies carried out within each area are highly interdisciplinary and involve different professional skills and expertises that extend across the disciplines of Linguistics, Computational Linguistics, Computer Science and Bio-Engineering.

← Previous
1 (current)
2
Next →

Current projects
EOSC FAIR-IMPACT

re3data COREF

To the extent possible under law, re3data.org has waived all copyright and related or neighboring rights to the database entries of re3data.org.
Except where otherwise noted, content on this site is licensed under a Creative Commons Attribution 4.0 International License .
Cite this service: re3data.org - Registry of Research Data Repositories. https://doi.org/10.17616/R3D last accessed: 2024-04-19