A valuable treasure of language data (texts, dictionaries and language tools) is now available to the public through the six repositories of the CLARIN:EL network. A multilingual glossary on the Greek crisis, educational texts, sessions of the Greek Parliament (2011-2015), the Greek corpus of student texts and Babiniotis archive are only some of the more than 400 language resources that have already been deposited at the CLARIN:EL network of repositories. Scientists, researchers, students, software developers and any interested parties have now access to a wide range of language resources, and the ability to download and process the language data they are interested in.
The members of the CLARIN:EL network (the Greek part of the CLARIN pan-European research infrastructure which today counts more than 130 members from 18 countries) have set up and maintain distinct but interconnected repositories, inviting researchers to contribute their language resources. The direct response of the research community indicates the dynamic character of the network, which is being enriched on an ongoing basis.
The organisations currently participating in the CLARIN:EL network with an institutional repository are the National and Kapodistrian University of Athens, the Aristotle University of Thessaloniki, the University of the Aegean, the Ionian University and the Centre for the Greek Language.
What kind of LRs can a researcher find in the CLARIN:EL infranstructure?
More than 436 language resources and 35 language tools are hosted on the platform of CLARIN:EL covering a wide range of scientific disciplines and subjects.
In the Repository of the University of Athens, the Babiniotis archive, which might be of great interest to the research community, includes the transcripts of authentic everyday conversations between students of the Department of Linguistics that took place during the period from 2001 to 2006.
In the Repository of the Ionian University, the user of CLARIN:EL can find a multilingual glossary on the Greek crisis. Updated on a regular basis, this glossary contains 820 terms in various languages, representing a valuable research tool.
Furthermore, the Centre for the Greek Language has deposited the Modern Greek Text Corpus of Educational Books which consists of 2 million words.
The Greek text corpus of students (ESKEIMATH), which is based on student examinations’ texts, can be found in the Institutional Repository of the Aristotle University of Thessaloniki.
A wealth of language material is now available through the Institutional Repository of the University of the Aegean. The Corpus of Greek Parliamentary discourse comprises around 28 million words covering the period 2011-2015.
In the Repository of NCSR Demokritos, the user of the CLARIN:EL infrastructure can find, among others, the Text Summarization Tool. This tool produces summaries from multiple different documents combining data mining techniques.
Last but not least, "Athena" RC (responsible for the coordination of the network and for the management of the central aggregator) has a great number of language resources such as corpora, lexical conceptual resources and tools or services. Orossimo Corpus, for example, is a corpus of academic texts belonging to 12 domains (Law, Medicine, Geology, Biology etc.). The terms contained in the texts were extracted for the creation of 15 bilingual terminological glossaries which will soon be available through the CLARIN:EL network. Moreover, the Word frequency counter and the Term extractor tool, which extracts terms from texts, are only a few among many language processing tools offered by the Repository of "Athena" RC.
Individuals wishing to contribute LRs can do so by using the Hosted Resources Repository (HRR), operated by the network coordinator ("Athena" RC).
The CLARIN:EL network of repositories, the Greek infrastructure for the collection, documentation, curation and distribution of high quality language resources for the Greek language aims at expanding its reach, through the promotion of the sharing culture and the cooperation and interconnection with other European research infrastructures.