infrastructure for language resources, technologies and services

to the clarin:el website



The National Center of Social Research (EKKE) joins clarin:el Research Infrastructure, the open network of repositories, members of which are actively involved in Language Resources and Technologies.

Digital language resources from the field of Social Sciences will be integrated in the...


South Africa has officially joined CLARIN ERIC as an Observer in October 2018. 

South Africa will be represented by North-West University, where the South African Centre for Digital Language Resources (SADiLAR) is based. SADiLaR has an enabling function, with a...


As of November 2018, Iceland is the fourth country that joined CLARIN ERIC as an Observer.

The Árni Magnússon Institute for Icelandic Studies will represent Iceland in CLARIN ERIC and will lead the Icelandic CLARIN consortium. The consortium will be formally established...

CLARIN mobility grants help researchers and developers to fund short visits (typical duration: one week) between representatives of CLARIN sites to collaborate on building and using the CLARIN infrastructure.   The mobility grants are designed to promote: sharing of expertise between... more


30/09/2019 to 02/10/2019
The CLARIN Annual Conference is the main annual event for those working on the construction and operation of CLARIN across Europe, as well as for representatives of the communities of use in the humanities, and social sciences. In 2019 the CLARIN Annual Conference (CLARIN2019), will take place on... more
21/08/2019 to 24/08/2019
Developments in historical corpus linguistics have taken a similar route as in corpus-based research on present-day languages: from the creation of small reference corpora to increasingly larger databases and from text-only to richly annotated resources. However, historical data have always posed... more

Resource of the month

A rare collection of medieval texts from the 4th till the 16th century A.D. is available at the clarin:el infrastructure. Discover the whole collection by visiting the Institutional Repository of the Aegean University.

Greek Medieval Texts, is a Text Corpus of about 3.5 million words. The texts are classified into the following categories: religious (563 records), poetical-literary (79 records), political-historical (77 records), hymns (41 records) and epigrams (164 records).

The full metadata description ...


  • language resources
  • web services
  • standards
  • infrastructure
  • dictionaries
  • terminology
  • metadata
  • lexical resources
  • open access
  • sharing
  • video
  • sound
  • language processing
  • datasets
  • availability
  • licence
  • lexical
  • alignment
  • digital resources
  • download
  • corpora