CLARIN:EL Research Infrastructure uses the CLARIN-SHARE metadata model for the description and documentation of Language Resources which is based on the META-SHARE metadata model.
The central entity of the CLARIN-SHARE ontology is the Language Resource per se. However, in the ontology, LRs are linked to other satellite entities such as
- reference documents related to the LR (papers, reports, manuals etc.),
- persons/organizations involved in its creation and use (creators, distributors etc.),
- related projects and activities (funding projects, activities of usage etc.),
- accompanying licenses, etc.
The interconnection between the LR and these satellite entities pictures the LR’s lifecycle from production to use.
LRs are classified along two main classification axes: Resource Type and Media Type (i.e. the medium on which the LR is implemented).
Each LR may take more than one mediaType values, since LRs can consist of parts belonging to different types of media: e.g., a multimodal corpus includes a video part (moving image), an audio part (dialogues) and a text part (subtitles and/or transcription of the dialogues). The mediaType values are: text, audio, video, image, textNumerical and textNgram. More information about the CLARIN-SHARE metadata model can be found here.