Files in this item
Download all files in item (17.26 MB)This item is
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
- Name
- kas.term.json
- Size
- 13.47 MB
- Format
- Unknown
- Description
- Lexicon in JSON format
- MD5
- d162802ca09cd12d3b624d353ec22ed9
- Name
- kas.term.csv
- Size
- 3.47 MB
- Format
- CSV file
- Description
- Lexicon in CSV format
- MD5
- 434795ea3191e24c1627f7a28726cd20
- Name
- kas.term.txt
- Size
- 1.42 KB
- Format
- Text file
- Description
- Attribute descriptions
- MD5
- 3b4ea1dfab0b7bd725f254b3c02cd9de
Attribute descriptions document_id - ID of the document (PhD thesis) the term candidate is extracted from area - One of the three scientific areas the PhD thesis covers (Kemija: Chemistry, Politologija: Political Science, Računalništvo: Computer Science) annotation_round - The annotation round the term candidate was annotated lemma_sequence - Sequence of lemmas of the term candidate most_frequent_sequence - Sequence of most frequent tokens of the term candidate (does not have to be the canonical form) pattern - Morphosyntactic pattern the term candidate satisfies length - Length of the term candidate annotator_1 - Response of annotator 1 (annotator number is a pseudoidentifier of a human annotator throughout one area, different annotators were used for each area) annotator_2 - Response of annotator 2 (t_termin: term, x_izvenpodročni: out-of-domain term, z_znanstveno: scientific term, n_nerelevantno: no term) annotator_3 - Response of annotator 3 annotator_4 - Response of annotator 4 f . . .
- Name
- Navodila_za_ocenjevanje_terminoloskih_kandidatov_KAS.pdf
- Size
- 331.51 KB
- Format
- Description
- Guidelines for annotation of term candidates (in Slovenian)
- MD5
- a3ee1395fc0557872d5e33bd94af25d9