Files in this item

 Download all files in item (17.26 MB)
This item is
Publicly Available
and licensed under:
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
Distributed under Creative Commons Attribution Required Share Alike
Icon
Name
kas.term.json
Size
13.47 MB
Format
Unknown
Description
Lexicon in JSON format
MD5
d162802ca09cd12d3b624d353ec22ed9
 Download file
Icon
Name
kas.term.csv
Size
3.47 MB
Format
CSV file
Description
Lexicon in CSV format
MD5
434795ea3191e24c1627f7a28726cd20
 Download file
Icon
Name
kas.term.txt
Size
1.42 KB
Format
Text file
Description
Attribute descriptions
MD5
3b4ea1dfab0b7bd725f254b3c02cd9de
 Download file  Preview
 File Preview  
Attribute descriptions

document_id - ID of the document (PhD thesis) the term candidate is extracted from
area - One of the three scientific areas the PhD thesis covers (Kemija: Chemistry, Politologija: Political Science, Računalništvo: Computer Science)
annotation_round - The annotation round the term candidate was annotated
lemma_sequence - Sequence of lemmas of the term candidate
most_frequent_sequence - Sequence of most frequent tokens of the term candidate (does not have to be the canonical form)
pattern - Morphosyntactic pattern the term candidate satisfies
length - Length of the term candidate
annotator_1 - Response of annotator 1 (annotator number is a pseudoidentifier of a human annotator throughout one area, different annotators were used for each area)
annotator_2 - Response of annotator 2 (t_termin: term, x_izvenpodročni: out-of-domain term, z_znanstveno: scientific term, n_nerelevantno: no term)
annotator_3 - Response of annotator 3
annotator_4 - Response of annotator 4
f . . .
                                            
Icon
Name
Navodila_za_ocenjevanje_terminoloskih_kandidatov_KAS.pdf
Size
331.51 KB
Format
PDF
Description
Guidelines for annotation of term candidates (in Slovenian)
MD5
a3ee1395fc0557872d5e33bd94af25d9
 Download file