Show simple item record

 
dc.contributor.author Čibej, Jaka
dc.contributor.author Kosem, Iztok
dc.date.accessioned 2022-11-15T16:29:12Z
dc.date.available 2022-11-15T16:29:12Z
dc.date.issued 2022-10-28
dc.identifier.uri http://hdl.handle.net/11356/1712
dc.description This frequency list of words was prepared by extracting words (i.e. lemmas with their lexical features) from the Trendi Monitor Corpus of Slovene (http://hdl.handle.net/11356/1590) covering the period between 1 January 2021 and 31 December 2021 using the LIST corpus extraction tool (http://hdl.handle.net/11356/1227). The Trendi frequency list was then compared to the frequency list of words from the Gigafida 2.0 Corpus of Slovene (http://hdl.handle.net/11356/1320), which covers the period between 1991 and 2018, and the frequency list of words from Trendi for 2019-2020. The words were compared using the simple maths formula implemented by SketchEngine (see https://www.sketchengine.eu/documentation/simple-maths/). The final list contains lemmas, their lexical features, their absolute and relative frequencies from the first (1991–2020) and second periods (2021), and the simple maths value indicating if the word is more frequent in 2021 (simple maths > 1.00) or in 1991–2020 (simple maths < 1.00). For frequency lists of words that are typical of previous years according to the simple maths measure (e.g. 2019 vs. 1991-2018), please refer to earlier versions of this entry.
dc.language.iso slv
dc.publisher Jožef Stefan Institute
dc.relation.replaces http://hdl.handle.net/11356/1705
dc.rights Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
dc.rights.uri https://creativecommons.org/licenses/by-sa/4.0/
dc.rights.label PUB
dc.source.uri https://sled.ijs.si/
dc.subject frequency list
dc.subject words
dc.subject monitor corpus
dc.title Frequency list of words from the Trendi corpus 2021
dc.type lexicalConceptualResource
metashare.ResourceInfo#ContentInfo.detailedType wordList
metashare.ResourceInfo#ContentInfo.mediaType text
has.files yes
branding CLARIN.SI data & tools
contact.person Jaka Čibej jaka.cibej@ijs.si Jožef Stefan Institute
sponsor Ministry of Culture of the Republic of Slovenia JR-infrastruktura-SJ-2021-2022 SLED - Monitor corpus of Slovene and related resources nationalFunds
size.info 4918940 entries
files.count 1
files.size 26275925


 Files in this item

This item is
Publicly Available
and licensed under:
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
Distributed under Creative Commons Attribution Required Share Alike
Icon
Name
sled_words_2021_vs_1991-2020.zip
Size
25.06 MB
Format
application/zip
Description
sled_words_2021_vs_1991-2020
MD5
2767e1532c535fae74f33462048d73af
 Download file  Preview
 File Preview  
    • sled_words_2021_vs_1991-2020.tsv248 MB
    • 00README.txt1 kB

Show simple item record