Show simple item record

 
dc.contributor.author Krsnik, Luka
dc.contributor.author Robnik-Šikonja, Marko
dc.contributor.author Šef, Tomaž
dc.contributor.author Krek, Simon
dc.date.accessioned 2018-05-08T01:59:30Z
dc.date.available 2018-05-08T01:59:30Z
dc.date.issued 2018-05-08
dc.identifier.uri http://hdl.handle.net/11356/1186
dc.description This lexicon is an extended version of Sloleks 1.2, http://hdl.handle.net/11356/1039. It contains all the original data from Sloleks with added information about the stress of each word form, which is included in two ways: information about stress location only, and information about stress location and type. Stress assignment was performed automatically, with algorithms based on deep neural networks which correctly predicted accent location in 91.5% and combined accent type and location in 88.5% of test data. Therefore not all accents are correct. This updated 1.1 version of the lexicon contains stress asignments with an improved algorithm, which reduces the error by about 1% against the previous 1.0 version.
dc.language.iso slv
dc.publisher Faculty of Computer and Information Science, University of Ljubljana
dc.publisher Centre for Language Resources and Technologies, University of Ljubljana
dc.relation.isreferencedby http://videolectures.net/jota_krsnik_napovedovanje_naglasa/
dc.relation.isreferencedby https://repozitorij.uni-lj.si/IzpisGradiva.php?id=98276
dc.relation.replaces http://hdl.handle.net/11356/1156
dc.rights Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
dc.rights.uri https://creativecommons.org/licenses/by-nc-sa/4.0/
dc.rights.label PUB
dc.source.uri https://gitea.cjvt.si/lkrsnik/stress_asignment
dc.subject word stress
dc.title Automatically stress labelled morphological lexicon Sloleks 1.2, version 1.1
dc.type lexicalConceptualResource
metashare.ResourceInfo#ContentInfo.detailedType computationalLexicon
metashare.ResourceInfo#ContentInfo.mediaType text
has.files yes
branding CLARIN.SI data & tools
contact.person Luka Krsnik krsnik.luka92@gmail.com Faculty of Computer and Information Science, University of Ljubljana
size.info 2774745 words
size.info 100805 entries
files.count 2
files.size 58624194


 Files in this item

 Download all files in item (55.91 MB)
Icon
Name
accented_sloleks2.xml.zip
Size
37.22 MB
Format
application/zip
Description
Sloleks with accented words in LMF XML format (PoS tags in Slovenian).
MD5
7c6b102647fb1328677c23ab9d2dacae
 Download file  Preview
 File Preview  
    • accented_sloleks2.xml1 GB
Icon
Name
accented_sloleks.zip
Size
18.69 MB
Format
application/zip
Description
Sloleks with accented words in tabular format (PoS tags in Slovenian).
MD5
2ba78fc7395631541f6a323b634869cb
 Download file  Preview
 File Preview  
    • accented_sloleks.tab165 MB

Show simple item record