Show simple item record

 
dc.contributor.author Čibej, Jaka
dc.date.accessioned 2024-12-03T10:58:02Z
dc.date.available 2024-12-03T10:58:02Z
dc.date.issued 2024-11-30
dc.identifier.uri http://hdl.handle.net/11356/2000
dc.description ArboSloleks is a dataset containing Slovene word formation trees that have been automatically constructed from word relations (http://hdl.handle.net/11356/1986) extracted from Sloleks 2.0 (http://hdl.handle.net/11356/1230). Each word formation tree begins with a root lexeme from Sloleks (e.g. abolicionizem); morphologically related lexemes are then listed in pairs (original lexeme, related lexeme) along with the levels of word formation (e.g. abolicionizem – abolicionist (Level 1); abolicionist – abolicionistka (Level 2)). Version 1.0 includes 14.918 word formation trees constructed from 66.360 lexeme pairs. It is available in an ad-hoc .txt format – for information on the structure and how to parse the data, please consult 00README.txt.
dc.language.iso slv
dc.publisher Centre for Language Resources and Technologies, University of Ljubljana
dc.publisher Faculty of Arts, University of Ljubljana
dc.publisher Faculty of Computer and Information Science, University of Ljubljana
dc.rights Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
dc.rights.uri https://creativecommons.org/licenses/by-sa/4.0/
dc.rights.label PUB
dc.subject word formation
dc.subject word relations
dc.subject word formation trees
dc.subject morphological rules
dc.subject morphology
dc.subject derivational morphology
dc.subject derivation
dc.title Dataset of Slovene word formation trees ArboSloleks 1.0
dc.type lexicalConceptualResource
metashare.ResourceInfo#ContentInfo.detailedType lexicon
metashare.ResourceInfo#ContentInfo.mediaType text
has.files yes
branding CLARIN.SI data & tools
contact.person Jaka Čibej jaka.cibej@ff.uni-lj.si Faculty of Arts, University of Ljubljana
sponsor ARIS (Slovenian Research and Innovation Agency) NOO PoVeJMo research project (Adaptive Natural Language Processing with Large Language Models) nationalFunds
sponsor ARRS (Slovenian Research Agency) P6-0411 Language Resources and Technologies for Slovene nationalFunds
sponsor ARIS (Slovenian Research and Innovation Agency) GC-0002 LLM4DH: Large Language Models for Digital Humanities nationalFunds
size.info 14918 items
size.info 66360 elements
files.count 1
files.size 2652947


 Files in this item

This item is
Publicly Available
and licensed under:
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
Distributed under Creative Commons Attribution Required Share Alike
Icon
Name
ArboSloleks_1.0.zip
Size
2.53 MB
Format
application/zip
Description
ArboSloleks 1.0 (TXT)
MD5
422873426dd5758ee7abf97e67d6b5dc
 Download file  Preview
 File Preview  

Show simple item record