dc.contributor.author | Čibej, Jaka |
dc.date.accessioned | 2024-12-03T10:58:02Z |
dc.date.available | 2024-12-03T10:58:02Z |
dc.date.issued | 2024-11-30 |
dc.identifier.uri | http://hdl.handle.net/11356/2000 |
dc.description | ArboSloleks is a dataset containing Slovene word formation trees that have been automatically constructed from word relations (http://hdl.handle.net/11356/1986) extracted from Sloleks 2.0 (http://hdl.handle.net/11356/1230). Each word formation tree begins with a root lexeme from Sloleks (e.g. abolicionizem); morphologically related lexemes are then listed in pairs (original lexeme, related lexeme) along with the levels of word formation (e.g. abolicionizem – abolicionist (Level 1); abolicionist – abolicionistka (Level 2)). Version 1.0 includes 14.918 word formation trees constructed from 66.360 lexeme pairs. It is available in an ad-hoc .txt format – for information on the structure and how to parse the data, please consult 00README.txt. |
dc.language.iso | slv |
dc.publisher | Centre for Language Resources and Technologies, University of Ljubljana |
dc.publisher | Faculty of Arts, University of Ljubljana |
dc.publisher | Faculty of Computer and Information Science, University of Ljubljana |
dc.rights | Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) |
dc.rights.uri | https://creativecommons.org/licenses/by-sa/4.0/ |
dc.rights.label | PUB |
dc.subject | word formation |
dc.subject | word relations |
dc.subject | word formation trees |
dc.subject | morphological rules |
dc.subject | morphology |
dc.subject | derivational morphology |
dc.subject | derivation |
dc.title | Dataset of Slovene word formation trees ArboSloleks 1.0 |
dc.type | lexicalConceptualResource |
metashare.ResourceInfo#ContentInfo.detailedType | lexicon |
metashare.ResourceInfo#ContentInfo.mediaType | text |
has.files | yes |
branding | CLARIN.SI data & tools |
contact.person | Jaka Čibej jaka.cibej@ff.uni-lj.si Faculty of Arts, University of Ljubljana |
sponsor | ARIS (Slovenian Research and Innovation Agency) NOO PoVeJMo research project (Adaptive Natural Language Processing with Large Language Models) nationalFunds |
sponsor | ARRS (Slovenian Research Agency) P6-0411 Language Resources and Technologies for Slovene nationalFunds |
sponsor | ARIS (Slovenian Research and Innovation Agency) GC-0002 LLM4DH: Large Language Models for Digital Humanities nationalFunds |
size.info | 14918 items |
size.info | 66360 elements |
files.count | 1 |
files.size | 2652947 |
Files in this item
This item is
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)




- Name
- ArboSloleks_1.0.zip
- Size
- 2.53 MB
- Format
- application/zip
- Description
- ArboSloleks 1.0 (TXT)
- MD5
- 422873426dd5758ee7abf97e67d6b5dc
- ArboSloleks_1.0
- ArboSloleks_1.0.txt11 MB
- 00README.txt3 kB