What's New
lexicalConceptualResource

Description:
This dataset contains lists of delexicalized dependency trees and subtrees extracted from the Slovenian UD corpora SSJ (written) and SST (spoken), version 2.15 (hdl:11234/1-5787), using the STARK tool (github.com/clarinsi/STARK). ...
Ta vnos vsebuje 6 datotek(e) (74.12
MB).
Publicly Available


toolService

Description:
Drevesnik (https://orodja.cjvt.si/drevesnik/) is an online service for querying Slovenian corpora parsed with the Universal Dependencies annotation scheme. It features an easy-to-use query language on the one hand and ...
Ta vnos vsebuje 1 datoteko (4.45
MB).
Publicly Available
toolService

Description:
STARK is a highly customizable tool designed for extracting different types of syntactic structures (trees) from parsed corpora (treebanks), aimed at corpus-driven linguistic investigations of syntactic and lexical phenomena ...
Ta vnos vsebuje 1 datoteko (3.17
MB).
Publicly Available
Največ ogledov
V preteklem tednu
lexicalConceptualResource

Description:
A lexicon of 751 emoji characters with automatically assigned sentiment.
The sentiment is computed from 70,000 tweets, labeled by 83 human annotators
in 13 European languages.
The process and analysis of emoji sentiment ...
Ta vnos vsebuje 3 datotek(e) (93.95
KB).
Publicly Available



lexicalConceptualResource

Description:
A list of headwords from the collection "Besede slovenskega jezika" (Words of Slovenian Language).
Ta vnos vsebuje 1 datoteko (997.48
KB).
Publicly Available



toolService

Description:
This Conformer CTC BPE E2E Automated Speech Recognition model was trained following the NVIDIA NeMo Conformer-CTC fine-tuning recipe (for details see the official NVIDIA NeMo NMT documentation, https://docs.nvidia.com/de ...
Ta vnos vsebuje 1 datoteko (430.87
MB).
Publicly Available