What's New
lexicalConceptualResource

Description:
SNES (Stalno naglašene enote iz Sloleksa; Constantly accentuated units from Sloleks) is a dataset containing Slovene final accentuated word parts (i.e., the ending part of an accentuated word from its last grapheme with ...
This item contains 1 file (525.54
KB).
Publicly Available



corpus

Description:
The Trendi corpus is a monitor corpus of Slovenian. It contains news articles from 106 media websites, published by 57 publishers. Trendi 2025-07 covers the period from January 2019 to July 2025, complementing the Gigafida ...
This item contains no files.
corpus

Description:
The ParlaSpeech corpora are built from the transcripts of parliamentary proceedings of Croatian, Serbian, Polish, and Czech parliaments available in the ParlaMint 4.0 corpus (http://hdl.handle.net/11356/1859), and the ...
This item contains 10 files (10.16
GB).
Publicly Available



Most Viewed Items
Top Last Week
toolService

Description:
The X-GENRE classifier is a text classification model that can be used for automatic genre identification. The model classifies texts to one of 9 genre labels: Information/Explanation, News, Instruction, Opinion/Argumentation, ...
This item contains 1 file (779.93
MB).
Publicly Available



corpus

Description:
Trilingual parallel corpus on general data protection regulation. The size of the corpus is 54,468 words in English, 42,566 words in Lithuanian, and 47,740 words in Danish.
This item contains no files.
lexicalConceptualResource

Description:
This dictionary has been prepared to support the Syrian Textbook prepared at the University of Vienna.
See also: https://hdl.handle.net/11022/0000-0007-C093-9
This item contains no files.