What's New

 lexicalConceptualResource 
lexicalConceptualResource
Author(s):
Description:
SNES (Stalno naglašene enote iz Sloleksa; Constantly accentuated units from Sloleks) is a dataset containing Slovene final accentuated word parts (i.e., the ending part of an accentuated word from its last grapheme with ...
 This item contains 1 file (525.54 KB).
 
Publicly Available Distributed under Creative Commons Attribution Required Share Alike
 corpus 
corpus
Description:
The Trendi corpus is a monitor corpus of Slovenian. It contains news articles from 106 media websites, published by 57 publishers. Trendi 2025-07 covers the period from January 2019 to July 2025, complementing the Gigafida ...
 This item contains no files.
 corpus 
corpus
Description:
The ParlaSpeech corpora are built from the transcripts of parliamentary proceedings of Croatian, Serbian, Polish, and Czech parliaments available in the ParlaMint 4.0 corpus (http://hdl.handle.net/11356/1859), and the ...
 This item contains 10 files (10.16 GB).
 
Publicly Available Distributed under Creative Commons Attribution Required Share Alike

Most Viewed Items

Top Last Week
 toolService 
toolService
Description:
The X-GENRE classifier is a text classification model that can be used for automatic genre identification. The model classifies texts to one of 9 genre labels: Information/Explanation, News, Instruction, Opinion/Argumentation, ...
 This item contains 1 file (779.93 MB).
 
Publicly Available Distributed under Creative Commons Attribution Required Share Alike
 corpus 
corpus
Description:
Trilingual parallel corpus on general data protection regulation. The size of the corpus is 54,468 words in English, 42,566 words in Lithuanian, and 47,740 words in Danish.
 This item contains no files.
 lexicalConceptualResource 
lexicalConceptualResource
Description:
This dictionary has been prepared to support the Syrian Textbook prepared at the University of Vienna. See also: https://hdl.handle.net/11022/0000-0007-C093-9
 This item contains no files.