What's New
lexicalConceptualResource

Description:
SrpEngPE - Serbian-English dictionary with terminology in the power engineering domain: automatically extracted from domain parallel corpus http://jerteh.rs/biblisha/ListaDokumenata.aspx?JCID=13&lng=en , extracted monolingual ...
This item contains no files.
lexicalConceptualResource

Description:
SrpNemLex - Bilingual list of German-Serbian translated pairs of lexical units: automatically extracted from parallel corpus that contains 14 novels http://jerteh.rs/biblisha/ListaDokumenata.aspx?JCID=11&lng=en , extracted ...
This item contains no files.
lexicalConceptualResource

Description:
This dataset contains Basque lemma-sense pairs with a POS tag, and definitions, extracted from Wikidata using this query: https://w.wiki/qWH . Other RDF statements related to Basque lexemes can be retrieved, such as links ...
This item contains no files.
Most Viewed Items
Top Last Week
corpus

Description:
The 24sata news portal consists of a portal with daily news and several smaller portals covering news from specific topics, such as automotive news, health, culinary content, and lifestyle advice. The dataset contains over ...
This item contains 2 files (1.26
GB).
Publicly Available




corpus

Description:
ParlaMint 2.1 is a multilingual set of 17 comparable corpora containing parliamentary debates mostly starting in 2015 and extending to mid-2020, with each corpus being about 20 million words in size. The sessions in the ...
This item contains 18 files (23.37
GB).
Publicly Available


lexicalConceptualResource

Description:
A lexicon of 751 emoji characters with automatically assigned sentiment.
The sentiment is computed from 70,000 tweets, labeled by 83 human annotators
in 13 European languages.
The process and analysis of emoji sentiment ...
This item contains 3 files (93.95
KB).
Publicly Available


