• Repository
  • About
  • Contact
  • CLARIN
  •  Login
  • English Slovenščina
  • CLARIN.SI repository
  • Search
  • CLARIN logo
  •   Browse  
    •    All of the Repository  
      •   Issue Date
      •   Authors
      •   Titles
      •   Subjects
      •   Publisher
      •   Language
      •   Type
      •   Rights Label
  •   My Account  
    •    Login
  •   General Information  
    •    Deposit
    •    Cite
    •    Submission Lifecycle
    •    FAQ
    •    About
    •    Help Desk
 

 
Selected Filters
 Author : Ljubešić, Nikola     Clear All
Advanced Search

Filters

Use filters to refine the search results.

Current Filters:
New Filters:

Limit your search

Author  
    • Erjavec, Tomaž (41)
    • Fišer, Darja (24)
    • Toral, Antonio (22)
    • Esplà-Gomis, Miquel (21)
    • Rupnik, Peter (21)
    • Kuzman, Taja (17)
    • Bañón, Marta (16)
    • Forcada, Mikel L. (16)
    • García-Romero, Cristian (16)
    • Pla Sempere, Leopoldo (16)
    • Ramírez-Sánchez, Gema (16)
    • Suchomel, Vít (16)
    • van der Werff, Tobias (16)
    • van Noord, Rik (16)
    • Zaragoza, Jaume (16)
    • Borovič, Mladen (9)
    • Boškovič, Borko (9)
    • Dobrovoljc, Kaja (9)
    • Ferme, Marko (9)
    • ... View More
Subject  
    • language model (31)
    • web corpus (27)
    • computer-mediated communication (20)
    • multilingual (17)
    • part-of-speech tagging (17)
    • lemmatisation (16)
    • parallel corpus (16)
    • TEI (16)
    • manual annotation (11)
    • academic writing (10)
    • named entities (10)
    • word normalisation (9)
    • BSc/BA theses (8)
    • MSc/MA theses (8)
    • PhD theses (8)
    • collocations (7)
    • named entity recognition (7)
    • parsing (7)
    • terminology (7)
    • word embeddings (6)
    • ... View More
Rights  
    • PUB (104)
    • ACA (16)
Language (ISO)  
    • Slovenian (51)
    • Croatian (37)
    • English (31)
    • Serbian (22)
    • Bulgarian (10)
    • Bosnian (7)
    • Macedonian (7)
    • Dutch (6)
    • Finnish (5)
    • Icelandic (5)
    • Montenegrin (5)
    • Spanish (5)
    • Turkish (5)
    • Czech (4)
    • Danish (4)
    • French (4)
    • Hungarian (4)
    • Italian (4)
    • Latvian (4)
    • Lithuanian (4)
    • ... View More
Type  
    • text (88)
    • corpus (73)
    • toolService (33)
    • lexicalConceptualResource (16)
    • audio (1)
Contain Files  
    • yes (120)
    • no (2)

Showing 1 through 60 out of 122 results

  • 1
  • 2
  • 3
  •  
  •    
    • Sort items by
    •  Relevance
    • Title Asc
    • Title Desc
    • Issue Date Asc
    • Issue Date Desc
    •  
    • Results/page
    • 5
    • 10
    • 20
    • 40
    •  60
    • 80
    • 100

  • corpus
    CLARIN.SI data & tools
    corpus
    Corpus of academic Slovene KAS 2.0
    (Faculty of Electrical Engineering and Computer Science, University of Maribor; Faculty of Computer and Information Science, University of Ljubljana / 2022-02-04)
    
    Author(s):
    Žagar, Aleš ; et al.show everyone Žagar, Aleš ; Kavaš, Matic ; Robnik-Šikonja, Marko ; Erjavec, Tomaž ; Fišer, Darja ; Ljubešić, Nikola ; Ferme, Marko ; Borovič, Mladen ; Boškovič, Borko ; Ojsteršek, Milan ; Hrovat, Goran
     This item contains 4 files (13.71 GB).
     
    Academic Use Inform Before Use Attribution Required Noncommercial

  • corpus
    CLARIN.SI data & tools
    corpus
    Abstracts from the KAS corpus KAS-Abs 2.0
    (Faculty of Electrical Engineering and Computer Science, University of Maribor; Faculty of Computer and Information Science, University of Ljubljana / 2022-02-04)
    
    Author(s):
    Žagar, Aleš ; et al.show everyone Žagar, Aleš ; Kavaš, Matic ; Robnik-Šikonja, Marko ; Erjavec, Tomaž ; Fišer, Darja ; Ljubešić, Nikola ; Ferme, Marko ; Borovič, Mladen ; Boškovič, Borko ; Ojsteršek, Milan ; Hrovat, Goran
     This item contains 1 file (83.48 MB).
     
    Academic Use Inform Before Use Attribution Required Noncommercial

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for morphosyntactic annotation of standard Macedonian 1.1
    (Jožef Stefan Institute / 2021-02-02)
    
    Author(s):
    Ljubešić, Nikola ; Zdravkova, Katerina ; Stojanoska, Sanja ; Erjavec, Tomaž and Krsnik, Luka
     This item contains 2 files (146.86 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Linguistically annotated multilingual comparable corpora of parliamentary debates ParlaMint.ana 2.1
    (CLARIN ERIC / 2021-06-18)
    
    Author(s):
    Erjavec, Tomaž ; et al.show everyone Erjavec, Tomaž ; Ogrodniczuk, Maciej ; Osenova, Petya ; Ljubešić, Nikola ; Simov, Kiril ; Grigorova, Vladislava ; Rudolf, Michał ; Pančur, Andrej ; Kopp, Matyáš ; Barkarson, Starkaður ; Steingrímsson, Steinþór ; van der Pol, Henk ; Depoorter, Griet ; de Does, Jesse ; Jongejan, Bart ; Haltrup Hansen, Dorte ; Navarretta, Costanza ; Calzada Pérez, María ; de Macedo, Luciana D. ; van Heusden, Ruben ; Marx, Maarten ; Çöltekin, Çağrı ; Coole, Matthew ; Agnoloni, Tommaso ; Frontini, Francesca ; Montemagni, Simonetta ; Quochi, Valeria ; Venturi, Giulia ; Ruisi, Manuela ; Marchetti, Carlo ; Battistoni, Roberto ; Sebők, Miklós ; Ring, Orsolya ; Darģis, Roberts ; Utka, Andrius ; Petkevičius, Mindaugas ; Briedienė, Monika ; Krilavičius, Tomas ; Morkevičius, Vaidas ; Bartolini, Roberto ; Cimino, Andrea ; Diwersy, Sascha ; Luxardo, Giancarlo ; Rayson, Paul
     This item contains 18 files (23.37 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Multilingual comparable corpora of parliamentary debates ParlaMint 2.1
    (CLARIN ERIC / 2021-06-18)
    
    Author(s):
    Erjavec, Tomaž ; et al.show everyone Erjavec, Tomaž ; Ogrodniczuk, Maciej ; Osenova, Petya ; Ljubešić, Nikola ; Simov, Kiril ; Grigorova, Vladislava ; Rudolf, Michał ; Pančur, Andrej ; Kopp, Matyáš ; Barkarson, Starkaður ; Steingrímsson, Steinþór ; van der Pol, Henk ; Depoorter, Griet ; de Does, Jesse ; Jongejan, Bart ; Haltrup Hansen, Dorte ; Navarretta, Costanza ; Calzada Pérez, María ; de Macedo, Luciana D. ; van Heusden, Ruben ; Marx, Maarten ; Çöltekin, Çağrı ; Coole, Matthew ; Agnoloni, Tommaso ; Frontini, Francesca ; Montemagni, Simonetta ; Quochi, Valeria ; Venturi, Giulia ; Ruisi, Manuela ; Marchetti, Carlo ; Battistoni, Roberto ; Sebők, Miklós ; Ring, Orsolya ; Darģis, Roberts ; Utka, Andrius ; Petkevičius, Mindaugas ; Briedienė, Monika ; Krilavičius, Tomas ; Morkevičius, Vaidas ; Diwersy, Sascha ; Luxardo, Giancarlo ; Rayson, Paul
     This item contains 18 files (2.17 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for lemmatisation of standard Serbian 1.2
    (Jožef Stefan Institute / 2020-09-15)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 1 file (87.49 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Serbian Twitter training corpus ReLDI-NormTagNER-sr 2.1
    (Jožef Stefan Institute / 2019-07-28)
    
    Author(s):
    Ljubešić, Nikola ; Erjavec, Tomaž ; Batanović, Vuk ; Miličević, Maja and Samardžić, Tanja
     This item contains 4 files (4.51 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for lemmatisation of standard Croatian 1.2
    (Jožef Stefan Institute / 2020-09-15)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 1 file (81.99 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Inflectional lexicon srLex 1.3
    (Jožef Stefan Institute / 2019-03-31)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 1 file (54.16 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    CMC training corpus Janes-Tag 2.1
    (Jožef Stefan Institute / 2019-09-11)
    
    Author(s):
    Erjavec, Tomaž ; et al.show everyone Erjavec, Tomaž ; Fišer, Darja ; Čibej, Jaka ; Arhar Holdt, Špela ; Ljubešić, Nikola ; Zupan, Katja ; Dobrovoljc, Kaja
     This item contains 7 files (5.68 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for morphosyntactic annotation of standard Slovenian 1.3
    (Jožef Stefan Institute / 2022-01-07)
    
    Author(s):
    Ljubešić, Nikola and Krsnik, Luka
     This item contains 2 files (313.32 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for lemmatisation of standard Bulgarian 1.1
    (Jožef Stefan Institute; IICT-BAS / 2020-06-24)
    
    Author(s):
    Ljubešić, Nikola ; Osenova, Petya and Simov, Kiril
     This item contains 1 file (22.55 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for lemmatisation of non-standard Croatian 1.1
    (Jožef Stefan Institute / 2020-07-17)
    
    Author(s):
    Ljubešić, Nikola and Štefanec, Vanja
     This item contains 1 file (89.98 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for morphosyntactic annotation of standard Bulgarian 1.1
    (Jožef Stefan Institute; IICT-BAS / 2021-02-02)
    
    Author(s):
    Ljubešić, Nikola ; Osenova, Petya ; Simov, Kiril and Krsnik, Luka
     This item contains 2 files (167.02 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for morphosyntactic annotation of standard Croatian 1.2
    (Jožef Stefan Institute / 2021-02-02)
    
    Author(s):
    Ljubešić, Nikola and Krsnik, Luka
     This item contains 2 files (178.58 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for lemmatisation of non-standard Serbian 1.1
    (Jožef Stefan Institute / 2020-09-15)
    
    Author(s):
    Ljubešić, Nikola and Štefanec, Vanja
     This item contains 1 file (90.05 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for morphosyntactic annotation of standard Serbian 1.2
    (Jožef Stefan Institute / 2021-02-02)
    
    Author(s):
    Ljubešić, Nikola and Krsnik, Luka
     This item contains 2 files (160.43 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for lemmatisation of non-standard Slovenian 1.1
    (Jožef Stefan Institute / 2020-09-15)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 1 file (38.07 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Offensive language dataset of Croatian, English and Slovenian comments FRENK 1.1
    (Jožef Stefan Institute / 2021-11-17)
    
    Author(s):
    Ljubešić, Nikola ; Fišer, Darja ; Erjavec, Tomaž and Šulc, Ajda
     This item contains 2 files (4.48 MB).
     
    Academic Use Inform Before Use Attribution Required Noncommercial

  • corpus
    CLARIN.SI data & tools
    corpus
    Croatian Twitter training corpus ReLDI-NormTagNER-hr 2.1
    (Jožef Stefan Institute / 2019-09-11)
    
    Author(s):
    Ljubešić, Nikola ; Erjavec, Tomaž ; Batanović, Vuk ; Miličević, Maja and Samardžić, Tanja
     This item contains 4 files (4.56 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Inflectional lexicon hrLex 1.3
    (Jožef Stefan Institute / 2019-03-31)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 1 file (51.95 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for lemmatisation of standard Slovenian 1.4
    (Jožef Stefan Institute / 2022-01-10)
    
    Author(s):
    Ljubešić, Nikola and Krsnik, Luka
     This item contains 1 file (1.93 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    SimLex-999 Slovenian translation SimLex-999-sl 1.0
    (University of Ljubljana / 2020-05-15)
    
    Author(s):
    Pollak, Senja ; et al.show everyone Pollak, Senja ; Vulić, Ivan ; Pelicon, Andraž ; Repar, Andraž ; Armendariz, Carlos ; Matthew, Purver ; Ljubešić, Nikola
     This item contains 3 files (37.3 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    A Resource for Evaluating Graded Word Similarity in Context: CoSimLex
    (Queen Mary University / 2020)
    
    Author(s):
    Armendariz, Carlos ; et al.show everyone Armendariz, Carlos ; Matthew, Purver ; Ulčar, Matej ; Pollak, Senja ; Ljubešić, Nikola ; Robnik-Šikonja, Marko ; Granroth-Wilding, Mark ; Vaik, Kristiina
     This item contains 5 files (486.73 KB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Machine Translation datasets from the KAS corpus KAS-MT 1.0
    (Faculty of Electrical Engineering and Computer Science, University of Maribor; Faculty of Computer and Information Science, University of Ljubljana / 2022-02-04)
    
    Author(s):
    Žagar, Aleš ; et al.show everyone Žagar, Aleš ; Kavaš, Matic ; Robnik-Šikonja, Marko ; Erjavec, Tomaž ; Fišer, Darja ; Ljubešić, Nikola ; Ferme, Marko ; Borovič, Mladen ; Boškovič, Borko ; Ojsteršek, Milan ; Hrovat, Goran
     This item contains 1 file (182.14 MB).
     
    Academic Use Inform Before Use Attribution Required Noncommercial

  • corpus
    CLARIN.SI data & tools
    corpus
    Summarization datasets from the KAS corpus KAS-Sum 1.0
    (Faculty of Electrical Engineering and Computer Science, University of Maribor; Faculty of Computer and Information Science, University of Ljubljana / 2022-02-04)
    
    Author(s):
    Žagar, Aleš ; et al.show everyone Žagar, Aleš ; Kavaš, Matic ; Robnik-Šikonja, Marko ; Erjavec, Tomaž ; Fišer, Darja ; Ljubešić, Nikola ; Ferme, Marko ; Borovič, Mladen ; Boškovič, Borko ; Ojsteršek, Milan ; Hrovat, Goran
     This item contains 1 file (4.11 GB).
     
    Academic Use Inform Before Use Attribution Required Noncommercial

  • corpus
    CLARIN.SI data & tools
    corpus
    Slovenian Twitter hate speech dataset IMSyPP-sl
    (Jožef Stefan Institute / 2021-02-17)
    
    Author(s):
    Kralj Novak, Petra ; Mozetič, Igor and Ljubešić, Nikola
     This item contains 4 files (5.19 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Abstracts from the KAS corpus KAS-Abs 1.0
    (Jožef Stefan Institute; Faculty of Electrical Engineering and Computer Science, University of Maribor / 2021-03-31)
    
    Author(s):
    Erjavec, Tomaž ; et al.show everyone Erjavec, Tomaž ; Fišer, Darja ; Ljubešić, Nikola ; Ferme, Marko ; Borovič, Mladen ; Boškovič, Borko ; Ojsteršek, Milan ; Hrovat, Goran
     This item contains 1 file (178.99 MB).
     
    Academic Use Inform Before Use Attribution Required Noncommercial

  • corpus
    CLARIN.SI data & tools
    corpus
    Finnish web corpus fiWaC 1.0
    (Jožef Stefan Institute / 2016-09-20)
    
    Author(s):
    Ljubešić, Nikola ; Pirinen, Tommi and Toral, Antonio
     This item contains 38 files (15.28 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Serbian-English parallel corpus srenWaC 1.0
    (Jožef Stefan Institute / 2016-03-09)
    
    Author(s):
    Ljubešić, Nikola ; Esplà-Gomis, Miquel ; Ortiz Rojas, Sergio ; Klubička, Filip and Toral, Antonio
     This item contains 1 file (70.94 MB).
     
    Academic Use Attribution Required Noncommercial

  • corpus
    CLARIN.SI data & tools
    corpus
    Dataset and baseline model of moderated content FRENK-STYRIA-24sata 1.0
    (Jožef Stefan Institute / 2018-10-27)
    
    Author(s):
    Ljubešić, Nikola ; Erjavec, Tomaž and Fišer, Darja
     This item contains 2 files (7.62 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for UD dependency parsing of standard Bulgarian 1.0
    (Jožef Stefan Institute; IICT-BAS / 2020-06-24)
    
    Author(s):
    Ljubešić, Nikola ; Osenova, Petya and Simov, Kiril
     This item contains 2 files (476.56 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    The LiLaH Emotion Lexicon of Croatian, Dutch and Slovene
    (Jožef Stefan Institute; Centre for Computational Linguistics and Psycholinguistics (CLiPS) / 2020-06-04)
    
    Author(s):
    Daelemans, Walter ; et al.show everyone Daelemans, Walter ; Fišer, Darja ; Franza, Jasmin ; Kranjčić, Denis ; Lemmens, Jens ; Ljubešić, Nikola ; Markov, Ilia ; Popič, Damjan
     This item contains 1 file (199.85 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Finnish-English parallel corpus fienWaC 1.0
    (Jožef Stefan Institute / 2016-03-09)
    
    Author(s):
    Ljubešić, Nikola ; Esplà-Gomis, Miquel ; Ortiz Rojas, Sergio ; Klubička, Filip and Toral, Antonio
     This item contains 1 file (283.67 MB).
     
    Academic Use Attribution Required Noncommercial

  • corpus
    CLARIN.SI data & tools
    corpus
    Corpus of Academic Slovene (PhD theses) KAS-dr 1.0
    (Jožef Stefan Institute; Faculty of Electrical Engineering and Computer Science, University of Maribor / 2019-11-28)
    
    Author(s):
    Erjavec, Tomaž ; et al.show everyone Erjavec, Tomaž ; Fišer, Darja ; Ljubešić, Nikola ; Ferme, Marko ; Borovič, Mladen ; Boškovič, Borko ; Ojsteršek, Milan ; Hrovat, Goran
     This item contains 3 files (2.52 GB).
     
    Academic Use Inform Before Use Attribution Required Noncommercial

  • corpus
    CLARIN.SI data & tools
    corpus
    English-Montenegrin parallel corpus of subtitles Opus-MontenegrinSubs 1.0
    (Jožef Stefan Institute / 2018-03-20)
    
    Author(s):
    Božović, Petar ; Erjavec, Tomaž ; Tiedemann, Jörg ; Ljubešić, Nikola and Gorjanc, Vojko
     This item contains 2 files (12.86 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Croatian language corpus Riznica 0.1
    (Institute of Croatian Language and Linguistics / 2018-03-07)
    
    Author(s):
    Brozović Rončević, Dunja ; et al.show everyone Brozović Rončević, Dunja ; Ćavar, Damir ; Ćavar, Małgorzata ; Stojanov, Tomislav ; Štrkalj Despot, Kristina ; Ljubešić, Nikola ; Erjavec, Tomaž
     This item contains 1 file (457.73 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for named entity recognition of standard Bulgarian 1.0
    (Jožef Stefan Institute; IICT-BAS / 2020-07-07)
    
    Author(s):
    Ljubešić, Nikola ; Osenova, Petya and Simov, Kiril
     This item contains 2 files (107.32 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Corpus of Academic Slovene (BSc/BA theses) KAS-dipl 1.0
    (Jožef Stefan Institute; Faculty of Electrical Engineering and Computer Science, University of Maribor / 2019-11-28)
    
    Author(s):
    Erjavec, Tomaž ; et al.show everyone Erjavec, Tomaž ; Fišer, Darja ; Ljubešić, Nikola ; Ferme, Marko ; Borovič, Mladen ; Boškovič, Borko ; Ojsteršek, Milan ; Hrovat, Goran
     This item contains 5 files (27.63 GB).
     
    Academic Use Inform Before Use Attribution Required Noncommercial

  • corpus
    CLARIN.SI data & tools
    corpus
    Corpus of Academic Slovene (MSc/MA theses) KAS-mag 1.0
    (Jožef Stefan Institute; Faculty of Electrical Engineering and Computer Science, University of Maribor / 2019-11-28)
    
    Author(s):
    Erjavec, Tomaž ; et al.show everyone Erjavec, Tomaž ; Fišer, Darja ; Ljubešić, Nikola ; Ferme, Marko ; Borovič, Mladen ; Boškovič, Borko ; Ojsteršek, Milan ; Hrovat, Goran
     This item contains 3 files (11.97 GB).
     
    Academic Use Inform Before Use Attribution Required Noncommercial

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Automatically constructed multiword lexicon slMWELex v0.5
    (Jožef Stefan Institute / 2015)
    
    Author(s):
    Ljubešić, Nikola ; Krek, Simon ; Dobrovoljc, Kaja and Erjavec, Tomaž
     This item contains 1 file (73.96 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Automatically constructed multiword lexicon hrMWELex v0.5
    (Jožef Stefan Institute / 2015)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 1 file (152.39 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Automatically constructed multiword lexicon srMWELex v0.5
    (Jožef Stefan Institute / 2015)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 1 file (40.26 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for JOS dependency parsing of standard Slovenian 1.0
    (Jožef Stefan Institute / 2020-06-24)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 2 files (1.53 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for named entity recognition of standard Slovenian 1.0
    (Jožef Stefan Institute / 2020-06-19)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 2 files (106.12 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Corpus of Croatian news portals ENGRI (2014-2018)
    (University of Rijeka, Faculty of Maritime Studies / 2021-03-14)
    
    Author(s):
    Bogunović, Irena ; Kučić, Mario ; Ljubešić, Nikola and Erjavec, Tomaž
     This item contains 12 files (8.48 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Tourism English-Croatian Parallel Corpus 2.0
    (Abu-MaTran project / 2016-01-28)
    
    Author(s):
    Toral, Antonio ; et al.show everyone Toral, Antonio ; Esplà-Gomis, Miquel ; Klubička, Filip ; Ljubešić, Nikola ; Papavassiliou, Vassilis ; Prokopidis, Prokopis ; Rubino, Raphael ; Way, Andy
     This item contains 1 file (69.36 MB).
     
    Academic Use Attribution Required Noncommercial

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Slovene ontology of semantic types for nouns SLONEST-noun 1.0
    (Centre for Language Resources and Technologies, University of Ljubljana / 2020-10-26)
    
    Author(s):
    Kosem, Iztok ; et al.show everyone Kosem, Iztok ; Pori, Eva ; Gantar, Polona ; Logar, Nataša ; Krek, Simon ; Laskowski, Cyprian ; Arhar Holdt, Špela ; Čibej, Jaka ; Dobrovoljc, Kaja ; Gorjanc, Vojko ; Klemenc, Bojan ; Ljubešić, Nikola
     This item contains 1 file (58.7 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Dataset of normalised Slovene text KonvNormSl 1.0
    (Jožef Stefan Institute / 2016-09-19)
    
    Author(s):
    Ljubešić, Nikola ; Zupan, Katja ; Fišer, Darja and Erjavec, Tomaž
     This item contains 1 file (4.57 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for named entity recognition of non-standard Croatian 1.0
    (Jožef Stefan Institute / 2020-08-07)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 1 file (46.14 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for named entity recognition of standard Croatian 1.0
    (Jožef Stefan Institute / 2020-06-19)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 2 files (106.34 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Concreteness and imageability lexicon MEGA.HR-Crossling
    (Jožef Stefan Institute; Faculty of Humanities and Social Sciences, University of Zagreb / 2018-05-28)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 1 file (164.76 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Corpus of Written Standard Slovene Gigafida 2.0
    (Centre for Language Resources and Technologies, University of Ljubljana / 2019-06-13)
    
    Author(s):
    Krek, Simon ; et al.show everyone Krek, Simon ; Erjavec, Tomaž ; Repar, Andraž ; Čibej, Jaka ; Arhar Holdt, Špela ; Gantar, Polona ; Kosem, Iztok ; Robnik-Šikonja, Marko ; Ljubešić, Nikola ; Dobrovoljc, Kaja ; Laskowski, Cyprian ; Grčar, Miha ; Holozan, Peter ; Šuster, Simon ; Gorjanc, Vojko ; Stabej, Marko ; Logar, Nataša
     This item contains no files.

  • toolService
    CLARIN.SI data & tools
    toolService
    The Orange workflow for observing collocation trends ColTrend 1.0
    (Centre for Language Resources and Technologies, University of Ljubljana / 2020-10-26)
    
    Author(s):
    Kosem, Iztok ; et al.show everyone Kosem, Iztok ; Krek, Simon ; Čibej, Jaka ; Gantar, Polona ; Arhar Holdt, Špela ; Logar, Nataša ; Laskowski, Cyprian ; Klemenc, Bojan ; Ljubešić, Nikola ; Dobrovoljc, Kaja ; Gorjanc, Vojko ; Pori, Eva
     This item contains 1 file (70.03 MB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Slovene-English parallel corpus slenWaC 1.0
    (Jožef Stefan Institute / 2016-03-10)
    
    Author(s):
    Ljubešić, Nikola ; Esplà-Gomis, Miquel ; Ortiz Rojas, Sergio ; Klubička, Filip and Toral, Antonio
     This item contains 1 file (94.44 MB).
     
    Academic Use Attribution Required Noncommercial

  • toolService
    CLARIN.SI data & tools
    toolService
    The Orange workflow for observing collocation clusters ColEmbed 1.0
    (Centre for Language Resources and Technologies, University of Ljubljana / 2020-10-26)
    
    Author(s):
    Kosem, Iztok ; et al.show everyone Kosem, Iztok ; Čibej, Jaka ; Ljubešić, Nikola ; Krek, Simon ; Gantar, Polona ; Arhar Holdt, Špela ; Logar, Nataša ; Laskowski, Cyprian ; Klemenc, Bojan ; Dobrovoljc, Kaja ; Gorjanc, Vojko ; Pori, Eva
     This item contains 1 file (86.32 MB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    English YouTube Hate Speech Corpus
    (Jožef Stefan Institute / 2021-10-14)
    
    Author(s):
    Ljubešić, Nikola ; Mozetič, Igor ; Cinelli, Matteo and Kralj Novak, Petra
     This item contains 3 files (30.59 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    The Twitter user dataset for discriminating between Bosnian, Croatian, Montenegrin and Serbian Twitter-HBS 1.0
    (Jožef Stefan Institute / 2022-01-26)
    
    Author(s):
    Ljubešić, Nikola and Rupnik, Peter
     This item contains 1 file (12.98 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for UD dependency parsing of standard Croatian
    (Jožef Stefan Institute / 2019-10-11)
    
    Author(s):
    Ljubešić, Nikola
     This item contains 2 files (1.13 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Bosnian web corpus bsWaC 1.1
    (Jožef Stefan Institute / 2016-05-12)
    
    Author(s):
    Ljubešić, Nikola and Klubička, Filip
     This item contains 3 files (1.85 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • 1
  • 2
  • 3
  •  
  •    
    • Sort items by
    •  Relevance
    • Title Asc
    • Title Desc
    • Issue Date Asc
    • Issue Date Desc
    •  
    • Results/page
    • 5
    • 10
    • 20
    • 40
    •  60
    • 80
    • 100
 

Partners

  • Alpineon, d.o.o.
  • Amebis, d.o.o.
  • Institute of Contemporary History
  • Jožef Stefan Institute
  • Slovenian Language Technologies Society
  • Trojina, Institute for Applied Slovene Studies

Partners

  • University of Ljubljana
  • University of Maribor
  • University of Nova Gorica
  • University of Primorska
  • ZRC SAZU
  • ZRS Koper

Repository

  • Main page
  • Contact
  • Submission Lifecycle
  • FAQ
  • About and Policies

This platform runs under the software developed for the LINDAT/CLARIAH-CZ repository for linguistics, available on GitHub

CLARIN.SI is supported by the Ministry of Education, Science and Sport of the Republic of Slovenia
under the Programme of "Research Infrastructures".