• Repository
  • About
  • Contact
  • CLARIN
  •  Login
  • English Slovenščina
  • CLARIN.SI repository
  • Search
  • CLARIN logo
  •   Browse  
    •    All of the Repository  
      •   Issue Date
      •   Authors
      •   Titles
      •   Subjects
      •   Publisher
      •   Language
      •   Type
      •   Rights Label
  •   My Account  
    •    Login
  •   General Information  
    •    Deposit
    •    Cite
    •    Submission Lifecycle
    •    FAQ
    •    About
    •    Help Desk
 

 
Selected Filters
 Type : corpus      Author : Krek, Simon     Clear All
Advanced Search

Filters

Use filters to refine the search results.

Current Filters:
New Filters:

Limit your search

Author  
    • Erjavec, Tomaž (19)
    • Dobrovoljc, Kaja (17)
    • Čibej, Jaka (16)
    • Kosem, Iztok (15)
    • Ponikvar, Primož (10)
    • Arhar Holdt, Špela (9)
    • Ljubešić, Nikola (9)
    • Gantar, Polona (8)
    • Stabej, Marko (8)
    • Šinkec, Mihael (7)
    • Laskowski, Cyprian (5)
    • Ledinek, Nina (5)
    • Munda, Tina (5)
    • Pori, Eva (5)
    • Terčon, Luka (5)
    • Holz, Nanika (4)
    • Jezeršek, Lucija (4)
    • Kavčič, Teja (4)
    • Klemenc, Bojan (4)
    • ... View More
Subject  
    • TEI (9)
    • monitor corpus (7)
    • news corpus (7)
    • temporal trends (7)
    • topic attribution (7)
    • universal dependencies (7)
    • manual annotation (6)
    • part-of-speech tagging (6)
    • CONLL-U (5)
    • named entities (5)
    • dependency treebank (4)
    • developmental corpus (4)
    • multilingual (4)
    • parsing (4)
    • semantic role labelling (4)
    • student writing (4)
    • tokenisation (4)
    • verbal multiword expressions (4)
    • error annotation (3)
    • parallel corpus (3)
    • ... View More
Rights  
    • PUB (19)
    • RES (1)
Language (ISO)  
    • Slovenian (27)
    • Italian (4)
    • Spanish (4)
    • Bulgarian (3)
    • Danish (3)
    • Dutch (3)
    • English (3)
    • Estonian (3)
    • Hungarian (3)
    • Portuguese (3)
    • German (1)
Type  
    • text (26)
    • audio (2)
Contain Files  
    • yes (20)
    • no (8)

Showing 1 through 10 out of 28 results

  • 1
  • 2
  • 3
  •  
  •    
    • Sort items by
    •  Relevance
    • Title Asc
    • Title Desc
    • Issue Date Asc
    • Issue Date Desc
    •  
    • Results/page
    • 5
    •  10
    • 20
    • 40
    • 60
    • 80
    • 100

  • corpus
    CLARIN.SI data & tools
    corpus
    Training corpus jos1M 1.2
    (Jožef Stefan Institute / 2019-02-13)
    
    Author(s):
    Erjavec, Tomaž ; Krek, Simon and Dobrovoljc, Kaja
     This item contains 4 files (108.6 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial

  • corpus
    CLARIN.SI data & tools
    corpus
    Developmental corpus Šolar 3.0
    (Centre for Language Resources and Technologies, University of Ljubljana / 2022-09-05)
    
    Author(s):
    Arhar Holdt, Špela ; et al.show everyone Arhar Holdt, Špela ; Rozman, Tadeja ; Stritar Kučuk, Mojca ; Krek, Simon ; Krapš Vodopivec, Irena ; Stabej, Marko ; Pori, Eva ; Goli, Teja ; Lavrič, Polona ; Laskowski, Cyprian ; Kocjančič, Polonca ; Klemenc, Bojan ; Krsnik, Luka ; Kosem, Iztok
     This item contains 4 files (194.6 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Training corpus SUK 1.1
    (Centre for Language Resources and Technologies, University of Ljubljana / 2024-08-22)
    
    Author(s):
    Arhar Holdt, Špela ; et al.show everyone Arhar Holdt, Špela ; Krek, Simon ; Dobrovoljc, Kaja ; Erjavec, Tomaž ; Gantar, Polona ; Čibej, Jaka ; Pori, Eva ; Terčon, Luka ; Munda, Tina ; Žitnik, Slavko ; Robida, Nejc ; Blagus, Neli ; Može, Sara ; Ledinek, Nina ; Holz, Nanika ; Zupan, Katja ; Kuzman, Taja ; Kavčič, Teja ; Škrjanec, Iza ; Marko, Dafne ; Jezeršek, Lucija ; Zajc, Anja
     This item contains 2 files (45.1 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Spoken corpus Gos 2.1 (transcriptions)
    (Centre for Language Resources and Technologies, University of Ljubljana; Faculty of Electrical Engineering and Computer Science, University of Maribor; Faculty of Electrical Engineering, University of Ljubljana; Faculty of Computer and Information Science, University of Ljubljana; ZRC SAZU; Jožef Stefan Institute / 2023-08-28)
    
    Author(s):
    Verdonik, Darinka ; et al.show everyone Verdonik, Darinka ; Zwitter Vitez, Ana ; Zemljarič Miklavčič, Jana ; Krek, Simon ; Stabej, Marko ; Erjavec, Tomaž ; Potočnik, Tomaž ; Sepesy Maučec, Mirjam ; Majhenič, Simona ; Žgank, Andrej ; Bizjak, Andreja ; Gril, Lucija ; Dobrišek, Simon ; Križaj, Janez ; Bajec, Marko ; Lebar Bajec, Iztok ; Jelovšek, Tjaša ; Trojar, Mitja ; Bernjak, Mitja ; Dretnik, Naum ; Strle, Gregor ; Dobrovoljc, Kaja ; Ljubešić, Nikola ; Rupnik, Peter
     This item contains 4 files (117.83 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Parallel sense-annotated corpus ELEXIS-WSD 1.3
    (Jožef Stefan Institute / 2025-05-06)
    
    Author(s):
    Čibej, Jaka ; et al.show everyone Čibej, Jaka ; Krek, Simon ; Tiberius, Carole ; Martelli, Federico ; Navigli, Roberto ; Kallas, Jelena ; Gantar, Polona ; Koeva, Svetla ; Nimb, Sanni ; Sandford Pedersen, Bolette ; Olsen, Sussi ; Langemets, Margit ; Koppel, Kristina ; Üksik, Tiiu ; Dobrovoljc, Kaja ; Ureña-Ruiz, Rafael ; Sancho-Sánchez, José-Luis ; Lipp, Veronika ; Váradi, Tamás ; Győrffy, András ; Simon, László ; Quochi, Valeria ; Monachini, Monica ; Frontini, Francesca ; Tempelaars, Rob ; Costa, Rute ; Salgado, Ana ; Munda, Tina ; Kosem, Iztok ; Roblek, Rebeka ; Kamenšek, Urška ; Zaranšek, Petra ; Zgaga, Karolina ; Ponikvar, Primož ; Terčon, Luka ; Jensen, Jonas ; Flörke, Ida ; Lorentzen, Henrik ; Troelsgård, Thomas ; Blagoeva, Diana ; Hristov, Dimitar ; Kolkovska, Sia
     This item contains 1 file (11.08 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Monitor corpus of Slovene Trendi 2025-04
    (Jožef Stefan Institute; Centre for Language Resources and Technologies, University of Ljubljana / 2025-05-06)
    
    Author(s):
    Kosem, Iztok ; et al.show everyone Kosem, Iztok ; Čibej, Jaka ; Dobrovoljc, Kaja ; Erjavec, Tomaž ; Ljubešić, Nikola ; Ponikvar, Primož ; Šinkec, Mihael ; Krek, Simon
     This item contains no files.

  • corpus
    CLARIN.SI data & tools
    corpus
    Developmental corpus Šolar 2.0
    (Trojina, Institute for Applied Slovene Studies; Centre for Language Resources and Technologies, University of Ljubljana / 2019-07-08)
    
    Author(s):
    Kosem, Iztok ; et al.show everyone Kosem, Iztok ; Arhar Holdt, Špela ; Stritar Kučuk, Mojca ; Krek, Simon ; Krapš Vodopivec, Irena ; Stabej, Marko ; Pori, Eva ; Goli, Teja ; Lavrič, Polona ; Laskowski, Cyprian ; Kocjančič, Polonca ; Klemenc, Bojan ; Rozman, Tadeja
     This item contains 2 files (21.69 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Developmental corpus (without language corrections) Šolar 2.0 Clear
    (Trojina, Institute for Applied Slovene Studies; Centre for Language Resources and Technologies, University of Ljubljana / 2019-07-08)
    
    Author(s):
    Kosem, Iztok ; et al.show everyone Kosem, Iztok ; Arhar Holdt, Špela ; Stritar Kučuk, Mojca ; Krek, Simon ; Krapš Vodopivec, Irena ; Stabej, Marko ; Kocjančič, Polonca ; Laskowski, Cyprian ; Klemenc, Bojan ; Pori, Eva ; Rozman, Tadeja
     This item contains 2 files (29.22 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Error-annotated developmental corpus Šolar 2.0 Error
    (Trojina, Institute for Applied Slovene Studies; Centre for Language Resources and Technologies, University of Ljubljana / 2019-07-08)
    
    Author(s):
    Arhar Holdt, Špela ; et al.show everyone Arhar Holdt, Špela ; Goli, Teja ; Lavrič, Polona ; Laskowski, Cyprian ; Klemenc, Bojan ; Rozman, Tadeja ; Stritar Kučuk, Mojca ; Krek, Simon ; Krapš Vodopivec, Irena ; Stabej, Marko ; Kosem, Iztok
     This item contains 2 files (10.77 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Training corpus ssj500k 2.3
    (Centre for Language Resources and Technologies, University of Ljubljana / 2021-07-07)
    
    Author(s):
    Krek, Simon ; et al.show everyone Krek, Simon ; Dobrovoljc, Kaja ; Erjavec, Tomaž ; Može, Sara ; Ledinek, Nina ; Holz, Nanika ; Zupan, Katja ; Gantar, Polona ; Kuzman, Taja ; Čibej, Jaka ; Arhar Holdt, Špela ; Kavčič, Teja ; Škrjanec, Iza ; Marko, Dafne ; Jezeršek, Lucija ; Zajc, Anja
     This item contains 4 files (42.85 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • 1
  • 2
  • 3
  •  
  •    
    • Sort items by
    •  Relevance
    • Title Asc
    • Title Desc
    • Issue Date Asc
    • Issue Date Desc
    •  
    • Results/page
    • 5
    •  10
    • 20
    • 40
    • 60
    • 80
    • 100
 

Partners

  • Alpineon, d.o.o.
  • Amebis, d.o.o.
  • Institute of Contemporary History
  • Jožef Stefan Institute
  • National and University Library of Slovenia
  • Slovenian Language Technologies Society

Partners

  • University of Ljubljana
  • University of Maribor
  • University of Nova Gorica
  • University of Primorska
  • ZRC SAZU
  • ZRS Koper

Repository

  • Main page
  • Contact
  • Submission Lifecycle
  • FAQ
  • About and Policies

This platform runs under the software developed for the LINDAT/CLARIAH-CZ repository for linguistics, available on GitHub

CLARIN.SI is supported by the Ministry of Education, Science and Sport of the Republic of Slovenia
under the Programme of "Research Infrastructures".