• Repozitorij
  • O repozitoriju
  • Kontakt
  • CLARIN
  •  Prijava
  • English Slovenščina
  • Repozitorij CLARIN.SI
  • Iskanje
  • CLARIN logo
  •   Brskanje  
    •    Celoten repozitorij  
      •   Datum izdaje
      •   Avtor
      •   Naslov
      •   Ključne besede
      •   Izdajatelj
      •   Jezik
      •   Vrsta
      •   Oznaka pravic
  •   Moj račun  
    •    Prijava
  •   Splošne informacije  
    •    O vnosu v repozitorij
    •    Citiranje
    •    Življenjski ciklus vnosa
    •    Pogosta vprašanja
    •    O repozitoriju
    •    Pomoč uporabnikom
 

 
Napredno iskanje

Filtri

Uporabite filtre za omejitev rezultatov iskanja.

Omejite svoje iskanje

Avtor  
    • Ljubešić, Nikola (206)
    • Erjavec, Tomaž (109)
    • Dobrovoljc, Kaja (73)
    • Rupnik, Peter (70)
    • Krek, Simon (68)
    • Kuzman, Taja (68)
    • Arhar Holdt, Špela (66)
    • Čibej, Jaka (56)
    • Toral, Antonio (50)
    • Esplà-Gomis, Miquel (49)
    • Kosem, Iztok (47)
    • Bañón, Marta (44)
    • Forcada, Mikel L. (44)
    • García-Romero, Cristian (44)
    • Pla Sempere, Leopoldo (44)
    • Ramírez-Sánchez, Gema (44)
    • Suchomel, Vít (44)
    • van Noord, Rik (44)
    • Fišer, Darja (43)
    • Terčon, Luka (41)
    • ... poglejte več
Ključna beseda  
    • lexicographic resource (101)
    • TEI (72)
    • general dictionary (66)
    • modern dictionary (64)
    • monolingual dictionary (64)
    • web corpus (64)
    • language model (50)
    • manual annotation (47)
    • multilingual (47)
    • part-of-speech tagging (45)
    • parallel corpus (44)
    • lemmatisation (43)
    • computer-mediated communication (38)
    • specialised dictionary (35)
    • historical dictionary (34)
    • bilingual dictionary (29)
    • tokenisation (24)
    • spoken corpus (23)
    • parsing (22)
    • terminology (22)
    • ... poglejte več
Pravice  
    • PUB (488)
    • ACA (26)
    • RES (1)
Jezik (ISO)  
    • Slovenian (352)
    • English (123)
    • Croatian (87)
    • Serbian (78)
    • French (37)
    • German (37)
    • Bulgarian (35)
    • Russian (28)
    • Bosnian (26)
    • Spanish (25)
    • Danish (24)
    • Lithuanian (23)
    • Macedonian (23)
    • Hungarian (22)
    • Dutch (21)
    • Estonian (21)
    • Italian (18)
    • Portuguese (18)
    • Latvian (17)
    • Polish (17)
    • ... poglejte več
Vrsta  
    • text (553)
    • corpus (300)
    • lexicalConceptualResource (270)
    • toolService (91)
    • audio (18)
    • image (1)
    • languageDescription (1)
    • video (1)
Vsebuje datoteke  
    • yes (515)
    • no (147)

Prikazovanje 1–100 od 662 zadetkov

  • 1
  • 2
  • 3
  •  
  • 7
  •    
    • Razvrsti vnose po
    •  Ustreznost
    • Naslov (naraščajoče)
    • Naslov (padajoče)
    • Datum izdaje (naraščajoče)
    • Datum izdaje (padajoče)
    •  
    • Rezultati/stran
    • 5
    • 10
    • 20
    • 40
    • 60
    • 80
    •  100

  • toolService
    CLARIN.SI data & tools
    toolService
    CroSloEngual BERT 1.1
    (Faculty of Computer and Information Science, University of Ljubljana / 2020-07-09)
    
    Avtorji:
    Ulčar, Matej and Robnik-Šikonja, Marko
     Ta vnos vsebuje 3 datotek(e) (476.35 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • toolService
    CLARIN.SI data & tools
    toolService
    Slovenian RoBERTa contextual embeddings model: SloBERTa 2.0
    (Faculty of Computer and Information Science, University of Ljubljana / 2021-01-17)
    
    Avtorji:
    Ulčar, Matej and Robnik-Šikonja, Marko
     Ta vnos vsebuje 2 datotek(e) (1.29 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    MULTEXT-East non-commercial lexicons 4.0
    (Jožef Stefan Institute / 2010-05-14)
    
    Avtorji:
    Erjavec, Tomaž ; et al.prikaži vse Erjavec, Tomaž ; Derzhanski, Ivan ; Divjak, Dagmar ; Feldman, Anna ; Kopotev, Mikhail ; Kotsyba, Natalia ; Krstev, Cvetana ; Petrovski, Aleksandar ; QasemiZadeh, Behrang ; Radziszewski, Adam ; Sharoff, Serge ; Sokolovsky, Paul ; Vitas, Duško ; Zdravkova, Katerina
     Ta vnos vsebuje 6 datotek(e) (12.05 MB).
     
    Academic Use Attribution Required Noncommercial

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Morphological lexicon Sloleks 3.0
    (Centre for Language Resources and Technologies, University of Ljubljana / 2022-12-05)
    
    Avtorji:
    Čibej, Jaka ; et al.prikaži vse Čibej, Jaka ; Gantar, Kaja ; Dobrovoljc, Kaja ; Krek, Simon ; Holozan, Peter ; Erjavec, Tomaž ; Romih, Miro ; Arhar Holdt, Špela ; Krsnik, Luka ; Robnik-Šikonja, Marko
     Ta vnos vsebuje 1 datoteko (239.75 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Parallel corpus EN-SL RSDO4 2.0
    (Centre for Language Resources and Technologies, University of Ljubljana / 2021-10-28)
    
    Avtorji:
    Repar, Andraž and Lebar Bajec, Iztok
     Ta vnos vsebuje 1 datoteko (189.06 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Frequency lists of word parts from the GOS 1.0 corpus 1.1
    (Centre for Language Resources and Technologies, University of Ljubljana; Jožef Stefan Institute / 2020-10-28)
    
    Avtorji:
    Čibej, Jaka ; Arhar Holdt, Špela ; Dobrovoljc, Kaja and Krek, Simon
     Ta vnos vsebuje 1 datoteko (33.41 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Frequency lists of words from the GOS 1.0 corpus 1.1
    (Centre for Language Resources and Technologies, University of Ljubljana; Jožef Stefan Institute / 2020-10-28)
    
    Avtorji:
    Čibej, Jaka ; Arhar Holdt, Špela ; Dobrovoljc, Kaja and Krek, Simon
     Ta vnos vsebuje 1 datoteko (4.5 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Automatically stress labelled morphological lexicon Sloleks 1.2, version 1.1
    (Faculty of Computer and Information Science, University of Ljubljana; Centre for Language Resources and Technologies, University of Ljubljana / 2018-05-08)
    
    Avtorji:
    Krsnik, Luka ; Robnik-Šikonja, Marko ; Šef, Tomaž and Krek, Simon
     Ta vnos vsebuje 2 datotek(e) (55.91 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Inflectional lexicon srLex 1.3
    (Jožef Stefan Institute / 2019-03-31)
    
    Avtorji:
    Ljubešić, Nikola
     Ta vnos vsebuje 1 datoteko (54.16 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for lemmatisation of standard Slovenian 2.0
    (Jožef Stefan Institute / 2023-01-31)
    
    Avtorji:
    Terčon, Luka ; Čibej, Jaka and Ljubešić, Nikola
     Ta vnos vsebuje 1 datoteko (2.09 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    ASR database ARTUR 1.0 (transcriptions)
    (Faculty of Electrical Engineering and Computer Science, University of Maribor; Faculty of Electrical Engineering, University of Ljubljana; Faculty of Computer and Information Science, University of Ljubljana; ZRC SAZU / 2023-02-22)
    
    Avtorji:
    Verdonik, Darinka ; et al.prikaži vse Verdonik, Darinka ; Bizjak, Andreja ; Sepesy Maučec, Mirjam ; Gril, Lucija ; Dobrišek, Simon ; Križaj, Janez ; Strle, Gregor ; Bajec, Marko ; Lebar Bajec, Iztok ; Jelovšek, Tjaša ; Lokovšek, Jure ; Trojar, Mitja ; Erjavec, Tomaž ; Bernjak, Mitja ; Žganec Gros, Jerneja ; Čakš, Peter ; Pucer, Matevž ; Cvetko, Mitja ; Pavlič, Jani ; Zelenik, Marijana ; Ivanovska, Marija ; Grm, Klemen ; Longyka, Jure ; Mihelič, Aleš ; Vesnicer, Boštjan ; Dretnik, Naum
     Ta vnos vsebuje 1 datoteko (48.63 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Frequency lists of character-level n-grams from the GOS 1.0 corpus 1.1
    (Centre for Language Resources and Technologies, University of Ljubljana; Jožef Stefan Institute / 2020-10-28)
    
    Avtorji:
    Čibej, Jaka ; Arhar Holdt, Špela ; Dobrovoljc, Kaja and Krek, Simon
     Ta vnos vsebuje 1 datoteko (2.56 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for morphosyntactic annotation of standard Slovenian 2.0
    (Jožef Stefan Institute / 2023-01-31)
    
    Avtorji:
    Ljubešić, Nikola ; Terčon, Luka and Čibej, Jaka
     Ta vnos vsebuje 2 datotek(e) (509.87 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for morphosyntactic annotation of standard Macedonian 1.1
    (Jožef Stefan Institute / 2021-02-02)
    
    Avtorji:
    Ljubešić, Nikola ; Zdravkova, Katerina ; Stojanoska, Sanja ; Erjavec, Tomaž and Krsnik, Luka
     Ta vnos vsebuje 2 datotek(e) (146.86 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for lemmatisation of non-standard Serbian 1.1
    (Jožef Stefan Institute / 2020-09-15)
    
    Avtorji:
    Ljubešić, Nikola and Štefanec, Vanja
     Ta vnos vsebuje 1 datoteko (90.05 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for morphosyntactic annotation of standard Croatian 1.2
    (Jožef Stefan Institute / 2021-02-02)
    
    Avtorji:
    Ljubešić, Nikola and Krsnik, Luka
     Ta vnos vsebuje 2 datotek(e) (178.58 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Kres corpus n-grams 2.0
    (Centre for Language Resources and Technologies, University of Ljubljana / 2018-08-03)
    
    Avtorji:
    Dobrovoljc, Kaja
     Ta vnos vsebuje 3 datotek(e) (2.34 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Frequency lists of word-level n-grams from the GOS 1.0 corpus 1.1
    (Centre for Language Resources and Technologies, University of Ljubljana; Jožef Stefan Institute / 2020-10-28)
    
    Avtorji:
    Čibej, Jaka ; Arhar Holdt, Špela ; Dobrovoljc, Kaja and Krek, Simon
     Ta vnos vsebuje 3 datotek(e) (287.52 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for lemmatisation of non-standard Croatian 1.1
    (Jožef Stefan Institute / 2020-07-17)
    
    Avtorji:
    Ljubešić, Nikola and Štefanec, Vanja
     Ta vnos vsebuje 1 datoteko (89.98 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Frequency list of words from the Trendi corpus 2021
    (Jožef Stefan Institute / 2022-10-28)
    
    Avtorji:
    Čibej, Jaka and Kosem, Iztok
     Ta vnos vsebuje 1 datoteko (25.06 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-StanfordNLP model for morphosyntactic annotation of standard Serbian 1.2
    (Jožef Stefan Institute / 2021-02-02)
    
    Avtorji:
    Ljubešić, Nikola and Krsnik, Luka
     Ta vnos vsebuje 2 datotek(e) (160.43 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Gos corpus n-grams 2.0
    (Centre for Language Resources and Technologies, University of Ljubljana / 2018-08-03)
    
    Avtorji:
    Dobrovoljc, Kaja
     Ta vnos vsebuje 3 datotek(e) (21.02 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for JOS dependency parsing of standard Slovenian 2.0
    (Jožef Stefan Institute / 2023-01-31)
    
    Avtorji:
    Terčon, Luka and Ljubešić, Nikola
     Ta vnos vsebuje 2 datotek(e) (176.5 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for semantic role labeling of standard Slovenian 2.0
    (Jožef Stefan Institute / 2023-01-31)
    
    Avtorji:
    Terčon, Luka and Ljubešić, Nikola
     Ta vnos vsebuje 1 datoteko (58.69 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Inflectional lexicon hrLex 1.3
    (Jožef Stefan Institute / 2019-03-31)
    
    Avtorji:
    Ljubešić, Nikola
     Ta vnos vsebuje 1 datoteko (51.95 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Slovene corpus for general relation extraction SloREL 1.1
    (Faculty of Computer and Information Science, University of Ljubljana / 2022-09-15)
    
    Avtorji:
    Štravs, Miha ; Knez, Timotej and Žitnik, Slavko
     Ta vnos vsebuje 1 datoteko (38.71 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Frequency lists of word-level n-grams from the Trendi corpus 2021
    (Jožef Stefan Institute / 2022-10-28)
    
    Avtorji:
    Čibej, Jaka and Kosem, Iztok
     Ta vnos vsebuje 1 datoteko (1.03 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Annotated Corpus of Pre-Standardized Balkan Slavic Literature 1.1
    (Slavic Seminary, University of Zurich / 2021-07-02)
    
    Avtorji:
    Šimko, Ivan
     Ta vnos vsebuje 5 datotek(e) (3.58 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for morphosyntactic annotation of non-standard Slovenian 2.1
    (Jožef Stefan Institute / 2023-03-30)
    
    Avtorji:
    Terčon, Luka and Ljubešić, Nikola
     Ta vnos vsebuje 2 datotek(e) (504.03 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for lemmatisation of non-standard Slovenian 2.1
    (Jožef Stefan Institute / 2023-03-30)
    
    Avtorji:
    Terčon, Luka and Ljubešić, Nikola
     Ta vnos vsebuje 1 datoteko (2.35 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Word embeddings CLARIN.SI-embed.sr 2.0
    (Jožef Stefan Institute / 2023-04-11)
    
    Avtorji:
    Terčon, Luka and Ljubešić, Nikola
     Ta vnos vsebuje 2 datotek(e) (3.41 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    Word embeddings CLARIN.SI-embed.mk 2.0
    (Jožef Stefan Institute / 2023-04-11)
    
    Avtorji:
    Terčon, Luka and Ljubešić, Nikola
     Ta vnos vsebuje 2 datotek(e) (1.71 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Word embeddings CLARIN.SI-embed.sl 2.0
    (Jožef Stefan Institute / 2023-04-11)
    
    Avtorji:
    Terčon, Luka ; Ljubešić, Nikola and Erjavec, Tomaž
     Ta vnos vsebuje 2 datotek(e) (4.22 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Word embeddings CLARIN.SI-embed.hr 2.0
    (Jožef Stefan Institute / 2023-04-11)
    
    Avtorji:
    Terčon, Luka and Ljubešić, Nikola
     Ta vnos vsebuje 2 datotek(e) (4.16 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Slovene web corpus MaCoCu-sl 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-19)
    
    Avtorji:
    Bañón, Marta ; et al.prikaži vse Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     Ta vnos vsebuje 2 datotek(e) (5.57 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Macedonian web corpus MaCoCu-mk 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-20)
    
    Avtorji:
    Bañón, Marta ; et al.prikaži vse Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     Ta vnos vsebuje 2 datotek(e) (1.79 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Bulgarian web corpus MaCoCu-bg 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-20)
    
    Avtorji:
    Bañón, Marta ; et al.prikaži vse Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     Ta vnos vsebuje 2 datotek(e) (12.06 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Maltese web corpus MaCoCu-mt 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-20)
    
    Avtorji:
    Bañón, Marta ; et al.prikaži vse Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     Ta vnos vsebuje 2 datotek(e) (1.07 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Maltese-English parallel corpus MaCoCu-mt-en 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-26)
    
    Avtorji:
    Bañón, Marta ; et al.prikaži vse Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     Ta vnos vsebuje 3 datotek(e) (1.06 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Croatian web corpus MaCoCu-hr 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-20)
    
    Avtorji:
    Bañón, Marta ; et al.prikaži vse Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     Ta vnos vsebuje 2 datotek(e) (7.12 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Macedonian-English parallel corpus MaCoCu-mk-en 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-26)
    
    Avtorji:
    Bañón, Marta ; et al.prikaži vse Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     Ta vnos vsebuje 3 datotek(e) (442.99 MB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Croatian-English parallel corpus MaCoCu-hr-en 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-26)
    
    Avtorji:
    Bañón, Marta ; et al.prikaži vse Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     Ta vnos vsebuje 3 datotek(e) (2.42 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Turkish-English parallel corpus MaCoCu-tr-en 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-26)
    
    Avtorji:
    Bañón, Marta ; et al.prikaži vse Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     Ta vnos vsebuje 3 datotek(e) (3.03 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Bulgarian-English parallel corpus MaCoCu-bg-en 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-26)
    
    Avtorji:
    Bañón, Marta ; et al.prikaži vse Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     Ta vnos vsebuje 3 datotek(e) (2.32 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Slovene-English parallel corpus MaCoCu-sl-en 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-26)
    
    Avtorji:
    Bañón, Marta ; et al.prikaži vse Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     Ta vnos vsebuje 3 datotek(e) (1.96 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Icelandic-English parallel corpus MaCoCu-is-en 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-26)
    
    Avtorji:
    Bañón, Marta ; et al.prikaži vse Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     Ta vnos vsebuje 3 datotek(e) (366.72 MB).
     
    Publicly Available

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for morphosyntactic annotation of standard Serbian 2.1
    (Jožef Stefan Institute / 2023-05-10)
    
    Avtorji:
    Terčon, Luka and Ljubešić, Nikola
     Ta vnos vsebuje 2 datotek(e) (179.43 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for morphosyntactic annotation of standard Croatian 2.1
    (Jožef Stefan Institute / 2023-05-10)
    
    Avtorji:
    Terčon, Luka and Ljubešić, Nikola
     Ta vnos vsebuje 2 datotek(e) (177.02 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for UD dependency parsing of standard Croatian 2.1
    (Jožef Stefan Institute / 2023-05-10)
    
    Avtorji:
    Terčon, Luka and Ljubešić, Nikola
     Ta vnos vsebuje 2 datotek(e) (191.81 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for lemmatisation of standard Croatian 2.1
    (Jožef Stefan Institute / 2023-05-10)
    
    Avtorji:
    Terčon, Luka and Ljubešić, Nikola
     Ta vnos vsebuje 1 datoteko (98.13 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for morphosyntactic annotation of non-standard Serbian 2.1
    (Jožef Stefan Institute / 2023-05-10)
    
    Avtorji:
    Terčon, Luka ; Ljubešić, Nikola and Štefanec, Vanja
     Ta vnos vsebuje 2 datotek(e) (172.92 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for morphosyntactic annotation of non-standard Croatian 2.1
    (Jožef Stefan Institute / 2023-05-10)
    
    Avtorji:
    Terčon, Luka ; Ljubešić, Nikola and Štefanec, Vanja
     Ta vnos vsebuje 2 datotek(e) (179.88 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for lemmatisation of non-standard Serbian 2.1
    (Jožef Stefan Institute / 2023-05-10)
    
    Avtorji:
    Terčon, Luka ; Ljubešić, Nikola and Štefanec, Vanja
     Ta vnos vsebuje 1 datoteko (104.93 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for lemmatisation of standard Serbian 2.1
    (Jožef Stefan Institute / 2023-05-10)
    
    Avtorji:
    Terčon, Luka and Ljubešić, Nikola
     Ta vnos vsebuje 1 datoteko (104.93 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for UD dependency parsing of standard Serbian 2.1
    (Jožef Stefan Institute / 2023-05-10)
    
    Avtorji:
    Terčon, Luka and Ljubešić, Nikola
     Ta vnos vsebuje 2 datotek(e) (189.8 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    ELMo embeddings models for seven languages
    (Faculty of Computer and Information Science, University of Ljubljana / 2019-11-25)
    
    Avtorji:
    Ulčar, Matej
     Ta vnos vsebuje 7 datotek(e) (1.35 GB).
     
    Publicly Available

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for lemmatisation of non-standard Croatian 2.1
    (Jožef Stefan Institute / 2023-05-10)
    
    Avtorji:
    Terčon, Luka ; Ljubešić, Nikola and Štefanec, Vanja
     Ta vnos vsebuje 1 datoteko (98.12 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    Q-CAT Corpus Annotation Tool 1.5
    (Jožef Stefan Institute / 2023-06-03)
    
    Avtorji:
    Brank, Janez
     Ta vnos vsebuje 1 datoteko (7.58 MB).
     
    Publicly Available

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    MULTEXT-East free lexicons 4.0
    (Jožef Stefan Institute / 2010-05-14)
    
    Avtorji:
    Erjavec, Tomaž ; et al.prikaži vse Erjavec, Tomaž ; Bruda, Ştefan ; Derzhanski, Ivan ; Dimitrova, Ludmila ; Garabík, Radovan ; Holozan, Peter ; Ide, Nancy ; Kaalep, Heiki-Jaan ; Kotsyba, Natalia ; Oravecz, Csaba ; Petkevič, Vladimír ; Priest-Dorman, Greg ; Shevchenko, Igor ; Simov, Kiril ; Sinapova, Lydia ; Steenwijk, Han ; Tihanyi, Laszlo ; Tufiş, Dan ; Véronis, Jean
     Ta vnos vsebuje 12 datotek(e) (16.27 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Corpus of comma placement Vejica 1.3
    (Amebis, d. o. o., Kamnik / 2018-04-15)
    
    Avtorji:
    Holozan, Peter
     Ta vnos vsebuje 2 datotek(e) (3.8 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for lemmatisation of standard Macedonian 2.1
    (Jožef Stefan Institute / 2023-06-27)
    
    Avtorji:
    Terčon, Luka ; Ljubešić, Nikola ; Zdravkova, Katerina and Erjavec, Tomaž
     Ta vnos vsebuje 1 datoteko (2.19 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for morphosyntactic annotation of standard Bulgarian 2.1
    (Jožef Stefan Institute; IICT-BAS / 2023-06-27)
    
    Avtorji:
    Terčon, Luka ; Ljubešić, Nikola ; Osenova, Petya ; Simov, Kiril and Krsnik, Luka
     Ta vnos vsebuje 2 datotek(e) (163.09 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for UD dependency parsing of standard Bulgarian 2.1
    (Jožef Stefan Institute; IICT-BAS / 2023-06-27)
    
    Avtorji:
    Terčon, Luka ; Ljubešić, Nikola ; Osenova, Petya and Simov, Kiril
     Ta vnos vsebuje 2 datotek(e) (190.67 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for morphosyntactic annotation of standard Macedonian 2.1
    (Jožef Stefan Institute / 2023-06-27)
    
    Avtorji:
    Terčon, Luka ; Ljubešić, Nikola ; Zdravkova, Katerina ; Stojanoska, Sanja and Erjavec, Tomaž
     Ta vnos vsebuje 2 datotek(e) (147.17 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The CLASSLA-Stanza model for lemmatisation of standard Bulgarian 2.1
    (Jožef Stefan Institute; IICT-BAS / 2023-06-27)
    
    Avtorji:
    Terčon, Luka ; Ljubešić, Nikola ; Osenova, Petya and Simov, Kiril
     Ta vnos vsebuje 1 datoteko (52.95 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Corpus of textbooks for learning Slovenian as L2 KUUS 2.0
    (Centre for Slovene as a Second and Foreign Language, University of Ljubljana; Centre for Language Resources and Technologies, University of Ljubljana / 2023-10-19)
    
    Avtorji:
    Klemen, Matej ; et al.prikaži vse Klemen, Matej ; Kosem, Iztok ; Arhar Holdt, Špela ; Pollak, Senja ; Huber, Damjan ; Lutar, Mateja
     Ta vnos vsebuje 3 datotek(e) (38.95 MB).
     
    Academic Use Inform Before Use Attribution Required Noncommercial

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    IMP corpus n-grams 2.0
    (Centre for Language Resources and Technologies, University of Ljubljana / 2018-08-03)
    
    Avtorji:
    Dobrovoljc, Kaja
     Ta vnos vsebuje 3 datotek(e) (326.65 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Collocations Dictionary of Modern Slovene KSSS 2.0
    (Centre for Language Resources and Technologies, University of Ljubljana / 2023-12-31)
    
    Avtorji:
    Kosem, Iztok ; et al.prikaži vse Kosem, Iztok ; Arhar Holdt, Špela ; Krek, Simon ; Gantar, Polona ; Pori, Eva ; Čibej, Jaka ; Klemenc, Bojan ; Laskowski, Cyprian ; Dobrovoljc, Kaja ; Gorjanc, Vojko ; Ljubešić, Nikola ; Zgaga, Karolina ; Roblek, Rebeka
     Ta vnos vsebuje 1 datoteko (100.09 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    ASR database ARTUR 1.0 (audio)
    (Faculty of Electrical Engineering and Computer Science, University of Maribor; Faculty of Electrical Engineering, University of Ljubljana; Faculty of Computer and Information Science, University of Ljubljana; Alpineon d.o.o.; STA / 2023-02-27)
    
    Avtorji:
    Verdonik, Darinka ; et al.prikaži vse Verdonik, Darinka ; Bizjak, Andreja ; Žgank, Andrej ; Bernjak, Mitja ; Antloga, Špela ; Majhenič, Simona ; Čakš, Peter ; Pucer, Matevž ; Cvetko, Mitja ; Zelenik, Marijana ; Pavlič, Jani ; Dobrišek, Simon ; Križaj, Janez ; Strle, Gregor ; Ivanovska, Marija ; Grm, Klemen ; Bajec, Marko ; Lebar Bajec, Iztok ; Jelovšek, Tjaša ; Lokovšek, Jure ; Longyka, Jure ; Trojar, Mitja ; Žganec Gros, Jerneja ; Mihelič, Aleš ; Vesnicer, Boštjan ; Dretnik, Naum ; Bordon, David
     Ta vnos vsebuje 39 datotek(e) (324.53 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Thesaurus of Modern Slovene 2.0
    (Centre for Language Resources and Technologies, University of Ljubljana / 2023-11-15)
    
    Avtorji:
    Krek, Simon ; et al.prikaži vse Krek, Simon ; Laskowski, Cyprian ; Robnik-Šikonja, Marko ; Kosem, Iztok ; Arhar Holdt, Špela ; Gantar, Polona ; Čibej, Jaka ; Gorjanc, Vojko ; Klemenc, Bojan ; Dobrovoljc, Kaja ; Pori, Eva ; Roblek, Rebeka ; Zgaga, Karolina
     Ta vnos vsebuje 1 datoteko (10.53 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Training corpus jos1M 1.2
    (Jožef Stefan Institute / 2019-02-13)
    
    Avtorji:
    Erjavec, Tomaž ; Krek, Simon and Dobrovoljc, Kaja
     Ta vnos vsebuje 4 datotek(e) (108.6 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial

  • corpus
    CLARIN.SI data & tools
    corpus
    Spoken corpus Gos VideoLectures 4.2 (transcription)
    (Faculty of Electrical Engineering and Computer Science, University of Maribor / 2021-09-23)
    
    Avtorji:
    Verdonik, Darinka ; et al.prikaži vse Verdonik, Darinka ; Potočnik, Tomaž ; Sepesy Maučec, Mirjam ; Erjavec, Tomaž ; Majhenič, Simona ; Žgank, Andrej
     Ta vnos vsebuje 3 datotek(e) (21.65 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Corpus of term-annotated texts RSDO5 1.1
    (ZRC SAZU / 2021-12-07)
    
    Avtorji:
    Jemec Tomazin, Mateja ; et al.prikaži vse Jemec Tomazin, Mateja ; Trojar, Mitja ; Atelšek, Simon ; Fajfar, Tanja ; Erjavec, Tomaž ; Žagar Karer, Mojca
     Ta vnos vsebuje 4 datotek(e) (15.62 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Developmental corpus Šolar 3.0
    (Centre for Language Resources and Technologies, University of Ljubljana / 2022-09-05)
    
    Avtorji:
    Arhar Holdt, Špela ; et al.prikaži vse Arhar Holdt, Špela ; Rozman, Tadeja ; Stritar Kučuk, Mojca ; Krek, Simon ; Krapš Vodopivec, Irena ; Stabej, Marko ; Pori, Eva ; Goli, Teja ; Lavrič, Polona ; Laskowski, Cyprian ; Kocjančič, Polonca ; Klemenc, Bojan ; Krsnik, Luka ; Kosem, Iztok
     Ta vnos vsebuje 4 datotek(e) (194.6 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    CMC training corpus Janes-Tag 3.0
    (Jožef Stefan Institute / 2022-12-06)
    
    Avtorji:
    Lenardič, Jakob ; et al.prikaži vse Lenardič, Jakob ; Čibej, Jaka ; Arhar Holdt, Špela ; Erjavec, Tomaž ; Fišer, Darja ; Ljubešić, Nikola ; Zupan, Katja ; Dobrovoljc, Kaja
     Ta vnos vsebuje 2 datotek(e) (8.63 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    CMC training corpus Janes-Norm 3.0
    (Jožef Stefan Institute / 2022-12-06)
    
    Avtorji:
    Lenardič, Jakob ; Čibej, Jaka ; Arhar Holdt, Špela ; Erjavec, Tomaž and Fišer, Darja
     Ta vnos vsebuje 2 datotek(e) (12.16 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Corpus of combined Slovenian corpora metaFida 1.0
    (Jožef Stefan Institute / 2023-02-28)
    
    Avtorji:
    Erjavec, Tomaž
     Ta vnos ne vsebuje datotek.

  • corpus
    CLARIN.SI data & tools
    corpus
    Corpus of Serbian Forms of Address 1.1
    (Department of Slavonic Languages and Literatures (Slavisches Seminar), University of Zurich / 2023-05-01)
    
    Avtorji:
    Lemmenmeier-Batinić, Dolores
     Ta vnos vsebuje 3 datotek(e) (6.43 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Slovene learner corpus KOST 2.0
    (Centre for Language Resources and Technologies, University of Ljubljana / 2023-10-31)
    
    Avtorji:
    Stritar Kučuk, Mojca ; et al.prikaži vse Stritar Kučuk, Mojca ; Šter, Helena ; Pisek, Staša ; Petric Lasnik, Ivana ; Kete Matičič, Jana ; Pirih Svetina, Nataša ; Preglau, Daniela ; Arhar Holdt, Špela ; Krsnik, Luka ; Erjavec, Tomaž ; Pegan, Jasmina ; Huber, Damjan
     Ta vnos vsebuje 2 datotek(e) (117.4 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Comprehensive Slovenian-Hungarian Dictionary 2.0
    (Centre for Language Resources and Technologies, University of Ljubljana / 2024-04-04)
    
    Avtorji:
    Kosem, Iztok ; et al.prikaži vse Kosem, Iztok ; Bálint Čeh, Júlia ; Ponikvar, Primož ; Zaranšek, Petra ; Kamenšek, Urška ; Koša, Peter ; Gróf, Annamária ; Böröcz, Nándor ; Harmat Császár, Jolanda ; Szíjártó, Imre ; Šantak, Borut ; Gantar, Polona ; Krek, Simon ; Roblek, Rebeka ; Zgaga, Karolina ; Logar, Urban ; Pori, Eva ; Arhar Holdt, Špela ; Gorjanc, Vojko ; Šešet, Jure ; Potoczky, Klára ; Laskowski, Cyprian ; Bombek, Miha ; Dragar, Luka
     Ta vnos vsebuje 1 datoteko (3.59 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Offensive language dataset of Croatian, English and Slovenian comments FRENK 1.1
    (Jožef Stefan Institute / 2021-11-17)
    
    Avtorji:
    Ljubešić, Nikola ; Fišer, Darja ; Erjavec, Tomaž and Šulc, Ajda
     Ta vnos vsebuje 2 datotek(e) (4.48 MB).
     
    Academic Use Inform Before Use Attribution Required Noncommercial

  • corpus
    CLARIN.SI data & tools
    corpus
    Spoken corpus Gos VideoLectures 4.0 (audio)
    (VideoLectures.NET / 2019-03-26)
    
    Avtorji:
    VideoLectures.NET
     Ta vnos vsebuje 6 datotek(e) (8.85 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial No Derivative Works

  • corpus
    CLARIN.SI data & tools
    corpus
    Slovenian parliamentary corpus (1990-2022) siParl 4.0
    (Institute of Contemporary History / 2024-06-05)
    
    Avtorji:
    Pančur, Andrej ; et al.prikaži vse Pančur, Andrej ; Meden, Katja ; Erjavec, Tomaž ; Ojsteršek, Mihael ; Šorn, Mojca ; Blaj Hribar, Neja
     Ta vnos vsebuje 5 datotek(e) (14.28 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    The Sarajevo Corpus of SMS Messages in Bosnian 1.1
    (University of Sarajevo – Faculty of Philosophy / 2024-07-16)
    
    Avtorji:
    Wasserscheidt, Philipp ; et al.prikaži vse Wasserscheidt, Philipp ; Bulić, Halid ; Durmišević, Elma ; Hodžić-Čavkić, Azra ; Bajraktarević, Enisa ; Ahmetspahić-Peljto, Azra ; Šabić, Belmin
     Ta vnos vsebuje 1 datoteko (1.69 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Albanian Spoken Corpus in Kosovo 1.0
    (University of Prishtina "Hasan Prishtina" / 2024-07-08)
    
    Avtorji:
    Wasserscheidt, Philipp ; Rugova, Bardh and Baftiu, Adelajda
     Ta vnos vsebuje 1 datoteko (1.76 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Training corpus SUK 1.1
    (Centre for Language Resources and Technologies, University of Ljubljana / 2024-08-22)
    
    Avtorji:
    Arhar Holdt, Špela ; et al.prikaži vse Arhar Holdt, Špela ; Krek, Simon ; Dobrovoljc, Kaja ; Erjavec, Tomaž ; Gantar, Polona ; Čibej, Jaka ; Pori, Eva ; Terčon, Luka ; Munda, Tina ; Žitnik, Slavko ; Robida, Nejc ; Blagus, Neli ; Može, Sara ; Ledinek, Nina ; Holz, Nanika ; Zupan, Katja ; Kuzman, Taja ; Kavčič, Teja ; Škrjanec, Iza ; Marko, Dafne ; Jezeršek, Lucija ; Zajc, Anja
     Ta vnos vsebuje 2 datotek(e) (45.1 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The Trankit model for linguistic process of standard written Slovenian 1.1
    (Centre for Language Resources and Technologies, University of Ljubljana / 2024-08-29)
    
    Avtorji:
    Krsnik, Luka ; Dobrovoljc, Kaja and Terčon, Luka
     Ta vnos vsebuje 1 datoteko (143.34 MB).
     
    Publicly Available

  • toolService
    CLARIN.SI data & tools
    toolService
    Corpus extraction tool LIST 1.3
    (Centre for Language Resources and Technologies, University of Ljubljana; Faculty of Computer and Information Science, University of Ljubljana; Jožef Stefan Institute / 2024-08-28)
    
    Avtorji:
    Krsnik, Luka ; et al.prikaži vse Krsnik, Luka ; Arhar Holdt, Špela ; Čibej, Jaka ; Dobrovoljc, Kaja ; Ključevšek, Aleksander ; Krek, Simon ; Robnik-Šikonja, Marko
     Ta vnos vsebuje 1 datoteko (231.07 MB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Icelandic web corpus MaCoCu-is 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-05-19)
    
    Avtorji:
    Bañón, Marta ; et al.prikaži vse Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     Ta vnos vsebuje 2 datotek(e) (2.48 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Turkish web corpus MaCoCu-tr 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-20)
    
    Avtorji:
    Bañón, Marta ; et al.prikaži vse Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     Ta vnos vsebuje 2 datotek(e) (15.07 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Corpus of academic Slovene KAS 2.0
    (Faculty of Electrical Engineering and Computer Science, University of Maribor; Faculty of Computer and Information Science, University of Ljubljana / 2022-02-04)
    
    Avtorji:
    Žagar, Aleš ; et al.prikaži vse Žagar, Aleš ; Kavaš, Matic ; Robnik-Šikonja, Marko ; Erjavec, Tomaž ; Fišer, Darja ; Ljubešić, Nikola ; Ferme, Marko ; Borovič, Mladen ; Boškovič, Borko ; Ojsteršek, Milan ; Hrovat, Goran
     Ta vnos vsebuje 4 datotek(e) (13.71 GB).
     
    Academic Use Inform Before Use Attribution Required Noncommercial

  • corpus
    CLARIN.SI data & tools
    corpus
    Abstracts from the KAS corpus KAS-Abs 2.0
    (Faculty of Electrical Engineering and Computer Science, University of Maribor; Faculty of Computer and Information Science, University of Ljubljana / 2022-02-04)
    
    Avtorji:
    Žagar, Aleš ; et al.prikaži vse Žagar, Aleš ; Kavaš, Matic ; Robnik-Šikonja, Marko ; Erjavec, Tomaž ; Fišer, Darja ; Ljubešić, Nikola ; Ferme, Marko ; Borovič, Mladen ; Boškovič, Borko ; Ojsteršek, Milan ; Hrovat, Goran
     Ta vnos vsebuje 1 datoteko (83.48 MB).
     
    Academic Use Inform Before Use Attribution Required Noncommercial

  • corpus
    CLARIN.SI data & tools
    corpus
    Corpus of 1968 Slovenian literature Maj68 3.0
    (ZRC SAZU / 2024-10-22)
    
    Avtorji:
    Juvan, Marko ; et al.prikaži vse Juvan, Marko ; Žejn, Andrejka ; Šorli, Mojca ; Mandić, Lucija ; Tomažin, Andrej ; Jež, Andraž ; Balžalorsky Antić, Varja ; Erjavec, Tomaž
     Ta vnos vsebuje 6 datotek(e) (1.33 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Croatian linguistic training corpus hr500k 2.0
    (Jožef Stefan Institute / 2023-04-13)
    
    Avtorji:
    Ljubešić, Nikola and Samardžić, Tanja
     Ta vnos vsebuje 7 datotek(e) (49.59 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Croatian Twitter training corpus ReLDI-NormTagNER-hr 3.0
    (Jožef Stefan Institute / 2023-04-07)
    
    Avtorji:
    Ljubešić, Nikola ; Erjavec, Tomaž ; Batanović, Vuk ; Miličević, Maja and Samardžić, Tanja
     Ta vnos vsebuje 4 datotek(e) (8.54 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Serbian linguistic training corpus SETimes.SR 2.0
    (Regional Linguistic Data Initiative Centre ReLDI; Jožef Stefan Institute / 2023-06-13)
    
    Avtorji:
    Batanović, Vuk ; Ljubešić, Nikola ; Samardžić, Tanja and Erjavec, Tomaž
     Ta vnos vsebuje 4 datotek(e) (9.4 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    List of word relations from the Sloleks 2.0 lexicon 1.1
    (Centre for Language Resources and Technologies, University of Ljubljana; Jožef Stefan Institute / 2024-11-07)
    
    Avtorji:
    Čibej, Jaka ; Arhar Holdt, Špela and Krek, Simon
     Ta vnos vsebuje 1 datoteko (2.84 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • lexicalConceptualResource
    CLARIN.SI data & tools
    lexicalConceptualResource
    Consonant-vowel structures in the GOS 1.0 corpus 1.1
    (Centre for Language Resources and Technologies, University of Ljubljana; Jožef Stefan Institute / 2020-10-28)
    
    Avtorji:
    Čibej, Jaka ; Arhar Holdt, Špela ; Dobrovoljc, Kaja and Krek, Simon
     Ta vnos vsebuje 7 datotek(e) (3.6 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • toolService
    CLARIN.SI data & tools
    toolService
    The Trankit model for linguistic processing of written and spoken Slovenian 1.2
    (Centre for Language Resources and Technologies, University of Ljubljana / 2024-12-06)
    
    Avtorji:
    Krsnik, Luka ; Dobrovoljc, Kaja and Terčon, Luka
     Ta vnos vsebuje 1 datoteko (145.51 MB).
     
    Publicly Available

  • toolService
    CLARIN.SI data & tools
    toolService
    Trankit model for SST 2.15 1.1
    (Centre for Language Resources and Technologies, University of Ljubljana / 2024-12-06)
    
    Avtorji:
    Krsnik, Luka ; Dobrovoljc, Kaja and Terčon, Luka
     Ta vnos vsebuje 1 datoteko (138.81 MB).
     
    Publicly Available

  • 1
  • 2
  • 3
  •  
  • 7
  •    
    • Razvrsti vnose po
    •  Ustreznost
    • Naslov (naraščajoče)
    • Naslov (padajoče)
    • Datum izdaje (naraščajoče)
    • Datum izdaje (padajoče)
    •  
    • Rezultati/stran
    • 5
    • 10
    • 20
    • 40
    • 60
    • 80
    •  100
 

Partnerji

  • Alpineon, d.o.o.
  • Amebis, d.o.o.
  • Inštitut za novejšo zgodovino
  • Institut "Jožef Stefan"
  • Narodna in univerzitetna knjižnica Slovenije
  • Slovensko društvo za jezikovne tehnologije

Partnerji

  • Univerza v Ljubljani
  • Univerza v Mariboru
  • Univerza v Novi Gorici
  • Univerza na Primorskem
  • ZRC SAZU
  • ZRS Koper

Repozitorij

  • Domača stran
  • Kontakt
  • Življenski ciklus vnosa
  • Pogosta vprašanja
  • O repozitoriju in pravilih uporabe

Repozitorij uporablja programsko opremo, ki je bila razvita za LINDAT/CLARIAH-CZ jezikoslovni repozitorij in je dostopna na GitHubu.

CLARIN.SI podpira Ministrstvo za izobraževanje, znanost in šport
v okviru programa "Raziskovalne infrastrukture".