• Repository
  • About
  • Contact
  • CLARIN
  •  Login
  • English Slovenščina
  • CLARIN.SI repository
  • Search
  • CLARIN logo
  •   Browse  
    •    All of the Repository  
      •   Issue Date
      •   Authors
      •   Titles
      •   Subjects
      •   Publisher
      •   Language
      •   Type
      •   Rights Label
  •   My Account  
    •    Login
  •   General Information  
    •    Deposit
    •    Cite
    •    Submission Lifecycle
    •    FAQ
    •    About
    •    Help Desk
 

 
Selected Filters
 Type : corpus     Clear All
Advanced Search

Filters

Use filters to refine the search results.

Current Filters:
New Filters:

Limit your search

Author  
    • Ljubešić, Nikola (138)
    • Erjavec, Tomaž (91)
    • Rupnik, Peter (70)
    • Kuzman, Taja (65)
    • Toral, Antonio (50)
    • Esplà-Gomis, Miquel (49)
    • Bañón, Marta (44)
    • Forcada, Mikel L. (44)
    • García-Romero, Cristian (44)
    • Pla Sempere, Leopoldo (44)
    • Ramírez-Sánchez, Gema (44)
    • Suchomel, Vít (44)
    • van Noord, Rik (44)
    • Fišer, Darja (37)
    • Chichirau, Malina (28)
    • Galiano-Jiménez, Aarón (28)
    • Zaragoza-Bernabeu, Jaume (28)
    • Arhar Holdt, Špela (25)
    • Krek, Simon (22)
    • Batanović, Vuk (18)
    • ... View More
Subject  
    • TEI (67)
    • web corpus (64)
    • parallel corpus (44)
    • manual annotation (41)
    • multilingual (39)
    • computer-mediated communication (27)
    • named entities (21)
    • parliamentary debates (21)
    • news corpus (20)
    • part-of-speech tagging (20)
    • tokenisation (18)
    • lemmatisation (16)
    • spoken corpus (16)
    • Parla-CLARIN (14)
    • word normalisation (14)
    • news comments (13)
    • Slovenian Parliament (13)
    • specialised corpus (12)
    • speech database (12)
    • speech transcription (12)
    • ... View More
Rights  
    • PUB (262)
    • ACA (24)
    • RES (1)
Language (ISO)  
    • Slovenian (165)
    • English (75)
    • Croatian (52)
    • Serbian (45)
    • Bosnian (22)
    • Bulgarian (20)
    • Spanish (16)
    • Estonian (15)
    • Russian (14)
    • French (13)
    • Hungarian (13)
    • Macedonian (13)
    • Dutch (12)
    • German (12)
    • Italian (12)
    • Czech (11)
    • Danish (11)
    • Icelandic (11)
    • Latvian (11)
    • Montenegrin (11)
    • ... View More
Type  
    • text (284)
    • audio (15)
    • video (1)
Contain Files  
    • yes (287)
    • no (13)

Showing 1 through 80 out of 300 results

  • 1
  • 2
  • 3
  •  
  • 4
  •    
    • Sort items by
    •  Relevance
    • Title Asc
    • Title Desc
    • Issue Date Asc
    • Issue Date Desc
    •  
    • Results/page
    • 5
    • 10
    • 20
    • 40
    • 60
    •  80
    • 100

  • corpus
    CLARIN.SI data & tools
    corpus
    Parallel corpus EN-SL RSDO4 2.0
    (Centre for Language Resources and Technologies, University of Ljubljana / 2021-10-28)
    
    Author(s):
    Repar, Andraž and Lebar Bajec, Iztok
     This item contains 1 file (189.06 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    ASR database ARTUR 1.0 (transcriptions)
    (Faculty of Electrical Engineering and Computer Science, University of Maribor; Faculty of Electrical Engineering, University of Ljubljana; Faculty of Computer and Information Science, University of Ljubljana; ZRC SAZU / 2023-02-22)
    
    Author(s):
    Verdonik, Darinka ; et al.show everyone Verdonik, Darinka ; Bizjak, Andreja ; Sepesy Maučec, Mirjam ; Gril, Lucija ; Dobrišek, Simon ; Križaj, Janez ; Strle, Gregor ; Bajec, Marko ; Lebar Bajec, Iztok ; Jelovšek, Tjaša ; Lokovšek, Jure ; Trojar, Mitja ; Erjavec, Tomaž ; Bernjak, Mitja ; Žganec Gros, Jerneja ; Čakš, Peter ; Pucer, Matevž ; Cvetko, Mitja ; Pavlič, Jani ; Zelenik, Marijana ; Ivanovska, Marija ; Grm, Klemen ; Longyka, Jure ; Mihelič, Aleš ; Vesnicer, Boštjan ; Dretnik, Naum
     This item contains 1 file (48.63 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Slovene corpus for general relation extraction SloREL 1.1
    (Faculty of Computer and Information Science, University of Ljubljana / 2022-09-15)
    
    Author(s):
    Štravs, Miha ; Knez, Timotej and Žitnik, Slavko
     This item contains 1 file (38.71 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Annotated Corpus of Pre-Standardized Balkan Slavic Literature 1.1
    (Slavic Seminary, University of Zurich / 2021-07-02)
    
    Author(s):
    Šimko, Ivan
     This item contains 5 files (3.58 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Slovene web corpus MaCoCu-sl 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-19)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     This item contains 2 files (5.57 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Macedonian web corpus MaCoCu-mk 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-20)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     This item contains 2 files (1.79 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Bulgarian web corpus MaCoCu-bg 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-20)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     This item contains 2 files (12.06 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Maltese web corpus MaCoCu-mt 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-20)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     This item contains 2 files (1.07 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Maltese-English parallel corpus MaCoCu-mt-en 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-26)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     This item contains 3 files (1.06 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Croatian web corpus MaCoCu-hr 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-20)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     This item contains 2 files (7.12 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Macedonian-English parallel corpus MaCoCu-mk-en 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-26)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     This item contains 3 files (442.99 MB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Croatian-English parallel corpus MaCoCu-hr-en 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-26)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     This item contains 3 files (2.42 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Turkish-English parallel corpus MaCoCu-tr-en 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-26)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     This item contains 3 files (3.03 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Bulgarian-English parallel corpus MaCoCu-bg-en 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-26)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     This item contains 3 files (2.32 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Slovene-English parallel corpus MaCoCu-sl-en 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-26)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     This item contains 3 files (1.96 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Icelandic-English parallel corpus MaCoCu-is-en 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-26)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     This item contains 3 files (366.72 MB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Corpus of comma placement Vejica 1.3
    (Amebis, d. o. o., Kamnik / 2018-04-15)
    
    Author(s):
    Holozan, Peter
     This item contains 2 files (3.8 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Corpus of textbooks for learning Slovenian as L2 KUUS 2.0
    (Centre for Slovene as a Second and Foreign Language, University of Ljubljana; Centre for Language Resources and Technologies, University of Ljubljana / 2023-10-19)
    
    Author(s):
    Klemen, Matej ; et al.show everyone Klemen, Matej ; Kosem, Iztok ; Arhar Holdt, Špela ; Pollak, Senja ; Huber, Damjan ; Lutar, Mateja
     This item contains 3 files (38.95 MB).
     
    Academic Use Inform Before Use Attribution Required Noncommercial

  • corpus
    CLARIN.SI data & tools
    corpus
    ASR database ARTUR 1.0 (audio)
    (Faculty of Electrical Engineering and Computer Science, University of Maribor; Faculty of Electrical Engineering, University of Ljubljana; Faculty of Computer and Information Science, University of Ljubljana; Alpineon d.o.o.; STA / 2023-02-27)
    
    Author(s):
    Verdonik, Darinka ; et al.show everyone Verdonik, Darinka ; Bizjak, Andreja ; Žgank, Andrej ; Bernjak, Mitja ; Antloga, Špela ; Majhenič, Simona ; Čakš, Peter ; Pucer, Matevž ; Cvetko, Mitja ; Zelenik, Marijana ; Pavlič, Jani ; Dobrišek, Simon ; Križaj, Janez ; Strle, Gregor ; Ivanovska, Marija ; Grm, Klemen ; Bajec, Marko ; Lebar Bajec, Iztok ; Jelovšek, Tjaša ; Lokovšek, Jure ; Longyka, Jure ; Trojar, Mitja ; Žganec Gros, Jerneja ; Mihelič, Aleš ; Vesnicer, Boštjan ; Dretnik, Naum ; Bordon, David
     This item contains 39 files (324.53 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Training corpus jos1M 1.2
    (Jožef Stefan Institute / 2019-02-13)
    
    Author(s):
    Erjavec, Tomaž ; Krek, Simon and Dobrovoljc, Kaja
     This item contains 4 files (108.6 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial

  • corpus
    CLARIN.SI data & tools
    corpus
    Spoken corpus Gos VideoLectures 4.2 (transcription)
    (Faculty of Electrical Engineering and Computer Science, University of Maribor / 2021-09-23)
    
    Author(s):
    Verdonik, Darinka ; et al.show everyone Verdonik, Darinka ; Potočnik, Tomaž ; Sepesy Maučec, Mirjam ; Erjavec, Tomaž ; Majhenič, Simona ; Žgank, Andrej
     This item contains 3 files (21.65 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Corpus of term-annotated texts RSDO5 1.1
    (ZRC SAZU / 2021-12-07)
    
    Author(s):
    Jemec Tomazin, Mateja ; et al.show everyone Jemec Tomazin, Mateja ; Trojar, Mitja ; Atelšek, Simon ; Fajfar, Tanja ; Erjavec, Tomaž ; Žagar Karer, Mojca
     This item contains 4 files (15.62 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Developmental corpus Šolar 3.0
    (Centre for Language Resources and Technologies, University of Ljubljana / 2022-09-05)
    
    Author(s):
    Arhar Holdt, Špela ; et al.show everyone Arhar Holdt, Špela ; Rozman, Tadeja ; Stritar Kučuk, Mojca ; Krek, Simon ; Krapš Vodopivec, Irena ; Stabej, Marko ; Pori, Eva ; Goli, Teja ; Lavrič, Polona ; Laskowski, Cyprian ; Kocjančič, Polonca ; Klemenc, Bojan ; Krsnik, Luka ; Kosem, Iztok
     This item contains 4 files (194.6 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    CMC training corpus Janes-Tag 3.0
    (Jožef Stefan Institute / 2022-12-06)
    
    Author(s):
    Lenardič, Jakob ; et al.show everyone Lenardič, Jakob ; Čibej, Jaka ; Arhar Holdt, Špela ; Erjavec, Tomaž ; Fišer, Darja ; Ljubešić, Nikola ; Zupan, Katja ; Dobrovoljc, Kaja
     This item contains 2 files (8.63 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    CMC training corpus Janes-Norm 3.0
    (Jožef Stefan Institute / 2022-12-06)
    
    Author(s):
    Lenardič, Jakob ; Čibej, Jaka ; Arhar Holdt, Špela ; Erjavec, Tomaž and Fišer, Darja
     This item contains 2 files (12.16 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Corpus of combined Slovenian corpora metaFida 1.0
    (Jožef Stefan Institute / 2023-02-28)
    
    Author(s):
    Erjavec, Tomaž
     This item contains no files.

  • corpus
    CLARIN.SI data & tools
    corpus
    Corpus of Serbian Forms of Address 1.1
    (Department of Slavonic Languages and Literatures (Slavisches Seminar), University of Zurich / 2023-05-01)
    
    Author(s):
    Lemmenmeier-Batinić, Dolores
     This item contains 3 files (6.43 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Slovene learner corpus KOST 2.0
    (Centre for Language Resources and Technologies, University of Ljubljana / 2023-10-31)
    
    Author(s):
    Stritar Kučuk, Mojca ; et al.show everyone Stritar Kučuk, Mojca ; Šter, Helena ; Pisek, Staša ; Petric Lasnik, Ivana ; Kete Matičič, Jana ; Pirih Svetina, Nataša ; Preglau, Daniela ; Arhar Holdt, Špela ; Krsnik, Luka ; Erjavec, Tomaž ; Pegan, Jasmina ; Huber, Damjan
     This item contains 2 files (117.4 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Offensive language dataset of Croatian, English and Slovenian comments FRENK 1.1
    (Jožef Stefan Institute / 2021-11-17)
    
    Author(s):
    Ljubešić, Nikola ; Fišer, Darja ; Erjavec, Tomaž and Šulc, Ajda
     This item contains 2 files (4.48 MB).
     
    Academic Use Inform Before Use Attribution Required Noncommercial

  • corpus
    CLARIN.SI data & tools
    corpus
    Spoken corpus Gos VideoLectures 4.0 (audio)
    (VideoLectures.NET / 2019-03-26)
    
    Author(s):
    VideoLectures.NET
     This item contains 6 files (8.85 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial No Derivative Works

  • corpus
    CLARIN.SI data & tools
    corpus
    Slovenian parliamentary corpus (1990-2022) siParl 4.0
    (Institute of Contemporary History / 2024-06-05)
    
    Author(s):
    Pančur, Andrej ; et al.show everyone Pančur, Andrej ; Meden, Katja ; Erjavec, Tomaž ; Ojsteršek, Mihael ; Šorn, Mojca ; Blaj Hribar, Neja
     This item contains 5 files (14.28 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    The Sarajevo Corpus of SMS Messages in Bosnian 1.1
    (University of Sarajevo – Faculty of Philosophy / 2024-07-16)
    
    Author(s):
    Wasserscheidt, Philipp ; et al.show everyone Wasserscheidt, Philipp ; Bulić, Halid ; Durmišević, Elma ; Hodžić-Čavkić, Azra ; Bajraktarević, Enisa ; Ahmetspahić-Peljto, Azra ; Šabić, Belmin
     This item contains 1 file (1.69 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Albanian Spoken Corpus in Kosovo 1.0
    (University of Prishtina "Hasan Prishtina" / 2024-07-08)
    
    Author(s):
    Wasserscheidt, Philipp ; Rugova, Bardh and Baftiu, Adelajda
     This item contains 1 file (1.76 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Training corpus SUK 1.1
    (Centre for Language Resources and Technologies, University of Ljubljana / 2024-08-22)
    
    Author(s):
    Arhar Holdt, Špela ; et al.show everyone Arhar Holdt, Špela ; Krek, Simon ; Dobrovoljc, Kaja ; Erjavec, Tomaž ; Gantar, Polona ; Čibej, Jaka ; Pori, Eva ; Terčon, Luka ; Munda, Tina ; Žitnik, Slavko ; Robida, Nejc ; Blagus, Neli ; Može, Sara ; Ledinek, Nina ; Holz, Nanika ; Zupan, Katja ; Kuzman, Taja ; Kavčič, Teja ; Škrjanec, Iza ; Marko, Dafne ; Jezeršek, Lucija ; Zajc, Anja
     This item contains 2 files (45.1 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Icelandic web corpus MaCoCu-is 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-05-19)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     This item contains 2 files (2.48 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Turkish web corpus MaCoCu-tr 2.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-04-20)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     This item contains 2 files (15.07 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Corpus of academic Slovene KAS 2.0
    (Faculty of Electrical Engineering and Computer Science, University of Maribor; Faculty of Computer and Information Science, University of Ljubljana / 2022-02-04)
    
    Author(s):
    Žagar, Aleš ; et al.show everyone Žagar, Aleš ; Kavaš, Matic ; Robnik-Šikonja, Marko ; Erjavec, Tomaž ; Fišer, Darja ; Ljubešić, Nikola ; Ferme, Marko ; Borovič, Mladen ; Boškovič, Borko ; Ojsteršek, Milan ; Hrovat, Goran
     This item contains 4 files (13.71 GB).
     
    Academic Use Inform Before Use Attribution Required Noncommercial

  • corpus
    CLARIN.SI data & tools
    corpus
    Abstracts from the KAS corpus KAS-Abs 2.0
    (Faculty of Electrical Engineering and Computer Science, University of Maribor; Faculty of Computer and Information Science, University of Ljubljana / 2022-02-04)
    
    Author(s):
    Žagar, Aleš ; et al.show everyone Žagar, Aleš ; Kavaš, Matic ; Robnik-Šikonja, Marko ; Erjavec, Tomaž ; Fišer, Darja ; Ljubešić, Nikola ; Ferme, Marko ; Borovič, Mladen ; Boškovič, Borko ; Ojsteršek, Milan ; Hrovat, Goran
     This item contains 1 file (83.48 MB).
     
    Academic Use Inform Before Use Attribution Required Noncommercial

  • corpus
    CLARIN.SI data & tools
    corpus
    Corpus of 1968 Slovenian literature Maj68 3.0
    (ZRC SAZU / 2024-10-22)
    
    Author(s):
    Juvan, Marko ; et al.show everyone Juvan, Marko ; Žejn, Andrejka ; Šorli, Mojca ; Mandić, Lucija ; Tomažin, Andrej ; Jež, Andraž ; Balžalorsky Antić, Varja ; Erjavec, Tomaž
     This item contains 6 files (1.33 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Croatian linguistic training corpus hr500k 2.0
    (Jožef Stefan Institute / 2023-04-13)
    
    Author(s):
    Ljubešić, Nikola and Samardžić, Tanja
     This item contains 7 files (49.59 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Croatian Twitter training corpus ReLDI-NormTagNER-hr 3.0
    (Jožef Stefan Institute / 2023-04-07)
    
    Author(s):
    Ljubešić, Nikola ; Erjavec, Tomaž ; Batanović, Vuk ; Miličević, Maja and Samardžić, Tanja
     This item contains 4 files (8.54 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Serbian linguistic training corpus SETimes.SR 2.0
    (Regional Linguistic Data Initiative Centre ReLDI; Jožef Stefan Institute / 2023-06-13)
    
    Author(s):
    Batanović, Vuk ; Ljubešić, Nikola ; Samardžić, Tanja and Erjavec, Tomaž
     This item contains 4 files (9.4 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Linguistically annotated multilingual comparable corpora of parliamentary debates ParlaMint.ana 4.1
    (CLARIN ERIC / 2024-06-03)
    
    Author(s):
    Erjavec, Tomaž ; et al.show everyone Erjavec, Tomaž ; Kopp, Matyáš ; Ogrodniczuk, Maciej ; Osenova, Petya ; Agerri, Rodrigo ; Agirrezabal, Manex ; Agnoloni, Tommaso ; Aires, José ; Albini, Monica ; Alkorta, Jon ; Antiba-Cartazo, Iván ; Arrieta, Ekain ; Barcala, Mario ; Bardanca, Daniel ; Barkarson, Starkaður ; Bartolini, Roberto ; Battistoni, Roberto ; Bel, Nuria ; Bonet Ramos, Maria del Mar ; Calzada Pérez, María ; Cardoso, Aida ; Çöltekin, Çağrı ; Coole, Matthew ; Darģis, Roberts ; de Does, Jesse ; de Libano, Ruben ; Depoorter, Griet ; Depuydt, Katrien ; Diwersy, Sascha ; Dodé, Réka ; Fernandez, Kike ; Fernández Rei, Elisa ; Frontini, Francesca ; Garcia, Marcos ; García Díaz, Noelia ; García Louzao, Pedro ; Gavriilidou, Maria ; Gkoumas, Dimitris ; Grigorov, Ilko ; Grigorova, Vladislava ; Haltrup Hansen, Dorte ; Iruskieta, Mikel ; Jarlbrink, Johan ; Jelencsik-Mátyus, Kinga ; Jongejan, Bart ; Kahusk, Neeme ; Kirnbauer, Martin ; Kryvenko, Anna ; Ligeti-Nagy, Noémi ; Ljubešić, Nikola ; Luxardo, Giancarlo ; Magariños, Carmen ; Magnusson, Måns ; Marchetti, Carlo ; Marx, Maarten ; Meden, Katja ; Mendes, Amália ; Mochtak, Michal ; Mölder, Martin ; Montemagni, Simonetta ; Navarretta, Costanza ; Nitoń, Bartłomiej ; Norén, Fredrik Mohammadi ; Nwadukwe, Amanda ; Ojsteršek, Mihael ; Pančur, Andrej ; Papavassiliou, Vassilis ; Pereira, Rui ; Pérez Lago, María ; Piperidis, Stelios ; Pirker, Hannes ; Pisani, Marilina ; Pol, Henk van der ; Prokopidis, Prokopis ; Quochi, Valeria ; Rayson, Paul ; Regueira, Xosé Luís ; Rii, Andriana ; Rudolf, Michał ; Ruisi, Manuela ; Rupnik, Peter ; Schopper, Daniel ; Simov, Kiril ; Sinikallio, Laura ; Skubic, Jure ; Tamper, Minna ; Tungland, Lars Magne ; Tuominen, Jouni ; van Heusden, Ruben ; Varga, Zsófia ; Vázquez Abuín, Marta ; Venturi, Giulia ; Vidal Miguéns, Adrián ; Vider, Kadri ; Vivel Couso, Ainhoa ; Vladu, Adina Ioana ; Wissik, Tanja ; Yrjänäinen, Väinö ; Zevallos, Rodolfo ; Fišer, Darja
     This item contains 31 files (65.97 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Multilingual comparable corpora of parliamentary debates ParlaMint 4.1
    (CLARIN ERIC / 2024-06-03)
    
    Author(s):
    Erjavec, Tomaž ; et al.show everyone Erjavec, Tomaž ; Kopp, Matyáš ; Ogrodniczuk, Maciej ; Osenova, Petya ; Agirrezabal, Manex ; Agnoloni, Tommaso ; Aires, José ; Albini, Monica ; Alkorta, Jon ; Antiba-Cartazo, Iván ; Arrieta, Ekain ; Barcala, Mario ; Bardanca, Daniel ; Barkarson, Starkaður ; Bartolini, Roberto ; Battistoni, Roberto ; Bel, Nuria ; Bonet Ramos, Maria del Mar ; Calzada Pérez, María ; Cardoso, Aida ; Çöltekin, Çağrı ; Coole, Matthew ; Darģis, Roberts ; de Libano, Ruben ; Depoorter, Griet ; Diwersy, Sascha ; Dodé, Réka ; Fernandez, Kike ; Fernández Rei, Elisa ; Frontini, Francesca ; Garcia, Marcos ; García Díaz, Noelia ; García Louzao, Pedro ; Gavriilidou, Maria ; Gkoumas, Dimitris ; Grigorov, Ilko ; Grigorova, Vladislava ; Haltrup Hansen, Dorte ; Iruskieta, Mikel ; Jarlbrink, Johan ; Jelencsik-Mátyus, Kinga ; Jongejan, Bart ; Kahusk, Neeme ; Kirnbauer, Martin ; Kryvenko, Anna ; Ligeti-Nagy, Noémi ; Ljubešić, Nikola ; Luxardo, Giancarlo ; Magariños, Carmen ; Magnusson, Måns ; Marchetti, Carlo ; Marx, Maarten ; Meden, Katja ; Mendes, Amália ; Mochtak, Michal ; Mölder, Martin ; Montemagni, Simonetta ; Navarretta, Costanza ; Nitoń, Bartłomiej ; Norén, Fredrik Mohammadi ; Nwadukwe, Amanda ; Ojsteršek, Mihael ; Pančur, Andrej ; Papavassiliou, Vassilis ; Pereira, Rui ; Pérez Lago, María ; Piperidis, Stelios ; Pirker, Hannes ; Pisani, Marilina ; Pol, Henk van der ; Prokopidis, Prokopis ; Quochi, Valeria ; Rayson, Paul ; Regueira, Xosé Luís ; Rii, Andriana ; Rudolf, Michał ; Ruisi, Manuela ; Rupnik, Peter ; Schopper, Daniel ; Simov, Kiril ; Sinikallio, Laura ; Skubic, Jure ; Tungland, Lars Magne ; Tuominen, Jouni ; van Heusden, Ruben ; Varga, Zsófia ; Vázquez Abuín, Marta ; Venturi, Giulia ; Vidal Miguéns, Adrián ; Vider, Kadri ; Vivel Couso, Ainhoa ; Vladu, Adina Ioana ; Wissik, Tanja ; Yrjänäinen, Väinö ; Zevallos, Rodolfo ; Fišer, Darja
     This item contains 30 files (5.87 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Linguistically annotated multilingual comparable corpora of parliamentary debates in English ParlaMint-en.ana 4.1
    (CLARIN ERIC / 2024-06-03)
    
    Author(s):
    Kuzman, Taja ; et al.show everyone Kuzman, Taja ; Ljubešić, Nikola ; Erjavec, Tomaž ; Kopp, Matyáš ; Ogrodniczuk, Maciej ; Osenova, Petya ; Rayson, Paul ; Vidler, John ; Agerri, Rodrigo ; Agirrezabal, Manex ; Agnoloni, Tommaso ; Aires, José ; Albini, Monica ; Alkorta, Jon ; Antiba-Cartazo, Iván ; Arrieta, Ekain ; Barcala, Mario ; Bardanca, Daniel ; Barkarson, Starkaður ; Bartolini, Roberto ; Battistoni, Roberto ; Bel, Nuria ; Bonet Ramos, Maria del Mar ; Calzada Pérez, María ; Cardoso, Aida ; Çöltekin, Çağrı ; Coole, Matthew ; Darģis, Roberts ; de Does, Jesse ; de Libano, Ruben ; Depoorter, Griet ; Depuydt, Katrien ; Diwersy, Sascha ; Dodé, Réka ; Fernandez, Kike ; Fernández Rei, Elisa ; Frontini, Francesca ; Garcia, Marcos ; García Díaz, Noelia ; García Louzao, Pedro ; Gavriilidou, Maria ; Gkoumas, Dimitris ; Grigorov, Ilko ; Grigorova, Vladislava ; Haltrup Hansen, Dorte ; Iruskieta, Mikel ; Jarlbrink, Johan ; Jelencsik-Mátyus, Kinga ; Jongejan, Bart ; Kahusk, Neeme ; Kirnbauer, Martin ; Kryvenko, Anna ; Ligeti-Nagy, Noémi ; Luxardo, Giancarlo ; Magariños, Carmen ; Magnusson, Måns ; Marchetti, Carlo ; Marx, Maarten ; Meden, Katja ; Mendes, Amália ; Mochtak, Michal ; Mölder, Martin ; Montemagni, Simonetta ; Navarretta, Costanza ; Nitoń, Bartłomiej ; Norén, Fredrik Mohammadi ; Nwadukwe, Amanda ; Ojsteršek, Mihael ; Pančur, Andrej ; Papavassiliou, Vassilis ; Pereira, Rui ; Pérez Lago, María ; Piperidis, Stelios ; Pirker, Hannes ; Pisani, Marilina ; Pol, Henk van der ; Prokopidis, Prokopis ; Quochi, Valeria ; Regueira, Xosé Luís ; Rii, Andriana ; Rudolf, Michał ; Ruisi, Manuela ; Rupnik, Peter ; Schopper, Daniel ; Simov, Kiril ; Sinikallio, Laura ; Skubic, Jure ; Tamper, Minna ; Tungland, Lars Magne ; Tuominen, Jouni ; van Heusden, Ruben ; Varga, Zsófia ; Vázquez Abuín, Marta ; Venturi, Giulia ; Vidal Miguéns, Adrián ; Vider, Kadri ; Vivel Couso, Ainhoa ; Vladu, Adina Ioana ; Wissik, Tanja ; Yrjänäinen, Väinö ; Zevallos, Rodolfo ; Fišer, Darja
     This item contains 31 files (53.36 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Spoken corpus Gos 2.1 (transcriptions)
    (Centre for Language Resources and Technologies, University of Ljubljana; Faculty of Electrical Engineering and Computer Science, University of Maribor; Faculty of Electrical Engineering, University of Ljubljana; Faculty of Computer and Information Science, University of Ljubljana; ZRC SAZU; Jožef Stefan Institute / 2023-08-28)
    
    Author(s):
    Verdonik, Darinka ; et al.show everyone Verdonik, Darinka ; Zwitter Vitez, Ana ; Zemljarič Miklavčič, Jana ; Krek, Simon ; Stabej, Marko ; Erjavec, Tomaž ; Potočnik, Tomaž ; Sepesy Maučec, Mirjam ; Majhenič, Simona ; Žgank, Andrej ; Bizjak, Andreja ; Gril, Lucija ; Dobrišek, Simon ; Križaj, Janez ; Bajec, Marko ; Lebar Bajec, Iztok ; Jelovšek, Tjaša ; Trojar, Mitja ; Bernjak, Mitja ; Dretnik, Naum ; Strle, Gregor ; Dobrovoljc, Kaja ; Ljubešić, Nikola ; Rupnik, Peter
     This item contains 4 files (117.83 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Parliamentary spoken corpus of Croatian ParlaSpeech-HR 2.0
    (Jožef Stefan Institute / 2024-01-25)
    
    Author(s):
    Ljubešić, Nikola ; Koržinek, Danijel and Rupnik, Peter
     This item contains 8 files (207.33 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Collection of Slovenian paremiological units Pregovori 1.1
    (ZRC SAZU; Jožef Stefan Institute / 2023-09-30)
    
    Author(s):
    Babič, Saša ; et al.show everyone Babič, Saša ; Miha, Peče ; Erjavec, Tomaž ; Ivančič Kutin, Barbara ; Šrimpf Vendramin, Katarina ; Kropej Telban, Monika ; Jakop, Nataša ; Stanonik, Marija
     This item contains 3 files (22.19 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Parallel sense-annotated corpus ELEXIS-WSD 1.3
    (Jožef Stefan Institute / 2025-05-06)
    
    Author(s):
    Čibej, Jaka ; et al.show everyone Čibej, Jaka ; Krek, Simon ; Tiberius, Carole ; Martelli, Federico ; Navigli, Roberto ; Kallas, Jelena ; Gantar, Polona ; Koeva, Svetla ; Nimb, Sanni ; Sandford Pedersen, Bolette ; Olsen, Sussi ; Langemets, Margit ; Koppel, Kristina ; Üksik, Tiiu ; Dobrovoljc, Kaja ; Ureña-Ruiz, Rafael ; Sancho-Sánchez, José-Luis ; Lipp, Veronika ; Váradi, Tamás ; Győrffy, András ; Simon, László ; Quochi, Valeria ; Monachini, Monica ; Frontini, Francesca ; Tempelaars, Rob ; Costa, Rute ; Salgado, Ana ; Munda, Tina ; Kosem, Iztok ; Roblek, Rebeka ; Kamenšek, Urška ; Zaranšek, Petra ; Zgaga, Karolina ; Ponikvar, Primož ; Terčon, Luka ; Jensen, Jonas ; Flörke, Ida ; Lorentzen, Henrik ; Troelsgård, Thomas ; Blagoeva, Diana ; Hristov, Dimitar ; Kolkovska, Sia
     This item contains 1 file (11.08 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Serbian Twitter training corpus ReLDI-NormTagNER-sr 3.0
    (Jožef Stefan Institute / 2023-04-07)
    
    Author(s):
    Ljubešić, Nikola ; Erjavec, Tomaž ; Batanović, Vuk ; Miličević, Maja and Samardžić, Tanja
     This item contains 4 files (8.81 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Monitor corpus of Slovene Trendi 2025-05
    (Jožef Stefan Institute; Centre for Language Resources and Technologies, University of Ljubljana / 2025-06-05)
    
    Author(s):
    Kosem, Iztok ; et al.show everyone Kosem, Iztok ; Čibej, Jaka ; Dobrovoljc, Kaja ; Erjavec, Tomaž ; Ljubešić, Nikola ; Ponikvar, Primož ; Šinkec, Mihael ; Krek, Simon
     This item contains no files.

  • corpus
    CLARIN.SI data & tools
    corpus
    Developmental corpus Šolar 2.0
    (Trojina, Institute for Applied Slovene Studies; Centre for Language Resources and Technologies, University of Ljubljana / 2019-07-08)
    
    Author(s):
    Kosem, Iztok ; et al.show everyone Kosem, Iztok ; Arhar Holdt, Špela ; Stritar Kučuk, Mojca ; Krek, Simon ; Krapš Vodopivec, Irena ; Stabej, Marko ; Pori, Eva ; Goli, Teja ; Lavrič, Polona ; Laskowski, Cyprian ; Kocjančič, Polonca ; Klemenc, Bojan ; Rozman, Tadeja
     This item contains 2 files (21.69 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Developmental corpus (without language corrections) Šolar 2.0 Clear
    (Trojina, Institute for Applied Slovene Studies; Centre for Language Resources and Technologies, University of Ljubljana / 2019-07-08)
    
    Author(s):
    Kosem, Iztok ; et al.show everyone Kosem, Iztok ; Arhar Holdt, Špela ; Stritar Kučuk, Mojca ; Krek, Simon ; Krapš Vodopivec, Irena ; Stabej, Marko ; Kocjančič, Polonca ; Laskowski, Cyprian ; Klemenc, Bojan ; Pori, Eva ; Rozman, Tadeja
     This item contains 2 files (29.22 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Error-annotated developmental corpus Šolar 2.0 Error
    (Trojina, Institute for Applied Slovene Studies; Centre for Language Resources and Technologies, University of Ljubljana / 2019-07-08)
    
    Author(s):
    Arhar Holdt, Špela ; et al.show everyone Arhar Holdt, Špela ; Goli, Teja ; Lavrič, Polona ; Laskowski, Cyprian ; Klemenc, Bojan ; Rozman, Tadeja ; Stritar Kučuk, Mojca ; Krek, Simon ; Krapš Vodopivec, Irena ; Stabej, Marko ; Kosem, Iztok
     This item contains 2 files (10.77 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Slovenian parliamentary corpus (1990-1992) SlovParl 2.0
    (Institute of Contemporary History / 2017-11-24)
    
    Author(s):
    Pančur, Andrej ; Šorn, Mojca and Erjavec, Tomaž
     This item contains 3 files (169.71 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Multilingual comparable corpora of parliamentary debates ParlaMint 2.1
    (CLARIN ERIC / 2021-06-18)
    
    Author(s):
    Erjavec, Tomaž ; et al.show everyone Erjavec, Tomaž ; Ogrodniczuk, Maciej ; Osenova, Petya ; Ljubešić, Nikola ; Simov, Kiril ; Grigorova, Vladislava ; Rudolf, Michał ; Pančur, Andrej ; Kopp, Matyáš ; Barkarson, Starkaður ; Steingrímsson, Steinþór ; van der Pol, Henk ; Depoorter, Griet ; de Does, Jesse ; Jongejan, Bart ; Haltrup Hansen, Dorte ; Navarretta, Costanza ; Calzada Pérez, María ; de Macedo, Luciana D. ; van Heusden, Ruben ; Marx, Maarten ; Çöltekin, Çağrı ; Coole, Matthew ; Agnoloni, Tommaso ; Frontini, Francesca ; Montemagni, Simonetta ; Quochi, Valeria ; Venturi, Giulia ; Ruisi, Manuela ; Marchetti, Carlo ; Battistoni, Roberto ; Sebők, Miklós ; Ring, Orsolya ; Darģis, Roberts ; Utka, Andrius ; Petkevičius, Mindaugas ; Briedienė, Monika ; Krilavičius, Tomas ; Morkevičius, Vaidas ; Diwersy, Sascha ; Luxardo, Giancarlo ; Rayson, Paul
     This item contains 18 files (2.17 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    CMC training corpus Janes-Norm 1.2
    (Jožef Stefan Institute / 2016-12-30)
    
    Author(s):
    Erjavec, Tomaž ; Fišer, Darja ; Čibej, Jaka and Arhar Holdt, Špela
     This item contains 4 files (4.01 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    CMC training corpus Janes-Tag 2.1
    (Jožef Stefan Institute / 2019-09-11)
    
    Author(s):
    Erjavec, Tomaž ; et al.show everyone Erjavec, Tomaž ; Fišer, Darja ; Čibej, Jaka ; Arhar Holdt, Špela ; Ljubešić, Nikola ; Zupan, Katja ; Dobrovoljc, Kaja
     This item contains 7 files (5.68 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Linguistically annotated multilingual comparable corpora of parliamentary debates ParlaMint.ana 2.1
    (CLARIN ERIC / 2021-06-18)
    
    Author(s):
    Erjavec, Tomaž ; et al.show everyone Erjavec, Tomaž ; Ogrodniczuk, Maciej ; Osenova, Petya ; Ljubešić, Nikola ; Simov, Kiril ; Grigorova, Vladislava ; Rudolf, Michał ; Pančur, Andrej ; Kopp, Matyáš ; Barkarson, Starkaður ; Steingrímsson, Steinþór ; van der Pol, Henk ; Depoorter, Griet ; de Does, Jesse ; Jongejan, Bart ; Haltrup Hansen, Dorte ; Navarretta, Costanza ; Calzada Pérez, María ; de Macedo, Luciana D. ; van Heusden, Ruben ; Marx, Maarten ; Çöltekin, Çağrı ; Coole, Matthew ; Agnoloni, Tommaso ; Frontini, Francesca ; Montemagni, Simonetta ; Quochi, Valeria ; Venturi, Giulia ; Ruisi, Manuela ; Marchetti, Carlo ; Battistoni, Roberto ; Sebők, Miklós ; Ring, Orsolya ; Darģis, Roberts ; Utka, Andrius ; Petkevičius, Mindaugas ; Briedienė, Monika ; Krilavičius, Tomas ; Morkevičius, Vaidas ; Bartolini, Roberto ; Cimino, Andrea ; Diwersy, Sascha ; Luxardo, Giancarlo ; Rayson, Paul
     This item contains 18 files (23.37 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Training corpus ssj500k 2.3
    (Centre for Language Resources and Technologies, University of Ljubljana / 2021-07-07)
    
    Author(s):
    Krek, Simon ; et al.show everyone Krek, Simon ; Dobrovoljc, Kaja ; Erjavec, Tomaž ; Može, Sara ; Ledinek, Nina ; Holz, Nanika ; Zupan, Katja ; Gantar, Polona ; Kuzman, Taja ; Čibej, Jaka ; Arhar Holdt, Špela ; Kavčič, Teja ; Škrjanec, Iza ; Marko, Dafne ; Jezeršek, Lucija ; Zajc, Anja
     This item contains 4 files (42.85 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Linguistically annotated multilingual comparable corpora of parliamentary debates ParlaMint.ana 4.0
    (CLARIN ERIC / 2023-10-24)
    
    Author(s):
    Erjavec, Tomaž ; et al.show everyone Erjavec, Tomaž ; Kopp, Matyáš ; Ogrodniczuk, Maciej ; Osenova, Petya ; Agerri, Rodrigo ; Agirrezabal, Manex ; Agnoloni, Tommaso ; Aires, José ; Albini, Monica ; Alkorta, Jon ; Antiba-Cartazo, Iván ; Arrieta, Ekain ; Barcala, Mario ; Bardanca, Daniel ; Barkarson, Starkaður ; Bartolini, Roberto ; Battistoni, Roberto ; Bel, Nuria ; Bonet Ramos, Maria del Mar ; Calzada Pérez, María ; Cardoso, Aida ; Çöltekin, Çağrı ; Coole, Matthew ; Darģis, Roberts ; de Does, Jesse ; de Libano, Ruben ; Depoorter, Griet ; Depuydt, Katrien ; Diwersy, Sascha ; Dodé, Réka ; Fernandez, Kike ; Fernández Rei, Elisa ; Frontini, Francesca ; Garcia, Marcos ; García Díaz, Noelia ; García Louzao, Pedro ; Gavriilidou, Maria ; Gkoumas, Dimitris ; Grigorov, Ilko ; Grigorova, Vladislava ; Haltrup Hansen, Dorte ; Iruskieta, Mikel ; Jarlbrink, Johan ; Jelencsik-Mátyus, Kinga ; Jongejan, Bart ; Kahusk, Neeme ; Kirnbauer, Martin ; Kryvenko, Anna ; Ligeti-Nagy, Noémi ; Ljubešić, Nikola ; Luxardo, Giancarlo ; Magariños, Carmen ; Magnusson, Måns ; Marchetti, Carlo ; Marx, Maarten ; Meden, Katja ; Mendes, Amália ; Mochtak, Michal ; Mölder, Martin ; Montemagni, Simonetta ; Navarretta, Costanza ; Nitoń, Bartłomiej ; Norén, Fredrik Mohammadi ; Nwadukwe, Amanda ; Ojsteršek, Mihael ; Pančur, Andrej ; Papavassiliou, Vassilis ; Pereira, Rui ; Pérez Lago, María ; Piperidis, Stelios ; Pirker, Hannes ; Pisani, Marilina ; Pol, Henk van der ; Prokopidis, Prokopis ; Quochi, Valeria ; Rayson, Paul ; Regueira, Xosé Luís ; Rudolf, Michał ; Ruisi, Manuela ; Rupnik, Peter ; Schopper, Daniel ; Simov, Kiril ; Sinikallio, Laura ; Skubic, Jure ; Tamper, Minna ; Tungland, Lars Magne ; Tuominen, Jouni ; van Heusden, Ruben ; Varga, Zsófia ; Vázquez Abuín, Marta ; Venturi, Giulia ; Vidal Miguéns, Adrián ; Vider, Kadri ; Vivel Couso, Ainhoa ; Vladu, Adina Ioana ; Wissik, Tanja ; Yrjänäinen, Väinö ; Zevallos, Rodolfo ; Fišer, Darja
     This item contains 31 files (61.05 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Multilingual comparable corpora of parliamentary debates ParlaMint 4.0
    (CLARIN ERIC / 2023-10-24)
    
    Author(s):
    Erjavec, Tomaž ; et al.show everyone Erjavec, Tomaž ; Kopp, Matyáš ; Ogrodniczuk, Maciej ; Osenova, Petya ; Agirrezabal, Manex ; Agnoloni, Tommaso ; Aires, José ; Albini, Monica ; Alkorta, Jon ; Antiba-Cartazo, Iván ; Arrieta, Ekain ; Barcala, Mario ; Bardanca, Daniel ; Barkarson, Starkaður ; Bartolini, Roberto ; Battistoni, Roberto ; Bel, Nuria ; Bonet Ramos, Maria del Mar ; Calzada Pérez, María ; Cardoso, Aida ; Çöltekin, Çağrı ; Coole, Matthew ; Darģis, Roberts ; de Libano, Ruben ; Depoorter, Griet ; Diwersy, Sascha ; Dodé, Réka ; Fernandez, Kike ; Fernández Rei, Elisa ; Frontini, Francesca ; Garcia, Marcos ; García Díaz, Noelia ; García Louzao, Pedro ; Gavriilidou, Maria ; Gkoumas, Dimitris ; Grigorov, Ilko ; Grigorova, Vladislava ; Haltrup Hansen, Dorte ; Iruskieta, Mikel ; Jarlbrink, Johan ; Jelencsik-Mátyus, Kinga ; Jongejan, Bart ; Kahusk, Neeme ; Kirnbauer, Martin ; Kryvenko, Anna ; Ligeti-Nagy, Noémi ; Ljubešić, Nikola ; Luxardo, Giancarlo ; Magariños, Carmen ; Magnusson, Måns ; Marchetti, Carlo ; Marx, Maarten ; Meden, Katja ; Mendes, Amália ; Mochtak, Michal ; Mölder, Martin ; Montemagni, Simonetta ; Navarretta, Costanza ; Nitoń, Bartłomiej ; Norén, Fredrik Mohammadi ; Nwadukwe, Amanda ; Ojsteršek, Mihael ; Pančur, Andrej ; Papavassiliou, Vassilis ; Pereira, Rui ; Pérez Lago, María ; Piperidis, Stelios ; Pirker, Hannes ; Pisani, Marilina ; Pol, Henk van der ; Prokopidis, Prokopis ; Quochi, Valeria ; Rayson, Paul ; Regueira, Xosé Luís ; Rudolf, Michał ; Ruisi, Manuela ; Rupnik, Peter ; Schopper, Daniel ; Simov, Kiril ; Sinikallio, Laura ; Skubic, Jure ; Tungland, Lars Magne ; Tuominen, Jouni ; van Heusden, Ruben ; Varga, Zsófia ; Vázquez Abuín, Marta ; Venturi, Giulia ; Vidal Miguéns, Adrián ; Vider, Kadri ; Vivel Couso, Ainhoa ; Vladu, Adina Ioana ; Wissik, Tanja ; Yrjänäinen, Väinö ; Zevallos, Rodolfo ; Fišer, Darja
     This item contains 30 files (5.67 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Linguistically annotated multilingual comparable corpora of parliamentary debates in English ParlaMint-en.ana 4.0
    (CLARIN ERIC / 2023-11-14)
    
    Author(s):
    Kuzman, Taja ; et al.show everyone Kuzman, Taja ; Ljubešić, Nikola ; Erjavec, Tomaž ; Kopp, Matyáš ; Ogrodniczuk, Maciej ; Osenova, Petya ; Rayson, Paul ; Vidler, John ; Agerri, Rodrigo ; Agirrezabal, Manex ; Agnoloni, Tommaso ; Aires, José ; Albini, Monica ; Alkorta, Jon ; Antiba-Cartazo, Iván ; Arrieta, Ekain ; Barcala, Mario ; Bardanca, Daniel ; Barkarson, Starkaður ; Bartolini, Roberto ; Battistoni, Roberto ; Bel, Nuria ; Bonet Ramos, Maria del Mar ; Calzada Pérez, María ; Cardoso, Aida ; Çöltekin, Çağrı ; Coole, Matthew ; Darģis, Roberts ; de Does, Jesse ; de Libano, Ruben ; Depoorter, Griet ; Depuydt, Katrien ; Diwersy, Sascha ; Dodé, Réka ; Fernandez, Kike ; Fernández Rei, Elisa ; Frontini, Francesca ; Garcia, Marcos ; García Díaz, Noelia ; García Louzao, Pedro ; Gavriilidou, Maria ; Gkoumas, Dimitris ; Grigorov, Ilko ; Grigorova, Vladislava ; Haltrup Hansen, Dorte ; Iruskieta, Mikel ; Jarlbrink, Johan ; Jelencsik-Mátyus, Kinga ; Jongejan, Bart ; Kahusk, Neeme ; Kirnbauer, Martin ; Kryvenko, Anna ; Ligeti-Nagy, Noémi ; Luxardo, Giancarlo ; Magariños, Carmen ; Magnusson, Måns ; Marchetti, Carlo ; Marx, Maarten ; Meden, Katja ; Mendes, Amália ; Mochtak, Michal ; Mölder, Martin ; Montemagni, Simonetta ; Navarretta, Costanza ; Nitoń, Bartłomiej ; Norén, Fredrik Mohammadi ; Nwadukwe, Amanda ; Ojsteršek, Mihael ; Pančur, Andrej ; Papavassiliou, Vassilis ; Pereira, Rui ; Pérez Lago, María ; Piperidis, Stelios ; Pirker, Hannes ; Pisani, Marilina ; Pol, Henk van der ; Prokopidis, Prokopis ; Quochi, Valeria ; Regueira, Xosé Luís ; Rudolf, Michał ; Ruisi, Manuela ; Rupnik, Peter ; Schopper, Daniel ; Simov, Kiril ; Sinikallio, Laura ; Skubic, Jure ; Tamper, Minna ; Tungland, Lars Magne ; Tuominen, Jouni ; van Heusden, Ruben ; Varga, Zsófia ; Vázquez Abuín, Marta ; Venturi, Giulia ; Vidal Miguéns, Adrián ; Vider, Kadri ; Vivel Couso, Ainhoa ; Vladu, Adina Ioana ; Wissik, Tanja ; Yrjänäinen, Väinö ; Zevallos, Rodolfo ; Fišer, Darja
     This item contains 31 files (67 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Slovenian parliamentary corpus (1990-2022) siParl 3.0
    (Institute of Contemporary History / 2022-12-06)
    
    Author(s):
    Pančur, Andrej ; et al.show everyone Pančur, Andrej ; Erjavec, Tomaž ; Meden, Katja ; Ojsteršek, Mihael ; Šorn, Mojca ; Blaj Hribar, Neja
     This item contains 2 files (5.63 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Training corpus SUK 1.0
    (Centre for Language Resources and Technologies, University of Ljubljana / 2022-12-05)
    
    Author(s):
    Arhar Holdt, Špela ; et al.show everyone Arhar Holdt, Špela ; Krek, Simon ; Dobrovoljc, Kaja ; Erjavec, Tomaž ; Gantar, Polona ; Čibej, Jaka ; Pori, Eva ; Terčon, Luka ; Munda, Tina ; Žitnik, Slavko ; Robida, Nejc ; Blagus, Neli ; Može, Sara ; Ledinek, Nina ; Holz, Nanika ; Zupan, Katja ; Kuzman, Taja ; Kavčič, Teja ; Škrjanec, Iza ; Marko, Dafne ; Jezeršek, Lucija ; Zajc, Anja
     This item contains 2 files (43.14 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Serbian Twitter training corpus ReLDI-NormTagNER-sr 2.1
    (Jožef Stefan Institute / 2019-07-28)
    
    Author(s):
    Ljubešić, Nikola ; Erjavec, Tomaž ; Batanović, Vuk ; Miličević, Maja and Samardžić, Tanja
     This item contains 4 files (4.51 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Croatian Twitter training corpus ReLDI-NormTagNER-hr 2.1
    (Jožef Stefan Institute / 2019-09-11)
    
    Author(s):
    Ljubešić, Nikola ; Erjavec, Tomaž ; Batanović, Vuk ; Miličević, Maja and Samardžić, Tanja
     This item contains 4 files (4.56 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Spoken corpus Gos 1.1
    (Centre for Language Resources and Technologies, University of Ljubljana / 2021-09-23)
    
    Author(s):
    Zwitter Vitez, Ana ; Zemljarič Miklavčič, Jana ; Krek, Simon ; Stabej, Marko and Erjavec, Tomaž
     This item contains 2 files (22.1 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Training corpus ssj500k 2.2
    (Centre for Language Resources and Technologies, University of Ljubljana / 2019-01-26)
    
    Author(s):
    Krek, Simon ; et al.show everyone Krek, Simon ; Dobrovoljc, Kaja ; Erjavec, Tomaž ; Može, Sara ; Ledinek, Nina ; Holz, Nanika ; Zupan, Katja ; Gantar, Polona ; Kuzman, Taja ; Čibej, Jaka ; Arhar Holdt, Špela ; Kavčič, Teja ; Škrjanec, Iza ; Marko, Dafne ; Jezeršek, Lucija ; Zajc, Anja
     This item contains 4 files (40.95 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Parallel sense-annotated corpus ELEXIS-WSD 1.1
    (Jožef Stefan Institute / 2023-05-22)
    
    Author(s):
    Martelli, Federico ; et al.show everyone Martelli, Federico ; Navigli, Roberto ; Krek, Simon ; Kallas, Jelena ; Gantar, Polona ; Koeva, Svetla ; Nimb, Sanni ; Sandford Pedersen, Bolette ; Olsen, Sussi ; Langemets, Margit ; Koppel, Kristina ; Üksik, Tiiu ; Dobrovoljc, Kaja ; Ureña-Ruiz, Rafael ; Sancho-Sánchez, José-Luis ; Lipp, Veronika ; Váradi, Tamás ; Győrffy, András ; Simon, László ; Quochi, Valeria ; Monachini, Monica ; Frontini, Francesca ; Tiberius, Carole ; Tempelaars, Rob ; Costa, Rute ; Salgado, Ana ; Čibej, Jaka ; Munda, Tina ; Kosem, Iztok ; Roblek, Rebeka ; Kamenšek, Urška ; Zaranšek, Petra ; Zgaga, Karolina ; Ponikvar, Primož ; Terčon, Luka ; Jensen, Jonas ; Flörke, Ida ; Lorentzen, Henrik ; Troelsgård, Thomas ; Blagoeva, Diana ; Hristov, Dimitar ; Kolkovska, Sia
     This item contains 1 file (9.28 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Parallel sense-annotated corpus ELEXIS-WSD 1.2
    (Jožef Stefan Institute / 2025-04-04)
    
    Author(s):
    Čibej, Jaka ; et al.show everyone Čibej, Jaka ; Krek, Simon ; Tiberius, Carole ; Martelli, Federico ; Navigli, Roberto ; Kallas, Jelena ; Gantar, Polona ; Koeva, Svetla ; Nimb, Sanni ; Sandford Pedersen, Bolette ; Olsen, Sussi ; Langemets, Margit ; Koppel, Kristina ; Üksik, Tiiu ; Dobrovoljc, Kaja ; Ureña-Ruiz, Rafael ; Sancho-Sánchez, José-Luis ; Lipp, Veronika ; Váradi, Tamás ; Győrffy, András ; Simon, László ; Quochi, Valeria ; Monachini, Monica ; Frontini, Francesca ; Tempelaars, Rob ; Costa, Rute ; Salgado, Ana ; Munda, Tina ; Kosem, Iztok ; Roblek, Rebeka ; Kamenšek, Urška ; Zaranšek, Petra ; Zgaga, Karolina ; Ponikvar, Primož ; Terčon, Luka ; Jensen, Jonas ; Flörke, Ida ; Lorentzen, Henrik ; Troelsgård, Thomas ; Blagoeva, Diana ; Hristov, Dimitar ; Kolkovska, Sia
     This item contains 1 file (11.08 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    MULTEXT-East "1984" document corpus 4.0
    (Jožef Stefan Institute / 2010-05-14)
    
    Author(s):
    Erjavec, Tomaž ; et al.show everyone Erjavec, Tomaž ; Bruda, Ştefan ; Dimitrova, Ludmila ; Ide, Nancy ; Kaalep, Heiki-Jaan ; Krstev, Cvetana ; Orav, Heili ; Oravecz, Csaba ; Paldre, Leho ; Petkevič, Vladimír ; Priest-Dorman, Greg ; Simov, Kiril ; Sinapova, Lydia ; Sokolovsky, Paul ; Sryvkin, Sergey ; Tufiş, Dan ; Utka, Andrius ; Villandi, Viire ; Vitas, Duško ; Vuković, Olga
     This item contains 1 file (4.62 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Post-edited and error annotated machine translation corpus PErr 1.0
    (Insight Centre for Data Analytics, National University of Ireland, Galway / 2016-05-24)
    
    Author(s):
    Popović, Maja and Arčan, Mihael
     This item contains 1 file (364.69 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Sentiment Annotated Dataset of Croatian News
    (Jožef Stefan Institute / 2020-09-15)
    
    Author(s):
    Pelicon, Andraž ; Pranjić, Marko ; Miljković, Dragana ; Škrlj, Blaž and Pollak, Senja
     This item contains 1 file (85.6 KB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial No Derivative Works

  • corpus
    CLARIN.SI data & tools
    corpus
    Dataset of Slovene idiomatic expressions SloIE
    (Faculty of Computer and Information Science, University of Ljubljana / 2020-07-27)
    
    Author(s):
    Škvorc, Tadej ; Gantar, Polona and Robnik-Šikonja, Marko
     This item contains 1 file (4.22 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Ekspress news article archive (in Estonian and Russian) 1.0
    (Ekspress Meedia Group / 2021-04-19)
    
    Author(s):
    Purver, Matthew ; et al.show everyone Purver, Matthew ; Pollak, Senja ; Freienthal, Linda ; Kuulmets, Hele-Andra ; Krustok, Ivar ; Shekhar, Ravi
     This item contains 6 files (2.32 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial No Derivative Works

  • corpus
    CLARIN.SI data & tools
    corpus
    Latvian Delfi article archive (in Latvian and Russian) 1.0
    (Ekspress Meedia Group / 2021-04-19)
    
    Author(s):
    Pollak, Senja ; et al.show everyone Pollak, Senja ; Purver, Matthew ; Shekhar, Ravi ; Freienthal, Linda ; Kuulmets, Hele-Andra ; Krustok, Ivar
     This item contains 3 files (395.39 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial No Derivative Works

  • corpus
    CLARIN.SI data & tools
    corpus
    Latvian user comment dataset 1.0
    (Ekspress Meedia Group / 2021-04-19)
    
    Author(s):
    Shekhar, Ravi ; Purver, Matthew ; Pollak, Senja ; Pelicon, Andraž and Krustok, Ivar
     This item contains 7 files (3.67 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial No Derivative Works

  • corpus
    CLARIN.SI data & tools
    corpus
    EMBEDDIA tools output example corpus of Estonian, Croatian and Latvian news articles 1.0
    (Ekspress Meedia Group; Styria Media Group / 2022-02-10)
    
    Author(s):
    Freienthal, Linda ; et al.show everyone Freienthal, Linda ; Pelicon, Andraž ; Martinc, Matej ; Škrlj, Blaž ; Krustok, Ivar ; Pranjić, Marko ; Cabrera-Diego, Luis Adrián ; Purver, Matthew ; Pollak, Senja ; Kuulmets, Hele-Andra ; Shekhar, Ravi ; Koloski, Boshko
     This item contains 1 file (434.28 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial No Derivative Works

  • corpus
    CLARIN.SI data & tools
    corpus
    Slovenian keyword extraction dataset from SentiNews 1.0
    (Jožef Stefan Institute / 2022-03-28)
    
    Author(s):
    Koloski, Boshko ; Martinc, Matej ; Tavchioski, Ilija ; Škrlj, Blaž and Pollak, Senja
     This item contains 2 files (6.05 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • 1
  • 2
  • 3
  •  
  • 4
  •    
    • Sort items by
    •  Relevance
    • Title Asc
    • Title Desc
    • Issue Date Asc
    • Issue Date Desc
    •  
    • Results/page
    • 5
    • 10
    • 20
    • 40
    • 60
    •  80
    • 100
 

Partners

  • Alpineon, d.o.o.
  • Amebis, d.o.o.
  • Institute of Contemporary History
  • Jožef Stefan Institute
  • National and University Library of Slovenia
  • Slovenian Language Technologies Society

Partners

  • University of Ljubljana
  • University of Maribor
  • University of Nova Gorica
  • University of Primorska
  • ZRC SAZU
  • ZRS Koper

Repository

  • Main page
  • Contact
  • Submission Lifecycle
  • FAQ
  • About and Policies

This platform runs under the software developed for the LINDAT/CLARIAH-CZ repository for linguistics, available on GitHub

CLARIN.SI is supported by the Ministry of Education, Science and Sport of the Republic of Slovenia
under the Programme of "Research Infrastructures".