• Repository
  • About
  • Contact
  • CLARIN
  •  Login
  • English Slovenščina
  • CLARIN.SI repository
  • Search
  • CLARIN logo
  •   Browse  
    •    All of the Repository  
      •   Issue Date
      •   Authors
      •   Titles
      •   Subjects
      •   Publisher
      •   Language
      •   Type
      •   Rights Label
  •   My Account  
    •    Login
  •   General Information  
    •    Deposit
    •    Cite
    •    Submission Lifecycle
    •    FAQ
    •    About
    •    Help Desk
 

 
Selected Filters
 Type : corpus     Clear All
Advanced Search

Filters

Use filters to refine the search results.

Current Filters:
New Filters:

Limit your search

Author  
    • Ljubešić, Nikola (138)
    • Erjavec, Tomaž (91)
    • Rupnik, Peter (70)
    • Kuzman, Taja (65)
    • Toral, Antonio (50)
    • Esplà-Gomis, Miquel (49)
    • Bañón, Marta (44)
    • Forcada, Mikel L. (44)
    • García-Romero, Cristian (44)
    • Pla Sempere, Leopoldo (44)
    • Ramírez-Sánchez, Gema (44)
    • Suchomel, Vít (44)
    • van Noord, Rik (44)
    • Fišer, Darja (37)
    • Chichirau, Malina (28)
    • Galiano-Jiménez, Aarón (28)
    • Zaragoza-Bernabeu, Jaume (28)
    • Arhar Holdt, Špela (25)
    • Krek, Simon (22)
    • Batanović, Vuk (18)
    • ... View More
Subject  
    • TEI (67)
    • web corpus (64)
    • parallel corpus (44)
    • manual annotation (41)
    • multilingual (39)
    • computer-mediated communication (27)
    • named entities (21)
    • parliamentary debates (21)
    • news corpus (20)
    • part-of-speech tagging (20)
    • tokenisation (18)
    • lemmatisation (16)
    • spoken corpus (16)
    • Parla-CLARIN (14)
    • word normalisation (14)
    • news comments (13)
    • Slovenian Parliament (13)
    • specialised corpus (12)
    • speech database (12)
    • speech transcription (12)
    • ... View More
Rights  
    • PUB (262)
    • ACA (24)
    • RES (1)
Language (ISO)  
    • Slovenian (165)
    • English (75)
    • Croatian (52)
    • Serbian (45)
    • Bosnian (22)
    • Bulgarian (20)
    • Spanish (16)
    • Estonian (15)
    • Russian (14)
    • French (13)
    • Hungarian (13)
    • Macedonian (13)
    • Dutch (12)
    • German (12)
    • Italian (12)
    • Czech (11)
    • Danish (11)
    • Icelandic (11)
    • Latvian (11)
    • Montenegrin (11)
    • ... View More
Type  
    • text (284)
    • audio (15)
    • video (1)
Contain Files  
    • yes (287)
    • no (13)

Showing 1 through 10 out of 300 results

  • 1
  • 2
  • 3
  •  
  • 30
  •    
    • Sort items by
    • Relevance
    • Title Asc
    •  Title Desc
    • Issue Date Asc
    • Issue Date Desc
    •  
    • Results/page
    • 5
    •  10
    • 20
    • 40
    • 60
    • 80
    • 100

  • corpus
    CLARIN.SI data & tools
    corpus
    ŠUSS archive of questions and answers about the Slovenian language (1998-2010)
    (CLARIN.SI / 2019-09-15)
    
    Author(s):
    Marušič, Franc Lanko ; et al.show everyone Marušič, Franc Lanko ; Marvin, Tatjana ; Potrato, Tina ; Saksida, Amanda ; Tomažin, Petra ; Verovnik, Tina ; Žaucer, Rok ; Železnikar, Jaka ; Benčina, Barbara ; Vekjet, Ivana ; Jejčič, Irena ; Mišmaš, Petra ; Marc, Neva ; Leban, Ivana ; Kobal, Elena ; Halilović, Amra ; Krošelj, Sara ; Gaši, Elbasana ; Papler, Urša ; Koglot, Marina ; Žnidarčič, Mateja ; Bajc, Sara ; Brus, Karmen ; Adamlje, Sara ; Šušanj, Špela ; Vodopivec, Ana ; Erjavec, Tomaž
     This item contains 2 files (5.46 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    xLiMe Twitter Corpus XTC 1.0.1
    (Jožef Stefan Institute / 2016-11-28)
    
    Author(s):
    Rei, Luis ; Krek, Simon and Mladenić, Dunja
     This item contains 2 files (6.29 MB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Written corpus ccKres 1.0
    (Centre for Language Resources and Technologies, University of Ljubljana / 2013-09-30)
    
    Author(s):
    Logar, Nataša ; Erjavec, Tomaž ; Krek, Simon ; Grčar, Miha and Holozan, Peter
     This item contains 3 files (201.59 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Written corpus ccGigafida 1.0
    (Centre for Language Resources and Technologies, University of Ljubljana / 2013-09-30)
    
    Author(s):
    Logar, Nataša ; Erjavec, Tomaž ; Krek, Simon ; Grčar, Miha and Holozan, Peter
     This item contains 3 files (1.89 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Noncommercial Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Word-sense disambiguation corpus SloDicWSD 1.0
    (Faculty of Computer and Information Science, University of Ljubljana / 2025-01-24)
    
    Author(s):
    Škvorc, Tadej and Robnik-Šikonja, Marko
     This item contains 1 file (3.01 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Wikipedia talk corpus Janes-Wiki 1.0
    (Jožef Stefan Institute / 2017-08-28)
    
    Author(s):
    Ljubešić, Nikola ; Erjavec, Tomaž and Fišer, Darja
     This item contains 2 files (55.35 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • corpus
    CLARIN.SI data & tools
    corpus
    Ukrainian-English parallel corpus MaCoCu-uk-en 1.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-07-07)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     This item contains 3 files (8.18 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Ukrainian web corpus MaCoCu-uk 1.0
    (Jožef Stefan Institute; Prompsit; Rijksuniversiteit Groningen; Universitat d'Alacant / 2023-05-24)
    
    Author(s):
    Bañón, Marta ; et al.show everyone Bañón, Marta ; Chichirau, Malina ; Esplà-Gomis, Miquel ; Forcada, Mikel L. ; Galiano-Jiménez, Aarón ; García-Romero, Cristian ; Kuzman, Taja ; Ljubešić, Nikola ; van Noord, Rik ; Pla Sempere, Leopoldo ; Ramírez-Sánchez, Gema ; Rupnik, Peter ; Suchomel, Vít ; Toral, Antonio ; Zaragoza-Bernabeu, Jaume
     This item contains 2 files (24.58 GB).
     
    Publicly Available

  • corpus
    CLARIN.SI data & tools
    corpus
    Ukrainian parliamentary corpus ParlaMint-UA 4.0.1
    (CLARIN.SI / 2023-11-29)
    
    Author(s):
    Kopp, Matyáš ; Kryvenko, Anna and Rii, Andriana
     This item contains 4 files (3.84 GB).
     
    Publicly Available Distributed under Creative Commons Attribution Required

  • corpus
    CLARIN.SI data & tools
    corpus
    Twitter sentiment for 15 European languages
    (Jožef Stefan Institute / 2016-02-23)
    
    Author(s):
    Mozetič, Igor ; Grčar, Miha and Smailović, Jasmina
     This item contains 16 files (49.38 MB).
     
    Publicly Available Distributed under Creative Commons Attribution Required Share Alike

  • 1
  • 2
  • 3
  •  
  • 30
  •    
    • Sort items by
    • Relevance
    • Title Asc
    •  Title Desc
    • Issue Date Asc
    • Issue Date Desc
    •  
    • Results/page
    • 5
    •  10
    • 20
    • 40
    • 60
    • 80
    • 100
 

Partners

  • Alpineon, d.o.o.
  • Amebis, d.o.o.
  • Institute of Contemporary History
  • Jožef Stefan Institute
  • National and University Library of Slovenia
  • Slovenian Language Technologies Society

Partners

  • University of Ljubljana
  • University of Maribor
  • University of Nova Gorica
  • University of Primorska
  • ZRC SAZU
  • ZRS Koper

Repository

  • Main page
  • Contact
  • Submission Lifecycle
  • FAQ
  • About and Policies

This platform runs under the software developed for the LINDAT/CLARIAH-CZ repository for linguistics, available on GitHub

CLARIN.SI is supported by the Ministry of Education, Science and Sport of the Republic of Slovenia
under the Programme of "Research Infrastructures".