Show simple item record

 
dc.contributor.author Bitenc, Maja
dc.contributor.author Erjavec, Tomaž
dc.date.accessioned 2025-07-25T07:17:22Z
dc.date.available 2025-07-25T07:17:22Z
dc.date.issued 2025-07-23
dc.identifier.uri http://hdl.handle.net/11356/2037
dc.description The "Mobile Languages" corpus documents in-depth, semi-structured sociolinguistic interviews with speakers from two Slovene regions and distinctive dialects: Idrija (Cerkno dialect, Rovte dialect group) and Ribnica (Lower Carniola dialect, Lower Carniola dialect group), who study or work in the Slovenian capital, Ljubljana, and thus navigate daily between dialectal and standard language use. Interview topics include narratives of personal (linguistic) history, reflections on past and present language practices, attitudes towards their own dialects and other Slovene varieties, experiences of dialect perception in the Ljubljana context and of standard-like speech in local environments, linguistic identity, stereotypes and prejudices, intergenerational language use (especially with children), and language behaviour in educational settings. The corpus includes: – Idrija group: 5 speakers (3 women, 2 men; 3 adults, 2 secondary-school students), recorded between 2009 and 2013; 1,112 transcribed utterances, 31,506 transcribed words. – Ribnica group: 11 speakers (3 primary informants and 8 close contacts, including family members, friends, and colleagues), recorded between 2020 and 2022; 2,889 transcribed utterances, 47,364 transcribed words. The transcriptions are orthographic, with selected non-standard features preserved using special symbols to capture salient dialectal elements (e.g., the fricative [γ] and the bilabial glide [w] in the Cerkno variety). Speaker names have been anonymised. While transcription prioritised content and was performed by multiple transcribers, consistency in the phonetic rendering of dialectal features was not systematically verified. Users should be aware that detailed phonological analysis may require additional checking. The interviews were conducted within the framework of broader sociolinguistic research, which also encompassed informants’ self-recordings of spontaneous speech in diverse everyday situations and a quantitative variationist analysis of five phonological variables (dialect-specific) across various communicative contexts. The interview data enable comparisons between speakers’ metalinguistic commentary and their actual language use as documented in the recordings. The findings of the Cerkno and Ribnica studies are comprehensively presented in two scientific publications: * Bitenc, Maja, 2016: Z jezikom na poti med Idrijskim in Ljubljano [With Language on the Move Between Idrija and Ljubljana]. Ljubljana: Znanstvena založba Filozofske fakultete. * Bitenc, Maja (in press): Govor v gibanju med Ribnico in Ljubljano [Speech in Motion Between Ribnica and Ljubljana]. Ljubljana: Znanstvena založba Filozofske fakultete. The corpus speech files for speakers who have consented to the publication of their recordings are available as a separate entry: The "Mobile languages" corpus MoJezik 1.0 (audio), http://hdl.handle.net/11356/2042.
dc.language.iso slv
dc.publisher Faculty of Arts, University of Ljubljana
dc.relation.isreferencedby https://doi.org/10.4312/slo2.0.2016.1.118-123
dc.relation.isreferencedby https://www.ff.uni-lj.si/publikacije/z-jezikom-na-poti-med-idrijskim-ljubljano
dc.rights Creative Commons - Attribution 4.0 International (CC BY 4.0)
dc.rights.uri https://creativecommons.org/licenses/by/4.0/
dc.rights.label PUB
dc.source.uri https://www.ff.uni-lj.si/raziskovanje/sociolingvisticna-variantnost-govorjene-slovenscine
dc.subject dialect
dc.subject spoken corpus
dc.subject spoken language
dc.subject spoken Slovene
dc.subject TEI
dc.subject sociolinguistic interview
dc.subject Idrija
dc.subject Cerkno dialect
dc.subject Rovte dialect group
dc.subject Ribnica
dc.subject Lower Carniola dialect
dc.subject Lower Carniola dialect group
dc.title The "Mobile languages" corpus MoJezik 1.0 (transcription)
dc.type corpus
metashare.ResourceInfo#ContentInfo.mediaType text
has.files yes
branding CLARIN.SI data & tools
contact.person Maja Bitenc Maja.Bitenc@ff.uni-lj.si Faculty of Arts, University of Ljubljana
sponsor ARIS Z6-9371 Sociolinguistic Variation of Spoken Slovene in the Case of Mobile Speakers: Resources, Methods and Analysis nationalFunds
size.info 4001 utterances
size.info 78871 words
files.count 3
files.size 2492473
featuredService.noske search|https://www.clarin.si/ske/#dashboard?corpname=mojezik


 Files in this item

 Download all files in item (2.38 MB)
This item is
Publicly Available
and licensed under:
Creative Commons - Attribution 4.0 International (CC BY 4.0)
Distributed under Creative Commons Attribution Required
Icon
Name
MoJezik.TEI.zip
Size
793.61 KB
Format
application/zip
Description
Corpus in canonical TEI encoding
MD5
a50afcb146c343b622ca029f9878ffd7
 Download file  Preview
 File Preview  
  • MoJezik.TEI
    • Ribnica.ana.xml2 MB
    • Idrija.xml283 kB
    • Idrija.ana.xml1 MB
    • Schema
      • tei_odds.rng665 kB
      • tei_clarin.rnc331 kB
      • tei_clarin.dtd240 kB
      • tei_clarin.rng707 kB
      • tei_clarin.xsd756 kB
      • dcr.tmp1 kB
      • xml.tmp2 kB
    • Ribnica.xml531 kB
    • 00README.txt3 kB
Icon
Name
MoJezik.PDF.zip
Size
1.4 MB
Format
application/zip
Description
Corpus files formatted for reading and transcription guidelines
MD5
159e70f9405e3114cc9b2357051c177b
 Download file  Preview
 File Preview  
  • MoJezik.PDF
    • MoJezik-Ribnica.pdf863 kB
    • MoJezik-opis_transkripcije.pdf68 kB
    • 00README.txt2 kB
    • MoJezik-Idrija.pdf606 kB
Icon
Name
MoJezik.vert.zip
Size
209.67 KB
Format
application/zip
Description
Corpus in derived vertical format
MD5
d17f982f66540ff07df1f36d53b8fb52
 Download file  Preview
 File Preview  
  • MoJezik.vert
    • Makefile.vert403 B
    • ribnica.vert946 kB
    • idrija.vert451 kB
    • tei2vert.xsl5 kB
    • mojezik.regi1 kB
    • xml2vert.pl556 B
    • 00README.txt3 kB

Show simple item record