dc.contributor.author | Kosem, Iztok |
dc.contributor.author | Rozman, Tadeja |
dc.contributor.author | Pori, Eva |
dc.contributor.author | Arhar Holdt, Špela |
dc.contributor.author | Kocjančič, Polonca |
dc.contributor.author | Laskowski, Cyprian |
dc.contributor.author | Klemenc, Bojan |
dc.date.accessioned | 2019-07-16T11:14:43Z |
dc.date.available | 2019-07-16T11:14:43Z |
dc.date.issued | 2019-07-16 |
dc.identifier.uri | http://hdl.handle.net/11356/1224 |
dc.description | The ccŠolar corpus contains 1693 texts collected during 2016-2018, as part of the upgrade of the corpus Šolar project. The project aims were to increase the size of the Šolar 1.0 corpus and to improve text balance across regions and education level. For each text, the information on school (elementary or secondary), subject, level (grade or year), type of text, region and date of production is provided. The ccŠolar 1.0 corpus is offered separately because the new texts were collected under CC BY 4.0 licence, a more open licence than the earlier texts. |
dc.language.iso | slv |
dc.publisher | Trojina, Institute for Applied Slovene Studies |
dc.publisher | Centre for Language Resources and Technologies, University of Ljubljana |
dc.rights | Creative Commons - Attribution 4.0 International (CC BY 4.0) |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ |
dc.rights.label | PUB |
dc.source.uri | https://www.cjvt.si/raziskovalno-delo/projekti-cjvt/korpus-solar/ |
dc.subject | developmental corpus |
dc.subject | student writing |
dc.title | Developmental corpus ccŠolar 1.0 |
dc.type | corpus |
metashare.ResourceInfo#ContentInfo.mediaType | text |
has.files | yes |
branding | CLARIN.SI data & tools |
contact.person | Iztok Kosem iztok.kosem@ff.uni-lj.si Centre for Language Resources and Technologies, University of Ljubljana |
sponsor | Ministry of Culture 3340-15-141006 Upgrade of Šolar Corpus nationalFunds |
sponsor | ARRS (Slovenian Research Agency) I0-0051 Centre for Applied Linguistics (CUJ) nationalFunds |
sponsor | University of Ljubljana I0-0022 Network of Research Infrastructure Centres (MRIC) nationalFunds |
size.info | 1693 texts |
size.info | 468821 words |
size.info | 540868 tokens |
files.count | 1 |
files.size | 6111742 |
Files in this item
This item is
Creative Commons - Attribution 4.0 International (CC BY 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution 4.0 International (CC BY 4.0)



- Name
- ccSolar1.0.zip
- Size
- 5.83 MB
- Format
- application/zip
- Description
- Corpus in TEI format
- MD5
- 03390cae483db47a1ad69e99611451b9
- ccSolar1.0
- ccSolar.xml43 MB
- schema
- tei_clarin.zip87 kB
- tei_clarin.rnc291 kB
- tei_clarin.dtd233 kB
- tei_clarin.rng592 kB
- 00README.txt214 B