dc.contributor.author | Borovič, Mladen |
dc.contributor.author | Žagar, Kristjan |
dc.contributor.author | Ferme, Marko |
dc.contributor.author | Majninger, Sandi |
dc.contributor.author | Ojsteršek, Milan |
dc.contributor.author | Žagar, Aleš |
dc.contributor.author | Robnik-Šikonja, Marko |
dc.date.accessioned | 2022-11-07T15:09:19Z |
dc.date.available | 2022-11-07T15:09:19Z |
dc.date.issued | 2022-11-07 |
dc.identifier.uri | http://hdl.handle.net/11356/1704 |
dc.description | SuperGLUE is a benchmark styled after GLUE with a new set of more difficult language understanding tasks, improved resources, and a public leaderboard. It is comprised of 8 corpora (BoolQ, CB, COPA, MultiRC, ReCoRD, RTE, WiC, WSC), which cover 4 different types of tasks (QA, NLI, WSD, coref.). Slovene translation of SuperGLUE consists of machine and human translations of the benchmark. ReCoRD is completely translated by the Google Machine Translation service. Questions and answers from the project "Slovene in the Palm of your Hand (Slovenščina na dlani)" are also included for the BoolQ, MultiRC and ReCoRD tasks and are in form of extensions to the existing datasets. The data is provided in jsonl format. |
dc.language.iso | slv |
dc.publisher | Faculty of Electrical Engineering and Computer Science, University of Maribor |
dc.relation.isreferencedby | https://super.gluebenchmark.com/ |
dc.rights | Creative Commons - Attribution 4.0 International (CC BY 4.0) |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ |
dc.rights.label | PUB |
dc.source.uri | https://rsdo.slovenscina.eu/en/semantic-resources-and-technologies |
dc.subject | dataset |
dc.subject | natural language processing |
dc.subject | Q&A |
dc.subject | SuperGLUE |
dc.title | Extensions to the Slovene translation of SuperGLUE |
dc.type | corpus |
metashare.ResourceInfo#ContentInfo.mediaType | text |
has.files | yes |
branding | CLARIN.SI data & tools |
contact.person | Mladen Borovič mladen.borovic@um.si Faculty of Electrical Engineering and Computer Science, University of Maribor |
sponsor | Ministry of Culture C3340-20-278001 Development of Slovene in a Digital Environment Other |
sponsor | Slovene Ministry of Culture and European Social Fund C3340-17-208002 Slovenščina na dlani Other |
files.count | 4 |
files.size | 61359637 |
Files in this item
Download all files in item (58.52 MB)This item is
Creative Commons - Attribution 4.0 International (CC BY 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution 4.0 International (CC BY 4.0)
- Name
- SuperGLUE-multirc-ext-SND.zip
- Size
- 306.41 KB
- Format
- application/zip
- Description
- Questions and answers from the project Slovenščina na dlani for the MultiRC task
- MD5
- 391ad993236c97ce33af893b1f300d33
- Name
- SuperGLUE-record-GoogleMT.zip
- Size
- 50.21 MB
- Format
- application/zip
- Description
- Google translated dataset for the ReCoRD task
- MD5
- 3f5deb474eb1cfa8ef392ea0f8049051
- Name
- SuperGLUE-record-ext-SND.zip
- Size
- 514.41 KB
- Format
- application/zip
- Description
- Questions and answers from the project Slovenščina na dlani for the ReCoRD task
- MD5
- f52dd6ecf27f0b117cc897e20f90bcb7
- Name
- SuperGLUE-boolq-ext-SND.zip
- Size
- 7.51 MB
- Format
- application/zip
- Description
- Questions and answers from the project Slovenščina na dlani for the BoolQ task
- MD5
- ef92b2a1a390ad84b805092b001128c5