| dc.contributor.author | Popović, Maja |
| dc.contributor.author | Arčan, Mihael |
| dc.date.accessioned | 2016-05-29T15:22:49Z |
| dc.date.available | 2016-05-29T15:22:49Z |
| dc.date.issued | 2016-05-24 |
| dc.identifier.uri | http://hdl.handle.net/11356/1065 |
| dc.description | The PE²rr corpus contains source language texts from different domains along with their automatically generated translations into several morphologically rich languages, their post-edited versions, and error annotations of the performed post-edit operations. The main advantage of the corpus is the fusion of post-editing and error classification tasks, which have usually been seen as two independent tasks, although naturally they are not. |
| dc.language.iso | slv |
| dc.language.iso | srp |
| dc.language.iso | deu |
| dc.language.iso | spa |
| dc.language.iso | eng |
| dc.publisher | Insight Centre for Data Analytics, National University of Ireland, Galway |
| dc.relation | info:eu-repo/grantAgreement/EC/H2020/644333 |
| dc.relation.isreferencedby | http://www.lrec-conf.org/proceedings/lrec2016/summaries/405.html |
| dc.rights | Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) |
| dc.rights.uri | https://creativecommons.org/licenses/by-sa/4.0/ |
| dc.rights.label | PUB |
| dc.subject | parallel corpus |
| dc.subject | machine translation |
| dc.subject | post-editing |
| dc.subject | error annotation |
| dc.subject | manual annotation |
| dc.subject | multilingual |
| dc.title | Post-edited and error annotated machine translation corpus PErr 1.0 |
| dc.type | corpus |
| metashare.ResourceInfo#ContentInfo.mediaType | text |
| has.files | yes |
| branding | CLARIN.SI data & tools |
| contact.person | Maja Popović maja.popovic@hu-berlin.de Humboldt University of Berlin |
| contact.person | Mihael Arčan mihael.arcan@insight-centre.org Insight Centre for Data Analytics, National University of Ireland, Galway |
| sponsor | European Union EC/H2020/644333 TraMOOC - Translation for Massive Open Online Courses euFunds info:eu-repo/grantAgreement/EC/H2020/644333 |
| sponsor | Science Foundation Ireland SFI/12/RC/2289 Insight nationalFunds |
| size.info | 2896 units |
| size.info | 43938 words |
| files.count | 1 |
| files.size | 373440 |
Files in this item
This item is
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
- Name
- pe2rr_dataset.tgz
- Size
- 364.69 KB
- Format
- Unknown
- Description
- 11 files (each for one source / language pair), tab-separated, with one translation unit per line.
- MD5
- cb5dd1552c5a3c28008a7ceb294aafb2