dc.contributor.author | Klemen, Matej |
dc.contributor.author | Čebular, Martin |
dc.contributor.author | Žitnik, Slavko |
dc.date.accessioned | 2023-02-17T10:17:27Z |
dc.date.available | 2023-02-17T10:17:27Z |
dc.date.issued | 2023-02-17 |
dc.identifier.uri | http://hdl.handle.net/11356/1773 |
dc.description | Slovenian model for coreference resolution: a neural network based on a customized transformer architecture, usable with the code published on https://github.com/matejklemen/slovene-coreference-resolution. The model is based on the Slovenian CroSloEngual BERT 1.1 model (http://hdl.handle.net/11356/1330). It was trained on the SUK 1.0 training corpus (http://hdl.handle.net/11356/1747), specifically the SentiCoref subcorpus. Using the evaluation setting where entity mentions are assumed to be correctly pre-detected, the model achieves the following metric values: MUC: precision = 0.931, recall = 0.957, F1 = 0.943 BCubed: precision = 0.887, recall = 0.947, F1 = 0.914 CEAFe: precision = 0.945, recall = 0.893, F1 = 0.916 CoNLL-12: precision = 0.921, recall = 0.932, F1 = 0.924 |
dc.language.iso | slv |
dc.publisher | Faculty of Computer and Information Science, University of Ljubljana |
dc.relation.isreferencedby | https://doi.org/10.2298/CSIS201120060K |
dc.rights | Creative Commons - Attribution 4.0 International (CC BY 4.0) |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ |
dc.rights.label | PUB |
dc.source.uri | https://rsdo.slovenscina.eu/en/semantic-resources-and-technologies |
dc.subject | Slovenian |
dc.subject | coreference resolution |
dc.subject | neural networks |
dc.title | PyTorch model for Slovenian Coreference Resolution |
dc.type | toolService |
metashare.ResourceInfo#ContentInfo.detailedType | tool |
metashare.ResourceInfo#ResourceComponentType#ToolServiceInfo.languageDependent | true |
has.files | yes |
branding | CLARIN.SI data & tools |
demo.uri | https://slovenscina.eu/odkrivanje-koreferencnosti |
contact.person | Matej Klemen matej.klemen@fri.uni-lj.si Faculty of Computer and Information Science, University of Ljubljana |
sponsor | Ministry of Culture C3340-20-278001 Development of Slovene in a Digital Environment Other |
files.count | 1 |
files.size | 491548090 |
Files in this item
This item is
Creative Commons - Attribution 4.0 International (CC BY 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution 4.0 International (CC BY 4.0)
- Name
- slo_coref.zip
- Size
- 468.78 MB
- Format
- application/zip
- Description
- Model weights and tokenizer settings
- MD5
- affeea1ca76bead23c8262526003a618
- slo_coref
- config.json-1 B
- special_tokens_map.json-1 B
- tokenizer_config.json-1 B
- pytorch_model.bin-1 B
- scorer.th-1 B
- vocab.txt-1 B
- controller_config.json-1 B