Show simple item record

 
dc.contributor.author Klemen, Matej
dc.contributor.author Čebular, Martin
dc.contributor.author Žitnik, Slavko
dc.date.accessioned 2023-02-17T10:17:27Z
dc.date.available 2023-02-17T10:17:27Z
dc.date.issued 2023-02-17
dc.identifier.uri http://hdl.handle.net/11356/1773
dc.description Slovenian model for coreference resolution: a neural network based on a customized transformer architecture, usable with the code published on https://github.com/matejklemen/slovene-coreference-resolution. The model is based on the Slovenian CroSloEngual BERT 1.1 model (http://hdl.handle.net/11356/1330). It was trained on the SUK 1.0 training corpus (http://hdl.handle.net/11356/1747), specifically the SentiCoref subcorpus. Using the evaluation setting where entity mentions are assumed to be correctly pre-detected, the model achieves the following metric values: MUC: precision = 0.931, recall = 0.957, F1 = 0.943 BCubed: precision = 0.887, recall = 0.947, F1 = 0.914 CEAFe: precision = 0.945, recall = 0.893, F1 = 0.916 CoNLL-12: precision = 0.921, recall = 0.932, F1 = 0.924
dc.language.iso slv
dc.publisher Faculty of Computer and Information Science, University of Ljubljana
dc.relation.isreferencedby https://doi.org/10.2298/CSIS201120060K
dc.rights Creative Commons - Attribution 4.0 International (CC BY 4.0)
dc.rights.uri https://creativecommons.org/licenses/by/4.0/
dc.rights.label PUB
dc.source.uri https://rsdo.slovenscina.eu/en/semantic-resources-and-technologies
dc.subject Slovenian
dc.subject coreference resolution
dc.subject neural networks
dc.title PyTorch model for Slovenian Coreference Resolution
dc.type toolService
metashare.ResourceInfo#ContentInfo.detailedType tool
metashare.ResourceInfo#ResourceComponentType#ToolServiceInfo.languageDependent true
has.files yes
branding CLARIN.SI data & tools
demo.uri https://slovenscina.eu/odkrivanje-koreferencnosti
contact.person Matej Klemen matej.klemen@fri.uni-lj.si Faculty of Computer and Information Science, University of Ljubljana
sponsor Ministry of Culture C3340-20-278001 Development of Slovene in a Digital Environment Other
files.count 1
files.size 491548090


 Files in this item

This item is
Publicly Available
and licensed under:
Creative Commons - Attribution 4.0 International (CC BY 4.0)
Distributed under Creative Commons Attribution Required
Icon
Name
slo_coref.zip
Size
468.78 MB
Format
application/zip
Description
Model weights and tokenizer settings
MD5
affeea1ca76bead23c8262526003a618
 Download file  Preview
 File Preview  
  • slo_coref
    • config.json-1 B
    • special_tokens_map.json-1 B
    • tokenizer_config.json-1 B
    • pytorch_model.bin-1 B
    • scorer.th-1 B
    • vocab.txt-1 B
    • controller_config.json-1 B

Show simple item record