| dc.contributor.author | Klemen, Matej |
| dc.contributor.author | Čebular, Martin |
| dc.contributor.author | Žitnik, Slavko |
| dc.date.accessioned | 2023-02-17T10:17:27Z |
| dc.date.available | 2023-02-17T10:17:27Z |
| dc.date.issued | 2023-02-17 |
| dc.identifier.uri | http://hdl.handle.net/11356/1773 |
| dc.description | Slovenian model for coreference resolution: a neural network based on a customized transformer architecture, usable with the code published on https://github.com/matejklemen/slovene-coreference-resolution. The model is based on the Slovenian CroSloEngual BERT 1.1 model (http://hdl.handle.net/11356/1330). It was trained on the SUK 1.0 training corpus (http://hdl.handle.net/11356/1747), specifically the SentiCoref subcorpus. Using the evaluation setting where entity mentions are assumed to be correctly pre-detected, the model achieves the following metric values: MUC: precision = 0.931, recall = 0.957, F1 = 0.943 BCubed: precision = 0.887, recall = 0.947, F1 = 0.914 CEAFe: precision = 0.945, recall = 0.893, F1 = 0.916 CoNLL-12: precision = 0.921, recall = 0.932, F1 = 0.924 |
| dc.language.iso | slv |
| dc.publisher | Faculty of Computer and Information Science, University of Ljubljana |
| dc.relation.isreferencedby | https://doi.org/10.2298/CSIS201120060K |
| dc.rights | Creative Commons - Attribution 4.0 International (CC BY 4.0) |
| dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ |
| dc.rights.label | PUB |
| dc.source.uri | https://rsdo.slovenscina.eu/en/semantic-resources-and-technologies |
| dc.subject | Slovenian |
| dc.subject | coreference resolution |
| dc.subject | neural networks |
| dc.title | PyTorch model for Slovenian Coreference Resolution |
| dc.type | toolService |
| metashare.ResourceInfo#ContentInfo.detailedType | tool |
| metashare.ResourceInfo#ResourceComponentType#ToolServiceInfo.languageDependent | true |
| has.files | yes |
| branding | CLARIN.SI data & tools |
| demo.uri | https://slovenscina.eu/odkrivanje-koreferencnosti |
| contact.person | Matej Klemen matej.klemen@fri.uni-lj.si Faculty of Computer and Information Science, University of Ljubljana |
| sponsor | Ministry of Culture C3340-20-278001 Development of Slovene in a Digital Environment Other |
| files.count | 1 |
| files.size | 491548090 |
Datoteke v tem vnosu
To je vnos
Creative Commons - Attribution 4.0 International (CC BY 4.0)
Publicly Available
z licenco:Creative Commons - Attribution 4.0 International (CC BY 4.0)
- Ime
- slo_coref.zip
- Velikost
- 468.78 MB
- Format
- application/zip
- Opis
- Model weights and tokenizer settings
- MD5
- affeea1ca76bead23c8262526003a618
- slo_coref
- config.json-1 B
- special_tokens_map.json-1 B
- tokenizer_config.json-1 B
- pytorch_model.bin-1 B
- scorer.th-1 B
- vocab.txt-1 B
- controller_config.json-1 B