Show simple item record

 
dc.contributor.author Brank, Janez
dc.date.accessioned 2023-06-04T16:13:54Z
dc.date.available 2023-06-04T16:13:54Z
dc.date.issued 2023-06-03
dc.identifier.uri http://hdl.handle.net/11356/1844
dc.description The Q-CAT (Querying-Supported Corpus Annotation Tool) is a tool for manual linguistic annotation of corpora, which also enables advanced queries on top of these annotations. The tool has been used in various annotation campaigns related to the ssj500k reference training corpus of Slovenian (http://hdl.handle.net/11356/1210), such as named entities, dependency syntax, semantic roles and multi-word expressions, but it can also be used for adding new annotation layers of various types to this or other language corpora. Q-CAT is a .NET application, which runs on Windows operating system. Version 1.1 enables the automatic attribution of token IDs and personalized font adjustments. Version 1.2 supports the CONLL-U format and working with UD POS tags. Version 1.3 supports adding new layers of annotation on top of CONLL-U (and then saving the corpus as XML TEI). Version 1.4 introduces new features in command line mode (filtering by sentence ID, multiple link type visualizations) Version 1.5 supports listening to audio recordings (provided in the # sound_url comment line in CONLL-U)
dc.publisher Jožef Stefan Institute
dc.relation.isreferencedby http://slovnica.ijs.si/wp-content/uploads/2019/10/Q-CAT_prirocnik.pdf
dc.relation.isreferencedby https://nl.ijs.si/jtdh20/pdf/JT-DH_2020_Krek-et-al_The-ssj500k-Training-Corpus-for-Slovene-Language-Processing.pdf
dc.relation.replaces http://hdl.handle.net/11356/1684
dc.rights Apache License 2.0
dc.rights.uri https://opensource.org/licenses/Apache-2.0
dc.rights.label PUB
dc.source.uri https://slovenscina.eu/
dc.subject manual annotation
dc.subject corpus annotation
dc.subject annotation tool
dc.subject corpus querying
dc.subject corpus linguistics
dc.title Q-CAT Corpus Annotation Tool 1.5
dc.type toolService
metashare.ResourceInfo#ContentInfo.detailedType tool
metashare.ResourceInfo#ResourceComponentType#ToolServiceInfo.languageDependent false
has.files yes
branding CLARIN.SI data & tools
contact.person Kaja Dobrovoljc kaja.dobrovoljc@ff.uni-lj.si Faculty of Arts, University of Ljubljana
sponsor ARRS (Slovenian Research Agency) J6-8256 New grammar of contemporary standard Slovene: sources and methods nationalFunds
sponsor Ministry of Education, Science and Sport 3311-08-986003 Communication in Slovene Other
sponsor ARRS (Slovenian Research Agency) P6-0411 Language Resources and Technologies for Slovene nationalFunds
sponsor Ministry of Culture C3340-20-278001 Development of Slovene in a Digital Environment Other
sponsor ARRS (Slovenian Research Agency) Z6-4617 Treebank-Driven Approach to the Study of Spoken Slovenian nationalFunds
files.count 1
files.size 7943680


 Files in this item

This item is
Publicly Available
and licensed under:
Apache License 2.0
Icon
Name
Q-CAT.msi
Size
7.58 MB
Format
Unknown
Description
Q-CAT windows installer package file
MD5
844016b95078b53a0efd718cf5149d61
 Download file

Show simple item record