dc.contributor.author | Ljubešić, Nikola |
dc.date.accessioned | 2018-05-28T11:22:02Z |
dc.date.available | 2018-05-28T11:22:02Z |
dc.date.issued | 2018-05-28 |
dc.identifier.uri | http://hdl.handle.net/11356/1187 |
dc.description | The lexicon contains concreteness and imageability predictions of words in 77 languages. The resource is built via supervised machine learning, using average human responses obtained for Croatian lexemes inside the MEGAHR project (http://megahr.ffzg.unizg.hr) as the response variable, and the Facebook cross-lingual word embeddings (https://github.com/Babylonpartners/fastText_multilingual) as explanatory variables. The Spearman correlation of human responses and automatic annotations on the Croatian-English language pair is ~0.8 for concreteness and ~0.7 for imageability. |
dc.language.iso | afr |
dc.language.iso | ara |
dc.language.iso | aze |
dc.language.iso | bel |
dc.language.iso | bul |
dc.language.iso | ben |
dc.language.iso | bos |
dc.language.iso | cat |
dc.language.iso | ceb |
dc.language.iso | ces |
dc.language.iso | cym |
dc.language.iso | dan |
dc.language.iso | deu |
dc.language.iso | ell |
dc.language.iso | eng |
dc.language.iso | epo |
dc.language.iso | spa |
dc.language.iso | est |
dc.language.iso | eus |
dc.language.iso | fas |
dc.language.iso | fin |
dc.language.iso | fra |
dc.language.iso | fry |
dc.language.iso | glg |
dc.language.iso | guj |
dc.language.iso | heb |
dc.language.iso | hin |
dc.language.iso | hrv |
dc.language.iso | hun |
dc.language.iso | hye |
dc.language.iso | ind |
dc.language.iso | isl |
dc.language.iso | ita |
dc.language.iso | jpn |
dc.language.iso | kat |
dc.language.iso | kaz |
dc.language.iso | khm |
dc.language.iso | kan |
dc.language.iso | kor |
dc.language.iso | kir |
dc.language.iso | lat |
dc.language.iso | ltz |
dc.language.iso | lit |
dc.language.iso | lav |
dc.language.iso | mlg |
dc.language.iso | mkd |
dc.language.iso | mal |
dc.language.iso | mon |
dc.language.iso | mar |
dc.language.iso | msa |
dc.language.iso | mya |
dc.language.iso | nep |
dc.language.iso | nld |
dc.language.iso | nor |
dc.language.iso | pan |
dc.language.iso | pol |
dc.language.iso | por |
dc.language.iso | ron |
dc.language.iso | rus |
dc.language.iso | hbs |
dc.language.iso | sin |
dc.language.iso | slk |
dc.language.iso | slv |
dc.language.iso | sqi |
dc.language.iso | srp |
dc.language.iso | swe |
dc.language.iso | tam |
dc.language.iso | tel |
dc.language.iso | tgk |
dc.language.iso | tha |
dc.language.iso | tgl |
dc.language.iso | tur |
dc.language.iso | ukr |
dc.language.iso | urd |
dc.language.iso | uzb |
dc.language.iso | vie |
dc.language.iso | zho |
dc.publisher | Jožef Stefan Institute |
dc.publisher | Faculty of Humanities and Social Sciences, University of Zagreb |
dc.relation.isreferencedby | https://arxiv.org/abs/1807.02903 |
dc.rights | Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) |
dc.rights.uri | https://creativecommons.org/licenses/by-sa/4.0/ |
dc.rights.label | PUB |
dc.source.uri | https://github.com/clarinsi/megahr-crossling |
dc.subject | concreteness |
dc.subject | imageability |
dc.subject | multilingual |
dc.title | Concreteness and imageability lexicon MEGA.HR-Crossling |
dc.type | lexicalConceptualResource |
metashare.ResourceInfo#ContentInfo.detailedType | lexicon |
metashare.ResourceInfo#ContentInfo.mediaType | text |
has.files | yes |
branding | CLARIN.SI data & tools |
contact.person | Nikola Ljubešić nikola.ljubesic@ijs.si Jožef Stefan Institute |
sponsor | Croatian Science Foundation HRZZ-IP-2016-06-1210 MEGAHR nationalFunds |
size.info | 7237589 entries |
files.count | 1 |
files.size | 172760569 |
Files in this item
This item is
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
- Name
- megahr-crossling.zip
- Size
- 164.76 MB
- Format
- application/zip
- Description
- Lexicons in tab-separated format
- MD5
- 0ffe2b901465b3eb807ac302a308f6ca
- megahr.pt4 MB
- megahr.de4 MB
- megahr.pl4 MB
- megahr.da4 MB
- megahr.ja4 MB
- megahr.vi4 MB
- megahr.cy2 MB
- megahr.pa2 MB
- megahr.cs4 MB
- megahr.it4 MB
- megahr.is4 MB
- megahr.uz4 MB
- megahr.ur4 MB
- megahr.ca4 MB
- megahr.id4 MB
- megahr.uk5 MB
- megahr.hy5 MB
- megahr.bs4 MB
- megahr.hu4 MB
- 00README181 B
- megahr.bn5 MB
- megahr.hr4 MB
- megahr.bg4 MB
- megahr.be5 MB
- megahr.no4 MB
- megahr.hi5 MB
- megahr.tr4 MB
- megahr.nl4 MB
- megahr.he4 MB
- megahr.tl2 MB
- megahr.az4 MB
- megahr.ne3 MB
- megahr.th4 MB
- megahr.tg2 MB
- megahr.te5 MB
- megahr.zh4 MB
- megahr.ar4 MB
- megahr.ta6 MB
- megahr.gu3 MB
- megahr.my5 MB
- megahr.ms4 MB
- megahr.mr5 MB
- megahr.gl4 MB
- megahr.sv4 MB
- megahr.af4 MB
- megahr.mn3 MB
- megahr.sr4 MB
- megahr.ml5 MB
- megahr.sq4 MB
- megahr.mk5 MB
- megahr.ceb4 MB
- megahr.mg1 MB
- megahr.sl4 MB
- megahr.sk4 MB
- megahr.si4 MB
- megahr.sh4 MB
- megahr.fy3 MB
- megahr.fr4 MB
- megahr.lv4 MB
- megahr.lt4 MB
- megahr.ru5 MB
- megahr.fi4 MB
- megahr.ro4 MB
- megahr.fa4 MB
- megahr.lb3 MB
- megahr.la4 MB
- megahr.eu4 MB
- megahr.et4 MB
- megahr.ky4 MB
- megahr.es4 MB
- megahr.eo4 MB
- megahr.en4 MB
- megahr.el5 MB
- megahr.ko4 MB
- megahr.kn5 MB
- megahr.km2 MB
- megahr.kk5 MB
- megahr.ka5 MB