Files in this item

 Download all files in item (7.43 MB)
This item is
Publicly Available
and licensed under:
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
Distributed under Creative Commons Attribution Required Share Alike
Icon
Name
ParlaSent_BCS.jsonl
Size
1.13 MB
Format
Unknown
Description
BCS train file
MD5
c8b59c84c476b031cc553bc3c768e627
 Download file
Icon
Name
ParlaSent_CZ.jsonl
Size
1.15 MB
Format
Unknown
Description
Czech train file
MD5
ff633c11f3d0e1e8fc544db0732e8104
 Download file
Icon
Name
ParlaSent_EN.jsonl
Size
1.1 MB
Format
Unknown
Description
English train file
MD5
9c011abd994c14dc53afb37013fdac05
 Download file
Icon
Name
ParlaSent_SK.jsonl
Size
1.13 MB
Format
Unknown
Description
Slovak train file
MD5
2e2944d8edaa2021b361e3ec3d23a5ee
 Download file
Icon
Name
ParlaSent_BCS_test.jsonl
Size
948.03 KB
Format
Unknown
Description
BCS test file
MD5
ee8699a4a7b1a834f79fe74b8ebdfaf1
 Download file
Icon
Name
ParlaSent_EN_test.jsonl
Size
940.29 KB
Format
Unknown
Description
English test file
MD5
003f0aeded7001574e79c49b09401e83
 Download file
Icon
Name
ParlaSent_SL.jsonl
Size
1.07 MB
Format
Unknown
Description
Slovenian train file
MD5
1117ec542bd1812681a2fff7f0eae1e2
 Download file
Icon
Name
README.txt
Size
2.15 KB
Format
Text file
Description
README with attribute descriptions
MD5
583856c8d470334e5638f6a078f727d5
 Download file  Preview
 File Preview  
The multilingual sentiment dataset of parliamentary debates ParlaSent 1.0
http://hdl.handle.net/11356/1868

The dataset consists of five training datasets and two test sets. The test sets have a _test.jsonl suffix.

The attributes in training data are the following:
- sentence - the sentence labeled for sentiment
- country - the country of the parliament the sentence comes form
- annotator1 - first annotator's annotation
- annotator2 - second annotator's annotation
- reconciliation - the final label agreed upon after reconciliation
- label - three level (positive, negative, neutral) label based on the reconciliation label
- document_id - internal identifier of the document the sentence comes form
- sentence_id - internal identifier of the sentence inside the document
- term - the term of the parliament the sentence comes from
- date - the date the sentence was uttered as part of a speech in the parliament
- name - name of the MP giving the speech
- party - the party of the MP
- gender . . .