Files in this item
Download all files in item (7.43 MB)This item is
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
- Name
- ParlaSent_BCS.jsonl
- Size
- 1.13 MB
- Format
- Unknown
- Description
- BCS train file
- MD5
- c8b59c84c476b031cc553bc3c768e627
- Name
- ParlaSent_CZ.jsonl
- Size
- 1.15 MB
- Format
- Unknown
- Description
- Czech train file
- MD5
- ff633c11f3d0e1e8fc544db0732e8104
- Name
- ParlaSent_EN.jsonl
- Size
- 1.1 MB
- Format
- Unknown
- Description
- English train file
- MD5
- 9c011abd994c14dc53afb37013fdac05
- Name
- ParlaSent_SK.jsonl
- Size
- 1.13 MB
- Format
- Unknown
- Description
- Slovak train file
- MD5
- 2e2944d8edaa2021b361e3ec3d23a5ee
- Name
- ParlaSent_BCS_test.jsonl
- Size
- 948.03 KB
- Format
- Unknown
- Description
- BCS test file
- MD5
- ee8699a4a7b1a834f79fe74b8ebdfaf1
- Name
- ParlaSent_EN_test.jsonl
- Size
- 940.29 KB
- Format
- Unknown
- Description
- English test file
- MD5
- 003f0aeded7001574e79c49b09401e83
- Name
- ParlaSent_SL.jsonl
- Size
- 1.07 MB
- Format
- Unknown
- Description
- Slovenian train file
- MD5
- 1117ec542bd1812681a2fff7f0eae1e2
- Name
- README.txt
- Size
- 2.15 KB
- Format
- Text file
- Description
- README with attribute descriptions
- MD5
- 583856c8d470334e5638f6a078f727d5
The multilingual sentiment dataset of parliamentary debates ParlaSent 1.0
http://hdl.handle.net/11356/1868
The dataset consists of five training datasets and two test sets. The test sets have a _test.jsonl suffix.
The attributes in training data are the following:
- sentence - the sentence labeled for sentiment
- country - the country of the parliament the sentence comes form
- annotator1 - first annotator's annotation
- annotator2 - second annotator's annotation
- reconciliation - the final label agreed upon after reconciliation
- label - three level (positive, negative, neutral) label based on the reconciliation label
- document_id - internal identifier of the document the sentence comes form
- sentence_id - internal identifier of the sentence inside the document
- term - the term of the parliament the sentence comes from
- date - the date the sentence was uttered as part of a speech in the parliament
- name - name of the MP giving the speech
- party - the party of the MP
- gender . . .