Log in Help
Homeprojectsdecarbonet 〉 datasets.html

Annotated Datasets

Here you can find some gold standard datasets created as part of the DecarboNet project. The data was originally downloaded from the Media Watch for Climate Change.

The datasets are made available as dehydrated json files, one for each corpus, in order to comply with tweet distribution regulations. To rehydrate them, please download the scripts from Github and follow the instructions.

License: the annotations are provided under a CC-BY licence, while Twitter retains the ownership and rights of the content of the tweets.

1. Corpora annotated with Sentiment

2. Corpora annotated with Environmental Terms