A service that tags abusive utterances in any text. It includes a feature, "type", indicating the type of abuse if any, such as sexist, racist etc, and a "target" feature that indicates if the abuse was aimed at the addressee or some other party. This can be run on any English language text.
It will also tag UK members of parliament for the 2015, 2017 and 2019 general elections, and candidates for the 2017 and 2019 elections. Where an individual has run for election or been elected multiple times, multiple "Politician" annotations will appear with different "minorType" features. In this way, a person's recent political career can be tracked. The current parliament is the 58th parliament, with previous parliaments counting down, so that MPs with a minorType feature of "mp55" are those that were MPs before the general election in 2015.
The service will also tag a range of politically relevant topics, as well as entities such as persons, locations and organizations and Twitter entities such as hashtags and user mentions. It is designed to run on tweets in the original Twitter JSON input format, on which it will also produce metadata such as whether the tweet is a reply or a retweet. Upload your own or harvest some with our Twitter Collector. However it can be run on any text.
Default annotations | |
:Abuse | Abusive phrases. Includes a feature, "type", indicating the type of abuse, such as racist, religious etc., and a "target" feature indicating whether the abuse is intended for the addressee or some other party. |
:Topic | Mentions of topics relevant to UK politics, based largely on the topic classification used on gov.uk. |
:Politician | Recognised UK politicians such as MPs, parliamentary candidates and other significant individuals such as party leaders who are not MPs. MP mentions include a feature distinguishing them by parliament, e.g. as of 2020 we are in the 58th parliament. Where an individual has been elected to more than one parliament, they receive multiple annotations indicating this. |
:Party | UK political parties. |
:Hashtag | From the original tweet, if it was run on tweets. |
:UserID | From the original tweet, if it was run on tweets. |
:URL | From the original tweet, if it was run on tweets. |
:Organization | Entities found by GATE's ANNIE named entity recogniser. |
:Person | Entities found by GATE's ANNIE named entity recogniser. |
:Address | Entities found by GATE's ANNIE named entity recogniser. |
:Date | Entities found by GATE's ANNIE named entity recogniser. |
:Location | Entities found by GATE's ANNIE named entity recogniser. |
:Money | Entities found by GATE's ANNIE named entity recogniser. |
:Percent | Entities found by GATE's ANNIE named entity recogniser. |
Additional annotations available if selected | |
:Sentence | Sentences |
:Tweet | Original tweet data |
When the input is Twitter JSON and the output is saved as GATE XML or sent to Mímir, the following additional information is extracted from the tweet metadata and made available as document-level features:
https://twitter.com/{author}/status/{id}