TwitIE is a named entity recognition pipeline specially tuned for use with Twitter data. It performs
Default annotations | |
:Person | Standard named entity types |
:Location | |
:Organization | |
:Date | |
:Address | Includes email and IP addresses as well as street addresses |
:Token | The individual tokens of the text, with "category" feature for POS |
:Emoticon | Emoticons such as :-) |
:Hashtag | Hashtags, including the leading # character |
:URL | URL mentions |
:UserID | The username part of @user mentions, not including the leading @ sign |
Additional annotations available if selected | |
:Money | Monetary amounts |
:Percent | Expressions representing percentages |
:SpaceToken | The spaces between tokens |
:Sentence | Sentences detected by the sentence splitter |