ANNIE is
a named entity recognition pipeline that identifies basic entity types, such
as Person, Location, Organization, Money
amounts, Time and Date expressions.
It is the prototypical information extraction pipeline distributed with the GATE framework and forms the base of many more complex GATE-based IE applications.
Default annotations | |
:Person | Standard named entity types |
:Location | |
:Organization | |
:Date | |
:Address | Includes email and IP addresses as well as street addresses |
Additional annotations available if selected | |
:Money | Monetary amounts |
:Percent | Expressions representing percentages |
:Token | The individual tokens of the text, with "category" feature for POS |
:SpaceToken | The spaces between tokens |
:Sentence | Sentences detected by the sentence splitter |