{ "cells": [ { "cell_type": "code", "execution_count": 1, "metadata": { "slideshow": { "slide_type": "skip" } }, "outputs": [], "source": [ "# Notes for configuring the presentation:\n", "# reveal/rise config is in the notebook metadata (see Edit - Edit Notebook Metadata, \"rise\" section)\n", "# possible themes: https://revealjs.com/themes/\n", "# - for us: simple, white\n", "# possible transitions: https://revealjs.com/transitions/\n", "# - for us: none, slide, zoom\n", "# More: see https://github.com/damianavila/RISE/blob/master/doc/customize.md#how-to-customize\n", "# Documentation: https://rise.readthedocs.io/en/latest/" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "##
GATE COURSE MODULE 11
GATE & PYTHON
\n", "
Johann Petrak and Mehmet Bakir

Last updated: 2022-06-24 for GateNLP v1.0.8a1 or later
\n", "\n", "* Online slides: https://gatenlp.github.io/python-gatenlp/training/module11-python.slides.html\n", "* Slides License: [CC BY-NC-SA 3.0](https://creativecommons.org/licenses/by-nc-sa/3.0/)" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "\n", "# GATE & PYTHON\n", "\n", "This tutorial covers Python tools related to GATE:\n", "\n", "1. **Python GateNLP:** Python package for NLP similar to Java GATE\n", "2. **Python GateNLP GateWorker:** run Java/GATE from Python \n", "2. **GATE Python Plugin:** Java GATE plugin to process GATE documents with Python and Python GateNLP\n", "3. **Format BDOC Plugin:** Java GATE plugin for support of loading/saving documents in JSON/YAML/MsgPack format\n" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "# Python GateNLP\n", "\n", "Aims:\n", "\n", "* NLP framework written in pure Python. \n", "* Similar concepts as Java GATE: documents, document features, annotation sets, annotations, ...\n", "* But \"pythonic\" API, try to make basic things very simple (e.g. loading/saving of documents)\n", "* Does NOT try to be a full multilingual NLP processing package, rather COMBINE:\n", " * Use existing tools and solutions: Spacy, Stanford Stanza \n", " * Add own tools and improvements where needed\n" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "# Python GateNLP: status\n", "\n", "* Current release: 1.0.x\n", "* All 1.0.x: get community feedback:\n", " * how to improve API, abstractions, conventions, find bugs\n", " * what is most important to still get added?\n", " * API may slightly change, parameter names may get consolidated\n", "* Planned 1.1.x releases and onwards: stable API" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "# Python GateNLP: Info and Feedback\n", "\n", "* Documentation: https://gatenlp.github.io/python-gatenlp/\n", "* Sources: https://github.com/GateNLP/python-gatenlp \n", "* Report a bug, request a feature with issue tracker: https://github.com/GateNLP/python-gatenlp/issues\n", "* Discuss, ask: \n", " * discussions forum at https://github.com/GateNLP/python-gatenlp/discussions\n", " * GATE mailing list https://groups.io/g/gate-users\n", "* Developers Chat: https://gitter.im/GateNLP/python-gatenlp" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "# Preparation: Install Python\n", "\n", "* see also https://gatenlp.github.io/python-gatenlp/installation.html\n", "* Recommended: \n", " * [Anaconda](https://www.anaconda.com/products/individual) / [Miniconda](https://docs.conda.io/en/latest/miniconda.html)\n", " * [Miniforge](https://github.com/conda-forge/miniforge) \n" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "# Preparation: install Miniconda (Linux)\n", "\n", "* [Download](https://docs.conda.io/en/latest/miniconda.html) the Python 3.8 (or later) installer (64-bit) for your OS\n", "* Run the installer \n", "* respond \"yes\" to \"running conda init?\"\n", "* start a new command line \n", "* Create environment: `conda create -y -n gatenlp python=3.9`\n", "* activate environment: `conda activate gatenlp`\n", "\n" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "# Preparation: install Miniconda (Windows)\n", "\n", "* [Download](https://docs.conda.io/en/latest/miniconda.html) the Python 3.8 (or later) installer (64-bit) for your OS\n", "* Run the installer, install for \"just me\", register as default Python, \n", "* start the \"Anaconda Prompt\" or \"Anaconda Powershell Prompt\"\n", "* Create environment: `conda create -y -n gatenlp python=3.9`\n", "* activate environment: `conda activate gatenlp`\n" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "# Install gatenlp\n", "\n", "To install most recent release and install **all** dependencies \n", "(without `[all]` only minimum dependencies are installed!):\n", "\n", "`pip install -U gatenlp[all]`\n", "\n", "Also install support for jupyter notebookd and for showing the slides:\n", "\n", "`pip install jupyter notebook ipython ipykernel RISE`\n", "\n", "Create kernel for the conda environment:\n", "\n", "`python -m ipykernel install --user --name gatenlp --display-name \"Python-gatenlp\"`\n", "\n", "See also GateNLP documentation: [installation](https://gatenlp.github.io/python-gatenlp/installation.html)\n" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Java GATE\n", "\n", "* Java is installed\n", " * e.g. [Adoptium](https://adoptium.net/) \n", " * java on the path and **registered**!\n", "* GATE 9.0 of later is installed\n", " * needed later: where (which directory) is it installed in? (`GATE_HOME`)\n", " * Windows: right-click icon, \"Properties\", \"Target\": directory that contains \"gate.exe\"" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "# Follow along\n", "\n", "* [Online slides](https://gatenlp.github.io/python-gatenlp/training/module11-python.slides.html)\n", "* Download the [handouts](https://gate.ac.uk/sale/talks/gate-course-feb21/module-11-python/hands-on-python.zip) zip file and extract the directory\n", "* Within the directory you can either:\n", " * to follow in the original notebook: run `jupyter notebook module11-python.ipynb`\n", " * to explore in a new Notebook: run `jupyter notebook`, choose New -> Python-gatenlp \n", " * to explore interactively: run `ipython` and enter python code\n", " \n", "If kernel error in Jupyter, try something like ([Anaconda bug](https://github.com/jupyter/jupyter_core/issues/215), apparently):\n", "\n", "`python C:\\Users\\USERNAME\\miniconda3\\envs\\gatenlp\\Scripts\\pywin32_postinstall.py -install`\n", "\n" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "# Python GateNLP: Main Concepts\n", "\n", "* A [**document**](https://gatenlp.github.io/python-gatenlp/documents) represents some text and\n", " * any number of named [**annotation sets**](https://gatenlp.github.io/python-gatenlp/annotationsets) \n", " * any number of **features**\n", "* An **annotation set** can have \n", " * any number of [**annotations**](https://gatenlp.github.io/python-gatenlp/annotations)\n", "* **Annotations** describe a span of a document and have\n", " * any number of **features**\n", " * an **annotation type**\n", " * **from** and **to** offsets that describe the span of the annotation\n", "* An **Annotator** is something that processes a document (and usually adds or changes annotations)\n" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Documents" ] }, { "cell_type": "code", "execution_count": 2, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "GateNLP version: 1.0.8.dev3\n" ] } ], "source": [ "# Import gatenlp to check gatenlp version:\n", "import gatenlp\n", "print(\"GateNLP version:\", gatenlp.__version__)\n", "from gatenlp import Document" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "Create a document from some text/string and print it:" ] }, { "cell_type": "code", "execution_count": 3, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Document(This is a small test document,features=Features({}),anns=[])\n" ] } ], "source": [ "doc1 = Document(\"This is a small test document\")\n", "print(doc1)" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Documents\n", "\n", "In a notebook, documents are visualized using the **html-viewer** when a document is the last value of a cell or when `display(doc1)` or when `document.show()` is used:" ] }, { "cell_type": "code", "execution_count": 4, "metadata": { "scrolled": false, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "text/html": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", " \n", " \n", "
\n", "\n", "
" ], "text/plain": [ "Document(This is a small test document,features=Features({}),anns={})" ] }, "execution_count": 4, "metadata": {}, "output_type": "execute_result" } ], "source": [ "# from IPython.display import display\n", "doc1" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Documents\n", "\n", "The `show()` method can be used to influence and parametrize the viewer" ] }, { "cell_type": "code", "execution_count": 5, "metadata": { "scrolled": true, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "text/html": [ "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", " \n", " \n", "
\n", "\n", "
" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "doc1.show(doc_style=\"color: blue; font-weight: bold;\")" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "# Documents: load\n", "\n", "* to load documents use `Document.load(some_location, ...)`\n", "* document format is auto-detected from the extension or specified using the `fmt` parameter\n", "* one standard format for saving/loading GateNLP is \"bdocjs\" (a JSON serialization)\n", "* `some_location` can be file or URL" ] }, { "cell_type": "code", "execution_count": 6, "metadata": { "scrolled": false, "slideshow": { "slide_type": "slide" } }, "outputs": [ { "data": { "text/html": [ "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", " \n", " \n", "
\n", "\n", "
" ], "text/plain": [ "Document(This is a test document.\n", "\n", "It contains just a few sentences. \n", "Here is a sentence that mentions a few named entities like \n", "the persons Barack Obama or Ursula von der Leyen, locations\n", "like New York City, Vienna or Beijing or companies like \n", "Google, UniCredit or Huawei. And here is Donald Trump, it may not be the real one :P\n", "\n", "Lets say Boris Johnson aka Bojo tweets from his BorisJohnson account, would be nice to match them!\n", "\n", "Here we include a URL https://gatenlp.github.io/python-gatenlp/ \n", "and a fake email address john.doe@hiscoolserver.com as well \n", "as #some #cool #hastags and a bunch of emojis like 😽 (a kissing cat),\n", "👩‍🏫 (a woman teacher), 🧬 (DNA), \n", "🧗 (a person climbing), \n", "\n", "Here we test a few different scripts, e.g. Hangul 한글 or \n", "simplified Hanzi 汉字 or Farsi فارسی and Arabic ,اَلْعَرَبِيَّةُ, which goes from right to left.\n", "\n", ",features=Features({}),anns={})" ] }, "execution_count": 6, "metadata": {}, "output_type": "execute_result" } ], "source": [ "doc2 = Document.load('./data/document-testing.txt')\n", "doc2" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "# Documents: save (JSON)\n", "\n", "* use `thedocument.save(location, ...)` \n", "* format is inferred from the extension or specified using `fmt` parameter\n", "* Formats: `bdocjs` (JSON, default), `bdocym` (YAML, slow), `bdocmp` (MessagePack, compact)" ] }, { "cell_type": "code", "execution_count": 7, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "{\"annotation_sets\": {}, \"text\": \"This is a small test document\", \"features\": {}, \"offset_type\": \"p\", \"name\": \"\"}\n" ] } ], "source": [ "doc1.save(\"myfirstdocument.bdocjs\")\n", "\n", "with open(\"myfirstdocument.bdocjs\", \"rt\", encoding=\"utf-8\") as infp:\n", " print(infp.read())" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Document: save (YAML)" ] }, { "cell_type": "code", "execution_count": 8, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "annotation_sets: {}\n", "features: {}\n", "name: ''\n", "offset_type: p\n", "text: This is a small test document\n", "\n" ] } ], "source": [ "doc1.save(\"myfirstdocument.bdocym\") # use YAML serialization\n", "\n", "with open(\"myfirstdocument.bdocym\", \"rt\", encoding=\"utf-8\") as infp:\n", " print(infp.read())" ] }, { "cell_type": "code", "execution_count": 9, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "text/plain": [ "'{\"annotation_sets\": {}, \"text\": \"This is a small test document\", \"features\": {}, \"offset_type\": \"p\", \"name\": \"\"}'" ] }, "execution_count": 9, "metadata": {}, "output_type": "execute_result" } ], "source": [ "# Can also \"save\" to memory/string, here the format is needed!\n", "doc1.save_mem(fmt=\"bdocjs\")" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Document features\n", "\n", "* Documents can have arbitrary **features** (similar to Python dictionaries)\n", "* key/name (string) maps to some value\n", "* value *should* be JSON serializable\n", "* name starting with single underscore: \"private value\"\n", "* name starting with double underscore: \"private/transient value\" (not saved by default, not shown in viewer)\n" ] }, { "cell_type": "code", "execution_count": 10, "metadata": { "scrolled": false, "slideshow": { "slide_type": "slide" } }, "outputs": [ { "data": { "text/html": [ "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", " \n", " \n", "
\n", "\n", "
" ], "text/plain": [ "Document(This is a small test document,features=Features({'loading_date': '2022-07-02 21:14:50.692354', 'purpose': 'Testing gatenlp.', 'numeric_value': 22, 'dict_of_objects': {'dict_key': 'dict_value', 'a_list': [1, 2, 3, 4, 5]}, '_tmp1': 'some value', '__tmp2': 12345}),anns={})" ] }, "execution_count": 10, "metadata": {}, "output_type": "execute_result" } ], "source": [ "import datetime\n", "doc1.features[\"loading_date\"] = str(datetime.datetime.now())\n", "doc1.features[\"purpose\"] = \"Testing gatenlp.\"\n", "doc1.features[\"numeric_value\"] = 22\n", "doc1.features[\"dict_of_objects\"] = {\"dict_key\": \"dict_value\", \"a_list\": [1,2,3,4,5]}\n", "doc1.features[\"_tmp1\"] = \"some value\"\n", "doc1.features[\"__tmp2\"] = 12345\n", "doc1" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Features: API\n", "\n", "* inherits from [UserDict](https://docs.python.org/3/library/collections.html#collections.UserDict)\n", "* same API as `dict`\n", "* but Document is aware of what changes are made (needed for updating a ChangeLog as we will see later!)" ] }, { "cell_type": "code", "execution_count": 11, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "1: Testing gatenlp.\n", "2: None\n", "3: NA!\n" ] } ], "source": [ "print(\"1:\", doc1.features[\"purpose\"])\n", "print(\"2:\", doc1.features.get(\"doesntexist\"))\n", "print(\"3:\", doc1.features.get(\"doesntexist\", \"NA!\"))" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Features: API" ] }, { "cell_type": "code", "execution_count": 12, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "loading_date: 2022-07-02 21:14:50.692354\n", "purpose: Testing gatenlp.\n", "numeric_value: 22\n", "dict_of_objects: {'dict_key': 'dict_value', 'a_list': [1, 2, 3, 4, 5]}\n", "_tmp1: some value\n", "__tmp2: 12345\n" ] } ], "source": [ "for name, value in doc1.features.items():\n", " print(f\"{name}: {value}\")" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Features: serialization\n", "\n", "Lets check how the document with features is serialized to \"bdocjs\" (JSON) format:" ] }, { "cell_type": "code", "execution_count": 13, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "{'annotation_sets': {},\n", " 'features': {'_tmp1': 'some value',\n", " 'dict_of_objects': {'a_list': [1, 2, 3, 4, 5],\n", " 'dict_key': 'dict_value'},\n", " 'loading_date': '2022-07-02 21:14:50.692354',\n", " 'numeric_value': 22,\n", " 'purpose': 'Testing gatenlp.'},\n", " 'name': '',\n", " 'offset_type': 'p',\n", " 'text': 'This is a small test document'}\n" ] } ], "source": [ "import pprint, json\n", "\n", "js_str = doc1.save_mem(fmt=\"bdocjs\")\n", "js = json.loads(js_str)\n", "pprint.pprint(js)" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Annotations & Annotation Sets & Spans\n", "\n", "* Span: a range of offsets \n", "* Annotation: information about a range of offsets, has\n", " * annotation type \n", " * features\n", " * unique integer annotation id \n", "* Annotation set: named collection of annotations \n", " * \"set\": only one annotation per set with the same annotation id\n", " * but ordered by insertion order or offset\n", " * \"default\" annotation set has name \"\" (empty string)\n" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "# Adding annotations\n", "\n", "* first get the annotation set we want to add the annotation to\n", "* then create the annotation using the `add` method of the set" ] }, { "cell_type": "code", "execution_count": 14, "metadata": { "slideshow": { "slide_type": "skip" } }, "outputs": [], "source": [ "# reset so annotations do not accumulate and annids are correct when running several times\n", "annset=doc1.annset(\"Set1\")\n", "annset.clear()\n", "annset._next_annid=0" ] }, { "cell_type": "code", "execution_count": 15, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "text/plain": [ "Annotation(0,4,AnnType1,features=Features({}),id=0)" ] }, "execution_count": 15, "metadata": {}, "output_type": "execute_result" } ], "source": [ "# create and get an annotation set with the name \"Set1\"\n", "annset = doc1.annset(\"Set1\")\n", "#Now, add an annotation, this method returns the newly created annotation\n", "annset.add(0,4,\"AnnType1\")" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Annotations\n", "\n", "* The annotation covers the characters 0, 1, 2, and 3, a text of length 4 (to - from = len)\n", "* the \"to\" offset is the offset *after* the last covered character\n", "* in Python ALL unicode code points are represented by 1 character\n", " * In Java: UTF-16 code units\n", " * -> Offsets different between Java and Python!" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Annotations" ] }, { "cell_type": "code", "execution_count": 16, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Annotation(0,4,AnnType1,features=Features({}),id=0)\n", "Annotation(0,4,Token,features=Features({'id': \"token1'\"}),id=1)\n", "Annotation(0,29,Sentence,features=Features({'what': \"The first 'sentence' annotation\"}),id=7)\n", "Annotation(5,7,Token,features=Features({'id': \"token2'\"}),id=2)\n", "Annotation(8,9,Token,features=Features({'id': \"token3'\"}),id=3)\n", "Annotation(10,15,Token,features=Features({'id': \"token4'\"}),id=4)\n", "Annotation(16,20,Token,features=Features({'id': 'token5'}),id=5)\n", "Annotation(21,29,Token,features=Features({'id': 'token6'}),id=6)\n" ] } ], "source": [ "# add a few more\n", "annset.add(0, 4, \"Token\", {\"id\": \"token1'\"})\n", "annset.add(5, 7, \"Token\", {\"id\": \"token2'\"})\n", "annset.add(8, 9, \"Token\", {\"id\": \"token3'\"})\n", "annset.add(10, 15, \"Token\", {\"id\": \"token4'\"})\n", "annset.add(16, 20, \"Token\", {\"id\": \"token5\"})\n", "annset.add(21, 29, \"Token\", {\"id\": \"token6\"})\n", "annset.add(0, 29, \"Sentence\", {\"what\": \"The first 'sentence' annotation\"});\n", "for ann in annset:\n", " print(ann)" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Annotations: document viewer" ] }, { "cell_type": "code", "execution_count": 17, "metadata": { "scrolled": true, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "text/html": [ "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", " \n", " \n", "
\n", "\n", "
" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "doc1.show(preselect=[(\"Set1\", [\"AnnType1\", \"Sentence\"])])" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "* show all annotations for a type by clicking the type name\n", "* clicking annotation shows annotation features instead of document features\n", "* clicking \"Document\" shows the document features again\n", "* when multiple annotations overlap, need to select first which to view" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Annotations/sets: remove" ] }, { "cell_type": "code", "execution_count": 18, "metadata": { "scrolled": true, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Annotation id=0: Annotation(0,4,AnnType1,features=Features({}),id=0)\n", "Annotation id=1: Annotation(0,4,Token,features=Features({'id': \"token1'\"}),id=1)\n", "After some anns removed AnnotationSet([Annotation(0,29,Sentence,features=Features({'what': \"The first 'sentence' annotation\"}),id=7), Annotation(16,20,Token,features=Features({'id': 'token5'}),id=5), Annotation(21,29,Token,features=Features({'id': 'token6'}),id=6)])\n", "After set cleared: AnnotationSet([])\n" ] } ], "source": [ "ann0 = annset.get(0) # get by annotation id\n", "print(\"Annotation id=0:\", ann0)\n", "annset.remove(ann0) # remove the annotation with the annotation id of ann1\n", "ann1 = annset.get(1)\n", "print(\"Annotation id=1:\", ann1)\n", "annset.remove(1) # remove the annotation with the given id\n", "annset.remove([2,3,4]) # remove a whole list of annotations\n", "print(\"After some anns removed \", annset)\n", "annset.clear()\n", "print(\"After set cleared: \", annset)\n", "doc1.remove_annset(\"Set1\")" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Annotation Relations\n", "\n", "* Annotations can overlap arbitrarily\n", "* Annotation API has methods to check **how** they relate to each other\n", " * overlap, within, covering, before, after, rightoverlapping, startingat, endingwith, coextensive ...\n", "* Annotation API implements ordering by start offset and annotation id\n", "\n", "![Annotation Relations](data/ann-relations-cropped.png)\n", "\n", "* Ann1 overlaps with all others, covers all but Ann2 and Ann4 \n", "* Ann5 is directly before Ann3, is before Ann6\n", "* Ann10 starts at Ann1, Ann12 ends with Ann1, Ann3 and Ann9 are coextensive " ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Annotation Relations\n", "\n", "![Annotation Relations](data/ann-relations-cropped.png)\n", "\n", "Let's load and view an example document to demonstrate this:" ] }, { "cell_type": "code", "execution_count": 19, "metadata": { "slideshow": { "slide_type": "skip" } }, "outputs": [ { "data": { "text/html": [ "" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "# Make sure the html ann viewer is smaller\n", "from IPython.core.display import display, HTML\n", "display(HTML(\"\"))\n" ] }, { "cell_type": "code", "execution_count": 20, "metadata": { "scrolled": true, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "text/html": [ "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", " \n", " \n", "
\n", "\n", "
" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "doc3 = Document.load(\"data/ann-relations.bdocjs\")\n", "doc3.show(htmlid=\"view1\")" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Annotation Relations API \n", "\n", "![Annotation Relations](data/ann-relations-cropped.png)" ] }, { "cell_type": "code", "execution_count": 21, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "# make a variable for each annotation type\n", "for anntype in list(doc3.annset(\"set1\").type_names):\n", " vars()[anntype.lower()] = doc3.annset(\"set1\").with_type(anntype).for_idx(0)" ] }, { "cell_type": "code", "execution_count": 22, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Ann2 isoverlapping Ann1: True\n", "Ann2 isbefore Ann3: True\n", "Ann3 isafter Ann2: True\n", "Ann1 iscovering Ann5: True\n", "Ann3 iscoextensive Ann9: True\n", "Ann6 iswithin Ann1: True\n", "Ann4 isrightoverlapping Ann1: True\n" ] } ], "source": [ "print(\"Ann2 isoverlapping Ann1:\", ann2.isoverlapping(ann1))\n", "print(\"Ann2 isbefore Ann3:\", ann2.isbefore(ann3))\n", "print(\"Ann3 isafter Ann2:\", ann3.isafter(ann2))\n", "print(\"Ann1 iscovering Ann5:\", ann1.iscovering(ann5))\n", "print(\"Ann3 iscoextensive Ann9:\", ann3.iscoextensive(ann9))\n", "print(\"Ann6 iswithin Ann1:\", ann6.iswithin(ann1))\n", "print(\"Ann4 isrightoverlapping Ann1:\", ann4.isrightoverlapping(ann1))" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Spans\n", "\n", "* Objects that describe offset ranges\n", "* similar API for relations\n", "* can get from annotations, use when only the span of an annotation is needed\n" ] }, { "cell_type": "code", "execution_count": 23, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "['span0: Span(3,4)', 'span1: Span(0,6)', 'span2: Span(0,45)', 'span3: Span(12,18)']\n" ] } ], "source": [ "from gatenlp import Span\n", "span1 = Span(3,4)\n", "span2 = ann2.span\n", "span3 = doc3.annset(\"set1\").span\n", "span4 = Span(ann5)\n", "print([f\"span{i}: {s}\" for i, s in enumerate([span1, span2, span3, span4])])\n", " " ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## AnnotationSet: retrieve by relation\n", "\n", "* get all annotations that overlap/are before/start at/... an annotation/span/annotation set\n", "* returns a new annotation set\n", "* returned set is **detached**: not part of document, changes do set not affect document \n", "* returned set is initially **immutable**: set cannot be changed\n", "* but annotations **are mutable** and still the same as in the set!\n", "* possible to \"detach\" annotations by (deep)copying them" ] }, { "cell_type": "code", "execution_count": 24, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Within Ann1: ['Ann10', 'Ann5', 'Ann3', 'Ann7', 'Ann9', 'Ann11', 'Ann6', 'Ann8', 'Ann12']\n", "Coextensive with Ann3: ['Ann9']\n", "Coextensive with span of Ann3: ['Ann3', 'Ann9']\n" ] } ], "source": [ "set1 = doc3.annset(\"set1\") # \"attached\" set\n", "print(\"Within Ann1: \", [a.type for a in set1.within(ann1)])\n", "print(\"Coextensive with Ann3:\", [a.type for a in set1.coextensive(ann3)])\n", "print(\"Coextensive with span of Ann3:\", [a.type for a in set1.coextensive(ann3.span)])" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## AnnotationSet: detached / immutable" ] }, { "cell_type": "code", "execution_count": 25, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Size of set1: 12\n", "Size of subset1: 9\n" ] } ], "source": [ "print(\"Size of set1:\", len(set1))\n", "subset1 = set1.within(ann1)\n", "print(\"Size of subset1:\", len(subset1))" ] }, { "cell_type": "code", "execution_count": 26, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Got exception: Cannot add an annotation to an immutable annotation set\n" ] } ], "source": [ "# try to add an annotation to subset1:\n", "try:\n", " subset1.add(2,3,\"ANewOne\")\n", "except Exception as ex:\n", " print(\"Got exception:\", ex)" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## AnnotationSet: detached / immutable" ] }, { "cell_type": "code", "execution_count": 27, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Size of set1: 12\n", "Size of subset1: 10\n", "Is set1 detached: False\n", "Is subset1 detached: True\n" ] } ], "source": [ "# make the set mutable and try again\n", "subset1.immutable = False\n", "subset1.add(2,3,\"ANewOne\")\n", "print(\"Size of set1:\", len(set1))\n", "print(\"Size of subset1:\", len(subset1))\n", "print(\"Is set1 detached:\", set1.isdetached())\n", "print(\"Is subset1 detached:\", subset1.isdetached())" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "* annotation only got added to `subset1`, NOT the original set\n", "* detached sets can**not** get attached again\n", "* annotations in the detached set are **the same** as in the document, so changing their features will affect the document!\n", "* detached set can also detach its annotations using `subset1.clone_anns()`" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Document loading/saving\n", "\n", "Supported formats:\n", "* bdocjs, bdocym, bdocmp: load/save (aliasing: only bdocym)\n", "* GATE xml: load (but only basic data types, no aliasing)\n", "* HTML: load and create annotations for HTML entities\n", "* plain text: load / save\n", "* tweet: load v1 format, WIP!\n", "* pickle: load/save\n", "* html-ann-viewer: save (also used for displaying in jupyter)" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Document: load HTML" ] }, { "cell_type": "code", "execution_count": 28, "metadata": { "scrolled": true, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "text/html": [ "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", " \n", " \n", "
\n", "\n", "
" ], "text/plain": [ "Document(\n", "Python GateNLP | python-gatenlp\n", "Python GateNLP\n", "Home\n", "Search\n", "GitHub\n", "PyPi\n", "PythonDoc\n", "Python GateNLP\n", "A Python NLP framework\n", "Python GateNLP is a natural language processing (NLP) and text processing framework implemented in Python.\n", "It provides very flexible representations of documents, stand-off annotaitons with arbitrary types and features, grouped into arbitrary annotations sets,\n", "spans, corpora, annotators, pipelines and more. Documents, annotations and corpora can be easily and interactively visualized in notebooks. \n", "It provides the ability to use existing NLP tools for annotating documents out of the box: Spacy, Stanza as well as online services like \n", "Gate Cloud, ELG, Google NLP, IBM Watson and others. The result of these tools get represented as GateNLP annotations making it easy to \n", "write code that works with all of these tools in the same way or compares or combines the results of these tools.\n", "In addition, GateNLP provides its own annotator tools: string-based and token based gazetteers, regular-expression-based annotators, and a very powerful\n", "and flexible rule-based annotator (PAMPAC) which allows to match complex pattern of annotations and text.\n", "Python GateNLP represents documents and stand-off annotations very similar to \n", "the Java GATE framework: Annotations describe arbitrary character ranges in the text and each annotation can have an arbitrary number of features. Documents can have arbitrary features and an arbitrary number of named annotation sets, where each annotation set can have an arbitrary number of annotations which can overlap in any way. Python GateNLP documents can be exchanged with Java GATE by using the bdocjs/bdocym/bdocmp formats which are supported in Java GATE via the Format Bdoc Plugin\n", "Other than many other Python NLP tools, GateNLP does not require a specific way of how text is split up into tokens, tokens can be represented by annotations in any way, and a document can have different ways of tokenization simultanously, if needed. Similarly, entities can be represented by annotations without restriction: they do not need to start or end at token boundaries and can overlap arbitrarily.\n", "GateNLP provides ways to process text and create annotations using annotating pipelines, which are sequences of one or more annotators. \n", "There are gazetteer annotators for matching text against gazetteer lists and annotators for a rule-like matching of complex annotation and text sequences (see PAMPAC).\n", "There is also support for creating GateNLP annotations with other NLP packages like Spacy or Stanford Stanza.\n", "The GateNLP document representation also optionally allows to track all changes\n", "done to the document in a “change log”. \n", "Such changes can later be applied to other Python GateNLP or to Java GATE documents.\n", "This library also implements the functionality for the interaction with\n", "a Java GATE process in two different ways:\n", "The Java GATE Python plugin can invoke a process running Python GateNLP to annotate GATE documents.\n", "Python code can remote-control a Jave GATE instance via the GateNLP GateWorker\n", "Installation\n", "Install GateNLP with all optional dependencies:\n", "pip install -U gatenlp[all]\n", "For more details see Installation\n", "Overview of the documentation:\n", "NOTE: most of the documentation pages below can be viewed as HTML, as a Jupyter notebook (NB), and the Jupyter notebook can be downloaded \n", "for running on your own computer (NB-DL).\n", "Installation\n", "Getting Started / NB / NB-DL\n", "The Document class and classes related to components of a document:\n", " \n", "Annotation / NB / NB-DL\n", "AnnotationSet / NB) / NB-DL\n", "Documents / NB) / NB-DL\n", "The Changelog class for recording changes to a document\n", " \n", "ChangeLogs / NB) / NB-DL\n", "A comparison with the Java GATE API\n", "The module for running Java GATE code from Python\n", " \n", "GateWorker / NB) / NB-DL\n", "Modules for interaction with other NLP packages and converting their documents\n", " \n", "lib_spacy\n", " / NB / NB-DL for interacting with Spacy\n", "lib_stanza\n", " / NB / NB-DL for interacting with Stanza\n", "Connecting to annotation services on the web (clients): annotators which connect to a web/cloud service to annotate documents.\n", " \n", "GateCloud Client / NB / NB-DL\n", "ELG (European Language Grid) Client / NB / NB-DL\n", "IBM Natural Language Understanding Client / NB / NB-DL\n", "Google NLP Client / NB / NB-DL\n", "TagMe Client / NB / NB-DL\n", "TextRazor Client / NB / NB-DL\n", "Modules related to NLP processing:\n", " \n", "Corpora and Document Sources / Destinations / NB / NB-DL\n", "Processing / NB / NB-DL\n", "Tokenizers / NB / NB-DL\n", "Matching strings and token sequences:\n", " \n", "Token/String Gazetteers / NB / NB-DL\n", "Regular Expressions Annotator / NB / NB-DL\n", "Complex Annotation Patterns for matching text and annotation sequences:\n", " \n", "PAMPAC / NB / NB-DL\n", "PAMPAC Reference\n", "Viewers and object representation / NB / NB-DL\n", "Overview of supported formats for loading and saving documents\n", "Miscellaneous Recipes and Examples\n", "Course Materials\n", "Gate Course 2021 - Module 11 Slides\n", "Change Log\n", "Change Log: show major changes in each release since 1.0.1\n", "Python API\n", "The Generated Python Documentation\n", "python-gatenlp is maintained by GateNLP.\n", "This page was generated by GitHub Pages.\n", ",features=Features({}),anns={'Original markups': AnnotationSet([Annotation(0,5122,[document],features=Features({}),id=0), Annotation(1,5122,html,features=Features({'lang': 'en-US'}),id=1), Annotation(1,33,head,features=Features({}),id=2), Annotation(1,1,meta,features=Features({'charset': 'UTF-8'}),id=3), Annotation(1,32,title,features=Features({}),id=4), Annotation(33,33,meta,features=Features({'name': 'generator', 'content': 'Jekyll v3.9.2'}),id=5), Annotation(33,33,meta,features=Features({'property': 'og:title', 'content': 'Python GateNLP'}),id=6), Annotation(33,33,meta,features=Features({'property': 'og:locale', 'content': 'en_US'}),id=7), Annotation(33,33,meta,features=Features({'name': 'description', 'content': 'Python text processing, pattern matching, and NLP framework'}),id=8), Annotation(33,33,meta,features=Features({'property': 'og:description', 'content': 'Python text processing, pattern matching, and NLP framework'}),id=9), Annotation(33,33,link,features=Features({'rel': 'canonical', 'href': 'https://gatenlp.github.io/python-gatenlp/'}),id=10), Annotation(33,33,meta,features=Features({'property': 'og:url', 'content': 'https://gatenlp.github.io/python-gatenlp/'}),id=11), Annotation(33,33,meta,features=Features({'property': 'og:site_name', 'content': 'python-gatenlp'}),id=12), Annotation(33,33,meta,features=Features({'property': 'og:type', 'content': 'website'}),id=13), Annotation(33,33,meta,features=Features({'name': 'twitter:card', 'content': 'summary'}),id=14), Annotation(33,33,meta,features=Features({'property': 'twitter:title', 'content': 'Python GateNLP'}),id=15), Annotation(33,33,meta,features=Features({'name': 'viewport', 'content': 'width=device-width, initial-scale=1'}),id=16), Annotation(33,33,meta,features=Features({'name': 'theme-color', 'content': '#157878'}),id=17), Annotation(33,33,meta,features=Features({'name': 'apple-mobile-web-app-status-bar-style', 'content': 'black-translucent'}),id=18), Annotation(33,33,link,features=Features({'rel': 'stylesheet', 'href': '/python-gatenlp/assets/css/style.css?v=fcbf7590ee23bcfa8c2359565c14aff59ad2e6fb'}),id=19), Annotation(33,5122,body,features=Features({}),id=20), Annotation(33,82,header,features=Features({'class': 'page-header', 'role': 'banner', 'style': 'padding: 2rem 3rem'}),id=21), Annotation(33,48,h2,features=Features({'class': 'project-name', 'style': 'color:white'}),id=22), Annotation(48,52,a,features=Features({'style': 'color:white;font-size:150%;padding:0px 12px', 'href': '/python-gatenlp/'}),id=23), Annotation(53,59,a,features=Features({'style': 'color:white;font-size:150%;padding:0px 12px', 'href': '/python-gatenlp/site-search'}),id=24), Annotation(60,66,a,features=Features({'style': 'color:white;font-size:150%;padding:0px 12px', 'href': 'https://github.com/GateNLP/python-gatenlp'}),id=25), Annotation(67,71,a,features=Features({'style': 'color:white;font-size:150%;padding:0px 12px', 'href': 'https://pypi.org/project/gatenlp/'}),id=26), Annotation(72,81,a,features=Features({'style': 'color:white;font-size:150%;padding:0px 12px', 'href': 'https://gatenlp.github.io/python-gatenlp/pythondoc/gatenlp/'}),id=27), Annotation(82,5122,main,features=Features({'id': 'content', 'class': 'main-content', 'role': 'main'}),id=28), Annotation(82,97,h1,features=Features({'id': 'python-gatenlp'}),id=29), Annotation(97,120,h2,features=Features({'id': 'a-python-nlp-framework'}),id=30), Annotation(120,227,p,features=Features({}),id=31), Annotation(227,910,p,features=Features({}),id=32), Annotation(910,1169,p,features=Features({}),id=33), Annotation(1169,1743,p,features=Features({}),id=34), Annotation(1252,1271,a,features=Features({'href': 'https://gate.ac.uk/'}),id=35), Annotation(1273,1284,a,features=Features({'href': 'annotations'}),id=36), Annotation(1385,1393,em,features=Features({}),id=37), Annotation(1396,1405,a,features=Features({'href': 'documents'}),id=38), Annotation(1467,1482,a,features=Features({'href': 'annotationsets'}),id=39), Annotation(1467,1482,em,features=Features({}),id=40), Annotation(1724,1742,a,features=Features({'href': 'https://gatenlp.github.io/gateplugin-Format_Bdoc/'}),id=41), Annotation(1743,2151,p,features=Features({}),id=42), Annotation(2151,2456,p,features=Features({}),id=43), Annotation(2218,2238,a,features=Features({'href': 'processing'}),id=44), Annotation(2298,2318,a,features=Features({'href': 'gazetteers'}),id=45), Annotation(2447,2453,a,features=Features({'href': 'pampac'}),id=46), Annotation(2456,2566,p,features=Features({}),id=47), Annotation(2566,2773,p,features=Features({}),id=48), Annotation(2672,2684,a,features=Features({'href': 'changelogs'}),id=49), Annotation(2773,2888,p,features=Features({}),id=50), Annotation(2888,3067,ul,features=Features({}),id=51), Annotation(2888,2988,li,features=Features({}),id=52), Annotation(2892,2915,a,features=Features({'href': 'http://gatenlp.github.io/gateplugin-Python/'}),id=53), Annotation(2988,3067,li,features=Features({}),id=54), Annotation(3048,3066,a,features=Features({'href': 'gateworker'}),id=55), Annotation(3067,3080,h2,features=Features({'id': 'installation'}),id=56), Annotation(3080,3128,p,features=Features({}),id=57), Annotation(3128,3156,p,features=Features({}),id=58), Annotation(3128,3156,code,features=Features({'class': 'language-plaintext highlighter-rouge'}),id=59), Annotation(3156,3190,p,features=Features({}),id=60), Annotation(3177,3189,a,features=Features({'href': '/python-gatenlp/installation.html'}),id=61), Annotation(3190,3221,h2,features=Features({'id': 'overview-of-the-documentation'}),id=62), Annotation(3221,3402,p,features=Features({}),id=63), Annotation(3402,4871,ul,features=Features({}),id=64), Annotation(3402,3415,li,features=Features({}),id=65), Annotation(3402,3414,a,features=Features({'href': '/python-gatenlp/installation.html'}),id=66), Annotation(3415,3444,li,features=Features({}),id=67), Annotation(3415,3430,a,features=Features({'href': 'getting-started'}),id=68), Annotation(3433,3435,a,features=Features({'href': 'https://nbviewer.jupyter.org/urls/gatenlp.github.io/python-gatenlp/getting-started.ipynb'}),id=69), Annotation(3438,3443,a,features=Features({'href': '/python-gatenlp/getting-started.ipynb'}),id=70), Annotation(3444,3593,li,features=Features({}),id=71), Annotation(3516,3593,ul,features=Features({}),id=72), Annotation(3517,3541,li,features=Features({}),id=73), Annotation(3517,3527,a,features=Features({'href': 'annotations'}),id=74), Annotation(3530,3532,a,features=Features({'href': 'https://nbviewer.jupyter.org/urls/gatenlp.github.io/python-gatenlp/annotations.ipynb'}),id=75), Annotation(3535,3540,a,features=Features({'href': '/python-gatenlp/annotations.ipynb'}),id=76), Annotation(3541,3569,li,features=Features({}),id=77), Annotation(3541,3554,a,features=Features({'href': 'annotationsets'}),id=78), Annotation(3557,3559,a,features=Features({'href': 'https://nbviewer.jupyter.org/urls/gatenlp.github.io/python-gatenlp/annotationsets.ipynb'}),id=79), Annotation(3563,3568,a,features=Features({'href': '/python-gatenlp/annotationsets.ipynb'}),id=80), Annotation(3569,3593,li,features=Features({}),id=81), Annotation(3569,3578,a,features=Features({'href': 'documents'}),id=82), Annotation(3581,3583,a,features=Features({'href': 'https://nbviewer.jupyter.org/urls/gatenlp.github.io/python-gatenlp/documents.ipynb'}),id=83), Annotation(3587,3592,a,features=Features({'href': '/python-gatenlp/documents.ipynb'}),id=84), Annotation(3593,3679,li,features=Features({}),id=85), Annotation(3653,3679,ul,features=Features({}),id=86), Annotation(3654,3679,li,features=Features({}),id=87), Annotation(3654,3664,a,features=Features({'href': 'changelogs'}),id=88), Annotation(3667,3669,a,features=Features({'href': 'https://nbviewer.jupyter.org/urls/gatenlp.github.io/python-gatenlp/changelogs.ipynb'}),id=89), Annotation(3673,3678,a,features=Features({'href': '/python-gatenlp/changelogs.ipynb'}),id=90), Annotation(3679,3715,li,features=Features({}),id=91), Annotation(3681,3714,a,features=Features({'href': 'diffs2gate'}),id=92), Annotation(3715,3795,li,features=Features({}),id=93), Annotation(3769,3795,ul,features=Features({}),id=94), Annotation(3770,3795,li,features=Features({}),id=95), Annotation(3770,3780,a,features=Features({'href': 'gateworker'}),id=96), Annotation(3783,3785,a,features=Features({'href': 'https://nbviewer.jupyter.org/urls/gatenlp.github.io/python-gatenlp/gateworker.ipynb'}),id=97), Annotation(3789,3794,a,features=Features({'href': '/python-gatenlp/gateworker.ipynb'}),id=98), Annotation(3795,3983,li,features=Features({}),id=99), Annotation(3878,3983,ul,features=Features({}),id=100), Annotation(3879,3930,li,features=Features({}),id=101), Annotation(3879,3889,a,features=Features({'href': 'lib_spacy'}),id=102), Annotation(3879,3889,code,features=Features({'class': 'language-plaintext highlighter-rouge'}),id=103), Annotation(3892,3894,a,features=Features({'href': 'https://nbviewer.jupyter.org/urls/gatenlp.github.io/python-gatenlp/lib_spacy.ipynb'}),id=104), Annotation(3897,3902,a,features=Features({'href': '/python-gatenlp/lib_spacy.ipynb'}),id=105), Annotation(3924,3929,a,features=Features({'href': 'spacy.io/'}),id=106), Annotation(3930,3983,li,features=Features({}),id=107), Annotation(3930,3941,a,features=Features({'href': 'lib_stanza'}),id=108), Annotation(3930,3941,code,features=Features({'class': 'language-plaintext highlighter-rouge'}),id=109), Annotation(3944,3946,a,features=Features({'href': 'https://nbviewer.jupyter.org/urls/gatenlp.github.io/python-gatenlp/lib_stanza.ipynb'}),id=110), Annotation(3949,3954,a,features=Features({'href': '/python-gatenlp/lib_stanza.ipynb'}),id=111), Annotation(3976,3982,a,features=Features({'href': 'https://stanfordnlp.github.io/stanza/'}),id=112), Annotation(3983,4336,li,features=Features({}),id=113), Annotation(4114,4336,ul,features=Features({}),id=114), Annotation(4115,4145,li,features=Features({}),id=115), Annotation(4115,4131,a,features=Features({'href': 'client_gatecloud'}),id=116), Annotation(4134,4136,a,features=Features({'href': 'https://nbviewer.jupyter.org/urls/gatenlp.github.io/python-gatenlp/client_gatecloud.ipynb'}),id=117), Annotation(4139,4144,a,features=Features({'href': '/python-gatenlp/client_gatecloud.ipynb'}),id=118), Annotation(4145,4194,li,features=Features({}),id=119), Annotation(4145,4180,a,features=Features({'href': 'client_elg'}),id=120), Annotation(4183,4185,a,features=Features({'href': 'https://nbviewer.jupyter.org/urls/gatenlp.github.io/python-gatenlp/client_elg.ipynb'}),id=121), Annotation(4188,4193,a,features=Features({'href': '/python-gatenlp/client_elg.ipynb'}),id=122), Annotation(4194,4249,li,features=Features({}),id=123), Annotation(4194,4235,a,features=Features({'href': 'client_ibmnlu'}),id=124), Annotation(4238,4240,a,features=Features({'href': 'https://nbviewer.jupyter.org/urls/gatenlp.github.io/python-gatenlp/client_ibmnlu.ipynb'}),id=125), Annotation(4243,4248,a,features=Features({'href': '/python-gatenlp/client_ibmnlu.ipynb'}),id=126), Annotation(4249,4280,li,features=Features({}),id=127), Annotation(4249,4266,a,features=Features({'href': 'client_googlenlp'}),id=128), Annotation(4269,4271,a,features=Features({'href': 'https://nbviewer.jupyter.org/urls/gatenlp.github.io/python-gatenlp/client_googlenlp.ipynb'}),id=129), Annotation(4274,4279,a,features=Features({'href': '/python-gatenlp/client_googlenlp.ipynb'}),id=130), Annotation(4280,4306,li,features=Features({}),id=131), Annotation(4280,4292,a,features=Features({'href': 'client_tagme'}),id=132), Annotation(4295,4297,a,features=Features({'href': 'https://nbviewer.jupyter.org/urls/gatenlp.github.io/python-gatenlp/client_tagme.ipynb'}),id=133), Annotation(4300,4305,a,features=Features({'href': '/python-gatenlp/client_tagme.ipynb'}),id=134), Annotation(4306,4336,li,features=Features({}),id=135), Annotation(4306,4322,a,features=Features({'href': 'client_textrazor'}),id=136), Annotation(4325,4327,a,features=Features({'href': 'https://nbviewer.jupyter.org/urls/gatenlp.github.io/python-gatenlp/client_textrazor.ipynb'}),id=137), Annotation(4330,4335,a,features=Features({'href': '/python-gatenlp/client_textrazor.ipynb'}),id=138), Annotation(4336,4726,li,features=Features({}),id=139), Annotation(4375,4726,ul,features=Features({}),id=140), Annotation(4376,4433,li,features=Features({}),id=141), Annotation(4376,4419,a,features=Features({'href': 'corpora'}),id=142), Annotation(4422,4424,a,features=Features({'href': 'https://nbviewer.jupyter.org/urls/gatenlp.github.io/python-gatenlp/corpora.ipynb'}),id=143), Annotation(4427,4432,a,features=Features({'href': '/python-gatenlp/corpora.ipynb'}),id=144), Annotation(4433,4457,li,features=Features({}),id=145), Annotation(4433,4443,a,features=Features({'href': 'processing'}),id=146), Annotation(4446,4448,a,features=Features({'href': 'https://nbviewer.jupyter.org/urls/gatenlp.github.io/python-gatenlp/processing.ipynb'}),id=147), Annotation(4451,4456,a,features=Features({'href': '/python-gatenlp/processing.ipynb'}),id=148), Annotation(4457,4481,li,features=Features({}),id=149), Annotation(4457,4467,a,features=Features({'href': 'tokenizers'}),id=150), Annotation(4470,4472,a,features=Features({'href': 'https://nbviewer.jupyter.org/urls/gatenlp.github.io/python-gatenlp/tokenizers.ipynb'}),id=151), Annotation(4475,4480,a,features=Features({'href': '/python-gatenlp/tokenizers.ipynb'}),id=152), Annotation(4481,4608,li,features=Features({}),id=153), Annotation(4527,4608,ul,features=Features({}),id=154), Annotation(4528,4565,li,features=Features({}),id=155), Annotation(4528,4551,a,features=Features({'href': 'gazetteers'}),id=156), Annotation(4554,4556,a,features=Features({'href': 'https://nbviewer.jupyter.org/urls/gatenlp.github.io/python-gatenlp/gazetteers.ipynb'}),id=157), Annotation(4559,4564,a,features=Features({'href': '/python-gatenlp/gazetteers.ipynb'}),id=158), Annotation(4565,4608,li,features=Features({}),id=159), Annotation(4565,4594,a,features=Features({'href': 'stringregex'}),id=160), Annotation(4597,4599,a,features=Features({'href': 'https://nbviewer.jupyter.org/urls/gatenlp.github.io/python-gatenlp/stringregex.ipynb'}),id=161), Annotation(4602,4607,a,features=Features({'href': '/python-gatenlp/stringregex.ipynb'}),id=162), Annotation(4608,4726,li,features=Features({}),id=163), Annotation(4688,4726,ul,features=Features({}),id=164), Annotation(4689,4709,li,features=Features({}),id=165), Annotation(4689,4695,a,features=Features({'href': 'pampac'}),id=166), Annotation(4698,4700,a,features=Features({'href': 'https://nbviewer.jupyter.org/urls/gatenlp.github.io/python-gatenlp/pampac.ipynb'}),id=167), Annotation(4703,4708,a,features=Features({'href': '/python-gatenlp/pampac.ipynb'}),id=168), Annotation(4709,4726,li,features=Features({}),id=169), Annotation(4709,4725,a,features=Features({'href': 'pampac-reference'}),id=170), Annotation(4726,4773,li,features=Features({}),id=171), Annotation(4726,4759,a,features=Features({'href': 'visualization'}),id=172), Annotation(4762,4764,a,features=Features({'href': 'https://nbviewer.jupyter.org/urls/gatenlp.github.io/python-gatenlp/visualization.ipynb'}),id=173), Annotation(4767,4772,a,features=Features({'href': '/python-gatenlp/visualization.ipynb'}),id=174), Annotation(4773,4836,li,features=Features({}),id=175), Annotation(4773,4835,a,features=Features({'href': 'formats'}),id=176), Annotation(4836,4871,li,features=Features({}),id=177), Annotation(4836,4870,a,features=Features({'href': 'recipes'}),id=178), Annotation(4871,4888,h2,features=Features({'id': 'course-materials'}),id=179), Annotation(4888,4924,ul,features=Features({}),id=180), Annotation(4888,4924,li,features=Features({}),id=181), Annotation(4888,4923,a,features=Features({'href': '/python-gatenlp/training/module11-python.slides.html'}),id=182), Annotation(4924,4935,h2,features=Features({'id': 'change-log'}),id=183), Annotation(4935,4994,ul,features=Features({}),id=184), Annotation(4935,4994,li,features=Features({}),id=185), Annotation(4935,4945,a,features=Features({'href': 'changes'}),id=186), Annotation(4994,5005,h2,features=Features({'id': 'python-api'}),id=187), Annotation(5005,5040,p,features=Features({}),id=188), Annotation(5005,5039,a,features=Features({'href': 'pythondoc/gatenlp'}),id=189), Annotation(5040,5122,footer,features=Features({'class': 'site-footer'}),id=190), Annotation(5040,5080,span,features=Features({'class': 'site-footer-owner'}),id=191), Annotation(5040,5054,a,features=Features({'href': 'https://github.com/GateNLP/python-gatenlp'}),id=192), Annotation(5072,5079,a,features=Features({'href': 'https://github.com/GateNLP'}),id=193), Annotation(5081,5121,span,features=Features({'class': 'site-footer-credits'}),id=194), Annotation(5108,5120,a,features=Features({'href': 'https://pages.github.com'}),id=195)])})" ] }, "execution_count": 28, "metadata": {}, "output_type": "execute_result" } ], "source": [ "# lets load and view the main GateNLP documentation page:\n", "doc4 = Document.load(\"https://gatenlp.github.io/python-gatenlp/\", fmt=\"html\")\n", "doc4" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Document: view sets/types\n", "\n", "Use: `doc.show(annspec=[\"set1\", (\"set2\", \"type1\"), (\"set3\", [\"type1\", \"type2\"])]`" ] }, { "cell_type": "code", "execution_count": 29, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "text/html": [ "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", " \n", " \n", "
\n", "\n", "
" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "doc4.show(annspec=[(\"Original markups\", [\"h1\",\"h2\",\"a\",\"li\"])])" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Document: save html-ann-viewer" ] }, { "cell_type": "code", "execution_count": 30, "metadata": { "scrolled": false, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "text/html": [ "\n", " \n", " " ], "text/plain": [ "" ] }, "execution_count": 30, "metadata": {}, "output_type": "execute_result" } ], "source": [ "doc4.save(\"gatenlp-doc.html\", fmt=\"html-ann-viewer\", notebook=False, stretch_height=True)\n", "from IPython.display import IFrame\n", "IFrame(\"gatenlp-doc.html\", 900,400)" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Exchange Documents with Java GATE\n", "\n", "* Python GateNLP can read Java GATE XML format\n", "* GATE plugin [Format_Bdoc](https://gatenlp.github.io/gateplugin-Format_Bdoc/) provides support for loading/saving formats bdocjs, bdocym and bdocmp in Java GATE\n", "* Offsets differ between GATE and GateNLP: \n", " * Java: offsets refer to UTF-16 encoding, possibly a *surrogate pair* of UTF-16 characters\n", " * Python: offsets refer to Unicode code points \n", " * bdocjs/bdocym/bdocmp automatically convert the offsets on either side\n", " * field `offset_type` is either `p` or `j` " ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "# Corpus\n", "\n", "* a list-like collection of a fixed number of documents which can be retrieved and stored by index:
\n", " get: `doc = corpus[2]` set: `corpus[3] = doc`\n", "* on retrieval, the index gets stored in a document feature\n", "* implements `store(doc)` to save a document to the index stored in the document feature\n", "* some implementations: `append(doc)` to add a new document to the corpus\n", "* some implementations: store/retrieve `None`\n", " * on retrieveal: `None` indicates absence of document\n", " * on storing: `None` indicates that document should get removed or should not get updated\n" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## ListCorpus\n", "\n", "* wrap a Python list-like data structure\n", "* but provide the `store` method\n" ] }, { "cell_type": "code", "execution_count": 31, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Features({'__idx_140542403809872': 1})\n" ] } ], "source": [ "from gatenlp.corpora import ListCorpus\n", "texts = [\"this is text one\", \"here is text two\", \"and this is text three\"]\n", "docs = [Document(t) for t in texts]\n", "lcorp = ListCorpus(docs)\n", "doc1 = lcorp[1]\n", "print(doc1.features)\n", "lcorp.store(doc1)" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## DirFilesCorpus\n", "\n", "* all (recursive) files in a directory with some specific extension\n", "* specify some specific format or infer from file extension\n", "* stores the relative file path as a document feature\n" ] }, { "cell_type": "code", "execution_count": 32, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Number of documents: 4\n", "Text for idx=2: This is another document for testing which mentions John Smith.\n", "Features for idx=2: Features({'__idx_140542403601232': 2})\n" ] } ], "source": [ "from gatenlp.corpora import DirFilesCorpus\n", "corp1 = DirFilesCorpus(\"data/dir1\") # get all the matching filenames from the directory\n", "print(\"Number of documents:\", len(corp1))\n", "doc1 = corp1[2] # actually read the document from the directory\n", "print(\"Text for idx=2:\", doc1.text)\n", "print(\"Features for idx=2:\", doc1.features)\n", "doc1.annset().add(0,len(doc1.text), \"Document\", dict(what=\"test document\"))\n", "# this writes the document back to the file:\n", "corp1.store(doc1)\n", "# could also have used: corp1[2] = doc1\n" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Corpus Viewer" ] }, { "cell_type": "code", "execution_count": 33, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "application/vnd.jupyter.widget-view+json": { "model_id": "8f7a01f874f64df5b1bdfc24ac482659", "version_major": 2, "version_minor": 0 }, "text/plain": [ "HBox(children=(Button(icon='arrow-left', layout=Layout(width='5em'), style=ButtonStyle()), IntSlider(value=0, …" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", " \n", " \n", "
\n", "\n", "
" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "from gatenlp.visualization import CorpusViewer\n", "cviewer = CorpusViewer(corp1)\n", "cviewer.show()" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Other Corpus Classes\n", "\n", "* `NumberedDirFilesCorpus`: create a directory tree where the path represents digits of a large number\n", " * e.g. `000/002/341.bdoc` for element number 2341 of 600000000 total\n", "* `EveryNthCorpus`: wrap a corpus and access only elements $k*i + o$ for $i = 0..\\lfloor(n/k)\\rfloor$\n", " * $k$: every that many elements\n", " * $o$: start with this element ($o < k$)\n", " * e.g.: get elements 3, 7, 11, 15 from a corpus with 17 elements\n", " * useful for processing files in a DirFilesCorpus with multiple processes\n", "* `ShuffledCorpus`: random re-ordering of the elements in the wrapped corpus\n", "* `CachedCorpus`: store retrieved elements from a (slow) base corpus in a (fast) cache corpus\n", "* Still work in progress" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Source, Destination\n", "\n", "* Document Source: something that can be iterated over to get one Document after the other\n", " * unknown size\n", " * a Corpus may also function as a Source\n", "* Document Destination: something that has `append(doc)` to add Document instances\n", " * unknown final size\n", " * also has `close()` to end writing\n", " * may implement the `with documentdestination as dest:` pattern\n", " * an appendable Corpus may also function as a Destination\n" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Source, Destination examples\n", "\n", "* `BdocjsLinesFileSource/Destination`: one line of bdocjs serialization per document\n", "* `TsvFileSource`: one column in a TSV file contains the text, other columns can be stored in features\n", "* `PandasDfSource`: similar to TSV source, but for a Pandas data frame\n", "* Still work in progress: improvements/more to come!" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "### TsvFileSource" ] }, { "cell_type": "code", "execution_count": 34, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Document(This is the text of the first row. It has several sentences.,features=Features({'src': 'source1', 'year': '2005'}),anns=[])\n", "Document(Text of the second row.,features=Features({'src': 'source1', 'year': '2006'}),anns=[])\n", "Document(Another text, this time of the third row. ,features=Features({'src': 'source2', 'year': '2001'}),anns=[])\n", "Document(And here another, from the fourth row.,features=Features({'src': 'source3', 'year': '2013'}),anns=[])\n" ] } ], "source": [ "from gatenlp.corpora import TsvFileSource\n", "tsvsrc1 = TsvFileSource(\"data/mytsvfile.tsv\", text_col=\"text\", feature_cols=dict(src=\"source\",year=\"year\"))\n", "for doc in tsvsrc1:\n", " print(doc)" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "### PandasDfSource" ] }, { "cell_type": "code", "execution_count": 35, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Document(This is the text of the first row. It has several sentences.,features=Features({'__data': {'source': 'source1', 'year': 2005}}),anns=[])\n", "Document(Text of the second row.,features=Features({'__data': {'source': 'source1', 'year': 2006}}),anns=[])\n", "Document(Another text, this time of the third row. ,features=Features({'__data': {'source': 'source2', 'year': 2001}}),anns=[])\n", "Document(And here another, from the fourth row.,features=Features({'__data': {'source': 'source3', 'year': 2013}}),anns=[])\n" ] } ], "source": [ "from gatenlp.corpora import PandasDfSource\n", "try: # this requires Pandas!\n", " import pandas as pd, csv\n", " df = pd.read_csv(\"data/mytsvfile.tsv\", sep=\"\\t\", quotechar=None, index_col=None, quoting=csv.QUOTE_NONE)\n", " pdsrc1 = PandasDfSource(df, text_col=\"text\", data_cols=[\"source\", \"year\"])\n", " for doc in pdsrc1:\n", " print(doc)\n", "except:\n", " print(\"Pandas not installed\")" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Conll-U Source\n", "\n", "* Read in one of the many multilingual corpora from https://universaldependencies.org/\n", "* create documents from k sentences, paragraphs conll documents\n", "* use original text hints or space hints, if available\n", "* Example: first few lines of `ar-ud-train.conllu`" ] }, { "cell_type": "code", "execution_count": 36, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "3\n" ] } ], "source": [ "from gatenlp.corpora.conll import ConllUFileSource\n", "src = ConllUFileSource(\"data/ar-tiny.conllu\", group_by=\"doc\", group_by_n=1)\n", "corp = list(src)\n", "print(len(corp))" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Conll-U Source" ] }, { "cell_type": "code", "execution_count": 37, "metadata": { "scrolled": true, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "text/html": [ "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", " \n", " \n", "
\n", "\n", "
" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "corp[0].show(doc_style=\"direction: rtl; font-size: 1.5em; line-height: 1.5;\")" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "# Annotators, Executors\n", "\n", "* Annotator: a *callable* that accepts a document to process and either:\n", " * returns the same or a different document (most common situation)\n", " * returns None: something went wrong or the document should get filtered\n", " * returns a list of zero to n documents: filter, error, split documents\n", " * may be just a function, but usually a subclass of `Annotator`\n", " * standard methods for handling over-a-corpus results\n", "* Pipeline: a special annotator that recursively runs other annotators in sequence\n", "* Executor: a class that runs an annotator\n", " * on a corpus\n", " * on a source and optional destination\n", " * takes care of handling None, lists of returned documents\n", "\n", " \n", "\n" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Example 1/3" ] }, { "cell_type": "code", "execution_count": 38, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "from gatenlp.corpora import ListCorpus\n", "from gatenlp.processing.pipeline import Pipeline \n", "from gatenlp.processing.annotator import AnnotatorFunction\n", "from gatenlp.processing.executor import SerialCorpusExecutor\n", "\n", "texts = [\"Some text.\", \"Another text.\", \"Also some text here.\", \"And this is also some text.\"]\n", "docs = [Document(t) for t in texts]\n", "corp = ListCorpus(docs)\n", "\n", "def annfunc1(doc):\n", " doc.annset().add(0,3,\"Ann1\")\n", " return doc\n", "def annfunc2(doc):\n", " doc.annset(\"set1\").add(1,4,\"Type1\")\n", " return doc\n", "ann1 = AnnotatorFunction(annfunc1)\n", "ann2 = AnnotatorFunction(annfunc2)\n", "pipeline = Pipeline()\n", "pipeline.add(ann1, name=\"FirstAnnotator\")\n", "pipeline.add(ann2, name=\"SecondAnnotator\")\n" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Example 2/3" ] }, { "cell_type": "code", "execution_count": 39, "metadata": { "scrolled": false, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "text/html": [ "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", " \n", " \n", "
\n", "\n", "
" ], "text/plain": [ "Document(Also some text here.,features=Features({'__idx_140542587401168': 2}),anns={'': AnnotationSet([Annotation(0,3,Ann1,features=Features({}),id=0)]), 'set1': AnnotationSet([Annotation(1,4,Type1,features=Features({}),id=0)])})" ] }, "execution_count": 39, "metadata": {}, "output_type": "execute_result" } ], "source": [ "exe = SerialCorpusExecutor(pipeline, corpus=corp)\n", "exe()\n", "corp[2]" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Example 3/3" ] }, { "cell_type": "code", "execution_count": 40, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Length of corpnew: 4\n", "in=4, out=4, none=0, ok=4, err=0\n" ] }, { "data": { "text/html": [ "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", " \n", " \n", "
\n", "\n", "
" ], "text/plain": [ "Document(Also some text here.,features=Features({'__idx_140542587401168': 2, '__idx_140541279480784': 2}),anns={'': AnnotationSet([Annotation(0,3,Ann1,features=Features({}),id=0), Annotation(0,3,Ann1,features=Features({}),id=1)]), 'set1': AnnotationSet([Annotation(1,4,Type1,features=Features({}),id=0), Annotation(1,4,Type1,features=Features({}),id=1)])})" ] }, "execution_count": 40, "metadata": {}, "output_type": "execute_result" } ], "source": [ "# use corp as source and create another ListCorpus as destination\n", "corpnew = ListCorpus([])\n", "exe2 = SerialCorpusExecutor(pipeline, source=corp, destination=corpnew)\n", "exe2()\n", "print(\"Length of corpnew:\", len(corpnew))\n", "print(f\"in={exe2.n_in}, out={exe2.n_out}, none={exe2.n_none}, ok={exe2.n_ok}, err={exe2.n_err}\")\n", "corpnew[2]" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "# Spacy Annotator\n", "\n", "* Use a [SpaCy](https://spacy.io/) pipeline to annotate a document\n", "* convert spacy tokens, entities etc into Annotations, convert token attributes into annotation features\n", "* makes it much easier to add own annotations and features, no need to keep vocab files around\n", "* but possibly not as optimized/fast as Spacy\n", "\n", "Preparation:\n", "* make sure spacy dependency is installed for your gatenlp environment:
\n", "`pip install -U spacy` (not necessary if `gatenlp[all]` was used)\n", "* make sure the [model for the language](https://spacy.io/models) is installed:
\n", " English: `python -m spacy download en_core_web_sm`\n", "* To use in notebook, need to restart kernel after installation! " ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Spacy Annotator\n" ] }, { "cell_type": "code", "execution_count": 41, "metadata": { "scrolled": true, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "name": "stderr", "output_type": "stream", "text": [ "/home/johann/software/anaconda/envs/gatenlp-37/lib/python3.7/site-packages/torch/cuda/__init__.py:80: UserWarning: CUDA initialization: The NVIDIA driver on your system is too old (found version 9010). Please update your GPU driver by downloading and installing a new version from the URL: http://www.nvidia.com/Download/index.aspx Alternatively, go to: https://pytorch.org to install a PyTorch version that has been compiled with your version of the CUDA driver. (Triggered internally at ../c10/cuda/CUDAFunctions.cpp:112.)\n", " return torch._C._cuda_getDeviceCount() > 0\n" ] }, { "name": "stdout", "output_type": "stream", "text": [ "Spacy version: 3.3.1\n" ] }, { "name": "stderr", "output_type": "stream", "text": [ "/home/johann/software/anaconda/envs/gatenlp-37/lib/python3.7/site-packages/spacy/util.py:837: UserWarning: [W095] Model 'en_core_web_sm' (3.2.0) was trained with spaCy v3.2 and may not be 100% compatible with the current version (3.3.1). If you see errors or degraded performance, download a newer compatible model or retrain your custom model with the current spaCy version. For more details and available updates, run: python -m spacy validate\n", " warnings.warn(warn_msg)\n" ] } ], "source": [ "import spacy\n", "print(\"Spacy version:\", spacy.__version__)\n", "from gatenlp.lib_spacy import AnnSpacy\n", "\n", "nlp = spacy.load(\"en_core_web_sm\")\n", "annotator = AnnSpacy(pipeline=nlp, outsetname=\"Spacy\")\n", "doc2.annset(\"Spacy\").clear() # avoid annotation duplication when running several times\n", "doc2 = annotator(doc2)\n" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Spacy Annotator" ] }, { "cell_type": "code", "execution_count": 42, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "text/html": [ "" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", " \n", " \n", "
\n", "\n", "
" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "# Adapt size of viewer\n", "from IPython.core.display import display, HTML\n", "display(HTML(\"\"))\n", "\n", "\n", "doc2.show(htmlid=\"view2\")" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "# Stanza Annotator\n", "\n", "* Use a [Stanza](https://stanfordnlp.github.io/stanza/) pipeline to annotate a document\n", "* convert stanza tokens, entities etc into Annotations, convert token attributes into annotation features\n", "\n", "Preparation:\n", "* make sure stanza dependency is installed for your gatenlp environment:
\n", "`pip install -U stanza` (not necessary if `gatenlp[all]` was used)\n", "* make sure the [model for the language](https://spacy.io/models) is installed:
\n", " English: `python -c \"import stanza; stanza.download('en')\" `" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Stanza Annotator" ] }, { "cell_type": "code", "execution_count": 43, "metadata": { "scrolled": false, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Stanza version: 1.3.0\n" ] } ], "source": [ "import stanza\n", "print(\"Stanza version:\",stanza.__version__)\n", "from gatenlp.lib_stanza import AnnStanza\n", "\n", "nlpstanza = stanza.Pipeline(logging_level=\"ERROR\")\n", "annotatorstanza = AnnStanza(pipeline=nlpstanza, outsetname=\"Stanza\")\n", "doc2.annset(\"Stanza\").clear() # avoid annotation duplication when running several times\n", "doc2 = annotatorstanza(doc2)\n" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Stanza Annotator" ] }, { "cell_type": "code", "execution_count": 44, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "text/html": [ "" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", " \n", " \n", "
\n", "\n", "
" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "# change size of document viewer\n", "from IPython.core.display import display, HTML\n", "display(HTML(\"\"))\n", "\n", "doc2.show(htmlid=\"view3\")" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "# Gazetteers\n", "\n", "* Look up tokens/words/strings/phrases in a long list (\"gazetteer list\")\n", "* **StringGazetteer**: match document text against list of known strings\n", "* **TokenGazetter**: \n", " * Match sequences of tokens\n", " * Tokens in documents: based on sequences of annotations\n", " * Use underlying document text or some feature value per annotation (\"Token\")\n", " * gazetteer list: each entry should be a list of tokens as well!\n" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## StringGazetteer: gazetteer list" ] }, { "cell_type": "code", "execution_count": 45, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "from gatenlp.processing.gazetteer import StringGazetteer\n", "\n", "# Strings we want to find\n", "sgazlist1 = [\n", " (\"Barack Obama\", dict(url=\"https://en.wikipedia.org/wiki/Barack_Obama\")),\n", " (\"Obama\", dict(url=\"https://en.wikipedia.org/wiki/Barack_Obama\")),\n", " (\"Donald Trump\", dict(url=\"https://en.wikipedia.org/wiki/Donald_Trump\")),\n", " (\"Trump\", dict(url=\"https://en.wikipedia.org/wiki/Donald_Trump\")),\n", " (\"George W. Bush\", dict(url=\"https://en.wikipedia.org/wiki/George_W._Bush\")),\n", " (\"George Bush\", dict(url=\"https://en.wikipedia.org/wiki/George_W._Bush\")),\n", " (\"Bush\", dict(url=\"https://en.wikipedia.org/wiki/George_W._Bush\")),\n", " (\" Bill Clinton \", dict(url=\"https://en.wikipedia.org/wiki/Bill_Clinton\")),\n", " (\"Clinton\", dict(url=\"https://en.wikipedia.org/wiki/Bill_Clinton\")),\n", "]" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## StringGazetteer: document " ] }, { "cell_type": "code", "execution_count": 46, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "text/html": [ "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", " \n", " \n", "
\n", "\n", "
" ], "text/plain": [ "Document(Barack Obama was the 44th president of the US and he followed George W. Bush and\n", " was followed by Donald Trump. Before Bush, Bill Clinton was president.\n", " Also, lets include a sentence about South Korea which is called 대한민국 in Korean.\n", " And a sentence with the full name of Iran in Farsi: جمهوری اسلامی ایران and also with \n", " just the word \"Iran\" in Farsi: ایران \n", " Also barack obama in all lower case and SOUTH KOREA in all upper case\n", " ,features=Features({}),anns={})" ] }, "execution_count": 46, "metadata": {}, "output_type": "execute_result" } ], "source": [ "# Document with some text mentioning some of the names in the gazeteer for testing\n", "text = \"\"\"Barack Obama was the 44th president of the US and he followed George W. Bush and\n", " was followed by Donald Trump. Before Bush, Bill Clinton was president.\n", " Also, lets include a sentence about South Korea which is called 대한민국 in Korean.\n", " And a sentence with the full name of Iran in Farsi: جمهوری اسلامی ایران and also with \n", " just the word \"Iran\" in Farsi: ایران \n", " Also barack obama in all lower case and SOUTH KOREA in all upper case\n", " \"\"\"\n", "doc0 = Document(text)\n", "doc0" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## StringGazetteer: match" ] }, { "cell_type": "code", "execution_count": 47, "metadata": { "slideshow": { "slide_type": "skip" } }, "outputs": [], "source": [ "doc0.annset().clear()" ] }, { "cell_type": "code", "execution_count": 48, "metadata": { "scrolled": true, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "text/html": [ "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", " \n", " \n", "
\n", "\n", "
" ], "text/plain": [ "Document(Barack Obama was the 44th president of the US and he followed George W. Bush and\n", " was followed by Donald Trump. Before Bush, Bill Clinton was president.\n", " Also, lets include a sentence about South Korea which is called 대한민국 in Korean.\n", " And a sentence with the full name of Iran in Farsi: جمهوری اسلامی ایران and also with \n", " just the word \"Iran\" in Farsi: ایران \n", " Also barack obama in all lower case and SOUTH KOREA in all upper case\n", " ,features=Features({}),anns={'': AnnotationSet([Annotation(0,12,Lookup,features=Features({'url': 'https://en.wikipedia.org/wiki/Barack_Obama'}),id=0), Annotation(7,12,Lookup,features=Features({'url': 'https://en.wikipedia.org/wiki/Barack_Obama'}),id=1), Annotation(62,76,Lookup,features=Features({'url': 'https://en.wikipedia.org/wiki/George_W._Bush'}),id=2), Annotation(72,76,Lookup,features=Features({'url': 'https://en.wikipedia.org/wiki/George_W._Bush'}),id=3), Annotation(99,111,Lookup,features=Features({'url': 'https://en.wikipedia.org/wiki/Donald_Trump'}),id=4), Annotation(106,111,Lookup,features=Features({'url': 'https://en.wikipedia.org/wiki/Donald_Trump'}),id=5), Annotation(120,124,Lookup,features=Features({'url': 'https://en.wikipedia.org/wiki/George_W._Bush'}),id=6), Annotation(126,138,Lookup,features=Features({'url': 'https://en.wikipedia.org/wiki/Bill_Clinton'}),id=7), Annotation(131,138,Lookup,features=Features({'url': 'https://en.wikipedia.org/wiki/Bill_Clinton'}),id=8)])})" ] }, "execution_count": 48, "metadata": {}, "output_type": "execute_result" } ], "source": [ "sgaz1 = StringGazetteer(source=sgazlist1, source_fmt=\"gazlist\")\n", "doc0 = sgaz1(doc0)\n", "doc0" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "# Document Tokenization\n", "\n", "* Stanza and Spacy both produce Tokens\n", "* Also possible to use NLTKTokenizer: wrap any of the NLTK tokenizers" ] }, { "cell_type": "code", "execution_count": 49, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "from gatenlp.processing.tokenizer import NLTKTokenizer\n", "from nltk.tokenize.destructive import NLTKWordTokenizer # get some tokenizer to use\n", "nltk_tokenizer = NLTKTokenizer(nltk_tokenizer=NLTKWordTokenizer(), token_type=\"Token\")\n", "doc2.annset().clear() \n", "doc2 = nltk_tokenizer(doc2)" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Document Tokenization" ] }, { "cell_type": "code", "execution_count": 50, "metadata": { "scrolled": true, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "text/html": [ "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", " \n", " \n", "
\n", "\n", "
" ], "text/plain": [ "Document(This is a test document.\n", "\n", "It contains just a few sentences. \n", "Here is a sentence that mentions a few named entities like \n", "the persons Barack Obama or Ursula von der Leyen, locations\n", "like New York City, Vienna or Beijing or companies like \n", "Google, UniCredit or Huawei. And here is Donald Trump, it may not be the real one :P\n", "\n", "Lets say Boris Johnson aka Bojo tweets from his BorisJohnson account, would be nice to match them!\n", "\n", "Here we include a URL https://gatenlp.github.io/python-gatenlp/ \n", "and a fake email address john.doe@hiscoolserver.com as well \n", "as #some #cool #hastags and a bunch of emojis like 😽 (a kissing cat),\n", "👩‍🏫 (a woman teacher), 🧬 (DNA), \n", "🧗 (a person climbing), \n", "\n", "Here we test a few different scripts, e.g. Hangul 한글 or \n", "simplified Hanzi 汉字 or Farsi فارسی and Arabic ,اَلْعَرَبِيَّةُ, which goes from right to left.\n", "\n", ",features=Features({}),anns={'Spacy': AnnotationSet([Annotation(0,4,Token,features=Features({'_i': 0, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': True, 'is_space': False, 'is_stop': True, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'this', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12943039165150086467, 'pos': 'PRON', 'prefix': 'T', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'Xxxx', 'suffix': 'his', 'ent_type': '', 'dep': 'nsubj', 'head': 2, 'left_edge': 0, 'right_edge': 0}),id=0), Annotation(0,24,Sentence,features=Features({}),id=336), Annotation(0,4,NounChunk,features=Features({}),id=342), Annotation(4,5,SpaceToken,features=Features({'is_space': True}),id=1), Annotation(5,7,Token,features=Features({'_i': 1, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'be', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3411606890003347522, 'pos': 'AUX', 'prefix': 'i', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBZ', 'shape': 'xx', 'suffix': 'is', 'ent_type': '', 'dep': 'ROOT', 'head': 2, 'left_edge': 0, 'right_edge': 9}),id=2), Annotation(7,8,SpaceToken,features=Features({'is_space': True}),id=3), Annotation(8,9,Token,features=Features({'_i': 2, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'det', 'head': 8, 'left_edge': 4, 'right_edge': 4}),id=4), Annotation(8,23,NounChunk,features=Features({}),id=343), Annotation(9,10,SpaceToken,features=Features({'is_space': True}),id=5), Annotation(10,14,Token,features=Features({'_i': 3, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'test', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 1618900948208871284, 'pos': 'NOUN', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'est', 'ent_type': '', 'dep': 'compound', 'head': 8, 'left_edge': 6, 'right_edge': 6}),id=6), Annotation(14,15,SpaceToken,features=Features({'is_space': True}),id=7), Annotation(15,23,Token,features=Features({'_i': 4, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'document', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2376024544662810659, 'pos': 'NOUN', 'prefix': 'd', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'ent', 'ent_type': '', 'dep': 'attr', 'head': 2, 'left_edge': 4, 'right_edge': 8}),id=8), Annotation(23,24,Token,features=Features({'_i': 5, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '.', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12646065887601541794, 'pos': 'PUNCT', 'prefix': '.', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '.', 'shape': '.', 'suffix': '.', 'ent_type': '', 'dep': 'punct', 'head': 2, 'left_edge': 9, 'right_edge': 9}),id=9), Annotation(24,26,SpaceToken,features=Features({'_i': 6, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': True, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 908432558851201422, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n\\n', 'suffix': '\\n\\n', 'ent_type': '', 'dep': 'dep', 'head': 13, 'left_edge': 10, 'right_edge': 10}),id=10), Annotation(24,59,Sentence,features=Features({}),id=337), Annotation(26,28,Token,features=Features({'_i': 7, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'it', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7859011591137717335, 'pos': 'PRON', 'prefix': 'I', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'PRP', 'shape': 'Xx', 'suffix': 'It', 'ent_type': '', 'dep': 'nsubj', 'head': 13, 'left_edge': 11, 'right_edge': 11}),id=11), Annotation(26,28,NounChunk,features=Features({}),id=344), Annotation(28,29,SpaceToken,features=Features({'is_space': True}),id=12), Annotation(29,37,Token,features=Features({'_i': 8, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'contain', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 4189568596636683296, 'pos': 'VERB', 'prefix': 'c', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBZ', 'shape': 'xxxx', 'suffix': 'ins', 'ent_type': '', 'dep': 'ROOT', 'head': 13, 'left_edge': 10, 'right_edge': 22}),id=13), Annotation(37,38,SpaceToken,features=Features({'is_space': True}),id=14), Annotation(38,42,Token,features=Features({'_i': 9, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'just', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7148522813498185515, 'pos': 'ADV', 'prefix': 'j', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'xxxx', 'suffix': 'ust', 'ent_type': '', 'dep': 'advmod', 'head': 19, 'left_edge': 15, 'right_edge': 15}),id=15), Annotation(38,58,NounChunk,features=Features({}),id=345), Annotation(42,43,SpaceToken,features=Features({'is_space': True}),id=16), Annotation(43,44,Token,features=Features({'_i': 10, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'quantmod', 'head': 19, 'left_edge': 17, 'right_edge': 17}),id=17), Annotation(44,45,SpaceToken,features=Features({'is_space': True}),id=18), Annotation(45,48,Token,features=Features({'_i': 11, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'few', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11866476999679706272, 'pos': 'ADJ', 'prefix': 'f', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'JJ', 'shape': 'xxx', 'suffix': 'few', 'ent_type': '', 'dep': 'nummod', 'head': 21, 'left_edge': 15, 'right_edge': 19}),id=19), Annotation(48,49,SpaceToken,features=Features({'is_space': True}),id=20), Annotation(49,58,Token,features=Features({'_i': 12, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'sentence', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 5257340109698985342, 'pos': 'NOUN', 'prefix': 's', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'xxxx', 'suffix': 'ces', 'ent_type': '', 'dep': 'dobj', 'head': 13, 'left_edge': 15, 'right_edge': 21}),id=21), Annotation(58,59,Token,features=Features({'_i': 13, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '.', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12646065887601541794, 'pos': 'PUNCT', 'prefix': '.', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '.', 'shape': '.', 'suffix': '.', 'ent_type': '', 'dep': 'punct', 'head': 13, 'left_edge': 22, 'right_edge': 22}),id=22), Annotation(59,60,SpaceToken,features=Features({'is_space': True}),id=23), Annotation(60,61,SpaceToken,features=Features({'_i': 14, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': True, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 962983613142996970, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n', 'suffix': '\\n', 'ent_type': '', 'dep': 'dep', 'head': 27, 'left_edge': 24, 'right_edge': 24}),id=24), Annotation(60,266,Sentence,features=Features({}),id=338), Annotation(61,65,Token,features=Features({'_i': 15, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'here', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 13973858553651532596, 'pos': 'ADV', 'prefix': 'H', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'Xxxx', 'suffix': 'ere', 'ent_type': '', 'dep': 'advmod', 'head': 27, 'left_edge': 25, 'right_edge': 25}),id=25), Annotation(65,66,SpaceToken,features=Features({'is_space': True}),id=26), Annotation(66,68,Token,features=Features({'_i': 16, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'be', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3411606890003347522, 'pos': 'AUX', 'prefix': 'i', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBZ', 'shape': 'xx', 'suffix': 'is', 'ent_type': '', 'dep': 'ccomp', 'head': 68, 'left_edge': 24, 'right_edge': 65}),id=27), Annotation(68,69,SpaceToken,features=Features({'is_space': True}),id=28), Annotation(69,70,Token,features=Features({'_i': 17, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'det', 'head': 31, 'left_edge': 29, 'right_edge': 29}),id=29), Annotation(69,79,NounChunk,features=Features({}),id=346), Annotation(70,71,SpaceToken,features=Features({'is_space': True}),id=30), Annotation(71,79,Token,features=Features({'_i': 18, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'sentence', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 18108853898452662235, 'pos': 'NOUN', 'prefix': 's', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'nce', 'ent_type': '', 'dep': 'attr', 'head': 27, 'left_edge': 29, 'right_edge': 64}),id=31), Annotation(79,80,SpaceToken,features=Features({'is_space': True}),id=32), Annotation(80,84,Token,features=Features({'_i': 19, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'that', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 4380130941430378203, 'pos': 'PRON', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'WDT', 'shape': 'xxxx', 'suffix': 'hat', 'ent_type': '', 'dep': 'nsubj', 'head': 35, 'left_edge': 33, 'right_edge': 33}),id=33), Annotation(80,84,NounChunk,features=Features({}),id=347), Annotation(84,85,SpaceToken,features=Features({'is_space': True}),id=34), Annotation(85,93,Token,features=Features({'_i': 20, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'mention', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 834570530775529781, 'pos': 'VERB', 'prefix': 'm', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBZ', 'shape': 'xxxx', 'suffix': 'ons', 'ent_type': '', 'dep': 'relcl', 'head': 31, 'left_edge': 33, 'right_edge': 64}),id=35), Annotation(93,94,SpaceToken,features=Features({'is_space': True}),id=36), Annotation(94,95,Token,features=Features({'_i': 21, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'det', 'head': 43, 'left_edge': 37, 'right_edge': 37}),id=37), Annotation(94,114,NounChunk,features=Features({}),id=348), Annotation(95,96,SpaceToken,features=Features({'is_space': True}),id=38), Annotation(96,99,Token,features=Features({'_i': 22, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'few', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11866476999679706272, 'pos': 'ADJ', 'prefix': 'f', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'JJ', 'shape': 'xxx', 'suffix': 'few', 'ent_type': '', 'dep': 'amod', 'head': 43, 'left_edge': 39, 'right_edge': 39}),id=39), Annotation(99,100,SpaceToken,features=Features({'is_space': True}),id=40), Annotation(100,105,Token,features=Features({'_i': 23, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'name', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 6719370519630147158, 'pos': 'VERB', 'prefix': 'n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBN', 'shape': 'xxxx', 'suffix': 'med', 'ent_type': '', 'dep': 'amod', 'head': 43, 'left_edge': 41, 'right_edge': 41}),id=41), Annotation(105,106,SpaceToken,features=Features({'is_space': True}),id=42), Annotation(106,114,Token,features=Features({'_i': 24, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'entity', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7701268449352692754, 'pos': 'NOUN', 'prefix': 'e', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'xxxx', 'suffix': 'ies', 'ent_type': '', 'dep': 'dobj', 'head': 35, 'left_edge': 37, 'right_edge': 64}),id=43), Annotation(114,115,SpaceToken,features=Features({'is_space': True}),id=44), Annotation(115,119,Token,features=Features({'_i': 25, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'like', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 18194338103975822726, 'pos': 'ADP', 'prefix': 'l', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'IN', 'shape': 'xxxx', 'suffix': 'ike', 'ent_type': '', 'dep': 'prep', 'head': 43, 'left_edge': 45, 'right_edge': 64}),id=45), Annotation(119,120,SpaceToken,features=Features({'is_space': True}),id=46), Annotation(120,121,SpaceToken,features=Features({'_i': 26, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 962983613142996970, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n', 'suffix': '\\n', 'ent_type': '', 'dep': 'dep', 'head': 45, 'left_edge': 47, 'right_edge': 47}),id=47), Annotation(121,124,Token,features=Features({'_i': 27, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'the', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7425985699627899538, 'pos': 'DET', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'xxx', 'suffix': 'the', 'ent_type': '', 'dep': 'det', 'head': 50, 'left_edge': 48, 'right_edge': 48}),id=48), Annotation(121,132,NounChunk,features=Features({}),id=349), Annotation(124,125,SpaceToken,features=Features({'is_space': True}),id=49), Annotation(125,132,Token,features=Features({'_i': 28, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'person', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 17520808660558581486, 'pos': 'NOUN', 'prefix': 'p', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'xxxx', 'suffix': 'ons', 'ent_type': '', 'dep': 'pobj', 'head': 45, 'left_edge': 48, 'right_edge': 64}),id=50), Annotation(132,133,SpaceToken,features=Features({'is_space': True}),id=51), Annotation(133,139,Token,features=Features({'_i': 29, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Barack', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 15388493565120789335, 'pos': 'PROPN', 'prefix': 'B', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'ack', 'ent_type': 'PERSON', 'dep': 'compound', 'head': 54, 'left_edge': 52, 'right_edge': 52}),id=52), Annotation(133,145,PERSON,features=Features({'lemma': 'Barack Obama'}),id=318), Annotation(133,145,NounChunk,features=Features({}),id=350), Annotation(139,140,SpaceToken,features=Features({'is_space': True}),id=53), Annotation(140,145,Token,features=Features({'_i': 30, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Obama', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 4857242187112322394, 'pos': 'PROPN', 'prefix': 'O', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'ama', 'ent_type': 'PERSON', 'dep': 'appos', 'head': 50, 'left_edge': 52, 'right_edge': 64}),id=54), Annotation(145,146,SpaceToken,features=Features({'is_space': True}),id=55), Annotation(146,148,Token,features=Features({'_i': 31, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'or', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3740602843040177340, 'pos': 'CCONJ', 'prefix': 'o', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'xx', 'suffix': 'or', 'ent_type': '', 'dep': 'cc', 'head': 54, 'left_edge': 56, 'right_edge': 56}),id=56), Annotation(148,149,SpaceToken,features=Features({'is_space': True}),id=57), Annotation(149,155,Token,features=Features({'_i': 32, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Ursula', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 13791600832320669420, 'pos': 'PROPN', 'prefix': 'U', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'ula', 'ent_type': 'GPE', 'dep': 'compound', 'head': 62, 'left_edge': 58, 'right_edge': 58}),id=58), Annotation(149,155,GPE,features=Features({'lemma': 'Ursula'}),id=319), Annotation(149,169,NounChunk,features=Features({}),id=351), Annotation(155,156,SpaceToken,features=Features({'is_space': True}),id=59), Annotation(156,159,Token,features=Features({'_i': 33, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'von', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7079615728476843702, 'pos': 'PROPN', 'prefix': 'v', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'xxx', 'suffix': 'von', 'ent_type': '', 'dep': 'compound', 'head': 62, 'left_edge': 60, 'right_edge': 60}),id=60), Annotation(159,160,SpaceToken,features=Features({'is_space': True}),id=61), Annotation(160,163,Token,features=Features({'_i': 34, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'der', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 9250722957692387333, 'pos': 'NOUN', 'prefix': 'd', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxx', 'suffix': 'der', 'ent_type': '', 'dep': 'compound', 'head': 64, 'left_edge': 58, 'right_edge': 62}),id=62), Annotation(163,164,SpaceToken,features=Features({'is_space': True}),id=63), Annotation(164,169,Token,features=Features({'_i': 35, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Leyen', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 13266048868047277473, 'pos': 'PROPN', 'prefix': 'L', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'yen', 'ent_type': 'GPE', 'dep': 'conj', 'head': 54, 'left_edge': 58, 'right_edge': 64}),id=64), Annotation(164,169,GPE,features=Features({'lemma': 'Leyen'}),id=320), Annotation(169,170,Token,features=Features({'_i': 36, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 27, 'left_edge': 65, 'right_edge': 65}),id=65), Annotation(170,171,SpaceToken,features=Features({'is_space': True}),id=66), Annotation(171,180,Token,features=Features({'_i': 37, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'location', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 13063876858105367867, 'pos': 'NOUN', 'prefix': 'l', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'xxxx', 'suffix': 'ons', 'ent_type': '', 'dep': 'nsubj', 'head': 68, 'left_edge': 67, 'right_edge': 67}),id=67), Annotation(171,180,NounChunk,features=Features({}),id=352), Annotation(180,181,SpaceToken,features=Features({'_i': 38, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 962983613142996970, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n', 'suffix': '\\n', 'ent_type': '', 'dep': 'dep', 'head': 68, 'left_edge': 24, 'right_edge': 99}),id=68), Annotation(181,185,Token,features=Features({'_i': 39, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'like', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 18194338103975822726, 'pos': 'ADP', 'prefix': 'l', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'IN', 'shape': 'xxxx', 'suffix': 'ike', 'ent_type': '', 'dep': 'prep', 'head': 68, 'left_edge': 69, 'right_edge': 98}),id=69), Annotation(185,186,SpaceToken,features=Features({'is_space': True}),id=70), Annotation(186,189,Token,features=Features({'_i': 40, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'New', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7503827727184870577, 'pos': 'PROPN', 'prefix': 'N', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxx', 'suffix': 'New', 'ent_type': 'GPE', 'dep': 'compound', 'head': 73, 'left_edge': 71, 'right_edge': 71}),id=71), Annotation(186,199,GPE,features=Features({'lemma': 'New York City'}),id=321), Annotation(186,199,NounChunk,features=Features({}),id=353), Annotation(189,190,SpaceToken,features=Features({'is_space': True}),id=72), Annotation(190,194,Token,features=Features({'_i': 41, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'York', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7898044819112200372, 'pos': 'PROPN', 'prefix': 'Y', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxx', 'suffix': 'ork', 'ent_type': 'GPE', 'dep': 'compound', 'head': 75, 'left_edge': 71, 'right_edge': 73}),id=73), Annotation(194,195,SpaceToken,features=Features({'is_space': True}),id=74), Annotation(195,199,Token,features=Features({'_i': 42, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'City', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 728339108046626067, 'pos': 'PROPN', 'prefix': 'C', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxx', 'suffix': 'ity', 'ent_type': 'GPE', 'dep': 'pobj', 'head': 69, 'left_edge': 71, 'right_edge': 98}),id=75), Annotation(199,200,Token,features=Features({'_i': 43, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 75, 'left_edge': 76, 'right_edge': 76}),id=76), Annotation(200,201,SpaceToken,features=Features({'is_space': True}),id=77), Annotation(201,207,Token,features=Features({'_i': 44, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Vienna', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 10141676973022394345, 'pos': 'PROPN', 'prefix': 'V', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'nna', 'ent_type': 'GPE', 'dep': 'conj', 'head': 75, 'left_edge': 78, 'right_edge': 98}),id=78), Annotation(201,207,GPE,features=Features({'lemma': 'Vienna'}),id=322), Annotation(201,207,NounChunk,features=Features({}),id=354), Annotation(207,208,SpaceToken,features=Features({'is_space': True}),id=79), Annotation(208,210,Token,features=Features({'_i': 45, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'or', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3740602843040177340, 'pos': 'CCONJ', 'prefix': 'o', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'xx', 'suffix': 'or', 'ent_type': '', 'dep': 'cc', 'head': 78, 'left_edge': 80, 'right_edge': 80}),id=80), Annotation(210,211,SpaceToken,features=Features({'is_space': True}),id=81), Annotation(211,218,Token,features=Features({'_i': 46, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Beijing', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2949147885855558195, 'pos': 'PROPN', 'prefix': 'B', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'ing', 'ent_type': 'GPE', 'dep': 'conj', 'head': 78, 'left_edge': 82, 'right_edge': 82}),id=82), Annotation(211,218,GPE,features=Features({'lemma': 'Beijing'}),id=323), Annotation(211,218,NounChunk,features=Features({}),id=355), Annotation(218,219,SpaceToken,features=Features({'is_space': True}),id=83), Annotation(219,221,Token,features=Features({'_i': 47, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'or', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3740602843040177340, 'pos': 'CCONJ', 'prefix': 'o', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'xx', 'suffix': 'or', 'ent_type': '', 'dep': 'cc', 'head': 78, 'left_edge': 84, 'right_edge': 84}),id=84), Annotation(221,222,SpaceToken,features=Features({'is_space': True}),id=85), Annotation(222,231,Token,features=Features({'_i': 48, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'company', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 8026612326651866097, 'pos': 'NOUN', 'prefix': 'c', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'xxxx', 'suffix': 'ies', 'ent_type': '', 'dep': 'conj', 'head': 78, 'left_edge': 86, 'right_edge': 98}),id=86), Annotation(222,231,NounChunk,features=Features({}),id=356), Annotation(231,232,SpaceToken,features=Features({'is_space': True}),id=87), Annotation(232,236,Token,features=Features({'_i': 49, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'like', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 18194338103975822726, 'pos': 'ADP', 'prefix': 'l', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'IN', 'shape': 'xxxx', 'suffix': 'ike', 'ent_type': '', 'dep': 'prep', 'head': 86, 'left_edge': 88, 'right_edge': 98}),id=88), Annotation(236,237,SpaceToken,features=Features({'is_space': True}),id=89), Annotation(237,238,SpaceToken,features=Features({'_i': 50, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 962983613142996970, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n', 'suffix': '\\n', 'ent_type': '', 'dep': 'dep', 'head': 88, 'left_edge': 90, 'right_edge': 90}),id=90), Annotation(238,244,Token,features=Features({'_i': 51, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Google', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11578853341595296054, 'pos': 'PROPN', 'prefix': 'G', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'gle', 'ent_type': '', 'dep': 'pobj', 'head': 88, 'left_edge': 91, 'right_edge': 98}),id=91), Annotation(238,244,NounChunk,features=Features({}),id=357), Annotation(244,245,Token,features=Features({'_i': 52, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 91, 'left_edge': 92, 'right_edge': 92}),id=92), Annotation(245,246,SpaceToken,features=Features({'is_space': True}),id=93), Annotation(246,255,Token,features=Features({'_i': 53, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'UniCredit', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7663443051765273511, 'pos': 'PROPN', 'prefix': 'U', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'XxxXxxxx', 'suffix': 'dit', 'ent_type': 'ORG', 'dep': 'conj', 'head': 91, 'left_edge': 94, 'right_edge': 98}),id=94), Annotation(246,255,ORG,features=Features({'lemma': 'UniCredit'}),id=324), Annotation(246,255,NounChunk,features=Features({}),id=358), Annotation(255,256,SpaceToken,features=Features({'is_space': True}),id=95), Annotation(256,258,Token,features=Features({'_i': 54, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'or', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3740602843040177340, 'pos': 'CCONJ', 'prefix': 'o', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'xx', 'suffix': 'or', 'ent_type': '', 'dep': 'cc', 'head': 94, 'left_edge': 96, 'right_edge': 96}),id=96), Annotation(258,259,SpaceToken,features=Features({'is_space': True}),id=97), Annotation(259,265,Token,features=Features({'_i': 55, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Huawei', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 4639919019609272215, 'pos': 'PROPN', 'prefix': 'H', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'wei', 'ent_type': 'ORG', 'dep': 'conj', 'head': 94, 'left_edge': 98, 'right_edge': 98}),id=98), Annotation(259,265,ORG,features=Features({'lemma': 'Huawei'}),id=325), Annotation(259,265,NounChunk,features=Features({}),id=359), Annotation(265,266,Token,features=Features({'_i': 56, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '.', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12646065887601541794, 'pos': 'PUNCT', 'prefix': '.', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '.', 'shape': '.', 'suffix': '.', 'ent_type': '', 'dep': 'punct', 'head': 68, 'left_edge': 99, 'right_edge': 99}),id=99), Annotation(266,267,SpaceToken,features=Features({'is_space': True}),id=100), Annotation(267,270,Token,features=Features({'_i': 57, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': True, 'is_space': False, 'is_stop': True, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'and', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12172435438170721471, 'pos': 'CCONJ', 'prefix': 'A', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'Xxx', 'suffix': 'And', 'ent_type': '', 'dep': 'cc', 'head': 105, 'left_edge': 101, 'right_edge': 101}),id=101), Annotation(267,422,Sentence,features=Features({}),id=339), Annotation(270,271,SpaceToken,features=Features({'is_space': True}),id=102), Annotation(271,275,Token,features=Features({'_i': 58, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'here', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 411390626470654571, 'pos': 'ADV', 'prefix': 'h', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'xxxx', 'suffix': 'ere', 'ent_type': '', 'dep': 'advmod', 'head': 105, 'left_edge': 103, 'right_edge': 103}),id=103), Annotation(275,276,SpaceToken,features=Features({'is_space': True}),id=104), Annotation(276,278,Token,features=Features({'_i': 59, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'be', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3411606890003347522, 'pos': 'AUX', 'prefix': 'i', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBZ', 'shape': 'xx', 'suffix': 'is', 'ent_type': '', 'dep': 'ccomp', 'head': 118, 'left_edge': 101, 'right_edge': 109}),id=105), Annotation(278,279,SpaceToken,features=Features({'is_space': True}),id=106), Annotation(279,285,Token,features=Features({'_i': 60, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Donald', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 16889399016836222064, 'pos': 'PROPN', 'prefix': 'D', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'ald', 'ent_type': 'PERSON', 'dep': 'compound', 'head': 109, 'left_edge': 107, 'right_edge': 107}),id=107), Annotation(279,291,PERSON,features=Features({'lemma': 'Donald Trump'}),id=326), Annotation(279,291,NounChunk,features=Features({}),id=360), Annotation(285,286,SpaceToken,features=Features({'is_space': True}),id=108), Annotation(286,291,Token,features=Features({'_i': 61, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Trump', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 1134333841961332695, 'pos': 'PROPN', 'prefix': 'T', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'ump', 'ent_type': 'PERSON', 'dep': 'nsubj', 'head': 105, 'left_edge': 107, 'right_edge': 109}),id=109), Annotation(291,292,Token,features=Features({'_i': 62, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 118, 'left_edge': 110, 'right_edge': 110}),id=110), Annotation(292,293,SpaceToken,features=Features({'is_space': True}),id=111), Annotation(293,295,Token,features=Features({'_i': 63, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'it', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 10239237003504588839, 'pos': 'PRON', 'prefix': 'i', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'PRP', 'shape': 'xx', 'suffix': 'it', 'ent_type': '', 'dep': 'nsubj', 'head': 118, 'left_edge': 112, 'right_edge': 112}),id=112), Annotation(293,295,NounChunk,features=Features({}),id=361), Annotation(295,296,SpaceToken,features=Features({'is_space': True}),id=113), Annotation(296,299,Token,features=Features({'_i': 64, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'may', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 14378475389916013800, 'pos': 'AUX', 'prefix': 'm', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'MD', 'shape': 'xxx', 'suffix': 'may', 'ent_type': '', 'dep': 'aux', 'head': 118, 'left_edge': 114, 'right_edge': 114}),id=114), Annotation(299,300,SpaceToken,features=Features({'is_space': True}),id=115), Annotation(300,303,Token,features=Features({'_i': 65, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'not', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 447765159362469301, 'pos': 'PART', 'prefix': 'n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'xxx', 'suffix': 'not', 'ent_type': '', 'dep': 'neg', 'head': 118, 'left_edge': 116, 'right_edge': 116}),id=116), Annotation(303,304,SpaceToken,features=Features({'is_space': True}),id=117), Annotation(304,306,Token,features=Features({'_i': 66, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'be', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 10382539506755952630, 'pos': 'AUX', 'prefix': 'b', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VB', 'shape': 'xx', 'suffix': 'be', 'ent_type': '', 'dep': 'ccomp', 'head': 130, 'left_edge': 101, 'right_edge': 127}),id=118), Annotation(306,307,SpaceToken,features=Features({'is_space': True}),id=119), Annotation(307,310,Token,features=Features({'_i': 67, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'the', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7425985699627899538, 'pos': 'DET', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'xxx', 'suffix': 'the', 'ent_type': '', 'dep': 'det', 'head': 124, 'left_edge': 120, 'right_edge': 120}),id=120), Annotation(310,311,SpaceToken,features=Features({'is_space': True}),id=121), Annotation(311,315,Token,features=Features({'_i': 68, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'real', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7050692189315705983, 'pos': 'ADJ', 'prefix': 'r', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'JJ', 'shape': 'xxxx', 'suffix': 'eal', 'ent_type': '', 'dep': 'amod', 'head': 124, 'left_edge': 122, 'right_edge': 122}),id=122), Annotation(315,316,SpaceToken,features=Features({'is_space': True}),id=123), Annotation(316,319,Token,features=Features({'_i': 69, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'one', 'like_email': False, 'like_num': True, 'like_url': False, 'orth': 17454115351911680600, 'pos': 'NUM', 'prefix': 'o', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CD', 'shape': 'xxx', 'suffix': 'one', 'ent_type': '', 'dep': 'attr', 'head': 118, 'left_edge': 120, 'right_edge': 127}),id=124), Annotation(319,320,SpaceToken,features=Features({'is_space': True}),id=125), Annotation(320,322,Token,features=Features({'_i': 70, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': True, 'lang': 'en', 'lemma': ':P', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11282459806373970386, 'pos': 'PUNCT', 'prefix': ':', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ':', 'shape': ':X', 'suffix': ':P', 'ent_type': '', 'dep': 'punct', 'head': 124, 'left_edge': 126, 'right_edge': 126}),id=126), Annotation(322,324,SpaceToken,features=Features({'_i': 71, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 908432558851201422, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n\\n', 'suffix': '\\n\\n', 'ent_type': '', 'dep': 'dep', 'head': 124, 'left_edge': 127, 'right_edge': 127}),id=127), Annotation(324,328,Token,features=Features({'_i': 72, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'let', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 8631549979455155821, 'pos': 'NOUN', 'prefix': 'L', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'Xxxx', 'suffix': 'ets', 'ent_type': '', 'dep': 'nsubj', 'head': 130, 'left_edge': 128, 'right_edge': 128}),id=128), Annotation(324,328,NounChunk,features=Features({}),id=362), Annotation(328,329,SpaceToken,features=Features({'is_space': True}),id=129), Annotation(329,332,Token,features=Features({'_i': 73, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'say', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 8685289367999165211, 'pos': 'VERB', 'prefix': 's', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBP', 'shape': 'xxx', 'suffix': 'say', 'ent_type': '', 'dep': 'parataxis', 'head': 153, 'left_edge': 101, 'right_edge': 149}),id=130), Annotation(332,333,SpaceToken,features=Features({'is_space': True}),id=131), Annotation(333,338,Token,features=Features({'_i': 74, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Boris', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3926703778135926261, 'pos': 'PROPN', 'prefix': 'B', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'ris', 'ent_type': 'PERSON', 'dep': 'compound', 'head': 134, 'left_edge': 132, 'right_edge': 132}),id=132), Annotation(333,346,PERSON,features=Features({'lemma': 'Boris Johnson'}),id=327), Annotation(333,346,NounChunk,features=Features({}),id=363), Annotation(338,339,SpaceToken,features=Features({'is_space': True}),id=133), Annotation(339,346,Token,features=Features({'_i': 75, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Johnson', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 14020170261514038406, 'pos': 'PROPN', 'prefix': 'J', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'son', 'ent_type': 'PERSON', 'dep': 'nsubj', 'head': 130, 'left_edge': 132, 'right_edge': 148}),id=134), Annotation(346,347,SpaceToken,features=Features({'is_space': True}),id=135), Annotation(347,350,Token,features=Features({'_i': 76, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'aka', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 6309978511773728404, 'pos': 'ADV', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'xxx', 'suffix': 'aka', 'ent_type': '', 'dep': 'advmod', 'head': 140, 'left_edge': 136, 'right_edge': 136}),id=136), Annotation(347,362,NounChunk,features=Features({}),id=364), Annotation(350,351,SpaceToken,features=Features({'is_space': True}),id=137), Annotation(351,355,Token,features=Features({'_i': 77, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Bojo', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 1381500864330977487, 'pos': 'PROPN', 'prefix': 'B', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxx', 'suffix': 'ojo', 'ent_type': '', 'dep': 'compound', 'head': 140, 'left_edge': 138, 'right_edge': 138}),id=138), Annotation(355,356,SpaceToken,features=Features({'is_space': True}),id=139), Annotation(356,362,Token,features=Features({'_i': 78, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'tweet', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 9158814828460388352, 'pos': 'NOUN', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'xxxx', 'suffix': 'ets', 'ent_type': '', 'dep': 'appos', 'head': 134, 'left_edge': 136, 'right_edge': 148}),id=140), Annotation(362,363,SpaceToken,features=Features({'is_space': True}),id=141), Annotation(363,367,Token,features=Features({'_i': 79, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'from', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7831658034963690409, 'pos': 'ADP', 'prefix': 'f', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'IN', 'shape': 'xxxx', 'suffix': 'rom', 'ent_type': '', 'dep': 'prep', 'head': 140, 'left_edge': 142, 'right_edge': 148}),id=142), Annotation(367,368,SpaceToken,features=Features({'is_space': True}),id=143), Annotation(368,371,Token,features=Features({'_i': 80, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'his', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2661093235354845946, 'pos': 'PRON', 'prefix': 'h', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'PRP$', 'shape': 'xxx', 'suffix': 'his', 'ent_type': '', 'dep': 'poss', 'head': 148, 'left_edge': 144, 'right_edge': 144}),id=144), Annotation(368,392,NounChunk,features=Features({}),id=365), Annotation(371,372,SpaceToken,features=Features({'is_space': True}),id=145), Annotation(372,384,Token,features=Features({'_i': 81, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'BorisJohnson', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 6152028365162425637, 'pos': 'PROPN', 'prefix': 'B', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'XxxxxXxxxx', 'suffix': 'son', 'ent_type': 'ORG', 'dep': 'compound', 'head': 148, 'left_edge': 146, 'right_edge': 146}),id=146), Annotation(372,384,ORG,features=Features({'lemma': 'BorisJohnson'}),id=328), Annotation(384,385,SpaceToken,features=Features({'is_space': True}),id=147), Annotation(385,392,Token,features=Features({'_i': 82, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'account', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 16051117652154763083, 'pos': 'NOUN', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'unt', 'ent_type': '', 'dep': 'pobj', 'head': 142, 'left_edge': 144, 'right_edge': 148}),id=148), Annotation(392,393,Token,features=Features({'_i': 83, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 130, 'left_edge': 149, 'right_edge': 149}),id=149), Annotation(393,394,SpaceToken,features=Features({'is_space': True}),id=150), Annotation(394,399,Token,features=Features({'_i': 84, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'would', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 6992604926141104606, 'pos': 'AUX', 'prefix': 'w', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'MD', 'shape': 'xxxx', 'suffix': 'uld', 'ent_type': '', 'dep': 'aux', 'head': 153, 'left_edge': 151, 'right_edge': 151}),id=151), Annotation(399,400,SpaceToken,features=Features({'is_space': True}),id=152), Annotation(400,402,Token,features=Features({'_i': 85, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'be', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 10382539506755952630, 'pos': 'AUX', 'prefix': 'b', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VB', 'shape': 'xx', 'suffix': 'be', 'ent_type': '', 'dep': 'ROOT', 'head': 153, 'left_edge': 101, 'right_edge': 162}),id=153), Annotation(402,403,SpaceToken,features=Features({'is_space': True}),id=154), Annotation(403,407,Token,features=Features({'_i': 86, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'nice', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 14121509715367036122, 'pos': 'ADJ', 'prefix': 'n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'JJ', 'shape': 'xxxx', 'suffix': 'ice', 'ent_type': '', 'dep': 'acomp', 'head': 153, 'left_edge': 155, 'right_edge': 155}),id=155), Annotation(407,408,SpaceToken,features=Features({'is_space': True}),id=156), Annotation(408,410,Token,features=Features({'_i': 87, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'to', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3791531372978436496, 'pos': 'PART', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'TO', 'shape': 'xx', 'suffix': 'to', 'ent_type': '', 'dep': 'aux', 'head': 159, 'left_edge': 157, 'right_edge': 157}),id=157), Annotation(410,411,SpaceToken,features=Features({'is_space': True}),id=158), Annotation(411,416,Token,features=Features({'_i': 88, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'match', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 16065818573247886523, 'pos': 'VERB', 'prefix': 'm', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VB', 'shape': 'xxxx', 'suffix': 'tch', 'ent_type': '', 'dep': 'xcomp', 'head': 153, 'left_edge': 157, 'right_edge': 161}),id=159), Annotation(416,417,SpaceToken,features=Features({'is_space': True}),id=160), Annotation(417,421,Token,features=Features({'_i': 89, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'they', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7976265300568155021, 'pos': 'PRON', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'PRP', 'shape': 'xxxx', 'suffix': 'hem', 'ent_type': '', 'dep': 'dobj', 'head': 159, 'left_edge': 161, 'right_edge': 161}),id=161), Annotation(417,421,NounChunk,features=Features({}),id=366), Annotation(421,422,Token,features=Features({'_i': 90, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '!', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 17494803046312582752, 'pos': 'PUNCT', 'prefix': '!', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '.', 'shape': '!', 'suffix': '!', 'ent_type': '', 'dep': 'punct', 'head': 153, 'left_edge': 162, 'right_edge': 162}),id=162), Annotation(422,424,SpaceToken,features=Features({'_i': 91, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': True, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 908432558851201422, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n\\n', 'suffix': '\\n\\n', 'ent_type': '', 'dep': 'dep', 'head': 168, 'left_edge': 163, 'right_edge': 163}),id=163), Annotation(422,829,Sentence,features=Features({}),id=340), Annotation(424,428,Token,features=Features({'_i': 92, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'here', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 13973858553651532596, 'pos': 'ADV', 'prefix': 'H', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'Xxxx', 'suffix': 'ere', 'ent_type': '', 'dep': 'advmod', 'head': 168, 'left_edge': 164, 'right_edge': 164}),id=164), Annotation(428,429,SpaceToken,features=Features({'is_space': True}),id=165), Annotation(429,431,Token,features=Features({'_i': 93, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'we', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 16064069575701507746, 'pos': 'PRON', 'prefix': 'w', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'PRP', 'shape': 'xx', 'suffix': 'we', 'ent_type': '', 'dep': 'nsubj', 'head': 168, 'left_edge': 166, 'right_edge': 166}),id=166), Annotation(429,431,NounChunk,features=Features({}),id=367), Annotation(431,432,SpaceToken,features=Features({'is_space': True}),id=167), Annotation(432,439,Token,features=Features({'_i': 94, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'include', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 14049642289933595219, 'pos': 'VERB', 'prefix': 'i', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBP', 'shape': 'xxxx', 'suffix': 'ude', 'ent_type': '', 'dep': 'ccomp', 'head': 265, 'left_edge': 163, 'right_edge': 257}),id=168), Annotation(439,440,SpaceToken,features=Features({'is_space': True}),id=169), Annotation(440,441,Token,features=Features({'_i': 95, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'det', 'head': 172, 'left_edge': 170, 'right_edge': 170}),id=170), Annotation(441,442,SpaceToken,features=Features({'is_space': True}),id=171), Annotation(442,445,Token,features=Features({'_i': 96, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': True, 'lang': 'en', 'lemma': 'url', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2582013287274679728, 'pos': 'NOUN', 'prefix': 'U', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'XXX', 'suffix': 'URL', 'ent_type': '', 'dep': 'nmod', 'head': 176, 'left_edge': 170, 'right_edge': 174}),id=172), Annotation(445,446,SpaceToken,features=Features({'is_space': True}),id=173), Annotation(446,487,Token,features=Features({'_i': 97, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'https://gatenlp.github.io/python-gatenlp/', 'like_email': False, 'like_num': False, 'like_url': True, 'orth': 616995933894448768, 'pos': 'X', 'prefix': 'h', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'FW', 'shape': 'xxxx://xxxx.xxxx.xx/xxxx-xxxx/', 'suffix': 'lp/', 'ent_type': '', 'dep': 'nummod', 'head': 172, 'left_edge': 174, 'right_edge': 174}),id=174), Annotation(487,488,SpaceToken,features=Features({'is_space': True}),id=175), Annotation(488,489,SpaceToken,features=Features({'_i': 98, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 962983613142996970, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n', 'suffix': '\\n', 'ent_type': '', 'dep': 'dep', 'head': 168, 'left_edge': 170, 'right_edge': 187}),id=176), Annotation(489,492,Token,features=Features({'_i': 99, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'and', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2283656566040971221, 'pos': 'CCONJ', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'xxx', 'suffix': 'and', 'ent_type': '', 'dep': 'cc', 'head': 176, 'left_edge': 177, 'right_edge': 177}),id=177), Annotation(492,493,SpaceToken,features=Features({'is_space': True}),id=178), Annotation(493,494,Token,features=Features({'_i': 100, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'det', 'head': 185, 'left_edge': 179, 'right_edge': 179}),id=179), Annotation(494,495,SpaceToken,features=Features({'is_space': True}),id=180), Annotation(495,499,Token,features=Features({'_i': 101, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'fake', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 513889413363281732, 'pos': 'ADJ', 'prefix': 'f', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'JJ', 'shape': 'xxxx', 'suffix': 'ake', 'ent_type': '', 'dep': 'amod', 'head': 185, 'left_edge': 181, 'right_edge': 181}),id=181), Annotation(499,500,SpaceToken,features=Features({'is_space': True}),id=182), Annotation(500,505,Token,features=Features({'_i': 102, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'email', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7320900731437023467, 'pos': 'NOUN', 'prefix': 'e', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'ail', 'ent_type': '', 'dep': 'compound', 'head': 185, 'left_edge': 183, 'right_edge': 183}),id=183), Annotation(505,506,SpaceToken,features=Features({'is_space': True}),id=184), Annotation(506,513,Token,features=Features({'_i': 103, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'address', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 14349763737243715186, 'pos': 'NOUN', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'ess', 'ent_type': '', 'dep': 'conj', 'head': 176, 'left_edge': 179, 'right_edge': 187}),id=185), Annotation(513,514,SpaceToken,features=Features({'is_space': True}),id=186), Annotation(514,540,Token,features=Features({'_i': 104, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'john.doe@hiscoolserver.com', 'like_email': True, 'like_num': False, 'like_url': False, 'orth': 8439314673651519470, 'pos': 'PROPN', 'prefix': 'j', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'xxxx.xxx@xxxx.xxx', 'suffix': 'com', 'ent_type': '', 'dep': 'appos', 'head': 185, 'left_edge': 187, 'right_edge': 187}),id=187), Annotation(514,540,NounChunk,features=Features({}),id=368), Annotation(540,541,SpaceToken,features=Features({'is_space': True}),id=188), Annotation(541,543,Token,features=Features({'_i': 105, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'as', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7437575085468336610, 'pos': 'ADV', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'xx', 'suffix': 'as', 'ent_type': '', 'dep': 'advmod', 'head': 191, 'left_edge': 189, 'right_edge': 189}),id=189), Annotation(543,544,SpaceToken,features=Features({'is_space': True}),id=190), Annotation(544,548,Token,features=Features({'_i': 106, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'well', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 4525988469032889948, 'pos': 'ADV', 'prefix': 'w', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'xxxx', 'suffix': 'ell', 'ent_type': '', 'dep': 'advmod', 'head': 168, 'left_edge': 189, 'right_edge': 191}),id=191), Annotation(548,549,SpaceToken,features=Features({'is_space': True}),id=192), Annotation(549,550,SpaceToken,features=Features({'_i': 107, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 962983613142996970, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n', 'suffix': '\\n', 'ent_type': '', 'dep': 'dep', 'head': 168, 'left_edge': 193, 'right_edge': 193}),id=193), Annotation(550,552,Token,features=Features({'_i': 108, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'as', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7437575085468336610, 'pos': 'ADP', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'IN', 'shape': 'xx', 'suffix': 'as', 'ent_type': '', 'dep': 'prep', 'head': 168, 'left_edge': 194, 'right_edge': 203}),id=194), Annotation(552,553,SpaceToken,features=Features({'is_space': True}),id=195), Annotation(553,554,Token,features=Features({'_i': 109, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '#', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 8663801465970268676, 'pos': 'SYM', 'prefix': '#', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '$', 'shape': '#', 'suffix': '#', 'ent_type': 'MONEY', 'dep': 'quantmod', 'head': 199, 'left_edge': 196, 'right_edge': 196}),id=196), Annotation(553,560,MONEY,features=Features({'lemma': '#some #'}),id=329), Annotation(553,573,NounChunk,features=Features({}),id=369), Annotation(554,558,Token,features=Features({'_i': 110, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'some', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7000492816108906599, 'pos': 'DET', 'prefix': 's', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'xxxx', 'suffix': 'ome', 'ent_type': 'MONEY', 'dep': 'det', 'head': 199, 'left_edge': 197, 'right_edge': 197}),id=197), Annotation(558,559,SpaceToken,features=Features({'is_space': True}),id=198), Annotation(559,560,Token,features=Features({'_i': 111, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '#', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 8663801465970268676, 'pos': 'ADV', 'prefix': '#', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RBR', 'shape': '#', 'suffix': '#', 'ent_type': 'MONEY', 'dep': 'quantmod', 'head': 203, 'left_edge': 196, 'right_edge': 200}),id=199), Annotation(560,564,Token,features=Features({'_i': 112, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'cool', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 16540862642162215401, 'pos': 'ADJ', 'prefix': 'c', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'JJ', 'shape': 'xxxx', 'suffix': 'ool', 'ent_type': '', 'dep': 'amod', 'head': 199, 'left_edge': 200, 'right_edge': 200}),id=200), Annotation(564,565,SpaceToken,features=Features({'is_space': True}),id=201), Annotation(565,566,Token,features=Features({'_i': 113, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '#', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 8663801465970268676, 'pos': 'SYM', 'prefix': '#', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '$', 'shape': '#', 'suffix': '#', 'ent_type': '', 'dep': 'nmod', 'head': 203, 'left_edge': 202, 'right_edge': 202}),id=202), Annotation(566,573,Token,features=Features({'_i': 114, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'hastag', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12335920655888895378, 'pos': 'NOUN', 'prefix': 'h', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'xxxx', 'suffix': 'ags', 'ent_type': '', 'dep': 'pobj', 'head': 194, 'left_edge': 196, 'right_edge': 203}),id=203), Annotation(573,574,SpaceToken,features=Features({'is_space': True}),id=204), Annotation(574,577,Token,features=Features({'_i': 115, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'and', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2283656566040971221, 'pos': 'CCONJ', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'xxx', 'suffix': 'and', 'ent_type': '', 'dep': 'cc', 'head': 168, 'left_edge': 205, 'right_edge': 205}),id=205), Annotation(577,578,SpaceToken,features=Features({'is_space': True}),id=206), Annotation(578,579,Token,features=Features({'_i': 116, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'det', 'head': 209, 'left_edge': 207, 'right_edge': 207}),id=207), Annotation(579,580,SpaceToken,features=Features({'is_space': True}),id=208), Annotation(580,585,Token,features=Features({'_i': 117, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'bunch', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 16097686030507304001, 'pos': 'NOUN', 'prefix': 'b', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'nch', 'ent_type': '', 'dep': 'conj', 'head': 168, 'left_edge': 207, 'right_edge': 225}),id=209), Annotation(585,586,SpaceToken,features=Features({'is_space': True}),id=210), Annotation(586,588,Token,features=Features({'_i': 118, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'of', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 886050111519832510, 'pos': 'ADP', 'prefix': 'o', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'IN', 'shape': 'xx', 'suffix': 'of', 'ent_type': '', 'dep': 'prep', 'head': 209, 'left_edge': 211, 'right_edge': 213}),id=211), Annotation(588,589,SpaceToken,features=Features({'is_space': True}),id=212), Annotation(589,595,Token,features=Features({'_i': 119, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'emoji', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 15308488336638733982, 'pos': 'NOUN', 'prefix': 'e', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'xxxx', 'suffix': 'jis', 'ent_type': '', 'dep': 'pobj', 'head': 211, 'left_edge': 213, 'right_edge': 213}),id=213), Annotation(589,595,NounChunk,features=Features({}),id=370), Annotation(595,596,SpaceToken,features=Features({'is_space': True}),id=214), Annotation(596,600,Token,features=Features({'_i': 120, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'like', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 18194338103975822726, 'pos': 'ADP', 'prefix': 'l', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'IN', 'shape': 'xxxx', 'suffix': 'ike', 'ent_type': '', 'dep': 'prep', 'head': 209, 'left_edge': 215, 'right_edge': 224}),id=215), Annotation(600,601,SpaceToken,features=Features({'is_space': True}),id=216), Annotation(601,602,Token,features=Features({'_i': 121, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '😽', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 13410134130775049117, 'pos': 'PROPN', 'prefix': '😽', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': '😽', 'suffix': '😽', 'ent_type': '', 'dep': 'pobj', 'head': 215, 'left_edge': 217, 'right_edge': 224}),id=217), Annotation(601,602,NounChunk,features=Features({}),id=371), Annotation(602,603,SpaceToken,features=Features({'is_space': True}),id=218), Annotation(603,604,Token,features=Features({'_i': 122, 'is_alpha': False, 'is_bracket': True, 'is_currency': False, 'is_digit': False, 'is_left_punct': True, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '(', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12638816674900267446, 'pos': 'PUNCT', 'prefix': '(', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '-LRB-', 'shape': '(', 'suffix': '(', 'ent_type': '', 'dep': 'punct', 'head': 217, 'left_edge': 219, 'right_edge': 219}),id=219), Annotation(604,605,Token,features=Features({'_i': 123, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'det', 'head': 224, 'left_edge': 220, 'right_edge': 220}),id=220), Annotation(604,617,NounChunk,features=Features({}),id=372), Annotation(605,606,SpaceToken,features=Features({'is_space': True}),id=221), Annotation(606,613,Token,features=Features({'_i': 124, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'kissing', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 17741139538205582276, 'pos': 'NOUN', 'prefix': 'k', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'ing', 'ent_type': '', 'dep': 'amod', 'head': 224, 'left_edge': 222, 'right_edge': 222}),id=222), Annotation(613,614,SpaceToken,features=Features({'is_space': True}),id=223), Annotation(614,617,Token,features=Features({'_i': 125, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'cat', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 5439657043933447811, 'pos': 'NOUN', 'prefix': 'c', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxx', 'suffix': 'cat', 'ent_type': '', 'dep': 'appos', 'head': 217, 'left_edge': 220, 'right_edge': 224}),id=224), Annotation(617,618,Token,features=Features({'_i': 126, 'is_alpha': False, 'is_bracket': True, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': True, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ')', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3842344029291005339, 'pos': 'PUNCT', 'prefix': ')', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '-RRB-', 'shape': ')', 'suffix': ')', 'ent_type': '', 'dep': 'punct', 'head': 209, 'left_edge': 225, 'right_edge': 225}),id=225), Annotation(618,619,Token,features=Features({'_i': 127, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 168, 'left_edge': 226, 'right_edge': 226}),id=226), Annotation(619,620,SpaceToken,features=Features({'_i': 128, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 962983613142996970, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n', 'suffix': '\\n', 'ent_type': '', 'dep': 'dep', 'head': 168, 'left_edge': 227, 'right_edge': 227}),id=227), Annotation(620,621,Token,features=Features({'_i': 129, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '👩', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 13533745690575315714, 'pos': 'X', 'prefix': '👩', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'ADD', 'shape': '👩', 'suffix': '👩', 'ent_type': '', 'dep': 'dobj', 'head': 168, 'left_edge': 228, 'right_edge': 228}),id=228), Annotation(621,622,Token,features=Features({'_i': 130, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\u200d', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 17442142028047717517, 'pos': 'X', 'prefix': '\\u200d', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'ADD', 'shape': '\\u200d', 'suffix': '\\u200d', 'ent_type': '', 'dep': 'dobj', 'head': 168, 'left_edge': 229, 'right_edge': 246}),id=229), Annotation(622,623,Token,features=Features({'_i': 131, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '🏫', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7253895586487469997, 'pos': 'NOUN', 'prefix': '🏫', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': '🏫', 'suffix': '🏫', 'ent_type': '', 'dep': 'dobj', 'head': 229, 'left_edge': 230, 'right_edge': 246}),id=230), Annotation(622,623,NounChunk,features=Features({}),id=373), Annotation(623,624,SpaceToken,features=Features({'is_space': True}),id=231), Annotation(624,625,Token,features=Features({'_i': 132, 'is_alpha': False, 'is_bracket': True, 'is_currency': False, 'is_digit': False, 'is_left_punct': True, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '(', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12638816674900267446, 'pos': 'PUNCT', 'prefix': '(', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '-LRB-', 'shape': '(', 'suffix': '(', 'ent_type': '', 'dep': 'punct', 'head': 230, 'left_edge': 232, 'right_edge': 232}),id=232), Annotation(625,626,Token,features=Features({'_i': 133, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'det', 'head': 237, 'left_edge': 233, 'right_edge': 233}),id=233), Annotation(625,640,NounChunk,features=Features({}),id=374), Annotation(626,627,SpaceToken,features=Features({'is_space': True}),id=234), Annotation(627,632,Token,features=Features({'_i': 134, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'woman', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 18308318389743014884, 'pos': 'NOUN', 'prefix': 'w', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'man', 'ent_type': '', 'dep': 'compound', 'head': 237, 'left_edge': 235, 'right_edge': 235}),id=235), Annotation(632,633,SpaceToken,features=Features({'is_space': True}),id=236), Annotation(633,640,Token,features=Features({'_i': 135, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'teacher', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 78820239457828971, 'pos': 'NOUN', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'her', 'ent_type': '', 'dep': 'appos', 'head': 230, 'left_edge': 233, 'right_edge': 237}),id=237), Annotation(640,641,Token,features=Features({'_i': 136, 'is_alpha': False, 'is_bracket': True, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': True, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ')', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3842344029291005339, 'pos': 'PUNCT', 'prefix': ')', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '-RRB-', 'shape': ')', 'suffix': ')', 'ent_type': '', 'dep': 'punct', 'head': 230, 'left_edge': 238, 'right_edge': 238}),id=238), Annotation(641,642,Token,features=Features({'_i': 137, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 230, 'left_edge': 239, 'right_edge': 239}),id=239), Annotation(642,643,SpaceToken,features=Features({'is_space': True}),id=240), Annotation(643,644,Token,features=Features({'_i': 138, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '🧬', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 9031423621242036097, 'pos': 'PROPN', 'prefix': '🧬', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': '🧬', 'suffix': '🧬', 'ent_type': 'PERSON', 'dep': 'conj', 'head': 230, 'left_edge': 241, 'right_edge': 245}),id=241), Annotation(643,644,PERSON,features=Features({'lemma': '🧬'}),id=330), Annotation(643,644,NounChunk,features=Features({}),id=375), Annotation(644,645,SpaceToken,features=Features({'is_space': True}),id=242), Annotation(645,646,Token,features=Features({'_i': 139, 'is_alpha': False, 'is_bracket': True, 'is_currency': False, 'is_digit': False, 'is_left_punct': True, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '(', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12638816674900267446, 'pos': 'PUNCT', 'prefix': '(', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '-LRB-', 'shape': '(', 'suffix': '(', 'ent_type': '', 'dep': 'punct', 'head': 241, 'left_edge': 243, 'right_edge': 243}),id=243), Annotation(646,649,Token,features=Features({'_i': 140, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': True, 'lang': 'en', 'lemma': 'DNA', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 14386531691665947480, 'pos': 'PROPN', 'prefix': 'D', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'XXX', 'suffix': 'DNA', 'ent_type': '', 'dep': 'appos', 'head': 241, 'left_edge': 244, 'right_edge': 244}),id=244), Annotation(646,649,NounChunk,features=Features({}),id=376), Annotation(649,650,Token,features=Features({'_i': 141, 'is_alpha': False, 'is_bracket': True, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': True, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ')', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3842344029291005339, 'pos': 'PUNCT', 'prefix': ')', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '-RRB-', 'shape': ')', 'suffix': ')', 'ent_type': '', 'dep': 'punct', 'head': 241, 'left_edge': 245, 'right_edge': 245}),id=245), Annotation(650,651,Token,features=Features({'_i': 142, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 230, 'left_edge': 246, 'right_edge': 246}),id=246), Annotation(651,652,SpaceToken,features=Features({'is_space': True}),id=247), Annotation(652,653,SpaceToken,features=Features({'_i': 143, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 962983613142996970, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n', 'suffix': '\\n', 'ent_type': '', 'dep': 'dep', 'head': 256, 'left_edge': 248, 'right_edge': 248}),id=248), Annotation(653,654,Token,features=Features({'_i': 144, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '🧗', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7724749230788735883, 'pos': 'PROPN', 'prefix': '🧗', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': '🧗', 'suffix': '🧗', 'ent_type': '', 'dep': 'nsubj', 'head': 256, 'left_edge': 249, 'right_edge': 249}),id=249), Annotation(653,654,NounChunk,features=Features({}),id=377), Annotation(654,655,SpaceToken,features=Features({'is_space': True}),id=250), Annotation(655,656,Token,features=Features({'_i': 145, 'is_alpha': False, 'is_bracket': True, 'is_currency': False, 'is_digit': False, 'is_left_punct': True, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '(', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12638816674900267446, 'pos': 'PUNCT', 'prefix': '(', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '-LRB-', 'shape': '(', 'suffix': '(', 'ent_type': '', 'dep': 'punct', 'head': 256, 'left_edge': 251, 'right_edge': 251}),id=251), Annotation(656,657,Token,features=Features({'_i': 146, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'det', 'head': 254, 'left_edge': 252, 'right_edge': 252}),id=252), Annotation(656,664,NounChunk,features=Features({}),id=378), Annotation(657,658,SpaceToken,features=Features({'is_space': True}),id=253), Annotation(658,664,Token,features=Features({'_i': 147, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'person', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 14800503047316267216, 'pos': 'NOUN', 'prefix': 'p', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'son', 'ent_type': '', 'dep': 'nsubj', 'head': 256, 'left_edge': 252, 'right_edge': 254}),id=254), Annotation(664,665,SpaceToken,features=Features({'is_space': True}),id=255), Annotation(665,673,Token,features=Features({'_i': 148, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'climb', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 992864226358664410, 'pos': 'VERB', 'prefix': 'c', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBG', 'shape': 'xxxx', 'suffix': 'ing', 'ent_type': '', 'dep': 'ccomp', 'head': 168, 'left_edge': 248, 'right_edge': 257}),id=256), Annotation(673,674,Token,features=Features({'_i': 149, 'is_alpha': False, 'is_bracket': True, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': True, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ')', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3842344029291005339, 'pos': 'PUNCT', 'prefix': ')', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '-RRB-', 'shape': ')', 'suffix': ')', 'ent_type': '', 'dep': 'punct', 'head': 256, 'left_edge': 257, 'right_edge': 257}),id=257), Annotation(674,675,Token,features=Features({'_i': 150, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 265, 'left_edge': 258, 'right_edge': 258}),id=258), Annotation(675,676,SpaceToken,features=Features({'is_space': True}),id=259), Annotation(676,678,SpaceToken,features=Features({'_i': 151, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 908432558851201422, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n\\n', 'suffix': '\\n\\n', 'ent_type': '', 'dep': 'dep', 'head': 265, 'left_edge': 260, 'right_edge': 260}),id=260), Annotation(678,682,Token,features=Features({'_i': 152, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'here', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 13973858553651532596, 'pos': 'ADV', 'prefix': 'H', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'Xxxx', 'suffix': 'ere', 'ent_type': '', 'dep': 'advmod', 'head': 265, 'left_edge': 261, 'right_edge': 261}),id=261), Annotation(682,683,SpaceToken,features=Features({'is_space': True}),id=262), Annotation(683,685,Token,features=Features({'_i': 153, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'we', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 16064069575701507746, 'pos': 'PRON', 'prefix': 'w', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'PRP', 'shape': 'xx', 'suffix': 'we', 'ent_type': '', 'dep': 'nsubj', 'head': 265, 'left_edge': 263, 'right_edge': 263}),id=263), Annotation(683,685,NounChunk,features=Features({}),id=379), Annotation(685,686,SpaceToken,features=Features({'is_space': True}),id=264), Annotation(686,690,Token,features=Features({'_i': 154, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'test', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 1618900948208871284, 'pos': 'VERB', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBP', 'shape': 'xxxx', 'suffix': 'est', 'ent_type': '', 'dep': 'ROOT', 'head': 265, 'left_edge': 163, 'right_edge': 316}),id=265), Annotation(690,691,SpaceToken,features=Features({'is_space': True}),id=266), Annotation(691,692,Token,features=Features({'_i': 155, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'quantmod', 'head': 269, 'left_edge': 267, 'right_edge': 267}),id=267), Annotation(691,714,NounChunk,features=Features({}),id=380), Annotation(692,693,SpaceToken,features=Features({'is_space': True}),id=268), Annotation(693,696,Token,features=Features({'_i': 156, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'few', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11866476999679706272, 'pos': 'ADJ', 'prefix': 'f', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'JJ', 'shape': 'xxx', 'suffix': 'few', 'ent_type': '', 'dep': 'nummod', 'head': 273, 'left_edge': 267, 'right_edge': 269}),id=269), Annotation(696,697,SpaceToken,features=Features({'is_space': True}),id=270), Annotation(697,706,Token,features=Features({'_i': 157, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'different', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 17379251038783395147, 'pos': 'ADJ', 'prefix': 'd', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'JJ', 'shape': 'xxxx', 'suffix': 'ent', 'ent_type': '', 'dep': 'amod', 'head': 273, 'left_edge': 271, 'right_edge': 271}),id=271), Annotation(706,707,SpaceToken,features=Features({'is_space': True}),id=272), Annotation(707,714,Token,features=Features({'_i': 158, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'script', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 5755375431925009742, 'pos': 'NOUN', 'prefix': 's', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'xxxx', 'suffix': 'pts', 'ent_type': '', 'dep': 'dobj', 'head': 265, 'left_edge': 267, 'right_edge': 315}),id=273), Annotation(714,715,Token,features=Features({'_i': 159, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 273, 'left_edge': 274, 'right_edge': 274}),id=274), Annotation(715,716,SpaceToken,features=Features({'is_space': True}),id=275), Annotation(716,720,Token,features=Features({'_i': 160, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'e.g.', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2360781233418247603, 'pos': 'ADV', 'prefix': 'e', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'x.x.', 'suffix': '.g.', 'ent_type': '', 'dep': 'advmod', 'head': 280, 'left_edge': 276, 'right_edge': 276}),id=276), Annotation(716,730,NounChunk,features=Features({}),id=381), Annotation(720,721,SpaceToken,features=Features({'is_space': True}),id=277), Annotation(721,727,Token,features=Features({'_i': 161, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Hangul', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 13901273419332601270, 'pos': 'PROPN', 'prefix': 'H', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'gul', 'ent_type': 'PERSON', 'dep': 'compound', 'head': 280, 'left_edge': 278, 'right_edge': 278}),id=278), Annotation(721,727,PERSON,features=Features({'lemma': 'Hangul'}),id=331), Annotation(727,728,SpaceToken,features=Features({'is_space': True}),id=279), Annotation(728,730,Token,features=Features({'_i': 162, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '한글', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 14226613016943586999, 'pos': 'PROPN', 'prefix': '한', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'xx', 'suffix': '한글', 'ent_type': '', 'dep': 'appos', 'head': 273, 'left_edge': 276, 'right_edge': 282}),id=280), Annotation(730,731,SpaceToken,features=Features({'is_space': True}),id=281), Annotation(731,733,Token,features=Features({'_i': 163, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'or', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3740602843040177340, 'pos': 'CCONJ', 'prefix': 'o', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'xx', 'suffix': 'or', 'ent_type': '', 'dep': 'cc', 'head': 280, 'left_edge': 282, 'right_edge': 282}),id=282), Annotation(733,734,SpaceToken,features=Features({'is_space': True}),id=283), Annotation(734,735,SpaceToken,features=Features({'_i': 164, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 962983613142996970, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n', 'suffix': '\\n', 'ent_type': '', 'dep': 'dep', 'head': 273, 'left_edge': 284, 'right_edge': 284}),id=284), Annotation(735,745,Token,features=Features({'_i': 165, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'simplify', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 15400283218552125021, 'pos': 'VERB', 'prefix': 's', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBN', 'shape': 'xxxx', 'suffix': 'ied', 'ent_type': '', 'dep': 'amod', 'head': 289, 'left_edge': 285, 'right_edge': 285}),id=285), Annotation(735,754,NounChunk,features=Features({}),id=382), Annotation(745,746,SpaceToken,features=Features({'is_space': True}),id=286), Annotation(746,751,Token,features=Features({'_i': 166, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Hanzi', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7162399416189955260, 'pos': 'PROPN', 'prefix': 'H', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'nzi', 'ent_type': 'PERSON', 'dep': 'compound', 'head': 289, 'left_edge': 287, 'right_edge': 287}),id=287), Annotation(746,751,PERSON,features=Features({'lemma': 'Hanzi'}),id=332), Annotation(751,752,SpaceToken,features=Features({'is_space': True}),id=288), Annotation(752,754,Token,features=Features({'_i': 167, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '汉字', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2455570606503990670, 'pos': 'PROPN', 'prefix': '汉', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'xx', 'suffix': '汉字', 'ent_type': '', 'dep': 'conj', 'head': 273, 'left_edge': 285, 'right_edge': 315}),id=289), Annotation(754,755,SpaceToken,features=Features({'is_space': True}),id=290), Annotation(755,757,Token,features=Features({'_i': 168, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'or', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3740602843040177340, 'pos': 'CCONJ', 'prefix': 'o', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'xx', 'suffix': 'or', 'ent_type': '', 'dep': 'cc', 'head': 289, 'left_edge': 291, 'right_edge': 291}),id=291), Annotation(757,758,SpaceToken,features=Features({'is_space': True}),id=292), Annotation(758,763,Token,features=Features({'_i': 169, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Farsi', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 1631855311088449743, 'pos': 'PROPN', 'prefix': 'F', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'rsi', 'ent_type': 'PERSON', 'dep': 'compound', 'head': 295, 'left_edge': 293, 'right_edge': 293}),id=293), Annotation(758,763,PERSON,features=Features({'lemma': 'Farsi'}),id=333), Annotation(758,769,NounChunk,features=Features({}),id=383), Annotation(763,764,SpaceToken,features=Features({'is_space': True}),id=294), Annotation(764,769,Token,features=Features({'_i': 170, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'فارسی', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 10192536850399601059, 'pos': 'NOUN', 'prefix': 'ف', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'رسی', 'ent_type': '', 'dep': 'conj', 'head': 289, 'left_edge': 293, 'right_edge': 299}),id=295), Annotation(769,770,SpaceToken,features=Features({'is_space': True}),id=296), Annotation(770,773,Token,features=Features({'_i': 171, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'and', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2283656566040971221, 'pos': 'CCONJ', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'xxx', 'suffix': 'and', 'ent_type': '', 'dep': 'cc', 'head': 295, 'left_edge': 297, 'right_edge': 297}),id=297), Annotation(773,774,SpaceToken,features=Features({'is_space': True}),id=298), Annotation(774,780,Token,features=Features({'_i': 172, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Arabic', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 6558098436421695236, 'pos': 'PROPN', 'prefix': 'A', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'bic', 'ent_type': 'LANGUAGE', 'dep': 'conj', 'head': 295, 'left_edge': 299, 'right_edge': 299}),id=299), Annotation(774,780,LANGUAGE,features=Features({'lemma': 'Arabic'}),id=334), Annotation(774,780,NounChunk,features=Features({}),id=384), Annotation(780,781,SpaceToken,features=Features({'is_space': True}),id=300), Annotation(781,782,Token,features=Features({'_i': 173, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 289, 'left_edge': 301, 'right_edge': 301}),id=301), Annotation(782,797,Token,features=Features({'_i': 174, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'اَلْعَرَبِيَّةُ', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 16849633969024660305, 'pos': 'NUM', 'prefix': 'ا', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CD', 'shape': 'xَxْxَxَxِxَّxُ', 'suffix': 'ّةُ', 'ent_type': 'GPE', 'dep': 'appos', 'head': 289, 'left_edge': 302, 'right_edge': 302}),id=302), Annotation(782,797,GPE,features=Features({'lemma': 'اَلْعَرَبِيَّةُ'}),id=335), Annotation(797,798,Token,features=Features({'_i': 175, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 289, 'left_edge': 303, 'right_edge': 303}),id=303), Annotation(798,799,SpaceToken,features=Features({'is_space': True}),id=304), Annotation(799,804,Token,features=Features({'_i': 176, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'which', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7063653163634019529, 'pos': 'PRON', 'prefix': 'w', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'WDT', 'shape': 'xxxx', 'suffix': 'ich', 'ent_type': '', 'dep': 'nsubj', 'head': 307, 'left_edge': 305, 'right_edge': 305}),id=305), Annotation(799,804,NounChunk,features=Features({}),id=385), Annotation(804,805,SpaceToken,features=Features({'is_space': True}),id=306), Annotation(805,809,Token,features=Features({'_i': 177, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'go', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 18224404223277189725, 'pos': 'VERB', 'prefix': 'g', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBZ', 'shape': 'xxxx', 'suffix': 'oes', 'ent_type': '', 'dep': 'relcl', 'head': 289, 'left_edge': 305, 'right_edge': 315}),id=307), Annotation(809,810,SpaceToken,features=Features({'is_space': True}),id=308), Annotation(810,814,Token,features=Features({'_i': 178, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'from', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7831658034963690409, 'pos': 'ADP', 'prefix': 'f', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'IN', 'shape': 'xxxx', 'suffix': 'rom', 'ent_type': '', 'dep': 'prep', 'head': 307, 'left_edge': 309, 'right_edge': 315}),id=309), Annotation(814,815,SpaceToken,features=Features({'is_space': True}),id=310), Annotation(815,820,Token,features=Features({'_i': 179, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'right', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 5943797630011647483, 'pos': 'NOUN', 'prefix': 'r', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'ght', 'ent_type': '', 'dep': 'advmod', 'head': 313, 'left_edge': 311, 'right_edge': 311}),id=311), Annotation(820,821,SpaceToken,features=Features({'is_space': True}),id=312), Annotation(821,823,Token,features=Features({'_i': 180, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'to', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3791531372978436496, 'pos': 'AUX', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'IN', 'shape': 'xx', 'suffix': 'to', 'ent_type': '', 'dep': 'aux', 'head': 315, 'left_edge': 311, 'right_edge': 313}),id=313), Annotation(823,824,SpaceToken,features=Features({'is_space': True}),id=314), Annotation(824,828,Token,features=Features({'_i': 181, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'leave', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 4003250512168697582, 'pos': 'VERB', 'prefix': 'l', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBN', 'shape': 'xxxx', 'suffix': 'eft', 'ent_type': '', 'dep': 'pcomp', 'head': 309, 'left_edge': 311, 'right_edge': 315}),id=315), Annotation(828,829,Token,features=Features({'_i': 182, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '.', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12646065887601541794, 'pos': 'PUNCT', 'prefix': '.', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '.', 'shape': '.', 'suffix': '.', 'ent_type': '', 'dep': 'punct', 'head': 265, 'left_edge': 316, 'right_edge': 316}),id=316), Annotation(829,831,SpaceToken,features=Features({'_i': 183, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': True, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 908432558851201422, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n\\n', 'suffix': '\\n\\n', 'ent_type': '', 'dep': 'dep', 'head': 317, 'left_edge': 317, 'right_edge': 317}),id=317), Annotation(829,831,Sentence,features=Features({}),id=341)]), 'Stanza': AnnotationSet([Annotation(0,4,Token,features=Features({'text': 'This', 'lemma': 'this', 'upos': 'PRON', 'xpos': 'DT', 'Number': 'Sing', 'PronType': 'Dem', 'head': 4, 'deprel': 'nsubj', 'ner': 'O'}),id=0), Annotation(0,24,Sentence,features=Features({}),id=6), Annotation(5,7,Token,features=Features({'text': 'is', 'lemma': 'be', 'upos': 'AUX', 'xpos': 'VBZ', 'Mood': 'Ind', 'Number': 'Sing', 'Person': '3', 'Tense': 'Pres', 'VerbForm': 'Fin', 'head': 4, 'deprel': 'cop', 'ner': 'O'}),id=1), Annotation(8,9,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 4, 'deprel': 'det', 'ner': 'O'}),id=2), Annotation(10,14,Token,features=Features({'text': 'test', 'lemma': 'test', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 4, 'deprel': 'compound', 'ner': 'O'}),id=3), Annotation(15,23,Token,features=Features({'text': 'document', 'lemma': 'document', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 6, 'deprel': 'root', 'ner': 'O'}),id=4), Annotation(23,24,Token,features=Features({'text': '.', 'lemma': '.', 'upos': 'PUNCT', 'xpos': '.', 'head': 4, 'deprel': 'punct', 'ner': 'O'}),id=5), Annotation(26,28,Token,features=Features({'text': 'It', 'lemma': 'it', 'upos': 'PRON', 'xpos': 'PRP', 'Case': 'Nom', 'Gender': 'Neut', 'Number': 'Sing', 'Person': '3', 'PronType': 'Prs', 'head': 8, 'deprel': 'nsubj', 'ner': 'O'}),id=7), Annotation(26,59,Sentence,features=Features({}),id=14), Annotation(29,37,Token,features=Features({'text': 'contains', 'lemma': 'contain', 'upos': 'VERB', 'xpos': 'VBZ', 'Mood': 'Ind', 'Number': 'Sing', 'Person': '3', 'Tense': 'Pres', 'VerbForm': 'Fin', 'head': 14, 'deprel': 'root', 'ner': 'O'}),id=8), Annotation(38,42,Token,features=Features({'text': 'just', 'lemma': 'just', 'upos': 'ADV', 'xpos': 'RB', 'head': 12, 'deprel': 'advmod', 'ner': 'O'}),id=9), Annotation(43,44,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 12, 'deprel': 'det', 'ner': 'O'}),id=10), Annotation(45,48,Token,features=Features({'text': 'few', 'lemma': 'few', 'upos': 'ADJ', 'xpos': 'JJ', 'Degree': 'Pos', 'head': 12, 'deprel': 'amod', 'ner': 'O'}),id=11), Annotation(49,58,Token,features=Features({'text': 'sentences', 'lemma': 'sentence', 'upos': 'NOUN', 'xpos': 'NNS', 'Number': 'Plur', 'head': 8, 'deprel': 'obj', 'ner': 'O'}),id=12), Annotation(58,59,Token,features=Features({'text': '.', 'lemma': '.', 'upos': 'PUNCT', 'xpos': '.', 'head': 8, 'deprel': 'punct', 'ner': 'O'}),id=13), Annotation(61,65,Token,features=Features({'text': 'Here', 'lemma': 'here', 'upos': 'ADV', 'xpos': 'RB', 'PronType': 'Dem', 'head': 54, 'deprel': 'root', 'ner': 'O'}),id=15), Annotation(61,266,Sentence,features=Features({}),id=54), Annotation(66,68,Token,features=Features({'text': 'is', 'lemma': 'be', 'upos': 'AUX', 'xpos': 'VBZ', 'Mood': 'Ind', 'Number': 'Sing', 'Person': '3', 'Tense': 'Pres', 'VerbForm': 'Fin', 'head': 15, 'deprel': 'cop', 'ner': 'O'}),id=16), Annotation(69,70,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 18, 'deprel': 'det', 'ner': 'O'}),id=17), Annotation(71,79,Token,features=Features({'text': 'sentence', 'lemma': 'sentence', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 15, 'deprel': 'nsubj', 'ner': 'O'}),id=18), Annotation(80,84,Token,features=Features({'text': 'that', 'lemma': 'that', 'upos': 'PRON', 'xpos': 'WDT', 'PronType': 'Rel', 'head': 20, 'deprel': 'nsubj', 'ner': 'O'}),id=19), Annotation(85,93,Token,features=Features({'text': 'mentions', 'lemma': 'mention', 'upos': 'VERB', 'xpos': 'VBZ', 'Mood': 'Ind', 'Number': 'Sing', 'Person': '3', 'Tense': 'Pres', 'VerbForm': 'Fin', 'head': 18, 'deprel': 'acl:relcl', 'ner': 'O'}),id=20), Annotation(94,95,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 24, 'deprel': 'det', 'ner': 'O'}),id=21), Annotation(96,99,Token,features=Features({'text': 'few', 'lemma': 'few', 'upos': 'ADJ', 'xpos': 'JJ', 'Degree': 'Pos', 'head': 24, 'deprel': 'amod', 'ner': 'O'}),id=22), Annotation(100,105,Token,features=Features({'text': 'named', 'lemma': 'name', 'upos': 'VERB', 'xpos': 'VBN', 'Tense': 'Past', 'VerbForm': 'Part', 'head': 24, 'deprel': 'amod', 'ner': 'O'}),id=23), Annotation(106,114,Token,features=Features({'text': 'entities', 'lemma': 'entity', 'upos': 'NOUN', 'xpos': 'NNS', 'Number': 'Plur', 'head': 20, 'deprel': 'obj', 'ner': 'O'}),id=24), Annotation(115,119,Token,features=Features({'text': 'like', 'lemma': 'like', 'upos': 'ADP', 'xpos': 'IN', 'head': 27, 'deprel': 'case', 'ner': 'O'}),id=25), Annotation(121,124,Token,features=Features({'text': 'the', 'lemma': 'the', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Def', 'PronType': 'Art', 'head': 27, 'deprel': 'det', 'ner': 'O'}),id=26), Annotation(125,132,Token,features=Features({'text': 'persons', 'lemma': 'person', 'upos': 'NOUN', 'xpos': 'NNS', 'Number': 'Plur', 'head': 24, 'deprel': 'nmod', 'ner': 'O'}),id=27), Annotation(133,139,Token,features=Features({'text': 'Barack', 'lemma': 'Barack', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 27, 'deprel': 'appos', 'ner': 'B-PERSON'}),id=28), Annotation(133,145,PERSON,features=Features({}),id=175), Annotation(140,145,Token,features=Features({'text': 'Obama', 'lemma': 'Obama', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 28, 'deprel': 'flat', 'ner': 'E-PERSON'}),id=29), Annotation(146,148,Token,features=Features({'text': 'or', 'lemma': 'or', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 31, 'deprel': 'cc', 'ner': 'O'}),id=30), Annotation(149,155,Token,features=Features({'text': 'Ursula', 'lemma': 'Ursula', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 28, 'deprel': 'conj', 'ner': 'B-PERSON'}),id=31), Annotation(149,169,PERSON,features=Features({}),id=176), Annotation(156,159,Token,features=Features({'text': 'von', 'lemma': 'von', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 31, 'deprel': 'flat', 'ner': 'I-PERSON'}),id=32), Annotation(160,163,Token,features=Features({'text': 'der', 'lemma': 'der', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 31, 'deprel': 'flat', 'ner': 'I-PERSON'}),id=33), Annotation(164,169,Token,features=Features({'text': 'Leyen', 'lemma': 'Leyen', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 31, 'deprel': 'flat', 'ner': 'E-PERSON'}),id=34), Annotation(169,170,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 36, 'deprel': 'punct', 'ner': 'O'}),id=35), Annotation(171,180,Token,features=Features({'text': 'locations', 'lemma': 'location', 'upos': 'NOUN', 'xpos': 'NNS', 'Number': 'Plur', 'head': 27, 'deprel': 'conj', 'ner': 'O'}),id=36), Annotation(181,185,Token,features=Features({'text': 'like', 'lemma': 'like', 'upos': 'ADP', 'xpos': 'IN', 'head': 40, 'deprel': 'case', 'ner': 'O'}),id=37), Annotation(186,189,Token,features=Features({'text': 'New', 'lemma': 'New', 'upos': 'ADJ', 'xpos': 'NNP', 'Degree': 'Pos', 'head': 40, 'deprel': 'amod', 'ner': 'B-GPE'}),id=38), Annotation(186,199,GPE,features=Features({}),id=177), Annotation(190,194,Token,features=Features({'text': 'York', 'lemma': 'York', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 40, 'deprel': 'compound', 'ner': 'I-GPE'}),id=39), Annotation(195,199,Token,features=Features({'text': 'City', 'lemma': 'City', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 36, 'deprel': 'nmod', 'ner': 'E-GPE'}),id=40), Annotation(199,200,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 42, 'deprel': 'punct', 'ner': 'O'}),id=41), Annotation(201,207,Token,features=Features({'text': 'Vienna', 'lemma': 'Vienna', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 40, 'deprel': 'conj', 'ner': 'S-GPE'}),id=42), Annotation(201,207,GPE,features=Features({}),id=178), Annotation(208,210,Token,features=Features({'text': 'or', 'lemma': 'or', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 44, 'deprel': 'cc', 'ner': 'O'}),id=43), Annotation(211,218,Token,features=Features({'text': 'Beijing', 'lemma': 'Beijing', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 40, 'deprel': 'conj', 'ner': 'S-GPE'}),id=44), Annotation(211,218,GPE,features=Features({}),id=179), Annotation(219,221,Token,features=Features({'text': 'or', 'lemma': 'or', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 46, 'deprel': 'cc', 'ner': 'O'}),id=45), Annotation(222,231,Token,features=Features({'text': 'companies', 'lemma': 'company', 'upos': 'NOUN', 'xpos': 'NNS', 'Number': 'Plur', 'head': 40, 'deprel': 'conj', 'ner': 'O'}),id=46), Annotation(232,236,Token,features=Features({'text': 'like', 'lemma': 'like', 'upos': 'ADP', 'xpos': 'IN', 'head': 48, 'deprel': 'case', 'ner': 'O'}),id=47), Annotation(238,244,Token,features=Features({'text': 'Google', 'lemma': 'Google', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 46, 'deprel': 'nmod', 'ner': 'S-ORG'}),id=48), Annotation(238,244,ORG,features=Features({}),id=180), Annotation(244,245,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 50, 'deprel': 'punct', 'ner': 'O'}),id=49), Annotation(246,255,Token,features=Features({'text': 'UniCredit', 'lemma': 'UniCredit', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 48, 'deprel': 'conj', 'ner': 'S-ORG'}),id=50), Annotation(246,255,ORG,features=Features({}),id=181), Annotation(256,258,Token,features=Features({'text': 'or', 'lemma': 'or', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 52, 'deprel': 'cc', 'ner': 'O'}),id=51), Annotation(259,265,Token,features=Features({'text': 'Huawei', 'lemma': 'Huawei', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 48, 'deprel': 'conj', 'ner': 'S-ORG'}),id=52), Annotation(259,265,ORG,features=Features({}),id=182), Annotation(265,266,Token,features=Features({'text': '.', 'lemma': '.', 'upos': 'PUNCT', 'xpos': '.', 'head': 15, 'deprel': 'punct', 'ner': 'O'}),id=53), Annotation(267,270,Token,features=Features({'text': 'And', 'lemma': 'and', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 67, 'deprel': 'cc', 'ner': 'O'}),id=55), Annotation(267,322,Sentence,features=Features({}),id=69), Annotation(271,275,Token,features=Features({'text': 'here', 'lemma': 'here', 'upos': 'ADV', 'xpos': 'RB', 'PronType': 'Dem', 'head': 69, 'deprel': 'root', 'ner': 'O'}),id=56), Annotation(276,278,Token,features=Features({'text': 'is', 'lemma': 'be', 'upos': 'AUX', 'xpos': 'VBZ', 'Mood': 'Ind', 'Number': 'Sing', 'Person': '3', 'Tense': 'Pres', 'VerbForm': 'Fin', 'head': 56, 'deprel': 'cop', 'ner': 'O'}),id=57), Annotation(279,285,Token,features=Features({'text': 'Donald', 'lemma': 'Donald', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 56, 'deprel': 'nsubj', 'ner': 'B-PERSON'}),id=58), Annotation(279,291,PERSON,features=Features({}),id=183), Annotation(286,291,Token,features=Features({'text': 'Trump', 'lemma': 'Trump', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 58, 'deprel': 'flat', 'ner': 'E-PERSON'}),id=59), Annotation(291,292,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 56, 'deprel': 'punct', 'ner': 'O'}),id=60), Annotation(293,295,Token,features=Features({'text': 'it', 'lemma': 'it', 'upos': 'PRON', 'xpos': 'PRP', 'Case': 'Nom', 'Gender': 'Neut', 'Number': 'Sing', 'Person': '3', 'PronType': 'Prs', 'head': 67, 'deprel': 'nsubj', 'ner': 'O'}),id=61), Annotation(296,299,Token,features=Features({'text': 'may', 'lemma': 'may', 'upos': 'AUX', 'xpos': 'MD', 'VerbForm': 'Fin', 'head': 67, 'deprel': 'aux', 'ner': 'O'}),id=62), Annotation(300,303,Token,features=Features({'text': 'not', 'lemma': 'not', 'upos': 'PART', 'xpos': 'RB', 'head': 67, 'deprel': 'advmod', 'ner': 'O'}),id=63), Annotation(304,306,Token,features=Features({'text': 'be', 'lemma': 'be', 'upos': 'AUX', 'xpos': 'VB', 'VerbForm': 'Inf', 'head': 67, 'deprel': 'cop', 'ner': 'O'}),id=64), Annotation(307,310,Token,features=Features({'text': 'the', 'lemma': 'the', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Def', 'PronType': 'Art', 'head': 67, 'deprel': 'det', 'ner': 'O'}),id=65), Annotation(311,315,Token,features=Features({'text': 'real', 'lemma': 'real', 'upos': 'ADJ', 'xpos': 'JJ', 'Degree': 'Pos', 'head': 67, 'deprel': 'amod', 'ner': 'O'}),id=66), Annotation(316,319,Token,features=Features({'text': 'one', 'lemma': 'one', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 56, 'deprel': 'parataxis', 'ner': 'O'}),id=67), Annotation(320,322,Token,features=Features({'text': ':P', 'lemma': ':p', 'upos': 'SYM', 'xpos': 'NFP', 'head': 56, 'deprel': 'discourse', 'ner': 'O'}),id=68), Annotation(324,328,Token,features=Features({'text': 'Lets', 'lemma': 'let', 'upos': 'VERB', 'xpos': 'VBZ', 'Mood': 'Ind', 'Number': 'Sing', 'Person': '3', 'Tense': 'Pres', 'VerbForm': 'Fin', 'head': 89, 'deprel': 'root', 'ner': 'O'}),id=70), Annotation(324,422,Sentence,features=Features({}),id=89), Annotation(329,332,Token,features=Features({'text': 'say', 'lemma': 'say', 'upos': 'VERB', 'xpos': 'VB', 'VerbForm': 'Inf', 'head': 70, 'deprel': 'xcomp', 'ner': 'O'}),id=71), Annotation(333,338,Token,features=Features({'text': 'Boris', 'lemma': 'Boris', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 84, 'deprel': 'nsubj', 'ner': 'B-PERSON'}),id=72), Annotation(333,346,PERSON,features=Features({}),id=184), Annotation(339,346,Token,features=Features({'text': 'Johnson', 'lemma': 'Johnson', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 72, 'deprel': 'flat', 'ner': 'E-PERSON'}),id=73), Annotation(347,350,Token,features=Features({'text': 'aka', 'lemma': 'aka', 'upos': 'ADP', 'xpos': 'IN', 'head': 76, 'deprel': 'case', 'ner': 'O'}),id=74), Annotation(351,355,Token,features=Features({'text': 'Bojo', 'lemma': 'Bojo', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 76, 'deprel': 'compound', 'ner': 'S-PERSON'}),id=75), Annotation(351,355,PERSON,features=Features({}),id=185), Annotation(356,362,Token,features=Features({'text': 'tweets', 'lemma': 'tweet', 'upos': 'NOUN', 'xpos': 'NNS', 'Number': 'Plur', 'head': 72, 'deprel': 'nmod', 'ner': 'O'}),id=76), Annotation(363,367,Token,features=Features({'text': 'from', 'lemma': 'from', 'upos': 'ADP', 'xpos': 'IN', 'head': 80, 'deprel': 'case', 'ner': 'O'}),id=77), Annotation(368,371,Token,features=Features({'text': 'his', 'lemma': 'he', 'upos': 'PRON', 'xpos': 'PRP$', 'Gender': 'Masc', 'Number': 'Sing', 'Person': '3', 'Poss': 'Yes', 'PronType': 'Prs', 'head': 80, 'deprel': 'nmod:poss', 'ner': 'O'}),id=78), Annotation(372,384,Token,features=Features({'text': 'BorisJohnson', 'lemma': 'BorisJohnson', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 80, 'deprel': 'compound', 'ner': 'S-PERSON'}),id=79), Annotation(372,384,PERSON,features=Features({}),id=186), Annotation(385,392,Token,features=Features({'text': 'account', 'lemma': 'account', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 76, 'deprel': 'nmod', 'ner': 'O'}),id=80), Annotation(392,393,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 84, 'deprel': 'punct', 'ner': 'O'}),id=81), Annotation(394,399,Token,features=Features({'text': 'would', 'lemma': 'would', 'upos': 'AUX', 'xpos': 'MD', 'VerbForm': 'Fin', 'head': 84, 'deprel': 'aux', 'ner': 'O'}),id=82), Annotation(400,402,Token,features=Features({'text': 'be', 'lemma': 'be', 'upos': 'AUX', 'xpos': 'VB', 'VerbForm': 'Inf', 'head': 84, 'deprel': 'cop', 'ner': 'O'}),id=83), Annotation(403,407,Token,features=Features({'text': 'nice', 'lemma': 'nice', 'upos': 'ADJ', 'xpos': 'JJ', 'Degree': 'Pos', 'head': 70, 'deprel': 'parataxis', 'ner': 'O'}),id=84), Annotation(408,410,Token,features=Features({'text': 'to', 'lemma': 'to', 'upos': 'PART', 'xpos': 'TO', 'head': 86, 'deprel': 'mark', 'ner': 'O'}),id=85), Annotation(411,416,Token,features=Features({'text': 'match', 'lemma': 'match', 'upos': 'VERB', 'xpos': 'VB', 'VerbForm': 'Inf', 'head': 84, 'deprel': 'advcl', 'ner': 'O'}),id=86), Annotation(417,421,Token,features=Features({'text': 'them', 'lemma': 'they', 'upos': 'PRON', 'xpos': 'PRP', 'Case': 'Acc', 'Number': 'Plur', 'Person': '3', 'PronType': 'Prs', 'head': 86, 'deprel': 'obj', 'ner': 'O'}),id=87), Annotation(421,422,Token,features=Features({'text': '!', 'lemma': '!', 'upos': 'PUNCT', 'xpos': '.', 'head': 70, 'deprel': 'punct', 'ner': 'O'}),id=88), Annotation(424,428,Token,features=Features({'text': 'Here', 'lemma': 'here', 'upos': 'ADV', 'xpos': 'RB', 'PronType': 'Dem', 'head': 92, 'deprel': 'advmod', 'ner': 'O'}),id=90), Annotation(424,675,Sentence,features=Features({}),id=143), Annotation(429,431,Token,features=Features({'text': 'we', 'lemma': 'we', 'upos': 'PRON', 'xpos': 'PRP', 'Case': 'Nom', 'Number': 'Plur', 'Person': '1', 'PronType': 'Prs', 'head': 92, 'deprel': 'nsubj', 'ner': 'O'}),id=91), Annotation(432,439,Token,features=Features({'text': 'include', 'lemma': 'include', 'upos': 'VERB', 'xpos': 'VBP', 'Mood': 'Ind', 'Tense': 'Pres', 'VerbForm': 'Fin', 'head': 143, 'deprel': 'root', 'ner': 'O'}),id=92), Annotation(440,441,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 94, 'deprel': 'det', 'ner': 'O'}),id=93), Annotation(442,445,Token,features=Features({'text': 'URL', 'lemma': 'URL', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 92, 'deprel': 'obj', 'ner': 'O'}),id=94), Annotation(446,487,Token,features=Features({'text': 'https://gatenlp.github.io/python-gatenlp/', 'lemma': 'https://gatenlp.github.io/python-gatenlp/', 'upos': 'X', 'xpos': 'ADD', 'head': 94, 'deprel': 'appos', 'ner': 'O'}),id=95), Annotation(489,492,Token,features=Features({'text': 'and', 'lemma': 'and', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 100, 'deprel': 'cc', 'ner': 'O'}),id=96), Annotation(493,494,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 100, 'deprel': 'det', 'ner': 'O'}),id=97), Annotation(495,499,Token,features=Features({'text': 'fake', 'lemma': 'fake', 'upos': 'ADJ', 'xpos': 'JJ', 'Degree': 'Pos', 'head': 100, 'deprel': 'amod', 'ner': 'O'}),id=98), Annotation(500,505,Token,features=Features({'text': 'email', 'lemma': 'email', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 100, 'deprel': 'compound', 'ner': 'O'}),id=99), Annotation(506,513,Token,features=Features({'text': 'address', 'lemma': 'address', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 94, 'deprel': 'conj', 'ner': 'O'}),id=100), Annotation(514,540,Token,features=Features({'text': 'john.doe@hiscoolserver.com', 'lemma': 'john.doe@hiscoolserver.com', 'upos': 'X', 'xpos': 'ADD', 'head': 100, 'deprel': 'appos', 'ner': 'O'}),id=101), Annotation(541,543,Token,features=Features({'text': 'as', 'lemma': 'as', 'upos': 'ADV', 'xpos': 'RB', 'head': 110, 'deprel': 'cc', 'ner': 'O'}),id=102), Annotation(544,548,Token,features=Features({'text': 'well', 'lemma': 'well', 'upos': 'ADV', 'xpos': 'RB', 'Degree': 'Pos', 'head': 102, 'deprel': 'fixed', 'ner': 'O'}),id=103), Annotation(550,552,Token,features=Features({'text': 'as', 'lemma': 'as', 'upos': 'ADP', 'xpos': 'IN', 'head': 102, 'deprel': 'fixed', 'ner': 'O'}),id=104), Annotation(553,554,Token,features=Features({'text': '#', 'lemma': '#', 'upos': 'SYM', 'xpos': 'NN', 'Number': 'Sing', 'head': 110, 'deprel': 'compound', 'ner': 'O'}),id=105), Annotation(554,558,Token,features=Features({'text': 'some', 'lemma': 'some', 'upos': 'DET', 'xpos': 'DT', 'head': 110, 'deprel': 'det', 'ner': 'O'}),id=106), Annotation(559,560,Token,features=Features({'text': '#', 'lemma': '#', 'upos': 'SYM', 'xpos': 'NN', 'Number': 'Sing', 'head': 110, 'deprel': 'compound', 'ner': 'O'}),id=107), Annotation(560,564,Token,features=Features({'text': 'cool', 'lemma': 'cool', 'upos': 'ADJ', 'xpos': 'JJ', 'Degree': 'Pos', 'head': 110, 'deprel': 'amod', 'ner': 'O'}),id=108), Annotation(565,566,Token,features=Features({'text': '#', 'lemma': '#', 'upos': 'SYM', 'xpos': 'NN', 'Number': 'Sing', 'head': 110, 'deprel': 'compound', 'ner': 'O'}),id=109), Annotation(566,573,Token,features=Features({'text': 'hastags', 'lemma': 'hastag', 'upos': 'NOUN', 'xpos': 'NNS', 'Number': 'Plur', 'head': 94, 'deprel': 'conj', 'ner': 'O'}),id=110), Annotation(574,577,Token,features=Features({'text': 'and', 'lemma': 'and', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 113, 'deprel': 'cc', 'ner': 'O'}),id=111), Annotation(578,579,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 113, 'deprel': 'det', 'ner': 'O'}),id=112), Annotation(580,585,Token,features=Features({'text': 'bunch', 'lemma': 'bunch', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 94, 'deprel': 'conj', 'ner': 'O'}),id=113), Annotation(586,588,Token,features=Features({'text': 'of', 'lemma': 'of', 'upos': 'ADP', 'xpos': 'IN', 'head': 115, 'deprel': 'case', 'ner': 'O'}),id=114), Annotation(589,595,Token,features=Features({'text': 'emojis', 'lemma': 'emojis', 'upos': 'NOUN', 'xpos': 'NNS', 'Number': 'Plur', 'head': 113, 'deprel': 'nmod', 'ner': 'O'}),id=115), Annotation(596,600,Token,features=Features({'text': 'like', 'lemma': 'like', 'upos': 'ADP', 'xpos': 'IN', 'head': 117, 'deprel': 'case', 'ner': 'O'}),id=116), Annotation(601,602,Token,features=Features({'text': '😽', 'lemma': '😽', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 115, 'deprel': 'nmod', 'ner': 'O'}),id=117), Annotation(603,604,Token,features=Features({'text': '(', 'lemma': '(', 'upos': 'PUNCT', 'xpos': '-LRB-', 'head': 121, 'deprel': 'punct', 'ner': 'O'}),id=118), Annotation(604,605,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 121, 'deprel': 'det', 'ner': 'O'}),id=119), Annotation(606,613,Token,features=Features({'text': 'kissing', 'lemma': 'kissing', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 121, 'deprel': 'compound', 'ner': 'O'}),id=120), Annotation(614,617,Token,features=Features({'text': 'cat', 'lemma': 'cat', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 117, 'deprel': 'appos', 'ner': 'O'}),id=121), Annotation(617,618,Token,features=Features({'text': ')', 'lemma': ')', 'upos': 'PUNCT', 'xpos': '-RRB-', 'head': 121, 'deprel': 'punct', 'ner': 'O'}),id=122), Annotation(618,619,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 124, 'deprel': 'punct', 'ner': 'O'}),id=123), Annotation(620,623,Token,features=Features({'text': '👩\\u200d🏫', 'lemma': '👩\\u200d🏫', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 117, 'deprel': 'conj', 'ner': 'O'}),id=124), Annotation(624,625,Token,features=Features({'text': '(', 'lemma': '(', 'upos': 'PUNCT', 'xpos': '-LRB-', 'head': 128, 'deprel': 'punct', 'ner': 'O'}),id=125), Annotation(625,626,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 128, 'deprel': 'det', 'ner': 'O'}),id=126), Annotation(627,632,Token,features=Features({'text': 'woman', 'lemma': 'woman', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 128, 'deprel': 'compound', 'ner': 'O'}),id=127), Annotation(633,640,Token,features=Features({'text': 'teacher', 'lemma': 'teacher', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 124, 'deprel': 'appos', 'ner': 'O'}),id=128), Annotation(640,641,Token,features=Features({'text': ')', 'lemma': ')', 'upos': 'PUNCT', 'xpos': '-RRB-', 'head': 128, 'deprel': 'punct', 'ner': 'O'}),id=129), Annotation(641,642,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 131, 'deprel': 'punct', 'ner': 'O'}),id=130), Annotation(643,644,Token,features=Features({'text': '🧬', 'lemma': '🧬', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 117, 'deprel': 'conj', 'ner': 'O'}),id=131), Annotation(645,646,Token,features=Features({'text': '(', 'lemma': '(', 'upos': 'PUNCT', 'xpos': '-LRB-', 'head': 133, 'deprel': 'punct', 'ner': 'O'}),id=132), Annotation(646,649,Token,features=Features({'text': 'DNA', 'lemma': 'dna', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 131, 'deprel': 'appos', 'ner': 'O'}),id=133), Annotation(649,650,Token,features=Features({'text': ')', 'lemma': ')', 'upos': 'PUNCT', 'xpos': '-RRB-', 'head': 133, 'deprel': 'punct', 'ner': 'O'}),id=134), Annotation(650,651,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 136, 'deprel': 'punct', 'ner': 'O'}),id=135), Annotation(653,654,Token,features=Features({'text': '🧗', 'lemma': '🧗', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 117, 'deprel': 'conj', 'ner': 'O'}),id=136), Annotation(655,656,Token,features=Features({'text': '(', 'lemma': '(', 'upos': 'PUNCT', 'xpos': '-LRB-', 'head': 140, 'deprel': 'punct', 'ner': 'O'}),id=137), Annotation(656,657,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 139, 'deprel': 'det', 'ner': 'O'}),id=138), Annotation(658,664,Token,features=Features({'text': 'person', 'lemma': 'person', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 140, 'deprel': 'nsubj', 'ner': 'O'}),id=139), Annotation(665,673,Token,features=Features({'text': 'climbing', 'lemma': 'climb', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 136, 'deprel': 'appos', 'ner': 'O'}),id=140), Annotation(673,674,Token,features=Features({'text': ')', 'lemma': ')', 'upos': 'PUNCT', 'xpos': '-RRB-', 'head': 140, 'deprel': 'punct', 'ner': 'O'}),id=141), Annotation(674,675,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 92, 'deprel': 'punct', 'ner': 'O'}),id=142), Annotation(678,682,Token,features=Features({'text': 'Here', 'lemma': 'here', 'upos': 'ADV', 'xpos': 'RB', 'PronType': 'Dem', 'head': 146, 'deprel': 'advmod', 'ner': 'O'}),id=144), Annotation(678,829,Sentence,features=Features({}),id=174), Annotation(683,685,Token,features=Features({'text': 'we', 'lemma': 'we', 'upos': 'PRON', 'xpos': 'PRP', 'Case': 'Nom', 'Number': 'Plur', 'Person': '1', 'PronType': 'Prs', 'head': 146, 'deprel': 'nsubj', 'ner': 'O'}),id=145), Annotation(686,690,Token,features=Features({'text': 'test', 'lemma': 'test', 'upos': 'VERB', 'xpos': 'VBP', 'Mood': 'Ind', 'Number': 'Plur', 'Person': '1', 'Tense': 'Pres', 'VerbForm': 'Fin', 'head': 174, 'deprel': 'root', 'ner': 'O'}),id=146), Annotation(691,692,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 150, 'deprel': 'det', 'ner': 'O'}),id=147), Annotation(693,696,Token,features=Features({'text': 'few', 'lemma': 'few', 'upos': 'ADJ', 'xpos': 'JJ', 'Degree': 'Pos', 'head': 150, 'deprel': 'amod', 'ner': 'O'}),id=148), Annotation(697,706,Token,features=Features({'text': 'different', 'lemma': 'different', 'upos': 'ADJ', 'xpos': 'JJ', 'Degree': 'Pos', 'head': 150, 'deprel': 'amod', 'ner': 'O'}),id=149), Annotation(707,714,Token,features=Features({'text': 'scripts', 'lemma': 'script', 'upos': 'NOUN', 'xpos': 'NNS', 'Number': 'Plur', 'head': 146, 'deprel': 'obj', 'ner': 'O'}),id=150), Annotation(714,715,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 153, 'deprel': 'punct', 'ner': 'O'}),id=151), Annotation(716,720,Token,features=Features({'text': 'e.g.', 'lemma': 'e.g.', 'upos': 'ADV', 'xpos': 'FW', 'Abbr': 'Yes', 'head': 153, 'deprel': 'advmod', 'ner': 'O'}),id=152), Annotation(721,727,Token,features=Features({'text': 'Hangul', 'lemma': 'Hangul', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 154, 'deprel': 'compound', 'ner': 'O'}),id=153), Annotation(728,730,Token,features=Features({'text': '한글', 'lemma': '한글', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 150, 'deprel': 'appos', 'ner': 'O'}),id=154), Annotation(731,733,Token,features=Features({'text': 'or', 'lemma': 'or', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 158, 'deprel': 'cc', 'ner': 'O'}),id=155), Annotation(735,745,Token,features=Features({'text': 'simplified', 'lemma': 'simplify', 'upos': 'VERB', 'xpos': 'VBN', 'Tense': 'Past', 'VerbForm': 'Part', 'head': 158, 'deprel': 'amod', 'ner': 'O'}),id=156), Annotation(746,751,Token,features=Features({'text': 'Hanzi', 'lemma': 'Hanzi', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 158, 'deprel': 'compound', 'ner': 'O'}),id=157), Annotation(752,754,Token,features=Features({'text': '汉字', 'lemma': '汉字', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 154, 'deprel': 'conj', 'ner': 'O'}),id=158), Annotation(755,757,Token,features=Features({'text': 'or', 'lemma': 'or', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 161, 'deprel': 'cc', 'ner': 'O'}),id=159), Annotation(758,763,Token,features=Features({'text': 'Farsi', 'lemma': 'Farsi', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 161, 'deprel': 'compound', 'ner': 'S-LANGUAGE'}),id=160), Annotation(758,763,LANGUAGE,features=Features({}),id=187), Annotation(764,769,Token,features=Features({'text': 'فارسی', 'lemma': 'فارسی', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 154, 'deprel': 'conj', 'ner': 'O'}),id=161), Annotation(770,773,Token,features=Features({'text': 'and', 'lemma': 'and', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 163, 'deprel': 'cc', 'ner': 'O'}),id=162), Annotation(774,780,Token,features=Features({'text': 'Arabic', 'lemma': 'Arabic', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 161, 'deprel': 'conj', 'ner': 'S-LANGUAGE'}),id=163), Annotation(774,780,LANGUAGE,features=Features({}),id=188), Annotation(781,782,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 165, 'deprel': 'punct', 'ner': 'O'}),id=164), Annotation(782,797,Token,features=Features({'text': 'اَلْعَرَبِيَّةُ', 'lemma': 'gamommamgeramam', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 150, 'deprel': 'appos', 'ner': 'O'}),id=165), Annotation(797,798,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 168, 'deprel': 'punct', 'ner': 'O'}),id=166), Annotation(799,804,Token,features=Features({'text': 'which', 'lemma': 'which', 'upos': 'PRON', 'xpos': 'WDT', 'PronType': 'Rel', 'head': 168, 'deprel': 'nsubj', 'ner': 'O'}),id=167), Annotation(805,809,Token,features=Features({'text': 'goes', 'lemma': 'go', 'upos': 'VERB', 'xpos': 'VBZ', 'Mood': 'Ind', 'Number': 'Sing', 'Person': '3', 'Tense': 'Pres', 'VerbForm': 'Fin', 'head': 150, 'deprel': 'acl:relcl', 'ner': 'O'}),id=168), Annotation(810,814,Token,features=Features({'text': 'from', 'lemma': 'from', 'upos': 'ADP', 'xpos': 'IN', 'head': 170, 'deprel': 'case', 'ner': 'O'}),id=169), Annotation(815,820,Token,features=Features({'text': 'right', 'lemma': 'right', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 168, 'deprel': 'obl', 'ner': 'O'}),id=170), Annotation(821,823,Token,features=Features({'text': 'to', 'lemma': 'to', 'upos': 'ADP', 'xpos': 'IN', 'head': 172, 'deprel': 'case', 'ner': 'O'}),id=171), Annotation(824,828,Token,features=Features({'text': 'left', 'lemma': 'leave', 'upos': 'VERB', 'xpos': 'VBN', 'Tense': 'Past', 'VerbForm': 'Part', 'head': 168, 'deprel': 'advcl', 'ner': 'O'}),id=172), Annotation(828,829,Token,features=Features({'text': '.', 'lemma': '.', 'upos': 'PUNCT', 'xpos': '.', 'head': 146, 'deprel': 'punct', 'ner': 'O'}),id=173)]), '': AnnotationSet([Annotation(0,4,Token,features=Features({}),id=0), Annotation(5,7,Token,features=Features({}),id=1), Annotation(8,9,Token,features=Features({}),id=2), Annotation(10,14,Token,features=Features({}),id=3), Annotation(15,24,Token,features=Features({}),id=4), Annotation(26,28,Token,features=Features({}),id=5), Annotation(29,37,Token,features=Features({}),id=6), Annotation(38,42,Token,features=Features({}),id=7), Annotation(43,44,Token,features=Features({}),id=8), Annotation(45,48,Token,features=Features({}),id=9), Annotation(49,59,Token,features=Features({}),id=10), Annotation(61,65,Token,features=Features({}),id=11), Annotation(66,68,Token,features=Features({}),id=12), Annotation(69,70,Token,features=Features({}),id=13), Annotation(71,79,Token,features=Features({}),id=14), Annotation(80,84,Token,features=Features({}),id=15), Annotation(85,93,Token,features=Features({}),id=16), Annotation(94,95,Token,features=Features({}),id=17), Annotation(96,99,Token,features=Features({}),id=18), Annotation(100,105,Token,features=Features({}),id=19), Annotation(106,114,Token,features=Features({}),id=20), Annotation(115,119,Token,features=Features({}),id=21), Annotation(121,124,Token,features=Features({}),id=22), Annotation(125,132,Token,features=Features({}),id=23), Annotation(133,139,Token,features=Features({}),id=24), Annotation(140,145,Token,features=Features({}),id=25), Annotation(146,148,Token,features=Features({}),id=26), Annotation(149,155,Token,features=Features({}),id=27), Annotation(156,159,Token,features=Features({}),id=28), Annotation(160,163,Token,features=Features({}),id=29), Annotation(164,169,Token,features=Features({}),id=30), Annotation(169,170,Token,features=Features({}),id=31), Annotation(171,180,Token,features=Features({}),id=32), Annotation(181,185,Token,features=Features({}),id=33), Annotation(186,189,Token,features=Features({}),id=34), Annotation(190,194,Token,features=Features({}),id=35), Annotation(195,199,Token,features=Features({}),id=36), Annotation(199,200,Token,features=Features({}),id=37), Annotation(201,207,Token,features=Features({}),id=38), Annotation(208,210,Token,features=Features({}),id=39), Annotation(211,218,Token,features=Features({}),id=40), Annotation(219,221,Token,features=Features({}),id=41), Annotation(222,231,Token,features=Features({}),id=42), Annotation(232,236,Token,features=Features({}),id=43), Annotation(238,244,Token,features=Features({}),id=44), Annotation(244,245,Token,features=Features({}),id=45), Annotation(246,255,Token,features=Features({}),id=46), Annotation(256,258,Token,features=Features({}),id=47), Annotation(259,266,Token,features=Features({}),id=48), Annotation(267,270,Token,features=Features({}),id=49), Annotation(271,275,Token,features=Features({}),id=50), Annotation(276,278,Token,features=Features({}),id=51), Annotation(279,285,Token,features=Features({}),id=52), Annotation(286,291,Token,features=Features({}),id=53), Annotation(291,292,Token,features=Features({}),id=54), Annotation(293,295,Token,features=Features({}),id=55), Annotation(296,299,Token,features=Features({}),id=56), Annotation(300,303,Token,features=Features({}),id=57), Annotation(304,306,Token,features=Features({}),id=58), Annotation(307,310,Token,features=Features({}),id=59), Annotation(311,315,Token,features=Features({}),id=60), Annotation(316,319,Token,features=Features({}),id=61), Annotation(320,321,Token,features=Features({}),id=62), Annotation(321,322,Token,features=Features({}),id=63), Annotation(324,328,Token,features=Features({}),id=64), Annotation(329,332,Token,features=Features({}),id=65), Annotation(333,338,Token,features=Features({}),id=66), Annotation(339,346,Token,features=Features({}),id=67), Annotation(347,350,Token,features=Features({}),id=68), Annotation(351,355,Token,features=Features({}),id=69), Annotation(356,362,Token,features=Features({}),id=70), Annotation(363,367,Token,features=Features({}),id=71), Annotation(368,371,Token,features=Features({}),id=72), Annotation(372,384,Token,features=Features({}),id=73), Annotation(385,392,Token,features=Features({}),id=74), Annotation(392,393,Token,features=Features({}),id=75), Annotation(394,399,Token,features=Features({}),id=76), Annotation(400,402,Token,features=Features({}),id=77), Annotation(403,407,Token,features=Features({}),id=78), Annotation(408,410,Token,features=Features({}),id=79), Annotation(411,416,Token,features=Features({}),id=80), Annotation(417,421,Token,features=Features({}),id=81), Annotation(421,422,Token,features=Features({}),id=82), Annotation(424,428,Token,features=Features({}),id=83), Annotation(429,431,Token,features=Features({}),id=84), Annotation(432,439,Token,features=Features({}),id=85), Annotation(440,441,Token,features=Features({}),id=86), Annotation(442,445,Token,features=Features({}),id=87), Annotation(446,451,Token,features=Features({}),id=88), Annotation(451,452,Token,features=Features({}),id=89), Annotation(452,487,Token,features=Features({}),id=90), Annotation(489,492,Token,features=Features({}),id=91), Annotation(493,494,Token,features=Features({}),id=92), Annotation(495,499,Token,features=Features({}),id=93), Annotation(500,505,Token,features=Features({}),id=94), Annotation(506,513,Token,features=Features({}),id=95), Annotation(514,522,Token,features=Features({}),id=96), Annotation(522,523,Token,features=Features({}),id=97), Annotation(523,540,Token,features=Features({}),id=98), Annotation(541,543,Token,features=Features({}),id=99), Annotation(544,548,Token,features=Features({}),id=100), Annotation(550,552,Token,features=Features({}),id=101), Annotation(553,554,Token,features=Features({}),id=102), Annotation(554,558,Token,features=Features({}),id=103), Annotation(559,560,Token,features=Features({}),id=104), Annotation(560,564,Token,features=Features({}),id=105), Annotation(565,566,Token,features=Features({}),id=106), Annotation(566,573,Token,features=Features({}),id=107), Annotation(574,577,Token,features=Features({}),id=108), Annotation(578,579,Token,features=Features({}),id=109), Annotation(580,585,Token,features=Features({}),id=110), Annotation(586,588,Token,features=Features({}),id=111), Annotation(589,595,Token,features=Features({}),id=112), Annotation(596,600,Token,features=Features({}),id=113), Annotation(601,602,Token,features=Features({}),id=114), Annotation(603,604,Token,features=Features({}),id=115), Annotation(604,605,Token,features=Features({}),id=116), Annotation(606,613,Token,features=Features({}),id=117), Annotation(614,617,Token,features=Features({}),id=118), Annotation(617,618,Token,features=Features({}),id=119), Annotation(618,619,Token,features=Features({}),id=120), Annotation(620,623,Token,features=Features({}),id=121), Annotation(624,625,Token,features=Features({}),id=122), Annotation(625,626,Token,features=Features({}),id=123), Annotation(627,632,Token,features=Features({}),id=124), Annotation(633,640,Token,features=Features({}),id=125), Annotation(640,641,Token,features=Features({}),id=126), Annotation(641,642,Token,features=Features({}),id=127), Annotation(643,644,Token,features=Features({}),id=128), Annotation(645,646,Token,features=Features({}),id=129), Annotation(646,649,Token,features=Features({}),id=130), Annotation(649,650,Token,features=Features({}),id=131), Annotation(650,651,Token,features=Features({}),id=132), Annotation(653,654,Token,features=Features({}),id=133), Annotation(655,656,Token,features=Features({}),id=134), Annotation(656,657,Token,features=Features({}),id=135), Annotation(658,664,Token,features=Features({}),id=136), Annotation(665,673,Token,features=Features({}),id=137), Annotation(673,674,Token,features=Features({}),id=138), Annotation(674,675,Token,features=Features({}),id=139), Annotation(678,682,Token,features=Features({}),id=140), Annotation(683,685,Token,features=Features({}),id=141), Annotation(686,690,Token,features=Features({}),id=142), Annotation(691,692,Token,features=Features({}),id=143), Annotation(693,696,Token,features=Features({}),id=144), Annotation(697,706,Token,features=Features({}),id=145), Annotation(707,714,Token,features=Features({}),id=146), Annotation(714,715,Token,features=Features({}),id=147), Annotation(716,720,Token,features=Features({}),id=148), Annotation(721,727,Token,features=Features({}),id=149), Annotation(728,730,Token,features=Features({}),id=150), Annotation(731,733,Token,features=Features({}),id=151), Annotation(735,745,Token,features=Features({}),id=152), Annotation(746,751,Token,features=Features({}),id=153), Annotation(752,754,Token,features=Features({}),id=154), Annotation(755,757,Token,features=Features({}),id=155), Annotation(758,763,Token,features=Features({}),id=156), Annotation(764,769,Token,features=Features({}),id=157), Annotation(770,773,Token,features=Features({}),id=158), Annotation(774,780,Token,features=Features({}),id=159), Annotation(781,782,Token,features=Features({}),id=160), Annotation(782,797,Token,features=Features({}),id=161), Annotation(797,798,Token,features=Features({}),id=162), Annotation(799,804,Token,features=Features({}),id=163), Annotation(805,809,Token,features=Features({}),id=164), Annotation(810,814,Token,features=Features({}),id=165), Annotation(815,820,Token,features=Features({}),id=166), Annotation(821,823,Token,features=Features({}),id=167), Annotation(824,828,Token,features=Features({}),id=168), Annotation(828,829,Token,features=Features({}),id=169)])})" ] }, "execution_count": 50, "metadata": {}, "output_type": "execute_result" } ], "source": [ "doc2" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## TokenGazetteer: list\n", "\n", "1) Use prepared list with already tokenized entries\n" ] }, { "cell_type": "code", "execution_count": 51, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "from gatenlp.processing.gazetteer import TokenGazetteer\n", "gazlist1 = [\n", " ([\"Donald\", \"Trump\"], dict(what=\"person\", country=\"US\")),\n", " ([\"Boris\", \"Johnson\"], dict(what=\"person\", country=\"UK\")),\n", " ([\"Google\"], dict(what=\"company\", country=\"Everywhere, really!\")) \n", "]\n", "tgaz1 = TokenGazetteer(source=gazlist1, source_fmt=\"gazlist\", \n", " annset_name=\"\", outset_name=\"TGaz1\", ann_type=\"Lookup\")\n", "\n", "doc2 = tgaz1(doc2)" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## TokenGazetteer: match" ] }, { "cell_type": "code", "execution_count": 52, "metadata": { "slideshow": { "slide_type": "skip" } }, "outputs": [], "source": [ "doc2.annset(\"TGaz1\").clear()" ] }, { "cell_type": "code", "execution_count": 53, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "text/html": [ "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", " \n", " \n", "
\n", "\n", "
" ], "text/plain": [ "Document(This is a test document.\n", "\n", "It contains just a few sentences. \n", "Here is a sentence that mentions a few named entities like \n", "the persons Barack Obama or Ursula von der Leyen, locations\n", "like New York City, Vienna or Beijing or companies like \n", "Google, UniCredit or Huawei. And here is Donald Trump, it may not be the real one :P\n", "\n", "Lets say Boris Johnson aka Bojo tweets from his BorisJohnson account, would be nice to match them!\n", "\n", "Here we include a URL https://gatenlp.github.io/python-gatenlp/ \n", "and a fake email address john.doe@hiscoolserver.com as well \n", "as #some #cool #hastags and a bunch of emojis like 😽 (a kissing cat),\n", "👩‍🏫 (a woman teacher), 🧬 (DNA), \n", "🧗 (a person climbing), \n", "\n", "Here we test a few different scripts, e.g. Hangul 한글 or \n", "simplified Hanzi 汉字 or Farsi فارسی and Arabic ,اَلْعَرَبِيَّةُ, which goes from right to left.\n", "\n", ",features=Features({}),anns={'Spacy': AnnotationSet([Annotation(0,4,Token,features=Features({'_i': 0, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': True, 'is_space': False, 'is_stop': True, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'this', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12943039165150086467, 'pos': 'PRON', 'prefix': 'T', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'Xxxx', 'suffix': 'his', 'ent_type': '', 'dep': 'nsubj', 'head': 2, 'left_edge': 0, 'right_edge': 0}),id=0), Annotation(0,24,Sentence,features=Features({}),id=336), Annotation(0,4,NounChunk,features=Features({}),id=342), Annotation(4,5,SpaceToken,features=Features({'is_space': True}),id=1), Annotation(5,7,Token,features=Features({'_i': 1, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'be', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3411606890003347522, 'pos': 'AUX', 'prefix': 'i', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBZ', 'shape': 'xx', 'suffix': 'is', 'ent_type': '', 'dep': 'ROOT', 'head': 2, 'left_edge': 0, 'right_edge': 9}),id=2), Annotation(7,8,SpaceToken,features=Features({'is_space': True}),id=3), Annotation(8,9,Token,features=Features({'_i': 2, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'det', 'head': 8, 'left_edge': 4, 'right_edge': 4}),id=4), Annotation(8,23,NounChunk,features=Features({}),id=343), Annotation(9,10,SpaceToken,features=Features({'is_space': True}),id=5), Annotation(10,14,Token,features=Features({'_i': 3, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'test', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 1618900948208871284, 'pos': 'NOUN', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'est', 'ent_type': '', 'dep': 'compound', 'head': 8, 'left_edge': 6, 'right_edge': 6}),id=6), Annotation(14,15,SpaceToken,features=Features({'is_space': True}),id=7), Annotation(15,23,Token,features=Features({'_i': 4, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'document', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2376024544662810659, 'pos': 'NOUN', 'prefix': 'd', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'ent', 'ent_type': '', 'dep': 'attr', 'head': 2, 'left_edge': 4, 'right_edge': 8}),id=8), Annotation(23,24,Token,features=Features({'_i': 5, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '.', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12646065887601541794, 'pos': 'PUNCT', 'prefix': '.', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '.', 'shape': '.', 'suffix': '.', 'ent_type': '', 'dep': 'punct', 'head': 2, 'left_edge': 9, 'right_edge': 9}),id=9), Annotation(24,26,SpaceToken,features=Features({'_i': 6, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': True, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 908432558851201422, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n\\n', 'suffix': '\\n\\n', 'ent_type': '', 'dep': 'dep', 'head': 13, 'left_edge': 10, 'right_edge': 10}),id=10), Annotation(24,59,Sentence,features=Features({}),id=337), Annotation(26,28,Token,features=Features({'_i': 7, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'it', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7859011591137717335, 'pos': 'PRON', 'prefix': 'I', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'PRP', 'shape': 'Xx', 'suffix': 'It', 'ent_type': '', 'dep': 'nsubj', 'head': 13, 'left_edge': 11, 'right_edge': 11}),id=11), Annotation(26,28,NounChunk,features=Features({}),id=344), Annotation(28,29,SpaceToken,features=Features({'is_space': True}),id=12), Annotation(29,37,Token,features=Features({'_i': 8, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'contain', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 4189568596636683296, 'pos': 'VERB', 'prefix': 'c', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBZ', 'shape': 'xxxx', 'suffix': 'ins', 'ent_type': '', 'dep': 'ROOT', 'head': 13, 'left_edge': 10, 'right_edge': 22}),id=13), Annotation(37,38,SpaceToken,features=Features({'is_space': True}),id=14), Annotation(38,42,Token,features=Features({'_i': 9, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'just', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7148522813498185515, 'pos': 'ADV', 'prefix': 'j', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'xxxx', 'suffix': 'ust', 'ent_type': '', 'dep': 'advmod', 'head': 19, 'left_edge': 15, 'right_edge': 15}),id=15), Annotation(38,58,NounChunk,features=Features({}),id=345), Annotation(42,43,SpaceToken,features=Features({'is_space': True}),id=16), Annotation(43,44,Token,features=Features({'_i': 10, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'quantmod', 'head': 19, 'left_edge': 17, 'right_edge': 17}),id=17), Annotation(44,45,SpaceToken,features=Features({'is_space': True}),id=18), Annotation(45,48,Token,features=Features({'_i': 11, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'few', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11866476999679706272, 'pos': 'ADJ', 'prefix': 'f', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'JJ', 'shape': 'xxx', 'suffix': 'few', 'ent_type': '', 'dep': 'nummod', 'head': 21, 'left_edge': 15, 'right_edge': 19}),id=19), Annotation(48,49,SpaceToken,features=Features({'is_space': True}),id=20), Annotation(49,58,Token,features=Features({'_i': 12, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'sentence', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 5257340109698985342, 'pos': 'NOUN', 'prefix': 's', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'xxxx', 'suffix': 'ces', 'ent_type': '', 'dep': 'dobj', 'head': 13, 'left_edge': 15, 'right_edge': 21}),id=21), Annotation(58,59,Token,features=Features({'_i': 13, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '.', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12646065887601541794, 'pos': 'PUNCT', 'prefix': '.', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '.', 'shape': '.', 'suffix': '.', 'ent_type': '', 'dep': 'punct', 'head': 13, 'left_edge': 22, 'right_edge': 22}),id=22), Annotation(59,60,SpaceToken,features=Features({'is_space': True}),id=23), Annotation(60,61,SpaceToken,features=Features({'_i': 14, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': True, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 962983613142996970, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n', 'suffix': '\\n', 'ent_type': '', 'dep': 'dep', 'head': 27, 'left_edge': 24, 'right_edge': 24}),id=24), Annotation(60,266,Sentence,features=Features({}),id=338), Annotation(61,65,Token,features=Features({'_i': 15, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'here', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 13973858553651532596, 'pos': 'ADV', 'prefix': 'H', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'Xxxx', 'suffix': 'ere', 'ent_type': '', 'dep': 'advmod', 'head': 27, 'left_edge': 25, 'right_edge': 25}),id=25), Annotation(65,66,SpaceToken,features=Features({'is_space': True}),id=26), Annotation(66,68,Token,features=Features({'_i': 16, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'be', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3411606890003347522, 'pos': 'AUX', 'prefix': 'i', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBZ', 'shape': 'xx', 'suffix': 'is', 'ent_type': '', 'dep': 'ccomp', 'head': 68, 'left_edge': 24, 'right_edge': 65}),id=27), Annotation(68,69,SpaceToken,features=Features({'is_space': True}),id=28), Annotation(69,70,Token,features=Features({'_i': 17, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'det', 'head': 31, 'left_edge': 29, 'right_edge': 29}),id=29), Annotation(69,79,NounChunk,features=Features({}),id=346), Annotation(70,71,SpaceToken,features=Features({'is_space': True}),id=30), Annotation(71,79,Token,features=Features({'_i': 18, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'sentence', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 18108853898452662235, 'pos': 'NOUN', 'prefix': 's', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'nce', 'ent_type': '', 'dep': 'attr', 'head': 27, 'left_edge': 29, 'right_edge': 64}),id=31), Annotation(79,80,SpaceToken,features=Features({'is_space': True}),id=32), Annotation(80,84,Token,features=Features({'_i': 19, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'that', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 4380130941430378203, 'pos': 'PRON', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'WDT', 'shape': 'xxxx', 'suffix': 'hat', 'ent_type': '', 'dep': 'nsubj', 'head': 35, 'left_edge': 33, 'right_edge': 33}),id=33), Annotation(80,84,NounChunk,features=Features({}),id=347), Annotation(84,85,SpaceToken,features=Features({'is_space': True}),id=34), Annotation(85,93,Token,features=Features({'_i': 20, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'mention', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 834570530775529781, 'pos': 'VERB', 'prefix': 'm', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBZ', 'shape': 'xxxx', 'suffix': 'ons', 'ent_type': '', 'dep': 'relcl', 'head': 31, 'left_edge': 33, 'right_edge': 64}),id=35), Annotation(93,94,SpaceToken,features=Features({'is_space': True}),id=36), Annotation(94,95,Token,features=Features({'_i': 21, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'det', 'head': 43, 'left_edge': 37, 'right_edge': 37}),id=37), Annotation(94,114,NounChunk,features=Features({}),id=348), Annotation(95,96,SpaceToken,features=Features({'is_space': True}),id=38), Annotation(96,99,Token,features=Features({'_i': 22, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'few', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11866476999679706272, 'pos': 'ADJ', 'prefix': 'f', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'JJ', 'shape': 'xxx', 'suffix': 'few', 'ent_type': '', 'dep': 'amod', 'head': 43, 'left_edge': 39, 'right_edge': 39}),id=39), Annotation(99,100,SpaceToken,features=Features({'is_space': True}),id=40), Annotation(100,105,Token,features=Features({'_i': 23, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'name', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 6719370519630147158, 'pos': 'VERB', 'prefix': 'n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBN', 'shape': 'xxxx', 'suffix': 'med', 'ent_type': '', 'dep': 'amod', 'head': 43, 'left_edge': 41, 'right_edge': 41}),id=41), Annotation(105,106,SpaceToken,features=Features({'is_space': True}),id=42), Annotation(106,114,Token,features=Features({'_i': 24, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'entity', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7701268449352692754, 'pos': 'NOUN', 'prefix': 'e', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'xxxx', 'suffix': 'ies', 'ent_type': '', 'dep': 'dobj', 'head': 35, 'left_edge': 37, 'right_edge': 64}),id=43), Annotation(114,115,SpaceToken,features=Features({'is_space': True}),id=44), Annotation(115,119,Token,features=Features({'_i': 25, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'like', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 18194338103975822726, 'pos': 'ADP', 'prefix': 'l', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'IN', 'shape': 'xxxx', 'suffix': 'ike', 'ent_type': '', 'dep': 'prep', 'head': 43, 'left_edge': 45, 'right_edge': 64}),id=45), Annotation(119,120,SpaceToken,features=Features({'is_space': True}),id=46), Annotation(120,121,SpaceToken,features=Features({'_i': 26, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 962983613142996970, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n', 'suffix': '\\n', 'ent_type': '', 'dep': 'dep', 'head': 45, 'left_edge': 47, 'right_edge': 47}),id=47), Annotation(121,124,Token,features=Features({'_i': 27, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'the', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7425985699627899538, 'pos': 'DET', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'xxx', 'suffix': 'the', 'ent_type': '', 'dep': 'det', 'head': 50, 'left_edge': 48, 'right_edge': 48}),id=48), Annotation(121,132,NounChunk,features=Features({}),id=349), Annotation(124,125,SpaceToken,features=Features({'is_space': True}),id=49), Annotation(125,132,Token,features=Features({'_i': 28, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'person', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 17520808660558581486, 'pos': 'NOUN', 'prefix': 'p', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'xxxx', 'suffix': 'ons', 'ent_type': '', 'dep': 'pobj', 'head': 45, 'left_edge': 48, 'right_edge': 64}),id=50), Annotation(132,133,SpaceToken,features=Features({'is_space': True}),id=51), Annotation(133,139,Token,features=Features({'_i': 29, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Barack', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 15388493565120789335, 'pos': 'PROPN', 'prefix': 'B', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'ack', 'ent_type': 'PERSON', 'dep': 'compound', 'head': 54, 'left_edge': 52, 'right_edge': 52}),id=52), Annotation(133,145,PERSON,features=Features({'lemma': 'Barack Obama'}),id=318), Annotation(133,145,NounChunk,features=Features({}),id=350), Annotation(139,140,SpaceToken,features=Features({'is_space': True}),id=53), Annotation(140,145,Token,features=Features({'_i': 30, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Obama', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 4857242187112322394, 'pos': 'PROPN', 'prefix': 'O', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'ama', 'ent_type': 'PERSON', 'dep': 'appos', 'head': 50, 'left_edge': 52, 'right_edge': 64}),id=54), Annotation(145,146,SpaceToken,features=Features({'is_space': True}),id=55), Annotation(146,148,Token,features=Features({'_i': 31, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'or', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3740602843040177340, 'pos': 'CCONJ', 'prefix': 'o', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'xx', 'suffix': 'or', 'ent_type': '', 'dep': 'cc', 'head': 54, 'left_edge': 56, 'right_edge': 56}),id=56), Annotation(148,149,SpaceToken,features=Features({'is_space': True}),id=57), Annotation(149,155,Token,features=Features({'_i': 32, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Ursula', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 13791600832320669420, 'pos': 'PROPN', 'prefix': 'U', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'ula', 'ent_type': 'GPE', 'dep': 'compound', 'head': 62, 'left_edge': 58, 'right_edge': 58}),id=58), Annotation(149,155,GPE,features=Features({'lemma': 'Ursula'}),id=319), Annotation(149,169,NounChunk,features=Features({}),id=351), Annotation(155,156,SpaceToken,features=Features({'is_space': True}),id=59), Annotation(156,159,Token,features=Features({'_i': 33, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'von', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7079615728476843702, 'pos': 'PROPN', 'prefix': 'v', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'xxx', 'suffix': 'von', 'ent_type': '', 'dep': 'compound', 'head': 62, 'left_edge': 60, 'right_edge': 60}),id=60), Annotation(159,160,SpaceToken,features=Features({'is_space': True}),id=61), Annotation(160,163,Token,features=Features({'_i': 34, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'der', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 9250722957692387333, 'pos': 'NOUN', 'prefix': 'd', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxx', 'suffix': 'der', 'ent_type': '', 'dep': 'compound', 'head': 64, 'left_edge': 58, 'right_edge': 62}),id=62), Annotation(163,164,SpaceToken,features=Features({'is_space': True}),id=63), Annotation(164,169,Token,features=Features({'_i': 35, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Leyen', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 13266048868047277473, 'pos': 'PROPN', 'prefix': 'L', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'yen', 'ent_type': 'GPE', 'dep': 'conj', 'head': 54, 'left_edge': 58, 'right_edge': 64}),id=64), Annotation(164,169,GPE,features=Features({'lemma': 'Leyen'}),id=320), Annotation(169,170,Token,features=Features({'_i': 36, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 27, 'left_edge': 65, 'right_edge': 65}),id=65), Annotation(170,171,SpaceToken,features=Features({'is_space': True}),id=66), Annotation(171,180,Token,features=Features({'_i': 37, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'location', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 13063876858105367867, 'pos': 'NOUN', 'prefix': 'l', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'xxxx', 'suffix': 'ons', 'ent_type': '', 'dep': 'nsubj', 'head': 68, 'left_edge': 67, 'right_edge': 67}),id=67), Annotation(171,180,NounChunk,features=Features({}),id=352), Annotation(180,181,SpaceToken,features=Features({'_i': 38, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 962983613142996970, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n', 'suffix': '\\n', 'ent_type': '', 'dep': 'dep', 'head': 68, 'left_edge': 24, 'right_edge': 99}),id=68), Annotation(181,185,Token,features=Features({'_i': 39, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'like', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 18194338103975822726, 'pos': 'ADP', 'prefix': 'l', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'IN', 'shape': 'xxxx', 'suffix': 'ike', 'ent_type': '', 'dep': 'prep', 'head': 68, 'left_edge': 69, 'right_edge': 98}),id=69), Annotation(185,186,SpaceToken,features=Features({'is_space': True}),id=70), Annotation(186,189,Token,features=Features({'_i': 40, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'New', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7503827727184870577, 'pos': 'PROPN', 'prefix': 'N', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxx', 'suffix': 'New', 'ent_type': 'GPE', 'dep': 'compound', 'head': 73, 'left_edge': 71, 'right_edge': 71}),id=71), Annotation(186,199,GPE,features=Features({'lemma': 'New York City'}),id=321), Annotation(186,199,NounChunk,features=Features({}),id=353), Annotation(189,190,SpaceToken,features=Features({'is_space': True}),id=72), Annotation(190,194,Token,features=Features({'_i': 41, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'York', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7898044819112200372, 'pos': 'PROPN', 'prefix': 'Y', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxx', 'suffix': 'ork', 'ent_type': 'GPE', 'dep': 'compound', 'head': 75, 'left_edge': 71, 'right_edge': 73}),id=73), Annotation(194,195,SpaceToken,features=Features({'is_space': True}),id=74), Annotation(195,199,Token,features=Features({'_i': 42, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'City', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 728339108046626067, 'pos': 'PROPN', 'prefix': 'C', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxx', 'suffix': 'ity', 'ent_type': 'GPE', 'dep': 'pobj', 'head': 69, 'left_edge': 71, 'right_edge': 98}),id=75), Annotation(199,200,Token,features=Features({'_i': 43, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 75, 'left_edge': 76, 'right_edge': 76}),id=76), Annotation(200,201,SpaceToken,features=Features({'is_space': True}),id=77), Annotation(201,207,Token,features=Features({'_i': 44, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Vienna', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 10141676973022394345, 'pos': 'PROPN', 'prefix': 'V', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'nna', 'ent_type': 'GPE', 'dep': 'conj', 'head': 75, 'left_edge': 78, 'right_edge': 98}),id=78), Annotation(201,207,GPE,features=Features({'lemma': 'Vienna'}),id=322), Annotation(201,207,NounChunk,features=Features({}),id=354), Annotation(207,208,SpaceToken,features=Features({'is_space': True}),id=79), Annotation(208,210,Token,features=Features({'_i': 45, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'or', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3740602843040177340, 'pos': 'CCONJ', 'prefix': 'o', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'xx', 'suffix': 'or', 'ent_type': '', 'dep': 'cc', 'head': 78, 'left_edge': 80, 'right_edge': 80}),id=80), Annotation(210,211,SpaceToken,features=Features({'is_space': True}),id=81), Annotation(211,218,Token,features=Features({'_i': 46, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Beijing', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2949147885855558195, 'pos': 'PROPN', 'prefix': 'B', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'ing', 'ent_type': 'GPE', 'dep': 'conj', 'head': 78, 'left_edge': 82, 'right_edge': 82}),id=82), Annotation(211,218,GPE,features=Features({'lemma': 'Beijing'}),id=323), Annotation(211,218,NounChunk,features=Features({}),id=355), Annotation(218,219,SpaceToken,features=Features({'is_space': True}),id=83), Annotation(219,221,Token,features=Features({'_i': 47, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'or', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3740602843040177340, 'pos': 'CCONJ', 'prefix': 'o', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'xx', 'suffix': 'or', 'ent_type': '', 'dep': 'cc', 'head': 78, 'left_edge': 84, 'right_edge': 84}),id=84), Annotation(221,222,SpaceToken,features=Features({'is_space': True}),id=85), Annotation(222,231,Token,features=Features({'_i': 48, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'company', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 8026612326651866097, 'pos': 'NOUN', 'prefix': 'c', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'xxxx', 'suffix': 'ies', 'ent_type': '', 'dep': 'conj', 'head': 78, 'left_edge': 86, 'right_edge': 98}),id=86), Annotation(222,231,NounChunk,features=Features({}),id=356), Annotation(231,232,SpaceToken,features=Features({'is_space': True}),id=87), Annotation(232,236,Token,features=Features({'_i': 49, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'like', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 18194338103975822726, 'pos': 'ADP', 'prefix': 'l', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'IN', 'shape': 'xxxx', 'suffix': 'ike', 'ent_type': '', 'dep': 'prep', 'head': 86, 'left_edge': 88, 'right_edge': 98}),id=88), Annotation(236,237,SpaceToken,features=Features({'is_space': True}),id=89), Annotation(237,238,SpaceToken,features=Features({'_i': 50, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 962983613142996970, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n', 'suffix': '\\n', 'ent_type': '', 'dep': 'dep', 'head': 88, 'left_edge': 90, 'right_edge': 90}),id=90), Annotation(238,244,Token,features=Features({'_i': 51, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Google', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11578853341595296054, 'pos': 'PROPN', 'prefix': 'G', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'gle', 'ent_type': '', 'dep': 'pobj', 'head': 88, 'left_edge': 91, 'right_edge': 98}),id=91), Annotation(238,244,NounChunk,features=Features({}),id=357), Annotation(244,245,Token,features=Features({'_i': 52, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 91, 'left_edge': 92, 'right_edge': 92}),id=92), Annotation(245,246,SpaceToken,features=Features({'is_space': True}),id=93), Annotation(246,255,Token,features=Features({'_i': 53, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'UniCredit', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7663443051765273511, 'pos': 'PROPN', 'prefix': 'U', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'XxxXxxxx', 'suffix': 'dit', 'ent_type': 'ORG', 'dep': 'conj', 'head': 91, 'left_edge': 94, 'right_edge': 98}),id=94), Annotation(246,255,ORG,features=Features({'lemma': 'UniCredit'}),id=324), Annotation(246,255,NounChunk,features=Features({}),id=358), Annotation(255,256,SpaceToken,features=Features({'is_space': True}),id=95), Annotation(256,258,Token,features=Features({'_i': 54, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'or', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3740602843040177340, 'pos': 'CCONJ', 'prefix': 'o', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'xx', 'suffix': 'or', 'ent_type': '', 'dep': 'cc', 'head': 94, 'left_edge': 96, 'right_edge': 96}),id=96), Annotation(258,259,SpaceToken,features=Features({'is_space': True}),id=97), Annotation(259,265,Token,features=Features({'_i': 55, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Huawei', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 4639919019609272215, 'pos': 'PROPN', 'prefix': 'H', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'wei', 'ent_type': 'ORG', 'dep': 'conj', 'head': 94, 'left_edge': 98, 'right_edge': 98}),id=98), Annotation(259,265,ORG,features=Features({'lemma': 'Huawei'}),id=325), Annotation(259,265,NounChunk,features=Features({}),id=359), Annotation(265,266,Token,features=Features({'_i': 56, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '.', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12646065887601541794, 'pos': 'PUNCT', 'prefix': '.', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '.', 'shape': '.', 'suffix': '.', 'ent_type': '', 'dep': 'punct', 'head': 68, 'left_edge': 99, 'right_edge': 99}),id=99), Annotation(266,267,SpaceToken,features=Features({'is_space': True}),id=100), Annotation(267,270,Token,features=Features({'_i': 57, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': True, 'is_space': False, 'is_stop': True, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'and', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12172435438170721471, 'pos': 'CCONJ', 'prefix': 'A', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'Xxx', 'suffix': 'And', 'ent_type': '', 'dep': 'cc', 'head': 105, 'left_edge': 101, 'right_edge': 101}),id=101), Annotation(267,422,Sentence,features=Features({}),id=339), Annotation(270,271,SpaceToken,features=Features({'is_space': True}),id=102), Annotation(271,275,Token,features=Features({'_i': 58, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'here', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 411390626470654571, 'pos': 'ADV', 'prefix': 'h', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'xxxx', 'suffix': 'ere', 'ent_type': '', 'dep': 'advmod', 'head': 105, 'left_edge': 103, 'right_edge': 103}),id=103), Annotation(275,276,SpaceToken,features=Features({'is_space': True}),id=104), Annotation(276,278,Token,features=Features({'_i': 59, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'be', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3411606890003347522, 'pos': 'AUX', 'prefix': 'i', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBZ', 'shape': 'xx', 'suffix': 'is', 'ent_type': '', 'dep': 'ccomp', 'head': 118, 'left_edge': 101, 'right_edge': 109}),id=105), Annotation(278,279,SpaceToken,features=Features({'is_space': True}),id=106), Annotation(279,285,Token,features=Features({'_i': 60, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Donald', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 16889399016836222064, 'pos': 'PROPN', 'prefix': 'D', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'ald', 'ent_type': 'PERSON', 'dep': 'compound', 'head': 109, 'left_edge': 107, 'right_edge': 107}),id=107), Annotation(279,291,PERSON,features=Features({'lemma': 'Donald Trump'}),id=326), Annotation(279,291,NounChunk,features=Features({}),id=360), Annotation(285,286,SpaceToken,features=Features({'is_space': True}),id=108), Annotation(286,291,Token,features=Features({'_i': 61, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Trump', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 1134333841961332695, 'pos': 'PROPN', 'prefix': 'T', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'ump', 'ent_type': 'PERSON', 'dep': 'nsubj', 'head': 105, 'left_edge': 107, 'right_edge': 109}),id=109), Annotation(291,292,Token,features=Features({'_i': 62, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 118, 'left_edge': 110, 'right_edge': 110}),id=110), Annotation(292,293,SpaceToken,features=Features({'is_space': True}),id=111), Annotation(293,295,Token,features=Features({'_i': 63, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'it', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 10239237003504588839, 'pos': 'PRON', 'prefix': 'i', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'PRP', 'shape': 'xx', 'suffix': 'it', 'ent_type': '', 'dep': 'nsubj', 'head': 118, 'left_edge': 112, 'right_edge': 112}),id=112), Annotation(293,295,NounChunk,features=Features({}),id=361), Annotation(295,296,SpaceToken,features=Features({'is_space': True}),id=113), Annotation(296,299,Token,features=Features({'_i': 64, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'may', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 14378475389916013800, 'pos': 'AUX', 'prefix': 'm', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'MD', 'shape': 'xxx', 'suffix': 'may', 'ent_type': '', 'dep': 'aux', 'head': 118, 'left_edge': 114, 'right_edge': 114}),id=114), Annotation(299,300,SpaceToken,features=Features({'is_space': True}),id=115), Annotation(300,303,Token,features=Features({'_i': 65, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'not', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 447765159362469301, 'pos': 'PART', 'prefix': 'n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'xxx', 'suffix': 'not', 'ent_type': '', 'dep': 'neg', 'head': 118, 'left_edge': 116, 'right_edge': 116}),id=116), Annotation(303,304,SpaceToken,features=Features({'is_space': True}),id=117), Annotation(304,306,Token,features=Features({'_i': 66, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'be', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 10382539506755952630, 'pos': 'AUX', 'prefix': 'b', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VB', 'shape': 'xx', 'suffix': 'be', 'ent_type': '', 'dep': 'ccomp', 'head': 130, 'left_edge': 101, 'right_edge': 127}),id=118), Annotation(306,307,SpaceToken,features=Features({'is_space': True}),id=119), Annotation(307,310,Token,features=Features({'_i': 67, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'the', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7425985699627899538, 'pos': 'DET', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'xxx', 'suffix': 'the', 'ent_type': '', 'dep': 'det', 'head': 124, 'left_edge': 120, 'right_edge': 120}),id=120), Annotation(310,311,SpaceToken,features=Features({'is_space': True}),id=121), Annotation(311,315,Token,features=Features({'_i': 68, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'real', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7050692189315705983, 'pos': 'ADJ', 'prefix': 'r', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'JJ', 'shape': 'xxxx', 'suffix': 'eal', 'ent_type': '', 'dep': 'amod', 'head': 124, 'left_edge': 122, 'right_edge': 122}),id=122), Annotation(315,316,SpaceToken,features=Features({'is_space': True}),id=123), Annotation(316,319,Token,features=Features({'_i': 69, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'one', 'like_email': False, 'like_num': True, 'like_url': False, 'orth': 17454115351911680600, 'pos': 'NUM', 'prefix': 'o', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CD', 'shape': 'xxx', 'suffix': 'one', 'ent_type': '', 'dep': 'attr', 'head': 118, 'left_edge': 120, 'right_edge': 127}),id=124), Annotation(319,320,SpaceToken,features=Features({'is_space': True}),id=125), Annotation(320,322,Token,features=Features({'_i': 70, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': True, 'lang': 'en', 'lemma': ':P', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11282459806373970386, 'pos': 'PUNCT', 'prefix': ':', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ':', 'shape': ':X', 'suffix': ':P', 'ent_type': '', 'dep': 'punct', 'head': 124, 'left_edge': 126, 'right_edge': 126}),id=126), Annotation(322,324,SpaceToken,features=Features({'_i': 71, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 908432558851201422, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n\\n', 'suffix': '\\n\\n', 'ent_type': '', 'dep': 'dep', 'head': 124, 'left_edge': 127, 'right_edge': 127}),id=127), Annotation(324,328,Token,features=Features({'_i': 72, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'let', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 8631549979455155821, 'pos': 'NOUN', 'prefix': 'L', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'Xxxx', 'suffix': 'ets', 'ent_type': '', 'dep': 'nsubj', 'head': 130, 'left_edge': 128, 'right_edge': 128}),id=128), Annotation(324,328,NounChunk,features=Features({}),id=362), Annotation(328,329,SpaceToken,features=Features({'is_space': True}),id=129), Annotation(329,332,Token,features=Features({'_i': 73, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'say', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 8685289367999165211, 'pos': 'VERB', 'prefix': 's', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBP', 'shape': 'xxx', 'suffix': 'say', 'ent_type': '', 'dep': 'parataxis', 'head': 153, 'left_edge': 101, 'right_edge': 149}),id=130), Annotation(332,333,SpaceToken,features=Features({'is_space': True}),id=131), Annotation(333,338,Token,features=Features({'_i': 74, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Boris', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3926703778135926261, 'pos': 'PROPN', 'prefix': 'B', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'ris', 'ent_type': 'PERSON', 'dep': 'compound', 'head': 134, 'left_edge': 132, 'right_edge': 132}),id=132), Annotation(333,346,PERSON,features=Features({'lemma': 'Boris Johnson'}),id=327), Annotation(333,346,NounChunk,features=Features({}),id=363), Annotation(338,339,SpaceToken,features=Features({'is_space': True}),id=133), Annotation(339,346,Token,features=Features({'_i': 75, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Johnson', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 14020170261514038406, 'pos': 'PROPN', 'prefix': 'J', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'son', 'ent_type': 'PERSON', 'dep': 'nsubj', 'head': 130, 'left_edge': 132, 'right_edge': 148}),id=134), Annotation(346,347,SpaceToken,features=Features({'is_space': True}),id=135), Annotation(347,350,Token,features=Features({'_i': 76, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'aka', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 6309978511773728404, 'pos': 'ADV', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'xxx', 'suffix': 'aka', 'ent_type': '', 'dep': 'advmod', 'head': 140, 'left_edge': 136, 'right_edge': 136}),id=136), Annotation(347,362,NounChunk,features=Features({}),id=364), Annotation(350,351,SpaceToken,features=Features({'is_space': True}),id=137), Annotation(351,355,Token,features=Features({'_i': 77, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Bojo', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 1381500864330977487, 'pos': 'PROPN', 'prefix': 'B', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxx', 'suffix': 'ojo', 'ent_type': '', 'dep': 'compound', 'head': 140, 'left_edge': 138, 'right_edge': 138}),id=138), Annotation(355,356,SpaceToken,features=Features({'is_space': True}),id=139), Annotation(356,362,Token,features=Features({'_i': 78, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'tweet', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 9158814828460388352, 'pos': 'NOUN', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'xxxx', 'suffix': 'ets', 'ent_type': '', 'dep': 'appos', 'head': 134, 'left_edge': 136, 'right_edge': 148}),id=140), Annotation(362,363,SpaceToken,features=Features({'is_space': True}),id=141), Annotation(363,367,Token,features=Features({'_i': 79, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'from', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7831658034963690409, 'pos': 'ADP', 'prefix': 'f', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'IN', 'shape': 'xxxx', 'suffix': 'rom', 'ent_type': '', 'dep': 'prep', 'head': 140, 'left_edge': 142, 'right_edge': 148}),id=142), Annotation(367,368,SpaceToken,features=Features({'is_space': True}),id=143), Annotation(368,371,Token,features=Features({'_i': 80, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'his', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2661093235354845946, 'pos': 'PRON', 'prefix': 'h', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'PRP$', 'shape': 'xxx', 'suffix': 'his', 'ent_type': '', 'dep': 'poss', 'head': 148, 'left_edge': 144, 'right_edge': 144}),id=144), Annotation(368,392,NounChunk,features=Features({}),id=365), Annotation(371,372,SpaceToken,features=Features({'is_space': True}),id=145), Annotation(372,384,Token,features=Features({'_i': 81, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'BorisJohnson', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 6152028365162425637, 'pos': 'PROPN', 'prefix': 'B', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'XxxxxXxxxx', 'suffix': 'son', 'ent_type': 'ORG', 'dep': 'compound', 'head': 148, 'left_edge': 146, 'right_edge': 146}),id=146), Annotation(372,384,ORG,features=Features({'lemma': 'BorisJohnson'}),id=328), Annotation(384,385,SpaceToken,features=Features({'is_space': True}),id=147), Annotation(385,392,Token,features=Features({'_i': 82, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'account', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 16051117652154763083, 'pos': 'NOUN', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'unt', 'ent_type': '', 'dep': 'pobj', 'head': 142, 'left_edge': 144, 'right_edge': 148}),id=148), Annotation(392,393,Token,features=Features({'_i': 83, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 130, 'left_edge': 149, 'right_edge': 149}),id=149), Annotation(393,394,SpaceToken,features=Features({'is_space': True}),id=150), Annotation(394,399,Token,features=Features({'_i': 84, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'would', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 6992604926141104606, 'pos': 'AUX', 'prefix': 'w', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'MD', 'shape': 'xxxx', 'suffix': 'uld', 'ent_type': '', 'dep': 'aux', 'head': 153, 'left_edge': 151, 'right_edge': 151}),id=151), Annotation(399,400,SpaceToken,features=Features({'is_space': True}),id=152), Annotation(400,402,Token,features=Features({'_i': 85, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'be', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 10382539506755952630, 'pos': 'AUX', 'prefix': 'b', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VB', 'shape': 'xx', 'suffix': 'be', 'ent_type': '', 'dep': 'ROOT', 'head': 153, 'left_edge': 101, 'right_edge': 162}),id=153), Annotation(402,403,SpaceToken,features=Features({'is_space': True}),id=154), Annotation(403,407,Token,features=Features({'_i': 86, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'nice', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 14121509715367036122, 'pos': 'ADJ', 'prefix': 'n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'JJ', 'shape': 'xxxx', 'suffix': 'ice', 'ent_type': '', 'dep': 'acomp', 'head': 153, 'left_edge': 155, 'right_edge': 155}),id=155), Annotation(407,408,SpaceToken,features=Features({'is_space': True}),id=156), Annotation(408,410,Token,features=Features({'_i': 87, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'to', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3791531372978436496, 'pos': 'PART', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'TO', 'shape': 'xx', 'suffix': 'to', 'ent_type': '', 'dep': 'aux', 'head': 159, 'left_edge': 157, 'right_edge': 157}),id=157), Annotation(410,411,SpaceToken,features=Features({'is_space': True}),id=158), Annotation(411,416,Token,features=Features({'_i': 88, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'match', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 16065818573247886523, 'pos': 'VERB', 'prefix': 'm', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VB', 'shape': 'xxxx', 'suffix': 'tch', 'ent_type': '', 'dep': 'xcomp', 'head': 153, 'left_edge': 157, 'right_edge': 161}),id=159), Annotation(416,417,SpaceToken,features=Features({'is_space': True}),id=160), Annotation(417,421,Token,features=Features({'_i': 89, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'they', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7976265300568155021, 'pos': 'PRON', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'PRP', 'shape': 'xxxx', 'suffix': 'hem', 'ent_type': '', 'dep': 'dobj', 'head': 159, 'left_edge': 161, 'right_edge': 161}),id=161), Annotation(417,421,NounChunk,features=Features({}),id=366), Annotation(421,422,Token,features=Features({'_i': 90, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '!', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 17494803046312582752, 'pos': 'PUNCT', 'prefix': '!', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '.', 'shape': '!', 'suffix': '!', 'ent_type': '', 'dep': 'punct', 'head': 153, 'left_edge': 162, 'right_edge': 162}),id=162), Annotation(422,424,SpaceToken,features=Features({'_i': 91, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': True, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 908432558851201422, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n\\n', 'suffix': '\\n\\n', 'ent_type': '', 'dep': 'dep', 'head': 168, 'left_edge': 163, 'right_edge': 163}),id=163), Annotation(422,829,Sentence,features=Features({}),id=340), Annotation(424,428,Token,features=Features({'_i': 92, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'here', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 13973858553651532596, 'pos': 'ADV', 'prefix': 'H', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'Xxxx', 'suffix': 'ere', 'ent_type': '', 'dep': 'advmod', 'head': 168, 'left_edge': 164, 'right_edge': 164}),id=164), Annotation(428,429,SpaceToken,features=Features({'is_space': True}),id=165), Annotation(429,431,Token,features=Features({'_i': 93, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'we', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 16064069575701507746, 'pos': 'PRON', 'prefix': 'w', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'PRP', 'shape': 'xx', 'suffix': 'we', 'ent_type': '', 'dep': 'nsubj', 'head': 168, 'left_edge': 166, 'right_edge': 166}),id=166), Annotation(429,431,NounChunk,features=Features({}),id=367), Annotation(431,432,SpaceToken,features=Features({'is_space': True}),id=167), Annotation(432,439,Token,features=Features({'_i': 94, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'include', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 14049642289933595219, 'pos': 'VERB', 'prefix': 'i', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBP', 'shape': 'xxxx', 'suffix': 'ude', 'ent_type': '', 'dep': 'ccomp', 'head': 265, 'left_edge': 163, 'right_edge': 257}),id=168), Annotation(439,440,SpaceToken,features=Features({'is_space': True}),id=169), Annotation(440,441,Token,features=Features({'_i': 95, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'det', 'head': 172, 'left_edge': 170, 'right_edge': 170}),id=170), Annotation(441,442,SpaceToken,features=Features({'is_space': True}),id=171), Annotation(442,445,Token,features=Features({'_i': 96, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': True, 'lang': 'en', 'lemma': 'url', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2582013287274679728, 'pos': 'NOUN', 'prefix': 'U', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'XXX', 'suffix': 'URL', 'ent_type': '', 'dep': 'nmod', 'head': 176, 'left_edge': 170, 'right_edge': 174}),id=172), Annotation(445,446,SpaceToken,features=Features({'is_space': True}),id=173), Annotation(446,487,Token,features=Features({'_i': 97, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'https://gatenlp.github.io/python-gatenlp/', 'like_email': False, 'like_num': False, 'like_url': True, 'orth': 616995933894448768, 'pos': 'X', 'prefix': 'h', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'FW', 'shape': 'xxxx://xxxx.xxxx.xx/xxxx-xxxx/', 'suffix': 'lp/', 'ent_type': '', 'dep': 'nummod', 'head': 172, 'left_edge': 174, 'right_edge': 174}),id=174), Annotation(487,488,SpaceToken,features=Features({'is_space': True}),id=175), Annotation(488,489,SpaceToken,features=Features({'_i': 98, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 962983613142996970, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n', 'suffix': '\\n', 'ent_type': '', 'dep': 'dep', 'head': 168, 'left_edge': 170, 'right_edge': 187}),id=176), Annotation(489,492,Token,features=Features({'_i': 99, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'and', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2283656566040971221, 'pos': 'CCONJ', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'xxx', 'suffix': 'and', 'ent_type': '', 'dep': 'cc', 'head': 176, 'left_edge': 177, 'right_edge': 177}),id=177), Annotation(492,493,SpaceToken,features=Features({'is_space': True}),id=178), Annotation(493,494,Token,features=Features({'_i': 100, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'det', 'head': 185, 'left_edge': 179, 'right_edge': 179}),id=179), Annotation(494,495,SpaceToken,features=Features({'is_space': True}),id=180), Annotation(495,499,Token,features=Features({'_i': 101, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'fake', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 513889413363281732, 'pos': 'ADJ', 'prefix': 'f', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'JJ', 'shape': 'xxxx', 'suffix': 'ake', 'ent_type': '', 'dep': 'amod', 'head': 185, 'left_edge': 181, 'right_edge': 181}),id=181), Annotation(499,500,SpaceToken,features=Features({'is_space': True}),id=182), Annotation(500,505,Token,features=Features({'_i': 102, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'email', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7320900731437023467, 'pos': 'NOUN', 'prefix': 'e', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'ail', 'ent_type': '', 'dep': 'compound', 'head': 185, 'left_edge': 183, 'right_edge': 183}),id=183), Annotation(505,506,SpaceToken,features=Features({'is_space': True}),id=184), Annotation(506,513,Token,features=Features({'_i': 103, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'address', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 14349763737243715186, 'pos': 'NOUN', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'ess', 'ent_type': '', 'dep': 'conj', 'head': 176, 'left_edge': 179, 'right_edge': 187}),id=185), Annotation(513,514,SpaceToken,features=Features({'is_space': True}),id=186), Annotation(514,540,Token,features=Features({'_i': 104, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'john.doe@hiscoolserver.com', 'like_email': True, 'like_num': False, 'like_url': False, 'orth': 8439314673651519470, 'pos': 'PROPN', 'prefix': 'j', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'xxxx.xxx@xxxx.xxx', 'suffix': 'com', 'ent_type': '', 'dep': 'appos', 'head': 185, 'left_edge': 187, 'right_edge': 187}),id=187), Annotation(514,540,NounChunk,features=Features({}),id=368), Annotation(540,541,SpaceToken,features=Features({'is_space': True}),id=188), Annotation(541,543,Token,features=Features({'_i': 105, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'as', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7437575085468336610, 'pos': 'ADV', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'xx', 'suffix': 'as', 'ent_type': '', 'dep': 'advmod', 'head': 191, 'left_edge': 189, 'right_edge': 189}),id=189), Annotation(543,544,SpaceToken,features=Features({'is_space': True}),id=190), Annotation(544,548,Token,features=Features({'_i': 106, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'well', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 4525988469032889948, 'pos': 'ADV', 'prefix': 'w', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'xxxx', 'suffix': 'ell', 'ent_type': '', 'dep': 'advmod', 'head': 168, 'left_edge': 189, 'right_edge': 191}),id=191), Annotation(548,549,SpaceToken,features=Features({'is_space': True}),id=192), Annotation(549,550,SpaceToken,features=Features({'_i': 107, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 962983613142996970, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n', 'suffix': '\\n', 'ent_type': '', 'dep': 'dep', 'head': 168, 'left_edge': 193, 'right_edge': 193}),id=193), Annotation(550,552,Token,features=Features({'_i': 108, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'as', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7437575085468336610, 'pos': 'ADP', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'IN', 'shape': 'xx', 'suffix': 'as', 'ent_type': '', 'dep': 'prep', 'head': 168, 'left_edge': 194, 'right_edge': 203}),id=194), Annotation(552,553,SpaceToken,features=Features({'is_space': True}),id=195), Annotation(553,554,Token,features=Features({'_i': 109, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '#', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 8663801465970268676, 'pos': 'SYM', 'prefix': '#', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '$', 'shape': '#', 'suffix': '#', 'ent_type': 'MONEY', 'dep': 'quantmod', 'head': 199, 'left_edge': 196, 'right_edge': 196}),id=196), Annotation(553,560,MONEY,features=Features({'lemma': '#some #'}),id=329), Annotation(553,573,NounChunk,features=Features({}),id=369), Annotation(554,558,Token,features=Features({'_i': 110, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'some', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7000492816108906599, 'pos': 'DET', 'prefix': 's', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'xxxx', 'suffix': 'ome', 'ent_type': 'MONEY', 'dep': 'det', 'head': 199, 'left_edge': 197, 'right_edge': 197}),id=197), Annotation(558,559,SpaceToken,features=Features({'is_space': True}),id=198), Annotation(559,560,Token,features=Features({'_i': 111, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '#', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 8663801465970268676, 'pos': 'ADV', 'prefix': '#', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RBR', 'shape': '#', 'suffix': '#', 'ent_type': 'MONEY', 'dep': 'quantmod', 'head': 203, 'left_edge': 196, 'right_edge': 200}),id=199), Annotation(560,564,Token,features=Features({'_i': 112, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'cool', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 16540862642162215401, 'pos': 'ADJ', 'prefix': 'c', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'JJ', 'shape': 'xxxx', 'suffix': 'ool', 'ent_type': '', 'dep': 'amod', 'head': 199, 'left_edge': 200, 'right_edge': 200}),id=200), Annotation(564,565,SpaceToken,features=Features({'is_space': True}),id=201), Annotation(565,566,Token,features=Features({'_i': 113, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '#', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 8663801465970268676, 'pos': 'SYM', 'prefix': '#', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '$', 'shape': '#', 'suffix': '#', 'ent_type': '', 'dep': 'nmod', 'head': 203, 'left_edge': 202, 'right_edge': 202}),id=202), Annotation(566,573,Token,features=Features({'_i': 114, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'hastag', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12335920655888895378, 'pos': 'NOUN', 'prefix': 'h', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'xxxx', 'suffix': 'ags', 'ent_type': '', 'dep': 'pobj', 'head': 194, 'left_edge': 196, 'right_edge': 203}),id=203), Annotation(573,574,SpaceToken,features=Features({'is_space': True}),id=204), Annotation(574,577,Token,features=Features({'_i': 115, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'and', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2283656566040971221, 'pos': 'CCONJ', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'xxx', 'suffix': 'and', 'ent_type': '', 'dep': 'cc', 'head': 168, 'left_edge': 205, 'right_edge': 205}),id=205), Annotation(577,578,SpaceToken,features=Features({'is_space': True}),id=206), Annotation(578,579,Token,features=Features({'_i': 116, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'det', 'head': 209, 'left_edge': 207, 'right_edge': 207}),id=207), Annotation(579,580,SpaceToken,features=Features({'is_space': True}),id=208), Annotation(580,585,Token,features=Features({'_i': 117, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'bunch', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 16097686030507304001, 'pos': 'NOUN', 'prefix': 'b', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'nch', 'ent_type': '', 'dep': 'conj', 'head': 168, 'left_edge': 207, 'right_edge': 225}),id=209), Annotation(585,586,SpaceToken,features=Features({'is_space': True}),id=210), Annotation(586,588,Token,features=Features({'_i': 118, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'of', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 886050111519832510, 'pos': 'ADP', 'prefix': 'o', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'IN', 'shape': 'xx', 'suffix': 'of', 'ent_type': '', 'dep': 'prep', 'head': 209, 'left_edge': 211, 'right_edge': 213}),id=211), Annotation(588,589,SpaceToken,features=Features({'is_space': True}),id=212), Annotation(589,595,Token,features=Features({'_i': 119, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'emoji', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 15308488336638733982, 'pos': 'NOUN', 'prefix': 'e', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'xxxx', 'suffix': 'jis', 'ent_type': '', 'dep': 'pobj', 'head': 211, 'left_edge': 213, 'right_edge': 213}),id=213), Annotation(589,595,NounChunk,features=Features({}),id=370), Annotation(595,596,SpaceToken,features=Features({'is_space': True}),id=214), Annotation(596,600,Token,features=Features({'_i': 120, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'like', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 18194338103975822726, 'pos': 'ADP', 'prefix': 'l', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'IN', 'shape': 'xxxx', 'suffix': 'ike', 'ent_type': '', 'dep': 'prep', 'head': 209, 'left_edge': 215, 'right_edge': 224}),id=215), Annotation(600,601,SpaceToken,features=Features({'is_space': True}),id=216), Annotation(601,602,Token,features=Features({'_i': 121, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '😽', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 13410134130775049117, 'pos': 'PROPN', 'prefix': '😽', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': '😽', 'suffix': '😽', 'ent_type': '', 'dep': 'pobj', 'head': 215, 'left_edge': 217, 'right_edge': 224}),id=217), Annotation(601,602,NounChunk,features=Features({}),id=371), Annotation(602,603,SpaceToken,features=Features({'is_space': True}),id=218), Annotation(603,604,Token,features=Features({'_i': 122, 'is_alpha': False, 'is_bracket': True, 'is_currency': False, 'is_digit': False, 'is_left_punct': True, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '(', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12638816674900267446, 'pos': 'PUNCT', 'prefix': '(', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '-LRB-', 'shape': '(', 'suffix': '(', 'ent_type': '', 'dep': 'punct', 'head': 217, 'left_edge': 219, 'right_edge': 219}),id=219), Annotation(604,605,Token,features=Features({'_i': 123, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'det', 'head': 224, 'left_edge': 220, 'right_edge': 220}),id=220), Annotation(604,617,NounChunk,features=Features({}),id=372), Annotation(605,606,SpaceToken,features=Features({'is_space': True}),id=221), Annotation(606,613,Token,features=Features({'_i': 124, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'kissing', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 17741139538205582276, 'pos': 'NOUN', 'prefix': 'k', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'ing', 'ent_type': '', 'dep': 'amod', 'head': 224, 'left_edge': 222, 'right_edge': 222}),id=222), Annotation(613,614,SpaceToken,features=Features({'is_space': True}),id=223), Annotation(614,617,Token,features=Features({'_i': 125, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'cat', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 5439657043933447811, 'pos': 'NOUN', 'prefix': 'c', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxx', 'suffix': 'cat', 'ent_type': '', 'dep': 'appos', 'head': 217, 'left_edge': 220, 'right_edge': 224}),id=224), Annotation(617,618,Token,features=Features({'_i': 126, 'is_alpha': False, 'is_bracket': True, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': True, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ')', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3842344029291005339, 'pos': 'PUNCT', 'prefix': ')', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '-RRB-', 'shape': ')', 'suffix': ')', 'ent_type': '', 'dep': 'punct', 'head': 209, 'left_edge': 225, 'right_edge': 225}),id=225), Annotation(618,619,Token,features=Features({'_i': 127, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 168, 'left_edge': 226, 'right_edge': 226}),id=226), Annotation(619,620,SpaceToken,features=Features({'_i': 128, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 962983613142996970, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n', 'suffix': '\\n', 'ent_type': '', 'dep': 'dep', 'head': 168, 'left_edge': 227, 'right_edge': 227}),id=227), Annotation(620,621,Token,features=Features({'_i': 129, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '👩', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 13533745690575315714, 'pos': 'X', 'prefix': '👩', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'ADD', 'shape': '👩', 'suffix': '👩', 'ent_type': '', 'dep': 'dobj', 'head': 168, 'left_edge': 228, 'right_edge': 228}),id=228), Annotation(621,622,Token,features=Features({'_i': 130, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\u200d', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 17442142028047717517, 'pos': 'X', 'prefix': '\\u200d', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'ADD', 'shape': '\\u200d', 'suffix': '\\u200d', 'ent_type': '', 'dep': 'dobj', 'head': 168, 'left_edge': 229, 'right_edge': 246}),id=229), Annotation(622,623,Token,features=Features({'_i': 131, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '🏫', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7253895586487469997, 'pos': 'NOUN', 'prefix': '🏫', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': '🏫', 'suffix': '🏫', 'ent_type': '', 'dep': 'dobj', 'head': 229, 'left_edge': 230, 'right_edge': 246}),id=230), Annotation(622,623,NounChunk,features=Features({}),id=373), Annotation(623,624,SpaceToken,features=Features({'is_space': True}),id=231), Annotation(624,625,Token,features=Features({'_i': 132, 'is_alpha': False, 'is_bracket': True, 'is_currency': False, 'is_digit': False, 'is_left_punct': True, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '(', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12638816674900267446, 'pos': 'PUNCT', 'prefix': '(', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '-LRB-', 'shape': '(', 'suffix': '(', 'ent_type': '', 'dep': 'punct', 'head': 230, 'left_edge': 232, 'right_edge': 232}),id=232), Annotation(625,626,Token,features=Features({'_i': 133, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'det', 'head': 237, 'left_edge': 233, 'right_edge': 233}),id=233), Annotation(625,640,NounChunk,features=Features({}),id=374), Annotation(626,627,SpaceToken,features=Features({'is_space': True}),id=234), Annotation(627,632,Token,features=Features({'_i': 134, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'woman', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 18308318389743014884, 'pos': 'NOUN', 'prefix': 'w', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'man', 'ent_type': '', 'dep': 'compound', 'head': 237, 'left_edge': 235, 'right_edge': 235}),id=235), Annotation(632,633,SpaceToken,features=Features({'is_space': True}),id=236), Annotation(633,640,Token,features=Features({'_i': 135, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'teacher', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 78820239457828971, 'pos': 'NOUN', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'her', 'ent_type': '', 'dep': 'appos', 'head': 230, 'left_edge': 233, 'right_edge': 237}),id=237), Annotation(640,641,Token,features=Features({'_i': 136, 'is_alpha': False, 'is_bracket': True, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': True, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ')', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3842344029291005339, 'pos': 'PUNCT', 'prefix': ')', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '-RRB-', 'shape': ')', 'suffix': ')', 'ent_type': '', 'dep': 'punct', 'head': 230, 'left_edge': 238, 'right_edge': 238}),id=238), Annotation(641,642,Token,features=Features({'_i': 137, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 230, 'left_edge': 239, 'right_edge': 239}),id=239), Annotation(642,643,SpaceToken,features=Features({'is_space': True}),id=240), Annotation(643,644,Token,features=Features({'_i': 138, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '🧬', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 9031423621242036097, 'pos': 'PROPN', 'prefix': '🧬', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': '🧬', 'suffix': '🧬', 'ent_type': 'PERSON', 'dep': 'conj', 'head': 230, 'left_edge': 241, 'right_edge': 245}),id=241), Annotation(643,644,PERSON,features=Features({'lemma': '🧬'}),id=330), Annotation(643,644,NounChunk,features=Features({}),id=375), Annotation(644,645,SpaceToken,features=Features({'is_space': True}),id=242), Annotation(645,646,Token,features=Features({'_i': 139, 'is_alpha': False, 'is_bracket': True, 'is_currency': False, 'is_digit': False, 'is_left_punct': True, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '(', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12638816674900267446, 'pos': 'PUNCT', 'prefix': '(', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '-LRB-', 'shape': '(', 'suffix': '(', 'ent_type': '', 'dep': 'punct', 'head': 241, 'left_edge': 243, 'right_edge': 243}),id=243), Annotation(646,649,Token,features=Features({'_i': 140, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': True, 'lang': 'en', 'lemma': 'DNA', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 14386531691665947480, 'pos': 'PROPN', 'prefix': 'D', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'XXX', 'suffix': 'DNA', 'ent_type': '', 'dep': 'appos', 'head': 241, 'left_edge': 244, 'right_edge': 244}),id=244), Annotation(646,649,NounChunk,features=Features({}),id=376), Annotation(649,650,Token,features=Features({'_i': 141, 'is_alpha': False, 'is_bracket': True, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': True, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ')', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3842344029291005339, 'pos': 'PUNCT', 'prefix': ')', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '-RRB-', 'shape': ')', 'suffix': ')', 'ent_type': '', 'dep': 'punct', 'head': 241, 'left_edge': 245, 'right_edge': 245}),id=245), Annotation(650,651,Token,features=Features({'_i': 142, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 230, 'left_edge': 246, 'right_edge': 246}),id=246), Annotation(651,652,SpaceToken,features=Features({'is_space': True}),id=247), Annotation(652,653,SpaceToken,features=Features({'_i': 143, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 962983613142996970, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n', 'suffix': '\\n', 'ent_type': '', 'dep': 'dep', 'head': 256, 'left_edge': 248, 'right_edge': 248}),id=248), Annotation(653,654,Token,features=Features({'_i': 144, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '🧗', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7724749230788735883, 'pos': 'PROPN', 'prefix': '🧗', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': '🧗', 'suffix': '🧗', 'ent_type': '', 'dep': 'nsubj', 'head': 256, 'left_edge': 249, 'right_edge': 249}),id=249), Annotation(653,654,NounChunk,features=Features({}),id=377), Annotation(654,655,SpaceToken,features=Features({'is_space': True}),id=250), Annotation(655,656,Token,features=Features({'_i': 145, 'is_alpha': False, 'is_bracket': True, 'is_currency': False, 'is_digit': False, 'is_left_punct': True, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '(', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12638816674900267446, 'pos': 'PUNCT', 'prefix': '(', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '-LRB-', 'shape': '(', 'suffix': '(', 'ent_type': '', 'dep': 'punct', 'head': 256, 'left_edge': 251, 'right_edge': 251}),id=251), Annotation(656,657,Token,features=Features({'_i': 146, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'det', 'head': 254, 'left_edge': 252, 'right_edge': 252}),id=252), Annotation(656,664,NounChunk,features=Features({}),id=378), Annotation(657,658,SpaceToken,features=Features({'is_space': True}),id=253), Annotation(658,664,Token,features=Features({'_i': 147, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'person', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 14800503047316267216, 'pos': 'NOUN', 'prefix': 'p', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'son', 'ent_type': '', 'dep': 'nsubj', 'head': 256, 'left_edge': 252, 'right_edge': 254}),id=254), Annotation(664,665,SpaceToken,features=Features({'is_space': True}),id=255), Annotation(665,673,Token,features=Features({'_i': 148, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'climb', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 992864226358664410, 'pos': 'VERB', 'prefix': 'c', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBG', 'shape': 'xxxx', 'suffix': 'ing', 'ent_type': '', 'dep': 'ccomp', 'head': 168, 'left_edge': 248, 'right_edge': 257}),id=256), Annotation(673,674,Token,features=Features({'_i': 149, 'is_alpha': False, 'is_bracket': True, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': True, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ')', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3842344029291005339, 'pos': 'PUNCT', 'prefix': ')', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '-RRB-', 'shape': ')', 'suffix': ')', 'ent_type': '', 'dep': 'punct', 'head': 256, 'left_edge': 257, 'right_edge': 257}),id=257), Annotation(674,675,Token,features=Features({'_i': 150, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 265, 'left_edge': 258, 'right_edge': 258}),id=258), Annotation(675,676,SpaceToken,features=Features({'is_space': True}),id=259), Annotation(676,678,SpaceToken,features=Features({'_i': 151, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 908432558851201422, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n\\n', 'suffix': '\\n\\n', 'ent_type': '', 'dep': 'dep', 'head': 265, 'left_edge': 260, 'right_edge': 260}),id=260), Annotation(678,682,Token,features=Features({'_i': 152, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'here', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 13973858553651532596, 'pos': 'ADV', 'prefix': 'H', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'Xxxx', 'suffix': 'ere', 'ent_type': '', 'dep': 'advmod', 'head': 265, 'left_edge': 261, 'right_edge': 261}),id=261), Annotation(682,683,SpaceToken,features=Features({'is_space': True}),id=262), Annotation(683,685,Token,features=Features({'_i': 153, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'we', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 16064069575701507746, 'pos': 'PRON', 'prefix': 'w', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'PRP', 'shape': 'xx', 'suffix': 'we', 'ent_type': '', 'dep': 'nsubj', 'head': 265, 'left_edge': 263, 'right_edge': 263}),id=263), Annotation(683,685,NounChunk,features=Features({}),id=379), Annotation(685,686,SpaceToken,features=Features({'is_space': True}),id=264), Annotation(686,690,Token,features=Features({'_i': 154, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'test', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 1618900948208871284, 'pos': 'VERB', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBP', 'shape': 'xxxx', 'suffix': 'est', 'ent_type': '', 'dep': 'ROOT', 'head': 265, 'left_edge': 163, 'right_edge': 316}),id=265), Annotation(690,691,SpaceToken,features=Features({'is_space': True}),id=266), Annotation(691,692,Token,features=Features({'_i': 155, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'quantmod', 'head': 269, 'left_edge': 267, 'right_edge': 267}),id=267), Annotation(691,714,NounChunk,features=Features({}),id=380), Annotation(692,693,SpaceToken,features=Features({'is_space': True}),id=268), Annotation(693,696,Token,features=Features({'_i': 156, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'few', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11866476999679706272, 'pos': 'ADJ', 'prefix': 'f', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'JJ', 'shape': 'xxx', 'suffix': 'few', 'ent_type': '', 'dep': 'nummod', 'head': 273, 'left_edge': 267, 'right_edge': 269}),id=269), Annotation(696,697,SpaceToken,features=Features({'is_space': True}),id=270), Annotation(697,706,Token,features=Features({'_i': 157, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'different', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 17379251038783395147, 'pos': 'ADJ', 'prefix': 'd', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'JJ', 'shape': 'xxxx', 'suffix': 'ent', 'ent_type': '', 'dep': 'amod', 'head': 273, 'left_edge': 271, 'right_edge': 271}),id=271), Annotation(706,707,SpaceToken,features=Features({'is_space': True}),id=272), Annotation(707,714,Token,features=Features({'_i': 158, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'script', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 5755375431925009742, 'pos': 'NOUN', 'prefix': 's', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'xxxx', 'suffix': 'pts', 'ent_type': '', 'dep': 'dobj', 'head': 265, 'left_edge': 267, 'right_edge': 315}),id=273), Annotation(714,715,Token,features=Features({'_i': 159, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 273, 'left_edge': 274, 'right_edge': 274}),id=274), Annotation(715,716,SpaceToken,features=Features({'is_space': True}),id=275), Annotation(716,720,Token,features=Features({'_i': 160, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'e.g.', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2360781233418247603, 'pos': 'ADV', 'prefix': 'e', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'x.x.', 'suffix': '.g.', 'ent_type': '', 'dep': 'advmod', 'head': 280, 'left_edge': 276, 'right_edge': 276}),id=276), Annotation(716,730,NounChunk,features=Features({}),id=381), Annotation(720,721,SpaceToken,features=Features({'is_space': True}),id=277), Annotation(721,727,Token,features=Features({'_i': 161, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Hangul', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 13901273419332601270, 'pos': 'PROPN', 'prefix': 'H', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'gul', 'ent_type': 'PERSON', 'dep': 'compound', 'head': 280, 'left_edge': 278, 'right_edge': 278}),id=278), Annotation(721,727,PERSON,features=Features({'lemma': 'Hangul'}),id=331), Annotation(727,728,SpaceToken,features=Features({'is_space': True}),id=279), Annotation(728,730,Token,features=Features({'_i': 162, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '한글', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 14226613016943586999, 'pos': 'PROPN', 'prefix': '한', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'xx', 'suffix': '한글', 'ent_type': '', 'dep': 'appos', 'head': 273, 'left_edge': 276, 'right_edge': 282}),id=280), Annotation(730,731,SpaceToken,features=Features({'is_space': True}),id=281), Annotation(731,733,Token,features=Features({'_i': 163, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'or', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3740602843040177340, 'pos': 'CCONJ', 'prefix': 'o', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'xx', 'suffix': 'or', 'ent_type': '', 'dep': 'cc', 'head': 280, 'left_edge': 282, 'right_edge': 282}),id=282), Annotation(733,734,SpaceToken,features=Features({'is_space': True}),id=283), Annotation(734,735,SpaceToken,features=Features({'_i': 164, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 962983613142996970, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n', 'suffix': '\\n', 'ent_type': '', 'dep': 'dep', 'head': 273, 'left_edge': 284, 'right_edge': 284}),id=284), Annotation(735,745,Token,features=Features({'_i': 165, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'simplify', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 15400283218552125021, 'pos': 'VERB', 'prefix': 's', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBN', 'shape': 'xxxx', 'suffix': 'ied', 'ent_type': '', 'dep': 'amod', 'head': 289, 'left_edge': 285, 'right_edge': 285}),id=285), Annotation(735,754,NounChunk,features=Features({}),id=382), Annotation(745,746,SpaceToken,features=Features({'is_space': True}),id=286), Annotation(746,751,Token,features=Features({'_i': 166, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Hanzi', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7162399416189955260, 'pos': 'PROPN', 'prefix': 'H', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'nzi', 'ent_type': 'PERSON', 'dep': 'compound', 'head': 289, 'left_edge': 287, 'right_edge': 287}),id=287), Annotation(746,751,PERSON,features=Features({'lemma': 'Hanzi'}),id=332), Annotation(751,752,SpaceToken,features=Features({'is_space': True}),id=288), Annotation(752,754,Token,features=Features({'_i': 167, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '汉字', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2455570606503990670, 'pos': 'PROPN', 'prefix': '汉', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'xx', 'suffix': '汉字', 'ent_type': '', 'dep': 'conj', 'head': 273, 'left_edge': 285, 'right_edge': 315}),id=289), Annotation(754,755,SpaceToken,features=Features({'is_space': True}),id=290), Annotation(755,757,Token,features=Features({'_i': 168, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'or', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3740602843040177340, 'pos': 'CCONJ', 'prefix': 'o', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'xx', 'suffix': 'or', 'ent_type': '', 'dep': 'cc', 'head': 289, 'left_edge': 291, 'right_edge': 291}),id=291), Annotation(757,758,SpaceToken,features=Features({'is_space': True}),id=292), Annotation(758,763,Token,features=Features({'_i': 169, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Farsi', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 1631855311088449743, 'pos': 'PROPN', 'prefix': 'F', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'rsi', 'ent_type': 'PERSON', 'dep': 'compound', 'head': 295, 'left_edge': 293, 'right_edge': 293}),id=293), Annotation(758,763,PERSON,features=Features({'lemma': 'Farsi'}),id=333), Annotation(758,769,NounChunk,features=Features({}),id=383), Annotation(763,764,SpaceToken,features=Features({'is_space': True}),id=294), Annotation(764,769,Token,features=Features({'_i': 170, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'فارسی', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 10192536850399601059, 'pos': 'NOUN', 'prefix': 'ف', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'رسی', 'ent_type': '', 'dep': 'conj', 'head': 289, 'left_edge': 293, 'right_edge': 299}),id=295), Annotation(769,770,SpaceToken,features=Features({'is_space': True}),id=296), Annotation(770,773,Token,features=Features({'_i': 171, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'and', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2283656566040971221, 'pos': 'CCONJ', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'xxx', 'suffix': 'and', 'ent_type': '', 'dep': 'cc', 'head': 295, 'left_edge': 297, 'right_edge': 297}),id=297), Annotation(773,774,SpaceToken,features=Features({'is_space': True}),id=298), Annotation(774,780,Token,features=Features({'_i': 172, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Arabic', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 6558098436421695236, 'pos': 'PROPN', 'prefix': 'A', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'bic', 'ent_type': 'LANGUAGE', 'dep': 'conj', 'head': 295, 'left_edge': 299, 'right_edge': 299}),id=299), Annotation(774,780,LANGUAGE,features=Features({'lemma': 'Arabic'}),id=334), Annotation(774,780,NounChunk,features=Features({}),id=384), Annotation(780,781,SpaceToken,features=Features({'is_space': True}),id=300), Annotation(781,782,Token,features=Features({'_i': 173, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 289, 'left_edge': 301, 'right_edge': 301}),id=301), Annotation(782,797,Token,features=Features({'_i': 174, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'اَلْعَرَبِيَّةُ', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 16849633969024660305, 'pos': 'NUM', 'prefix': 'ا', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CD', 'shape': 'xَxْxَxَxِxَّxُ', 'suffix': 'ّةُ', 'ent_type': 'GPE', 'dep': 'appos', 'head': 289, 'left_edge': 302, 'right_edge': 302}),id=302), Annotation(782,797,GPE,features=Features({'lemma': 'اَلْعَرَبِيَّةُ'}),id=335), Annotation(797,798,Token,features=Features({'_i': 175, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 289, 'left_edge': 303, 'right_edge': 303}),id=303), Annotation(798,799,SpaceToken,features=Features({'is_space': True}),id=304), Annotation(799,804,Token,features=Features({'_i': 176, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'which', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7063653163634019529, 'pos': 'PRON', 'prefix': 'w', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'WDT', 'shape': 'xxxx', 'suffix': 'ich', 'ent_type': '', 'dep': 'nsubj', 'head': 307, 'left_edge': 305, 'right_edge': 305}),id=305), Annotation(799,804,NounChunk,features=Features({}),id=385), Annotation(804,805,SpaceToken,features=Features({'is_space': True}),id=306), Annotation(805,809,Token,features=Features({'_i': 177, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'go', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 18224404223277189725, 'pos': 'VERB', 'prefix': 'g', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBZ', 'shape': 'xxxx', 'suffix': 'oes', 'ent_type': '', 'dep': 'relcl', 'head': 289, 'left_edge': 305, 'right_edge': 315}),id=307), Annotation(809,810,SpaceToken,features=Features({'is_space': True}),id=308), Annotation(810,814,Token,features=Features({'_i': 178, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'from', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7831658034963690409, 'pos': 'ADP', 'prefix': 'f', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'IN', 'shape': 'xxxx', 'suffix': 'rom', 'ent_type': '', 'dep': 'prep', 'head': 307, 'left_edge': 309, 'right_edge': 315}),id=309), Annotation(814,815,SpaceToken,features=Features({'is_space': True}),id=310), Annotation(815,820,Token,features=Features({'_i': 179, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'right', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 5943797630011647483, 'pos': 'NOUN', 'prefix': 'r', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'ght', 'ent_type': '', 'dep': 'advmod', 'head': 313, 'left_edge': 311, 'right_edge': 311}),id=311), Annotation(820,821,SpaceToken,features=Features({'is_space': True}),id=312), Annotation(821,823,Token,features=Features({'_i': 180, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'to', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3791531372978436496, 'pos': 'AUX', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'IN', 'shape': 'xx', 'suffix': 'to', 'ent_type': '', 'dep': 'aux', 'head': 315, 'left_edge': 311, 'right_edge': 313}),id=313), Annotation(823,824,SpaceToken,features=Features({'is_space': True}),id=314), Annotation(824,828,Token,features=Features({'_i': 181, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'leave', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 4003250512168697582, 'pos': 'VERB', 'prefix': 'l', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBN', 'shape': 'xxxx', 'suffix': 'eft', 'ent_type': '', 'dep': 'pcomp', 'head': 309, 'left_edge': 311, 'right_edge': 315}),id=315), Annotation(828,829,Token,features=Features({'_i': 182, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '.', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12646065887601541794, 'pos': 'PUNCT', 'prefix': '.', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '.', 'shape': '.', 'suffix': '.', 'ent_type': '', 'dep': 'punct', 'head': 265, 'left_edge': 316, 'right_edge': 316}),id=316), Annotation(829,831,SpaceToken,features=Features({'_i': 183, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': True, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 908432558851201422, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n\\n', 'suffix': '\\n\\n', 'ent_type': '', 'dep': 'dep', 'head': 317, 'left_edge': 317, 'right_edge': 317}),id=317), Annotation(829,831,Sentence,features=Features({}),id=341)]), 'Stanza': AnnotationSet([Annotation(0,4,Token,features=Features({'text': 'This', 'lemma': 'this', 'upos': 'PRON', 'xpos': 'DT', 'Number': 'Sing', 'PronType': 'Dem', 'head': 4, 'deprel': 'nsubj', 'ner': 'O'}),id=0), Annotation(0,24,Sentence,features=Features({}),id=6), Annotation(5,7,Token,features=Features({'text': 'is', 'lemma': 'be', 'upos': 'AUX', 'xpos': 'VBZ', 'Mood': 'Ind', 'Number': 'Sing', 'Person': '3', 'Tense': 'Pres', 'VerbForm': 'Fin', 'head': 4, 'deprel': 'cop', 'ner': 'O'}),id=1), Annotation(8,9,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 4, 'deprel': 'det', 'ner': 'O'}),id=2), Annotation(10,14,Token,features=Features({'text': 'test', 'lemma': 'test', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 4, 'deprel': 'compound', 'ner': 'O'}),id=3), Annotation(15,23,Token,features=Features({'text': 'document', 'lemma': 'document', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 6, 'deprel': 'root', 'ner': 'O'}),id=4), Annotation(23,24,Token,features=Features({'text': '.', 'lemma': '.', 'upos': 'PUNCT', 'xpos': '.', 'head': 4, 'deprel': 'punct', 'ner': 'O'}),id=5), Annotation(26,28,Token,features=Features({'text': 'It', 'lemma': 'it', 'upos': 'PRON', 'xpos': 'PRP', 'Case': 'Nom', 'Gender': 'Neut', 'Number': 'Sing', 'Person': '3', 'PronType': 'Prs', 'head': 8, 'deprel': 'nsubj', 'ner': 'O'}),id=7), Annotation(26,59,Sentence,features=Features({}),id=14), Annotation(29,37,Token,features=Features({'text': 'contains', 'lemma': 'contain', 'upos': 'VERB', 'xpos': 'VBZ', 'Mood': 'Ind', 'Number': 'Sing', 'Person': '3', 'Tense': 'Pres', 'VerbForm': 'Fin', 'head': 14, 'deprel': 'root', 'ner': 'O'}),id=8), Annotation(38,42,Token,features=Features({'text': 'just', 'lemma': 'just', 'upos': 'ADV', 'xpos': 'RB', 'head': 12, 'deprel': 'advmod', 'ner': 'O'}),id=9), Annotation(43,44,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 12, 'deprel': 'det', 'ner': 'O'}),id=10), Annotation(45,48,Token,features=Features({'text': 'few', 'lemma': 'few', 'upos': 'ADJ', 'xpos': 'JJ', 'Degree': 'Pos', 'head': 12, 'deprel': 'amod', 'ner': 'O'}),id=11), Annotation(49,58,Token,features=Features({'text': 'sentences', 'lemma': 'sentence', 'upos': 'NOUN', 'xpos': 'NNS', 'Number': 'Plur', 'head': 8, 'deprel': 'obj', 'ner': 'O'}),id=12), Annotation(58,59,Token,features=Features({'text': '.', 'lemma': '.', 'upos': 'PUNCT', 'xpos': '.', 'head': 8, 'deprel': 'punct', 'ner': 'O'}),id=13), Annotation(61,65,Token,features=Features({'text': 'Here', 'lemma': 'here', 'upos': 'ADV', 'xpos': 'RB', 'PronType': 'Dem', 'head': 54, 'deprel': 'root', 'ner': 'O'}),id=15), Annotation(61,266,Sentence,features=Features({}),id=54), Annotation(66,68,Token,features=Features({'text': 'is', 'lemma': 'be', 'upos': 'AUX', 'xpos': 'VBZ', 'Mood': 'Ind', 'Number': 'Sing', 'Person': '3', 'Tense': 'Pres', 'VerbForm': 'Fin', 'head': 15, 'deprel': 'cop', 'ner': 'O'}),id=16), Annotation(69,70,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 18, 'deprel': 'det', 'ner': 'O'}),id=17), Annotation(71,79,Token,features=Features({'text': 'sentence', 'lemma': 'sentence', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 15, 'deprel': 'nsubj', 'ner': 'O'}),id=18), Annotation(80,84,Token,features=Features({'text': 'that', 'lemma': 'that', 'upos': 'PRON', 'xpos': 'WDT', 'PronType': 'Rel', 'head': 20, 'deprel': 'nsubj', 'ner': 'O'}),id=19), Annotation(85,93,Token,features=Features({'text': 'mentions', 'lemma': 'mention', 'upos': 'VERB', 'xpos': 'VBZ', 'Mood': 'Ind', 'Number': 'Sing', 'Person': '3', 'Tense': 'Pres', 'VerbForm': 'Fin', 'head': 18, 'deprel': 'acl:relcl', 'ner': 'O'}),id=20), Annotation(94,95,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 24, 'deprel': 'det', 'ner': 'O'}),id=21), Annotation(96,99,Token,features=Features({'text': 'few', 'lemma': 'few', 'upos': 'ADJ', 'xpos': 'JJ', 'Degree': 'Pos', 'head': 24, 'deprel': 'amod', 'ner': 'O'}),id=22), Annotation(100,105,Token,features=Features({'text': 'named', 'lemma': 'name', 'upos': 'VERB', 'xpos': 'VBN', 'Tense': 'Past', 'VerbForm': 'Part', 'head': 24, 'deprel': 'amod', 'ner': 'O'}),id=23), Annotation(106,114,Token,features=Features({'text': 'entities', 'lemma': 'entity', 'upos': 'NOUN', 'xpos': 'NNS', 'Number': 'Plur', 'head': 20, 'deprel': 'obj', 'ner': 'O'}),id=24), Annotation(115,119,Token,features=Features({'text': 'like', 'lemma': 'like', 'upos': 'ADP', 'xpos': 'IN', 'head': 27, 'deprel': 'case', 'ner': 'O'}),id=25), Annotation(121,124,Token,features=Features({'text': 'the', 'lemma': 'the', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Def', 'PronType': 'Art', 'head': 27, 'deprel': 'det', 'ner': 'O'}),id=26), Annotation(125,132,Token,features=Features({'text': 'persons', 'lemma': 'person', 'upos': 'NOUN', 'xpos': 'NNS', 'Number': 'Plur', 'head': 24, 'deprel': 'nmod', 'ner': 'O'}),id=27), Annotation(133,139,Token,features=Features({'text': 'Barack', 'lemma': 'Barack', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 27, 'deprel': 'appos', 'ner': 'B-PERSON'}),id=28), Annotation(133,145,PERSON,features=Features({}),id=175), Annotation(140,145,Token,features=Features({'text': 'Obama', 'lemma': 'Obama', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 28, 'deprel': 'flat', 'ner': 'E-PERSON'}),id=29), Annotation(146,148,Token,features=Features({'text': 'or', 'lemma': 'or', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 31, 'deprel': 'cc', 'ner': 'O'}),id=30), Annotation(149,155,Token,features=Features({'text': 'Ursula', 'lemma': 'Ursula', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 28, 'deprel': 'conj', 'ner': 'B-PERSON'}),id=31), Annotation(149,169,PERSON,features=Features({}),id=176), Annotation(156,159,Token,features=Features({'text': 'von', 'lemma': 'von', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 31, 'deprel': 'flat', 'ner': 'I-PERSON'}),id=32), Annotation(160,163,Token,features=Features({'text': 'der', 'lemma': 'der', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 31, 'deprel': 'flat', 'ner': 'I-PERSON'}),id=33), Annotation(164,169,Token,features=Features({'text': 'Leyen', 'lemma': 'Leyen', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 31, 'deprel': 'flat', 'ner': 'E-PERSON'}),id=34), Annotation(169,170,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 36, 'deprel': 'punct', 'ner': 'O'}),id=35), Annotation(171,180,Token,features=Features({'text': 'locations', 'lemma': 'location', 'upos': 'NOUN', 'xpos': 'NNS', 'Number': 'Plur', 'head': 27, 'deprel': 'conj', 'ner': 'O'}),id=36), Annotation(181,185,Token,features=Features({'text': 'like', 'lemma': 'like', 'upos': 'ADP', 'xpos': 'IN', 'head': 40, 'deprel': 'case', 'ner': 'O'}),id=37), Annotation(186,189,Token,features=Features({'text': 'New', 'lemma': 'New', 'upos': 'ADJ', 'xpos': 'NNP', 'Degree': 'Pos', 'head': 40, 'deprel': 'amod', 'ner': 'B-GPE'}),id=38), Annotation(186,199,GPE,features=Features({}),id=177), Annotation(190,194,Token,features=Features({'text': 'York', 'lemma': 'York', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 40, 'deprel': 'compound', 'ner': 'I-GPE'}),id=39), Annotation(195,199,Token,features=Features({'text': 'City', 'lemma': 'City', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 36, 'deprel': 'nmod', 'ner': 'E-GPE'}),id=40), Annotation(199,200,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 42, 'deprel': 'punct', 'ner': 'O'}),id=41), Annotation(201,207,Token,features=Features({'text': 'Vienna', 'lemma': 'Vienna', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 40, 'deprel': 'conj', 'ner': 'S-GPE'}),id=42), Annotation(201,207,GPE,features=Features({}),id=178), Annotation(208,210,Token,features=Features({'text': 'or', 'lemma': 'or', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 44, 'deprel': 'cc', 'ner': 'O'}),id=43), Annotation(211,218,Token,features=Features({'text': 'Beijing', 'lemma': 'Beijing', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 40, 'deprel': 'conj', 'ner': 'S-GPE'}),id=44), Annotation(211,218,GPE,features=Features({}),id=179), Annotation(219,221,Token,features=Features({'text': 'or', 'lemma': 'or', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 46, 'deprel': 'cc', 'ner': 'O'}),id=45), Annotation(222,231,Token,features=Features({'text': 'companies', 'lemma': 'company', 'upos': 'NOUN', 'xpos': 'NNS', 'Number': 'Plur', 'head': 40, 'deprel': 'conj', 'ner': 'O'}),id=46), Annotation(232,236,Token,features=Features({'text': 'like', 'lemma': 'like', 'upos': 'ADP', 'xpos': 'IN', 'head': 48, 'deprel': 'case', 'ner': 'O'}),id=47), Annotation(238,244,Token,features=Features({'text': 'Google', 'lemma': 'Google', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 46, 'deprel': 'nmod', 'ner': 'S-ORG'}),id=48), Annotation(238,244,ORG,features=Features({}),id=180), Annotation(244,245,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 50, 'deprel': 'punct', 'ner': 'O'}),id=49), Annotation(246,255,Token,features=Features({'text': 'UniCredit', 'lemma': 'UniCredit', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 48, 'deprel': 'conj', 'ner': 'S-ORG'}),id=50), Annotation(246,255,ORG,features=Features({}),id=181), Annotation(256,258,Token,features=Features({'text': 'or', 'lemma': 'or', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 52, 'deprel': 'cc', 'ner': 'O'}),id=51), Annotation(259,265,Token,features=Features({'text': 'Huawei', 'lemma': 'Huawei', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 48, 'deprel': 'conj', 'ner': 'S-ORG'}),id=52), Annotation(259,265,ORG,features=Features({}),id=182), Annotation(265,266,Token,features=Features({'text': '.', 'lemma': '.', 'upos': 'PUNCT', 'xpos': '.', 'head': 15, 'deprel': 'punct', 'ner': 'O'}),id=53), Annotation(267,270,Token,features=Features({'text': 'And', 'lemma': 'and', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 67, 'deprel': 'cc', 'ner': 'O'}),id=55), Annotation(267,322,Sentence,features=Features({}),id=69), Annotation(271,275,Token,features=Features({'text': 'here', 'lemma': 'here', 'upos': 'ADV', 'xpos': 'RB', 'PronType': 'Dem', 'head': 69, 'deprel': 'root', 'ner': 'O'}),id=56), Annotation(276,278,Token,features=Features({'text': 'is', 'lemma': 'be', 'upos': 'AUX', 'xpos': 'VBZ', 'Mood': 'Ind', 'Number': 'Sing', 'Person': '3', 'Tense': 'Pres', 'VerbForm': 'Fin', 'head': 56, 'deprel': 'cop', 'ner': 'O'}),id=57), Annotation(279,285,Token,features=Features({'text': 'Donald', 'lemma': 'Donald', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 56, 'deprel': 'nsubj', 'ner': 'B-PERSON'}),id=58), Annotation(279,291,PERSON,features=Features({}),id=183), Annotation(286,291,Token,features=Features({'text': 'Trump', 'lemma': 'Trump', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 58, 'deprel': 'flat', 'ner': 'E-PERSON'}),id=59), Annotation(291,292,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 56, 'deprel': 'punct', 'ner': 'O'}),id=60), Annotation(293,295,Token,features=Features({'text': 'it', 'lemma': 'it', 'upos': 'PRON', 'xpos': 'PRP', 'Case': 'Nom', 'Gender': 'Neut', 'Number': 'Sing', 'Person': '3', 'PronType': 'Prs', 'head': 67, 'deprel': 'nsubj', 'ner': 'O'}),id=61), Annotation(296,299,Token,features=Features({'text': 'may', 'lemma': 'may', 'upos': 'AUX', 'xpos': 'MD', 'VerbForm': 'Fin', 'head': 67, 'deprel': 'aux', 'ner': 'O'}),id=62), Annotation(300,303,Token,features=Features({'text': 'not', 'lemma': 'not', 'upos': 'PART', 'xpos': 'RB', 'head': 67, 'deprel': 'advmod', 'ner': 'O'}),id=63), Annotation(304,306,Token,features=Features({'text': 'be', 'lemma': 'be', 'upos': 'AUX', 'xpos': 'VB', 'VerbForm': 'Inf', 'head': 67, 'deprel': 'cop', 'ner': 'O'}),id=64), Annotation(307,310,Token,features=Features({'text': 'the', 'lemma': 'the', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Def', 'PronType': 'Art', 'head': 67, 'deprel': 'det', 'ner': 'O'}),id=65), Annotation(311,315,Token,features=Features({'text': 'real', 'lemma': 'real', 'upos': 'ADJ', 'xpos': 'JJ', 'Degree': 'Pos', 'head': 67, 'deprel': 'amod', 'ner': 'O'}),id=66), Annotation(316,319,Token,features=Features({'text': 'one', 'lemma': 'one', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 56, 'deprel': 'parataxis', 'ner': 'O'}),id=67), Annotation(320,322,Token,features=Features({'text': ':P', 'lemma': ':p', 'upos': 'SYM', 'xpos': 'NFP', 'head': 56, 'deprel': 'discourse', 'ner': 'O'}),id=68), Annotation(324,328,Token,features=Features({'text': 'Lets', 'lemma': 'let', 'upos': 'VERB', 'xpos': 'VBZ', 'Mood': 'Ind', 'Number': 'Sing', 'Person': '3', 'Tense': 'Pres', 'VerbForm': 'Fin', 'head': 89, 'deprel': 'root', 'ner': 'O'}),id=70), Annotation(324,422,Sentence,features=Features({}),id=89), Annotation(329,332,Token,features=Features({'text': 'say', 'lemma': 'say', 'upos': 'VERB', 'xpos': 'VB', 'VerbForm': 'Inf', 'head': 70, 'deprel': 'xcomp', 'ner': 'O'}),id=71), Annotation(333,338,Token,features=Features({'text': 'Boris', 'lemma': 'Boris', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 84, 'deprel': 'nsubj', 'ner': 'B-PERSON'}),id=72), Annotation(333,346,PERSON,features=Features({}),id=184), Annotation(339,346,Token,features=Features({'text': 'Johnson', 'lemma': 'Johnson', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 72, 'deprel': 'flat', 'ner': 'E-PERSON'}),id=73), Annotation(347,350,Token,features=Features({'text': 'aka', 'lemma': 'aka', 'upos': 'ADP', 'xpos': 'IN', 'head': 76, 'deprel': 'case', 'ner': 'O'}),id=74), Annotation(351,355,Token,features=Features({'text': 'Bojo', 'lemma': 'Bojo', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 76, 'deprel': 'compound', 'ner': 'S-PERSON'}),id=75), Annotation(351,355,PERSON,features=Features({}),id=185), Annotation(356,362,Token,features=Features({'text': 'tweets', 'lemma': 'tweet', 'upos': 'NOUN', 'xpos': 'NNS', 'Number': 'Plur', 'head': 72, 'deprel': 'nmod', 'ner': 'O'}),id=76), Annotation(363,367,Token,features=Features({'text': 'from', 'lemma': 'from', 'upos': 'ADP', 'xpos': 'IN', 'head': 80, 'deprel': 'case', 'ner': 'O'}),id=77), Annotation(368,371,Token,features=Features({'text': 'his', 'lemma': 'he', 'upos': 'PRON', 'xpos': 'PRP$', 'Gender': 'Masc', 'Number': 'Sing', 'Person': '3', 'Poss': 'Yes', 'PronType': 'Prs', 'head': 80, 'deprel': 'nmod:poss', 'ner': 'O'}),id=78), Annotation(372,384,Token,features=Features({'text': 'BorisJohnson', 'lemma': 'BorisJohnson', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 80, 'deprel': 'compound', 'ner': 'S-PERSON'}),id=79), Annotation(372,384,PERSON,features=Features({}),id=186), Annotation(385,392,Token,features=Features({'text': 'account', 'lemma': 'account', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 76, 'deprel': 'nmod', 'ner': 'O'}),id=80), Annotation(392,393,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 84, 'deprel': 'punct', 'ner': 'O'}),id=81), Annotation(394,399,Token,features=Features({'text': 'would', 'lemma': 'would', 'upos': 'AUX', 'xpos': 'MD', 'VerbForm': 'Fin', 'head': 84, 'deprel': 'aux', 'ner': 'O'}),id=82), Annotation(400,402,Token,features=Features({'text': 'be', 'lemma': 'be', 'upos': 'AUX', 'xpos': 'VB', 'VerbForm': 'Inf', 'head': 84, 'deprel': 'cop', 'ner': 'O'}),id=83), Annotation(403,407,Token,features=Features({'text': 'nice', 'lemma': 'nice', 'upos': 'ADJ', 'xpos': 'JJ', 'Degree': 'Pos', 'head': 70, 'deprel': 'parataxis', 'ner': 'O'}),id=84), Annotation(408,410,Token,features=Features({'text': 'to', 'lemma': 'to', 'upos': 'PART', 'xpos': 'TO', 'head': 86, 'deprel': 'mark', 'ner': 'O'}),id=85), Annotation(411,416,Token,features=Features({'text': 'match', 'lemma': 'match', 'upos': 'VERB', 'xpos': 'VB', 'VerbForm': 'Inf', 'head': 84, 'deprel': 'advcl', 'ner': 'O'}),id=86), Annotation(417,421,Token,features=Features({'text': 'them', 'lemma': 'they', 'upos': 'PRON', 'xpos': 'PRP', 'Case': 'Acc', 'Number': 'Plur', 'Person': '3', 'PronType': 'Prs', 'head': 86, 'deprel': 'obj', 'ner': 'O'}),id=87), Annotation(421,422,Token,features=Features({'text': '!', 'lemma': '!', 'upos': 'PUNCT', 'xpos': '.', 'head': 70, 'deprel': 'punct', 'ner': 'O'}),id=88), Annotation(424,428,Token,features=Features({'text': 'Here', 'lemma': 'here', 'upos': 'ADV', 'xpos': 'RB', 'PronType': 'Dem', 'head': 92, 'deprel': 'advmod', 'ner': 'O'}),id=90), Annotation(424,675,Sentence,features=Features({}),id=143), Annotation(429,431,Token,features=Features({'text': 'we', 'lemma': 'we', 'upos': 'PRON', 'xpos': 'PRP', 'Case': 'Nom', 'Number': 'Plur', 'Person': '1', 'PronType': 'Prs', 'head': 92, 'deprel': 'nsubj', 'ner': 'O'}),id=91), Annotation(432,439,Token,features=Features({'text': 'include', 'lemma': 'include', 'upos': 'VERB', 'xpos': 'VBP', 'Mood': 'Ind', 'Tense': 'Pres', 'VerbForm': 'Fin', 'head': 143, 'deprel': 'root', 'ner': 'O'}),id=92), Annotation(440,441,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 94, 'deprel': 'det', 'ner': 'O'}),id=93), Annotation(442,445,Token,features=Features({'text': 'URL', 'lemma': 'URL', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 92, 'deprel': 'obj', 'ner': 'O'}),id=94), Annotation(446,487,Token,features=Features({'text': 'https://gatenlp.github.io/python-gatenlp/', 'lemma': 'https://gatenlp.github.io/python-gatenlp/', 'upos': 'X', 'xpos': 'ADD', 'head': 94, 'deprel': 'appos', 'ner': 'O'}),id=95), Annotation(489,492,Token,features=Features({'text': 'and', 'lemma': 'and', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 100, 'deprel': 'cc', 'ner': 'O'}),id=96), Annotation(493,494,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 100, 'deprel': 'det', 'ner': 'O'}),id=97), Annotation(495,499,Token,features=Features({'text': 'fake', 'lemma': 'fake', 'upos': 'ADJ', 'xpos': 'JJ', 'Degree': 'Pos', 'head': 100, 'deprel': 'amod', 'ner': 'O'}),id=98), Annotation(500,505,Token,features=Features({'text': 'email', 'lemma': 'email', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 100, 'deprel': 'compound', 'ner': 'O'}),id=99), Annotation(506,513,Token,features=Features({'text': 'address', 'lemma': 'address', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 94, 'deprel': 'conj', 'ner': 'O'}),id=100), Annotation(514,540,Token,features=Features({'text': 'john.doe@hiscoolserver.com', 'lemma': 'john.doe@hiscoolserver.com', 'upos': 'X', 'xpos': 'ADD', 'head': 100, 'deprel': 'appos', 'ner': 'O'}),id=101), Annotation(541,543,Token,features=Features({'text': 'as', 'lemma': 'as', 'upos': 'ADV', 'xpos': 'RB', 'head': 110, 'deprel': 'cc', 'ner': 'O'}),id=102), Annotation(544,548,Token,features=Features({'text': 'well', 'lemma': 'well', 'upos': 'ADV', 'xpos': 'RB', 'Degree': 'Pos', 'head': 102, 'deprel': 'fixed', 'ner': 'O'}),id=103), Annotation(550,552,Token,features=Features({'text': 'as', 'lemma': 'as', 'upos': 'ADP', 'xpos': 'IN', 'head': 102, 'deprel': 'fixed', 'ner': 'O'}),id=104), Annotation(553,554,Token,features=Features({'text': '#', 'lemma': '#', 'upos': 'SYM', 'xpos': 'NN', 'Number': 'Sing', 'head': 110, 'deprel': 'compound', 'ner': 'O'}),id=105), Annotation(554,558,Token,features=Features({'text': 'some', 'lemma': 'some', 'upos': 'DET', 'xpos': 'DT', 'head': 110, 'deprel': 'det', 'ner': 'O'}),id=106), Annotation(559,560,Token,features=Features({'text': '#', 'lemma': '#', 'upos': 'SYM', 'xpos': 'NN', 'Number': 'Sing', 'head': 110, 'deprel': 'compound', 'ner': 'O'}),id=107), Annotation(560,564,Token,features=Features({'text': 'cool', 'lemma': 'cool', 'upos': 'ADJ', 'xpos': 'JJ', 'Degree': 'Pos', 'head': 110, 'deprel': 'amod', 'ner': 'O'}),id=108), Annotation(565,566,Token,features=Features({'text': '#', 'lemma': '#', 'upos': 'SYM', 'xpos': 'NN', 'Number': 'Sing', 'head': 110, 'deprel': 'compound', 'ner': 'O'}),id=109), Annotation(566,573,Token,features=Features({'text': 'hastags', 'lemma': 'hastag', 'upos': 'NOUN', 'xpos': 'NNS', 'Number': 'Plur', 'head': 94, 'deprel': 'conj', 'ner': 'O'}),id=110), Annotation(574,577,Token,features=Features({'text': 'and', 'lemma': 'and', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 113, 'deprel': 'cc', 'ner': 'O'}),id=111), Annotation(578,579,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 113, 'deprel': 'det', 'ner': 'O'}),id=112), Annotation(580,585,Token,features=Features({'text': 'bunch', 'lemma': 'bunch', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 94, 'deprel': 'conj', 'ner': 'O'}),id=113), Annotation(586,588,Token,features=Features({'text': 'of', 'lemma': 'of', 'upos': 'ADP', 'xpos': 'IN', 'head': 115, 'deprel': 'case', 'ner': 'O'}),id=114), Annotation(589,595,Token,features=Features({'text': 'emojis', 'lemma': 'emojis', 'upos': 'NOUN', 'xpos': 'NNS', 'Number': 'Plur', 'head': 113, 'deprel': 'nmod', 'ner': 'O'}),id=115), Annotation(596,600,Token,features=Features({'text': 'like', 'lemma': 'like', 'upos': 'ADP', 'xpos': 'IN', 'head': 117, 'deprel': 'case', 'ner': 'O'}),id=116), Annotation(601,602,Token,features=Features({'text': '😽', 'lemma': '😽', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 115, 'deprel': 'nmod', 'ner': 'O'}),id=117), Annotation(603,604,Token,features=Features({'text': '(', 'lemma': '(', 'upos': 'PUNCT', 'xpos': '-LRB-', 'head': 121, 'deprel': 'punct', 'ner': 'O'}),id=118), Annotation(604,605,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 121, 'deprel': 'det', 'ner': 'O'}),id=119), Annotation(606,613,Token,features=Features({'text': 'kissing', 'lemma': 'kissing', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 121, 'deprel': 'compound', 'ner': 'O'}),id=120), Annotation(614,617,Token,features=Features({'text': 'cat', 'lemma': 'cat', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 117, 'deprel': 'appos', 'ner': 'O'}),id=121), Annotation(617,618,Token,features=Features({'text': ')', 'lemma': ')', 'upos': 'PUNCT', 'xpos': '-RRB-', 'head': 121, 'deprel': 'punct', 'ner': 'O'}),id=122), Annotation(618,619,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 124, 'deprel': 'punct', 'ner': 'O'}),id=123), Annotation(620,623,Token,features=Features({'text': '👩\\u200d🏫', 'lemma': '👩\\u200d🏫', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 117, 'deprel': 'conj', 'ner': 'O'}),id=124), Annotation(624,625,Token,features=Features({'text': '(', 'lemma': '(', 'upos': 'PUNCT', 'xpos': '-LRB-', 'head': 128, 'deprel': 'punct', 'ner': 'O'}),id=125), Annotation(625,626,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 128, 'deprel': 'det', 'ner': 'O'}),id=126), Annotation(627,632,Token,features=Features({'text': 'woman', 'lemma': 'woman', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 128, 'deprel': 'compound', 'ner': 'O'}),id=127), Annotation(633,640,Token,features=Features({'text': 'teacher', 'lemma': 'teacher', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 124, 'deprel': 'appos', 'ner': 'O'}),id=128), Annotation(640,641,Token,features=Features({'text': ')', 'lemma': ')', 'upos': 'PUNCT', 'xpos': '-RRB-', 'head': 128, 'deprel': 'punct', 'ner': 'O'}),id=129), Annotation(641,642,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 131, 'deprel': 'punct', 'ner': 'O'}),id=130), Annotation(643,644,Token,features=Features({'text': '🧬', 'lemma': '🧬', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 117, 'deprel': 'conj', 'ner': 'O'}),id=131), Annotation(645,646,Token,features=Features({'text': '(', 'lemma': '(', 'upos': 'PUNCT', 'xpos': '-LRB-', 'head': 133, 'deprel': 'punct', 'ner': 'O'}),id=132), Annotation(646,649,Token,features=Features({'text': 'DNA', 'lemma': 'dna', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 131, 'deprel': 'appos', 'ner': 'O'}),id=133), Annotation(649,650,Token,features=Features({'text': ')', 'lemma': ')', 'upos': 'PUNCT', 'xpos': '-RRB-', 'head': 133, 'deprel': 'punct', 'ner': 'O'}),id=134), Annotation(650,651,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 136, 'deprel': 'punct', 'ner': 'O'}),id=135), Annotation(653,654,Token,features=Features({'text': '🧗', 'lemma': '🧗', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 117, 'deprel': 'conj', 'ner': 'O'}),id=136), Annotation(655,656,Token,features=Features({'text': '(', 'lemma': '(', 'upos': 'PUNCT', 'xpos': '-LRB-', 'head': 140, 'deprel': 'punct', 'ner': 'O'}),id=137), Annotation(656,657,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 139, 'deprel': 'det', 'ner': 'O'}),id=138), Annotation(658,664,Token,features=Features({'text': 'person', 'lemma': 'person', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 140, 'deprel': 'nsubj', 'ner': 'O'}),id=139), Annotation(665,673,Token,features=Features({'text': 'climbing', 'lemma': 'climb', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 136, 'deprel': 'appos', 'ner': 'O'}),id=140), Annotation(673,674,Token,features=Features({'text': ')', 'lemma': ')', 'upos': 'PUNCT', 'xpos': '-RRB-', 'head': 140, 'deprel': 'punct', 'ner': 'O'}),id=141), Annotation(674,675,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 92, 'deprel': 'punct', 'ner': 'O'}),id=142), Annotation(678,682,Token,features=Features({'text': 'Here', 'lemma': 'here', 'upos': 'ADV', 'xpos': 'RB', 'PronType': 'Dem', 'head': 146, 'deprel': 'advmod', 'ner': 'O'}),id=144), Annotation(678,829,Sentence,features=Features({}),id=174), Annotation(683,685,Token,features=Features({'text': 'we', 'lemma': 'we', 'upos': 'PRON', 'xpos': 'PRP', 'Case': 'Nom', 'Number': 'Plur', 'Person': '1', 'PronType': 'Prs', 'head': 146, 'deprel': 'nsubj', 'ner': 'O'}),id=145), Annotation(686,690,Token,features=Features({'text': 'test', 'lemma': 'test', 'upos': 'VERB', 'xpos': 'VBP', 'Mood': 'Ind', 'Number': 'Plur', 'Person': '1', 'Tense': 'Pres', 'VerbForm': 'Fin', 'head': 174, 'deprel': 'root', 'ner': 'O'}),id=146), Annotation(691,692,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 150, 'deprel': 'det', 'ner': 'O'}),id=147), Annotation(693,696,Token,features=Features({'text': 'few', 'lemma': 'few', 'upos': 'ADJ', 'xpos': 'JJ', 'Degree': 'Pos', 'head': 150, 'deprel': 'amod', 'ner': 'O'}),id=148), Annotation(697,706,Token,features=Features({'text': 'different', 'lemma': 'different', 'upos': 'ADJ', 'xpos': 'JJ', 'Degree': 'Pos', 'head': 150, 'deprel': 'amod', 'ner': 'O'}),id=149), Annotation(707,714,Token,features=Features({'text': 'scripts', 'lemma': 'script', 'upos': 'NOUN', 'xpos': 'NNS', 'Number': 'Plur', 'head': 146, 'deprel': 'obj', 'ner': 'O'}),id=150), Annotation(714,715,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 153, 'deprel': 'punct', 'ner': 'O'}),id=151), Annotation(716,720,Token,features=Features({'text': 'e.g.', 'lemma': 'e.g.', 'upos': 'ADV', 'xpos': 'FW', 'Abbr': 'Yes', 'head': 153, 'deprel': 'advmod', 'ner': 'O'}),id=152), Annotation(721,727,Token,features=Features({'text': 'Hangul', 'lemma': 'Hangul', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 154, 'deprel': 'compound', 'ner': 'O'}),id=153), Annotation(728,730,Token,features=Features({'text': '한글', 'lemma': '한글', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 150, 'deprel': 'appos', 'ner': 'O'}),id=154), Annotation(731,733,Token,features=Features({'text': 'or', 'lemma': 'or', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 158, 'deprel': 'cc', 'ner': 'O'}),id=155), Annotation(735,745,Token,features=Features({'text': 'simplified', 'lemma': 'simplify', 'upos': 'VERB', 'xpos': 'VBN', 'Tense': 'Past', 'VerbForm': 'Part', 'head': 158, 'deprel': 'amod', 'ner': 'O'}),id=156), Annotation(746,751,Token,features=Features({'text': 'Hanzi', 'lemma': 'Hanzi', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 158, 'deprel': 'compound', 'ner': 'O'}),id=157), Annotation(752,754,Token,features=Features({'text': '汉字', 'lemma': '汉字', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 154, 'deprel': 'conj', 'ner': 'O'}),id=158), Annotation(755,757,Token,features=Features({'text': 'or', 'lemma': 'or', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 161, 'deprel': 'cc', 'ner': 'O'}),id=159), Annotation(758,763,Token,features=Features({'text': 'Farsi', 'lemma': 'Farsi', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 161, 'deprel': 'compound', 'ner': 'S-LANGUAGE'}),id=160), Annotation(758,763,LANGUAGE,features=Features({}),id=187), Annotation(764,769,Token,features=Features({'text': 'فارسی', 'lemma': 'فارسی', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 154, 'deprel': 'conj', 'ner': 'O'}),id=161), Annotation(770,773,Token,features=Features({'text': 'and', 'lemma': 'and', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 163, 'deprel': 'cc', 'ner': 'O'}),id=162), Annotation(774,780,Token,features=Features({'text': 'Arabic', 'lemma': 'Arabic', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 161, 'deprel': 'conj', 'ner': 'S-LANGUAGE'}),id=163), Annotation(774,780,LANGUAGE,features=Features({}),id=188), Annotation(781,782,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 165, 'deprel': 'punct', 'ner': 'O'}),id=164), Annotation(782,797,Token,features=Features({'text': 'اَلْعَرَبِيَّةُ', 'lemma': 'gamommamgeramam', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 150, 'deprel': 'appos', 'ner': 'O'}),id=165), Annotation(797,798,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 168, 'deprel': 'punct', 'ner': 'O'}),id=166), Annotation(799,804,Token,features=Features({'text': 'which', 'lemma': 'which', 'upos': 'PRON', 'xpos': 'WDT', 'PronType': 'Rel', 'head': 168, 'deprel': 'nsubj', 'ner': 'O'}),id=167), Annotation(805,809,Token,features=Features({'text': 'goes', 'lemma': 'go', 'upos': 'VERB', 'xpos': 'VBZ', 'Mood': 'Ind', 'Number': 'Sing', 'Person': '3', 'Tense': 'Pres', 'VerbForm': 'Fin', 'head': 150, 'deprel': 'acl:relcl', 'ner': 'O'}),id=168), Annotation(810,814,Token,features=Features({'text': 'from', 'lemma': 'from', 'upos': 'ADP', 'xpos': 'IN', 'head': 170, 'deprel': 'case', 'ner': 'O'}),id=169), Annotation(815,820,Token,features=Features({'text': 'right', 'lemma': 'right', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 168, 'deprel': 'obl', 'ner': 'O'}),id=170), Annotation(821,823,Token,features=Features({'text': 'to', 'lemma': 'to', 'upos': 'ADP', 'xpos': 'IN', 'head': 172, 'deprel': 'case', 'ner': 'O'}),id=171), Annotation(824,828,Token,features=Features({'text': 'left', 'lemma': 'leave', 'upos': 'VERB', 'xpos': 'VBN', 'Tense': 'Past', 'VerbForm': 'Part', 'head': 168, 'deprel': 'advcl', 'ner': 'O'}),id=172), Annotation(828,829,Token,features=Features({'text': '.', 'lemma': '.', 'upos': 'PUNCT', 'xpos': '.', 'head': 146, 'deprel': 'punct', 'ner': 'O'}),id=173)]), '': AnnotationSet([Annotation(0,4,Token,features=Features({}),id=0), Annotation(5,7,Token,features=Features({}),id=1), Annotation(8,9,Token,features=Features({}),id=2), Annotation(10,14,Token,features=Features({}),id=3), Annotation(15,24,Token,features=Features({}),id=4), Annotation(26,28,Token,features=Features({}),id=5), Annotation(29,37,Token,features=Features({}),id=6), Annotation(38,42,Token,features=Features({}),id=7), Annotation(43,44,Token,features=Features({}),id=8), Annotation(45,48,Token,features=Features({}),id=9), Annotation(49,59,Token,features=Features({}),id=10), Annotation(61,65,Token,features=Features({}),id=11), Annotation(66,68,Token,features=Features({}),id=12), Annotation(69,70,Token,features=Features({}),id=13), Annotation(71,79,Token,features=Features({}),id=14), Annotation(80,84,Token,features=Features({}),id=15), Annotation(85,93,Token,features=Features({}),id=16), Annotation(94,95,Token,features=Features({}),id=17), Annotation(96,99,Token,features=Features({}),id=18), Annotation(100,105,Token,features=Features({}),id=19), Annotation(106,114,Token,features=Features({}),id=20), Annotation(115,119,Token,features=Features({}),id=21), Annotation(121,124,Token,features=Features({}),id=22), Annotation(125,132,Token,features=Features({}),id=23), Annotation(133,139,Token,features=Features({}),id=24), Annotation(140,145,Token,features=Features({}),id=25), Annotation(146,148,Token,features=Features({}),id=26), Annotation(149,155,Token,features=Features({}),id=27), Annotation(156,159,Token,features=Features({}),id=28), Annotation(160,163,Token,features=Features({}),id=29), Annotation(164,169,Token,features=Features({}),id=30), Annotation(169,170,Token,features=Features({}),id=31), Annotation(171,180,Token,features=Features({}),id=32), Annotation(181,185,Token,features=Features({}),id=33), Annotation(186,189,Token,features=Features({}),id=34), Annotation(190,194,Token,features=Features({}),id=35), Annotation(195,199,Token,features=Features({}),id=36), Annotation(199,200,Token,features=Features({}),id=37), Annotation(201,207,Token,features=Features({}),id=38), Annotation(208,210,Token,features=Features({}),id=39), Annotation(211,218,Token,features=Features({}),id=40), Annotation(219,221,Token,features=Features({}),id=41), Annotation(222,231,Token,features=Features({}),id=42), Annotation(232,236,Token,features=Features({}),id=43), Annotation(238,244,Token,features=Features({}),id=44), Annotation(244,245,Token,features=Features({}),id=45), Annotation(246,255,Token,features=Features({}),id=46), Annotation(256,258,Token,features=Features({}),id=47), Annotation(259,266,Token,features=Features({}),id=48), Annotation(267,270,Token,features=Features({}),id=49), Annotation(271,275,Token,features=Features({}),id=50), Annotation(276,278,Token,features=Features({}),id=51), Annotation(279,285,Token,features=Features({}),id=52), Annotation(286,291,Token,features=Features({}),id=53), Annotation(291,292,Token,features=Features({}),id=54), Annotation(293,295,Token,features=Features({}),id=55), Annotation(296,299,Token,features=Features({}),id=56), Annotation(300,303,Token,features=Features({}),id=57), Annotation(304,306,Token,features=Features({}),id=58), Annotation(307,310,Token,features=Features({}),id=59), Annotation(311,315,Token,features=Features({}),id=60), Annotation(316,319,Token,features=Features({}),id=61), Annotation(320,321,Token,features=Features({}),id=62), Annotation(321,322,Token,features=Features({}),id=63), Annotation(324,328,Token,features=Features({}),id=64), Annotation(329,332,Token,features=Features({}),id=65), Annotation(333,338,Token,features=Features({}),id=66), Annotation(339,346,Token,features=Features({}),id=67), Annotation(347,350,Token,features=Features({}),id=68), Annotation(351,355,Token,features=Features({}),id=69), Annotation(356,362,Token,features=Features({}),id=70), Annotation(363,367,Token,features=Features({}),id=71), Annotation(368,371,Token,features=Features({}),id=72), Annotation(372,384,Token,features=Features({}),id=73), Annotation(385,392,Token,features=Features({}),id=74), Annotation(392,393,Token,features=Features({}),id=75), Annotation(394,399,Token,features=Features({}),id=76), Annotation(400,402,Token,features=Features({}),id=77), Annotation(403,407,Token,features=Features({}),id=78), Annotation(408,410,Token,features=Features({}),id=79), Annotation(411,416,Token,features=Features({}),id=80), Annotation(417,421,Token,features=Features({}),id=81), Annotation(421,422,Token,features=Features({}),id=82), Annotation(424,428,Token,features=Features({}),id=83), Annotation(429,431,Token,features=Features({}),id=84), Annotation(432,439,Token,features=Features({}),id=85), Annotation(440,441,Token,features=Features({}),id=86), Annotation(442,445,Token,features=Features({}),id=87), Annotation(446,451,Token,features=Features({}),id=88), Annotation(451,452,Token,features=Features({}),id=89), Annotation(452,487,Token,features=Features({}),id=90), Annotation(489,492,Token,features=Features({}),id=91), Annotation(493,494,Token,features=Features({}),id=92), Annotation(495,499,Token,features=Features({}),id=93), Annotation(500,505,Token,features=Features({}),id=94), Annotation(506,513,Token,features=Features({}),id=95), Annotation(514,522,Token,features=Features({}),id=96), Annotation(522,523,Token,features=Features({}),id=97), Annotation(523,540,Token,features=Features({}),id=98), Annotation(541,543,Token,features=Features({}),id=99), Annotation(544,548,Token,features=Features({}),id=100), Annotation(550,552,Token,features=Features({}),id=101), Annotation(553,554,Token,features=Features({}),id=102), Annotation(554,558,Token,features=Features({}),id=103), Annotation(559,560,Token,features=Features({}),id=104), Annotation(560,564,Token,features=Features({}),id=105), Annotation(565,566,Token,features=Features({}),id=106), Annotation(566,573,Token,features=Features({}),id=107), Annotation(574,577,Token,features=Features({}),id=108), Annotation(578,579,Token,features=Features({}),id=109), Annotation(580,585,Token,features=Features({}),id=110), Annotation(586,588,Token,features=Features({}),id=111), Annotation(589,595,Token,features=Features({}),id=112), Annotation(596,600,Token,features=Features({}),id=113), Annotation(601,602,Token,features=Features({}),id=114), Annotation(603,604,Token,features=Features({}),id=115), Annotation(604,605,Token,features=Features({}),id=116), Annotation(606,613,Token,features=Features({}),id=117), Annotation(614,617,Token,features=Features({}),id=118), Annotation(617,618,Token,features=Features({}),id=119), Annotation(618,619,Token,features=Features({}),id=120), Annotation(620,623,Token,features=Features({}),id=121), Annotation(624,625,Token,features=Features({}),id=122), Annotation(625,626,Token,features=Features({}),id=123), Annotation(627,632,Token,features=Features({}),id=124), Annotation(633,640,Token,features=Features({}),id=125), Annotation(640,641,Token,features=Features({}),id=126), Annotation(641,642,Token,features=Features({}),id=127), Annotation(643,644,Token,features=Features({}),id=128), Annotation(645,646,Token,features=Features({}),id=129), Annotation(646,649,Token,features=Features({}),id=130), Annotation(649,650,Token,features=Features({}),id=131), Annotation(650,651,Token,features=Features({}),id=132), Annotation(653,654,Token,features=Features({}),id=133), Annotation(655,656,Token,features=Features({}),id=134), Annotation(656,657,Token,features=Features({}),id=135), Annotation(658,664,Token,features=Features({}),id=136), Annotation(665,673,Token,features=Features({}),id=137), Annotation(673,674,Token,features=Features({}),id=138), Annotation(674,675,Token,features=Features({}),id=139), Annotation(678,682,Token,features=Features({}),id=140), Annotation(683,685,Token,features=Features({}),id=141), Annotation(686,690,Token,features=Features({}),id=142), Annotation(691,692,Token,features=Features({}),id=143), Annotation(693,696,Token,features=Features({}),id=144), Annotation(697,706,Token,features=Features({}),id=145), Annotation(707,714,Token,features=Features({}),id=146), Annotation(714,715,Token,features=Features({}),id=147), Annotation(716,720,Token,features=Features({}),id=148), Annotation(721,727,Token,features=Features({}),id=149), Annotation(728,730,Token,features=Features({}),id=150), Annotation(731,733,Token,features=Features({}),id=151), Annotation(735,745,Token,features=Features({}),id=152), Annotation(746,751,Token,features=Features({}),id=153), Annotation(752,754,Token,features=Features({}),id=154), Annotation(755,757,Token,features=Features({}),id=155), Annotation(758,763,Token,features=Features({}),id=156), Annotation(764,769,Token,features=Features({}),id=157), Annotation(770,773,Token,features=Features({}),id=158), Annotation(774,780,Token,features=Features({}),id=159), Annotation(781,782,Token,features=Features({}),id=160), Annotation(782,797,Token,features=Features({}),id=161), Annotation(797,798,Token,features=Features({}),id=162), Annotation(799,804,Token,features=Features({}),id=163), Annotation(805,809,Token,features=Features({}),id=164), Annotation(810,814,Token,features=Features({}),id=165), Annotation(815,820,Token,features=Features({}),id=166), Annotation(821,823,Token,features=Features({}),id=167), Annotation(824,828,Token,features=Features({}),id=168), Annotation(828,829,Token,features=Features({}),id=169)]), 'TGaz1': AnnotationSet([Annotation(238,244,Lookup,features=Features({'what': 'company', 'country': 'Everywhere, really!'}),id=3), Annotation(279,291,Lookup,features=Features({'what': 'person', 'country': 'US'}),id=4), Annotation(333,346,Lookup,features=Features({'what': 'person', 'country': 'UK'}),id=5)])})" ] }, "execution_count": 53, "metadata": {}, "output_type": "execute_result" } ], "source": [ "doc2 = tgaz1(doc2)\n", "doc2" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## TokenGazetteer: listfile\n", "\n", "2) Load a list from a file, using JAVA GATE \"def\" format (https://gate.ac.uk/userguide/sec:annie:gazetteer):\n", "\n", "* A somthing.def file contains one line for each list file to use\n", "* Each somename.lst file contains one line with entry to match and arbitrary features\n", "\n", "E.g. `data/gaz1.def`:\n", "```\n", "persons.lst\n", "companies.lst\n", "```\n", "\n", "`data/persons.lst`:\n", "```\n", "Donald Trump what=person country=US\n", "Boris Johnson what=person country=UK\n", "```\n", "\n", "`data/companies.lst`:\n", "```\n", "Google where=Everywhere, really!\n", "```\n" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Gazetteer List (GATE def)" ] }, { "cell_type": "code", "execution_count": 54, "metadata": { "scrolled": true, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "tgaz2 = TokenGazetteer(\n", " source=\"data/gaz1.def\", source_fmt=\"gate-def\", annset_name=\"\", outset_name=\"TGaz2\", \n", " ann_type=\"Lookup\", source_tokenizer=nltk_tokenizer)\n", "doc2.annset(\"TGaz2\").clear()\n", "doc2 = tgaz2(doc2)" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Gazetteer List\n", "\n", "Result when using the loaded GATE-style gazetteer files:" ] }, { "cell_type": "code", "execution_count": 55, "metadata": { "scrolled": true, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "text/html": [ "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", " \n", " \n", "
\n", "\n", "
" ], "text/plain": [ "Document(This is a test document.\n", "\n", "It contains just a few sentences. \n", "Here is a sentence that mentions a few named entities like \n", "the persons Barack Obama or Ursula von der Leyen, locations\n", "like New York City, Vienna or Beijing or companies like \n", "Google, UniCredit or Huawei. And here is Donald Trump, it may not be the real one :P\n", "\n", "Lets say Boris Johnson aka Bojo tweets from his BorisJohnson account, would be nice to match them!\n", "\n", "Here we include a URL https://gatenlp.github.io/python-gatenlp/ \n", "and a fake email address john.doe@hiscoolserver.com as well \n", "as #some #cool #hastags and a bunch of emojis like 😽 (a kissing cat),\n", "👩‍🏫 (a woman teacher), 🧬 (DNA), \n", "🧗 (a person climbing), \n", "\n", "Here we test a few different scripts, e.g. Hangul 한글 or \n", "simplified Hanzi 汉字 or Farsi فارسی and Arabic ,اَلْعَرَبِيَّةُ, which goes from right to left.\n", "\n", ",features=Features({}),anns={'Spacy': AnnotationSet([Annotation(0,4,Token,features=Features({'_i': 0, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': True, 'is_space': False, 'is_stop': True, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'this', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12943039165150086467, 'pos': 'PRON', 'prefix': 'T', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'Xxxx', 'suffix': 'his', 'ent_type': '', 'dep': 'nsubj', 'head': 2, 'left_edge': 0, 'right_edge': 0}),id=0), Annotation(0,24,Sentence,features=Features({}),id=336), Annotation(0,4,NounChunk,features=Features({}),id=342), Annotation(4,5,SpaceToken,features=Features({'is_space': True}),id=1), Annotation(5,7,Token,features=Features({'_i': 1, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'be', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3411606890003347522, 'pos': 'AUX', 'prefix': 'i', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBZ', 'shape': 'xx', 'suffix': 'is', 'ent_type': '', 'dep': 'ROOT', 'head': 2, 'left_edge': 0, 'right_edge': 9}),id=2), Annotation(7,8,SpaceToken,features=Features({'is_space': True}),id=3), Annotation(8,9,Token,features=Features({'_i': 2, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'det', 'head': 8, 'left_edge': 4, 'right_edge': 4}),id=4), Annotation(8,23,NounChunk,features=Features({}),id=343), Annotation(9,10,SpaceToken,features=Features({'is_space': True}),id=5), Annotation(10,14,Token,features=Features({'_i': 3, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'test', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 1618900948208871284, 'pos': 'NOUN', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'est', 'ent_type': '', 'dep': 'compound', 'head': 8, 'left_edge': 6, 'right_edge': 6}),id=6), Annotation(14,15,SpaceToken,features=Features({'is_space': True}),id=7), Annotation(15,23,Token,features=Features({'_i': 4, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'document', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2376024544662810659, 'pos': 'NOUN', 'prefix': 'd', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'ent', 'ent_type': '', 'dep': 'attr', 'head': 2, 'left_edge': 4, 'right_edge': 8}),id=8), Annotation(23,24,Token,features=Features({'_i': 5, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '.', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12646065887601541794, 'pos': 'PUNCT', 'prefix': '.', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '.', 'shape': '.', 'suffix': '.', 'ent_type': '', 'dep': 'punct', 'head': 2, 'left_edge': 9, 'right_edge': 9}),id=9), Annotation(24,26,SpaceToken,features=Features({'_i': 6, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': True, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 908432558851201422, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n\\n', 'suffix': '\\n\\n', 'ent_type': '', 'dep': 'dep', 'head': 13, 'left_edge': 10, 'right_edge': 10}),id=10), Annotation(24,59,Sentence,features=Features({}),id=337), Annotation(26,28,Token,features=Features({'_i': 7, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'it', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7859011591137717335, 'pos': 'PRON', 'prefix': 'I', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'PRP', 'shape': 'Xx', 'suffix': 'It', 'ent_type': '', 'dep': 'nsubj', 'head': 13, 'left_edge': 11, 'right_edge': 11}),id=11), Annotation(26,28,NounChunk,features=Features({}),id=344), Annotation(28,29,SpaceToken,features=Features({'is_space': True}),id=12), Annotation(29,37,Token,features=Features({'_i': 8, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'contain', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 4189568596636683296, 'pos': 'VERB', 'prefix': 'c', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBZ', 'shape': 'xxxx', 'suffix': 'ins', 'ent_type': '', 'dep': 'ROOT', 'head': 13, 'left_edge': 10, 'right_edge': 22}),id=13), Annotation(37,38,SpaceToken,features=Features({'is_space': True}),id=14), Annotation(38,42,Token,features=Features({'_i': 9, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'just', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7148522813498185515, 'pos': 'ADV', 'prefix': 'j', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'xxxx', 'suffix': 'ust', 'ent_type': '', 'dep': 'advmod', 'head': 19, 'left_edge': 15, 'right_edge': 15}),id=15), Annotation(38,58,NounChunk,features=Features({}),id=345), Annotation(42,43,SpaceToken,features=Features({'is_space': True}),id=16), Annotation(43,44,Token,features=Features({'_i': 10, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'quantmod', 'head': 19, 'left_edge': 17, 'right_edge': 17}),id=17), Annotation(44,45,SpaceToken,features=Features({'is_space': True}),id=18), Annotation(45,48,Token,features=Features({'_i': 11, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'few', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11866476999679706272, 'pos': 'ADJ', 'prefix': 'f', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'JJ', 'shape': 'xxx', 'suffix': 'few', 'ent_type': '', 'dep': 'nummod', 'head': 21, 'left_edge': 15, 'right_edge': 19}),id=19), Annotation(48,49,SpaceToken,features=Features({'is_space': True}),id=20), Annotation(49,58,Token,features=Features({'_i': 12, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'sentence', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 5257340109698985342, 'pos': 'NOUN', 'prefix': 's', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'xxxx', 'suffix': 'ces', 'ent_type': '', 'dep': 'dobj', 'head': 13, 'left_edge': 15, 'right_edge': 21}),id=21), Annotation(58,59,Token,features=Features({'_i': 13, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '.', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12646065887601541794, 'pos': 'PUNCT', 'prefix': '.', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '.', 'shape': '.', 'suffix': '.', 'ent_type': '', 'dep': 'punct', 'head': 13, 'left_edge': 22, 'right_edge': 22}),id=22), Annotation(59,60,SpaceToken,features=Features({'is_space': True}),id=23), Annotation(60,61,SpaceToken,features=Features({'_i': 14, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': True, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 962983613142996970, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n', 'suffix': '\\n', 'ent_type': '', 'dep': 'dep', 'head': 27, 'left_edge': 24, 'right_edge': 24}),id=24), Annotation(60,266,Sentence,features=Features({}),id=338), Annotation(61,65,Token,features=Features({'_i': 15, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'here', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 13973858553651532596, 'pos': 'ADV', 'prefix': 'H', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'Xxxx', 'suffix': 'ere', 'ent_type': '', 'dep': 'advmod', 'head': 27, 'left_edge': 25, 'right_edge': 25}),id=25), Annotation(65,66,SpaceToken,features=Features({'is_space': True}),id=26), Annotation(66,68,Token,features=Features({'_i': 16, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'be', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3411606890003347522, 'pos': 'AUX', 'prefix': 'i', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBZ', 'shape': 'xx', 'suffix': 'is', 'ent_type': '', 'dep': 'ccomp', 'head': 68, 'left_edge': 24, 'right_edge': 65}),id=27), Annotation(68,69,SpaceToken,features=Features({'is_space': True}),id=28), Annotation(69,70,Token,features=Features({'_i': 17, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'det', 'head': 31, 'left_edge': 29, 'right_edge': 29}),id=29), Annotation(69,79,NounChunk,features=Features({}),id=346), Annotation(70,71,SpaceToken,features=Features({'is_space': True}),id=30), Annotation(71,79,Token,features=Features({'_i': 18, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'sentence', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 18108853898452662235, 'pos': 'NOUN', 'prefix': 's', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'nce', 'ent_type': '', 'dep': 'attr', 'head': 27, 'left_edge': 29, 'right_edge': 64}),id=31), Annotation(79,80,SpaceToken,features=Features({'is_space': True}),id=32), Annotation(80,84,Token,features=Features({'_i': 19, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'that', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 4380130941430378203, 'pos': 'PRON', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'WDT', 'shape': 'xxxx', 'suffix': 'hat', 'ent_type': '', 'dep': 'nsubj', 'head': 35, 'left_edge': 33, 'right_edge': 33}),id=33), Annotation(80,84,NounChunk,features=Features({}),id=347), Annotation(84,85,SpaceToken,features=Features({'is_space': True}),id=34), Annotation(85,93,Token,features=Features({'_i': 20, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'mention', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 834570530775529781, 'pos': 'VERB', 'prefix': 'm', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBZ', 'shape': 'xxxx', 'suffix': 'ons', 'ent_type': '', 'dep': 'relcl', 'head': 31, 'left_edge': 33, 'right_edge': 64}),id=35), Annotation(93,94,SpaceToken,features=Features({'is_space': True}),id=36), Annotation(94,95,Token,features=Features({'_i': 21, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'det', 'head': 43, 'left_edge': 37, 'right_edge': 37}),id=37), Annotation(94,114,NounChunk,features=Features({}),id=348), Annotation(95,96,SpaceToken,features=Features({'is_space': True}),id=38), Annotation(96,99,Token,features=Features({'_i': 22, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'few', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11866476999679706272, 'pos': 'ADJ', 'prefix': 'f', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'JJ', 'shape': 'xxx', 'suffix': 'few', 'ent_type': '', 'dep': 'amod', 'head': 43, 'left_edge': 39, 'right_edge': 39}),id=39), Annotation(99,100,SpaceToken,features=Features({'is_space': True}),id=40), Annotation(100,105,Token,features=Features({'_i': 23, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'name', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 6719370519630147158, 'pos': 'VERB', 'prefix': 'n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBN', 'shape': 'xxxx', 'suffix': 'med', 'ent_type': '', 'dep': 'amod', 'head': 43, 'left_edge': 41, 'right_edge': 41}),id=41), Annotation(105,106,SpaceToken,features=Features({'is_space': True}),id=42), Annotation(106,114,Token,features=Features({'_i': 24, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'entity', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7701268449352692754, 'pos': 'NOUN', 'prefix': 'e', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'xxxx', 'suffix': 'ies', 'ent_type': '', 'dep': 'dobj', 'head': 35, 'left_edge': 37, 'right_edge': 64}),id=43), Annotation(114,115,SpaceToken,features=Features({'is_space': True}),id=44), Annotation(115,119,Token,features=Features({'_i': 25, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'like', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 18194338103975822726, 'pos': 'ADP', 'prefix': 'l', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'IN', 'shape': 'xxxx', 'suffix': 'ike', 'ent_type': '', 'dep': 'prep', 'head': 43, 'left_edge': 45, 'right_edge': 64}),id=45), Annotation(119,120,SpaceToken,features=Features({'is_space': True}),id=46), Annotation(120,121,SpaceToken,features=Features({'_i': 26, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 962983613142996970, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n', 'suffix': '\\n', 'ent_type': '', 'dep': 'dep', 'head': 45, 'left_edge': 47, 'right_edge': 47}),id=47), Annotation(121,124,Token,features=Features({'_i': 27, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'the', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7425985699627899538, 'pos': 'DET', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'xxx', 'suffix': 'the', 'ent_type': '', 'dep': 'det', 'head': 50, 'left_edge': 48, 'right_edge': 48}),id=48), Annotation(121,132,NounChunk,features=Features({}),id=349), Annotation(124,125,SpaceToken,features=Features({'is_space': True}),id=49), Annotation(125,132,Token,features=Features({'_i': 28, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'person', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 17520808660558581486, 'pos': 'NOUN', 'prefix': 'p', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'xxxx', 'suffix': 'ons', 'ent_type': '', 'dep': 'pobj', 'head': 45, 'left_edge': 48, 'right_edge': 64}),id=50), Annotation(132,133,SpaceToken,features=Features({'is_space': True}),id=51), Annotation(133,139,Token,features=Features({'_i': 29, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Barack', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 15388493565120789335, 'pos': 'PROPN', 'prefix': 'B', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'ack', 'ent_type': 'PERSON', 'dep': 'compound', 'head': 54, 'left_edge': 52, 'right_edge': 52}),id=52), Annotation(133,145,PERSON,features=Features({'lemma': 'Barack Obama'}),id=318), Annotation(133,145,NounChunk,features=Features({}),id=350), Annotation(139,140,SpaceToken,features=Features({'is_space': True}),id=53), Annotation(140,145,Token,features=Features({'_i': 30, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Obama', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 4857242187112322394, 'pos': 'PROPN', 'prefix': 'O', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'ama', 'ent_type': 'PERSON', 'dep': 'appos', 'head': 50, 'left_edge': 52, 'right_edge': 64}),id=54), Annotation(145,146,SpaceToken,features=Features({'is_space': True}),id=55), Annotation(146,148,Token,features=Features({'_i': 31, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'or', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3740602843040177340, 'pos': 'CCONJ', 'prefix': 'o', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'xx', 'suffix': 'or', 'ent_type': '', 'dep': 'cc', 'head': 54, 'left_edge': 56, 'right_edge': 56}),id=56), Annotation(148,149,SpaceToken,features=Features({'is_space': True}),id=57), Annotation(149,155,Token,features=Features({'_i': 32, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Ursula', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 13791600832320669420, 'pos': 'PROPN', 'prefix': 'U', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'ula', 'ent_type': 'GPE', 'dep': 'compound', 'head': 62, 'left_edge': 58, 'right_edge': 58}),id=58), Annotation(149,155,GPE,features=Features({'lemma': 'Ursula'}),id=319), Annotation(149,169,NounChunk,features=Features({}),id=351), Annotation(155,156,SpaceToken,features=Features({'is_space': True}),id=59), Annotation(156,159,Token,features=Features({'_i': 33, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'von', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7079615728476843702, 'pos': 'PROPN', 'prefix': 'v', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'xxx', 'suffix': 'von', 'ent_type': '', 'dep': 'compound', 'head': 62, 'left_edge': 60, 'right_edge': 60}),id=60), Annotation(159,160,SpaceToken,features=Features({'is_space': True}),id=61), Annotation(160,163,Token,features=Features({'_i': 34, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'der', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 9250722957692387333, 'pos': 'NOUN', 'prefix': 'd', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxx', 'suffix': 'der', 'ent_type': '', 'dep': 'compound', 'head': 64, 'left_edge': 58, 'right_edge': 62}),id=62), Annotation(163,164,SpaceToken,features=Features({'is_space': True}),id=63), Annotation(164,169,Token,features=Features({'_i': 35, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Leyen', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 13266048868047277473, 'pos': 'PROPN', 'prefix': 'L', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'yen', 'ent_type': 'GPE', 'dep': 'conj', 'head': 54, 'left_edge': 58, 'right_edge': 64}),id=64), Annotation(164,169,GPE,features=Features({'lemma': 'Leyen'}),id=320), Annotation(169,170,Token,features=Features({'_i': 36, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 27, 'left_edge': 65, 'right_edge': 65}),id=65), Annotation(170,171,SpaceToken,features=Features({'is_space': True}),id=66), Annotation(171,180,Token,features=Features({'_i': 37, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'location', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 13063876858105367867, 'pos': 'NOUN', 'prefix': 'l', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'xxxx', 'suffix': 'ons', 'ent_type': '', 'dep': 'nsubj', 'head': 68, 'left_edge': 67, 'right_edge': 67}),id=67), Annotation(171,180,NounChunk,features=Features({}),id=352), Annotation(180,181,SpaceToken,features=Features({'_i': 38, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 962983613142996970, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n', 'suffix': '\\n', 'ent_type': '', 'dep': 'dep', 'head': 68, 'left_edge': 24, 'right_edge': 99}),id=68), Annotation(181,185,Token,features=Features({'_i': 39, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'like', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 18194338103975822726, 'pos': 'ADP', 'prefix': 'l', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'IN', 'shape': 'xxxx', 'suffix': 'ike', 'ent_type': '', 'dep': 'prep', 'head': 68, 'left_edge': 69, 'right_edge': 98}),id=69), Annotation(185,186,SpaceToken,features=Features({'is_space': True}),id=70), Annotation(186,189,Token,features=Features({'_i': 40, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'New', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7503827727184870577, 'pos': 'PROPN', 'prefix': 'N', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxx', 'suffix': 'New', 'ent_type': 'GPE', 'dep': 'compound', 'head': 73, 'left_edge': 71, 'right_edge': 71}),id=71), Annotation(186,199,GPE,features=Features({'lemma': 'New York City'}),id=321), Annotation(186,199,NounChunk,features=Features({}),id=353), Annotation(189,190,SpaceToken,features=Features({'is_space': True}),id=72), Annotation(190,194,Token,features=Features({'_i': 41, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'York', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7898044819112200372, 'pos': 'PROPN', 'prefix': 'Y', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxx', 'suffix': 'ork', 'ent_type': 'GPE', 'dep': 'compound', 'head': 75, 'left_edge': 71, 'right_edge': 73}),id=73), Annotation(194,195,SpaceToken,features=Features({'is_space': True}),id=74), Annotation(195,199,Token,features=Features({'_i': 42, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'City', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 728339108046626067, 'pos': 'PROPN', 'prefix': 'C', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxx', 'suffix': 'ity', 'ent_type': 'GPE', 'dep': 'pobj', 'head': 69, 'left_edge': 71, 'right_edge': 98}),id=75), Annotation(199,200,Token,features=Features({'_i': 43, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 75, 'left_edge': 76, 'right_edge': 76}),id=76), Annotation(200,201,SpaceToken,features=Features({'is_space': True}),id=77), Annotation(201,207,Token,features=Features({'_i': 44, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Vienna', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 10141676973022394345, 'pos': 'PROPN', 'prefix': 'V', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'nna', 'ent_type': 'GPE', 'dep': 'conj', 'head': 75, 'left_edge': 78, 'right_edge': 98}),id=78), Annotation(201,207,GPE,features=Features({'lemma': 'Vienna'}),id=322), Annotation(201,207,NounChunk,features=Features({}),id=354), Annotation(207,208,SpaceToken,features=Features({'is_space': True}),id=79), Annotation(208,210,Token,features=Features({'_i': 45, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'or', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3740602843040177340, 'pos': 'CCONJ', 'prefix': 'o', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'xx', 'suffix': 'or', 'ent_type': '', 'dep': 'cc', 'head': 78, 'left_edge': 80, 'right_edge': 80}),id=80), Annotation(210,211,SpaceToken,features=Features({'is_space': True}),id=81), Annotation(211,218,Token,features=Features({'_i': 46, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Beijing', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2949147885855558195, 'pos': 'PROPN', 'prefix': 'B', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'ing', 'ent_type': 'GPE', 'dep': 'conj', 'head': 78, 'left_edge': 82, 'right_edge': 82}),id=82), Annotation(211,218,GPE,features=Features({'lemma': 'Beijing'}),id=323), Annotation(211,218,NounChunk,features=Features({}),id=355), Annotation(218,219,SpaceToken,features=Features({'is_space': True}),id=83), Annotation(219,221,Token,features=Features({'_i': 47, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'or', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3740602843040177340, 'pos': 'CCONJ', 'prefix': 'o', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'xx', 'suffix': 'or', 'ent_type': '', 'dep': 'cc', 'head': 78, 'left_edge': 84, 'right_edge': 84}),id=84), Annotation(221,222,SpaceToken,features=Features({'is_space': True}),id=85), Annotation(222,231,Token,features=Features({'_i': 48, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'company', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 8026612326651866097, 'pos': 'NOUN', 'prefix': 'c', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'xxxx', 'suffix': 'ies', 'ent_type': '', 'dep': 'conj', 'head': 78, 'left_edge': 86, 'right_edge': 98}),id=86), Annotation(222,231,NounChunk,features=Features({}),id=356), Annotation(231,232,SpaceToken,features=Features({'is_space': True}),id=87), Annotation(232,236,Token,features=Features({'_i': 49, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'like', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 18194338103975822726, 'pos': 'ADP', 'prefix': 'l', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'IN', 'shape': 'xxxx', 'suffix': 'ike', 'ent_type': '', 'dep': 'prep', 'head': 86, 'left_edge': 88, 'right_edge': 98}),id=88), Annotation(236,237,SpaceToken,features=Features({'is_space': True}),id=89), Annotation(237,238,SpaceToken,features=Features({'_i': 50, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 962983613142996970, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n', 'suffix': '\\n', 'ent_type': '', 'dep': 'dep', 'head': 88, 'left_edge': 90, 'right_edge': 90}),id=90), Annotation(238,244,Token,features=Features({'_i': 51, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Google', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11578853341595296054, 'pos': 'PROPN', 'prefix': 'G', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'gle', 'ent_type': '', 'dep': 'pobj', 'head': 88, 'left_edge': 91, 'right_edge': 98}),id=91), Annotation(238,244,NounChunk,features=Features({}),id=357), Annotation(244,245,Token,features=Features({'_i': 52, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 91, 'left_edge': 92, 'right_edge': 92}),id=92), Annotation(245,246,SpaceToken,features=Features({'is_space': True}),id=93), Annotation(246,255,Token,features=Features({'_i': 53, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'UniCredit', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7663443051765273511, 'pos': 'PROPN', 'prefix': 'U', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'XxxXxxxx', 'suffix': 'dit', 'ent_type': 'ORG', 'dep': 'conj', 'head': 91, 'left_edge': 94, 'right_edge': 98}),id=94), Annotation(246,255,ORG,features=Features({'lemma': 'UniCredit'}),id=324), Annotation(246,255,NounChunk,features=Features({}),id=358), Annotation(255,256,SpaceToken,features=Features({'is_space': True}),id=95), Annotation(256,258,Token,features=Features({'_i': 54, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'or', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3740602843040177340, 'pos': 'CCONJ', 'prefix': 'o', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'xx', 'suffix': 'or', 'ent_type': '', 'dep': 'cc', 'head': 94, 'left_edge': 96, 'right_edge': 96}),id=96), Annotation(258,259,SpaceToken,features=Features({'is_space': True}),id=97), Annotation(259,265,Token,features=Features({'_i': 55, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Huawei', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 4639919019609272215, 'pos': 'PROPN', 'prefix': 'H', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'wei', 'ent_type': 'ORG', 'dep': 'conj', 'head': 94, 'left_edge': 98, 'right_edge': 98}),id=98), Annotation(259,265,ORG,features=Features({'lemma': 'Huawei'}),id=325), Annotation(259,265,NounChunk,features=Features({}),id=359), Annotation(265,266,Token,features=Features({'_i': 56, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '.', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12646065887601541794, 'pos': 'PUNCT', 'prefix': '.', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '.', 'shape': '.', 'suffix': '.', 'ent_type': '', 'dep': 'punct', 'head': 68, 'left_edge': 99, 'right_edge': 99}),id=99), Annotation(266,267,SpaceToken,features=Features({'is_space': True}),id=100), Annotation(267,270,Token,features=Features({'_i': 57, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': True, 'is_space': False, 'is_stop': True, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'and', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12172435438170721471, 'pos': 'CCONJ', 'prefix': 'A', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'Xxx', 'suffix': 'And', 'ent_type': '', 'dep': 'cc', 'head': 105, 'left_edge': 101, 'right_edge': 101}),id=101), Annotation(267,422,Sentence,features=Features({}),id=339), Annotation(270,271,SpaceToken,features=Features({'is_space': True}),id=102), Annotation(271,275,Token,features=Features({'_i': 58, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'here', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 411390626470654571, 'pos': 'ADV', 'prefix': 'h', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'xxxx', 'suffix': 'ere', 'ent_type': '', 'dep': 'advmod', 'head': 105, 'left_edge': 103, 'right_edge': 103}),id=103), Annotation(275,276,SpaceToken,features=Features({'is_space': True}),id=104), Annotation(276,278,Token,features=Features({'_i': 59, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'be', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3411606890003347522, 'pos': 'AUX', 'prefix': 'i', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBZ', 'shape': 'xx', 'suffix': 'is', 'ent_type': '', 'dep': 'ccomp', 'head': 118, 'left_edge': 101, 'right_edge': 109}),id=105), Annotation(278,279,SpaceToken,features=Features({'is_space': True}),id=106), Annotation(279,285,Token,features=Features({'_i': 60, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Donald', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 16889399016836222064, 'pos': 'PROPN', 'prefix': 'D', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'ald', 'ent_type': 'PERSON', 'dep': 'compound', 'head': 109, 'left_edge': 107, 'right_edge': 107}),id=107), Annotation(279,291,PERSON,features=Features({'lemma': 'Donald Trump'}),id=326), Annotation(279,291,NounChunk,features=Features({}),id=360), Annotation(285,286,SpaceToken,features=Features({'is_space': True}),id=108), Annotation(286,291,Token,features=Features({'_i': 61, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Trump', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 1134333841961332695, 'pos': 'PROPN', 'prefix': 'T', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'ump', 'ent_type': 'PERSON', 'dep': 'nsubj', 'head': 105, 'left_edge': 107, 'right_edge': 109}),id=109), Annotation(291,292,Token,features=Features({'_i': 62, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 118, 'left_edge': 110, 'right_edge': 110}),id=110), Annotation(292,293,SpaceToken,features=Features({'is_space': True}),id=111), Annotation(293,295,Token,features=Features({'_i': 63, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'it', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 10239237003504588839, 'pos': 'PRON', 'prefix': 'i', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'PRP', 'shape': 'xx', 'suffix': 'it', 'ent_type': '', 'dep': 'nsubj', 'head': 118, 'left_edge': 112, 'right_edge': 112}),id=112), Annotation(293,295,NounChunk,features=Features({}),id=361), Annotation(295,296,SpaceToken,features=Features({'is_space': True}),id=113), Annotation(296,299,Token,features=Features({'_i': 64, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'may', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 14378475389916013800, 'pos': 'AUX', 'prefix': 'm', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'MD', 'shape': 'xxx', 'suffix': 'may', 'ent_type': '', 'dep': 'aux', 'head': 118, 'left_edge': 114, 'right_edge': 114}),id=114), Annotation(299,300,SpaceToken,features=Features({'is_space': True}),id=115), Annotation(300,303,Token,features=Features({'_i': 65, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'not', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 447765159362469301, 'pos': 'PART', 'prefix': 'n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'xxx', 'suffix': 'not', 'ent_type': '', 'dep': 'neg', 'head': 118, 'left_edge': 116, 'right_edge': 116}),id=116), Annotation(303,304,SpaceToken,features=Features({'is_space': True}),id=117), Annotation(304,306,Token,features=Features({'_i': 66, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'be', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 10382539506755952630, 'pos': 'AUX', 'prefix': 'b', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VB', 'shape': 'xx', 'suffix': 'be', 'ent_type': '', 'dep': 'ccomp', 'head': 130, 'left_edge': 101, 'right_edge': 127}),id=118), Annotation(306,307,SpaceToken,features=Features({'is_space': True}),id=119), Annotation(307,310,Token,features=Features({'_i': 67, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'the', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7425985699627899538, 'pos': 'DET', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'xxx', 'suffix': 'the', 'ent_type': '', 'dep': 'det', 'head': 124, 'left_edge': 120, 'right_edge': 120}),id=120), Annotation(310,311,SpaceToken,features=Features({'is_space': True}),id=121), Annotation(311,315,Token,features=Features({'_i': 68, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'real', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7050692189315705983, 'pos': 'ADJ', 'prefix': 'r', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'JJ', 'shape': 'xxxx', 'suffix': 'eal', 'ent_type': '', 'dep': 'amod', 'head': 124, 'left_edge': 122, 'right_edge': 122}),id=122), Annotation(315,316,SpaceToken,features=Features({'is_space': True}),id=123), Annotation(316,319,Token,features=Features({'_i': 69, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'one', 'like_email': False, 'like_num': True, 'like_url': False, 'orth': 17454115351911680600, 'pos': 'NUM', 'prefix': 'o', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CD', 'shape': 'xxx', 'suffix': 'one', 'ent_type': '', 'dep': 'attr', 'head': 118, 'left_edge': 120, 'right_edge': 127}),id=124), Annotation(319,320,SpaceToken,features=Features({'is_space': True}),id=125), Annotation(320,322,Token,features=Features({'_i': 70, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': True, 'lang': 'en', 'lemma': ':P', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11282459806373970386, 'pos': 'PUNCT', 'prefix': ':', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ':', 'shape': ':X', 'suffix': ':P', 'ent_type': '', 'dep': 'punct', 'head': 124, 'left_edge': 126, 'right_edge': 126}),id=126), Annotation(322,324,SpaceToken,features=Features({'_i': 71, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 908432558851201422, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n\\n', 'suffix': '\\n\\n', 'ent_type': '', 'dep': 'dep', 'head': 124, 'left_edge': 127, 'right_edge': 127}),id=127), Annotation(324,328,Token,features=Features({'_i': 72, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'let', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 8631549979455155821, 'pos': 'NOUN', 'prefix': 'L', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'Xxxx', 'suffix': 'ets', 'ent_type': '', 'dep': 'nsubj', 'head': 130, 'left_edge': 128, 'right_edge': 128}),id=128), Annotation(324,328,NounChunk,features=Features({}),id=362), Annotation(328,329,SpaceToken,features=Features({'is_space': True}),id=129), Annotation(329,332,Token,features=Features({'_i': 73, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'say', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 8685289367999165211, 'pos': 'VERB', 'prefix': 's', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBP', 'shape': 'xxx', 'suffix': 'say', 'ent_type': '', 'dep': 'parataxis', 'head': 153, 'left_edge': 101, 'right_edge': 149}),id=130), Annotation(332,333,SpaceToken,features=Features({'is_space': True}),id=131), Annotation(333,338,Token,features=Features({'_i': 74, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Boris', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3926703778135926261, 'pos': 'PROPN', 'prefix': 'B', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'ris', 'ent_type': 'PERSON', 'dep': 'compound', 'head': 134, 'left_edge': 132, 'right_edge': 132}),id=132), Annotation(333,346,PERSON,features=Features({'lemma': 'Boris Johnson'}),id=327), Annotation(333,346,NounChunk,features=Features({}),id=363), Annotation(338,339,SpaceToken,features=Features({'is_space': True}),id=133), Annotation(339,346,Token,features=Features({'_i': 75, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Johnson', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 14020170261514038406, 'pos': 'PROPN', 'prefix': 'J', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'son', 'ent_type': 'PERSON', 'dep': 'nsubj', 'head': 130, 'left_edge': 132, 'right_edge': 148}),id=134), Annotation(346,347,SpaceToken,features=Features({'is_space': True}),id=135), Annotation(347,350,Token,features=Features({'_i': 76, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'aka', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 6309978511773728404, 'pos': 'ADV', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'xxx', 'suffix': 'aka', 'ent_type': '', 'dep': 'advmod', 'head': 140, 'left_edge': 136, 'right_edge': 136}),id=136), Annotation(347,362,NounChunk,features=Features({}),id=364), Annotation(350,351,SpaceToken,features=Features({'is_space': True}),id=137), Annotation(351,355,Token,features=Features({'_i': 77, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Bojo', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 1381500864330977487, 'pos': 'PROPN', 'prefix': 'B', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxx', 'suffix': 'ojo', 'ent_type': '', 'dep': 'compound', 'head': 140, 'left_edge': 138, 'right_edge': 138}),id=138), Annotation(355,356,SpaceToken,features=Features({'is_space': True}),id=139), Annotation(356,362,Token,features=Features({'_i': 78, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'tweet', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 9158814828460388352, 'pos': 'NOUN', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'xxxx', 'suffix': 'ets', 'ent_type': '', 'dep': 'appos', 'head': 134, 'left_edge': 136, 'right_edge': 148}),id=140), Annotation(362,363,SpaceToken,features=Features({'is_space': True}),id=141), Annotation(363,367,Token,features=Features({'_i': 79, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'from', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7831658034963690409, 'pos': 'ADP', 'prefix': 'f', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'IN', 'shape': 'xxxx', 'suffix': 'rom', 'ent_type': '', 'dep': 'prep', 'head': 140, 'left_edge': 142, 'right_edge': 148}),id=142), Annotation(367,368,SpaceToken,features=Features({'is_space': True}),id=143), Annotation(368,371,Token,features=Features({'_i': 80, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'his', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2661093235354845946, 'pos': 'PRON', 'prefix': 'h', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'PRP$', 'shape': 'xxx', 'suffix': 'his', 'ent_type': '', 'dep': 'poss', 'head': 148, 'left_edge': 144, 'right_edge': 144}),id=144), Annotation(368,392,NounChunk,features=Features({}),id=365), Annotation(371,372,SpaceToken,features=Features({'is_space': True}),id=145), Annotation(372,384,Token,features=Features({'_i': 81, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'BorisJohnson', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 6152028365162425637, 'pos': 'PROPN', 'prefix': 'B', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'XxxxxXxxxx', 'suffix': 'son', 'ent_type': 'ORG', 'dep': 'compound', 'head': 148, 'left_edge': 146, 'right_edge': 146}),id=146), Annotation(372,384,ORG,features=Features({'lemma': 'BorisJohnson'}),id=328), Annotation(384,385,SpaceToken,features=Features({'is_space': True}),id=147), Annotation(385,392,Token,features=Features({'_i': 82, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'account', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 16051117652154763083, 'pos': 'NOUN', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'unt', 'ent_type': '', 'dep': 'pobj', 'head': 142, 'left_edge': 144, 'right_edge': 148}),id=148), Annotation(392,393,Token,features=Features({'_i': 83, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 130, 'left_edge': 149, 'right_edge': 149}),id=149), Annotation(393,394,SpaceToken,features=Features({'is_space': True}),id=150), Annotation(394,399,Token,features=Features({'_i': 84, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'would', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 6992604926141104606, 'pos': 'AUX', 'prefix': 'w', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'MD', 'shape': 'xxxx', 'suffix': 'uld', 'ent_type': '', 'dep': 'aux', 'head': 153, 'left_edge': 151, 'right_edge': 151}),id=151), Annotation(399,400,SpaceToken,features=Features({'is_space': True}),id=152), Annotation(400,402,Token,features=Features({'_i': 85, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'be', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 10382539506755952630, 'pos': 'AUX', 'prefix': 'b', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VB', 'shape': 'xx', 'suffix': 'be', 'ent_type': '', 'dep': 'ROOT', 'head': 153, 'left_edge': 101, 'right_edge': 162}),id=153), Annotation(402,403,SpaceToken,features=Features({'is_space': True}),id=154), Annotation(403,407,Token,features=Features({'_i': 86, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'nice', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 14121509715367036122, 'pos': 'ADJ', 'prefix': 'n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'JJ', 'shape': 'xxxx', 'suffix': 'ice', 'ent_type': '', 'dep': 'acomp', 'head': 153, 'left_edge': 155, 'right_edge': 155}),id=155), Annotation(407,408,SpaceToken,features=Features({'is_space': True}),id=156), Annotation(408,410,Token,features=Features({'_i': 87, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'to', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3791531372978436496, 'pos': 'PART', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'TO', 'shape': 'xx', 'suffix': 'to', 'ent_type': '', 'dep': 'aux', 'head': 159, 'left_edge': 157, 'right_edge': 157}),id=157), Annotation(410,411,SpaceToken,features=Features({'is_space': True}),id=158), Annotation(411,416,Token,features=Features({'_i': 88, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'match', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 16065818573247886523, 'pos': 'VERB', 'prefix': 'm', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VB', 'shape': 'xxxx', 'suffix': 'tch', 'ent_type': '', 'dep': 'xcomp', 'head': 153, 'left_edge': 157, 'right_edge': 161}),id=159), Annotation(416,417,SpaceToken,features=Features({'is_space': True}),id=160), Annotation(417,421,Token,features=Features({'_i': 89, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'they', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7976265300568155021, 'pos': 'PRON', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'PRP', 'shape': 'xxxx', 'suffix': 'hem', 'ent_type': '', 'dep': 'dobj', 'head': 159, 'left_edge': 161, 'right_edge': 161}),id=161), Annotation(417,421,NounChunk,features=Features({}),id=366), Annotation(421,422,Token,features=Features({'_i': 90, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '!', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 17494803046312582752, 'pos': 'PUNCT', 'prefix': '!', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '.', 'shape': '!', 'suffix': '!', 'ent_type': '', 'dep': 'punct', 'head': 153, 'left_edge': 162, 'right_edge': 162}),id=162), Annotation(422,424,SpaceToken,features=Features({'_i': 91, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': True, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 908432558851201422, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n\\n', 'suffix': '\\n\\n', 'ent_type': '', 'dep': 'dep', 'head': 168, 'left_edge': 163, 'right_edge': 163}),id=163), Annotation(422,829,Sentence,features=Features({}),id=340), Annotation(424,428,Token,features=Features({'_i': 92, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'here', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 13973858553651532596, 'pos': 'ADV', 'prefix': 'H', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'Xxxx', 'suffix': 'ere', 'ent_type': '', 'dep': 'advmod', 'head': 168, 'left_edge': 164, 'right_edge': 164}),id=164), Annotation(428,429,SpaceToken,features=Features({'is_space': True}),id=165), Annotation(429,431,Token,features=Features({'_i': 93, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'we', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 16064069575701507746, 'pos': 'PRON', 'prefix': 'w', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'PRP', 'shape': 'xx', 'suffix': 'we', 'ent_type': '', 'dep': 'nsubj', 'head': 168, 'left_edge': 166, 'right_edge': 166}),id=166), Annotation(429,431,NounChunk,features=Features({}),id=367), Annotation(431,432,SpaceToken,features=Features({'is_space': True}),id=167), Annotation(432,439,Token,features=Features({'_i': 94, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'include', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 14049642289933595219, 'pos': 'VERB', 'prefix': 'i', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBP', 'shape': 'xxxx', 'suffix': 'ude', 'ent_type': '', 'dep': 'ccomp', 'head': 265, 'left_edge': 163, 'right_edge': 257}),id=168), Annotation(439,440,SpaceToken,features=Features({'is_space': True}),id=169), Annotation(440,441,Token,features=Features({'_i': 95, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'det', 'head': 172, 'left_edge': 170, 'right_edge': 170}),id=170), Annotation(441,442,SpaceToken,features=Features({'is_space': True}),id=171), Annotation(442,445,Token,features=Features({'_i': 96, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': True, 'lang': 'en', 'lemma': 'url', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2582013287274679728, 'pos': 'NOUN', 'prefix': 'U', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'XXX', 'suffix': 'URL', 'ent_type': '', 'dep': 'nmod', 'head': 176, 'left_edge': 170, 'right_edge': 174}),id=172), Annotation(445,446,SpaceToken,features=Features({'is_space': True}),id=173), Annotation(446,487,Token,features=Features({'_i': 97, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'https://gatenlp.github.io/python-gatenlp/', 'like_email': False, 'like_num': False, 'like_url': True, 'orth': 616995933894448768, 'pos': 'X', 'prefix': 'h', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'FW', 'shape': 'xxxx://xxxx.xxxx.xx/xxxx-xxxx/', 'suffix': 'lp/', 'ent_type': '', 'dep': 'nummod', 'head': 172, 'left_edge': 174, 'right_edge': 174}),id=174), Annotation(487,488,SpaceToken,features=Features({'is_space': True}),id=175), Annotation(488,489,SpaceToken,features=Features({'_i': 98, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 962983613142996970, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n', 'suffix': '\\n', 'ent_type': '', 'dep': 'dep', 'head': 168, 'left_edge': 170, 'right_edge': 187}),id=176), Annotation(489,492,Token,features=Features({'_i': 99, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'and', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2283656566040971221, 'pos': 'CCONJ', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'xxx', 'suffix': 'and', 'ent_type': '', 'dep': 'cc', 'head': 176, 'left_edge': 177, 'right_edge': 177}),id=177), Annotation(492,493,SpaceToken,features=Features({'is_space': True}),id=178), Annotation(493,494,Token,features=Features({'_i': 100, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'det', 'head': 185, 'left_edge': 179, 'right_edge': 179}),id=179), Annotation(494,495,SpaceToken,features=Features({'is_space': True}),id=180), Annotation(495,499,Token,features=Features({'_i': 101, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'fake', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 513889413363281732, 'pos': 'ADJ', 'prefix': 'f', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'JJ', 'shape': 'xxxx', 'suffix': 'ake', 'ent_type': '', 'dep': 'amod', 'head': 185, 'left_edge': 181, 'right_edge': 181}),id=181), Annotation(499,500,SpaceToken,features=Features({'is_space': True}),id=182), Annotation(500,505,Token,features=Features({'_i': 102, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'email', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7320900731437023467, 'pos': 'NOUN', 'prefix': 'e', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'ail', 'ent_type': '', 'dep': 'compound', 'head': 185, 'left_edge': 183, 'right_edge': 183}),id=183), Annotation(505,506,SpaceToken,features=Features({'is_space': True}),id=184), Annotation(506,513,Token,features=Features({'_i': 103, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'address', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 14349763737243715186, 'pos': 'NOUN', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'ess', 'ent_type': '', 'dep': 'conj', 'head': 176, 'left_edge': 179, 'right_edge': 187}),id=185), Annotation(513,514,SpaceToken,features=Features({'is_space': True}),id=186), Annotation(514,540,Token,features=Features({'_i': 104, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'john.doe@hiscoolserver.com', 'like_email': True, 'like_num': False, 'like_url': False, 'orth': 8439314673651519470, 'pos': 'PROPN', 'prefix': 'j', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'xxxx.xxx@xxxx.xxx', 'suffix': 'com', 'ent_type': '', 'dep': 'appos', 'head': 185, 'left_edge': 187, 'right_edge': 187}),id=187), Annotation(514,540,NounChunk,features=Features({}),id=368), Annotation(540,541,SpaceToken,features=Features({'is_space': True}),id=188), Annotation(541,543,Token,features=Features({'_i': 105, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'as', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7437575085468336610, 'pos': 'ADV', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'xx', 'suffix': 'as', 'ent_type': '', 'dep': 'advmod', 'head': 191, 'left_edge': 189, 'right_edge': 189}),id=189), Annotation(543,544,SpaceToken,features=Features({'is_space': True}),id=190), Annotation(544,548,Token,features=Features({'_i': 106, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'well', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 4525988469032889948, 'pos': 'ADV', 'prefix': 'w', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'xxxx', 'suffix': 'ell', 'ent_type': '', 'dep': 'advmod', 'head': 168, 'left_edge': 189, 'right_edge': 191}),id=191), Annotation(548,549,SpaceToken,features=Features({'is_space': True}),id=192), Annotation(549,550,SpaceToken,features=Features({'_i': 107, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 962983613142996970, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n', 'suffix': '\\n', 'ent_type': '', 'dep': 'dep', 'head': 168, 'left_edge': 193, 'right_edge': 193}),id=193), Annotation(550,552,Token,features=Features({'_i': 108, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'as', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7437575085468336610, 'pos': 'ADP', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'IN', 'shape': 'xx', 'suffix': 'as', 'ent_type': '', 'dep': 'prep', 'head': 168, 'left_edge': 194, 'right_edge': 203}),id=194), Annotation(552,553,SpaceToken,features=Features({'is_space': True}),id=195), Annotation(553,554,Token,features=Features({'_i': 109, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '#', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 8663801465970268676, 'pos': 'SYM', 'prefix': '#', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '$', 'shape': '#', 'suffix': '#', 'ent_type': 'MONEY', 'dep': 'quantmod', 'head': 199, 'left_edge': 196, 'right_edge': 196}),id=196), Annotation(553,560,MONEY,features=Features({'lemma': '#some #'}),id=329), Annotation(553,573,NounChunk,features=Features({}),id=369), Annotation(554,558,Token,features=Features({'_i': 110, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'some', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7000492816108906599, 'pos': 'DET', 'prefix': 's', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'xxxx', 'suffix': 'ome', 'ent_type': 'MONEY', 'dep': 'det', 'head': 199, 'left_edge': 197, 'right_edge': 197}),id=197), Annotation(558,559,SpaceToken,features=Features({'is_space': True}),id=198), Annotation(559,560,Token,features=Features({'_i': 111, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '#', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 8663801465970268676, 'pos': 'ADV', 'prefix': '#', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RBR', 'shape': '#', 'suffix': '#', 'ent_type': 'MONEY', 'dep': 'quantmod', 'head': 203, 'left_edge': 196, 'right_edge': 200}),id=199), Annotation(560,564,Token,features=Features({'_i': 112, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'cool', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 16540862642162215401, 'pos': 'ADJ', 'prefix': 'c', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'JJ', 'shape': 'xxxx', 'suffix': 'ool', 'ent_type': '', 'dep': 'amod', 'head': 199, 'left_edge': 200, 'right_edge': 200}),id=200), Annotation(564,565,SpaceToken,features=Features({'is_space': True}),id=201), Annotation(565,566,Token,features=Features({'_i': 113, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '#', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 8663801465970268676, 'pos': 'SYM', 'prefix': '#', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '$', 'shape': '#', 'suffix': '#', 'ent_type': '', 'dep': 'nmod', 'head': 203, 'left_edge': 202, 'right_edge': 202}),id=202), Annotation(566,573,Token,features=Features({'_i': 114, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'hastag', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12335920655888895378, 'pos': 'NOUN', 'prefix': 'h', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'xxxx', 'suffix': 'ags', 'ent_type': '', 'dep': 'pobj', 'head': 194, 'left_edge': 196, 'right_edge': 203}),id=203), Annotation(573,574,SpaceToken,features=Features({'is_space': True}),id=204), Annotation(574,577,Token,features=Features({'_i': 115, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'and', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2283656566040971221, 'pos': 'CCONJ', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'xxx', 'suffix': 'and', 'ent_type': '', 'dep': 'cc', 'head': 168, 'left_edge': 205, 'right_edge': 205}),id=205), Annotation(577,578,SpaceToken,features=Features({'is_space': True}),id=206), Annotation(578,579,Token,features=Features({'_i': 116, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'det', 'head': 209, 'left_edge': 207, 'right_edge': 207}),id=207), Annotation(579,580,SpaceToken,features=Features({'is_space': True}),id=208), Annotation(580,585,Token,features=Features({'_i': 117, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'bunch', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 16097686030507304001, 'pos': 'NOUN', 'prefix': 'b', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'nch', 'ent_type': '', 'dep': 'conj', 'head': 168, 'left_edge': 207, 'right_edge': 225}),id=209), Annotation(585,586,SpaceToken,features=Features({'is_space': True}),id=210), Annotation(586,588,Token,features=Features({'_i': 118, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'of', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 886050111519832510, 'pos': 'ADP', 'prefix': 'o', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'IN', 'shape': 'xx', 'suffix': 'of', 'ent_type': '', 'dep': 'prep', 'head': 209, 'left_edge': 211, 'right_edge': 213}),id=211), Annotation(588,589,SpaceToken,features=Features({'is_space': True}),id=212), Annotation(589,595,Token,features=Features({'_i': 119, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'emoji', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 15308488336638733982, 'pos': 'NOUN', 'prefix': 'e', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'xxxx', 'suffix': 'jis', 'ent_type': '', 'dep': 'pobj', 'head': 211, 'left_edge': 213, 'right_edge': 213}),id=213), Annotation(589,595,NounChunk,features=Features({}),id=370), Annotation(595,596,SpaceToken,features=Features({'is_space': True}),id=214), Annotation(596,600,Token,features=Features({'_i': 120, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'like', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 18194338103975822726, 'pos': 'ADP', 'prefix': 'l', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'IN', 'shape': 'xxxx', 'suffix': 'ike', 'ent_type': '', 'dep': 'prep', 'head': 209, 'left_edge': 215, 'right_edge': 224}),id=215), Annotation(600,601,SpaceToken,features=Features({'is_space': True}),id=216), Annotation(601,602,Token,features=Features({'_i': 121, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '😽', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 13410134130775049117, 'pos': 'PROPN', 'prefix': '😽', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': '😽', 'suffix': '😽', 'ent_type': '', 'dep': 'pobj', 'head': 215, 'left_edge': 217, 'right_edge': 224}),id=217), Annotation(601,602,NounChunk,features=Features({}),id=371), Annotation(602,603,SpaceToken,features=Features({'is_space': True}),id=218), Annotation(603,604,Token,features=Features({'_i': 122, 'is_alpha': False, 'is_bracket': True, 'is_currency': False, 'is_digit': False, 'is_left_punct': True, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '(', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12638816674900267446, 'pos': 'PUNCT', 'prefix': '(', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '-LRB-', 'shape': '(', 'suffix': '(', 'ent_type': '', 'dep': 'punct', 'head': 217, 'left_edge': 219, 'right_edge': 219}),id=219), Annotation(604,605,Token,features=Features({'_i': 123, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'det', 'head': 224, 'left_edge': 220, 'right_edge': 220}),id=220), Annotation(604,617,NounChunk,features=Features({}),id=372), Annotation(605,606,SpaceToken,features=Features({'is_space': True}),id=221), Annotation(606,613,Token,features=Features({'_i': 124, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'kissing', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 17741139538205582276, 'pos': 'NOUN', 'prefix': 'k', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'ing', 'ent_type': '', 'dep': 'amod', 'head': 224, 'left_edge': 222, 'right_edge': 222}),id=222), Annotation(613,614,SpaceToken,features=Features({'is_space': True}),id=223), Annotation(614,617,Token,features=Features({'_i': 125, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'cat', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 5439657043933447811, 'pos': 'NOUN', 'prefix': 'c', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxx', 'suffix': 'cat', 'ent_type': '', 'dep': 'appos', 'head': 217, 'left_edge': 220, 'right_edge': 224}),id=224), Annotation(617,618,Token,features=Features({'_i': 126, 'is_alpha': False, 'is_bracket': True, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': True, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ')', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3842344029291005339, 'pos': 'PUNCT', 'prefix': ')', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '-RRB-', 'shape': ')', 'suffix': ')', 'ent_type': '', 'dep': 'punct', 'head': 209, 'left_edge': 225, 'right_edge': 225}),id=225), Annotation(618,619,Token,features=Features({'_i': 127, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 168, 'left_edge': 226, 'right_edge': 226}),id=226), Annotation(619,620,SpaceToken,features=Features({'_i': 128, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 962983613142996970, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n', 'suffix': '\\n', 'ent_type': '', 'dep': 'dep', 'head': 168, 'left_edge': 227, 'right_edge': 227}),id=227), Annotation(620,621,Token,features=Features({'_i': 129, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '👩', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 13533745690575315714, 'pos': 'X', 'prefix': '👩', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'ADD', 'shape': '👩', 'suffix': '👩', 'ent_type': '', 'dep': 'dobj', 'head': 168, 'left_edge': 228, 'right_edge': 228}),id=228), Annotation(621,622,Token,features=Features({'_i': 130, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\u200d', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 17442142028047717517, 'pos': 'X', 'prefix': '\\u200d', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'ADD', 'shape': '\\u200d', 'suffix': '\\u200d', 'ent_type': '', 'dep': 'dobj', 'head': 168, 'left_edge': 229, 'right_edge': 246}),id=229), Annotation(622,623,Token,features=Features({'_i': 131, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '🏫', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7253895586487469997, 'pos': 'NOUN', 'prefix': '🏫', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': '🏫', 'suffix': '🏫', 'ent_type': '', 'dep': 'dobj', 'head': 229, 'left_edge': 230, 'right_edge': 246}),id=230), Annotation(622,623,NounChunk,features=Features({}),id=373), Annotation(623,624,SpaceToken,features=Features({'is_space': True}),id=231), Annotation(624,625,Token,features=Features({'_i': 132, 'is_alpha': False, 'is_bracket': True, 'is_currency': False, 'is_digit': False, 'is_left_punct': True, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '(', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12638816674900267446, 'pos': 'PUNCT', 'prefix': '(', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '-LRB-', 'shape': '(', 'suffix': '(', 'ent_type': '', 'dep': 'punct', 'head': 230, 'left_edge': 232, 'right_edge': 232}),id=232), Annotation(625,626,Token,features=Features({'_i': 133, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'det', 'head': 237, 'left_edge': 233, 'right_edge': 233}),id=233), Annotation(625,640,NounChunk,features=Features({}),id=374), Annotation(626,627,SpaceToken,features=Features({'is_space': True}),id=234), Annotation(627,632,Token,features=Features({'_i': 134, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'woman', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 18308318389743014884, 'pos': 'NOUN', 'prefix': 'w', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'man', 'ent_type': '', 'dep': 'compound', 'head': 237, 'left_edge': 235, 'right_edge': 235}),id=235), Annotation(632,633,SpaceToken,features=Features({'is_space': True}),id=236), Annotation(633,640,Token,features=Features({'_i': 135, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'teacher', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 78820239457828971, 'pos': 'NOUN', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'her', 'ent_type': '', 'dep': 'appos', 'head': 230, 'left_edge': 233, 'right_edge': 237}),id=237), Annotation(640,641,Token,features=Features({'_i': 136, 'is_alpha': False, 'is_bracket': True, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': True, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ')', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3842344029291005339, 'pos': 'PUNCT', 'prefix': ')', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '-RRB-', 'shape': ')', 'suffix': ')', 'ent_type': '', 'dep': 'punct', 'head': 230, 'left_edge': 238, 'right_edge': 238}),id=238), Annotation(641,642,Token,features=Features({'_i': 137, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 230, 'left_edge': 239, 'right_edge': 239}),id=239), Annotation(642,643,SpaceToken,features=Features({'is_space': True}),id=240), Annotation(643,644,Token,features=Features({'_i': 138, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '🧬', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 9031423621242036097, 'pos': 'PROPN', 'prefix': '🧬', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': '🧬', 'suffix': '🧬', 'ent_type': 'PERSON', 'dep': 'conj', 'head': 230, 'left_edge': 241, 'right_edge': 245}),id=241), Annotation(643,644,PERSON,features=Features({'lemma': '🧬'}),id=330), Annotation(643,644,NounChunk,features=Features({}),id=375), Annotation(644,645,SpaceToken,features=Features({'is_space': True}),id=242), Annotation(645,646,Token,features=Features({'_i': 139, 'is_alpha': False, 'is_bracket': True, 'is_currency': False, 'is_digit': False, 'is_left_punct': True, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '(', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12638816674900267446, 'pos': 'PUNCT', 'prefix': '(', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '-LRB-', 'shape': '(', 'suffix': '(', 'ent_type': '', 'dep': 'punct', 'head': 241, 'left_edge': 243, 'right_edge': 243}),id=243), Annotation(646,649,Token,features=Features({'_i': 140, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': True, 'lang': 'en', 'lemma': 'DNA', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 14386531691665947480, 'pos': 'PROPN', 'prefix': 'D', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'XXX', 'suffix': 'DNA', 'ent_type': '', 'dep': 'appos', 'head': 241, 'left_edge': 244, 'right_edge': 244}),id=244), Annotation(646,649,NounChunk,features=Features({}),id=376), Annotation(649,650,Token,features=Features({'_i': 141, 'is_alpha': False, 'is_bracket': True, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': True, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ')', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3842344029291005339, 'pos': 'PUNCT', 'prefix': ')', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '-RRB-', 'shape': ')', 'suffix': ')', 'ent_type': '', 'dep': 'punct', 'head': 241, 'left_edge': 245, 'right_edge': 245}),id=245), Annotation(650,651,Token,features=Features({'_i': 142, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 230, 'left_edge': 246, 'right_edge': 246}),id=246), Annotation(651,652,SpaceToken,features=Features({'is_space': True}),id=247), Annotation(652,653,SpaceToken,features=Features({'_i': 143, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 962983613142996970, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n', 'suffix': '\\n', 'ent_type': '', 'dep': 'dep', 'head': 256, 'left_edge': 248, 'right_edge': 248}),id=248), Annotation(653,654,Token,features=Features({'_i': 144, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '🧗', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7724749230788735883, 'pos': 'PROPN', 'prefix': '🧗', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': '🧗', 'suffix': '🧗', 'ent_type': '', 'dep': 'nsubj', 'head': 256, 'left_edge': 249, 'right_edge': 249}),id=249), Annotation(653,654,NounChunk,features=Features({}),id=377), Annotation(654,655,SpaceToken,features=Features({'is_space': True}),id=250), Annotation(655,656,Token,features=Features({'_i': 145, 'is_alpha': False, 'is_bracket': True, 'is_currency': False, 'is_digit': False, 'is_left_punct': True, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '(', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12638816674900267446, 'pos': 'PUNCT', 'prefix': '(', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '-LRB-', 'shape': '(', 'suffix': '(', 'ent_type': '', 'dep': 'punct', 'head': 256, 'left_edge': 251, 'right_edge': 251}),id=251), Annotation(656,657,Token,features=Features({'_i': 146, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'det', 'head': 254, 'left_edge': 252, 'right_edge': 252}),id=252), Annotation(656,664,NounChunk,features=Features({}),id=378), Annotation(657,658,SpaceToken,features=Features({'is_space': True}),id=253), Annotation(658,664,Token,features=Features({'_i': 147, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'person', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 14800503047316267216, 'pos': 'NOUN', 'prefix': 'p', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'son', 'ent_type': '', 'dep': 'nsubj', 'head': 256, 'left_edge': 252, 'right_edge': 254}),id=254), Annotation(664,665,SpaceToken,features=Features({'is_space': True}),id=255), Annotation(665,673,Token,features=Features({'_i': 148, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'climb', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 992864226358664410, 'pos': 'VERB', 'prefix': 'c', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBG', 'shape': 'xxxx', 'suffix': 'ing', 'ent_type': '', 'dep': 'ccomp', 'head': 168, 'left_edge': 248, 'right_edge': 257}),id=256), Annotation(673,674,Token,features=Features({'_i': 149, 'is_alpha': False, 'is_bracket': True, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': True, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ')', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3842344029291005339, 'pos': 'PUNCT', 'prefix': ')', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '-RRB-', 'shape': ')', 'suffix': ')', 'ent_type': '', 'dep': 'punct', 'head': 256, 'left_edge': 257, 'right_edge': 257}),id=257), Annotation(674,675,Token,features=Features({'_i': 150, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 265, 'left_edge': 258, 'right_edge': 258}),id=258), Annotation(675,676,SpaceToken,features=Features({'is_space': True}),id=259), Annotation(676,678,SpaceToken,features=Features({'_i': 151, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 908432558851201422, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n\\n', 'suffix': '\\n\\n', 'ent_type': '', 'dep': 'dep', 'head': 265, 'left_edge': 260, 'right_edge': 260}),id=260), Annotation(678,682,Token,features=Features({'_i': 152, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'here', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 13973858553651532596, 'pos': 'ADV', 'prefix': 'H', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'Xxxx', 'suffix': 'ere', 'ent_type': '', 'dep': 'advmod', 'head': 265, 'left_edge': 261, 'right_edge': 261}),id=261), Annotation(682,683,SpaceToken,features=Features({'is_space': True}),id=262), Annotation(683,685,Token,features=Features({'_i': 153, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'we', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 16064069575701507746, 'pos': 'PRON', 'prefix': 'w', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'PRP', 'shape': 'xx', 'suffix': 'we', 'ent_type': '', 'dep': 'nsubj', 'head': 265, 'left_edge': 263, 'right_edge': 263}),id=263), Annotation(683,685,NounChunk,features=Features({}),id=379), Annotation(685,686,SpaceToken,features=Features({'is_space': True}),id=264), Annotation(686,690,Token,features=Features({'_i': 154, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'test', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 1618900948208871284, 'pos': 'VERB', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBP', 'shape': 'xxxx', 'suffix': 'est', 'ent_type': '', 'dep': 'ROOT', 'head': 265, 'left_edge': 163, 'right_edge': 316}),id=265), Annotation(690,691,SpaceToken,features=Features({'is_space': True}),id=266), Annotation(691,692,Token,features=Features({'_i': 155, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'quantmod', 'head': 269, 'left_edge': 267, 'right_edge': 267}),id=267), Annotation(691,714,NounChunk,features=Features({}),id=380), Annotation(692,693,SpaceToken,features=Features({'is_space': True}),id=268), Annotation(693,696,Token,features=Features({'_i': 156, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'few', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11866476999679706272, 'pos': 'ADJ', 'prefix': 'f', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'JJ', 'shape': 'xxx', 'suffix': 'few', 'ent_type': '', 'dep': 'nummod', 'head': 273, 'left_edge': 267, 'right_edge': 269}),id=269), Annotation(696,697,SpaceToken,features=Features({'is_space': True}),id=270), Annotation(697,706,Token,features=Features({'_i': 157, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'different', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 17379251038783395147, 'pos': 'ADJ', 'prefix': 'd', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'JJ', 'shape': 'xxxx', 'suffix': 'ent', 'ent_type': '', 'dep': 'amod', 'head': 273, 'left_edge': 271, 'right_edge': 271}),id=271), Annotation(706,707,SpaceToken,features=Features({'is_space': True}),id=272), Annotation(707,714,Token,features=Features({'_i': 158, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'script', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 5755375431925009742, 'pos': 'NOUN', 'prefix': 's', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'xxxx', 'suffix': 'pts', 'ent_type': '', 'dep': 'dobj', 'head': 265, 'left_edge': 267, 'right_edge': 315}),id=273), Annotation(714,715,Token,features=Features({'_i': 159, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 273, 'left_edge': 274, 'right_edge': 274}),id=274), Annotation(715,716,SpaceToken,features=Features({'is_space': True}),id=275), Annotation(716,720,Token,features=Features({'_i': 160, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'e.g.', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2360781233418247603, 'pos': 'ADV', 'prefix': 'e', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'x.x.', 'suffix': '.g.', 'ent_type': '', 'dep': 'advmod', 'head': 280, 'left_edge': 276, 'right_edge': 276}),id=276), Annotation(716,730,NounChunk,features=Features({}),id=381), Annotation(720,721,SpaceToken,features=Features({'is_space': True}),id=277), Annotation(721,727,Token,features=Features({'_i': 161, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Hangul', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 13901273419332601270, 'pos': 'PROPN', 'prefix': 'H', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'gul', 'ent_type': 'PERSON', 'dep': 'compound', 'head': 280, 'left_edge': 278, 'right_edge': 278}),id=278), Annotation(721,727,PERSON,features=Features({'lemma': 'Hangul'}),id=331), Annotation(727,728,SpaceToken,features=Features({'is_space': True}),id=279), Annotation(728,730,Token,features=Features({'_i': 162, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '한글', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 14226613016943586999, 'pos': 'PROPN', 'prefix': '한', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'xx', 'suffix': '한글', 'ent_type': '', 'dep': 'appos', 'head': 273, 'left_edge': 276, 'right_edge': 282}),id=280), Annotation(730,731,SpaceToken,features=Features({'is_space': True}),id=281), Annotation(731,733,Token,features=Features({'_i': 163, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'or', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3740602843040177340, 'pos': 'CCONJ', 'prefix': 'o', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'xx', 'suffix': 'or', 'ent_type': '', 'dep': 'cc', 'head': 280, 'left_edge': 282, 'right_edge': 282}),id=282), Annotation(733,734,SpaceToken,features=Features({'is_space': True}),id=283), Annotation(734,735,SpaceToken,features=Features({'_i': 164, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 962983613142996970, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n', 'suffix': '\\n', 'ent_type': '', 'dep': 'dep', 'head': 273, 'left_edge': 284, 'right_edge': 284}),id=284), Annotation(735,745,Token,features=Features({'_i': 165, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'simplify', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 15400283218552125021, 'pos': 'VERB', 'prefix': 's', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBN', 'shape': 'xxxx', 'suffix': 'ied', 'ent_type': '', 'dep': 'amod', 'head': 289, 'left_edge': 285, 'right_edge': 285}),id=285), Annotation(735,754,NounChunk,features=Features({}),id=382), Annotation(745,746,SpaceToken,features=Features({'is_space': True}),id=286), Annotation(746,751,Token,features=Features({'_i': 166, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Hanzi', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7162399416189955260, 'pos': 'PROPN', 'prefix': 'H', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'nzi', 'ent_type': 'PERSON', 'dep': 'compound', 'head': 289, 'left_edge': 287, 'right_edge': 287}),id=287), Annotation(746,751,PERSON,features=Features({'lemma': 'Hanzi'}),id=332), Annotation(751,752,SpaceToken,features=Features({'is_space': True}),id=288), Annotation(752,754,Token,features=Features({'_i': 167, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '汉字', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2455570606503990670, 'pos': 'PROPN', 'prefix': '汉', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'xx', 'suffix': '汉字', 'ent_type': '', 'dep': 'conj', 'head': 273, 'left_edge': 285, 'right_edge': 315}),id=289), Annotation(754,755,SpaceToken,features=Features({'is_space': True}),id=290), Annotation(755,757,Token,features=Features({'_i': 168, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'or', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3740602843040177340, 'pos': 'CCONJ', 'prefix': 'o', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'xx', 'suffix': 'or', 'ent_type': '', 'dep': 'cc', 'head': 289, 'left_edge': 291, 'right_edge': 291}),id=291), Annotation(757,758,SpaceToken,features=Features({'is_space': True}),id=292), Annotation(758,763,Token,features=Features({'_i': 169, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Farsi', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 1631855311088449743, 'pos': 'PROPN', 'prefix': 'F', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'rsi', 'ent_type': 'PERSON', 'dep': 'compound', 'head': 295, 'left_edge': 293, 'right_edge': 293}),id=293), Annotation(758,763,PERSON,features=Features({'lemma': 'Farsi'}),id=333), Annotation(758,769,NounChunk,features=Features({}),id=383), Annotation(763,764,SpaceToken,features=Features({'is_space': True}),id=294), Annotation(764,769,Token,features=Features({'_i': 170, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'فارسی', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 10192536850399601059, 'pos': 'NOUN', 'prefix': 'ف', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'رسی', 'ent_type': '', 'dep': 'conj', 'head': 289, 'left_edge': 293, 'right_edge': 299}),id=295), Annotation(769,770,SpaceToken,features=Features({'is_space': True}),id=296), Annotation(770,773,Token,features=Features({'_i': 171, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'and', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2283656566040971221, 'pos': 'CCONJ', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'xxx', 'suffix': 'and', 'ent_type': '', 'dep': 'cc', 'head': 295, 'left_edge': 297, 'right_edge': 297}),id=297), Annotation(773,774,SpaceToken,features=Features({'is_space': True}),id=298), Annotation(774,780,Token,features=Features({'_i': 172, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Arabic', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 6558098436421695236, 'pos': 'PROPN', 'prefix': 'A', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'bic', 'ent_type': 'LANGUAGE', 'dep': 'conj', 'head': 295, 'left_edge': 299, 'right_edge': 299}),id=299), Annotation(774,780,LANGUAGE,features=Features({'lemma': 'Arabic'}),id=334), Annotation(774,780,NounChunk,features=Features({}),id=384), Annotation(780,781,SpaceToken,features=Features({'is_space': True}),id=300), Annotation(781,782,Token,features=Features({'_i': 173, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 289, 'left_edge': 301, 'right_edge': 301}),id=301), Annotation(782,797,Token,features=Features({'_i': 174, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'اَلْعَرَبِيَّةُ', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 16849633969024660305, 'pos': 'NUM', 'prefix': 'ا', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CD', 'shape': 'xَxْxَxَxِxَّxُ', 'suffix': 'ّةُ', 'ent_type': 'GPE', 'dep': 'appos', 'head': 289, 'left_edge': 302, 'right_edge': 302}),id=302), Annotation(782,797,GPE,features=Features({'lemma': 'اَلْعَرَبِيَّةُ'}),id=335), Annotation(797,798,Token,features=Features({'_i': 175, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 289, 'left_edge': 303, 'right_edge': 303}),id=303), Annotation(798,799,SpaceToken,features=Features({'is_space': True}),id=304), Annotation(799,804,Token,features=Features({'_i': 176, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'which', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7063653163634019529, 'pos': 'PRON', 'prefix': 'w', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'WDT', 'shape': 'xxxx', 'suffix': 'ich', 'ent_type': '', 'dep': 'nsubj', 'head': 307, 'left_edge': 305, 'right_edge': 305}),id=305), Annotation(799,804,NounChunk,features=Features({}),id=385), Annotation(804,805,SpaceToken,features=Features({'is_space': True}),id=306), Annotation(805,809,Token,features=Features({'_i': 177, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'go', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 18224404223277189725, 'pos': 'VERB', 'prefix': 'g', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBZ', 'shape': 'xxxx', 'suffix': 'oes', 'ent_type': '', 'dep': 'relcl', 'head': 289, 'left_edge': 305, 'right_edge': 315}),id=307), Annotation(809,810,SpaceToken,features=Features({'is_space': True}),id=308), Annotation(810,814,Token,features=Features({'_i': 178, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'from', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7831658034963690409, 'pos': 'ADP', 'prefix': 'f', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'IN', 'shape': 'xxxx', 'suffix': 'rom', 'ent_type': '', 'dep': 'prep', 'head': 307, 'left_edge': 309, 'right_edge': 315}),id=309), Annotation(814,815,SpaceToken,features=Features({'is_space': True}),id=310), Annotation(815,820,Token,features=Features({'_i': 179, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'right', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 5943797630011647483, 'pos': 'NOUN', 'prefix': 'r', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'ght', 'ent_type': '', 'dep': 'advmod', 'head': 313, 'left_edge': 311, 'right_edge': 311}),id=311), Annotation(820,821,SpaceToken,features=Features({'is_space': True}),id=312), Annotation(821,823,Token,features=Features({'_i': 180, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'to', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3791531372978436496, 'pos': 'AUX', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'IN', 'shape': 'xx', 'suffix': 'to', 'ent_type': '', 'dep': 'aux', 'head': 315, 'left_edge': 311, 'right_edge': 313}),id=313), Annotation(823,824,SpaceToken,features=Features({'is_space': True}),id=314), Annotation(824,828,Token,features=Features({'_i': 181, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'leave', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 4003250512168697582, 'pos': 'VERB', 'prefix': 'l', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBN', 'shape': 'xxxx', 'suffix': 'eft', 'ent_type': '', 'dep': 'pcomp', 'head': 309, 'left_edge': 311, 'right_edge': 315}),id=315), Annotation(828,829,Token,features=Features({'_i': 182, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '.', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12646065887601541794, 'pos': 'PUNCT', 'prefix': '.', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '.', 'shape': '.', 'suffix': '.', 'ent_type': '', 'dep': 'punct', 'head': 265, 'left_edge': 316, 'right_edge': 316}),id=316), Annotation(829,831,SpaceToken,features=Features({'_i': 183, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': True, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 908432558851201422, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n\\n', 'suffix': '\\n\\n', 'ent_type': '', 'dep': 'dep', 'head': 317, 'left_edge': 317, 'right_edge': 317}),id=317), Annotation(829,831,Sentence,features=Features({}),id=341)]), 'Stanza': AnnotationSet([Annotation(0,4,Token,features=Features({'text': 'This', 'lemma': 'this', 'upos': 'PRON', 'xpos': 'DT', 'Number': 'Sing', 'PronType': 'Dem', 'head': 4, 'deprel': 'nsubj', 'ner': 'O'}),id=0), Annotation(0,24,Sentence,features=Features({}),id=6), Annotation(5,7,Token,features=Features({'text': 'is', 'lemma': 'be', 'upos': 'AUX', 'xpos': 'VBZ', 'Mood': 'Ind', 'Number': 'Sing', 'Person': '3', 'Tense': 'Pres', 'VerbForm': 'Fin', 'head': 4, 'deprel': 'cop', 'ner': 'O'}),id=1), Annotation(8,9,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 4, 'deprel': 'det', 'ner': 'O'}),id=2), Annotation(10,14,Token,features=Features({'text': 'test', 'lemma': 'test', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 4, 'deprel': 'compound', 'ner': 'O'}),id=3), Annotation(15,23,Token,features=Features({'text': 'document', 'lemma': 'document', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 6, 'deprel': 'root', 'ner': 'O'}),id=4), Annotation(23,24,Token,features=Features({'text': '.', 'lemma': '.', 'upos': 'PUNCT', 'xpos': '.', 'head': 4, 'deprel': 'punct', 'ner': 'O'}),id=5), Annotation(26,28,Token,features=Features({'text': 'It', 'lemma': 'it', 'upos': 'PRON', 'xpos': 'PRP', 'Case': 'Nom', 'Gender': 'Neut', 'Number': 'Sing', 'Person': '3', 'PronType': 'Prs', 'head': 8, 'deprel': 'nsubj', 'ner': 'O'}),id=7), Annotation(26,59,Sentence,features=Features({}),id=14), Annotation(29,37,Token,features=Features({'text': 'contains', 'lemma': 'contain', 'upos': 'VERB', 'xpos': 'VBZ', 'Mood': 'Ind', 'Number': 'Sing', 'Person': '3', 'Tense': 'Pres', 'VerbForm': 'Fin', 'head': 14, 'deprel': 'root', 'ner': 'O'}),id=8), Annotation(38,42,Token,features=Features({'text': 'just', 'lemma': 'just', 'upos': 'ADV', 'xpos': 'RB', 'head': 12, 'deprel': 'advmod', 'ner': 'O'}),id=9), Annotation(43,44,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 12, 'deprel': 'det', 'ner': 'O'}),id=10), Annotation(45,48,Token,features=Features({'text': 'few', 'lemma': 'few', 'upos': 'ADJ', 'xpos': 'JJ', 'Degree': 'Pos', 'head': 12, 'deprel': 'amod', 'ner': 'O'}),id=11), Annotation(49,58,Token,features=Features({'text': 'sentences', 'lemma': 'sentence', 'upos': 'NOUN', 'xpos': 'NNS', 'Number': 'Plur', 'head': 8, 'deprel': 'obj', 'ner': 'O'}),id=12), Annotation(58,59,Token,features=Features({'text': '.', 'lemma': '.', 'upos': 'PUNCT', 'xpos': '.', 'head': 8, 'deprel': 'punct', 'ner': 'O'}),id=13), Annotation(61,65,Token,features=Features({'text': 'Here', 'lemma': 'here', 'upos': 'ADV', 'xpos': 'RB', 'PronType': 'Dem', 'head': 54, 'deprel': 'root', 'ner': 'O'}),id=15), Annotation(61,266,Sentence,features=Features({}),id=54), Annotation(66,68,Token,features=Features({'text': 'is', 'lemma': 'be', 'upos': 'AUX', 'xpos': 'VBZ', 'Mood': 'Ind', 'Number': 'Sing', 'Person': '3', 'Tense': 'Pres', 'VerbForm': 'Fin', 'head': 15, 'deprel': 'cop', 'ner': 'O'}),id=16), Annotation(69,70,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 18, 'deprel': 'det', 'ner': 'O'}),id=17), Annotation(71,79,Token,features=Features({'text': 'sentence', 'lemma': 'sentence', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 15, 'deprel': 'nsubj', 'ner': 'O'}),id=18), Annotation(80,84,Token,features=Features({'text': 'that', 'lemma': 'that', 'upos': 'PRON', 'xpos': 'WDT', 'PronType': 'Rel', 'head': 20, 'deprel': 'nsubj', 'ner': 'O'}),id=19), Annotation(85,93,Token,features=Features({'text': 'mentions', 'lemma': 'mention', 'upos': 'VERB', 'xpos': 'VBZ', 'Mood': 'Ind', 'Number': 'Sing', 'Person': '3', 'Tense': 'Pres', 'VerbForm': 'Fin', 'head': 18, 'deprel': 'acl:relcl', 'ner': 'O'}),id=20), Annotation(94,95,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 24, 'deprel': 'det', 'ner': 'O'}),id=21), Annotation(96,99,Token,features=Features({'text': 'few', 'lemma': 'few', 'upos': 'ADJ', 'xpos': 'JJ', 'Degree': 'Pos', 'head': 24, 'deprel': 'amod', 'ner': 'O'}),id=22), Annotation(100,105,Token,features=Features({'text': 'named', 'lemma': 'name', 'upos': 'VERB', 'xpos': 'VBN', 'Tense': 'Past', 'VerbForm': 'Part', 'head': 24, 'deprel': 'amod', 'ner': 'O'}),id=23), Annotation(106,114,Token,features=Features({'text': 'entities', 'lemma': 'entity', 'upos': 'NOUN', 'xpos': 'NNS', 'Number': 'Plur', 'head': 20, 'deprel': 'obj', 'ner': 'O'}),id=24), Annotation(115,119,Token,features=Features({'text': 'like', 'lemma': 'like', 'upos': 'ADP', 'xpos': 'IN', 'head': 27, 'deprel': 'case', 'ner': 'O'}),id=25), Annotation(121,124,Token,features=Features({'text': 'the', 'lemma': 'the', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Def', 'PronType': 'Art', 'head': 27, 'deprel': 'det', 'ner': 'O'}),id=26), Annotation(125,132,Token,features=Features({'text': 'persons', 'lemma': 'person', 'upos': 'NOUN', 'xpos': 'NNS', 'Number': 'Plur', 'head': 24, 'deprel': 'nmod', 'ner': 'O'}),id=27), Annotation(133,139,Token,features=Features({'text': 'Barack', 'lemma': 'Barack', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 27, 'deprel': 'appos', 'ner': 'B-PERSON'}),id=28), Annotation(133,145,PERSON,features=Features({}),id=175), Annotation(140,145,Token,features=Features({'text': 'Obama', 'lemma': 'Obama', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 28, 'deprel': 'flat', 'ner': 'E-PERSON'}),id=29), Annotation(146,148,Token,features=Features({'text': 'or', 'lemma': 'or', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 31, 'deprel': 'cc', 'ner': 'O'}),id=30), Annotation(149,155,Token,features=Features({'text': 'Ursula', 'lemma': 'Ursula', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 28, 'deprel': 'conj', 'ner': 'B-PERSON'}),id=31), Annotation(149,169,PERSON,features=Features({}),id=176), Annotation(156,159,Token,features=Features({'text': 'von', 'lemma': 'von', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 31, 'deprel': 'flat', 'ner': 'I-PERSON'}),id=32), Annotation(160,163,Token,features=Features({'text': 'der', 'lemma': 'der', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 31, 'deprel': 'flat', 'ner': 'I-PERSON'}),id=33), Annotation(164,169,Token,features=Features({'text': 'Leyen', 'lemma': 'Leyen', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 31, 'deprel': 'flat', 'ner': 'E-PERSON'}),id=34), Annotation(169,170,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 36, 'deprel': 'punct', 'ner': 'O'}),id=35), Annotation(171,180,Token,features=Features({'text': 'locations', 'lemma': 'location', 'upos': 'NOUN', 'xpos': 'NNS', 'Number': 'Plur', 'head': 27, 'deprel': 'conj', 'ner': 'O'}),id=36), Annotation(181,185,Token,features=Features({'text': 'like', 'lemma': 'like', 'upos': 'ADP', 'xpos': 'IN', 'head': 40, 'deprel': 'case', 'ner': 'O'}),id=37), Annotation(186,189,Token,features=Features({'text': 'New', 'lemma': 'New', 'upos': 'ADJ', 'xpos': 'NNP', 'Degree': 'Pos', 'head': 40, 'deprel': 'amod', 'ner': 'B-GPE'}),id=38), Annotation(186,199,GPE,features=Features({}),id=177), Annotation(190,194,Token,features=Features({'text': 'York', 'lemma': 'York', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 40, 'deprel': 'compound', 'ner': 'I-GPE'}),id=39), Annotation(195,199,Token,features=Features({'text': 'City', 'lemma': 'City', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 36, 'deprel': 'nmod', 'ner': 'E-GPE'}),id=40), Annotation(199,200,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 42, 'deprel': 'punct', 'ner': 'O'}),id=41), Annotation(201,207,Token,features=Features({'text': 'Vienna', 'lemma': 'Vienna', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 40, 'deprel': 'conj', 'ner': 'S-GPE'}),id=42), Annotation(201,207,GPE,features=Features({}),id=178), Annotation(208,210,Token,features=Features({'text': 'or', 'lemma': 'or', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 44, 'deprel': 'cc', 'ner': 'O'}),id=43), Annotation(211,218,Token,features=Features({'text': 'Beijing', 'lemma': 'Beijing', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 40, 'deprel': 'conj', 'ner': 'S-GPE'}),id=44), Annotation(211,218,GPE,features=Features({}),id=179), Annotation(219,221,Token,features=Features({'text': 'or', 'lemma': 'or', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 46, 'deprel': 'cc', 'ner': 'O'}),id=45), Annotation(222,231,Token,features=Features({'text': 'companies', 'lemma': 'company', 'upos': 'NOUN', 'xpos': 'NNS', 'Number': 'Plur', 'head': 40, 'deprel': 'conj', 'ner': 'O'}),id=46), Annotation(232,236,Token,features=Features({'text': 'like', 'lemma': 'like', 'upos': 'ADP', 'xpos': 'IN', 'head': 48, 'deprel': 'case', 'ner': 'O'}),id=47), Annotation(238,244,Token,features=Features({'text': 'Google', 'lemma': 'Google', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 46, 'deprel': 'nmod', 'ner': 'S-ORG'}),id=48), Annotation(238,244,ORG,features=Features({}),id=180), Annotation(244,245,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 50, 'deprel': 'punct', 'ner': 'O'}),id=49), Annotation(246,255,Token,features=Features({'text': 'UniCredit', 'lemma': 'UniCredit', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 48, 'deprel': 'conj', 'ner': 'S-ORG'}),id=50), Annotation(246,255,ORG,features=Features({}),id=181), Annotation(256,258,Token,features=Features({'text': 'or', 'lemma': 'or', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 52, 'deprel': 'cc', 'ner': 'O'}),id=51), Annotation(259,265,Token,features=Features({'text': 'Huawei', 'lemma': 'Huawei', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 48, 'deprel': 'conj', 'ner': 'S-ORG'}),id=52), Annotation(259,265,ORG,features=Features({}),id=182), Annotation(265,266,Token,features=Features({'text': '.', 'lemma': '.', 'upos': 'PUNCT', 'xpos': '.', 'head': 15, 'deprel': 'punct', 'ner': 'O'}),id=53), Annotation(267,270,Token,features=Features({'text': 'And', 'lemma': 'and', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 67, 'deprel': 'cc', 'ner': 'O'}),id=55), Annotation(267,322,Sentence,features=Features({}),id=69), Annotation(271,275,Token,features=Features({'text': 'here', 'lemma': 'here', 'upos': 'ADV', 'xpos': 'RB', 'PronType': 'Dem', 'head': 69, 'deprel': 'root', 'ner': 'O'}),id=56), Annotation(276,278,Token,features=Features({'text': 'is', 'lemma': 'be', 'upos': 'AUX', 'xpos': 'VBZ', 'Mood': 'Ind', 'Number': 'Sing', 'Person': '3', 'Tense': 'Pres', 'VerbForm': 'Fin', 'head': 56, 'deprel': 'cop', 'ner': 'O'}),id=57), Annotation(279,285,Token,features=Features({'text': 'Donald', 'lemma': 'Donald', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 56, 'deprel': 'nsubj', 'ner': 'B-PERSON'}),id=58), Annotation(279,291,PERSON,features=Features({}),id=183), Annotation(286,291,Token,features=Features({'text': 'Trump', 'lemma': 'Trump', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 58, 'deprel': 'flat', 'ner': 'E-PERSON'}),id=59), Annotation(291,292,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 56, 'deprel': 'punct', 'ner': 'O'}),id=60), Annotation(293,295,Token,features=Features({'text': 'it', 'lemma': 'it', 'upos': 'PRON', 'xpos': 'PRP', 'Case': 'Nom', 'Gender': 'Neut', 'Number': 'Sing', 'Person': '3', 'PronType': 'Prs', 'head': 67, 'deprel': 'nsubj', 'ner': 'O'}),id=61), Annotation(296,299,Token,features=Features({'text': 'may', 'lemma': 'may', 'upos': 'AUX', 'xpos': 'MD', 'VerbForm': 'Fin', 'head': 67, 'deprel': 'aux', 'ner': 'O'}),id=62), Annotation(300,303,Token,features=Features({'text': 'not', 'lemma': 'not', 'upos': 'PART', 'xpos': 'RB', 'head': 67, 'deprel': 'advmod', 'ner': 'O'}),id=63), Annotation(304,306,Token,features=Features({'text': 'be', 'lemma': 'be', 'upos': 'AUX', 'xpos': 'VB', 'VerbForm': 'Inf', 'head': 67, 'deprel': 'cop', 'ner': 'O'}),id=64), Annotation(307,310,Token,features=Features({'text': 'the', 'lemma': 'the', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Def', 'PronType': 'Art', 'head': 67, 'deprel': 'det', 'ner': 'O'}),id=65), Annotation(311,315,Token,features=Features({'text': 'real', 'lemma': 'real', 'upos': 'ADJ', 'xpos': 'JJ', 'Degree': 'Pos', 'head': 67, 'deprel': 'amod', 'ner': 'O'}),id=66), Annotation(316,319,Token,features=Features({'text': 'one', 'lemma': 'one', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 56, 'deprel': 'parataxis', 'ner': 'O'}),id=67), Annotation(320,322,Token,features=Features({'text': ':P', 'lemma': ':p', 'upos': 'SYM', 'xpos': 'NFP', 'head': 56, 'deprel': 'discourse', 'ner': 'O'}),id=68), Annotation(324,328,Token,features=Features({'text': 'Lets', 'lemma': 'let', 'upos': 'VERB', 'xpos': 'VBZ', 'Mood': 'Ind', 'Number': 'Sing', 'Person': '3', 'Tense': 'Pres', 'VerbForm': 'Fin', 'head': 89, 'deprel': 'root', 'ner': 'O'}),id=70), Annotation(324,422,Sentence,features=Features({}),id=89), Annotation(329,332,Token,features=Features({'text': 'say', 'lemma': 'say', 'upos': 'VERB', 'xpos': 'VB', 'VerbForm': 'Inf', 'head': 70, 'deprel': 'xcomp', 'ner': 'O'}),id=71), Annotation(333,338,Token,features=Features({'text': 'Boris', 'lemma': 'Boris', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 84, 'deprel': 'nsubj', 'ner': 'B-PERSON'}),id=72), Annotation(333,346,PERSON,features=Features({}),id=184), Annotation(339,346,Token,features=Features({'text': 'Johnson', 'lemma': 'Johnson', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 72, 'deprel': 'flat', 'ner': 'E-PERSON'}),id=73), Annotation(347,350,Token,features=Features({'text': 'aka', 'lemma': 'aka', 'upos': 'ADP', 'xpos': 'IN', 'head': 76, 'deprel': 'case', 'ner': 'O'}),id=74), Annotation(351,355,Token,features=Features({'text': 'Bojo', 'lemma': 'Bojo', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 76, 'deprel': 'compound', 'ner': 'S-PERSON'}),id=75), Annotation(351,355,PERSON,features=Features({}),id=185), Annotation(356,362,Token,features=Features({'text': 'tweets', 'lemma': 'tweet', 'upos': 'NOUN', 'xpos': 'NNS', 'Number': 'Plur', 'head': 72, 'deprel': 'nmod', 'ner': 'O'}),id=76), Annotation(363,367,Token,features=Features({'text': 'from', 'lemma': 'from', 'upos': 'ADP', 'xpos': 'IN', 'head': 80, 'deprel': 'case', 'ner': 'O'}),id=77), Annotation(368,371,Token,features=Features({'text': 'his', 'lemma': 'he', 'upos': 'PRON', 'xpos': 'PRP$', 'Gender': 'Masc', 'Number': 'Sing', 'Person': '3', 'Poss': 'Yes', 'PronType': 'Prs', 'head': 80, 'deprel': 'nmod:poss', 'ner': 'O'}),id=78), Annotation(372,384,Token,features=Features({'text': 'BorisJohnson', 'lemma': 'BorisJohnson', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 80, 'deprel': 'compound', 'ner': 'S-PERSON'}),id=79), Annotation(372,384,PERSON,features=Features({}),id=186), Annotation(385,392,Token,features=Features({'text': 'account', 'lemma': 'account', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 76, 'deprel': 'nmod', 'ner': 'O'}),id=80), Annotation(392,393,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 84, 'deprel': 'punct', 'ner': 'O'}),id=81), Annotation(394,399,Token,features=Features({'text': 'would', 'lemma': 'would', 'upos': 'AUX', 'xpos': 'MD', 'VerbForm': 'Fin', 'head': 84, 'deprel': 'aux', 'ner': 'O'}),id=82), Annotation(400,402,Token,features=Features({'text': 'be', 'lemma': 'be', 'upos': 'AUX', 'xpos': 'VB', 'VerbForm': 'Inf', 'head': 84, 'deprel': 'cop', 'ner': 'O'}),id=83), Annotation(403,407,Token,features=Features({'text': 'nice', 'lemma': 'nice', 'upos': 'ADJ', 'xpos': 'JJ', 'Degree': 'Pos', 'head': 70, 'deprel': 'parataxis', 'ner': 'O'}),id=84), Annotation(408,410,Token,features=Features({'text': 'to', 'lemma': 'to', 'upos': 'PART', 'xpos': 'TO', 'head': 86, 'deprel': 'mark', 'ner': 'O'}),id=85), Annotation(411,416,Token,features=Features({'text': 'match', 'lemma': 'match', 'upos': 'VERB', 'xpos': 'VB', 'VerbForm': 'Inf', 'head': 84, 'deprel': 'advcl', 'ner': 'O'}),id=86), Annotation(417,421,Token,features=Features({'text': 'them', 'lemma': 'they', 'upos': 'PRON', 'xpos': 'PRP', 'Case': 'Acc', 'Number': 'Plur', 'Person': '3', 'PronType': 'Prs', 'head': 86, 'deprel': 'obj', 'ner': 'O'}),id=87), Annotation(421,422,Token,features=Features({'text': '!', 'lemma': '!', 'upos': 'PUNCT', 'xpos': '.', 'head': 70, 'deprel': 'punct', 'ner': 'O'}),id=88), Annotation(424,428,Token,features=Features({'text': 'Here', 'lemma': 'here', 'upos': 'ADV', 'xpos': 'RB', 'PronType': 'Dem', 'head': 92, 'deprel': 'advmod', 'ner': 'O'}),id=90), Annotation(424,675,Sentence,features=Features({}),id=143), Annotation(429,431,Token,features=Features({'text': 'we', 'lemma': 'we', 'upos': 'PRON', 'xpos': 'PRP', 'Case': 'Nom', 'Number': 'Plur', 'Person': '1', 'PronType': 'Prs', 'head': 92, 'deprel': 'nsubj', 'ner': 'O'}),id=91), Annotation(432,439,Token,features=Features({'text': 'include', 'lemma': 'include', 'upos': 'VERB', 'xpos': 'VBP', 'Mood': 'Ind', 'Tense': 'Pres', 'VerbForm': 'Fin', 'head': 143, 'deprel': 'root', 'ner': 'O'}),id=92), Annotation(440,441,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 94, 'deprel': 'det', 'ner': 'O'}),id=93), Annotation(442,445,Token,features=Features({'text': 'URL', 'lemma': 'URL', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 92, 'deprel': 'obj', 'ner': 'O'}),id=94), Annotation(446,487,Token,features=Features({'text': 'https://gatenlp.github.io/python-gatenlp/', 'lemma': 'https://gatenlp.github.io/python-gatenlp/', 'upos': 'X', 'xpos': 'ADD', 'head': 94, 'deprel': 'appos', 'ner': 'O'}),id=95), Annotation(489,492,Token,features=Features({'text': 'and', 'lemma': 'and', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 100, 'deprel': 'cc', 'ner': 'O'}),id=96), Annotation(493,494,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 100, 'deprel': 'det', 'ner': 'O'}),id=97), Annotation(495,499,Token,features=Features({'text': 'fake', 'lemma': 'fake', 'upos': 'ADJ', 'xpos': 'JJ', 'Degree': 'Pos', 'head': 100, 'deprel': 'amod', 'ner': 'O'}),id=98), Annotation(500,505,Token,features=Features({'text': 'email', 'lemma': 'email', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 100, 'deprel': 'compound', 'ner': 'O'}),id=99), Annotation(506,513,Token,features=Features({'text': 'address', 'lemma': 'address', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 94, 'deprel': 'conj', 'ner': 'O'}),id=100), Annotation(514,540,Token,features=Features({'text': 'john.doe@hiscoolserver.com', 'lemma': 'john.doe@hiscoolserver.com', 'upos': 'X', 'xpos': 'ADD', 'head': 100, 'deprel': 'appos', 'ner': 'O'}),id=101), Annotation(541,543,Token,features=Features({'text': 'as', 'lemma': 'as', 'upos': 'ADV', 'xpos': 'RB', 'head': 110, 'deprel': 'cc', 'ner': 'O'}),id=102), Annotation(544,548,Token,features=Features({'text': 'well', 'lemma': 'well', 'upos': 'ADV', 'xpos': 'RB', 'Degree': 'Pos', 'head': 102, 'deprel': 'fixed', 'ner': 'O'}),id=103), Annotation(550,552,Token,features=Features({'text': 'as', 'lemma': 'as', 'upos': 'ADP', 'xpos': 'IN', 'head': 102, 'deprel': 'fixed', 'ner': 'O'}),id=104), Annotation(553,554,Token,features=Features({'text': '#', 'lemma': '#', 'upos': 'SYM', 'xpos': 'NN', 'Number': 'Sing', 'head': 110, 'deprel': 'compound', 'ner': 'O'}),id=105), Annotation(554,558,Token,features=Features({'text': 'some', 'lemma': 'some', 'upos': 'DET', 'xpos': 'DT', 'head': 110, 'deprel': 'det', 'ner': 'O'}),id=106), Annotation(559,560,Token,features=Features({'text': '#', 'lemma': '#', 'upos': 'SYM', 'xpos': 'NN', 'Number': 'Sing', 'head': 110, 'deprel': 'compound', 'ner': 'O'}),id=107), Annotation(560,564,Token,features=Features({'text': 'cool', 'lemma': 'cool', 'upos': 'ADJ', 'xpos': 'JJ', 'Degree': 'Pos', 'head': 110, 'deprel': 'amod', 'ner': 'O'}),id=108), Annotation(565,566,Token,features=Features({'text': '#', 'lemma': '#', 'upos': 'SYM', 'xpos': 'NN', 'Number': 'Sing', 'head': 110, 'deprel': 'compound', 'ner': 'O'}),id=109), Annotation(566,573,Token,features=Features({'text': 'hastags', 'lemma': 'hastag', 'upos': 'NOUN', 'xpos': 'NNS', 'Number': 'Plur', 'head': 94, 'deprel': 'conj', 'ner': 'O'}),id=110), Annotation(574,577,Token,features=Features({'text': 'and', 'lemma': 'and', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 113, 'deprel': 'cc', 'ner': 'O'}),id=111), Annotation(578,579,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 113, 'deprel': 'det', 'ner': 'O'}),id=112), Annotation(580,585,Token,features=Features({'text': 'bunch', 'lemma': 'bunch', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 94, 'deprel': 'conj', 'ner': 'O'}),id=113), Annotation(586,588,Token,features=Features({'text': 'of', 'lemma': 'of', 'upos': 'ADP', 'xpos': 'IN', 'head': 115, 'deprel': 'case', 'ner': 'O'}),id=114), Annotation(589,595,Token,features=Features({'text': 'emojis', 'lemma': 'emojis', 'upos': 'NOUN', 'xpos': 'NNS', 'Number': 'Plur', 'head': 113, 'deprel': 'nmod', 'ner': 'O'}),id=115), Annotation(596,600,Token,features=Features({'text': 'like', 'lemma': 'like', 'upos': 'ADP', 'xpos': 'IN', 'head': 117, 'deprel': 'case', 'ner': 'O'}),id=116), Annotation(601,602,Token,features=Features({'text': '😽', 'lemma': '😽', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 115, 'deprel': 'nmod', 'ner': 'O'}),id=117), Annotation(603,604,Token,features=Features({'text': '(', 'lemma': '(', 'upos': 'PUNCT', 'xpos': '-LRB-', 'head': 121, 'deprel': 'punct', 'ner': 'O'}),id=118), Annotation(604,605,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 121, 'deprel': 'det', 'ner': 'O'}),id=119), Annotation(606,613,Token,features=Features({'text': 'kissing', 'lemma': 'kissing', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 121, 'deprel': 'compound', 'ner': 'O'}),id=120), Annotation(614,617,Token,features=Features({'text': 'cat', 'lemma': 'cat', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 117, 'deprel': 'appos', 'ner': 'O'}),id=121), Annotation(617,618,Token,features=Features({'text': ')', 'lemma': ')', 'upos': 'PUNCT', 'xpos': '-RRB-', 'head': 121, 'deprel': 'punct', 'ner': 'O'}),id=122), Annotation(618,619,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 124, 'deprel': 'punct', 'ner': 'O'}),id=123), Annotation(620,623,Token,features=Features({'text': '👩\\u200d🏫', 'lemma': '👩\\u200d🏫', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 117, 'deprel': 'conj', 'ner': 'O'}),id=124), Annotation(624,625,Token,features=Features({'text': '(', 'lemma': '(', 'upos': 'PUNCT', 'xpos': '-LRB-', 'head': 128, 'deprel': 'punct', 'ner': 'O'}),id=125), Annotation(625,626,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 128, 'deprel': 'det', 'ner': 'O'}),id=126), Annotation(627,632,Token,features=Features({'text': 'woman', 'lemma': 'woman', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 128, 'deprel': 'compound', 'ner': 'O'}),id=127), Annotation(633,640,Token,features=Features({'text': 'teacher', 'lemma': 'teacher', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 124, 'deprel': 'appos', 'ner': 'O'}),id=128), Annotation(640,641,Token,features=Features({'text': ')', 'lemma': ')', 'upos': 'PUNCT', 'xpos': '-RRB-', 'head': 128, 'deprel': 'punct', 'ner': 'O'}),id=129), Annotation(641,642,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 131, 'deprel': 'punct', 'ner': 'O'}),id=130), Annotation(643,644,Token,features=Features({'text': '🧬', 'lemma': '🧬', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 117, 'deprel': 'conj', 'ner': 'O'}),id=131), Annotation(645,646,Token,features=Features({'text': '(', 'lemma': '(', 'upos': 'PUNCT', 'xpos': '-LRB-', 'head': 133, 'deprel': 'punct', 'ner': 'O'}),id=132), Annotation(646,649,Token,features=Features({'text': 'DNA', 'lemma': 'dna', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 131, 'deprel': 'appos', 'ner': 'O'}),id=133), Annotation(649,650,Token,features=Features({'text': ')', 'lemma': ')', 'upos': 'PUNCT', 'xpos': '-RRB-', 'head': 133, 'deprel': 'punct', 'ner': 'O'}),id=134), Annotation(650,651,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 136, 'deprel': 'punct', 'ner': 'O'}),id=135), Annotation(653,654,Token,features=Features({'text': '🧗', 'lemma': '🧗', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 117, 'deprel': 'conj', 'ner': 'O'}),id=136), Annotation(655,656,Token,features=Features({'text': '(', 'lemma': '(', 'upos': 'PUNCT', 'xpos': '-LRB-', 'head': 140, 'deprel': 'punct', 'ner': 'O'}),id=137), Annotation(656,657,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 139, 'deprel': 'det', 'ner': 'O'}),id=138), Annotation(658,664,Token,features=Features({'text': 'person', 'lemma': 'person', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 140, 'deprel': 'nsubj', 'ner': 'O'}),id=139), Annotation(665,673,Token,features=Features({'text': 'climbing', 'lemma': 'climb', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 136, 'deprel': 'appos', 'ner': 'O'}),id=140), Annotation(673,674,Token,features=Features({'text': ')', 'lemma': ')', 'upos': 'PUNCT', 'xpos': '-RRB-', 'head': 140, 'deprel': 'punct', 'ner': 'O'}),id=141), Annotation(674,675,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 92, 'deprel': 'punct', 'ner': 'O'}),id=142), Annotation(678,682,Token,features=Features({'text': 'Here', 'lemma': 'here', 'upos': 'ADV', 'xpos': 'RB', 'PronType': 'Dem', 'head': 146, 'deprel': 'advmod', 'ner': 'O'}),id=144), Annotation(678,829,Sentence,features=Features({}),id=174), Annotation(683,685,Token,features=Features({'text': 'we', 'lemma': 'we', 'upos': 'PRON', 'xpos': 'PRP', 'Case': 'Nom', 'Number': 'Plur', 'Person': '1', 'PronType': 'Prs', 'head': 146, 'deprel': 'nsubj', 'ner': 'O'}),id=145), Annotation(686,690,Token,features=Features({'text': 'test', 'lemma': 'test', 'upos': 'VERB', 'xpos': 'VBP', 'Mood': 'Ind', 'Number': 'Plur', 'Person': '1', 'Tense': 'Pres', 'VerbForm': 'Fin', 'head': 174, 'deprel': 'root', 'ner': 'O'}),id=146), Annotation(691,692,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 150, 'deprel': 'det', 'ner': 'O'}),id=147), Annotation(693,696,Token,features=Features({'text': 'few', 'lemma': 'few', 'upos': 'ADJ', 'xpos': 'JJ', 'Degree': 'Pos', 'head': 150, 'deprel': 'amod', 'ner': 'O'}),id=148), Annotation(697,706,Token,features=Features({'text': 'different', 'lemma': 'different', 'upos': 'ADJ', 'xpos': 'JJ', 'Degree': 'Pos', 'head': 150, 'deprel': 'amod', 'ner': 'O'}),id=149), Annotation(707,714,Token,features=Features({'text': 'scripts', 'lemma': 'script', 'upos': 'NOUN', 'xpos': 'NNS', 'Number': 'Plur', 'head': 146, 'deprel': 'obj', 'ner': 'O'}),id=150), Annotation(714,715,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 153, 'deprel': 'punct', 'ner': 'O'}),id=151), Annotation(716,720,Token,features=Features({'text': 'e.g.', 'lemma': 'e.g.', 'upos': 'ADV', 'xpos': 'FW', 'Abbr': 'Yes', 'head': 153, 'deprel': 'advmod', 'ner': 'O'}),id=152), Annotation(721,727,Token,features=Features({'text': 'Hangul', 'lemma': 'Hangul', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 154, 'deprel': 'compound', 'ner': 'O'}),id=153), Annotation(728,730,Token,features=Features({'text': '한글', 'lemma': '한글', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 150, 'deprel': 'appos', 'ner': 'O'}),id=154), Annotation(731,733,Token,features=Features({'text': 'or', 'lemma': 'or', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 158, 'deprel': 'cc', 'ner': 'O'}),id=155), Annotation(735,745,Token,features=Features({'text': 'simplified', 'lemma': 'simplify', 'upos': 'VERB', 'xpos': 'VBN', 'Tense': 'Past', 'VerbForm': 'Part', 'head': 158, 'deprel': 'amod', 'ner': 'O'}),id=156), Annotation(746,751,Token,features=Features({'text': 'Hanzi', 'lemma': 'Hanzi', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 158, 'deprel': 'compound', 'ner': 'O'}),id=157), Annotation(752,754,Token,features=Features({'text': '汉字', 'lemma': '汉字', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 154, 'deprel': 'conj', 'ner': 'O'}),id=158), Annotation(755,757,Token,features=Features({'text': 'or', 'lemma': 'or', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 161, 'deprel': 'cc', 'ner': 'O'}),id=159), Annotation(758,763,Token,features=Features({'text': 'Farsi', 'lemma': 'Farsi', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 161, 'deprel': 'compound', 'ner': 'S-LANGUAGE'}),id=160), Annotation(758,763,LANGUAGE,features=Features({}),id=187), Annotation(764,769,Token,features=Features({'text': 'فارسی', 'lemma': 'فارسی', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 154, 'deprel': 'conj', 'ner': 'O'}),id=161), Annotation(770,773,Token,features=Features({'text': 'and', 'lemma': 'and', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 163, 'deprel': 'cc', 'ner': 'O'}),id=162), Annotation(774,780,Token,features=Features({'text': 'Arabic', 'lemma': 'Arabic', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 161, 'deprel': 'conj', 'ner': 'S-LANGUAGE'}),id=163), Annotation(774,780,LANGUAGE,features=Features({}),id=188), Annotation(781,782,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 165, 'deprel': 'punct', 'ner': 'O'}),id=164), Annotation(782,797,Token,features=Features({'text': 'اَلْعَرَبِيَّةُ', 'lemma': 'gamommamgeramam', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 150, 'deprel': 'appos', 'ner': 'O'}),id=165), Annotation(797,798,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 168, 'deprel': 'punct', 'ner': 'O'}),id=166), Annotation(799,804,Token,features=Features({'text': 'which', 'lemma': 'which', 'upos': 'PRON', 'xpos': 'WDT', 'PronType': 'Rel', 'head': 168, 'deprel': 'nsubj', 'ner': 'O'}),id=167), Annotation(805,809,Token,features=Features({'text': 'goes', 'lemma': 'go', 'upos': 'VERB', 'xpos': 'VBZ', 'Mood': 'Ind', 'Number': 'Sing', 'Person': '3', 'Tense': 'Pres', 'VerbForm': 'Fin', 'head': 150, 'deprel': 'acl:relcl', 'ner': 'O'}),id=168), Annotation(810,814,Token,features=Features({'text': 'from', 'lemma': 'from', 'upos': 'ADP', 'xpos': 'IN', 'head': 170, 'deprel': 'case', 'ner': 'O'}),id=169), Annotation(815,820,Token,features=Features({'text': 'right', 'lemma': 'right', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 168, 'deprel': 'obl', 'ner': 'O'}),id=170), Annotation(821,823,Token,features=Features({'text': 'to', 'lemma': 'to', 'upos': 'ADP', 'xpos': 'IN', 'head': 172, 'deprel': 'case', 'ner': 'O'}),id=171), Annotation(824,828,Token,features=Features({'text': 'left', 'lemma': 'leave', 'upos': 'VERB', 'xpos': 'VBN', 'Tense': 'Past', 'VerbForm': 'Part', 'head': 168, 'deprel': 'advcl', 'ner': 'O'}),id=172), Annotation(828,829,Token,features=Features({'text': '.', 'lemma': '.', 'upos': 'PUNCT', 'xpos': '.', 'head': 146, 'deprel': 'punct', 'ner': 'O'}),id=173)]), '': AnnotationSet([Annotation(0,4,Token,features=Features({}),id=0), Annotation(5,7,Token,features=Features({}),id=1), Annotation(8,9,Token,features=Features({}),id=2), Annotation(10,14,Token,features=Features({}),id=3), Annotation(15,24,Token,features=Features({}),id=4), Annotation(26,28,Token,features=Features({}),id=5), Annotation(29,37,Token,features=Features({}),id=6), Annotation(38,42,Token,features=Features({}),id=7), Annotation(43,44,Token,features=Features({}),id=8), Annotation(45,48,Token,features=Features({}),id=9), Annotation(49,59,Token,features=Features({}),id=10), Annotation(61,65,Token,features=Features({}),id=11), Annotation(66,68,Token,features=Features({}),id=12), Annotation(69,70,Token,features=Features({}),id=13), Annotation(71,79,Token,features=Features({}),id=14), Annotation(80,84,Token,features=Features({}),id=15), Annotation(85,93,Token,features=Features({}),id=16), Annotation(94,95,Token,features=Features({}),id=17), Annotation(96,99,Token,features=Features({}),id=18), Annotation(100,105,Token,features=Features({}),id=19), Annotation(106,114,Token,features=Features({}),id=20), Annotation(115,119,Token,features=Features({}),id=21), Annotation(121,124,Token,features=Features({}),id=22), Annotation(125,132,Token,features=Features({}),id=23), Annotation(133,139,Token,features=Features({}),id=24), Annotation(140,145,Token,features=Features({}),id=25), Annotation(146,148,Token,features=Features({}),id=26), Annotation(149,155,Token,features=Features({}),id=27), Annotation(156,159,Token,features=Features({}),id=28), Annotation(160,163,Token,features=Features({}),id=29), Annotation(164,169,Token,features=Features({}),id=30), Annotation(169,170,Token,features=Features({}),id=31), Annotation(171,180,Token,features=Features({}),id=32), Annotation(181,185,Token,features=Features({}),id=33), Annotation(186,189,Token,features=Features({}),id=34), Annotation(190,194,Token,features=Features({}),id=35), Annotation(195,199,Token,features=Features({}),id=36), Annotation(199,200,Token,features=Features({}),id=37), Annotation(201,207,Token,features=Features({}),id=38), Annotation(208,210,Token,features=Features({}),id=39), Annotation(211,218,Token,features=Features({}),id=40), Annotation(219,221,Token,features=Features({}),id=41), Annotation(222,231,Token,features=Features({}),id=42), Annotation(232,236,Token,features=Features({}),id=43), Annotation(238,244,Token,features=Features({}),id=44), Annotation(244,245,Token,features=Features({}),id=45), Annotation(246,255,Token,features=Features({}),id=46), Annotation(256,258,Token,features=Features({}),id=47), Annotation(259,266,Token,features=Features({}),id=48), Annotation(267,270,Token,features=Features({}),id=49), Annotation(271,275,Token,features=Features({}),id=50), Annotation(276,278,Token,features=Features({}),id=51), Annotation(279,285,Token,features=Features({}),id=52), Annotation(286,291,Token,features=Features({}),id=53), Annotation(291,292,Token,features=Features({}),id=54), Annotation(293,295,Token,features=Features({}),id=55), Annotation(296,299,Token,features=Features({}),id=56), Annotation(300,303,Token,features=Features({}),id=57), Annotation(304,306,Token,features=Features({}),id=58), Annotation(307,310,Token,features=Features({}),id=59), Annotation(311,315,Token,features=Features({}),id=60), Annotation(316,319,Token,features=Features({}),id=61), Annotation(320,321,Token,features=Features({}),id=62), Annotation(321,322,Token,features=Features({}),id=63), Annotation(324,328,Token,features=Features({}),id=64), Annotation(329,332,Token,features=Features({}),id=65), Annotation(333,338,Token,features=Features({}),id=66), Annotation(339,346,Token,features=Features({}),id=67), Annotation(347,350,Token,features=Features({}),id=68), Annotation(351,355,Token,features=Features({}),id=69), Annotation(356,362,Token,features=Features({}),id=70), Annotation(363,367,Token,features=Features({}),id=71), Annotation(368,371,Token,features=Features({}),id=72), Annotation(372,384,Token,features=Features({}),id=73), Annotation(385,392,Token,features=Features({}),id=74), Annotation(392,393,Token,features=Features({}),id=75), Annotation(394,399,Token,features=Features({}),id=76), Annotation(400,402,Token,features=Features({}),id=77), Annotation(403,407,Token,features=Features({}),id=78), Annotation(408,410,Token,features=Features({}),id=79), Annotation(411,416,Token,features=Features({}),id=80), Annotation(417,421,Token,features=Features({}),id=81), Annotation(421,422,Token,features=Features({}),id=82), Annotation(424,428,Token,features=Features({}),id=83), Annotation(429,431,Token,features=Features({}),id=84), Annotation(432,439,Token,features=Features({}),id=85), Annotation(440,441,Token,features=Features({}),id=86), Annotation(442,445,Token,features=Features({}),id=87), Annotation(446,451,Token,features=Features({}),id=88), Annotation(451,452,Token,features=Features({}),id=89), Annotation(452,487,Token,features=Features({}),id=90), Annotation(489,492,Token,features=Features({}),id=91), Annotation(493,494,Token,features=Features({}),id=92), Annotation(495,499,Token,features=Features({}),id=93), Annotation(500,505,Token,features=Features({}),id=94), Annotation(506,513,Token,features=Features({}),id=95), Annotation(514,522,Token,features=Features({}),id=96), Annotation(522,523,Token,features=Features({}),id=97), Annotation(523,540,Token,features=Features({}),id=98), Annotation(541,543,Token,features=Features({}),id=99), Annotation(544,548,Token,features=Features({}),id=100), Annotation(550,552,Token,features=Features({}),id=101), Annotation(553,554,Token,features=Features({}),id=102), Annotation(554,558,Token,features=Features({}),id=103), Annotation(559,560,Token,features=Features({}),id=104), Annotation(560,564,Token,features=Features({}),id=105), Annotation(565,566,Token,features=Features({}),id=106), Annotation(566,573,Token,features=Features({}),id=107), Annotation(574,577,Token,features=Features({}),id=108), Annotation(578,579,Token,features=Features({}),id=109), Annotation(580,585,Token,features=Features({}),id=110), Annotation(586,588,Token,features=Features({}),id=111), Annotation(589,595,Token,features=Features({}),id=112), Annotation(596,600,Token,features=Features({}),id=113), Annotation(601,602,Token,features=Features({}),id=114), Annotation(603,604,Token,features=Features({}),id=115), Annotation(604,605,Token,features=Features({}),id=116), Annotation(606,613,Token,features=Features({}),id=117), Annotation(614,617,Token,features=Features({}),id=118), Annotation(617,618,Token,features=Features({}),id=119), Annotation(618,619,Token,features=Features({}),id=120), Annotation(620,623,Token,features=Features({}),id=121), Annotation(624,625,Token,features=Features({}),id=122), Annotation(625,626,Token,features=Features({}),id=123), Annotation(627,632,Token,features=Features({}),id=124), Annotation(633,640,Token,features=Features({}),id=125), Annotation(640,641,Token,features=Features({}),id=126), Annotation(641,642,Token,features=Features({}),id=127), Annotation(643,644,Token,features=Features({}),id=128), Annotation(645,646,Token,features=Features({}),id=129), Annotation(646,649,Token,features=Features({}),id=130), Annotation(649,650,Token,features=Features({}),id=131), Annotation(650,651,Token,features=Features({}),id=132), Annotation(653,654,Token,features=Features({}),id=133), Annotation(655,656,Token,features=Features({}),id=134), Annotation(656,657,Token,features=Features({}),id=135), Annotation(658,664,Token,features=Features({}),id=136), Annotation(665,673,Token,features=Features({}),id=137), Annotation(673,674,Token,features=Features({}),id=138), Annotation(674,675,Token,features=Features({}),id=139), Annotation(678,682,Token,features=Features({}),id=140), Annotation(683,685,Token,features=Features({}),id=141), Annotation(686,690,Token,features=Features({}),id=142), Annotation(691,692,Token,features=Features({}),id=143), Annotation(693,696,Token,features=Features({}),id=144), Annotation(697,706,Token,features=Features({}),id=145), Annotation(707,714,Token,features=Features({}),id=146), Annotation(714,715,Token,features=Features({}),id=147), Annotation(716,720,Token,features=Features({}),id=148), Annotation(721,727,Token,features=Features({}),id=149), Annotation(728,730,Token,features=Features({}),id=150), Annotation(731,733,Token,features=Features({}),id=151), Annotation(735,745,Token,features=Features({}),id=152), Annotation(746,751,Token,features=Features({}),id=153), Annotation(752,754,Token,features=Features({}),id=154), Annotation(755,757,Token,features=Features({}),id=155), Annotation(758,763,Token,features=Features({}),id=156), Annotation(764,769,Token,features=Features({}),id=157), Annotation(770,773,Token,features=Features({}),id=158), Annotation(774,780,Token,features=Features({}),id=159), Annotation(781,782,Token,features=Features({}),id=160), Annotation(782,797,Token,features=Features({}),id=161), Annotation(797,798,Token,features=Features({}),id=162), Annotation(799,804,Token,features=Features({}),id=163), Annotation(805,809,Token,features=Features({}),id=164), Annotation(810,814,Token,features=Features({}),id=165), Annotation(815,820,Token,features=Features({}),id=166), Annotation(821,823,Token,features=Features({}),id=167), Annotation(824,828,Token,features=Features({}),id=168), Annotation(828,829,Token,features=Features({}),id=169)]), 'TGaz1': AnnotationSet([Annotation(238,244,Lookup,features=Features({'what': 'company', 'country': 'Everywhere, really!'}),id=3), Annotation(279,291,Lookup,features=Features({'what': 'person', 'country': 'US'}),id=4), Annotation(333,346,Lookup,features=Features({'what': 'person', 'country': 'UK'}),id=5)]), 'TGaz2': AnnotationSet([Annotation(238,244,Lookup,features=Features({'where': 'Everywhere, really!'}),id=0), Annotation(279,291,Lookup,features=Features({'what': 'person', 'country': 'US'}),id=1), Annotation(333,346,Lookup,features=Features({'what': 'person', 'country': 'UK'}),id=2)])})" ] }, "execution_count": 55, "metadata": {}, "output_type": "execute_result" } ], "source": [ "doc2" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## TokenGazetteer\n", "\n", "Other features:\n", "\n", "* Match each gazetteer token against the feature of a document token, not the underlying document text (e.g. the lemma)\n", "* only match within another annotation type (e.g. only in noun phrases)\n", "* More to come: still WIP!\n", " * handle space tokens\n", " * handle split tokens (do not cross them!)\n" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Regexp Annotator\n", "\n", "* match multiple regular expressions\n", "* use macros to build complex REs from simpler parts\n", "* different strategies for how to handle multiple matches\n", "* assign annotation features from RE groups" ] }, { "cell_type": "code", "execution_count": 56, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "from gatenlp.processing.gazetteer import StringRegexAnnotator" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## RegexpAnnotator: rules" ] }, { "cell_type": "code", "execution_count": 57, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "rules = \"\"\"\n", "year=(19[0-9]{2}|20[0-9]{2})\n", "month=(0[0-9]|10|11|12)\n", "day=([012][0-9]|3[01])\n", "\n", "// The ISO date:\n", "|{{year}}-{{month}}-{{day}}\n", "0 => Date type=\"iso\", year=G1, month=G2, day=G3\n", "\n", "# The traditional way of writing a date:\n", "|{{day}}/({{month}})/{{year}}\n", "0 => Date type=\"traditional\", year=G3, month=G2, day=G1\n", "\"\"\"\n", "\n", "re_anntr = StringRegexAnnotator(source=rules, source_fmt=\"string\")\n" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## RegexpAnnotator: match" ] }, { "cell_type": "code", "execution_count": 58, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "text/html": [ "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", " \n", " \n", "
\n", "\n", "
" ], "text/plain": [ "Document(\n", "A document that contains a date here: 2013-01-12 and also here: 14/02/1991. This should not \n", "get matched: 1833-12-21 and nor should this 45/03/2012 but this should 13/12/2012 and also\n", "this, despite not being a valid data: 31/02/2000\n", ",features=Features({}),anns={'': AnnotationSet([Annotation(39,49,Date,features=Features({'type': 'iso', 'year': '2013', 'month': '01', 'day': '12'}),id=0), Annotation(65,75,Date,features=Features({'type': 'traditional', 'year': '02', 'month': '02', 'day': '14'}),id=1), Annotation(165,175,Date,features=Features({'type': 'traditional', 'year': '12', 'month': '12', 'day': '13'}),id=2), Annotation(223,233,Date,features=Features({'type': 'traditional', 'year': '02', 'month': '02', 'day': '31'}),id=3)])})" ] }, "execution_count": 58, "metadata": {}, "output_type": "execute_result" } ], "source": [ "redoc = Document(\"\"\"\n", "A document that contains a date here: 2013-01-12 and also here: 14/02/1991. This should not \n", "get matched: 1833-12-21 and nor should this 45/03/2012 but this should 13/12/2012 and also\n", "this, despite not being a valid data: 31/02/2000\n", "\"\"\")\n", "re_anntr(redoc)\n", "redoc" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "# PAMPAC\n", "\n", "* PAttern Matching through PArser Combinators\n", "* A pattern language for matching annotations and text\n", "* DSL (domain specific language) implemented as Python classes\n", "* Purpose: similar to Java GATE JAPE / JAPE Plus, but:\n", " * does not need a separate language\n", " * more powerful (e.g. directly match text AND annotations)\n", "\n" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## PAMPAC - How to use\n", "\n", "* define **Rules**\n", "* each Rule consists of:\n", " * a **Pattern** to match in the document\n", " * an **Action** to perform if the pattern matches\n", "* create a **Pampac** instance from the rules\n", "* **run** the Pampac instance on the document, using a set of annotations and specify the output annotation set\n", " " ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## PAMPAC - Example 1\n", "\n", "Lets create a rule that annotates any Token which is within a PERSON or ORG annotation:" ] }, { "cell_type": "code", "execution_count": 59, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "text/plain": [ "15" ] }, "execution_count": 59, "metadata": {}, "output_type": "execute_result" } ], "source": [ "from gatenlp.pam.pampac import Ann, AnnAt, Rule, Pampac, AddAnn, N, Seq, Or\n", "from gatenlp.pam.matcher import FeatureMatcher, IfNot\n", "\n", "r1 = Rule(\n", " # first the pattern\n", " Or ( Ann(\"Token\", name=\"tok\").within(\"ORG\"),\n", " Ann(\"Token\", name=\"tok\").within(\"PERSON\")\n", " ),\n", " # then the action for the pattern\n", " AddAnn(name=\"tok\", type=\"PersOrOrg\")\n", ")\n", "# get the annotations we want to use for matching\n", "anns2match = doc2.annset(\"Stanza\").with_type([\"Token\", \"PERSON\", \"ORG\"])\n", "\n", "outset = doc2.annset(\"Pampac1\")\n", "outset.clear()\n", "# Create the Pampac instance from the single rule and run it on the annotations, also specify output set\n", "# The run method returns the list of offsets and the action return values where the rule matches in the doc\n", "Pampac(r1).run(doc2, anns2match, outset=outset)\n", "len(outset)" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## PAMPAC - Example 1\n", "\n" ] }, { "cell_type": "code", "execution_count": 60, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "text/html": [ "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", " \n", " \n", "
\n", "\n", "
" ], "text/plain": [ "Document(This is a test document.\n", "\n", "It contains just a few sentences. \n", "Here is a sentence that mentions a few named entities like \n", "the persons Barack Obama or Ursula von der Leyen, locations\n", "like New York City, Vienna or Beijing or companies like \n", "Google, UniCredit or Huawei. And here is Donald Trump, it may not be the real one :P\n", "\n", "Lets say Boris Johnson aka Bojo tweets from his BorisJohnson account, would be nice to match them!\n", "\n", "Here we include a URL https://gatenlp.github.io/python-gatenlp/ \n", "and a fake email address john.doe@hiscoolserver.com as well \n", "as #some #cool #hastags and a bunch of emojis like 😽 (a kissing cat),\n", "👩‍🏫 (a woman teacher), 🧬 (DNA), \n", "🧗 (a person climbing), \n", "\n", "Here we test a few different scripts, e.g. Hangul 한글 or \n", "simplified Hanzi 汉字 or Farsi فارسی and Arabic ,اَلْعَرَبِيَّةُ, which goes from right to left.\n", "\n", ",features=Features({}),anns={'Spacy': AnnotationSet([Annotation(0,4,Token,features=Features({'_i': 0, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': True, 'is_space': False, 'is_stop': True, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'this', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12943039165150086467, 'pos': 'PRON', 'prefix': 'T', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'Xxxx', 'suffix': 'his', 'ent_type': '', 'dep': 'nsubj', 'head': 2, 'left_edge': 0, 'right_edge': 0}),id=0), Annotation(0,24,Sentence,features=Features({}),id=336), Annotation(0,4,NounChunk,features=Features({}),id=342), Annotation(4,5,SpaceToken,features=Features({'is_space': True}),id=1), Annotation(5,7,Token,features=Features({'_i': 1, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'be', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3411606890003347522, 'pos': 'AUX', 'prefix': 'i', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBZ', 'shape': 'xx', 'suffix': 'is', 'ent_type': '', 'dep': 'ROOT', 'head': 2, 'left_edge': 0, 'right_edge': 9}),id=2), Annotation(7,8,SpaceToken,features=Features({'is_space': True}),id=3), Annotation(8,9,Token,features=Features({'_i': 2, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'det', 'head': 8, 'left_edge': 4, 'right_edge': 4}),id=4), Annotation(8,23,NounChunk,features=Features({}),id=343), Annotation(9,10,SpaceToken,features=Features({'is_space': True}),id=5), Annotation(10,14,Token,features=Features({'_i': 3, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'test', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 1618900948208871284, 'pos': 'NOUN', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'est', 'ent_type': '', 'dep': 'compound', 'head': 8, 'left_edge': 6, 'right_edge': 6}),id=6), Annotation(14,15,SpaceToken,features=Features({'is_space': True}),id=7), Annotation(15,23,Token,features=Features({'_i': 4, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'document', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2376024544662810659, 'pos': 'NOUN', 'prefix': 'd', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'ent', 'ent_type': '', 'dep': 'attr', 'head': 2, 'left_edge': 4, 'right_edge': 8}),id=8), Annotation(23,24,Token,features=Features({'_i': 5, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '.', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12646065887601541794, 'pos': 'PUNCT', 'prefix': '.', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '.', 'shape': '.', 'suffix': '.', 'ent_type': '', 'dep': 'punct', 'head': 2, 'left_edge': 9, 'right_edge': 9}),id=9), Annotation(24,26,SpaceToken,features=Features({'_i': 6, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': True, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 908432558851201422, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n\\n', 'suffix': '\\n\\n', 'ent_type': '', 'dep': 'dep', 'head': 13, 'left_edge': 10, 'right_edge': 10}),id=10), Annotation(24,59,Sentence,features=Features({}),id=337), Annotation(26,28,Token,features=Features({'_i': 7, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'it', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7859011591137717335, 'pos': 'PRON', 'prefix': 'I', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'PRP', 'shape': 'Xx', 'suffix': 'It', 'ent_type': '', 'dep': 'nsubj', 'head': 13, 'left_edge': 11, 'right_edge': 11}),id=11), Annotation(26,28,NounChunk,features=Features({}),id=344), Annotation(28,29,SpaceToken,features=Features({'is_space': True}),id=12), Annotation(29,37,Token,features=Features({'_i': 8, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'contain', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 4189568596636683296, 'pos': 'VERB', 'prefix': 'c', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBZ', 'shape': 'xxxx', 'suffix': 'ins', 'ent_type': '', 'dep': 'ROOT', 'head': 13, 'left_edge': 10, 'right_edge': 22}),id=13), Annotation(37,38,SpaceToken,features=Features({'is_space': True}),id=14), Annotation(38,42,Token,features=Features({'_i': 9, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'just', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7148522813498185515, 'pos': 'ADV', 'prefix': 'j', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'xxxx', 'suffix': 'ust', 'ent_type': '', 'dep': 'advmod', 'head': 19, 'left_edge': 15, 'right_edge': 15}),id=15), Annotation(38,58,NounChunk,features=Features({}),id=345), Annotation(42,43,SpaceToken,features=Features({'is_space': True}),id=16), Annotation(43,44,Token,features=Features({'_i': 10, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'quantmod', 'head': 19, 'left_edge': 17, 'right_edge': 17}),id=17), Annotation(44,45,SpaceToken,features=Features({'is_space': True}),id=18), Annotation(45,48,Token,features=Features({'_i': 11, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'few', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11866476999679706272, 'pos': 'ADJ', 'prefix': 'f', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'JJ', 'shape': 'xxx', 'suffix': 'few', 'ent_type': '', 'dep': 'nummod', 'head': 21, 'left_edge': 15, 'right_edge': 19}),id=19), Annotation(48,49,SpaceToken,features=Features({'is_space': True}),id=20), Annotation(49,58,Token,features=Features({'_i': 12, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'sentence', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 5257340109698985342, 'pos': 'NOUN', 'prefix': 's', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'xxxx', 'suffix': 'ces', 'ent_type': '', 'dep': 'dobj', 'head': 13, 'left_edge': 15, 'right_edge': 21}),id=21), Annotation(58,59,Token,features=Features({'_i': 13, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '.', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12646065887601541794, 'pos': 'PUNCT', 'prefix': '.', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '.', 'shape': '.', 'suffix': '.', 'ent_type': '', 'dep': 'punct', 'head': 13, 'left_edge': 22, 'right_edge': 22}),id=22), Annotation(59,60,SpaceToken,features=Features({'is_space': True}),id=23), Annotation(60,61,SpaceToken,features=Features({'_i': 14, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': True, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 962983613142996970, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n', 'suffix': '\\n', 'ent_type': '', 'dep': 'dep', 'head': 27, 'left_edge': 24, 'right_edge': 24}),id=24), Annotation(60,266,Sentence,features=Features({}),id=338), Annotation(61,65,Token,features=Features({'_i': 15, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'here', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 13973858553651532596, 'pos': 'ADV', 'prefix': 'H', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'Xxxx', 'suffix': 'ere', 'ent_type': '', 'dep': 'advmod', 'head': 27, 'left_edge': 25, 'right_edge': 25}),id=25), Annotation(65,66,SpaceToken,features=Features({'is_space': True}),id=26), Annotation(66,68,Token,features=Features({'_i': 16, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'be', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3411606890003347522, 'pos': 'AUX', 'prefix': 'i', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBZ', 'shape': 'xx', 'suffix': 'is', 'ent_type': '', 'dep': 'ccomp', 'head': 68, 'left_edge': 24, 'right_edge': 65}),id=27), Annotation(68,69,SpaceToken,features=Features({'is_space': True}),id=28), Annotation(69,70,Token,features=Features({'_i': 17, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'det', 'head': 31, 'left_edge': 29, 'right_edge': 29}),id=29), Annotation(69,79,NounChunk,features=Features({}),id=346), Annotation(70,71,SpaceToken,features=Features({'is_space': True}),id=30), Annotation(71,79,Token,features=Features({'_i': 18, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'sentence', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 18108853898452662235, 'pos': 'NOUN', 'prefix': 's', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'nce', 'ent_type': '', 'dep': 'attr', 'head': 27, 'left_edge': 29, 'right_edge': 64}),id=31), Annotation(79,80,SpaceToken,features=Features({'is_space': True}),id=32), Annotation(80,84,Token,features=Features({'_i': 19, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'that', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 4380130941430378203, 'pos': 'PRON', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'WDT', 'shape': 'xxxx', 'suffix': 'hat', 'ent_type': '', 'dep': 'nsubj', 'head': 35, 'left_edge': 33, 'right_edge': 33}),id=33), Annotation(80,84,NounChunk,features=Features({}),id=347), Annotation(84,85,SpaceToken,features=Features({'is_space': True}),id=34), Annotation(85,93,Token,features=Features({'_i': 20, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'mention', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 834570530775529781, 'pos': 'VERB', 'prefix': 'm', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBZ', 'shape': 'xxxx', 'suffix': 'ons', 'ent_type': '', 'dep': 'relcl', 'head': 31, 'left_edge': 33, 'right_edge': 64}),id=35), Annotation(93,94,SpaceToken,features=Features({'is_space': True}),id=36), Annotation(94,95,Token,features=Features({'_i': 21, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'det', 'head': 43, 'left_edge': 37, 'right_edge': 37}),id=37), Annotation(94,114,NounChunk,features=Features({}),id=348), Annotation(95,96,SpaceToken,features=Features({'is_space': True}),id=38), Annotation(96,99,Token,features=Features({'_i': 22, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'few', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11866476999679706272, 'pos': 'ADJ', 'prefix': 'f', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'JJ', 'shape': 'xxx', 'suffix': 'few', 'ent_type': '', 'dep': 'amod', 'head': 43, 'left_edge': 39, 'right_edge': 39}),id=39), Annotation(99,100,SpaceToken,features=Features({'is_space': True}),id=40), Annotation(100,105,Token,features=Features({'_i': 23, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'name', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 6719370519630147158, 'pos': 'VERB', 'prefix': 'n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBN', 'shape': 'xxxx', 'suffix': 'med', 'ent_type': '', 'dep': 'amod', 'head': 43, 'left_edge': 41, 'right_edge': 41}),id=41), Annotation(105,106,SpaceToken,features=Features({'is_space': True}),id=42), Annotation(106,114,Token,features=Features({'_i': 24, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'entity', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7701268449352692754, 'pos': 'NOUN', 'prefix': 'e', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'xxxx', 'suffix': 'ies', 'ent_type': '', 'dep': 'dobj', 'head': 35, 'left_edge': 37, 'right_edge': 64}),id=43), Annotation(114,115,SpaceToken,features=Features({'is_space': True}),id=44), Annotation(115,119,Token,features=Features({'_i': 25, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'like', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 18194338103975822726, 'pos': 'ADP', 'prefix': 'l', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'IN', 'shape': 'xxxx', 'suffix': 'ike', 'ent_type': '', 'dep': 'prep', 'head': 43, 'left_edge': 45, 'right_edge': 64}),id=45), Annotation(119,120,SpaceToken,features=Features({'is_space': True}),id=46), Annotation(120,121,SpaceToken,features=Features({'_i': 26, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 962983613142996970, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n', 'suffix': '\\n', 'ent_type': '', 'dep': 'dep', 'head': 45, 'left_edge': 47, 'right_edge': 47}),id=47), Annotation(121,124,Token,features=Features({'_i': 27, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'the', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7425985699627899538, 'pos': 'DET', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'xxx', 'suffix': 'the', 'ent_type': '', 'dep': 'det', 'head': 50, 'left_edge': 48, 'right_edge': 48}),id=48), Annotation(121,132,NounChunk,features=Features({}),id=349), Annotation(124,125,SpaceToken,features=Features({'is_space': True}),id=49), Annotation(125,132,Token,features=Features({'_i': 28, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'person', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 17520808660558581486, 'pos': 'NOUN', 'prefix': 'p', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'xxxx', 'suffix': 'ons', 'ent_type': '', 'dep': 'pobj', 'head': 45, 'left_edge': 48, 'right_edge': 64}),id=50), Annotation(132,133,SpaceToken,features=Features({'is_space': True}),id=51), Annotation(133,139,Token,features=Features({'_i': 29, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Barack', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 15388493565120789335, 'pos': 'PROPN', 'prefix': 'B', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'ack', 'ent_type': 'PERSON', 'dep': 'compound', 'head': 54, 'left_edge': 52, 'right_edge': 52}),id=52), Annotation(133,145,PERSON,features=Features({'lemma': 'Barack Obama'}),id=318), Annotation(133,145,NounChunk,features=Features({}),id=350), Annotation(139,140,SpaceToken,features=Features({'is_space': True}),id=53), Annotation(140,145,Token,features=Features({'_i': 30, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Obama', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 4857242187112322394, 'pos': 'PROPN', 'prefix': 'O', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'ama', 'ent_type': 'PERSON', 'dep': 'appos', 'head': 50, 'left_edge': 52, 'right_edge': 64}),id=54), Annotation(145,146,SpaceToken,features=Features({'is_space': True}),id=55), Annotation(146,148,Token,features=Features({'_i': 31, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'or', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3740602843040177340, 'pos': 'CCONJ', 'prefix': 'o', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'xx', 'suffix': 'or', 'ent_type': '', 'dep': 'cc', 'head': 54, 'left_edge': 56, 'right_edge': 56}),id=56), Annotation(148,149,SpaceToken,features=Features({'is_space': True}),id=57), Annotation(149,155,Token,features=Features({'_i': 32, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Ursula', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 13791600832320669420, 'pos': 'PROPN', 'prefix': 'U', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'ula', 'ent_type': 'GPE', 'dep': 'compound', 'head': 62, 'left_edge': 58, 'right_edge': 58}),id=58), Annotation(149,155,GPE,features=Features({'lemma': 'Ursula'}),id=319), Annotation(149,169,NounChunk,features=Features({}),id=351), Annotation(155,156,SpaceToken,features=Features({'is_space': True}),id=59), Annotation(156,159,Token,features=Features({'_i': 33, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'von', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7079615728476843702, 'pos': 'PROPN', 'prefix': 'v', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'xxx', 'suffix': 'von', 'ent_type': '', 'dep': 'compound', 'head': 62, 'left_edge': 60, 'right_edge': 60}),id=60), Annotation(159,160,SpaceToken,features=Features({'is_space': True}),id=61), Annotation(160,163,Token,features=Features({'_i': 34, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'der', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 9250722957692387333, 'pos': 'NOUN', 'prefix': 'd', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxx', 'suffix': 'der', 'ent_type': '', 'dep': 'compound', 'head': 64, 'left_edge': 58, 'right_edge': 62}),id=62), Annotation(163,164,SpaceToken,features=Features({'is_space': True}),id=63), Annotation(164,169,Token,features=Features({'_i': 35, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Leyen', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 13266048868047277473, 'pos': 'PROPN', 'prefix': 'L', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'yen', 'ent_type': 'GPE', 'dep': 'conj', 'head': 54, 'left_edge': 58, 'right_edge': 64}),id=64), Annotation(164,169,GPE,features=Features({'lemma': 'Leyen'}),id=320), Annotation(169,170,Token,features=Features({'_i': 36, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 27, 'left_edge': 65, 'right_edge': 65}),id=65), Annotation(170,171,SpaceToken,features=Features({'is_space': True}),id=66), Annotation(171,180,Token,features=Features({'_i': 37, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'location', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 13063876858105367867, 'pos': 'NOUN', 'prefix': 'l', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'xxxx', 'suffix': 'ons', 'ent_type': '', 'dep': 'nsubj', 'head': 68, 'left_edge': 67, 'right_edge': 67}),id=67), Annotation(171,180,NounChunk,features=Features({}),id=352), Annotation(180,181,SpaceToken,features=Features({'_i': 38, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 962983613142996970, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n', 'suffix': '\\n', 'ent_type': '', 'dep': 'dep', 'head': 68, 'left_edge': 24, 'right_edge': 99}),id=68), Annotation(181,185,Token,features=Features({'_i': 39, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'like', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 18194338103975822726, 'pos': 'ADP', 'prefix': 'l', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'IN', 'shape': 'xxxx', 'suffix': 'ike', 'ent_type': '', 'dep': 'prep', 'head': 68, 'left_edge': 69, 'right_edge': 98}),id=69), Annotation(185,186,SpaceToken,features=Features({'is_space': True}),id=70), Annotation(186,189,Token,features=Features({'_i': 40, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'New', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7503827727184870577, 'pos': 'PROPN', 'prefix': 'N', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxx', 'suffix': 'New', 'ent_type': 'GPE', 'dep': 'compound', 'head': 73, 'left_edge': 71, 'right_edge': 71}),id=71), Annotation(186,199,GPE,features=Features({'lemma': 'New York City'}),id=321), Annotation(186,199,NounChunk,features=Features({}),id=353), Annotation(189,190,SpaceToken,features=Features({'is_space': True}),id=72), Annotation(190,194,Token,features=Features({'_i': 41, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'York', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7898044819112200372, 'pos': 'PROPN', 'prefix': 'Y', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxx', 'suffix': 'ork', 'ent_type': 'GPE', 'dep': 'compound', 'head': 75, 'left_edge': 71, 'right_edge': 73}),id=73), Annotation(194,195,SpaceToken,features=Features({'is_space': True}),id=74), Annotation(195,199,Token,features=Features({'_i': 42, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'City', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 728339108046626067, 'pos': 'PROPN', 'prefix': 'C', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxx', 'suffix': 'ity', 'ent_type': 'GPE', 'dep': 'pobj', 'head': 69, 'left_edge': 71, 'right_edge': 98}),id=75), Annotation(199,200,Token,features=Features({'_i': 43, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 75, 'left_edge': 76, 'right_edge': 76}),id=76), Annotation(200,201,SpaceToken,features=Features({'is_space': True}),id=77), Annotation(201,207,Token,features=Features({'_i': 44, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Vienna', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 10141676973022394345, 'pos': 'PROPN', 'prefix': 'V', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'nna', 'ent_type': 'GPE', 'dep': 'conj', 'head': 75, 'left_edge': 78, 'right_edge': 98}),id=78), Annotation(201,207,GPE,features=Features({'lemma': 'Vienna'}),id=322), Annotation(201,207,NounChunk,features=Features({}),id=354), Annotation(207,208,SpaceToken,features=Features({'is_space': True}),id=79), Annotation(208,210,Token,features=Features({'_i': 45, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'or', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3740602843040177340, 'pos': 'CCONJ', 'prefix': 'o', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'xx', 'suffix': 'or', 'ent_type': '', 'dep': 'cc', 'head': 78, 'left_edge': 80, 'right_edge': 80}),id=80), Annotation(210,211,SpaceToken,features=Features({'is_space': True}),id=81), Annotation(211,218,Token,features=Features({'_i': 46, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Beijing', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2949147885855558195, 'pos': 'PROPN', 'prefix': 'B', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'ing', 'ent_type': 'GPE', 'dep': 'conj', 'head': 78, 'left_edge': 82, 'right_edge': 82}),id=82), Annotation(211,218,GPE,features=Features({'lemma': 'Beijing'}),id=323), Annotation(211,218,NounChunk,features=Features({}),id=355), Annotation(218,219,SpaceToken,features=Features({'is_space': True}),id=83), Annotation(219,221,Token,features=Features({'_i': 47, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'or', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3740602843040177340, 'pos': 'CCONJ', 'prefix': 'o', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'xx', 'suffix': 'or', 'ent_type': '', 'dep': 'cc', 'head': 78, 'left_edge': 84, 'right_edge': 84}),id=84), Annotation(221,222,SpaceToken,features=Features({'is_space': True}),id=85), Annotation(222,231,Token,features=Features({'_i': 48, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'company', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 8026612326651866097, 'pos': 'NOUN', 'prefix': 'c', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'xxxx', 'suffix': 'ies', 'ent_type': '', 'dep': 'conj', 'head': 78, 'left_edge': 86, 'right_edge': 98}),id=86), Annotation(222,231,NounChunk,features=Features({}),id=356), Annotation(231,232,SpaceToken,features=Features({'is_space': True}),id=87), Annotation(232,236,Token,features=Features({'_i': 49, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'like', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 18194338103975822726, 'pos': 'ADP', 'prefix': 'l', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'IN', 'shape': 'xxxx', 'suffix': 'ike', 'ent_type': '', 'dep': 'prep', 'head': 86, 'left_edge': 88, 'right_edge': 98}),id=88), Annotation(236,237,SpaceToken,features=Features({'is_space': True}),id=89), Annotation(237,238,SpaceToken,features=Features({'_i': 50, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 962983613142996970, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n', 'suffix': '\\n', 'ent_type': '', 'dep': 'dep', 'head': 88, 'left_edge': 90, 'right_edge': 90}),id=90), Annotation(238,244,Token,features=Features({'_i': 51, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Google', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11578853341595296054, 'pos': 'PROPN', 'prefix': 'G', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'gle', 'ent_type': '', 'dep': 'pobj', 'head': 88, 'left_edge': 91, 'right_edge': 98}),id=91), Annotation(238,244,NounChunk,features=Features({}),id=357), Annotation(244,245,Token,features=Features({'_i': 52, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 91, 'left_edge': 92, 'right_edge': 92}),id=92), Annotation(245,246,SpaceToken,features=Features({'is_space': True}),id=93), Annotation(246,255,Token,features=Features({'_i': 53, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'UniCredit', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7663443051765273511, 'pos': 'PROPN', 'prefix': 'U', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'XxxXxxxx', 'suffix': 'dit', 'ent_type': 'ORG', 'dep': 'conj', 'head': 91, 'left_edge': 94, 'right_edge': 98}),id=94), Annotation(246,255,ORG,features=Features({'lemma': 'UniCredit'}),id=324), Annotation(246,255,NounChunk,features=Features({}),id=358), Annotation(255,256,SpaceToken,features=Features({'is_space': True}),id=95), Annotation(256,258,Token,features=Features({'_i': 54, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'or', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3740602843040177340, 'pos': 'CCONJ', 'prefix': 'o', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'xx', 'suffix': 'or', 'ent_type': '', 'dep': 'cc', 'head': 94, 'left_edge': 96, 'right_edge': 96}),id=96), Annotation(258,259,SpaceToken,features=Features({'is_space': True}),id=97), Annotation(259,265,Token,features=Features({'_i': 55, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Huawei', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 4639919019609272215, 'pos': 'PROPN', 'prefix': 'H', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'wei', 'ent_type': 'ORG', 'dep': 'conj', 'head': 94, 'left_edge': 98, 'right_edge': 98}),id=98), Annotation(259,265,ORG,features=Features({'lemma': 'Huawei'}),id=325), Annotation(259,265,NounChunk,features=Features({}),id=359), Annotation(265,266,Token,features=Features({'_i': 56, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '.', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12646065887601541794, 'pos': 'PUNCT', 'prefix': '.', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '.', 'shape': '.', 'suffix': '.', 'ent_type': '', 'dep': 'punct', 'head': 68, 'left_edge': 99, 'right_edge': 99}),id=99), Annotation(266,267,SpaceToken,features=Features({'is_space': True}),id=100), Annotation(267,270,Token,features=Features({'_i': 57, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': True, 'is_space': False, 'is_stop': True, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'and', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12172435438170721471, 'pos': 'CCONJ', 'prefix': 'A', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'Xxx', 'suffix': 'And', 'ent_type': '', 'dep': 'cc', 'head': 105, 'left_edge': 101, 'right_edge': 101}),id=101), Annotation(267,422,Sentence,features=Features({}),id=339), Annotation(270,271,SpaceToken,features=Features({'is_space': True}),id=102), Annotation(271,275,Token,features=Features({'_i': 58, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'here', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 411390626470654571, 'pos': 'ADV', 'prefix': 'h', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'xxxx', 'suffix': 'ere', 'ent_type': '', 'dep': 'advmod', 'head': 105, 'left_edge': 103, 'right_edge': 103}),id=103), Annotation(275,276,SpaceToken,features=Features({'is_space': True}),id=104), Annotation(276,278,Token,features=Features({'_i': 59, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'be', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3411606890003347522, 'pos': 'AUX', 'prefix': 'i', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBZ', 'shape': 'xx', 'suffix': 'is', 'ent_type': '', 'dep': 'ccomp', 'head': 118, 'left_edge': 101, 'right_edge': 109}),id=105), Annotation(278,279,SpaceToken,features=Features({'is_space': True}),id=106), Annotation(279,285,Token,features=Features({'_i': 60, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Donald', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 16889399016836222064, 'pos': 'PROPN', 'prefix': 'D', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'ald', 'ent_type': 'PERSON', 'dep': 'compound', 'head': 109, 'left_edge': 107, 'right_edge': 107}),id=107), Annotation(279,291,PERSON,features=Features({'lemma': 'Donald Trump'}),id=326), Annotation(279,291,NounChunk,features=Features({}),id=360), Annotation(285,286,SpaceToken,features=Features({'is_space': True}),id=108), Annotation(286,291,Token,features=Features({'_i': 61, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Trump', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 1134333841961332695, 'pos': 'PROPN', 'prefix': 'T', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'ump', 'ent_type': 'PERSON', 'dep': 'nsubj', 'head': 105, 'left_edge': 107, 'right_edge': 109}),id=109), Annotation(291,292,Token,features=Features({'_i': 62, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 118, 'left_edge': 110, 'right_edge': 110}),id=110), Annotation(292,293,SpaceToken,features=Features({'is_space': True}),id=111), Annotation(293,295,Token,features=Features({'_i': 63, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'it', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 10239237003504588839, 'pos': 'PRON', 'prefix': 'i', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'PRP', 'shape': 'xx', 'suffix': 'it', 'ent_type': '', 'dep': 'nsubj', 'head': 118, 'left_edge': 112, 'right_edge': 112}),id=112), Annotation(293,295,NounChunk,features=Features({}),id=361), Annotation(295,296,SpaceToken,features=Features({'is_space': True}),id=113), Annotation(296,299,Token,features=Features({'_i': 64, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'may', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 14378475389916013800, 'pos': 'AUX', 'prefix': 'm', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'MD', 'shape': 'xxx', 'suffix': 'may', 'ent_type': '', 'dep': 'aux', 'head': 118, 'left_edge': 114, 'right_edge': 114}),id=114), Annotation(299,300,SpaceToken,features=Features({'is_space': True}),id=115), Annotation(300,303,Token,features=Features({'_i': 65, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'not', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 447765159362469301, 'pos': 'PART', 'prefix': 'n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'xxx', 'suffix': 'not', 'ent_type': '', 'dep': 'neg', 'head': 118, 'left_edge': 116, 'right_edge': 116}),id=116), Annotation(303,304,SpaceToken,features=Features({'is_space': True}),id=117), Annotation(304,306,Token,features=Features({'_i': 66, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'be', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 10382539506755952630, 'pos': 'AUX', 'prefix': 'b', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VB', 'shape': 'xx', 'suffix': 'be', 'ent_type': '', 'dep': 'ccomp', 'head': 130, 'left_edge': 101, 'right_edge': 127}),id=118), Annotation(306,307,SpaceToken,features=Features({'is_space': True}),id=119), Annotation(307,310,Token,features=Features({'_i': 67, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'the', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7425985699627899538, 'pos': 'DET', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'xxx', 'suffix': 'the', 'ent_type': '', 'dep': 'det', 'head': 124, 'left_edge': 120, 'right_edge': 120}),id=120), Annotation(310,311,SpaceToken,features=Features({'is_space': True}),id=121), Annotation(311,315,Token,features=Features({'_i': 68, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'real', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7050692189315705983, 'pos': 'ADJ', 'prefix': 'r', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'JJ', 'shape': 'xxxx', 'suffix': 'eal', 'ent_type': '', 'dep': 'amod', 'head': 124, 'left_edge': 122, 'right_edge': 122}),id=122), Annotation(315,316,SpaceToken,features=Features({'is_space': True}),id=123), Annotation(316,319,Token,features=Features({'_i': 69, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'one', 'like_email': False, 'like_num': True, 'like_url': False, 'orth': 17454115351911680600, 'pos': 'NUM', 'prefix': 'o', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CD', 'shape': 'xxx', 'suffix': 'one', 'ent_type': '', 'dep': 'attr', 'head': 118, 'left_edge': 120, 'right_edge': 127}),id=124), Annotation(319,320,SpaceToken,features=Features({'is_space': True}),id=125), Annotation(320,322,Token,features=Features({'_i': 70, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': True, 'lang': 'en', 'lemma': ':P', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11282459806373970386, 'pos': 'PUNCT', 'prefix': ':', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ':', 'shape': ':X', 'suffix': ':P', 'ent_type': '', 'dep': 'punct', 'head': 124, 'left_edge': 126, 'right_edge': 126}),id=126), Annotation(322,324,SpaceToken,features=Features({'_i': 71, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 908432558851201422, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n\\n', 'suffix': '\\n\\n', 'ent_type': '', 'dep': 'dep', 'head': 124, 'left_edge': 127, 'right_edge': 127}),id=127), Annotation(324,328,Token,features=Features({'_i': 72, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'let', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 8631549979455155821, 'pos': 'NOUN', 'prefix': 'L', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'Xxxx', 'suffix': 'ets', 'ent_type': '', 'dep': 'nsubj', 'head': 130, 'left_edge': 128, 'right_edge': 128}),id=128), Annotation(324,328,NounChunk,features=Features({}),id=362), Annotation(328,329,SpaceToken,features=Features({'is_space': True}),id=129), Annotation(329,332,Token,features=Features({'_i': 73, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'say', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 8685289367999165211, 'pos': 'VERB', 'prefix': 's', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBP', 'shape': 'xxx', 'suffix': 'say', 'ent_type': '', 'dep': 'parataxis', 'head': 153, 'left_edge': 101, 'right_edge': 149}),id=130), Annotation(332,333,SpaceToken,features=Features({'is_space': True}),id=131), Annotation(333,338,Token,features=Features({'_i': 74, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Boris', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3926703778135926261, 'pos': 'PROPN', 'prefix': 'B', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'ris', 'ent_type': 'PERSON', 'dep': 'compound', 'head': 134, 'left_edge': 132, 'right_edge': 132}),id=132), Annotation(333,346,PERSON,features=Features({'lemma': 'Boris Johnson'}),id=327), Annotation(333,346,NounChunk,features=Features({}),id=363), Annotation(338,339,SpaceToken,features=Features({'is_space': True}),id=133), Annotation(339,346,Token,features=Features({'_i': 75, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Johnson', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 14020170261514038406, 'pos': 'PROPN', 'prefix': 'J', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'son', 'ent_type': 'PERSON', 'dep': 'nsubj', 'head': 130, 'left_edge': 132, 'right_edge': 148}),id=134), Annotation(346,347,SpaceToken,features=Features({'is_space': True}),id=135), Annotation(347,350,Token,features=Features({'_i': 76, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'aka', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 6309978511773728404, 'pos': 'ADV', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'xxx', 'suffix': 'aka', 'ent_type': '', 'dep': 'advmod', 'head': 140, 'left_edge': 136, 'right_edge': 136}),id=136), Annotation(347,362,NounChunk,features=Features({}),id=364), Annotation(350,351,SpaceToken,features=Features({'is_space': True}),id=137), Annotation(351,355,Token,features=Features({'_i': 77, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Bojo', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 1381500864330977487, 'pos': 'PROPN', 'prefix': 'B', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxx', 'suffix': 'ojo', 'ent_type': '', 'dep': 'compound', 'head': 140, 'left_edge': 138, 'right_edge': 138}),id=138), Annotation(355,356,SpaceToken,features=Features({'is_space': True}),id=139), Annotation(356,362,Token,features=Features({'_i': 78, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'tweet', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 9158814828460388352, 'pos': 'NOUN', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'xxxx', 'suffix': 'ets', 'ent_type': '', 'dep': 'appos', 'head': 134, 'left_edge': 136, 'right_edge': 148}),id=140), Annotation(362,363,SpaceToken,features=Features({'is_space': True}),id=141), Annotation(363,367,Token,features=Features({'_i': 79, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'from', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7831658034963690409, 'pos': 'ADP', 'prefix': 'f', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'IN', 'shape': 'xxxx', 'suffix': 'rom', 'ent_type': '', 'dep': 'prep', 'head': 140, 'left_edge': 142, 'right_edge': 148}),id=142), Annotation(367,368,SpaceToken,features=Features({'is_space': True}),id=143), Annotation(368,371,Token,features=Features({'_i': 80, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'his', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2661093235354845946, 'pos': 'PRON', 'prefix': 'h', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'PRP$', 'shape': 'xxx', 'suffix': 'his', 'ent_type': '', 'dep': 'poss', 'head': 148, 'left_edge': 144, 'right_edge': 144}),id=144), Annotation(368,392,NounChunk,features=Features({}),id=365), Annotation(371,372,SpaceToken,features=Features({'is_space': True}),id=145), Annotation(372,384,Token,features=Features({'_i': 81, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'BorisJohnson', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 6152028365162425637, 'pos': 'PROPN', 'prefix': 'B', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'XxxxxXxxxx', 'suffix': 'son', 'ent_type': 'ORG', 'dep': 'compound', 'head': 148, 'left_edge': 146, 'right_edge': 146}),id=146), Annotation(372,384,ORG,features=Features({'lemma': 'BorisJohnson'}),id=328), Annotation(384,385,SpaceToken,features=Features({'is_space': True}),id=147), Annotation(385,392,Token,features=Features({'_i': 82, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'account', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 16051117652154763083, 'pos': 'NOUN', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'unt', 'ent_type': '', 'dep': 'pobj', 'head': 142, 'left_edge': 144, 'right_edge': 148}),id=148), Annotation(392,393,Token,features=Features({'_i': 83, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 130, 'left_edge': 149, 'right_edge': 149}),id=149), Annotation(393,394,SpaceToken,features=Features({'is_space': True}),id=150), Annotation(394,399,Token,features=Features({'_i': 84, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'would', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 6992604926141104606, 'pos': 'AUX', 'prefix': 'w', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'MD', 'shape': 'xxxx', 'suffix': 'uld', 'ent_type': '', 'dep': 'aux', 'head': 153, 'left_edge': 151, 'right_edge': 151}),id=151), Annotation(399,400,SpaceToken,features=Features({'is_space': True}),id=152), Annotation(400,402,Token,features=Features({'_i': 85, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'be', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 10382539506755952630, 'pos': 'AUX', 'prefix': 'b', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VB', 'shape': 'xx', 'suffix': 'be', 'ent_type': '', 'dep': 'ROOT', 'head': 153, 'left_edge': 101, 'right_edge': 162}),id=153), Annotation(402,403,SpaceToken,features=Features({'is_space': True}),id=154), Annotation(403,407,Token,features=Features({'_i': 86, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'nice', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 14121509715367036122, 'pos': 'ADJ', 'prefix': 'n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'JJ', 'shape': 'xxxx', 'suffix': 'ice', 'ent_type': '', 'dep': 'acomp', 'head': 153, 'left_edge': 155, 'right_edge': 155}),id=155), Annotation(407,408,SpaceToken,features=Features({'is_space': True}),id=156), Annotation(408,410,Token,features=Features({'_i': 87, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'to', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3791531372978436496, 'pos': 'PART', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'TO', 'shape': 'xx', 'suffix': 'to', 'ent_type': '', 'dep': 'aux', 'head': 159, 'left_edge': 157, 'right_edge': 157}),id=157), Annotation(410,411,SpaceToken,features=Features({'is_space': True}),id=158), Annotation(411,416,Token,features=Features({'_i': 88, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'match', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 16065818573247886523, 'pos': 'VERB', 'prefix': 'm', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VB', 'shape': 'xxxx', 'suffix': 'tch', 'ent_type': '', 'dep': 'xcomp', 'head': 153, 'left_edge': 157, 'right_edge': 161}),id=159), Annotation(416,417,SpaceToken,features=Features({'is_space': True}),id=160), Annotation(417,421,Token,features=Features({'_i': 89, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'they', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7976265300568155021, 'pos': 'PRON', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'PRP', 'shape': 'xxxx', 'suffix': 'hem', 'ent_type': '', 'dep': 'dobj', 'head': 159, 'left_edge': 161, 'right_edge': 161}),id=161), Annotation(417,421,NounChunk,features=Features({}),id=366), Annotation(421,422,Token,features=Features({'_i': 90, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '!', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 17494803046312582752, 'pos': 'PUNCT', 'prefix': '!', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '.', 'shape': '!', 'suffix': '!', 'ent_type': '', 'dep': 'punct', 'head': 153, 'left_edge': 162, 'right_edge': 162}),id=162), Annotation(422,424,SpaceToken,features=Features({'_i': 91, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': True, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 908432558851201422, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n\\n', 'suffix': '\\n\\n', 'ent_type': '', 'dep': 'dep', 'head': 168, 'left_edge': 163, 'right_edge': 163}),id=163), Annotation(422,829,Sentence,features=Features({}),id=340), Annotation(424,428,Token,features=Features({'_i': 92, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'here', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 13973858553651532596, 'pos': 'ADV', 'prefix': 'H', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'Xxxx', 'suffix': 'ere', 'ent_type': '', 'dep': 'advmod', 'head': 168, 'left_edge': 164, 'right_edge': 164}),id=164), Annotation(428,429,SpaceToken,features=Features({'is_space': True}),id=165), Annotation(429,431,Token,features=Features({'_i': 93, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'we', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 16064069575701507746, 'pos': 'PRON', 'prefix': 'w', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'PRP', 'shape': 'xx', 'suffix': 'we', 'ent_type': '', 'dep': 'nsubj', 'head': 168, 'left_edge': 166, 'right_edge': 166}),id=166), Annotation(429,431,NounChunk,features=Features({}),id=367), Annotation(431,432,SpaceToken,features=Features({'is_space': True}),id=167), Annotation(432,439,Token,features=Features({'_i': 94, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'include', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 14049642289933595219, 'pos': 'VERB', 'prefix': 'i', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBP', 'shape': 'xxxx', 'suffix': 'ude', 'ent_type': '', 'dep': 'ccomp', 'head': 265, 'left_edge': 163, 'right_edge': 257}),id=168), Annotation(439,440,SpaceToken,features=Features({'is_space': True}),id=169), Annotation(440,441,Token,features=Features({'_i': 95, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'det', 'head': 172, 'left_edge': 170, 'right_edge': 170}),id=170), Annotation(441,442,SpaceToken,features=Features({'is_space': True}),id=171), Annotation(442,445,Token,features=Features({'_i': 96, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': True, 'lang': 'en', 'lemma': 'url', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2582013287274679728, 'pos': 'NOUN', 'prefix': 'U', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'XXX', 'suffix': 'URL', 'ent_type': '', 'dep': 'nmod', 'head': 176, 'left_edge': 170, 'right_edge': 174}),id=172), Annotation(445,446,SpaceToken,features=Features({'is_space': True}),id=173), Annotation(446,487,Token,features=Features({'_i': 97, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'https://gatenlp.github.io/python-gatenlp/', 'like_email': False, 'like_num': False, 'like_url': True, 'orth': 616995933894448768, 'pos': 'X', 'prefix': 'h', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'FW', 'shape': 'xxxx://xxxx.xxxx.xx/xxxx-xxxx/', 'suffix': 'lp/', 'ent_type': '', 'dep': 'nummod', 'head': 172, 'left_edge': 174, 'right_edge': 174}),id=174), Annotation(487,488,SpaceToken,features=Features({'is_space': True}),id=175), Annotation(488,489,SpaceToken,features=Features({'_i': 98, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 962983613142996970, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n', 'suffix': '\\n', 'ent_type': '', 'dep': 'dep', 'head': 168, 'left_edge': 170, 'right_edge': 187}),id=176), Annotation(489,492,Token,features=Features({'_i': 99, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'and', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2283656566040971221, 'pos': 'CCONJ', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'xxx', 'suffix': 'and', 'ent_type': '', 'dep': 'cc', 'head': 176, 'left_edge': 177, 'right_edge': 177}),id=177), Annotation(492,493,SpaceToken,features=Features({'is_space': True}),id=178), Annotation(493,494,Token,features=Features({'_i': 100, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'det', 'head': 185, 'left_edge': 179, 'right_edge': 179}),id=179), Annotation(494,495,SpaceToken,features=Features({'is_space': True}),id=180), Annotation(495,499,Token,features=Features({'_i': 101, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'fake', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 513889413363281732, 'pos': 'ADJ', 'prefix': 'f', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'JJ', 'shape': 'xxxx', 'suffix': 'ake', 'ent_type': '', 'dep': 'amod', 'head': 185, 'left_edge': 181, 'right_edge': 181}),id=181), Annotation(499,500,SpaceToken,features=Features({'is_space': True}),id=182), Annotation(500,505,Token,features=Features({'_i': 102, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'email', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7320900731437023467, 'pos': 'NOUN', 'prefix': 'e', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'ail', 'ent_type': '', 'dep': 'compound', 'head': 185, 'left_edge': 183, 'right_edge': 183}),id=183), Annotation(505,506,SpaceToken,features=Features({'is_space': True}),id=184), Annotation(506,513,Token,features=Features({'_i': 103, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'address', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 14349763737243715186, 'pos': 'NOUN', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'ess', 'ent_type': '', 'dep': 'conj', 'head': 176, 'left_edge': 179, 'right_edge': 187}),id=185), Annotation(513,514,SpaceToken,features=Features({'is_space': True}),id=186), Annotation(514,540,Token,features=Features({'_i': 104, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'john.doe@hiscoolserver.com', 'like_email': True, 'like_num': False, 'like_url': False, 'orth': 8439314673651519470, 'pos': 'PROPN', 'prefix': 'j', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'xxxx.xxx@xxxx.xxx', 'suffix': 'com', 'ent_type': '', 'dep': 'appos', 'head': 185, 'left_edge': 187, 'right_edge': 187}),id=187), Annotation(514,540,NounChunk,features=Features({}),id=368), Annotation(540,541,SpaceToken,features=Features({'is_space': True}),id=188), Annotation(541,543,Token,features=Features({'_i': 105, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'as', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7437575085468336610, 'pos': 'ADV', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'xx', 'suffix': 'as', 'ent_type': '', 'dep': 'advmod', 'head': 191, 'left_edge': 189, 'right_edge': 189}),id=189), Annotation(543,544,SpaceToken,features=Features({'is_space': True}),id=190), Annotation(544,548,Token,features=Features({'_i': 106, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'well', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 4525988469032889948, 'pos': 'ADV', 'prefix': 'w', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'xxxx', 'suffix': 'ell', 'ent_type': '', 'dep': 'advmod', 'head': 168, 'left_edge': 189, 'right_edge': 191}),id=191), Annotation(548,549,SpaceToken,features=Features({'is_space': True}),id=192), Annotation(549,550,SpaceToken,features=Features({'_i': 107, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 962983613142996970, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n', 'suffix': '\\n', 'ent_type': '', 'dep': 'dep', 'head': 168, 'left_edge': 193, 'right_edge': 193}),id=193), Annotation(550,552,Token,features=Features({'_i': 108, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'as', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7437575085468336610, 'pos': 'ADP', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'IN', 'shape': 'xx', 'suffix': 'as', 'ent_type': '', 'dep': 'prep', 'head': 168, 'left_edge': 194, 'right_edge': 203}),id=194), Annotation(552,553,SpaceToken,features=Features({'is_space': True}),id=195), Annotation(553,554,Token,features=Features({'_i': 109, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '#', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 8663801465970268676, 'pos': 'SYM', 'prefix': '#', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '$', 'shape': '#', 'suffix': '#', 'ent_type': 'MONEY', 'dep': 'quantmod', 'head': 199, 'left_edge': 196, 'right_edge': 196}),id=196), Annotation(553,560,MONEY,features=Features({'lemma': '#some #'}),id=329), Annotation(553,573,NounChunk,features=Features({}),id=369), Annotation(554,558,Token,features=Features({'_i': 110, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'some', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7000492816108906599, 'pos': 'DET', 'prefix': 's', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'xxxx', 'suffix': 'ome', 'ent_type': 'MONEY', 'dep': 'det', 'head': 199, 'left_edge': 197, 'right_edge': 197}),id=197), Annotation(558,559,SpaceToken,features=Features({'is_space': True}),id=198), Annotation(559,560,Token,features=Features({'_i': 111, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '#', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 8663801465970268676, 'pos': 'ADV', 'prefix': '#', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RBR', 'shape': '#', 'suffix': '#', 'ent_type': 'MONEY', 'dep': 'quantmod', 'head': 203, 'left_edge': 196, 'right_edge': 200}),id=199), Annotation(560,564,Token,features=Features({'_i': 112, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'cool', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 16540862642162215401, 'pos': 'ADJ', 'prefix': 'c', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'JJ', 'shape': 'xxxx', 'suffix': 'ool', 'ent_type': '', 'dep': 'amod', 'head': 199, 'left_edge': 200, 'right_edge': 200}),id=200), Annotation(564,565,SpaceToken,features=Features({'is_space': True}),id=201), Annotation(565,566,Token,features=Features({'_i': 113, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '#', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 8663801465970268676, 'pos': 'SYM', 'prefix': '#', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '$', 'shape': '#', 'suffix': '#', 'ent_type': '', 'dep': 'nmod', 'head': 203, 'left_edge': 202, 'right_edge': 202}),id=202), Annotation(566,573,Token,features=Features({'_i': 114, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'hastag', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12335920655888895378, 'pos': 'NOUN', 'prefix': 'h', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'xxxx', 'suffix': 'ags', 'ent_type': '', 'dep': 'pobj', 'head': 194, 'left_edge': 196, 'right_edge': 203}),id=203), Annotation(573,574,SpaceToken,features=Features({'is_space': True}),id=204), Annotation(574,577,Token,features=Features({'_i': 115, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'and', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2283656566040971221, 'pos': 'CCONJ', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'xxx', 'suffix': 'and', 'ent_type': '', 'dep': 'cc', 'head': 168, 'left_edge': 205, 'right_edge': 205}),id=205), Annotation(577,578,SpaceToken,features=Features({'is_space': True}),id=206), Annotation(578,579,Token,features=Features({'_i': 116, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'det', 'head': 209, 'left_edge': 207, 'right_edge': 207}),id=207), Annotation(579,580,SpaceToken,features=Features({'is_space': True}),id=208), Annotation(580,585,Token,features=Features({'_i': 117, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'bunch', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 16097686030507304001, 'pos': 'NOUN', 'prefix': 'b', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'nch', 'ent_type': '', 'dep': 'conj', 'head': 168, 'left_edge': 207, 'right_edge': 225}),id=209), Annotation(585,586,SpaceToken,features=Features({'is_space': True}),id=210), Annotation(586,588,Token,features=Features({'_i': 118, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'of', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 886050111519832510, 'pos': 'ADP', 'prefix': 'o', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'IN', 'shape': 'xx', 'suffix': 'of', 'ent_type': '', 'dep': 'prep', 'head': 209, 'left_edge': 211, 'right_edge': 213}),id=211), Annotation(588,589,SpaceToken,features=Features({'is_space': True}),id=212), Annotation(589,595,Token,features=Features({'_i': 119, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'emoji', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 15308488336638733982, 'pos': 'NOUN', 'prefix': 'e', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'xxxx', 'suffix': 'jis', 'ent_type': '', 'dep': 'pobj', 'head': 211, 'left_edge': 213, 'right_edge': 213}),id=213), Annotation(589,595,NounChunk,features=Features({}),id=370), Annotation(595,596,SpaceToken,features=Features({'is_space': True}),id=214), Annotation(596,600,Token,features=Features({'_i': 120, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'like', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 18194338103975822726, 'pos': 'ADP', 'prefix': 'l', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'IN', 'shape': 'xxxx', 'suffix': 'ike', 'ent_type': '', 'dep': 'prep', 'head': 209, 'left_edge': 215, 'right_edge': 224}),id=215), Annotation(600,601,SpaceToken,features=Features({'is_space': True}),id=216), Annotation(601,602,Token,features=Features({'_i': 121, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '😽', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 13410134130775049117, 'pos': 'PROPN', 'prefix': '😽', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': '😽', 'suffix': '😽', 'ent_type': '', 'dep': 'pobj', 'head': 215, 'left_edge': 217, 'right_edge': 224}),id=217), Annotation(601,602,NounChunk,features=Features({}),id=371), Annotation(602,603,SpaceToken,features=Features({'is_space': True}),id=218), Annotation(603,604,Token,features=Features({'_i': 122, 'is_alpha': False, 'is_bracket': True, 'is_currency': False, 'is_digit': False, 'is_left_punct': True, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '(', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12638816674900267446, 'pos': 'PUNCT', 'prefix': '(', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '-LRB-', 'shape': '(', 'suffix': '(', 'ent_type': '', 'dep': 'punct', 'head': 217, 'left_edge': 219, 'right_edge': 219}),id=219), Annotation(604,605,Token,features=Features({'_i': 123, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'det', 'head': 224, 'left_edge': 220, 'right_edge': 220}),id=220), Annotation(604,617,NounChunk,features=Features({}),id=372), Annotation(605,606,SpaceToken,features=Features({'is_space': True}),id=221), Annotation(606,613,Token,features=Features({'_i': 124, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'kissing', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 17741139538205582276, 'pos': 'NOUN', 'prefix': 'k', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'ing', 'ent_type': '', 'dep': 'amod', 'head': 224, 'left_edge': 222, 'right_edge': 222}),id=222), Annotation(613,614,SpaceToken,features=Features({'is_space': True}),id=223), Annotation(614,617,Token,features=Features({'_i': 125, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'cat', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 5439657043933447811, 'pos': 'NOUN', 'prefix': 'c', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxx', 'suffix': 'cat', 'ent_type': '', 'dep': 'appos', 'head': 217, 'left_edge': 220, 'right_edge': 224}),id=224), Annotation(617,618,Token,features=Features({'_i': 126, 'is_alpha': False, 'is_bracket': True, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': True, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ')', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3842344029291005339, 'pos': 'PUNCT', 'prefix': ')', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '-RRB-', 'shape': ')', 'suffix': ')', 'ent_type': '', 'dep': 'punct', 'head': 209, 'left_edge': 225, 'right_edge': 225}),id=225), Annotation(618,619,Token,features=Features({'_i': 127, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 168, 'left_edge': 226, 'right_edge': 226}),id=226), Annotation(619,620,SpaceToken,features=Features({'_i': 128, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 962983613142996970, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n', 'suffix': '\\n', 'ent_type': '', 'dep': 'dep', 'head': 168, 'left_edge': 227, 'right_edge': 227}),id=227), Annotation(620,621,Token,features=Features({'_i': 129, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '👩', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 13533745690575315714, 'pos': 'X', 'prefix': '👩', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'ADD', 'shape': '👩', 'suffix': '👩', 'ent_type': '', 'dep': 'dobj', 'head': 168, 'left_edge': 228, 'right_edge': 228}),id=228), Annotation(621,622,Token,features=Features({'_i': 130, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\u200d', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 17442142028047717517, 'pos': 'X', 'prefix': '\\u200d', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'ADD', 'shape': '\\u200d', 'suffix': '\\u200d', 'ent_type': '', 'dep': 'dobj', 'head': 168, 'left_edge': 229, 'right_edge': 246}),id=229), Annotation(622,623,Token,features=Features({'_i': 131, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '🏫', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7253895586487469997, 'pos': 'NOUN', 'prefix': '🏫', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': '🏫', 'suffix': '🏫', 'ent_type': '', 'dep': 'dobj', 'head': 229, 'left_edge': 230, 'right_edge': 246}),id=230), Annotation(622,623,NounChunk,features=Features({}),id=373), Annotation(623,624,SpaceToken,features=Features({'is_space': True}),id=231), Annotation(624,625,Token,features=Features({'_i': 132, 'is_alpha': False, 'is_bracket': True, 'is_currency': False, 'is_digit': False, 'is_left_punct': True, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '(', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12638816674900267446, 'pos': 'PUNCT', 'prefix': '(', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '-LRB-', 'shape': '(', 'suffix': '(', 'ent_type': '', 'dep': 'punct', 'head': 230, 'left_edge': 232, 'right_edge': 232}),id=232), Annotation(625,626,Token,features=Features({'_i': 133, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'det', 'head': 237, 'left_edge': 233, 'right_edge': 233}),id=233), Annotation(625,640,NounChunk,features=Features({}),id=374), Annotation(626,627,SpaceToken,features=Features({'is_space': True}),id=234), Annotation(627,632,Token,features=Features({'_i': 134, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'woman', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 18308318389743014884, 'pos': 'NOUN', 'prefix': 'w', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'man', 'ent_type': '', 'dep': 'compound', 'head': 237, 'left_edge': 235, 'right_edge': 235}),id=235), Annotation(632,633,SpaceToken,features=Features({'is_space': True}),id=236), Annotation(633,640,Token,features=Features({'_i': 135, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'teacher', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 78820239457828971, 'pos': 'NOUN', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'her', 'ent_type': '', 'dep': 'appos', 'head': 230, 'left_edge': 233, 'right_edge': 237}),id=237), Annotation(640,641,Token,features=Features({'_i': 136, 'is_alpha': False, 'is_bracket': True, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': True, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ')', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3842344029291005339, 'pos': 'PUNCT', 'prefix': ')', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '-RRB-', 'shape': ')', 'suffix': ')', 'ent_type': '', 'dep': 'punct', 'head': 230, 'left_edge': 238, 'right_edge': 238}),id=238), Annotation(641,642,Token,features=Features({'_i': 137, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 230, 'left_edge': 239, 'right_edge': 239}),id=239), Annotation(642,643,SpaceToken,features=Features({'is_space': True}),id=240), Annotation(643,644,Token,features=Features({'_i': 138, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '🧬', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 9031423621242036097, 'pos': 'PROPN', 'prefix': '🧬', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': '🧬', 'suffix': '🧬', 'ent_type': 'PERSON', 'dep': 'conj', 'head': 230, 'left_edge': 241, 'right_edge': 245}),id=241), Annotation(643,644,PERSON,features=Features({'lemma': '🧬'}),id=330), Annotation(643,644,NounChunk,features=Features({}),id=375), Annotation(644,645,SpaceToken,features=Features({'is_space': True}),id=242), Annotation(645,646,Token,features=Features({'_i': 139, 'is_alpha': False, 'is_bracket': True, 'is_currency': False, 'is_digit': False, 'is_left_punct': True, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '(', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12638816674900267446, 'pos': 'PUNCT', 'prefix': '(', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '-LRB-', 'shape': '(', 'suffix': '(', 'ent_type': '', 'dep': 'punct', 'head': 241, 'left_edge': 243, 'right_edge': 243}),id=243), Annotation(646,649,Token,features=Features({'_i': 140, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': True, 'lang': 'en', 'lemma': 'DNA', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 14386531691665947480, 'pos': 'PROPN', 'prefix': 'D', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'XXX', 'suffix': 'DNA', 'ent_type': '', 'dep': 'appos', 'head': 241, 'left_edge': 244, 'right_edge': 244}),id=244), Annotation(646,649,NounChunk,features=Features({}),id=376), Annotation(649,650,Token,features=Features({'_i': 141, 'is_alpha': False, 'is_bracket': True, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': True, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ')', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3842344029291005339, 'pos': 'PUNCT', 'prefix': ')', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '-RRB-', 'shape': ')', 'suffix': ')', 'ent_type': '', 'dep': 'punct', 'head': 241, 'left_edge': 245, 'right_edge': 245}),id=245), Annotation(650,651,Token,features=Features({'_i': 142, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 230, 'left_edge': 246, 'right_edge': 246}),id=246), Annotation(651,652,SpaceToken,features=Features({'is_space': True}),id=247), Annotation(652,653,SpaceToken,features=Features({'_i': 143, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 962983613142996970, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n', 'suffix': '\\n', 'ent_type': '', 'dep': 'dep', 'head': 256, 'left_edge': 248, 'right_edge': 248}),id=248), Annotation(653,654,Token,features=Features({'_i': 144, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '🧗', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7724749230788735883, 'pos': 'PROPN', 'prefix': '🧗', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': '🧗', 'suffix': '🧗', 'ent_type': '', 'dep': 'nsubj', 'head': 256, 'left_edge': 249, 'right_edge': 249}),id=249), Annotation(653,654,NounChunk,features=Features({}),id=377), Annotation(654,655,SpaceToken,features=Features({'is_space': True}),id=250), Annotation(655,656,Token,features=Features({'_i': 145, 'is_alpha': False, 'is_bracket': True, 'is_currency': False, 'is_digit': False, 'is_left_punct': True, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '(', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12638816674900267446, 'pos': 'PUNCT', 'prefix': '(', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '-LRB-', 'shape': '(', 'suffix': '(', 'ent_type': '', 'dep': 'punct', 'head': 256, 'left_edge': 251, 'right_edge': 251}),id=251), Annotation(656,657,Token,features=Features({'_i': 146, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'det', 'head': 254, 'left_edge': 252, 'right_edge': 252}),id=252), Annotation(656,664,NounChunk,features=Features({}),id=378), Annotation(657,658,SpaceToken,features=Features({'is_space': True}),id=253), Annotation(658,664,Token,features=Features({'_i': 147, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'person', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 14800503047316267216, 'pos': 'NOUN', 'prefix': 'p', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'son', 'ent_type': '', 'dep': 'nsubj', 'head': 256, 'left_edge': 252, 'right_edge': 254}),id=254), Annotation(664,665,SpaceToken,features=Features({'is_space': True}),id=255), Annotation(665,673,Token,features=Features({'_i': 148, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'climb', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 992864226358664410, 'pos': 'VERB', 'prefix': 'c', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBG', 'shape': 'xxxx', 'suffix': 'ing', 'ent_type': '', 'dep': 'ccomp', 'head': 168, 'left_edge': 248, 'right_edge': 257}),id=256), Annotation(673,674,Token,features=Features({'_i': 149, 'is_alpha': False, 'is_bracket': True, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': True, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ')', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3842344029291005339, 'pos': 'PUNCT', 'prefix': ')', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '-RRB-', 'shape': ')', 'suffix': ')', 'ent_type': '', 'dep': 'punct', 'head': 256, 'left_edge': 257, 'right_edge': 257}),id=257), Annotation(674,675,Token,features=Features({'_i': 150, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 265, 'left_edge': 258, 'right_edge': 258}),id=258), Annotation(675,676,SpaceToken,features=Features({'is_space': True}),id=259), Annotation(676,678,SpaceToken,features=Features({'_i': 151, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 908432558851201422, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n\\n', 'suffix': '\\n\\n', 'ent_type': '', 'dep': 'dep', 'head': 265, 'left_edge': 260, 'right_edge': 260}),id=260), Annotation(678,682,Token,features=Features({'_i': 152, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'here', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 13973858553651532596, 'pos': 'ADV', 'prefix': 'H', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'Xxxx', 'suffix': 'ere', 'ent_type': '', 'dep': 'advmod', 'head': 265, 'left_edge': 261, 'right_edge': 261}),id=261), Annotation(682,683,SpaceToken,features=Features({'is_space': True}),id=262), Annotation(683,685,Token,features=Features({'_i': 153, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'we', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 16064069575701507746, 'pos': 'PRON', 'prefix': 'w', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'PRP', 'shape': 'xx', 'suffix': 'we', 'ent_type': '', 'dep': 'nsubj', 'head': 265, 'left_edge': 263, 'right_edge': 263}),id=263), Annotation(683,685,NounChunk,features=Features({}),id=379), Annotation(685,686,SpaceToken,features=Features({'is_space': True}),id=264), Annotation(686,690,Token,features=Features({'_i': 154, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'test', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 1618900948208871284, 'pos': 'VERB', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBP', 'shape': 'xxxx', 'suffix': 'est', 'ent_type': '', 'dep': 'ROOT', 'head': 265, 'left_edge': 163, 'right_edge': 316}),id=265), Annotation(690,691,SpaceToken,features=Features({'is_space': True}),id=266), Annotation(691,692,Token,features=Features({'_i': 155, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'quantmod', 'head': 269, 'left_edge': 267, 'right_edge': 267}),id=267), Annotation(691,714,NounChunk,features=Features({}),id=380), Annotation(692,693,SpaceToken,features=Features({'is_space': True}),id=268), Annotation(693,696,Token,features=Features({'_i': 156, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'few', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11866476999679706272, 'pos': 'ADJ', 'prefix': 'f', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'JJ', 'shape': 'xxx', 'suffix': 'few', 'ent_type': '', 'dep': 'nummod', 'head': 273, 'left_edge': 267, 'right_edge': 269}),id=269), Annotation(696,697,SpaceToken,features=Features({'is_space': True}),id=270), Annotation(697,706,Token,features=Features({'_i': 157, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'different', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 17379251038783395147, 'pos': 'ADJ', 'prefix': 'd', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'JJ', 'shape': 'xxxx', 'suffix': 'ent', 'ent_type': '', 'dep': 'amod', 'head': 273, 'left_edge': 271, 'right_edge': 271}),id=271), Annotation(706,707,SpaceToken,features=Features({'is_space': True}),id=272), Annotation(707,714,Token,features=Features({'_i': 158, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'script', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 5755375431925009742, 'pos': 'NOUN', 'prefix': 's', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'xxxx', 'suffix': 'pts', 'ent_type': '', 'dep': 'dobj', 'head': 265, 'left_edge': 267, 'right_edge': 315}),id=273), Annotation(714,715,Token,features=Features({'_i': 159, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 273, 'left_edge': 274, 'right_edge': 274}),id=274), Annotation(715,716,SpaceToken,features=Features({'is_space': True}),id=275), Annotation(716,720,Token,features=Features({'_i': 160, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'e.g.', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2360781233418247603, 'pos': 'ADV', 'prefix': 'e', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'x.x.', 'suffix': '.g.', 'ent_type': '', 'dep': 'advmod', 'head': 280, 'left_edge': 276, 'right_edge': 276}),id=276), Annotation(716,730,NounChunk,features=Features({}),id=381), Annotation(720,721,SpaceToken,features=Features({'is_space': True}),id=277), Annotation(721,727,Token,features=Features({'_i': 161, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Hangul', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 13901273419332601270, 'pos': 'PROPN', 'prefix': 'H', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'gul', 'ent_type': 'PERSON', 'dep': 'compound', 'head': 280, 'left_edge': 278, 'right_edge': 278}),id=278), Annotation(721,727,PERSON,features=Features({'lemma': 'Hangul'}),id=331), Annotation(727,728,SpaceToken,features=Features({'is_space': True}),id=279), Annotation(728,730,Token,features=Features({'_i': 162, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '한글', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 14226613016943586999, 'pos': 'PROPN', 'prefix': '한', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'xx', 'suffix': '한글', 'ent_type': '', 'dep': 'appos', 'head': 273, 'left_edge': 276, 'right_edge': 282}),id=280), Annotation(730,731,SpaceToken,features=Features({'is_space': True}),id=281), Annotation(731,733,Token,features=Features({'_i': 163, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'or', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3740602843040177340, 'pos': 'CCONJ', 'prefix': 'o', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'xx', 'suffix': 'or', 'ent_type': '', 'dep': 'cc', 'head': 280, 'left_edge': 282, 'right_edge': 282}),id=282), Annotation(733,734,SpaceToken,features=Features({'is_space': True}),id=283), Annotation(734,735,SpaceToken,features=Features({'_i': 164, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 962983613142996970, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n', 'suffix': '\\n', 'ent_type': '', 'dep': 'dep', 'head': 273, 'left_edge': 284, 'right_edge': 284}),id=284), Annotation(735,745,Token,features=Features({'_i': 165, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'simplify', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 15400283218552125021, 'pos': 'VERB', 'prefix': 's', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBN', 'shape': 'xxxx', 'suffix': 'ied', 'ent_type': '', 'dep': 'amod', 'head': 289, 'left_edge': 285, 'right_edge': 285}),id=285), Annotation(735,754,NounChunk,features=Features({}),id=382), Annotation(745,746,SpaceToken,features=Features({'is_space': True}),id=286), Annotation(746,751,Token,features=Features({'_i': 166, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Hanzi', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7162399416189955260, 'pos': 'PROPN', 'prefix': 'H', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'nzi', 'ent_type': 'PERSON', 'dep': 'compound', 'head': 289, 'left_edge': 287, 'right_edge': 287}),id=287), Annotation(746,751,PERSON,features=Features({'lemma': 'Hanzi'}),id=332), Annotation(751,752,SpaceToken,features=Features({'is_space': True}),id=288), Annotation(752,754,Token,features=Features({'_i': 167, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '汉字', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2455570606503990670, 'pos': 'PROPN', 'prefix': '汉', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'xx', 'suffix': '汉字', 'ent_type': '', 'dep': 'conj', 'head': 273, 'left_edge': 285, 'right_edge': 315}),id=289), Annotation(754,755,SpaceToken,features=Features({'is_space': True}),id=290), Annotation(755,757,Token,features=Features({'_i': 168, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'or', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3740602843040177340, 'pos': 'CCONJ', 'prefix': 'o', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'xx', 'suffix': 'or', 'ent_type': '', 'dep': 'cc', 'head': 289, 'left_edge': 291, 'right_edge': 291}),id=291), Annotation(757,758,SpaceToken,features=Features({'is_space': True}),id=292), Annotation(758,763,Token,features=Features({'_i': 169, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Farsi', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 1631855311088449743, 'pos': 'PROPN', 'prefix': 'F', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'rsi', 'ent_type': 'PERSON', 'dep': 'compound', 'head': 295, 'left_edge': 293, 'right_edge': 293}),id=293), Annotation(758,763,PERSON,features=Features({'lemma': 'Farsi'}),id=333), Annotation(758,769,NounChunk,features=Features({}),id=383), Annotation(763,764,SpaceToken,features=Features({'is_space': True}),id=294), Annotation(764,769,Token,features=Features({'_i': 170, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'فارسی', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 10192536850399601059, 'pos': 'NOUN', 'prefix': 'ف', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'رسی', 'ent_type': '', 'dep': 'conj', 'head': 289, 'left_edge': 293, 'right_edge': 299}),id=295), Annotation(769,770,SpaceToken,features=Features({'is_space': True}),id=296), Annotation(770,773,Token,features=Features({'_i': 171, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'and', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2283656566040971221, 'pos': 'CCONJ', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'xxx', 'suffix': 'and', 'ent_type': '', 'dep': 'cc', 'head': 295, 'left_edge': 297, 'right_edge': 297}),id=297), Annotation(773,774,SpaceToken,features=Features({'is_space': True}),id=298), Annotation(774,780,Token,features=Features({'_i': 172, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Arabic', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 6558098436421695236, 'pos': 'PROPN', 'prefix': 'A', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'bic', 'ent_type': 'LANGUAGE', 'dep': 'conj', 'head': 295, 'left_edge': 299, 'right_edge': 299}),id=299), Annotation(774,780,LANGUAGE,features=Features({'lemma': 'Arabic'}),id=334), Annotation(774,780,NounChunk,features=Features({}),id=384), Annotation(780,781,SpaceToken,features=Features({'is_space': True}),id=300), Annotation(781,782,Token,features=Features({'_i': 173, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 289, 'left_edge': 301, 'right_edge': 301}),id=301), Annotation(782,797,Token,features=Features({'_i': 174, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'اَلْعَرَبِيَّةُ', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 16849633969024660305, 'pos': 'NUM', 'prefix': 'ا', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CD', 'shape': 'xَxْxَxَxِxَّxُ', 'suffix': 'ّةُ', 'ent_type': 'GPE', 'dep': 'appos', 'head': 289, 'left_edge': 302, 'right_edge': 302}),id=302), Annotation(782,797,GPE,features=Features({'lemma': 'اَلْعَرَبِيَّةُ'}),id=335), Annotation(797,798,Token,features=Features({'_i': 175, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 289, 'left_edge': 303, 'right_edge': 303}),id=303), Annotation(798,799,SpaceToken,features=Features({'is_space': True}),id=304), Annotation(799,804,Token,features=Features({'_i': 176, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'which', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7063653163634019529, 'pos': 'PRON', 'prefix': 'w', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'WDT', 'shape': 'xxxx', 'suffix': 'ich', 'ent_type': '', 'dep': 'nsubj', 'head': 307, 'left_edge': 305, 'right_edge': 305}),id=305), Annotation(799,804,NounChunk,features=Features({}),id=385), Annotation(804,805,SpaceToken,features=Features({'is_space': True}),id=306), Annotation(805,809,Token,features=Features({'_i': 177, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'go', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 18224404223277189725, 'pos': 'VERB', 'prefix': 'g', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBZ', 'shape': 'xxxx', 'suffix': 'oes', 'ent_type': '', 'dep': 'relcl', 'head': 289, 'left_edge': 305, 'right_edge': 315}),id=307), Annotation(809,810,SpaceToken,features=Features({'is_space': True}),id=308), Annotation(810,814,Token,features=Features({'_i': 178, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'from', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7831658034963690409, 'pos': 'ADP', 'prefix': 'f', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'IN', 'shape': 'xxxx', 'suffix': 'rom', 'ent_type': '', 'dep': 'prep', 'head': 307, 'left_edge': 309, 'right_edge': 315}),id=309), Annotation(814,815,SpaceToken,features=Features({'is_space': True}),id=310), Annotation(815,820,Token,features=Features({'_i': 179, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'right', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 5943797630011647483, 'pos': 'NOUN', 'prefix': 'r', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'ght', 'ent_type': '', 'dep': 'advmod', 'head': 313, 'left_edge': 311, 'right_edge': 311}),id=311), Annotation(820,821,SpaceToken,features=Features({'is_space': True}),id=312), Annotation(821,823,Token,features=Features({'_i': 180, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'to', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3791531372978436496, 'pos': 'AUX', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'IN', 'shape': 'xx', 'suffix': 'to', 'ent_type': '', 'dep': 'aux', 'head': 315, 'left_edge': 311, 'right_edge': 313}),id=313), Annotation(823,824,SpaceToken,features=Features({'is_space': True}),id=314), Annotation(824,828,Token,features=Features({'_i': 181, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'leave', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 4003250512168697582, 'pos': 'VERB', 'prefix': 'l', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBN', 'shape': 'xxxx', 'suffix': 'eft', 'ent_type': '', 'dep': 'pcomp', 'head': 309, 'left_edge': 311, 'right_edge': 315}),id=315), Annotation(828,829,Token,features=Features({'_i': 182, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '.', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12646065887601541794, 'pos': 'PUNCT', 'prefix': '.', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '.', 'shape': '.', 'suffix': '.', 'ent_type': '', 'dep': 'punct', 'head': 265, 'left_edge': 316, 'right_edge': 316}),id=316), Annotation(829,831,SpaceToken,features=Features({'_i': 183, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': True, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 908432558851201422, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n\\n', 'suffix': '\\n\\n', 'ent_type': '', 'dep': 'dep', 'head': 317, 'left_edge': 317, 'right_edge': 317}),id=317), Annotation(829,831,Sentence,features=Features({}),id=341)]), 'Stanza': AnnotationSet([Annotation(0,4,Token,features=Features({'text': 'This', 'lemma': 'this', 'upos': 'PRON', 'xpos': 'DT', 'Number': 'Sing', 'PronType': 'Dem', 'head': 4, 'deprel': 'nsubj', 'ner': 'O'}),id=0), Annotation(0,24,Sentence,features=Features({}),id=6), Annotation(5,7,Token,features=Features({'text': 'is', 'lemma': 'be', 'upos': 'AUX', 'xpos': 'VBZ', 'Mood': 'Ind', 'Number': 'Sing', 'Person': '3', 'Tense': 'Pres', 'VerbForm': 'Fin', 'head': 4, 'deprel': 'cop', 'ner': 'O'}),id=1), Annotation(8,9,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 4, 'deprel': 'det', 'ner': 'O'}),id=2), Annotation(10,14,Token,features=Features({'text': 'test', 'lemma': 'test', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 4, 'deprel': 'compound', 'ner': 'O'}),id=3), Annotation(15,23,Token,features=Features({'text': 'document', 'lemma': 'document', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 6, 'deprel': 'root', 'ner': 'O'}),id=4), Annotation(23,24,Token,features=Features({'text': '.', 'lemma': '.', 'upos': 'PUNCT', 'xpos': '.', 'head': 4, 'deprel': 'punct', 'ner': 'O'}),id=5), Annotation(26,28,Token,features=Features({'text': 'It', 'lemma': 'it', 'upos': 'PRON', 'xpos': 'PRP', 'Case': 'Nom', 'Gender': 'Neut', 'Number': 'Sing', 'Person': '3', 'PronType': 'Prs', 'head': 8, 'deprel': 'nsubj', 'ner': 'O'}),id=7), Annotation(26,59,Sentence,features=Features({}),id=14), Annotation(29,37,Token,features=Features({'text': 'contains', 'lemma': 'contain', 'upos': 'VERB', 'xpos': 'VBZ', 'Mood': 'Ind', 'Number': 'Sing', 'Person': '3', 'Tense': 'Pres', 'VerbForm': 'Fin', 'head': 14, 'deprel': 'root', 'ner': 'O'}),id=8), Annotation(38,42,Token,features=Features({'text': 'just', 'lemma': 'just', 'upos': 'ADV', 'xpos': 'RB', 'head': 12, 'deprel': 'advmod', 'ner': 'O'}),id=9), Annotation(43,44,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 12, 'deprel': 'det', 'ner': 'O'}),id=10), Annotation(45,48,Token,features=Features({'text': 'few', 'lemma': 'few', 'upos': 'ADJ', 'xpos': 'JJ', 'Degree': 'Pos', 'head': 12, 'deprel': 'amod', 'ner': 'O'}),id=11), Annotation(49,58,Token,features=Features({'text': 'sentences', 'lemma': 'sentence', 'upos': 'NOUN', 'xpos': 'NNS', 'Number': 'Plur', 'head': 8, 'deprel': 'obj', 'ner': 'O'}),id=12), Annotation(58,59,Token,features=Features({'text': '.', 'lemma': '.', 'upos': 'PUNCT', 'xpos': '.', 'head': 8, 'deprel': 'punct', 'ner': 'O'}),id=13), Annotation(61,65,Token,features=Features({'text': 'Here', 'lemma': 'here', 'upos': 'ADV', 'xpos': 'RB', 'PronType': 'Dem', 'head': 54, 'deprel': 'root', 'ner': 'O'}),id=15), Annotation(61,266,Sentence,features=Features({}),id=54), Annotation(66,68,Token,features=Features({'text': 'is', 'lemma': 'be', 'upos': 'AUX', 'xpos': 'VBZ', 'Mood': 'Ind', 'Number': 'Sing', 'Person': '3', 'Tense': 'Pres', 'VerbForm': 'Fin', 'head': 15, 'deprel': 'cop', 'ner': 'O'}),id=16), Annotation(69,70,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 18, 'deprel': 'det', 'ner': 'O'}),id=17), Annotation(71,79,Token,features=Features({'text': 'sentence', 'lemma': 'sentence', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 15, 'deprel': 'nsubj', 'ner': 'O'}),id=18), Annotation(80,84,Token,features=Features({'text': 'that', 'lemma': 'that', 'upos': 'PRON', 'xpos': 'WDT', 'PronType': 'Rel', 'head': 20, 'deprel': 'nsubj', 'ner': 'O'}),id=19), Annotation(85,93,Token,features=Features({'text': 'mentions', 'lemma': 'mention', 'upos': 'VERB', 'xpos': 'VBZ', 'Mood': 'Ind', 'Number': 'Sing', 'Person': '3', 'Tense': 'Pres', 'VerbForm': 'Fin', 'head': 18, 'deprel': 'acl:relcl', 'ner': 'O'}),id=20), Annotation(94,95,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 24, 'deprel': 'det', 'ner': 'O'}),id=21), Annotation(96,99,Token,features=Features({'text': 'few', 'lemma': 'few', 'upos': 'ADJ', 'xpos': 'JJ', 'Degree': 'Pos', 'head': 24, 'deprel': 'amod', 'ner': 'O'}),id=22), Annotation(100,105,Token,features=Features({'text': 'named', 'lemma': 'name', 'upos': 'VERB', 'xpos': 'VBN', 'Tense': 'Past', 'VerbForm': 'Part', 'head': 24, 'deprel': 'amod', 'ner': 'O'}),id=23), Annotation(106,114,Token,features=Features({'text': 'entities', 'lemma': 'entity', 'upos': 'NOUN', 'xpos': 'NNS', 'Number': 'Plur', 'head': 20, 'deprel': 'obj', 'ner': 'O'}),id=24), Annotation(115,119,Token,features=Features({'text': 'like', 'lemma': 'like', 'upos': 'ADP', 'xpos': 'IN', 'head': 27, 'deprel': 'case', 'ner': 'O'}),id=25), Annotation(121,124,Token,features=Features({'text': 'the', 'lemma': 'the', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Def', 'PronType': 'Art', 'head': 27, 'deprel': 'det', 'ner': 'O'}),id=26), Annotation(125,132,Token,features=Features({'text': 'persons', 'lemma': 'person', 'upos': 'NOUN', 'xpos': 'NNS', 'Number': 'Plur', 'head': 24, 'deprel': 'nmod', 'ner': 'O'}),id=27), Annotation(133,139,Token,features=Features({'text': 'Barack', 'lemma': 'Barack', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 27, 'deprel': 'appos', 'ner': 'B-PERSON'}),id=28), Annotation(133,145,PERSON,features=Features({}),id=175), Annotation(140,145,Token,features=Features({'text': 'Obama', 'lemma': 'Obama', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 28, 'deprel': 'flat', 'ner': 'E-PERSON'}),id=29), Annotation(146,148,Token,features=Features({'text': 'or', 'lemma': 'or', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 31, 'deprel': 'cc', 'ner': 'O'}),id=30), Annotation(149,155,Token,features=Features({'text': 'Ursula', 'lemma': 'Ursula', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 28, 'deprel': 'conj', 'ner': 'B-PERSON'}),id=31), Annotation(149,169,PERSON,features=Features({}),id=176), Annotation(156,159,Token,features=Features({'text': 'von', 'lemma': 'von', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 31, 'deprel': 'flat', 'ner': 'I-PERSON'}),id=32), Annotation(160,163,Token,features=Features({'text': 'der', 'lemma': 'der', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 31, 'deprel': 'flat', 'ner': 'I-PERSON'}),id=33), Annotation(164,169,Token,features=Features({'text': 'Leyen', 'lemma': 'Leyen', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 31, 'deprel': 'flat', 'ner': 'E-PERSON'}),id=34), Annotation(169,170,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 36, 'deprel': 'punct', 'ner': 'O'}),id=35), Annotation(171,180,Token,features=Features({'text': 'locations', 'lemma': 'location', 'upos': 'NOUN', 'xpos': 'NNS', 'Number': 'Plur', 'head': 27, 'deprel': 'conj', 'ner': 'O'}),id=36), Annotation(181,185,Token,features=Features({'text': 'like', 'lemma': 'like', 'upos': 'ADP', 'xpos': 'IN', 'head': 40, 'deprel': 'case', 'ner': 'O'}),id=37), Annotation(186,189,Token,features=Features({'text': 'New', 'lemma': 'New', 'upos': 'ADJ', 'xpos': 'NNP', 'Degree': 'Pos', 'head': 40, 'deprel': 'amod', 'ner': 'B-GPE'}),id=38), Annotation(186,199,GPE,features=Features({}),id=177), Annotation(190,194,Token,features=Features({'text': 'York', 'lemma': 'York', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 40, 'deprel': 'compound', 'ner': 'I-GPE'}),id=39), Annotation(195,199,Token,features=Features({'text': 'City', 'lemma': 'City', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 36, 'deprel': 'nmod', 'ner': 'E-GPE'}),id=40), Annotation(199,200,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 42, 'deprel': 'punct', 'ner': 'O'}),id=41), Annotation(201,207,Token,features=Features({'text': 'Vienna', 'lemma': 'Vienna', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 40, 'deprel': 'conj', 'ner': 'S-GPE'}),id=42), Annotation(201,207,GPE,features=Features({}),id=178), Annotation(208,210,Token,features=Features({'text': 'or', 'lemma': 'or', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 44, 'deprel': 'cc', 'ner': 'O'}),id=43), Annotation(211,218,Token,features=Features({'text': 'Beijing', 'lemma': 'Beijing', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 40, 'deprel': 'conj', 'ner': 'S-GPE'}),id=44), Annotation(211,218,GPE,features=Features({}),id=179), Annotation(219,221,Token,features=Features({'text': 'or', 'lemma': 'or', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 46, 'deprel': 'cc', 'ner': 'O'}),id=45), Annotation(222,231,Token,features=Features({'text': 'companies', 'lemma': 'company', 'upos': 'NOUN', 'xpos': 'NNS', 'Number': 'Plur', 'head': 40, 'deprel': 'conj', 'ner': 'O'}),id=46), Annotation(232,236,Token,features=Features({'text': 'like', 'lemma': 'like', 'upos': 'ADP', 'xpos': 'IN', 'head': 48, 'deprel': 'case', 'ner': 'O'}),id=47), Annotation(238,244,Token,features=Features({'text': 'Google', 'lemma': 'Google', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 46, 'deprel': 'nmod', 'ner': 'S-ORG'}),id=48), Annotation(238,244,ORG,features=Features({}),id=180), Annotation(244,245,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 50, 'deprel': 'punct', 'ner': 'O'}),id=49), Annotation(246,255,Token,features=Features({'text': 'UniCredit', 'lemma': 'UniCredit', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 48, 'deprel': 'conj', 'ner': 'S-ORG'}),id=50), Annotation(246,255,ORG,features=Features({}),id=181), Annotation(256,258,Token,features=Features({'text': 'or', 'lemma': 'or', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 52, 'deprel': 'cc', 'ner': 'O'}),id=51), Annotation(259,265,Token,features=Features({'text': 'Huawei', 'lemma': 'Huawei', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 48, 'deprel': 'conj', 'ner': 'S-ORG'}),id=52), Annotation(259,265,ORG,features=Features({}),id=182), Annotation(265,266,Token,features=Features({'text': '.', 'lemma': '.', 'upos': 'PUNCT', 'xpos': '.', 'head': 15, 'deprel': 'punct', 'ner': 'O'}),id=53), Annotation(267,270,Token,features=Features({'text': 'And', 'lemma': 'and', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 67, 'deprel': 'cc', 'ner': 'O'}),id=55), Annotation(267,322,Sentence,features=Features({}),id=69), Annotation(271,275,Token,features=Features({'text': 'here', 'lemma': 'here', 'upos': 'ADV', 'xpos': 'RB', 'PronType': 'Dem', 'head': 69, 'deprel': 'root', 'ner': 'O'}),id=56), Annotation(276,278,Token,features=Features({'text': 'is', 'lemma': 'be', 'upos': 'AUX', 'xpos': 'VBZ', 'Mood': 'Ind', 'Number': 'Sing', 'Person': '3', 'Tense': 'Pres', 'VerbForm': 'Fin', 'head': 56, 'deprel': 'cop', 'ner': 'O'}),id=57), Annotation(279,285,Token,features=Features({'text': 'Donald', 'lemma': 'Donald', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 56, 'deprel': 'nsubj', 'ner': 'B-PERSON'}),id=58), Annotation(279,291,PERSON,features=Features({}),id=183), Annotation(286,291,Token,features=Features({'text': 'Trump', 'lemma': 'Trump', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 58, 'deprel': 'flat', 'ner': 'E-PERSON'}),id=59), Annotation(291,292,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 56, 'deprel': 'punct', 'ner': 'O'}),id=60), Annotation(293,295,Token,features=Features({'text': 'it', 'lemma': 'it', 'upos': 'PRON', 'xpos': 'PRP', 'Case': 'Nom', 'Gender': 'Neut', 'Number': 'Sing', 'Person': '3', 'PronType': 'Prs', 'head': 67, 'deprel': 'nsubj', 'ner': 'O'}),id=61), Annotation(296,299,Token,features=Features({'text': 'may', 'lemma': 'may', 'upos': 'AUX', 'xpos': 'MD', 'VerbForm': 'Fin', 'head': 67, 'deprel': 'aux', 'ner': 'O'}),id=62), Annotation(300,303,Token,features=Features({'text': 'not', 'lemma': 'not', 'upos': 'PART', 'xpos': 'RB', 'head': 67, 'deprel': 'advmod', 'ner': 'O'}),id=63), Annotation(304,306,Token,features=Features({'text': 'be', 'lemma': 'be', 'upos': 'AUX', 'xpos': 'VB', 'VerbForm': 'Inf', 'head': 67, 'deprel': 'cop', 'ner': 'O'}),id=64), Annotation(307,310,Token,features=Features({'text': 'the', 'lemma': 'the', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Def', 'PronType': 'Art', 'head': 67, 'deprel': 'det', 'ner': 'O'}),id=65), Annotation(311,315,Token,features=Features({'text': 'real', 'lemma': 'real', 'upos': 'ADJ', 'xpos': 'JJ', 'Degree': 'Pos', 'head': 67, 'deprel': 'amod', 'ner': 'O'}),id=66), Annotation(316,319,Token,features=Features({'text': 'one', 'lemma': 'one', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 56, 'deprel': 'parataxis', 'ner': 'O'}),id=67), Annotation(320,322,Token,features=Features({'text': ':P', 'lemma': ':p', 'upos': 'SYM', 'xpos': 'NFP', 'head': 56, 'deprel': 'discourse', 'ner': 'O'}),id=68), Annotation(324,328,Token,features=Features({'text': 'Lets', 'lemma': 'let', 'upos': 'VERB', 'xpos': 'VBZ', 'Mood': 'Ind', 'Number': 'Sing', 'Person': '3', 'Tense': 'Pres', 'VerbForm': 'Fin', 'head': 89, 'deprel': 'root', 'ner': 'O'}),id=70), Annotation(324,422,Sentence,features=Features({}),id=89), Annotation(329,332,Token,features=Features({'text': 'say', 'lemma': 'say', 'upos': 'VERB', 'xpos': 'VB', 'VerbForm': 'Inf', 'head': 70, 'deprel': 'xcomp', 'ner': 'O'}),id=71), Annotation(333,338,Token,features=Features({'text': 'Boris', 'lemma': 'Boris', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 84, 'deprel': 'nsubj', 'ner': 'B-PERSON'}),id=72), Annotation(333,346,PERSON,features=Features({}),id=184), Annotation(339,346,Token,features=Features({'text': 'Johnson', 'lemma': 'Johnson', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 72, 'deprel': 'flat', 'ner': 'E-PERSON'}),id=73), Annotation(347,350,Token,features=Features({'text': 'aka', 'lemma': 'aka', 'upos': 'ADP', 'xpos': 'IN', 'head': 76, 'deprel': 'case', 'ner': 'O'}),id=74), Annotation(351,355,Token,features=Features({'text': 'Bojo', 'lemma': 'Bojo', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 76, 'deprel': 'compound', 'ner': 'S-PERSON'}),id=75), Annotation(351,355,PERSON,features=Features({}),id=185), Annotation(356,362,Token,features=Features({'text': 'tweets', 'lemma': 'tweet', 'upos': 'NOUN', 'xpos': 'NNS', 'Number': 'Plur', 'head': 72, 'deprel': 'nmod', 'ner': 'O'}),id=76), Annotation(363,367,Token,features=Features({'text': 'from', 'lemma': 'from', 'upos': 'ADP', 'xpos': 'IN', 'head': 80, 'deprel': 'case', 'ner': 'O'}),id=77), Annotation(368,371,Token,features=Features({'text': 'his', 'lemma': 'he', 'upos': 'PRON', 'xpos': 'PRP$', 'Gender': 'Masc', 'Number': 'Sing', 'Person': '3', 'Poss': 'Yes', 'PronType': 'Prs', 'head': 80, 'deprel': 'nmod:poss', 'ner': 'O'}),id=78), Annotation(372,384,Token,features=Features({'text': 'BorisJohnson', 'lemma': 'BorisJohnson', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 80, 'deprel': 'compound', 'ner': 'S-PERSON'}),id=79), Annotation(372,384,PERSON,features=Features({}),id=186), Annotation(385,392,Token,features=Features({'text': 'account', 'lemma': 'account', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 76, 'deprel': 'nmod', 'ner': 'O'}),id=80), Annotation(392,393,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 84, 'deprel': 'punct', 'ner': 'O'}),id=81), Annotation(394,399,Token,features=Features({'text': 'would', 'lemma': 'would', 'upos': 'AUX', 'xpos': 'MD', 'VerbForm': 'Fin', 'head': 84, 'deprel': 'aux', 'ner': 'O'}),id=82), Annotation(400,402,Token,features=Features({'text': 'be', 'lemma': 'be', 'upos': 'AUX', 'xpos': 'VB', 'VerbForm': 'Inf', 'head': 84, 'deprel': 'cop', 'ner': 'O'}),id=83), Annotation(403,407,Token,features=Features({'text': 'nice', 'lemma': 'nice', 'upos': 'ADJ', 'xpos': 'JJ', 'Degree': 'Pos', 'head': 70, 'deprel': 'parataxis', 'ner': 'O'}),id=84), Annotation(408,410,Token,features=Features({'text': 'to', 'lemma': 'to', 'upos': 'PART', 'xpos': 'TO', 'head': 86, 'deprel': 'mark', 'ner': 'O'}),id=85), Annotation(411,416,Token,features=Features({'text': 'match', 'lemma': 'match', 'upos': 'VERB', 'xpos': 'VB', 'VerbForm': 'Inf', 'head': 84, 'deprel': 'advcl', 'ner': 'O'}),id=86), Annotation(417,421,Token,features=Features({'text': 'them', 'lemma': 'they', 'upos': 'PRON', 'xpos': 'PRP', 'Case': 'Acc', 'Number': 'Plur', 'Person': '3', 'PronType': 'Prs', 'head': 86, 'deprel': 'obj', 'ner': 'O'}),id=87), Annotation(421,422,Token,features=Features({'text': '!', 'lemma': '!', 'upos': 'PUNCT', 'xpos': '.', 'head': 70, 'deprel': 'punct', 'ner': 'O'}),id=88), Annotation(424,428,Token,features=Features({'text': 'Here', 'lemma': 'here', 'upos': 'ADV', 'xpos': 'RB', 'PronType': 'Dem', 'head': 92, 'deprel': 'advmod', 'ner': 'O'}),id=90), Annotation(424,675,Sentence,features=Features({}),id=143), Annotation(429,431,Token,features=Features({'text': 'we', 'lemma': 'we', 'upos': 'PRON', 'xpos': 'PRP', 'Case': 'Nom', 'Number': 'Plur', 'Person': '1', 'PronType': 'Prs', 'head': 92, 'deprel': 'nsubj', 'ner': 'O'}),id=91), Annotation(432,439,Token,features=Features({'text': 'include', 'lemma': 'include', 'upos': 'VERB', 'xpos': 'VBP', 'Mood': 'Ind', 'Tense': 'Pres', 'VerbForm': 'Fin', 'head': 143, 'deprel': 'root', 'ner': 'O'}),id=92), Annotation(440,441,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 94, 'deprel': 'det', 'ner': 'O'}),id=93), Annotation(442,445,Token,features=Features({'text': 'URL', 'lemma': 'URL', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 92, 'deprel': 'obj', 'ner': 'O'}),id=94), Annotation(446,487,Token,features=Features({'text': 'https://gatenlp.github.io/python-gatenlp/', 'lemma': 'https://gatenlp.github.io/python-gatenlp/', 'upos': 'X', 'xpos': 'ADD', 'head': 94, 'deprel': 'appos', 'ner': 'O'}),id=95), Annotation(489,492,Token,features=Features({'text': 'and', 'lemma': 'and', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 100, 'deprel': 'cc', 'ner': 'O'}),id=96), Annotation(493,494,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 100, 'deprel': 'det', 'ner': 'O'}),id=97), Annotation(495,499,Token,features=Features({'text': 'fake', 'lemma': 'fake', 'upos': 'ADJ', 'xpos': 'JJ', 'Degree': 'Pos', 'head': 100, 'deprel': 'amod', 'ner': 'O'}),id=98), Annotation(500,505,Token,features=Features({'text': 'email', 'lemma': 'email', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 100, 'deprel': 'compound', 'ner': 'O'}),id=99), Annotation(506,513,Token,features=Features({'text': 'address', 'lemma': 'address', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 94, 'deprel': 'conj', 'ner': 'O'}),id=100), Annotation(514,540,Token,features=Features({'text': 'john.doe@hiscoolserver.com', 'lemma': 'john.doe@hiscoolserver.com', 'upos': 'X', 'xpos': 'ADD', 'head': 100, 'deprel': 'appos', 'ner': 'O'}),id=101), Annotation(541,543,Token,features=Features({'text': 'as', 'lemma': 'as', 'upos': 'ADV', 'xpos': 'RB', 'head': 110, 'deprel': 'cc', 'ner': 'O'}),id=102), Annotation(544,548,Token,features=Features({'text': 'well', 'lemma': 'well', 'upos': 'ADV', 'xpos': 'RB', 'Degree': 'Pos', 'head': 102, 'deprel': 'fixed', 'ner': 'O'}),id=103), Annotation(550,552,Token,features=Features({'text': 'as', 'lemma': 'as', 'upos': 'ADP', 'xpos': 'IN', 'head': 102, 'deprel': 'fixed', 'ner': 'O'}),id=104), Annotation(553,554,Token,features=Features({'text': '#', 'lemma': '#', 'upos': 'SYM', 'xpos': 'NN', 'Number': 'Sing', 'head': 110, 'deprel': 'compound', 'ner': 'O'}),id=105), Annotation(554,558,Token,features=Features({'text': 'some', 'lemma': 'some', 'upos': 'DET', 'xpos': 'DT', 'head': 110, 'deprel': 'det', 'ner': 'O'}),id=106), Annotation(559,560,Token,features=Features({'text': '#', 'lemma': '#', 'upos': 'SYM', 'xpos': 'NN', 'Number': 'Sing', 'head': 110, 'deprel': 'compound', 'ner': 'O'}),id=107), Annotation(560,564,Token,features=Features({'text': 'cool', 'lemma': 'cool', 'upos': 'ADJ', 'xpos': 'JJ', 'Degree': 'Pos', 'head': 110, 'deprel': 'amod', 'ner': 'O'}),id=108), Annotation(565,566,Token,features=Features({'text': '#', 'lemma': '#', 'upos': 'SYM', 'xpos': 'NN', 'Number': 'Sing', 'head': 110, 'deprel': 'compound', 'ner': 'O'}),id=109), Annotation(566,573,Token,features=Features({'text': 'hastags', 'lemma': 'hastag', 'upos': 'NOUN', 'xpos': 'NNS', 'Number': 'Plur', 'head': 94, 'deprel': 'conj', 'ner': 'O'}),id=110), Annotation(574,577,Token,features=Features({'text': 'and', 'lemma': 'and', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 113, 'deprel': 'cc', 'ner': 'O'}),id=111), Annotation(578,579,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 113, 'deprel': 'det', 'ner': 'O'}),id=112), Annotation(580,585,Token,features=Features({'text': 'bunch', 'lemma': 'bunch', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 94, 'deprel': 'conj', 'ner': 'O'}),id=113), Annotation(586,588,Token,features=Features({'text': 'of', 'lemma': 'of', 'upos': 'ADP', 'xpos': 'IN', 'head': 115, 'deprel': 'case', 'ner': 'O'}),id=114), Annotation(589,595,Token,features=Features({'text': 'emojis', 'lemma': 'emojis', 'upos': 'NOUN', 'xpos': 'NNS', 'Number': 'Plur', 'head': 113, 'deprel': 'nmod', 'ner': 'O'}),id=115), Annotation(596,600,Token,features=Features({'text': 'like', 'lemma': 'like', 'upos': 'ADP', 'xpos': 'IN', 'head': 117, 'deprel': 'case', 'ner': 'O'}),id=116), Annotation(601,602,Token,features=Features({'text': '😽', 'lemma': '😽', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 115, 'deprel': 'nmod', 'ner': 'O'}),id=117), Annotation(603,604,Token,features=Features({'text': '(', 'lemma': '(', 'upos': 'PUNCT', 'xpos': '-LRB-', 'head': 121, 'deprel': 'punct', 'ner': 'O'}),id=118), Annotation(604,605,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 121, 'deprel': 'det', 'ner': 'O'}),id=119), Annotation(606,613,Token,features=Features({'text': 'kissing', 'lemma': 'kissing', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 121, 'deprel': 'compound', 'ner': 'O'}),id=120), Annotation(614,617,Token,features=Features({'text': 'cat', 'lemma': 'cat', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 117, 'deprel': 'appos', 'ner': 'O'}),id=121), Annotation(617,618,Token,features=Features({'text': ')', 'lemma': ')', 'upos': 'PUNCT', 'xpos': '-RRB-', 'head': 121, 'deprel': 'punct', 'ner': 'O'}),id=122), Annotation(618,619,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 124, 'deprel': 'punct', 'ner': 'O'}),id=123), Annotation(620,623,Token,features=Features({'text': '👩\\u200d🏫', 'lemma': '👩\\u200d🏫', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 117, 'deprel': 'conj', 'ner': 'O'}),id=124), Annotation(624,625,Token,features=Features({'text': '(', 'lemma': '(', 'upos': 'PUNCT', 'xpos': '-LRB-', 'head': 128, 'deprel': 'punct', 'ner': 'O'}),id=125), Annotation(625,626,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 128, 'deprel': 'det', 'ner': 'O'}),id=126), Annotation(627,632,Token,features=Features({'text': 'woman', 'lemma': 'woman', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 128, 'deprel': 'compound', 'ner': 'O'}),id=127), Annotation(633,640,Token,features=Features({'text': 'teacher', 'lemma': 'teacher', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 124, 'deprel': 'appos', 'ner': 'O'}),id=128), Annotation(640,641,Token,features=Features({'text': ')', 'lemma': ')', 'upos': 'PUNCT', 'xpos': '-RRB-', 'head': 128, 'deprel': 'punct', 'ner': 'O'}),id=129), Annotation(641,642,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 131, 'deprel': 'punct', 'ner': 'O'}),id=130), Annotation(643,644,Token,features=Features({'text': '🧬', 'lemma': '🧬', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 117, 'deprel': 'conj', 'ner': 'O'}),id=131), Annotation(645,646,Token,features=Features({'text': '(', 'lemma': '(', 'upos': 'PUNCT', 'xpos': '-LRB-', 'head': 133, 'deprel': 'punct', 'ner': 'O'}),id=132), Annotation(646,649,Token,features=Features({'text': 'DNA', 'lemma': 'dna', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 131, 'deprel': 'appos', 'ner': 'O'}),id=133), Annotation(649,650,Token,features=Features({'text': ')', 'lemma': ')', 'upos': 'PUNCT', 'xpos': '-RRB-', 'head': 133, 'deprel': 'punct', 'ner': 'O'}),id=134), Annotation(650,651,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 136, 'deprel': 'punct', 'ner': 'O'}),id=135), Annotation(653,654,Token,features=Features({'text': '🧗', 'lemma': '🧗', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 117, 'deprel': 'conj', 'ner': 'O'}),id=136), Annotation(655,656,Token,features=Features({'text': '(', 'lemma': '(', 'upos': 'PUNCT', 'xpos': '-LRB-', 'head': 140, 'deprel': 'punct', 'ner': 'O'}),id=137), Annotation(656,657,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 139, 'deprel': 'det', 'ner': 'O'}),id=138), Annotation(658,664,Token,features=Features({'text': 'person', 'lemma': 'person', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 140, 'deprel': 'nsubj', 'ner': 'O'}),id=139), Annotation(665,673,Token,features=Features({'text': 'climbing', 'lemma': 'climb', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 136, 'deprel': 'appos', 'ner': 'O'}),id=140), Annotation(673,674,Token,features=Features({'text': ')', 'lemma': ')', 'upos': 'PUNCT', 'xpos': '-RRB-', 'head': 140, 'deprel': 'punct', 'ner': 'O'}),id=141), Annotation(674,675,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 92, 'deprel': 'punct', 'ner': 'O'}),id=142), Annotation(678,682,Token,features=Features({'text': 'Here', 'lemma': 'here', 'upos': 'ADV', 'xpos': 'RB', 'PronType': 'Dem', 'head': 146, 'deprel': 'advmod', 'ner': 'O'}),id=144), Annotation(678,829,Sentence,features=Features({}),id=174), Annotation(683,685,Token,features=Features({'text': 'we', 'lemma': 'we', 'upos': 'PRON', 'xpos': 'PRP', 'Case': 'Nom', 'Number': 'Plur', 'Person': '1', 'PronType': 'Prs', 'head': 146, 'deprel': 'nsubj', 'ner': 'O'}),id=145), Annotation(686,690,Token,features=Features({'text': 'test', 'lemma': 'test', 'upos': 'VERB', 'xpos': 'VBP', 'Mood': 'Ind', 'Number': 'Plur', 'Person': '1', 'Tense': 'Pres', 'VerbForm': 'Fin', 'head': 174, 'deprel': 'root', 'ner': 'O'}),id=146), Annotation(691,692,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 150, 'deprel': 'det', 'ner': 'O'}),id=147), Annotation(693,696,Token,features=Features({'text': 'few', 'lemma': 'few', 'upos': 'ADJ', 'xpos': 'JJ', 'Degree': 'Pos', 'head': 150, 'deprel': 'amod', 'ner': 'O'}),id=148), Annotation(697,706,Token,features=Features({'text': 'different', 'lemma': 'different', 'upos': 'ADJ', 'xpos': 'JJ', 'Degree': 'Pos', 'head': 150, 'deprel': 'amod', 'ner': 'O'}),id=149), Annotation(707,714,Token,features=Features({'text': 'scripts', 'lemma': 'script', 'upos': 'NOUN', 'xpos': 'NNS', 'Number': 'Plur', 'head': 146, 'deprel': 'obj', 'ner': 'O'}),id=150), Annotation(714,715,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 153, 'deprel': 'punct', 'ner': 'O'}),id=151), Annotation(716,720,Token,features=Features({'text': 'e.g.', 'lemma': 'e.g.', 'upos': 'ADV', 'xpos': 'FW', 'Abbr': 'Yes', 'head': 153, 'deprel': 'advmod', 'ner': 'O'}),id=152), Annotation(721,727,Token,features=Features({'text': 'Hangul', 'lemma': 'Hangul', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 154, 'deprel': 'compound', 'ner': 'O'}),id=153), Annotation(728,730,Token,features=Features({'text': '한글', 'lemma': '한글', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 150, 'deprel': 'appos', 'ner': 'O'}),id=154), Annotation(731,733,Token,features=Features({'text': 'or', 'lemma': 'or', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 158, 'deprel': 'cc', 'ner': 'O'}),id=155), Annotation(735,745,Token,features=Features({'text': 'simplified', 'lemma': 'simplify', 'upos': 'VERB', 'xpos': 'VBN', 'Tense': 'Past', 'VerbForm': 'Part', 'head': 158, 'deprel': 'amod', 'ner': 'O'}),id=156), Annotation(746,751,Token,features=Features({'text': 'Hanzi', 'lemma': 'Hanzi', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 158, 'deprel': 'compound', 'ner': 'O'}),id=157), Annotation(752,754,Token,features=Features({'text': '汉字', 'lemma': '汉字', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 154, 'deprel': 'conj', 'ner': 'O'}),id=158), Annotation(755,757,Token,features=Features({'text': 'or', 'lemma': 'or', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 161, 'deprel': 'cc', 'ner': 'O'}),id=159), Annotation(758,763,Token,features=Features({'text': 'Farsi', 'lemma': 'Farsi', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 161, 'deprel': 'compound', 'ner': 'S-LANGUAGE'}),id=160), Annotation(758,763,LANGUAGE,features=Features({}),id=187), Annotation(764,769,Token,features=Features({'text': 'فارسی', 'lemma': 'فارسی', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 154, 'deprel': 'conj', 'ner': 'O'}),id=161), Annotation(770,773,Token,features=Features({'text': 'and', 'lemma': 'and', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 163, 'deprel': 'cc', 'ner': 'O'}),id=162), Annotation(774,780,Token,features=Features({'text': 'Arabic', 'lemma': 'Arabic', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 161, 'deprel': 'conj', 'ner': 'S-LANGUAGE'}),id=163), Annotation(774,780,LANGUAGE,features=Features({}),id=188), Annotation(781,782,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 165, 'deprel': 'punct', 'ner': 'O'}),id=164), Annotation(782,797,Token,features=Features({'text': 'اَلْعَرَبِيَّةُ', 'lemma': 'gamommamgeramam', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 150, 'deprel': 'appos', 'ner': 'O'}),id=165), Annotation(797,798,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 168, 'deprel': 'punct', 'ner': 'O'}),id=166), Annotation(799,804,Token,features=Features({'text': 'which', 'lemma': 'which', 'upos': 'PRON', 'xpos': 'WDT', 'PronType': 'Rel', 'head': 168, 'deprel': 'nsubj', 'ner': 'O'}),id=167), Annotation(805,809,Token,features=Features({'text': 'goes', 'lemma': 'go', 'upos': 'VERB', 'xpos': 'VBZ', 'Mood': 'Ind', 'Number': 'Sing', 'Person': '3', 'Tense': 'Pres', 'VerbForm': 'Fin', 'head': 150, 'deprel': 'acl:relcl', 'ner': 'O'}),id=168), Annotation(810,814,Token,features=Features({'text': 'from', 'lemma': 'from', 'upos': 'ADP', 'xpos': 'IN', 'head': 170, 'deprel': 'case', 'ner': 'O'}),id=169), Annotation(815,820,Token,features=Features({'text': 'right', 'lemma': 'right', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 168, 'deprel': 'obl', 'ner': 'O'}),id=170), Annotation(821,823,Token,features=Features({'text': 'to', 'lemma': 'to', 'upos': 'ADP', 'xpos': 'IN', 'head': 172, 'deprel': 'case', 'ner': 'O'}),id=171), Annotation(824,828,Token,features=Features({'text': 'left', 'lemma': 'leave', 'upos': 'VERB', 'xpos': 'VBN', 'Tense': 'Past', 'VerbForm': 'Part', 'head': 168, 'deprel': 'advcl', 'ner': 'O'}),id=172), Annotation(828,829,Token,features=Features({'text': '.', 'lemma': '.', 'upos': 'PUNCT', 'xpos': '.', 'head': 146, 'deprel': 'punct', 'ner': 'O'}),id=173)]), '': AnnotationSet([Annotation(0,4,Token,features=Features({}),id=0), Annotation(5,7,Token,features=Features({}),id=1), Annotation(8,9,Token,features=Features({}),id=2), Annotation(10,14,Token,features=Features({}),id=3), Annotation(15,24,Token,features=Features({}),id=4), Annotation(26,28,Token,features=Features({}),id=5), Annotation(29,37,Token,features=Features({}),id=6), Annotation(38,42,Token,features=Features({}),id=7), Annotation(43,44,Token,features=Features({}),id=8), Annotation(45,48,Token,features=Features({}),id=9), Annotation(49,59,Token,features=Features({}),id=10), Annotation(61,65,Token,features=Features({}),id=11), Annotation(66,68,Token,features=Features({}),id=12), Annotation(69,70,Token,features=Features({}),id=13), Annotation(71,79,Token,features=Features({}),id=14), Annotation(80,84,Token,features=Features({}),id=15), Annotation(85,93,Token,features=Features({}),id=16), Annotation(94,95,Token,features=Features({}),id=17), Annotation(96,99,Token,features=Features({}),id=18), Annotation(100,105,Token,features=Features({}),id=19), Annotation(106,114,Token,features=Features({}),id=20), Annotation(115,119,Token,features=Features({}),id=21), Annotation(121,124,Token,features=Features({}),id=22), Annotation(125,132,Token,features=Features({}),id=23), Annotation(133,139,Token,features=Features({}),id=24), Annotation(140,145,Token,features=Features({}),id=25), Annotation(146,148,Token,features=Features({}),id=26), Annotation(149,155,Token,features=Features({}),id=27), Annotation(156,159,Token,features=Features({}),id=28), Annotation(160,163,Token,features=Features({}),id=29), Annotation(164,169,Token,features=Features({}),id=30), Annotation(169,170,Token,features=Features({}),id=31), Annotation(171,180,Token,features=Features({}),id=32), Annotation(181,185,Token,features=Features({}),id=33), Annotation(186,189,Token,features=Features({}),id=34), Annotation(190,194,Token,features=Features({}),id=35), Annotation(195,199,Token,features=Features({}),id=36), Annotation(199,200,Token,features=Features({}),id=37), Annotation(201,207,Token,features=Features({}),id=38), Annotation(208,210,Token,features=Features({}),id=39), Annotation(211,218,Token,features=Features({}),id=40), Annotation(219,221,Token,features=Features({}),id=41), Annotation(222,231,Token,features=Features({}),id=42), Annotation(232,236,Token,features=Features({}),id=43), Annotation(238,244,Token,features=Features({}),id=44), Annotation(244,245,Token,features=Features({}),id=45), Annotation(246,255,Token,features=Features({}),id=46), Annotation(256,258,Token,features=Features({}),id=47), Annotation(259,266,Token,features=Features({}),id=48), Annotation(267,270,Token,features=Features({}),id=49), Annotation(271,275,Token,features=Features({}),id=50), Annotation(276,278,Token,features=Features({}),id=51), Annotation(279,285,Token,features=Features({}),id=52), Annotation(286,291,Token,features=Features({}),id=53), Annotation(291,292,Token,features=Features({}),id=54), Annotation(293,295,Token,features=Features({}),id=55), Annotation(296,299,Token,features=Features({}),id=56), Annotation(300,303,Token,features=Features({}),id=57), Annotation(304,306,Token,features=Features({}),id=58), Annotation(307,310,Token,features=Features({}),id=59), Annotation(311,315,Token,features=Features({}),id=60), Annotation(316,319,Token,features=Features({}),id=61), Annotation(320,321,Token,features=Features({}),id=62), Annotation(321,322,Token,features=Features({}),id=63), Annotation(324,328,Token,features=Features({}),id=64), Annotation(329,332,Token,features=Features({}),id=65), Annotation(333,338,Token,features=Features({}),id=66), Annotation(339,346,Token,features=Features({}),id=67), Annotation(347,350,Token,features=Features({}),id=68), Annotation(351,355,Token,features=Features({}),id=69), Annotation(356,362,Token,features=Features({}),id=70), Annotation(363,367,Token,features=Features({}),id=71), Annotation(368,371,Token,features=Features({}),id=72), Annotation(372,384,Token,features=Features({}),id=73), Annotation(385,392,Token,features=Features({}),id=74), Annotation(392,393,Token,features=Features({}),id=75), Annotation(394,399,Token,features=Features({}),id=76), Annotation(400,402,Token,features=Features({}),id=77), Annotation(403,407,Token,features=Features({}),id=78), Annotation(408,410,Token,features=Features({}),id=79), Annotation(411,416,Token,features=Features({}),id=80), Annotation(417,421,Token,features=Features({}),id=81), Annotation(421,422,Token,features=Features({}),id=82), Annotation(424,428,Token,features=Features({}),id=83), Annotation(429,431,Token,features=Features({}),id=84), Annotation(432,439,Token,features=Features({}),id=85), Annotation(440,441,Token,features=Features({}),id=86), Annotation(442,445,Token,features=Features({}),id=87), Annotation(446,451,Token,features=Features({}),id=88), Annotation(451,452,Token,features=Features({}),id=89), Annotation(452,487,Token,features=Features({}),id=90), Annotation(489,492,Token,features=Features({}),id=91), Annotation(493,494,Token,features=Features({}),id=92), Annotation(495,499,Token,features=Features({}),id=93), Annotation(500,505,Token,features=Features({}),id=94), Annotation(506,513,Token,features=Features({}),id=95), Annotation(514,522,Token,features=Features({}),id=96), Annotation(522,523,Token,features=Features({}),id=97), Annotation(523,540,Token,features=Features({}),id=98), Annotation(541,543,Token,features=Features({}),id=99), Annotation(544,548,Token,features=Features({}),id=100), Annotation(550,552,Token,features=Features({}),id=101), Annotation(553,554,Token,features=Features({}),id=102), Annotation(554,558,Token,features=Features({}),id=103), Annotation(559,560,Token,features=Features({}),id=104), Annotation(560,564,Token,features=Features({}),id=105), Annotation(565,566,Token,features=Features({}),id=106), Annotation(566,573,Token,features=Features({}),id=107), Annotation(574,577,Token,features=Features({}),id=108), Annotation(578,579,Token,features=Features({}),id=109), Annotation(580,585,Token,features=Features({}),id=110), Annotation(586,588,Token,features=Features({}),id=111), Annotation(589,595,Token,features=Features({}),id=112), Annotation(596,600,Token,features=Features({}),id=113), Annotation(601,602,Token,features=Features({}),id=114), Annotation(603,604,Token,features=Features({}),id=115), Annotation(604,605,Token,features=Features({}),id=116), Annotation(606,613,Token,features=Features({}),id=117), Annotation(614,617,Token,features=Features({}),id=118), Annotation(617,618,Token,features=Features({}),id=119), Annotation(618,619,Token,features=Features({}),id=120), Annotation(620,623,Token,features=Features({}),id=121), Annotation(624,625,Token,features=Features({}),id=122), Annotation(625,626,Token,features=Features({}),id=123), Annotation(627,632,Token,features=Features({}),id=124), Annotation(633,640,Token,features=Features({}),id=125), Annotation(640,641,Token,features=Features({}),id=126), Annotation(641,642,Token,features=Features({}),id=127), Annotation(643,644,Token,features=Features({}),id=128), Annotation(645,646,Token,features=Features({}),id=129), Annotation(646,649,Token,features=Features({}),id=130), Annotation(649,650,Token,features=Features({}),id=131), Annotation(650,651,Token,features=Features({}),id=132), Annotation(653,654,Token,features=Features({}),id=133), Annotation(655,656,Token,features=Features({}),id=134), Annotation(656,657,Token,features=Features({}),id=135), Annotation(658,664,Token,features=Features({}),id=136), Annotation(665,673,Token,features=Features({}),id=137), Annotation(673,674,Token,features=Features({}),id=138), Annotation(674,675,Token,features=Features({}),id=139), Annotation(678,682,Token,features=Features({}),id=140), Annotation(683,685,Token,features=Features({}),id=141), Annotation(686,690,Token,features=Features({}),id=142), Annotation(691,692,Token,features=Features({}),id=143), Annotation(693,696,Token,features=Features({}),id=144), Annotation(697,706,Token,features=Features({}),id=145), Annotation(707,714,Token,features=Features({}),id=146), Annotation(714,715,Token,features=Features({}),id=147), Annotation(716,720,Token,features=Features({}),id=148), Annotation(721,727,Token,features=Features({}),id=149), Annotation(728,730,Token,features=Features({}),id=150), Annotation(731,733,Token,features=Features({}),id=151), Annotation(735,745,Token,features=Features({}),id=152), Annotation(746,751,Token,features=Features({}),id=153), Annotation(752,754,Token,features=Features({}),id=154), Annotation(755,757,Token,features=Features({}),id=155), Annotation(758,763,Token,features=Features({}),id=156), Annotation(764,769,Token,features=Features({}),id=157), Annotation(770,773,Token,features=Features({}),id=158), Annotation(774,780,Token,features=Features({}),id=159), Annotation(781,782,Token,features=Features({}),id=160), Annotation(782,797,Token,features=Features({}),id=161), Annotation(797,798,Token,features=Features({}),id=162), Annotation(799,804,Token,features=Features({}),id=163), Annotation(805,809,Token,features=Features({}),id=164), Annotation(810,814,Token,features=Features({}),id=165), Annotation(815,820,Token,features=Features({}),id=166), Annotation(821,823,Token,features=Features({}),id=167), Annotation(824,828,Token,features=Features({}),id=168), Annotation(828,829,Token,features=Features({}),id=169)]), 'TGaz1': AnnotationSet([Annotation(238,244,Lookup,features=Features({'what': 'company', 'country': 'Everywhere, really!'}),id=3), Annotation(279,291,Lookup,features=Features({'what': 'person', 'country': 'US'}),id=4), Annotation(333,346,Lookup,features=Features({'what': 'person', 'country': 'UK'}),id=5)]), 'TGaz2': AnnotationSet([Annotation(238,244,Lookup,features=Features({'where': 'Everywhere, really!'}),id=0), Annotation(279,291,Lookup,features=Features({'what': 'person', 'country': 'US'}),id=1), Annotation(333,346,Lookup,features=Features({'what': 'person', 'country': 'UK'}),id=2)]), 'Pampac1': AnnotationSet([Annotation(133,139,PersOrOrg,features=Features({}),id=0), Annotation(140,145,PersOrOrg,features=Features({}),id=1), Annotation(149,155,PersOrOrg,features=Features({}),id=2), Annotation(156,159,PersOrOrg,features=Features({}),id=3), Annotation(160,163,PersOrOrg,features=Features({}),id=4), Annotation(164,169,PersOrOrg,features=Features({}),id=5), Annotation(238,244,PersOrOrg,features=Features({}),id=6), Annotation(246,255,PersOrOrg,features=Features({}),id=7), Annotation(259,265,PersOrOrg,features=Features({}),id=8), Annotation(279,285,PersOrOrg,features=Features({}),id=9), Annotation(286,291,PersOrOrg,features=Features({}),id=10), Annotation(333,338,PersOrOrg,features=Features({}),id=11), Annotation(339,346,PersOrOrg,features=Features({}),id=12), Annotation(351,355,PersOrOrg,features=Features({}),id=13), Annotation(372,384,PersOrOrg,features=Features({}),id=14)])})" ] }, "execution_count": 60, "metadata": {}, "output_type": "execute_result" } ], "source": [ "doc2" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## PAMPAC - Example 2\n", "\n", "Create a rule that annotates any Sequence of two or more Token annotations which have a \"upos\" tag of \"PROPN\", separated by at most one other arbitrary token:" ] }, { "cell_type": "code", "execution_count": 61, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "text/plain": [ "7" ] }, "execution_count": 61, "metadata": {}, "output_type": "execute_result" } ], "source": [ "from gatenlp.pam.pampac import Ann, AnnAt, Rule, Pampac, AddAnn, N, Seq\n", "from gatenlp.pam.matcher import FeatureMatcher, IfNot\n", "\n", "feat = FeatureMatcher(upos=\"PROPN\")\n", "r1 = Rule(\n", " # first the pattern\n", " Seq( Ann(\"Token\", features=feat),\n", " N( Seq( N(Ann(\"Token\", features=IfNot(feat)), min=0, max=1),\n", " Ann(\"Token\", features=feat)),\n", " min=1, max=99),\n", " name=\"seq1\"\n", " ),\n", " # then the action for the pattern\n", " AddAnn(name=\"seq1\", type=\"PROPNSEQ\")\n", ")\n", "# get the annotations we want to use for matching\n", "anns2match = doc2.annset(\"Stanza\").with_type(\"Token\")\n", "\n", "outset = doc2.annset(\"Pampac2\")\n", "outset.clear()\n", "# Create the Pampac instance from the single rule and run it on the annotations, also specify output set\n", "# The run method returns the list of offsets and the action return values where the rule matches in the doc\n", "Pampac(r1).run(doc2, anns2match, outset=outset)\n", "len(outset)" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## PAMPAC - Example 2\n", "\n", "Result: found 8 matches and added annotations for them:" ] }, { "cell_type": "code", "execution_count": 62, "metadata": { "scrolled": true, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "text/html": [ "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", " \n", " \n", "
\n", "\n", "
" ], "text/plain": [ "Document(This is a test document.\n", "\n", "It contains just a few sentences. \n", "Here is a sentence that mentions a few named entities like \n", "the persons Barack Obama or Ursula von der Leyen, locations\n", "like New York City, Vienna or Beijing or companies like \n", "Google, UniCredit or Huawei. And here is Donald Trump, it may not be the real one :P\n", "\n", "Lets say Boris Johnson aka Bojo tweets from his BorisJohnson account, would be nice to match them!\n", "\n", "Here we include a URL https://gatenlp.github.io/python-gatenlp/ \n", "and a fake email address john.doe@hiscoolserver.com as well \n", "as #some #cool #hastags and a bunch of emojis like 😽 (a kissing cat),\n", "👩‍🏫 (a woman teacher), 🧬 (DNA), \n", "🧗 (a person climbing), \n", "\n", "Here we test a few different scripts, e.g. Hangul 한글 or \n", "simplified Hanzi 汉字 or Farsi فارسی and Arabic ,اَلْعَرَبِيَّةُ, which goes from right to left.\n", "\n", ",features=Features({}),anns={'Spacy': AnnotationSet([Annotation(0,4,Token,features=Features({'_i': 0, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': True, 'is_space': False, 'is_stop': True, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'this', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12943039165150086467, 'pos': 'PRON', 'prefix': 'T', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'Xxxx', 'suffix': 'his', 'ent_type': '', 'dep': 'nsubj', 'head': 2, 'left_edge': 0, 'right_edge': 0}),id=0), Annotation(0,24,Sentence,features=Features({}),id=336), Annotation(0,4,NounChunk,features=Features({}),id=342), Annotation(4,5,SpaceToken,features=Features({'is_space': True}),id=1), Annotation(5,7,Token,features=Features({'_i': 1, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'be', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3411606890003347522, 'pos': 'AUX', 'prefix': 'i', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBZ', 'shape': 'xx', 'suffix': 'is', 'ent_type': '', 'dep': 'ROOT', 'head': 2, 'left_edge': 0, 'right_edge': 9}),id=2), Annotation(7,8,SpaceToken,features=Features({'is_space': True}),id=3), Annotation(8,9,Token,features=Features({'_i': 2, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'det', 'head': 8, 'left_edge': 4, 'right_edge': 4}),id=4), Annotation(8,23,NounChunk,features=Features({}),id=343), Annotation(9,10,SpaceToken,features=Features({'is_space': True}),id=5), Annotation(10,14,Token,features=Features({'_i': 3, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'test', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 1618900948208871284, 'pos': 'NOUN', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'est', 'ent_type': '', 'dep': 'compound', 'head': 8, 'left_edge': 6, 'right_edge': 6}),id=6), Annotation(14,15,SpaceToken,features=Features({'is_space': True}),id=7), Annotation(15,23,Token,features=Features({'_i': 4, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'document', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2376024544662810659, 'pos': 'NOUN', 'prefix': 'd', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'ent', 'ent_type': '', 'dep': 'attr', 'head': 2, 'left_edge': 4, 'right_edge': 8}),id=8), Annotation(23,24,Token,features=Features({'_i': 5, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '.', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12646065887601541794, 'pos': 'PUNCT', 'prefix': '.', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '.', 'shape': '.', 'suffix': '.', 'ent_type': '', 'dep': 'punct', 'head': 2, 'left_edge': 9, 'right_edge': 9}),id=9), Annotation(24,26,SpaceToken,features=Features({'_i': 6, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': True, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 908432558851201422, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n\\n', 'suffix': '\\n\\n', 'ent_type': '', 'dep': 'dep', 'head': 13, 'left_edge': 10, 'right_edge': 10}),id=10), Annotation(24,59,Sentence,features=Features({}),id=337), Annotation(26,28,Token,features=Features({'_i': 7, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'it', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7859011591137717335, 'pos': 'PRON', 'prefix': 'I', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'PRP', 'shape': 'Xx', 'suffix': 'It', 'ent_type': '', 'dep': 'nsubj', 'head': 13, 'left_edge': 11, 'right_edge': 11}),id=11), Annotation(26,28,NounChunk,features=Features({}),id=344), Annotation(28,29,SpaceToken,features=Features({'is_space': True}),id=12), Annotation(29,37,Token,features=Features({'_i': 8, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'contain', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 4189568596636683296, 'pos': 'VERB', 'prefix': 'c', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBZ', 'shape': 'xxxx', 'suffix': 'ins', 'ent_type': '', 'dep': 'ROOT', 'head': 13, 'left_edge': 10, 'right_edge': 22}),id=13), Annotation(37,38,SpaceToken,features=Features({'is_space': True}),id=14), Annotation(38,42,Token,features=Features({'_i': 9, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'just', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7148522813498185515, 'pos': 'ADV', 'prefix': 'j', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'xxxx', 'suffix': 'ust', 'ent_type': '', 'dep': 'advmod', 'head': 19, 'left_edge': 15, 'right_edge': 15}),id=15), Annotation(38,58,NounChunk,features=Features({}),id=345), Annotation(42,43,SpaceToken,features=Features({'is_space': True}),id=16), Annotation(43,44,Token,features=Features({'_i': 10, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'quantmod', 'head': 19, 'left_edge': 17, 'right_edge': 17}),id=17), Annotation(44,45,SpaceToken,features=Features({'is_space': True}),id=18), Annotation(45,48,Token,features=Features({'_i': 11, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'few', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11866476999679706272, 'pos': 'ADJ', 'prefix': 'f', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'JJ', 'shape': 'xxx', 'suffix': 'few', 'ent_type': '', 'dep': 'nummod', 'head': 21, 'left_edge': 15, 'right_edge': 19}),id=19), Annotation(48,49,SpaceToken,features=Features({'is_space': True}),id=20), Annotation(49,58,Token,features=Features({'_i': 12, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'sentence', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 5257340109698985342, 'pos': 'NOUN', 'prefix': 's', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'xxxx', 'suffix': 'ces', 'ent_type': '', 'dep': 'dobj', 'head': 13, 'left_edge': 15, 'right_edge': 21}),id=21), Annotation(58,59,Token,features=Features({'_i': 13, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '.', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12646065887601541794, 'pos': 'PUNCT', 'prefix': '.', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '.', 'shape': '.', 'suffix': '.', 'ent_type': '', 'dep': 'punct', 'head': 13, 'left_edge': 22, 'right_edge': 22}),id=22), Annotation(59,60,SpaceToken,features=Features({'is_space': True}),id=23), Annotation(60,61,SpaceToken,features=Features({'_i': 14, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': True, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 962983613142996970, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n', 'suffix': '\\n', 'ent_type': '', 'dep': 'dep', 'head': 27, 'left_edge': 24, 'right_edge': 24}),id=24), Annotation(60,266,Sentence,features=Features({}),id=338), Annotation(61,65,Token,features=Features({'_i': 15, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'here', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 13973858553651532596, 'pos': 'ADV', 'prefix': 'H', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'Xxxx', 'suffix': 'ere', 'ent_type': '', 'dep': 'advmod', 'head': 27, 'left_edge': 25, 'right_edge': 25}),id=25), Annotation(65,66,SpaceToken,features=Features({'is_space': True}),id=26), Annotation(66,68,Token,features=Features({'_i': 16, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'be', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3411606890003347522, 'pos': 'AUX', 'prefix': 'i', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBZ', 'shape': 'xx', 'suffix': 'is', 'ent_type': '', 'dep': 'ccomp', 'head': 68, 'left_edge': 24, 'right_edge': 65}),id=27), Annotation(68,69,SpaceToken,features=Features({'is_space': True}),id=28), Annotation(69,70,Token,features=Features({'_i': 17, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'det', 'head': 31, 'left_edge': 29, 'right_edge': 29}),id=29), Annotation(69,79,NounChunk,features=Features({}),id=346), Annotation(70,71,SpaceToken,features=Features({'is_space': True}),id=30), Annotation(71,79,Token,features=Features({'_i': 18, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'sentence', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 18108853898452662235, 'pos': 'NOUN', 'prefix': 's', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'nce', 'ent_type': '', 'dep': 'attr', 'head': 27, 'left_edge': 29, 'right_edge': 64}),id=31), Annotation(79,80,SpaceToken,features=Features({'is_space': True}),id=32), Annotation(80,84,Token,features=Features({'_i': 19, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'that', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 4380130941430378203, 'pos': 'PRON', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'WDT', 'shape': 'xxxx', 'suffix': 'hat', 'ent_type': '', 'dep': 'nsubj', 'head': 35, 'left_edge': 33, 'right_edge': 33}),id=33), Annotation(80,84,NounChunk,features=Features({}),id=347), Annotation(84,85,SpaceToken,features=Features({'is_space': True}),id=34), Annotation(85,93,Token,features=Features({'_i': 20, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'mention', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 834570530775529781, 'pos': 'VERB', 'prefix': 'm', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBZ', 'shape': 'xxxx', 'suffix': 'ons', 'ent_type': '', 'dep': 'relcl', 'head': 31, 'left_edge': 33, 'right_edge': 64}),id=35), Annotation(93,94,SpaceToken,features=Features({'is_space': True}),id=36), Annotation(94,95,Token,features=Features({'_i': 21, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'det', 'head': 43, 'left_edge': 37, 'right_edge': 37}),id=37), Annotation(94,114,NounChunk,features=Features({}),id=348), Annotation(95,96,SpaceToken,features=Features({'is_space': True}),id=38), Annotation(96,99,Token,features=Features({'_i': 22, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'few', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11866476999679706272, 'pos': 'ADJ', 'prefix': 'f', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'JJ', 'shape': 'xxx', 'suffix': 'few', 'ent_type': '', 'dep': 'amod', 'head': 43, 'left_edge': 39, 'right_edge': 39}),id=39), Annotation(99,100,SpaceToken,features=Features({'is_space': True}),id=40), Annotation(100,105,Token,features=Features({'_i': 23, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'name', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 6719370519630147158, 'pos': 'VERB', 'prefix': 'n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBN', 'shape': 'xxxx', 'suffix': 'med', 'ent_type': '', 'dep': 'amod', 'head': 43, 'left_edge': 41, 'right_edge': 41}),id=41), Annotation(105,106,SpaceToken,features=Features({'is_space': True}),id=42), Annotation(106,114,Token,features=Features({'_i': 24, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'entity', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7701268449352692754, 'pos': 'NOUN', 'prefix': 'e', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'xxxx', 'suffix': 'ies', 'ent_type': '', 'dep': 'dobj', 'head': 35, 'left_edge': 37, 'right_edge': 64}),id=43), Annotation(114,115,SpaceToken,features=Features({'is_space': True}),id=44), Annotation(115,119,Token,features=Features({'_i': 25, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'like', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 18194338103975822726, 'pos': 'ADP', 'prefix': 'l', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'IN', 'shape': 'xxxx', 'suffix': 'ike', 'ent_type': '', 'dep': 'prep', 'head': 43, 'left_edge': 45, 'right_edge': 64}),id=45), Annotation(119,120,SpaceToken,features=Features({'is_space': True}),id=46), Annotation(120,121,SpaceToken,features=Features({'_i': 26, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 962983613142996970, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n', 'suffix': '\\n', 'ent_type': '', 'dep': 'dep', 'head': 45, 'left_edge': 47, 'right_edge': 47}),id=47), Annotation(121,124,Token,features=Features({'_i': 27, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'the', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7425985699627899538, 'pos': 'DET', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'xxx', 'suffix': 'the', 'ent_type': '', 'dep': 'det', 'head': 50, 'left_edge': 48, 'right_edge': 48}),id=48), Annotation(121,132,NounChunk,features=Features({}),id=349), Annotation(124,125,SpaceToken,features=Features({'is_space': True}),id=49), Annotation(125,132,Token,features=Features({'_i': 28, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'person', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 17520808660558581486, 'pos': 'NOUN', 'prefix': 'p', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'xxxx', 'suffix': 'ons', 'ent_type': '', 'dep': 'pobj', 'head': 45, 'left_edge': 48, 'right_edge': 64}),id=50), Annotation(132,133,SpaceToken,features=Features({'is_space': True}),id=51), Annotation(133,139,Token,features=Features({'_i': 29, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Barack', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 15388493565120789335, 'pos': 'PROPN', 'prefix': 'B', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'ack', 'ent_type': 'PERSON', 'dep': 'compound', 'head': 54, 'left_edge': 52, 'right_edge': 52}),id=52), Annotation(133,145,PERSON,features=Features({'lemma': 'Barack Obama'}),id=318), Annotation(133,145,NounChunk,features=Features({}),id=350), Annotation(139,140,SpaceToken,features=Features({'is_space': True}),id=53), Annotation(140,145,Token,features=Features({'_i': 30, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Obama', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 4857242187112322394, 'pos': 'PROPN', 'prefix': 'O', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'ama', 'ent_type': 'PERSON', 'dep': 'appos', 'head': 50, 'left_edge': 52, 'right_edge': 64}),id=54), Annotation(145,146,SpaceToken,features=Features({'is_space': True}),id=55), Annotation(146,148,Token,features=Features({'_i': 31, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'or', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3740602843040177340, 'pos': 'CCONJ', 'prefix': 'o', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'xx', 'suffix': 'or', 'ent_type': '', 'dep': 'cc', 'head': 54, 'left_edge': 56, 'right_edge': 56}),id=56), Annotation(148,149,SpaceToken,features=Features({'is_space': True}),id=57), Annotation(149,155,Token,features=Features({'_i': 32, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Ursula', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 13791600832320669420, 'pos': 'PROPN', 'prefix': 'U', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'ula', 'ent_type': 'GPE', 'dep': 'compound', 'head': 62, 'left_edge': 58, 'right_edge': 58}),id=58), Annotation(149,155,GPE,features=Features({'lemma': 'Ursula'}),id=319), Annotation(149,169,NounChunk,features=Features({}),id=351), Annotation(155,156,SpaceToken,features=Features({'is_space': True}),id=59), Annotation(156,159,Token,features=Features({'_i': 33, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'von', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7079615728476843702, 'pos': 'PROPN', 'prefix': 'v', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'xxx', 'suffix': 'von', 'ent_type': '', 'dep': 'compound', 'head': 62, 'left_edge': 60, 'right_edge': 60}),id=60), Annotation(159,160,SpaceToken,features=Features({'is_space': True}),id=61), Annotation(160,163,Token,features=Features({'_i': 34, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'der', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 9250722957692387333, 'pos': 'NOUN', 'prefix': 'd', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxx', 'suffix': 'der', 'ent_type': '', 'dep': 'compound', 'head': 64, 'left_edge': 58, 'right_edge': 62}),id=62), Annotation(163,164,SpaceToken,features=Features({'is_space': True}),id=63), Annotation(164,169,Token,features=Features({'_i': 35, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Leyen', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 13266048868047277473, 'pos': 'PROPN', 'prefix': 'L', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'yen', 'ent_type': 'GPE', 'dep': 'conj', 'head': 54, 'left_edge': 58, 'right_edge': 64}),id=64), Annotation(164,169,GPE,features=Features({'lemma': 'Leyen'}),id=320), Annotation(169,170,Token,features=Features({'_i': 36, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 27, 'left_edge': 65, 'right_edge': 65}),id=65), Annotation(170,171,SpaceToken,features=Features({'is_space': True}),id=66), Annotation(171,180,Token,features=Features({'_i': 37, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'location', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 13063876858105367867, 'pos': 'NOUN', 'prefix': 'l', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'xxxx', 'suffix': 'ons', 'ent_type': '', 'dep': 'nsubj', 'head': 68, 'left_edge': 67, 'right_edge': 67}),id=67), Annotation(171,180,NounChunk,features=Features({}),id=352), Annotation(180,181,SpaceToken,features=Features({'_i': 38, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 962983613142996970, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n', 'suffix': '\\n', 'ent_type': '', 'dep': 'dep', 'head': 68, 'left_edge': 24, 'right_edge': 99}),id=68), Annotation(181,185,Token,features=Features({'_i': 39, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'like', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 18194338103975822726, 'pos': 'ADP', 'prefix': 'l', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'IN', 'shape': 'xxxx', 'suffix': 'ike', 'ent_type': '', 'dep': 'prep', 'head': 68, 'left_edge': 69, 'right_edge': 98}),id=69), Annotation(185,186,SpaceToken,features=Features({'is_space': True}),id=70), Annotation(186,189,Token,features=Features({'_i': 40, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'New', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7503827727184870577, 'pos': 'PROPN', 'prefix': 'N', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxx', 'suffix': 'New', 'ent_type': 'GPE', 'dep': 'compound', 'head': 73, 'left_edge': 71, 'right_edge': 71}),id=71), Annotation(186,199,GPE,features=Features({'lemma': 'New York City'}),id=321), Annotation(186,199,NounChunk,features=Features({}),id=353), Annotation(189,190,SpaceToken,features=Features({'is_space': True}),id=72), Annotation(190,194,Token,features=Features({'_i': 41, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'York', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7898044819112200372, 'pos': 'PROPN', 'prefix': 'Y', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxx', 'suffix': 'ork', 'ent_type': 'GPE', 'dep': 'compound', 'head': 75, 'left_edge': 71, 'right_edge': 73}),id=73), Annotation(194,195,SpaceToken,features=Features({'is_space': True}),id=74), Annotation(195,199,Token,features=Features({'_i': 42, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'City', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 728339108046626067, 'pos': 'PROPN', 'prefix': 'C', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxx', 'suffix': 'ity', 'ent_type': 'GPE', 'dep': 'pobj', 'head': 69, 'left_edge': 71, 'right_edge': 98}),id=75), Annotation(199,200,Token,features=Features({'_i': 43, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 75, 'left_edge': 76, 'right_edge': 76}),id=76), Annotation(200,201,SpaceToken,features=Features({'is_space': True}),id=77), Annotation(201,207,Token,features=Features({'_i': 44, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Vienna', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 10141676973022394345, 'pos': 'PROPN', 'prefix': 'V', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'nna', 'ent_type': 'GPE', 'dep': 'conj', 'head': 75, 'left_edge': 78, 'right_edge': 98}),id=78), Annotation(201,207,GPE,features=Features({'lemma': 'Vienna'}),id=322), Annotation(201,207,NounChunk,features=Features({}),id=354), Annotation(207,208,SpaceToken,features=Features({'is_space': True}),id=79), Annotation(208,210,Token,features=Features({'_i': 45, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'or', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3740602843040177340, 'pos': 'CCONJ', 'prefix': 'o', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'xx', 'suffix': 'or', 'ent_type': '', 'dep': 'cc', 'head': 78, 'left_edge': 80, 'right_edge': 80}),id=80), Annotation(210,211,SpaceToken,features=Features({'is_space': True}),id=81), Annotation(211,218,Token,features=Features({'_i': 46, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Beijing', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2949147885855558195, 'pos': 'PROPN', 'prefix': 'B', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'ing', 'ent_type': 'GPE', 'dep': 'conj', 'head': 78, 'left_edge': 82, 'right_edge': 82}),id=82), Annotation(211,218,GPE,features=Features({'lemma': 'Beijing'}),id=323), Annotation(211,218,NounChunk,features=Features({}),id=355), Annotation(218,219,SpaceToken,features=Features({'is_space': True}),id=83), Annotation(219,221,Token,features=Features({'_i': 47, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'or', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3740602843040177340, 'pos': 'CCONJ', 'prefix': 'o', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'xx', 'suffix': 'or', 'ent_type': '', 'dep': 'cc', 'head': 78, 'left_edge': 84, 'right_edge': 84}),id=84), Annotation(221,222,SpaceToken,features=Features({'is_space': True}),id=85), Annotation(222,231,Token,features=Features({'_i': 48, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'company', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 8026612326651866097, 'pos': 'NOUN', 'prefix': 'c', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'xxxx', 'suffix': 'ies', 'ent_type': '', 'dep': 'conj', 'head': 78, 'left_edge': 86, 'right_edge': 98}),id=86), Annotation(222,231,NounChunk,features=Features({}),id=356), Annotation(231,232,SpaceToken,features=Features({'is_space': True}),id=87), Annotation(232,236,Token,features=Features({'_i': 49, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'like', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 18194338103975822726, 'pos': 'ADP', 'prefix': 'l', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'IN', 'shape': 'xxxx', 'suffix': 'ike', 'ent_type': '', 'dep': 'prep', 'head': 86, 'left_edge': 88, 'right_edge': 98}),id=88), Annotation(236,237,SpaceToken,features=Features({'is_space': True}),id=89), Annotation(237,238,SpaceToken,features=Features({'_i': 50, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 962983613142996970, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n', 'suffix': '\\n', 'ent_type': '', 'dep': 'dep', 'head': 88, 'left_edge': 90, 'right_edge': 90}),id=90), Annotation(238,244,Token,features=Features({'_i': 51, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Google', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11578853341595296054, 'pos': 'PROPN', 'prefix': 'G', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'gle', 'ent_type': '', 'dep': 'pobj', 'head': 88, 'left_edge': 91, 'right_edge': 98}),id=91), Annotation(238,244,NounChunk,features=Features({}),id=357), Annotation(244,245,Token,features=Features({'_i': 52, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 91, 'left_edge': 92, 'right_edge': 92}),id=92), Annotation(245,246,SpaceToken,features=Features({'is_space': True}),id=93), Annotation(246,255,Token,features=Features({'_i': 53, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'UniCredit', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7663443051765273511, 'pos': 'PROPN', 'prefix': 'U', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'XxxXxxxx', 'suffix': 'dit', 'ent_type': 'ORG', 'dep': 'conj', 'head': 91, 'left_edge': 94, 'right_edge': 98}),id=94), Annotation(246,255,ORG,features=Features({'lemma': 'UniCredit'}),id=324), Annotation(246,255,NounChunk,features=Features({}),id=358), Annotation(255,256,SpaceToken,features=Features({'is_space': True}),id=95), Annotation(256,258,Token,features=Features({'_i': 54, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'or', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3740602843040177340, 'pos': 'CCONJ', 'prefix': 'o', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'xx', 'suffix': 'or', 'ent_type': '', 'dep': 'cc', 'head': 94, 'left_edge': 96, 'right_edge': 96}),id=96), Annotation(258,259,SpaceToken,features=Features({'is_space': True}),id=97), Annotation(259,265,Token,features=Features({'_i': 55, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Huawei', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 4639919019609272215, 'pos': 'PROPN', 'prefix': 'H', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'wei', 'ent_type': 'ORG', 'dep': 'conj', 'head': 94, 'left_edge': 98, 'right_edge': 98}),id=98), Annotation(259,265,ORG,features=Features({'lemma': 'Huawei'}),id=325), Annotation(259,265,NounChunk,features=Features({}),id=359), Annotation(265,266,Token,features=Features({'_i': 56, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '.', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12646065887601541794, 'pos': 'PUNCT', 'prefix': '.', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '.', 'shape': '.', 'suffix': '.', 'ent_type': '', 'dep': 'punct', 'head': 68, 'left_edge': 99, 'right_edge': 99}),id=99), Annotation(266,267,SpaceToken,features=Features({'is_space': True}),id=100), Annotation(267,270,Token,features=Features({'_i': 57, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': True, 'is_space': False, 'is_stop': True, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'and', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12172435438170721471, 'pos': 'CCONJ', 'prefix': 'A', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'Xxx', 'suffix': 'And', 'ent_type': '', 'dep': 'cc', 'head': 105, 'left_edge': 101, 'right_edge': 101}),id=101), Annotation(267,422,Sentence,features=Features({}),id=339), Annotation(270,271,SpaceToken,features=Features({'is_space': True}),id=102), Annotation(271,275,Token,features=Features({'_i': 58, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'here', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 411390626470654571, 'pos': 'ADV', 'prefix': 'h', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'xxxx', 'suffix': 'ere', 'ent_type': '', 'dep': 'advmod', 'head': 105, 'left_edge': 103, 'right_edge': 103}),id=103), Annotation(275,276,SpaceToken,features=Features({'is_space': True}),id=104), Annotation(276,278,Token,features=Features({'_i': 59, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'be', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3411606890003347522, 'pos': 'AUX', 'prefix': 'i', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBZ', 'shape': 'xx', 'suffix': 'is', 'ent_type': '', 'dep': 'ccomp', 'head': 118, 'left_edge': 101, 'right_edge': 109}),id=105), Annotation(278,279,SpaceToken,features=Features({'is_space': True}),id=106), Annotation(279,285,Token,features=Features({'_i': 60, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Donald', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 16889399016836222064, 'pos': 'PROPN', 'prefix': 'D', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'ald', 'ent_type': 'PERSON', 'dep': 'compound', 'head': 109, 'left_edge': 107, 'right_edge': 107}),id=107), Annotation(279,291,PERSON,features=Features({'lemma': 'Donald Trump'}),id=326), Annotation(279,291,NounChunk,features=Features({}),id=360), Annotation(285,286,SpaceToken,features=Features({'is_space': True}),id=108), Annotation(286,291,Token,features=Features({'_i': 61, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Trump', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 1134333841961332695, 'pos': 'PROPN', 'prefix': 'T', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'ump', 'ent_type': 'PERSON', 'dep': 'nsubj', 'head': 105, 'left_edge': 107, 'right_edge': 109}),id=109), Annotation(291,292,Token,features=Features({'_i': 62, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 118, 'left_edge': 110, 'right_edge': 110}),id=110), Annotation(292,293,SpaceToken,features=Features({'is_space': True}),id=111), Annotation(293,295,Token,features=Features({'_i': 63, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'it', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 10239237003504588839, 'pos': 'PRON', 'prefix': 'i', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'PRP', 'shape': 'xx', 'suffix': 'it', 'ent_type': '', 'dep': 'nsubj', 'head': 118, 'left_edge': 112, 'right_edge': 112}),id=112), Annotation(293,295,NounChunk,features=Features({}),id=361), Annotation(295,296,SpaceToken,features=Features({'is_space': True}),id=113), Annotation(296,299,Token,features=Features({'_i': 64, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'may', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 14378475389916013800, 'pos': 'AUX', 'prefix': 'm', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'MD', 'shape': 'xxx', 'suffix': 'may', 'ent_type': '', 'dep': 'aux', 'head': 118, 'left_edge': 114, 'right_edge': 114}),id=114), Annotation(299,300,SpaceToken,features=Features({'is_space': True}),id=115), Annotation(300,303,Token,features=Features({'_i': 65, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'not', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 447765159362469301, 'pos': 'PART', 'prefix': 'n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'xxx', 'suffix': 'not', 'ent_type': '', 'dep': 'neg', 'head': 118, 'left_edge': 116, 'right_edge': 116}),id=116), Annotation(303,304,SpaceToken,features=Features({'is_space': True}),id=117), Annotation(304,306,Token,features=Features({'_i': 66, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'be', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 10382539506755952630, 'pos': 'AUX', 'prefix': 'b', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VB', 'shape': 'xx', 'suffix': 'be', 'ent_type': '', 'dep': 'ccomp', 'head': 130, 'left_edge': 101, 'right_edge': 127}),id=118), Annotation(306,307,SpaceToken,features=Features({'is_space': True}),id=119), Annotation(307,310,Token,features=Features({'_i': 67, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'the', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7425985699627899538, 'pos': 'DET', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'xxx', 'suffix': 'the', 'ent_type': '', 'dep': 'det', 'head': 124, 'left_edge': 120, 'right_edge': 120}),id=120), Annotation(310,311,SpaceToken,features=Features({'is_space': True}),id=121), Annotation(311,315,Token,features=Features({'_i': 68, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'real', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7050692189315705983, 'pos': 'ADJ', 'prefix': 'r', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'JJ', 'shape': 'xxxx', 'suffix': 'eal', 'ent_type': '', 'dep': 'amod', 'head': 124, 'left_edge': 122, 'right_edge': 122}),id=122), Annotation(315,316,SpaceToken,features=Features({'is_space': True}),id=123), Annotation(316,319,Token,features=Features({'_i': 69, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'one', 'like_email': False, 'like_num': True, 'like_url': False, 'orth': 17454115351911680600, 'pos': 'NUM', 'prefix': 'o', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CD', 'shape': 'xxx', 'suffix': 'one', 'ent_type': '', 'dep': 'attr', 'head': 118, 'left_edge': 120, 'right_edge': 127}),id=124), Annotation(319,320,SpaceToken,features=Features({'is_space': True}),id=125), Annotation(320,322,Token,features=Features({'_i': 70, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': True, 'lang': 'en', 'lemma': ':P', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11282459806373970386, 'pos': 'PUNCT', 'prefix': ':', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ':', 'shape': ':X', 'suffix': ':P', 'ent_type': '', 'dep': 'punct', 'head': 124, 'left_edge': 126, 'right_edge': 126}),id=126), Annotation(322,324,SpaceToken,features=Features({'_i': 71, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 908432558851201422, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n\\n', 'suffix': '\\n\\n', 'ent_type': '', 'dep': 'dep', 'head': 124, 'left_edge': 127, 'right_edge': 127}),id=127), Annotation(324,328,Token,features=Features({'_i': 72, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'let', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 8631549979455155821, 'pos': 'NOUN', 'prefix': 'L', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'Xxxx', 'suffix': 'ets', 'ent_type': '', 'dep': 'nsubj', 'head': 130, 'left_edge': 128, 'right_edge': 128}),id=128), Annotation(324,328,NounChunk,features=Features({}),id=362), Annotation(328,329,SpaceToken,features=Features({'is_space': True}),id=129), Annotation(329,332,Token,features=Features({'_i': 73, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'say', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 8685289367999165211, 'pos': 'VERB', 'prefix': 's', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBP', 'shape': 'xxx', 'suffix': 'say', 'ent_type': '', 'dep': 'parataxis', 'head': 153, 'left_edge': 101, 'right_edge': 149}),id=130), Annotation(332,333,SpaceToken,features=Features({'is_space': True}),id=131), Annotation(333,338,Token,features=Features({'_i': 74, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Boris', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3926703778135926261, 'pos': 'PROPN', 'prefix': 'B', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'ris', 'ent_type': 'PERSON', 'dep': 'compound', 'head': 134, 'left_edge': 132, 'right_edge': 132}),id=132), Annotation(333,346,PERSON,features=Features({'lemma': 'Boris Johnson'}),id=327), Annotation(333,346,NounChunk,features=Features({}),id=363), Annotation(338,339,SpaceToken,features=Features({'is_space': True}),id=133), Annotation(339,346,Token,features=Features({'_i': 75, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Johnson', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 14020170261514038406, 'pos': 'PROPN', 'prefix': 'J', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'son', 'ent_type': 'PERSON', 'dep': 'nsubj', 'head': 130, 'left_edge': 132, 'right_edge': 148}),id=134), Annotation(346,347,SpaceToken,features=Features({'is_space': True}),id=135), Annotation(347,350,Token,features=Features({'_i': 76, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'aka', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 6309978511773728404, 'pos': 'ADV', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'xxx', 'suffix': 'aka', 'ent_type': '', 'dep': 'advmod', 'head': 140, 'left_edge': 136, 'right_edge': 136}),id=136), Annotation(347,362,NounChunk,features=Features({}),id=364), Annotation(350,351,SpaceToken,features=Features({'is_space': True}),id=137), Annotation(351,355,Token,features=Features({'_i': 77, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Bojo', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 1381500864330977487, 'pos': 'PROPN', 'prefix': 'B', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxx', 'suffix': 'ojo', 'ent_type': '', 'dep': 'compound', 'head': 140, 'left_edge': 138, 'right_edge': 138}),id=138), Annotation(355,356,SpaceToken,features=Features({'is_space': True}),id=139), Annotation(356,362,Token,features=Features({'_i': 78, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'tweet', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 9158814828460388352, 'pos': 'NOUN', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'xxxx', 'suffix': 'ets', 'ent_type': '', 'dep': 'appos', 'head': 134, 'left_edge': 136, 'right_edge': 148}),id=140), Annotation(362,363,SpaceToken,features=Features({'is_space': True}),id=141), Annotation(363,367,Token,features=Features({'_i': 79, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'from', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7831658034963690409, 'pos': 'ADP', 'prefix': 'f', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'IN', 'shape': 'xxxx', 'suffix': 'rom', 'ent_type': '', 'dep': 'prep', 'head': 140, 'left_edge': 142, 'right_edge': 148}),id=142), Annotation(367,368,SpaceToken,features=Features({'is_space': True}),id=143), Annotation(368,371,Token,features=Features({'_i': 80, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'his', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2661093235354845946, 'pos': 'PRON', 'prefix': 'h', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'PRP$', 'shape': 'xxx', 'suffix': 'his', 'ent_type': '', 'dep': 'poss', 'head': 148, 'left_edge': 144, 'right_edge': 144}),id=144), Annotation(368,392,NounChunk,features=Features({}),id=365), Annotation(371,372,SpaceToken,features=Features({'is_space': True}),id=145), Annotation(372,384,Token,features=Features({'_i': 81, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'BorisJohnson', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 6152028365162425637, 'pos': 'PROPN', 'prefix': 'B', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'XxxxxXxxxx', 'suffix': 'son', 'ent_type': 'ORG', 'dep': 'compound', 'head': 148, 'left_edge': 146, 'right_edge': 146}),id=146), Annotation(372,384,ORG,features=Features({'lemma': 'BorisJohnson'}),id=328), Annotation(384,385,SpaceToken,features=Features({'is_space': True}),id=147), Annotation(385,392,Token,features=Features({'_i': 82, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'account', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 16051117652154763083, 'pos': 'NOUN', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'unt', 'ent_type': '', 'dep': 'pobj', 'head': 142, 'left_edge': 144, 'right_edge': 148}),id=148), Annotation(392,393,Token,features=Features({'_i': 83, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 130, 'left_edge': 149, 'right_edge': 149}),id=149), Annotation(393,394,SpaceToken,features=Features({'is_space': True}),id=150), Annotation(394,399,Token,features=Features({'_i': 84, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'would', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 6992604926141104606, 'pos': 'AUX', 'prefix': 'w', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'MD', 'shape': 'xxxx', 'suffix': 'uld', 'ent_type': '', 'dep': 'aux', 'head': 153, 'left_edge': 151, 'right_edge': 151}),id=151), Annotation(399,400,SpaceToken,features=Features({'is_space': True}),id=152), Annotation(400,402,Token,features=Features({'_i': 85, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'be', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 10382539506755952630, 'pos': 'AUX', 'prefix': 'b', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VB', 'shape': 'xx', 'suffix': 'be', 'ent_type': '', 'dep': 'ROOT', 'head': 153, 'left_edge': 101, 'right_edge': 162}),id=153), Annotation(402,403,SpaceToken,features=Features({'is_space': True}),id=154), Annotation(403,407,Token,features=Features({'_i': 86, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'nice', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 14121509715367036122, 'pos': 'ADJ', 'prefix': 'n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'JJ', 'shape': 'xxxx', 'suffix': 'ice', 'ent_type': '', 'dep': 'acomp', 'head': 153, 'left_edge': 155, 'right_edge': 155}),id=155), Annotation(407,408,SpaceToken,features=Features({'is_space': True}),id=156), Annotation(408,410,Token,features=Features({'_i': 87, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'to', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3791531372978436496, 'pos': 'PART', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'TO', 'shape': 'xx', 'suffix': 'to', 'ent_type': '', 'dep': 'aux', 'head': 159, 'left_edge': 157, 'right_edge': 157}),id=157), Annotation(410,411,SpaceToken,features=Features({'is_space': True}),id=158), Annotation(411,416,Token,features=Features({'_i': 88, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'match', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 16065818573247886523, 'pos': 'VERB', 'prefix': 'm', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VB', 'shape': 'xxxx', 'suffix': 'tch', 'ent_type': '', 'dep': 'xcomp', 'head': 153, 'left_edge': 157, 'right_edge': 161}),id=159), Annotation(416,417,SpaceToken,features=Features({'is_space': True}),id=160), Annotation(417,421,Token,features=Features({'_i': 89, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'they', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7976265300568155021, 'pos': 'PRON', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'PRP', 'shape': 'xxxx', 'suffix': 'hem', 'ent_type': '', 'dep': 'dobj', 'head': 159, 'left_edge': 161, 'right_edge': 161}),id=161), Annotation(417,421,NounChunk,features=Features({}),id=366), Annotation(421,422,Token,features=Features({'_i': 90, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '!', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 17494803046312582752, 'pos': 'PUNCT', 'prefix': '!', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '.', 'shape': '!', 'suffix': '!', 'ent_type': '', 'dep': 'punct', 'head': 153, 'left_edge': 162, 'right_edge': 162}),id=162), Annotation(422,424,SpaceToken,features=Features({'_i': 91, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': True, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 908432558851201422, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n\\n', 'suffix': '\\n\\n', 'ent_type': '', 'dep': 'dep', 'head': 168, 'left_edge': 163, 'right_edge': 163}),id=163), Annotation(422,829,Sentence,features=Features({}),id=340), Annotation(424,428,Token,features=Features({'_i': 92, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'here', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 13973858553651532596, 'pos': 'ADV', 'prefix': 'H', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'Xxxx', 'suffix': 'ere', 'ent_type': '', 'dep': 'advmod', 'head': 168, 'left_edge': 164, 'right_edge': 164}),id=164), Annotation(428,429,SpaceToken,features=Features({'is_space': True}),id=165), Annotation(429,431,Token,features=Features({'_i': 93, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'we', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 16064069575701507746, 'pos': 'PRON', 'prefix': 'w', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'PRP', 'shape': 'xx', 'suffix': 'we', 'ent_type': '', 'dep': 'nsubj', 'head': 168, 'left_edge': 166, 'right_edge': 166}),id=166), Annotation(429,431,NounChunk,features=Features({}),id=367), Annotation(431,432,SpaceToken,features=Features({'is_space': True}),id=167), Annotation(432,439,Token,features=Features({'_i': 94, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'include', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 14049642289933595219, 'pos': 'VERB', 'prefix': 'i', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBP', 'shape': 'xxxx', 'suffix': 'ude', 'ent_type': '', 'dep': 'ccomp', 'head': 265, 'left_edge': 163, 'right_edge': 257}),id=168), Annotation(439,440,SpaceToken,features=Features({'is_space': True}),id=169), Annotation(440,441,Token,features=Features({'_i': 95, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'det', 'head': 172, 'left_edge': 170, 'right_edge': 170}),id=170), Annotation(441,442,SpaceToken,features=Features({'is_space': True}),id=171), Annotation(442,445,Token,features=Features({'_i': 96, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': True, 'lang': 'en', 'lemma': 'url', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2582013287274679728, 'pos': 'NOUN', 'prefix': 'U', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'XXX', 'suffix': 'URL', 'ent_type': '', 'dep': 'nmod', 'head': 176, 'left_edge': 170, 'right_edge': 174}),id=172), Annotation(445,446,SpaceToken,features=Features({'is_space': True}),id=173), Annotation(446,487,Token,features=Features({'_i': 97, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'https://gatenlp.github.io/python-gatenlp/', 'like_email': False, 'like_num': False, 'like_url': True, 'orth': 616995933894448768, 'pos': 'X', 'prefix': 'h', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'FW', 'shape': 'xxxx://xxxx.xxxx.xx/xxxx-xxxx/', 'suffix': 'lp/', 'ent_type': '', 'dep': 'nummod', 'head': 172, 'left_edge': 174, 'right_edge': 174}),id=174), Annotation(487,488,SpaceToken,features=Features({'is_space': True}),id=175), Annotation(488,489,SpaceToken,features=Features({'_i': 98, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 962983613142996970, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n', 'suffix': '\\n', 'ent_type': '', 'dep': 'dep', 'head': 168, 'left_edge': 170, 'right_edge': 187}),id=176), Annotation(489,492,Token,features=Features({'_i': 99, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'and', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2283656566040971221, 'pos': 'CCONJ', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'xxx', 'suffix': 'and', 'ent_type': '', 'dep': 'cc', 'head': 176, 'left_edge': 177, 'right_edge': 177}),id=177), Annotation(492,493,SpaceToken,features=Features({'is_space': True}),id=178), Annotation(493,494,Token,features=Features({'_i': 100, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'det', 'head': 185, 'left_edge': 179, 'right_edge': 179}),id=179), Annotation(494,495,SpaceToken,features=Features({'is_space': True}),id=180), Annotation(495,499,Token,features=Features({'_i': 101, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'fake', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 513889413363281732, 'pos': 'ADJ', 'prefix': 'f', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'JJ', 'shape': 'xxxx', 'suffix': 'ake', 'ent_type': '', 'dep': 'amod', 'head': 185, 'left_edge': 181, 'right_edge': 181}),id=181), Annotation(499,500,SpaceToken,features=Features({'is_space': True}),id=182), Annotation(500,505,Token,features=Features({'_i': 102, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'email', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7320900731437023467, 'pos': 'NOUN', 'prefix': 'e', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'ail', 'ent_type': '', 'dep': 'compound', 'head': 185, 'left_edge': 183, 'right_edge': 183}),id=183), Annotation(505,506,SpaceToken,features=Features({'is_space': True}),id=184), Annotation(506,513,Token,features=Features({'_i': 103, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'address', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 14349763737243715186, 'pos': 'NOUN', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'ess', 'ent_type': '', 'dep': 'conj', 'head': 176, 'left_edge': 179, 'right_edge': 187}),id=185), Annotation(513,514,SpaceToken,features=Features({'is_space': True}),id=186), Annotation(514,540,Token,features=Features({'_i': 104, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'john.doe@hiscoolserver.com', 'like_email': True, 'like_num': False, 'like_url': False, 'orth': 8439314673651519470, 'pos': 'PROPN', 'prefix': 'j', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'xxxx.xxx@xxxx.xxx', 'suffix': 'com', 'ent_type': '', 'dep': 'appos', 'head': 185, 'left_edge': 187, 'right_edge': 187}),id=187), Annotation(514,540,NounChunk,features=Features({}),id=368), Annotation(540,541,SpaceToken,features=Features({'is_space': True}),id=188), Annotation(541,543,Token,features=Features({'_i': 105, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'as', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7437575085468336610, 'pos': 'ADV', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'xx', 'suffix': 'as', 'ent_type': '', 'dep': 'advmod', 'head': 191, 'left_edge': 189, 'right_edge': 189}),id=189), Annotation(543,544,SpaceToken,features=Features({'is_space': True}),id=190), Annotation(544,548,Token,features=Features({'_i': 106, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'well', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 4525988469032889948, 'pos': 'ADV', 'prefix': 'w', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'xxxx', 'suffix': 'ell', 'ent_type': '', 'dep': 'advmod', 'head': 168, 'left_edge': 189, 'right_edge': 191}),id=191), Annotation(548,549,SpaceToken,features=Features({'is_space': True}),id=192), Annotation(549,550,SpaceToken,features=Features({'_i': 107, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 962983613142996970, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n', 'suffix': '\\n', 'ent_type': '', 'dep': 'dep', 'head': 168, 'left_edge': 193, 'right_edge': 193}),id=193), Annotation(550,552,Token,features=Features({'_i': 108, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'as', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7437575085468336610, 'pos': 'ADP', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'IN', 'shape': 'xx', 'suffix': 'as', 'ent_type': '', 'dep': 'prep', 'head': 168, 'left_edge': 194, 'right_edge': 203}),id=194), Annotation(552,553,SpaceToken,features=Features({'is_space': True}),id=195), Annotation(553,554,Token,features=Features({'_i': 109, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '#', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 8663801465970268676, 'pos': 'SYM', 'prefix': '#', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '$', 'shape': '#', 'suffix': '#', 'ent_type': 'MONEY', 'dep': 'quantmod', 'head': 199, 'left_edge': 196, 'right_edge': 196}),id=196), Annotation(553,560,MONEY,features=Features({'lemma': '#some #'}),id=329), Annotation(553,573,NounChunk,features=Features({}),id=369), Annotation(554,558,Token,features=Features({'_i': 110, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'some', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7000492816108906599, 'pos': 'DET', 'prefix': 's', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'xxxx', 'suffix': 'ome', 'ent_type': 'MONEY', 'dep': 'det', 'head': 199, 'left_edge': 197, 'right_edge': 197}),id=197), Annotation(558,559,SpaceToken,features=Features({'is_space': True}),id=198), Annotation(559,560,Token,features=Features({'_i': 111, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '#', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 8663801465970268676, 'pos': 'ADV', 'prefix': '#', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RBR', 'shape': '#', 'suffix': '#', 'ent_type': 'MONEY', 'dep': 'quantmod', 'head': 203, 'left_edge': 196, 'right_edge': 200}),id=199), Annotation(560,564,Token,features=Features({'_i': 112, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'cool', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 16540862642162215401, 'pos': 'ADJ', 'prefix': 'c', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'JJ', 'shape': 'xxxx', 'suffix': 'ool', 'ent_type': '', 'dep': 'amod', 'head': 199, 'left_edge': 200, 'right_edge': 200}),id=200), Annotation(564,565,SpaceToken,features=Features({'is_space': True}),id=201), Annotation(565,566,Token,features=Features({'_i': 113, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '#', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 8663801465970268676, 'pos': 'SYM', 'prefix': '#', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '$', 'shape': '#', 'suffix': '#', 'ent_type': '', 'dep': 'nmod', 'head': 203, 'left_edge': 202, 'right_edge': 202}),id=202), Annotation(566,573,Token,features=Features({'_i': 114, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'hastag', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12335920655888895378, 'pos': 'NOUN', 'prefix': 'h', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'xxxx', 'suffix': 'ags', 'ent_type': '', 'dep': 'pobj', 'head': 194, 'left_edge': 196, 'right_edge': 203}),id=203), Annotation(573,574,SpaceToken,features=Features({'is_space': True}),id=204), Annotation(574,577,Token,features=Features({'_i': 115, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'and', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2283656566040971221, 'pos': 'CCONJ', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'xxx', 'suffix': 'and', 'ent_type': '', 'dep': 'cc', 'head': 168, 'left_edge': 205, 'right_edge': 205}),id=205), Annotation(577,578,SpaceToken,features=Features({'is_space': True}),id=206), Annotation(578,579,Token,features=Features({'_i': 116, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'det', 'head': 209, 'left_edge': 207, 'right_edge': 207}),id=207), Annotation(579,580,SpaceToken,features=Features({'is_space': True}),id=208), Annotation(580,585,Token,features=Features({'_i': 117, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'bunch', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 16097686030507304001, 'pos': 'NOUN', 'prefix': 'b', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'nch', 'ent_type': '', 'dep': 'conj', 'head': 168, 'left_edge': 207, 'right_edge': 225}),id=209), Annotation(585,586,SpaceToken,features=Features({'is_space': True}),id=210), Annotation(586,588,Token,features=Features({'_i': 118, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'of', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 886050111519832510, 'pos': 'ADP', 'prefix': 'o', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'IN', 'shape': 'xx', 'suffix': 'of', 'ent_type': '', 'dep': 'prep', 'head': 209, 'left_edge': 211, 'right_edge': 213}),id=211), Annotation(588,589,SpaceToken,features=Features({'is_space': True}),id=212), Annotation(589,595,Token,features=Features({'_i': 119, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'emoji', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 15308488336638733982, 'pos': 'NOUN', 'prefix': 'e', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'xxxx', 'suffix': 'jis', 'ent_type': '', 'dep': 'pobj', 'head': 211, 'left_edge': 213, 'right_edge': 213}),id=213), Annotation(589,595,NounChunk,features=Features({}),id=370), Annotation(595,596,SpaceToken,features=Features({'is_space': True}),id=214), Annotation(596,600,Token,features=Features({'_i': 120, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'like', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 18194338103975822726, 'pos': 'ADP', 'prefix': 'l', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'IN', 'shape': 'xxxx', 'suffix': 'ike', 'ent_type': '', 'dep': 'prep', 'head': 209, 'left_edge': 215, 'right_edge': 224}),id=215), Annotation(600,601,SpaceToken,features=Features({'is_space': True}),id=216), Annotation(601,602,Token,features=Features({'_i': 121, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '😽', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 13410134130775049117, 'pos': 'PROPN', 'prefix': '😽', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': '😽', 'suffix': '😽', 'ent_type': '', 'dep': 'pobj', 'head': 215, 'left_edge': 217, 'right_edge': 224}),id=217), Annotation(601,602,NounChunk,features=Features({}),id=371), Annotation(602,603,SpaceToken,features=Features({'is_space': True}),id=218), Annotation(603,604,Token,features=Features({'_i': 122, 'is_alpha': False, 'is_bracket': True, 'is_currency': False, 'is_digit': False, 'is_left_punct': True, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '(', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12638816674900267446, 'pos': 'PUNCT', 'prefix': '(', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '-LRB-', 'shape': '(', 'suffix': '(', 'ent_type': '', 'dep': 'punct', 'head': 217, 'left_edge': 219, 'right_edge': 219}),id=219), Annotation(604,605,Token,features=Features({'_i': 123, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'det', 'head': 224, 'left_edge': 220, 'right_edge': 220}),id=220), Annotation(604,617,NounChunk,features=Features({}),id=372), Annotation(605,606,SpaceToken,features=Features({'is_space': True}),id=221), Annotation(606,613,Token,features=Features({'_i': 124, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'kissing', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 17741139538205582276, 'pos': 'NOUN', 'prefix': 'k', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'ing', 'ent_type': '', 'dep': 'amod', 'head': 224, 'left_edge': 222, 'right_edge': 222}),id=222), Annotation(613,614,SpaceToken,features=Features({'is_space': True}),id=223), Annotation(614,617,Token,features=Features({'_i': 125, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'cat', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 5439657043933447811, 'pos': 'NOUN', 'prefix': 'c', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxx', 'suffix': 'cat', 'ent_type': '', 'dep': 'appos', 'head': 217, 'left_edge': 220, 'right_edge': 224}),id=224), Annotation(617,618,Token,features=Features({'_i': 126, 'is_alpha': False, 'is_bracket': True, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': True, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ')', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3842344029291005339, 'pos': 'PUNCT', 'prefix': ')', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '-RRB-', 'shape': ')', 'suffix': ')', 'ent_type': '', 'dep': 'punct', 'head': 209, 'left_edge': 225, 'right_edge': 225}),id=225), Annotation(618,619,Token,features=Features({'_i': 127, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 168, 'left_edge': 226, 'right_edge': 226}),id=226), Annotation(619,620,SpaceToken,features=Features({'_i': 128, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 962983613142996970, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n', 'suffix': '\\n', 'ent_type': '', 'dep': 'dep', 'head': 168, 'left_edge': 227, 'right_edge': 227}),id=227), Annotation(620,621,Token,features=Features({'_i': 129, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '👩', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 13533745690575315714, 'pos': 'X', 'prefix': '👩', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'ADD', 'shape': '👩', 'suffix': '👩', 'ent_type': '', 'dep': 'dobj', 'head': 168, 'left_edge': 228, 'right_edge': 228}),id=228), Annotation(621,622,Token,features=Features({'_i': 130, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\u200d', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 17442142028047717517, 'pos': 'X', 'prefix': '\\u200d', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'ADD', 'shape': '\\u200d', 'suffix': '\\u200d', 'ent_type': '', 'dep': 'dobj', 'head': 168, 'left_edge': 229, 'right_edge': 246}),id=229), Annotation(622,623,Token,features=Features({'_i': 131, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '🏫', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7253895586487469997, 'pos': 'NOUN', 'prefix': '🏫', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': '🏫', 'suffix': '🏫', 'ent_type': '', 'dep': 'dobj', 'head': 229, 'left_edge': 230, 'right_edge': 246}),id=230), Annotation(622,623,NounChunk,features=Features({}),id=373), Annotation(623,624,SpaceToken,features=Features({'is_space': True}),id=231), Annotation(624,625,Token,features=Features({'_i': 132, 'is_alpha': False, 'is_bracket': True, 'is_currency': False, 'is_digit': False, 'is_left_punct': True, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '(', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12638816674900267446, 'pos': 'PUNCT', 'prefix': '(', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '-LRB-', 'shape': '(', 'suffix': '(', 'ent_type': '', 'dep': 'punct', 'head': 230, 'left_edge': 232, 'right_edge': 232}),id=232), Annotation(625,626,Token,features=Features({'_i': 133, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'det', 'head': 237, 'left_edge': 233, 'right_edge': 233}),id=233), Annotation(625,640,NounChunk,features=Features({}),id=374), Annotation(626,627,SpaceToken,features=Features({'is_space': True}),id=234), Annotation(627,632,Token,features=Features({'_i': 134, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'woman', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 18308318389743014884, 'pos': 'NOUN', 'prefix': 'w', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'man', 'ent_type': '', 'dep': 'compound', 'head': 237, 'left_edge': 235, 'right_edge': 235}),id=235), Annotation(632,633,SpaceToken,features=Features({'is_space': True}),id=236), Annotation(633,640,Token,features=Features({'_i': 135, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'teacher', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 78820239457828971, 'pos': 'NOUN', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'her', 'ent_type': '', 'dep': 'appos', 'head': 230, 'left_edge': 233, 'right_edge': 237}),id=237), Annotation(640,641,Token,features=Features({'_i': 136, 'is_alpha': False, 'is_bracket': True, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': True, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ')', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3842344029291005339, 'pos': 'PUNCT', 'prefix': ')', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '-RRB-', 'shape': ')', 'suffix': ')', 'ent_type': '', 'dep': 'punct', 'head': 230, 'left_edge': 238, 'right_edge': 238}),id=238), Annotation(641,642,Token,features=Features({'_i': 137, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 230, 'left_edge': 239, 'right_edge': 239}),id=239), Annotation(642,643,SpaceToken,features=Features({'is_space': True}),id=240), Annotation(643,644,Token,features=Features({'_i': 138, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '🧬', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 9031423621242036097, 'pos': 'PROPN', 'prefix': '🧬', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': '🧬', 'suffix': '🧬', 'ent_type': 'PERSON', 'dep': 'conj', 'head': 230, 'left_edge': 241, 'right_edge': 245}),id=241), Annotation(643,644,PERSON,features=Features({'lemma': '🧬'}),id=330), Annotation(643,644,NounChunk,features=Features({}),id=375), Annotation(644,645,SpaceToken,features=Features({'is_space': True}),id=242), Annotation(645,646,Token,features=Features({'_i': 139, 'is_alpha': False, 'is_bracket': True, 'is_currency': False, 'is_digit': False, 'is_left_punct': True, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '(', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12638816674900267446, 'pos': 'PUNCT', 'prefix': '(', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '-LRB-', 'shape': '(', 'suffix': '(', 'ent_type': '', 'dep': 'punct', 'head': 241, 'left_edge': 243, 'right_edge': 243}),id=243), Annotation(646,649,Token,features=Features({'_i': 140, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': True, 'lang': 'en', 'lemma': 'DNA', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 14386531691665947480, 'pos': 'PROPN', 'prefix': 'D', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'XXX', 'suffix': 'DNA', 'ent_type': '', 'dep': 'appos', 'head': 241, 'left_edge': 244, 'right_edge': 244}),id=244), Annotation(646,649,NounChunk,features=Features({}),id=376), Annotation(649,650,Token,features=Features({'_i': 141, 'is_alpha': False, 'is_bracket': True, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': True, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ')', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3842344029291005339, 'pos': 'PUNCT', 'prefix': ')', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '-RRB-', 'shape': ')', 'suffix': ')', 'ent_type': '', 'dep': 'punct', 'head': 241, 'left_edge': 245, 'right_edge': 245}),id=245), Annotation(650,651,Token,features=Features({'_i': 142, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 230, 'left_edge': 246, 'right_edge': 246}),id=246), Annotation(651,652,SpaceToken,features=Features({'is_space': True}),id=247), Annotation(652,653,SpaceToken,features=Features({'_i': 143, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 962983613142996970, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n', 'suffix': '\\n', 'ent_type': '', 'dep': 'dep', 'head': 256, 'left_edge': 248, 'right_edge': 248}),id=248), Annotation(653,654,Token,features=Features({'_i': 144, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '🧗', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7724749230788735883, 'pos': 'PROPN', 'prefix': '🧗', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': '🧗', 'suffix': '🧗', 'ent_type': '', 'dep': 'nsubj', 'head': 256, 'left_edge': 249, 'right_edge': 249}),id=249), Annotation(653,654,NounChunk,features=Features({}),id=377), Annotation(654,655,SpaceToken,features=Features({'is_space': True}),id=250), Annotation(655,656,Token,features=Features({'_i': 145, 'is_alpha': False, 'is_bracket': True, 'is_currency': False, 'is_digit': False, 'is_left_punct': True, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '(', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12638816674900267446, 'pos': 'PUNCT', 'prefix': '(', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '-LRB-', 'shape': '(', 'suffix': '(', 'ent_type': '', 'dep': 'punct', 'head': 256, 'left_edge': 251, 'right_edge': 251}),id=251), Annotation(656,657,Token,features=Features({'_i': 146, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'det', 'head': 254, 'left_edge': 252, 'right_edge': 252}),id=252), Annotation(656,664,NounChunk,features=Features({}),id=378), Annotation(657,658,SpaceToken,features=Features({'is_space': True}),id=253), Annotation(658,664,Token,features=Features({'_i': 147, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'person', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 14800503047316267216, 'pos': 'NOUN', 'prefix': 'p', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'son', 'ent_type': '', 'dep': 'nsubj', 'head': 256, 'left_edge': 252, 'right_edge': 254}),id=254), Annotation(664,665,SpaceToken,features=Features({'is_space': True}),id=255), Annotation(665,673,Token,features=Features({'_i': 148, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'climb', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 992864226358664410, 'pos': 'VERB', 'prefix': 'c', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBG', 'shape': 'xxxx', 'suffix': 'ing', 'ent_type': '', 'dep': 'ccomp', 'head': 168, 'left_edge': 248, 'right_edge': 257}),id=256), Annotation(673,674,Token,features=Features({'_i': 149, 'is_alpha': False, 'is_bracket': True, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': True, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ')', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3842344029291005339, 'pos': 'PUNCT', 'prefix': ')', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '-RRB-', 'shape': ')', 'suffix': ')', 'ent_type': '', 'dep': 'punct', 'head': 256, 'left_edge': 257, 'right_edge': 257}),id=257), Annotation(674,675,Token,features=Features({'_i': 150, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 265, 'left_edge': 258, 'right_edge': 258}),id=258), Annotation(675,676,SpaceToken,features=Features({'is_space': True}),id=259), Annotation(676,678,SpaceToken,features=Features({'_i': 151, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 908432558851201422, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n\\n', 'suffix': '\\n\\n', 'ent_type': '', 'dep': 'dep', 'head': 265, 'left_edge': 260, 'right_edge': 260}),id=260), Annotation(678,682,Token,features=Features({'_i': 152, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'here', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 13973858553651532596, 'pos': 'ADV', 'prefix': 'H', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'Xxxx', 'suffix': 'ere', 'ent_type': '', 'dep': 'advmod', 'head': 265, 'left_edge': 261, 'right_edge': 261}),id=261), Annotation(682,683,SpaceToken,features=Features({'is_space': True}),id=262), Annotation(683,685,Token,features=Features({'_i': 153, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'we', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 16064069575701507746, 'pos': 'PRON', 'prefix': 'w', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'PRP', 'shape': 'xx', 'suffix': 'we', 'ent_type': '', 'dep': 'nsubj', 'head': 265, 'left_edge': 263, 'right_edge': 263}),id=263), Annotation(683,685,NounChunk,features=Features({}),id=379), Annotation(685,686,SpaceToken,features=Features({'is_space': True}),id=264), Annotation(686,690,Token,features=Features({'_i': 154, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'test', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 1618900948208871284, 'pos': 'VERB', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBP', 'shape': 'xxxx', 'suffix': 'est', 'ent_type': '', 'dep': 'ROOT', 'head': 265, 'left_edge': 163, 'right_edge': 316}),id=265), Annotation(690,691,SpaceToken,features=Features({'is_space': True}),id=266), Annotation(691,692,Token,features=Features({'_i': 155, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'a', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11901859001352538922, 'pos': 'DET', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'DT', 'shape': 'x', 'suffix': 'a', 'ent_type': '', 'dep': 'quantmod', 'head': 269, 'left_edge': 267, 'right_edge': 267}),id=267), Annotation(691,714,NounChunk,features=Features({}),id=380), Annotation(692,693,SpaceToken,features=Features({'is_space': True}),id=268), Annotation(693,696,Token,features=Features({'_i': 156, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'few', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 11866476999679706272, 'pos': 'ADJ', 'prefix': 'f', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'JJ', 'shape': 'xxx', 'suffix': 'few', 'ent_type': '', 'dep': 'nummod', 'head': 273, 'left_edge': 267, 'right_edge': 269}),id=269), Annotation(696,697,SpaceToken,features=Features({'is_space': True}),id=270), Annotation(697,706,Token,features=Features({'_i': 157, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'different', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 17379251038783395147, 'pos': 'ADJ', 'prefix': 'd', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'JJ', 'shape': 'xxxx', 'suffix': 'ent', 'ent_type': '', 'dep': 'amod', 'head': 273, 'left_edge': 271, 'right_edge': 271}),id=271), Annotation(706,707,SpaceToken,features=Features({'is_space': True}),id=272), Annotation(707,714,Token,features=Features({'_i': 158, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'script', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 5755375431925009742, 'pos': 'NOUN', 'prefix': 's', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNS', 'shape': 'xxxx', 'suffix': 'pts', 'ent_type': '', 'dep': 'dobj', 'head': 265, 'left_edge': 267, 'right_edge': 315}),id=273), Annotation(714,715,Token,features=Features({'_i': 159, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 273, 'left_edge': 274, 'right_edge': 274}),id=274), Annotation(715,716,SpaceToken,features=Features({'is_space': True}),id=275), Annotation(716,720,Token,features=Features({'_i': 160, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'e.g.', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2360781233418247603, 'pos': 'ADV', 'prefix': 'e', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'RB', 'shape': 'x.x.', 'suffix': '.g.', 'ent_type': '', 'dep': 'advmod', 'head': 280, 'left_edge': 276, 'right_edge': 276}),id=276), Annotation(716,730,NounChunk,features=Features({}),id=381), Annotation(720,721,SpaceToken,features=Features({'is_space': True}),id=277), Annotation(721,727,Token,features=Features({'_i': 161, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Hangul', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 13901273419332601270, 'pos': 'PROPN', 'prefix': 'H', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'gul', 'ent_type': 'PERSON', 'dep': 'compound', 'head': 280, 'left_edge': 278, 'right_edge': 278}),id=278), Annotation(721,727,PERSON,features=Features({'lemma': 'Hangul'}),id=331), Annotation(727,728,SpaceToken,features=Features({'is_space': True}),id=279), Annotation(728,730,Token,features=Features({'_i': 162, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '한글', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 14226613016943586999, 'pos': 'PROPN', 'prefix': '한', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'xx', 'suffix': '한글', 'ent_type': '', 'dep': 'appos', 'head': 273, 'left_edge': 276, 'right_edge': 282}),id=280), Annotation(730,731,SpaceToken,features=Features({'is_space': True}),id=281), Annotation(731,733,Token,features=Features({'_i': 163, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'or', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3740602843040177340, 'pos': 'CCONJ', 'prefix': 'o', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'xx', 'suffix': 'or', 'ent_type': '', 'dep': 'cc', 'head': 280, 'left_edge': 282, 'right_edge': 282}),id=282), Annotation(733,734,SpaceToken,features=Features({'is_space': True}),id=283), Annotation(734,735,SpaceToken,features=Features({'_i': 164, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 962983613142996970, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n', 'suffix': '\\n', 'ent_type': '', 'dep': 'dep', 'head': 273, 'left_edge': 284, 'right_edge': 284}),id=284), Annotation(735,745,Token,features=Features({'_i': 165, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'simplify', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 15400283218552125021, 'pos': 'VERB', 'prefix': 's', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBN', 'shape': 'xxxx', 'suffix': 'ied', 'ent_type': '', 'dep': 'amod', 'head': 289, 'left_edge': 285, 'right_edge': 285}),id=285), Annotation(735,754,NounChunk,features=Features({}),id=382), Annotation(745,746,SpaceToken,features=Features({'is_space': True}),id=286), Annotation(746,751,Token,features=Features({'_i': 166, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Hanzi', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7162399416189955260, 'pos': 'PROPN', 'prefix': 'H', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'nzi', 'ent_type': 'PERSON', 'dep': 'compound', 'head': 289, 'left_edge': 287, 'right_edge': 287}),id=287), Annotation(746,751,PERSON,features=Features({'lemma': 'Hanzi'}),id=332), Annotation(751,752,SpaceToken,features=Features({'is_space': True}),id=288), Annotation(752,754,Token,features=Features({'_i': 167, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '汉字', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2455570606503990670, 'pos': 'PROPN', 'prefix': '汉', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'xx', 'suffix': '汉字', 'ent_type': '', 'dep': 'conj', 'head': 273, 'left_edge': 285, 'right_edge': 315}),id=289), Annotation(754,755,SpaceToken,features=Features({'is_space': True}),id=290), Annotation(755,757,Token,features=Features({'_i': 168, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'or', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3740602843040177340, 'pos': 'CCONJ', 'prefix': 'o', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'xx', 'suffix': 'or', 'ent_type': '', 'dep': 'cc', 'head': 289, 'left_edge': 291, 'right_edge': 291}),id=291), Annotation(757,758,SpaceToken,features=Features({'is_space': True}),id=292), Annotation(758,763,Token,features=Features({'_i': 169, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Farsi', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 1631855311088449743, 'pos': 'PROPN', 'prefix': 'F', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'rsi', 'ent_type': 'PERSON', 'dep': 'compound', 'head': 295, 'left_edge': 293, 'right_edge': 293}),id=293), Annotation(758,763,PERSON,features=Features({'lemma': 'Farsi'}),id=333), Annotation(758,769,NounChunk,features=Features({}),id=383), Annotation(763,764,SpaceToken,features=Features({'is_space': True}),id=294), Annotation(764,769,Token,features=Features({'_i': 170, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'فارسی', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 10192536850399601059, 'pos': 'NOUN', 'prefix': 'ف', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'رسی', 'ent_type': '', 'dep': 'conj', 'head': 289, 'left_edge': 293, 'right_edge': 299}),id=295), Annotation(769,770,SpaceToken,features=Features({'is_space': True}),id=296), Annotation(770,773,Token,features=Features({'_i': 171, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'and', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2283656566040971221, 'pos': 'CCONJ', 'prefix': 'a', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CC', 'shape': 'xxx', 'suffix': 'and', 'ent_type': '', 'dep': 'cc', 'head': 295, 'left_edge': 297, 'right_edge': 297}),id=297), Annotation(773,774,SpaceToken,features=Features({'is_space': True}),id=298), Annotation(774,780,Token,features=Features({'_i': 172, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': True, 'is_upper': False, 'lang': 'en', 'lemma': 'Arabic', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 6558098436421695236, 'pos': 'PROPN', 'prefix': 'A', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NNP', 'shape': 'Xxxxx', 'suffix': 'bic', 'ent_type': 'LANGUAGE', 'dep': 'conj', 'head': 295, 'left_edge': 299, 'right_edge': 299}),id=299), Annotation(774,780,LANGUAGE,features=Features({'lemma': 'Arabic'}),id=334), Annotation(774,780,NounChunk,features=Features({}),id=384), Annotation(780,781,SpaceToken,features=Features({'is_space': True}),id=300), Annotation(781,782,Token,features=Features({'_i': 173, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 289, 'left_edge': 301, 'right_edge': 301}),id=301), Annotation(782,797,Token,features=Features({'_i': 174, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'اَلْعَرَبِيَّةُ', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 16849633969024660305, 'pos': 'NUM', 'prefix': 'ا', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'CD', 'shape': 'xَxْxَxَxِxَّxُ', 'suffix': 'ّةُ', 'ent_type': 'GPE', 'dep': 'appos', 'head': 289, 'left_edge': 302, 'right_edge': 302}),id=302), Annotation(782,797,GPE,features=Features({'lemma': 'اَلْعَرَبِيَّةُ'}),id=335), Annotation(797,798,Token,features=Features({'_i': 175, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': ',', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 2593208677638477497, 'pos': 'PUNCT', 'prefix': ',', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': ',', 'shape': ',', 'suffix': ',', 'ent_type': '', 'dep': 'punct', 'head': 289, 'left_edge': 303, 'right_edge': 303}),id=303), Annotation(798,799,SpaceToken,features=Features({'is_space': True}),id=304), Annotation(799,804,Token,features=Features({'_i': 176, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'which', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7063653163634019529, 'pos': 'PRON', 'prefix': 'w', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'WDT', 'shape': 'xxxx', 'suffix': 'ich', 'ent_type': '', 'dep': 'nsubj', 'head': 307, 'left_edge': 305, 'right_edge': 305}),id=305), Annotation(799,804,NounChunk,features=Features({}),id=385), Annotation(804,805,SpaceToken,features=Features({'is_space': True}),id=306), Annotation(805,809,Token,features=Features({'_i': 177, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'go', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 18224404223277189725, 'pos': 'VERB', 'prefix': 'g', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBZ', 'shape': 'xxxx', 'suffix': 'oes', 'ent_type': '', 'dep': 'relcl', 'head': 289, 'left_edge': 305, 'right_edge': 315}),id=307), Annotation(809,810,SpaceToken,features=Features({'is_space': True}),id=308), Annotation(810,814,Token,features=Features({'_i': 178, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'from', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 7831658034963690409, 'pos': 'ADP', 'prefix': 'f', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'IN', 'shape': 'xxxx', 'suffix': 'rom', 'ent_type': '', 'dep': 'prep', 'head': 307, 'left_edge': 309, 'right_edge': 315}),id=309), Annotation(814,815,SpaceToken,features=Features({'is_space': True}),id=310), Annotation(815,820,Token,features=Features({'_i': 179, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'right', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 5943797630011647483, 'pos': 'NOUN', 'prefix': 'r', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'NN', 'shape': 'xxxx', 'suffix': 'ght', 'ent_type': '', 'dep': 'advmod', 'head': 313, 'left_edge': 311, 'right_edge': 311}),id=311), Annotation(820,821,SpaceToken,features=Features({'is_space': True}),id=312), Annotation(821,823,Token,features=Features({'_i': 180, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': True, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'to', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 3791531372978436496, 'pos': 'AUX', 'prefix': 't', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'IN', 'shape': 'xx', 'suffix': 'to', 'ent_type': '', 'dep': 'aux', 'head': 315, 'left_edge': 311, 'right_edge': 313}),id=313), Annotation(823,824,SpaceToken,features=Features({'is_space': True}),id=314), Annotation(824,828,Token,features=Features({'_i': 181, 'is_alpha': True, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': True, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': 'leave', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 4003250512168697582, 'pos': 'VERB', 'prefix': 'l', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': 'VBN', 'shape': 'xxxx', 'suffix': 'eft', 'ent_type': '', 'dep': 'pcomp', 'head': 309, 'left_edge': 311, 'right_edge': 315}),id=315), Annotation(828,829,Token,features=Features({'_i': 182, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': True, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': False, 'is_space': False, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '.', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 12646065887601541794, 'pos': 'PUNCT', 'prefix': '.', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '.', 'shape': '.', 'suffix': '.', 'ent_type': '', 'dep': 'punct', 'head': 265, 'left_edge': 316, 'right_edge': 316}),id=316), Annotation(829,831,SpaceToken,features=Features({'_i': 183, 'is_alpha': False, 'is_bracket': False, 'is_currency': False, 'is_digit': False, 'is_left_punct': False, 'is_lower': False, 'is_oov': True, 'is_punct': False, 'is_quote': False, 'is_right_punct': False, 'is_sent_start': True, 'is_space': True, 'is_stop': False, 'is_title': False, 'is_upper': False, 'lang': 'en', 'lemma': '\\n\\n', 'like_email': False, 'like_num': False, 'like_url': False, 'orth': 908432558851201422, 'pos': 'SPACE', 'prefix': '\\n', 'prob': -20.0, 'rank': 18446744073709551615, 'sentiment': 0.0, 'tag': '_SP', 'shape': '\\n\\n', 'suffix': '\\n\\n', 'ent_type': '', 'dep': 'dep', 'head': 317, 'left_edge': 317, 'right_edge': 317}),id=317), Annotation(829,831,Sentence,features=Features({}),id=341)]), 'Stanza': AnnotationSet([Annotation(0,4,Token,features=Features({'text': 'This', 'lemma': 'this', 'upos': 'PRON', 'xpos': 'DT', 'Number': 'Sing', 'PronType': 'Dem', 'head': 4, 'deprel': 'nsubj', 'ner': 'O'}),id=0), Annotation(0,24,Sentence,features=Features({}),id=6), Annotation(5,7,Token,features=Features({'text': 'is', 'lemma': 'be', 'upos': 'AUX', 'xpos': 'VBZ', 'Mood': 'Ind', 'Number': 'Sing', 'Person': '3', 'Tense': 'Pres', 'VerbForm': 'Fin', 'head': 4, 'deprel': 'cop', 'ner': 'O'}),id=1), Annotation(8,9,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 4, 'deprel': 'det', 'ner': 'O'}),id=2), Annotation(10,14,Token,features=Features({'text': 'test', 'lemma': 'test', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 4, 'deprel': 'compound', 'ner': 'O'}),id=3), Annotation(15,23,Token,features=Features({'text': 'document', 'lemma': 'document', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 6, 'deprel': 'root', 'ner': 'O'}),id=4), Annotation(23,24,Token,features=Features({'text': '.', 'lemma': '.', 'upos': 'PUNCT', 'xpos': '.', 'head': 4, 'deprel': 'punct', 'ner': 'O'}),id=5), Annotation(26,28,Token,features=Features({'text': 'It', 'lemma': 'it', 'upos': 'PRON', 'xpos': 'PRP', 'Case': 'Nom', 'Gender': 'Neut', 'Number': 'Sing', 'Person': '3', 'PronType': 'Prs', 'head': 8, 'deprel': 'nsubj', 'ner': 'O'}),id=7), Annotation(26,59,Sentence,features=Features({}),id=14), Annotation(29,37,Token,features=Features({'text': 'contains', 'lemma': 'contain', 'upos': 'VERB', 'xpos': 'VBZ', 'Mood': 'Ind', 'Number': 'Sing', 'Person': '3', 'Tense': 'Pres', 'VerbForm': 'Fin', 'head': 14, 'deprel': 'root', 'ner': 'O'}),id=8), Annotation(38,42,Token,features=Features({'text': 'just', 'lemma': 'just', 'upos': 'ADV', 'xpos': 'RB', 'head': 12, 'deprel': 'advmod', 'ner': 'O'}),id=9), Annotation(43,44,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 12, 'deprel': 'det', 'ner': 'O'}),id=10), Annotation(45,48,Token,features=Features({'text': 'few', 'lemma': 'few', 'upos': 'ADJ', 'xpos': 'JJ', 'Degree': 'Pos', 'head': 12, 'deprel': 'amod', 'ner': 'O'}),id=11), Annotation(49,58,Token,features=Features({'text': 'sentences', 'lemma': 'sentence', 'upos': 'NOUN', 'xpos': 'NNS', 'Number': 'Plur', 'head': 8, 'deprel': 'obj', 'ner': 'O'}),id=12), Annotation(58,59,Token,features=Features({'text': '.', 'lemma': '.', 'upos': 'PUNCT', 'xpos': '.', 'head': 8, 'deprel': 'punct', 'ner': 'O'}),id=13), Annotation(61,65,Token,features=Features({'text': 'Here', 'lemma': 'here', 'upos': 'ADV', 'xpos': 'RB', 'PronType': 'Dem', 'head': 54, 'deprel': 'root', 'ner': 'O'}),id=15), Annotation(61,266,Sentence,features=Features({}),id=54), Annotation(66,68,Token,features=Features({'text': 'is', 'lemma': 'be', 'upos': 'AUX', 'xpos': 'VBZ', 'Mood': 'Ind', 'Number': 'Sing', 'Person': '3', 'Tense': 'Pres', 'VerbForm': 'Fin', 'head': 15, 'deprel': 'cop', 'ner': 'O'}),id=16), Annotation(69,70,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 18, 'deprel': 'det', 'ner': 'O'}),id=17), Annotation(71,79,Token,features=Features({'text': 'sentence', 'lemma': 'sentence', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 15, 'deprel': 'nsubj', 'ner': 'O'}),id=18), Annotation(80,84,Token,features=Features({'text': 'that', 'lemma': 'that', 'upos': 'PRON', 'xpos': 'WDT', 'PronType': 'Rel', 'head': 20, 'deprel': 'nsubj', 'ner': 'O'}),id=19), Annotation(85,93,Token,features=Features({'text': 'mentions', 'lemma': 'mention', 'upos': 'VERB', 'xpos': 'VBZ', 'Mood': 'Ind', 'Number': 'Sing', 'Person': '3', 'Tense': 'Pres', 'VerbForm': 'Fin', 'head': 18, 'deprel': 'acl:relcl', 'ner': 'O'}),id=20), Annotation(94,95,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 24, 'deprel': 'det', 'ner': 'O'}),id=21), Annotation(96,99,Token,features=Features({'text': 'few', 'lemma': 'few', 'upos': 'ADJ', 'xpos': 'JJ', 'Degree': 'Pos', 'head': 24, 'deprel': 'amod', 'ner': 'O'}),id=22), Annotation(100,105,Token,features=Features({'text': 'named', 'lemma': 'name', 'upos': 'VERB', 'xpos': 'VBN', 'Tense': 'Past', 'VerbForm': 'Part', 'head': 24, 'deprel': 'amod', 'ner': 'O'}),id=23), Annotation(106,114,Token,features=Features({'text': 'entities', 'lemma': 'entity', 'upos': 'NOUN', 'xpos': 'NNS', 'Number': 'Plur', 'head': 20, 'deprel': 'obj', 'ner': 'O'}),id=24), Annotation(115,119,Token,features=Features({'text': 'like', 'lemma': 'like', 'upos': 'ADP', 'xpos': 'IN', 'head': 27, 'deprel': 'case', 'ner': 'O'}),id=25), Annotation(121,124,Token,features=Features({'text': 'the', 'lemma': 'the', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Def', 'PronType': 'Art', 'head': 27, 'deprel': 'det', 'ner': 'O'}),id=26), Annotation(125,132,Token,features=Features({'text': 'persons', 'lemma': 'person', 'upos': 'NOUN', 'xpos': 'NNS', 'Number': 'Plur', 'head': 24, 'deprel': 'nmod', 'ner': 'O'}),id=27), Annotation(133,139,Token,features=Features({'text': 'Barack', 'lemma': 'Barack', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 27, 'deprel': 'appos', 'ner': 'B-PERSON'}),id=28), Annotation(133,145,PERSON,features=Features({}),id=175), Annotation(140,145,Token,features=Features({'text': 'Obama', 'lemma': 'Obama', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 28, 'deprel': 'flat', 'ner': 'E-PERSON'}),id=29), Annotation(146,148,Token,features=Features({'text': 'or', 'lemma': 'or', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 31, 'deprel': 'cc', 'ner': 'O'}),id=30), Annotation(149,155,Token,features=Features({'text': 'Ursula', 'lemma': 'Ursula', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 28, 'deprel': 'conj', 'ner': 'B-PERSON'}),id=31), Annotation(149,169,PERSON,features=Features({}),id=176), Annotation(156,159,Token,features=Features({'text': 'von', 'lemma': 'von', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 31, 'deprel': 'flat', 'ner': 'I-PERSON'}),id=32), Annotation(160,163,Token,features=Features({'text': 'der', 'lemma': 'der', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 31, 'deprel': 'flat', 'ner': 'I-PERSON'}),id=33), Annotation(164,169,Token,features=Features({'text': 'Leyen', 'lemma': 'Leyen', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 31, 'deprel': 'flat', 'ner': 'E-PERSON'}),id=34), Annotation(169,170,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 36, 'deprel': 'punct', 'ner': 'O'}),id=35), Annotation(171,180,Token,features=Features({'text': 'locations', 'lemma': 'location', 'upos': 'NOUN', 'xpos': 'NNS', 'Number': 'Plur', 'head': 27, 'deprel': 'conj', 'ner': 'O'}),id=36), Annotation(181,185,Token,features=Features({'text': 'like', 'lemma': 'like', 'upos': 'ADP', 'xpos': 'IN', 'head': 40, 'deprel': 'case', 'ner': 'O'}),id=37), Annotation(186,189,Token,features=Features({'text': 'New', 'lemma': 'New', 'upos': 'ADJ', 'xpos': 'NNP', 'Degree': 'Pos', 'head': 40, 'deprel': 'amod', 'ner': 'B-GPE'}),id=38), Annotation(186,199,GPE,features=Features({}),id=177), Annotation(190,194,Token,features=Features({'text': 'York', 'lemma': 'York', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 40, 'deprel': 'compound', 'ner': 'I-GPE'}),id=39), Annotation(195,199,Token,features=Features({'text': 'City', 'lemma': 'City', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 36, 'deprel': 'nmod', 'ner': 'E-GPE'}),id=40), Annotation(199,200,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 42, 'deprel': 'punct', 'ner': 'O'}),id=41), Annotation(201,207,Token,features=Features({'text': 'Vienna', 'lemma': 'Vienna', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 40, 'deprel': 'conj', 'ner': 'S-GPE'}),id=42), Annotation(201,207,GPE,features=Features({}),id=178), Annotation(208,210,Token,features=Features({'text': 'or', 'lemma': 'or', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 44, 'deprel': 'cc', 'ner': 'O'}),id=43), Annotation(211,218,Token,features=Features({'text': 'Beijing', 'lemma': 'Beijing', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 40, 'deprel': 'conj', 'ner': 'S-GPE'}),id=44), Annotation(211,218,GPE,features=Features({}),id=179), Annotation(219,221,Token,features=Features({'text': 'or', 'lemma': 'or', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 46, 'deprel': 'cc', 'ner': 'O'}),id=45), Annotation(222,231,Token,features=Features({'text': 'companies', 'lemma': 'company', 'upos': 'NOUN', 'xpos': 'NNS', 'Number': 'Plur', 'head': 40, 'deprel': 'conj', 'ner': 'O'}),id=46), Annotation(232,236,Token,features=Features({'text': 'like', 'lemma': 'like', 'upos': 'ADP', 'xpos': 'IN', 'head': 48, 'deprel': 'case', 'ner': 'O'}),id=47), Annotation(238,244,Token,features=Features({'text': 'Google', 'lemma': 'Google', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 46, 'deprel': 'nmod', 'ner': 'S-ORG'}),id=48), Annotation(238,244,ORG,features=Features({}),id=180), Annotation(244,245,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 50, 'deprel': 'punct', 'ner': 'O'}),id=49), Annotation(246,255,Token,features=Features({'text': 'UniCredit', 'lemma': 'UniCredit', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 48, 'deprel': 'conj', 'ner': 'S-ORG'}),id=50), Annotation(246,255,ORG,features=Features({}),id=181), Annotation(256,258,Token,features=Features({'text': 'or', 'lemma': 'or', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 52, 'deprel': 'cc', 'ner': 'O'}),id=51), Annotation(259,265,Token,features=Features({'text': 'Huawei', 'lemma': 'Huawei', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 48, 'deprel': 'conj', 'ner': 'S-ORG'}),id=52), Annotation(259,265,ORG,features=Features({}),id=182), Annotation(265,266,Token,features=Features({'text': '.', 'lemma': '.', 'upos': 'PUNCT', 'xpos': '.', 'head': 15, 'deprel': 'punct', 'ner': 'O'}),id=53), Annotation(267,270,Token,features=Features({'text': 'And', 'lemma': 'and', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 67, 'deprel': 'cc', 'ner': 'O'}),id=55), Annotation(267,322,Sentence,features=Features({}),id=69), Annotation(271,275,Token,features=Features({'text': 'here', 'lemma': 'here', 'upos': 'ADV', 'xpos': 'RB', 'PronType': 'Dem', 'head': 69, 'deprel': 'root', 'ner': 'O'}),id=56), Annotation(276,278,Token,features=Features({'text': 'is', 'lemma': 'be', 'upos': 'AUX', 'xpos': 'VBZ', 'Mood': 'Ind', 'Number': 'Sing', 'Person': '3', 'Tense': 'Pres', 'VerbForm': 'Fin', 'head': 56, 'deprel': 'cop', 'ner': 'O'}),id=57), Annotation(279,285,Token,features=Features({'text': 'Donald', 'lemma': 'Donald', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 56, 'deprel': 'nsubj', 'ner': 'B-PERSON'}),id=58), Annotation(279,291,PERSON,features=Features({}),id=183), Annotation(286,291,Token,features=Features({'text': 'Trump', 'lemma': 'Trump', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 58, 'deprel': 'flat', 'ner': 'E-PERSON'}),id=59), Annotation(291,292,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 56, 'deprel': 'punct', 'ner': 'O'}),id=60), Annotation(293,295,Token,features=Features({'text': 'it', 'lemma': 'it', 'upos': 'PRON', 'xpos': 'PRP', 'Case': 'Nom', 'Gender': 'Neut', 'Number': 'Sing', 'Person': '3', 'PronType': 'Prs', 'head': 67, 'deprel': 'nsubj', 'ner': 'O'}),id=61), Annotation(296,299,Token,features=Features({'text': 'may', 'lemma': 'may', 'upos': 'AUX', 'xpos': 'MD', 'VerbForm': 'Fin', 'head': 67, 'deprel': 'aux', 'ner': 'O'}),id=62), Annotation(300,303,Token,features=Features({'text': 'not', 'lemma': 'not', 'upos': 'PART', 'xpos': 'RB', 'head': 67, 'deprel': 'advmod', 'ner': 'O'}),id=63), Annotation(304,306,Token,features=Features({'text': 'be', 'lemma': 'be', 'upos': 'AUX', 'xpos': 'VB', 'VerbForm': 'Inf', 'head': 67, 'deprel': 'cop', 'ner': 'O'}),id=64), Annotation(307,310,Token,features=Features({'text': 'the', 'lemma': 'the', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Def', 'PronType': 'Art', 'head': 67, 'deprel': 'det', 'ner': 'O'}),id=65), Annotation(311,315,Token,features=Features({'text': 'real', 'lemma': 'real', 'upos': 'ADJ', 'xpos': 'JJ', 'Degree': 'Pos', 'head': 67, 'deprel': 'amod', 'ner': 'O'}),id=66), Annotation(316,319,Token,features=Features({'text': 'one', 'lemma': 'one', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 56, 'deprel': 'parataxis', 'ner': 'O'}),id=67), Annotation(320,322,Token,features=Features({'text': ':P', 'lemma': ':p', 'upos': 'SYM', 'xpos': 'NFP', 'head': 56, 'deprel': 'discourse', 'ner': 'O'}),id=68), Annotation(324,328,Token,features=Features({'text': 'Lets', 'lemma': 'let', 'upos': 'VERB', 'xpos': 'VBZ', 'Mood': 'Ind', 'Number': 'Sing', 'Person': '3', 'Tense': 'Pres', 'VerbForm': 'Fin', 'head': 89, 'deprel': 'root', 'ner': 'O'}),id=70), Annotation(324,422,Sentence,features=Features({}),id=89), Annotation(329,332,Token,features=Features({'text': 'say', 'lemma': 'say', 'upos': 'VERB', 'xpos': 'VB', 'VerbForm': 'Inf', 'head': 70, 'deprel': 'xcomp', 'ner': 'O'}),id=71), Annotation(333,338,Token,features=Features({'text': 'Boris', 'lemma': 'Boris', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 84, 'deprel': 'nsubj', 'ner': 'B-PERSON'}),id=72), Annotation(333,346,PERSON,features=Features({}),id=184), Annotation(339,346,Token,features=Features({'text': 'Johnson', 'lemma': 'Johnson', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 72, 'deprel': 'flat', 'ner': 'E-PERSON'}),id=73), Annotation(347,350,Token,features=Features({'text': 'aka', 'lemma': 'aka', 'upos': 'ADP', 'xpos': 'IN', 'head': 76, 'deprel': 'case', 'ner': 'O'}),id=74), Annotation(351,355,Token,features=Features({'text': 'Bojo', 'lemma': 'Bojo', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 76, 'deprel': 'compound', 'ner': 'S-PERSON'}),id=75), Annotation(351,355,PERSON,features=Features({}),id=185), Annotation(356,362,Token,features=Features({'text': 'tweets', 'lemma': 'tweet', 'upos': 'NOUN', 'xpos': 'NNS', 'Number': 'Plur', 'head': 72, 'deprel': 'nmod', 'ner': 'O'}),id=76), Annotation(363,367,Token,features=Features({'text': 'from', 'lemma': 'from', 'upos': 'ADP', 'xpos': 'IN', 'head': 80, 'deprel': 'case', 'ner': 'O'}),id=77), Annotation(368,371,Token,features=Features({'text': 'his', 'lemma': 'he', 'upos': 'PRON', 'xpos': 'PRP$', 'Gender': 'Masc', 'Number': 'Sing', 'Person': '3', 'Poss': 'Yes', 'PronType': 'Prs', 'head': 80, 'deprel': 'nmod:poss', 'ner': 'O'}),id=78), Annotation(372,384,Token,features=Features({'text': 'BorisJohnson', 'lemma': 'BorisJohnson', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 80, 'deprel': 'compound', 'ner': 'S-PERSON'}),id=79), Annotation(372,384,PERSON,features=Features({}),id=186), Annotation(385,392,Token,features=Features({'text': 'account', 'lemma': 'account', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 76, 'deprel': 'nmod', 'ner': 'O'}),id=80), Annotation(392,393,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 84, 'deprel': 'punct', 'ner': 'O'}),id=81), Annotation(394,399,Token,features=Features({'text': 'would', 'lemma': 'would', 'upos': 'AUX', 'xpos': 'MD', 'VerbForm': 'Fin', 'head': 84, 'deprel': 'aux', 'ner': 'O'}),id=82), Annotation(400,402,Token,features=Features({'text': 'be', 'lemma': 'be', 'upos': 'AUX', 'xpos': 'VB', 'VerbForm': 'Inf', 'head': 84, 'deprel': 'cop', 'ner': 'O'}),id=83), Annotation(403,407,Token,features=Features({'text': 'nice', 'lemma': 'nice', 'upos': 'ADJ', 'xpos': 'JJ', 'Degree': 'Pos', 'head': 70, 'deprel': 'parataxis', 'ner': 'O'}),id=84), Annotation(408,410,Token,features=Features({'text': 'to', 'lemma': 'to', 'upos': 'PART', 'xpos': 'TO', 'head': 86, 'deprel': 'mark', 'ner': 'O'}),id=85), Annotation(411,416,Token,features=Features({'text': 'match', 'lemma': 'match', 'upos': 'VERB', 'xpos': 'VB', 'VerbForm': 'Inf', 'head': 84, 'deprel': 'advcl', 'ner': 'O'}),id=86), Annotation(417,421,Token,features=Features({'text': 'them', 'lemma': 'they', 'upos': 'PRON', 'xpos': 'PRP', 'Case': 'Acc', 'Number': 'Plur', 'Person': '3', 'PronType': 'Prs', 'head': 86, 'deprel': 'obj', 'ner': 'O'}),id=87), Annotation(421,422,Token,features=Features({'text': '!', 'lemma': '!', 'upos': 'PUNCT', 'xpos': '.', 'head': 70, 'deprel': 'punct', 'ner': 'O'}),id=88), Annotation(424,428,Token,features=Features({'text': 'Here', 'lemma': 'here', 'upos': 'ADV', 'xpos': 'RB', 'PronType': 'Dem', 'head': 92, 'deprel': 'advmod', 'ner': 'O'}),id=90), Annotation(424,675,Sentence,features=Features({}),id=143), Annotation(429,431,Token,features=Features({'text': 'we', 'lemma': 'we', 'upos': 'PRON', 'xpos': 'PRP', 'Case': 'Nom', 'Number': 'Plur', 'Person': '1', 'PronType': 'Prs', 'head': 92, 'deprel': 'nsubj', 'ner': 'O'}),id=91), Annotation(432,439,Token,features=Features({'text': 'include', 'lemma': 'include', 'upos': 'VERB', 'xpos': 'VBP', 'Mood': 'Ind', 'Tense': 'Pres', 'VerbForm': 'Fin', 'head': 143, 'deprel': 'root', 'ner': 'O'}),id=92), Annotation(440,441,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 94, 'deprel': 'det', 'ner': 'O'}),id=93), Annotation(442,445,Token,features=Features({'text': 'URL', 'lemma': 'URL', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 92, 'deprel': 'obj', 'ner': 'O'}),id=94), Annotation(446,487,Token,features=Features({'text': 'https://gatenlp.github.io/python-gatenlp/', 'lemma': 'https://gatenlp.github.io/python-gatenlp/', 'upos': 'X', 'xpos': 'ADD', 'head': 94, 'deprel': 'appos', 'ner': 'O'}),id=95), Annotation(489,492,Token,features=Features({'text': 'and', 'lemma': 'and', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 100, 'deprel': 'cc', 'ner': 'O'}),id=96), Annotation(493,494,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 100, 'deprel': 'det', 'ner': 'O'}),id=97), Annotation(495,499,Token,features=Features({'text': 'fake', 'lemma': 'fake', 'upos': 'ADJ', 'xpos': 'JJ', 'Degree': 'Pos', 'head': 100, 'deprel': 'amod', 'ner': 'O'}),id=98), Annotation(500,505,Token,features=Features({'text': 'email', 'lemma': 'email', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 100, 'deprel': 'compound', 'ner': 'O'}),id=99), Annotation(506,513,Token,features=Features({'text': 'address', 'lemma': 'address', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 94, 'deprel': 'conj', 'ner': 'O'}),id=100), Annotation(514,540,Token,features=Features({'text': 'john.doe@hiscoolserver.com', 'lemma': 'john.doe@hiscoolserver.com', 'upos': 'X', 'xpos': 'ADD', 'head': 100, 'deprel': 'appos', 'ner': 'O'}),id=101), Annotation(541,543,Token,features=Features({'text': 'as', 'lemma': 'as', 'upos': 'ADV', 'xpos': 'RB', 'head': 110, 'deprel': 'cc', 'ner': 'O'}),id=102), Annotation(544,548,Token,features=Features({'text': 'well', 'lemma': 'well', 'upos': 'ADV', 'xpos': 'RB', 'Degree': 'Pos', 'head': 102, 'deprel': 'fixed', 'ner': 'O'}),id=103), Annotation(550,552,Token,features=Features({'text': 'as', 'lemma': 'as', 'upos': 'ADP', 'xpos': 'IN', 'head': 102, 'deprel': 'fixed', 'ner': 'O'}),id=104), Annotation(553,554,Token,features=Features({'text': '#', 'lemma': '#', 'upos': 'SYM', 'xpos': 'NN', 'Number': 'Sing', 'head': 110, 'deprel': 'compound', 'ner': 'O'}),id=105), Annotation(554,558,Token,features=Features({'text': 'some', 'lemma': 'some', 'upos': 'DET', 'xpos': 'DT', 'head': 110, 'deprel': 'det', 'ner': 'O'}),id=106), Annotation(559,560,Token,features=Features({'text': '#', 'lemma': '#', 'upos': 'SYM', 'xpos': 'NN', 'Number': 'Sing', 'head': 110, 'deprel': 'compound', 'ner': 'O'}),id=107), Annotation(560,564,Token,features=Features({'text': 'cool', 'lemma': 'cool', 'upos': 'ADJ', 'xpos': 'JJ', 'Degree': 'Pos', 'head': 110, 'deprel': 'amod', 'ner': 'O'}),id=108), Annotation(565,566,Token,features=Features({'text': '#', 'lemma': '#', 'upos': 'SYM', 'xpos': 'NN', 'Number': 'Sing', 'head': 110, 'deprel': 'compound', 'ner': 'O'}),id=109), Annotation(566,573,Token,features=Features({'text': 'hastags', 'lemma': 'hastag', 'upos': 'NOUN', 'xpos': 'NNS', 'Number': 'Plur', 'head': 94, 'deprel': 'conj', 'ner': 'O'}),id=110), Annotation(574,577,Token,features=Features({'text': 'and', 'lemma': 'and', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 113, 'deprel': 'cc', 'ner': 'O'}),id=111), Annotation(578,579,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 113, 'deprel': 'det', 'ner': 'O'}),id=112), Annotation(580,585,Token,features=Features({'text': 'bunch', 'lemma': 'bunch', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 94, 'deprel': 'conj', 'ner': 'O'}),id=113), Annotation(586,588,Token,features=Features({'text': 'of', 'lemma': 'of', 'upos': 'ADP', 'xpos': 'IN', 'head': 115, 'deprel': 'case', 'ner': 'O'}),id=114), Annotation(589,595,Token,features=Features({'text': 'emojis', 'lemma': 'emojis', 'upos': 'NOUN', 'xpos': 'NNS', 'Number': 'Plur', 'head': 113, 'deprel': 'nmod', 'ner': 'O'}),id=115), Annotation(596,600,Token,features=Features({'text': 'like', 'lemma': 'like', 'upos': 'ADP', 'xpos': 'IN', 'head': 117, 'deprel': 'case', 'ner': 'O'}),id=116), Annotation(601,602,Token,features=Features({'text': '😽', 'lemma': '😽', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 115, 'deprel': 'nmod', 'ner': 'O'}),id=117), Annotation(603,604,Token,features=Features({'text': '(', 'lemma': '(', 'upos': 'PUNCT', 'xpos': '-LRB-', 'head': 121, 'deprel': 'punct', 'ner': 'O'}),id=118), Annotation(604,605,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 121, 'deprel': 'det', 'ner': 'O'}),id=119), Annotation(606,613,Token,features=Features({'text': 'kissing', 'lemma': 'kissing', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 121, 'deprel': 'compound', 'ner': 'O'}),id=120), Annotation(614,617,Token,features=Features({'text': 'cat', 'lemma': 'cat', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 117, 'deprel': 'appos', 'ner': 'O'}),id=121), Annotation(617,618,Token,features=Features({'text': ')', 'lemma': ')', 'upos': 'PUNCT', 'xpos': '-RRB-', 'head': 121, 'deprel': 'punct', 'ner': 'O'}),id=122), Annotation(618,619,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 124, 'deprel': 'punct', 'ner': 'O'}),id=123), Annotation(620,623,Token,features=Features({'text': '👩\\u200d🏫', 'lemma': '👩\\u200d🏫', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 117, 'deprel': 'conj', 'ner': 'O'}),id=124), Annotation(624,625,Token,features=Features({'text': '(', 'lemma': '(', 'upos': 'PUNCT', 'xpos': '-LRB-', 'head': 128, 'deprel': 'punct', 'ner': 'O'}),id=125), Annotation(625,626,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 128, 'deprel': 'det', 'ner': 'O'}),id=126), Annotation(627,632,Token,features=Features({'text': 'woman', 'lemma': 'woman', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 128, 'deprel': 'compound', 'ner': 'O'}),id=127), Annotation(633,640,Token,features=Features({'text': 'teacher', 'lemma': 'teacher', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 124, 'deprel': 'appos', 'ner': 'O'}),id=128), Annotation(640,641,Token,features=Features({'text': ')', 'lemma': ')', 'upos': 'PUNCT', 'xpos': '-RRB-', 'head': 128, 'deprel': 'punct', 'ner': 'O'}),id=129), Annotation(641,642,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 131, 'deprel': 'punct', 'ner': 'O'}),id=130), Annotation(643,644,Token,features=Features({'text': '🧬', 'lemma': '🧬', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 117, 'deprel': 'conj', 'ner': 'O'}),id=131), Annotation(645,646,Token,features=Features({'text': '(', 'lemma': '(', 'upos': 'PUNCT', 'xpos': '-LRB-', 'head': 133, 'deprel': 'punct', 'ner': 'O'}),id=132), Annotation(646,649,Token,features=Features({'text': 'DNA', 'lemma': 'dna', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 131, 'deprel': 'appos', 'ner': 'O'}),id=133), Annotation(649,650,Token,features=Features({'text': ')', 'lemma': ')', 'upos': 'PUNCT', 'xpos': '-RRB-', 'head': 133, 'deprel': 'punct', 'ner': 'O'}),id=134), Annotation(650,651,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 136, 'deprel': 'punct', 'ner': 'O'}),id=135), Annotation(653,654,Token,features=Features({'text': '🧗', 'lemma': '🧗', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 117, 'deprel': 'conj', 'ner': 'O'}),id=136), Annotation(655,656,Token,features=Features({'text': '(', 'lemma': '(', 'upos': 'PUNCT', 'xpos': '-LRB-', 'head': 140, 'deprel': 'punct', 'ner': 'O'}),id=137), Annotation(656,657,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 139, 'deprel': 'det', 'ner': 'O'}),id=138), Annotation(658,664,Token,features=Features({'text': 'person', 'lemma': 'person', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 140, 'deprel': 'nsubj', 'ner': 'O'}),id=139), Annotation(665,673,Token,features=Features({'text': 'climbing', 'lemma': 'climb', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 136, 'deprel': 'appos', 'ner': 'O'}),id=140), Annotation(673,674,Token,features=Features({'text': ')', 'lemma': ')', 'upos': 'PUNCT', 'xpos': '-RRB-', 'head': 140, 'deprel': 'punct', 'ner': 'O'}),id=141), Annotation(674,675,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 92, 'deprel': 'punct', 'ner': 'O'}),id=142), Annotation(678,682,Token,features=Features({'text': 'Here', 'lemma': 'here', 'upos': 'ADV', 'xpos': 'RB', 'PronType': 'Dem', 'head': 146, 'deprel': 'advmod', 'ner': 'O'}),id=144), Annotation(678,829,Sentence,features=Features({}),id=174), Annotation(683,685,Token,features=Features({'text': 'we', 'lemma': 'we', 'upos': 'PRON', 'xpos': 'PRP', 'Case': 'Nom', 'Number': 'Plur', 'Person': '1', 'PronType': 'Prs', 'head': 146, 'deprel': 'nsubj', 'ner': 'O'}),id=145), Annotation(686,690,Token,features=Features({'text': 'test', 'lemma': 'test', 'upos': 'VERB', 'xpos': 'VBP', 'Mood': 'Ind', 'Number': 'Plur', 'Person': '1', 'Tense': 'Pres', 'VerbForm': 'Fin', 'head': 174, 'deprel': 'root', 'ner': 'O'}),id=146), Annotation(691,692,Token,features=Features({'text': 'a', 'lemma': 'a', 'upos': 'DET', 'xpos': 'DT', 'Definite': 'Ind', 'PronType': 'Art', 'head': 150, 'deprel': 'det', 'ner': 'O'}),id=147), Annotation(693,696,Token,features=Features({'text': 'few', 'lemma': 'few', 'upos': 'ADJ', 'xpos': 'JJ', 'Degree': 'Pos', 'head': 150, 'deprel': 'amod', 'ner': 'O'}),id=148), Annotation(697,706,Token,features=Features({'text': 'different', 'lemma': 'different', 'upos': 'ADJ', 'xpos': 'JJ', 'Degree': 'Pos', 'head': 150, 'deprel': 'amod', 'ner': 'O'}),id=149), Annotation(707,714,Token,features=Features({'text': 'scripts', 'lemma': 'script', 'upos': 'NOUN', 'xpos': 'NNS', 'Number': 'Plur', 'head': 146, 'deprel': 'obj', 'ner': 'O'}),id=150), Annotation(714,715,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 153, 'deprel': 'punct', 'ner': 'O'}),id=151), Annotation(716,720,Token,features=Features({'text': 'e.g.', 'lemma': 'e.g.', 'upos': 'ADV', 'xpos': 'FW', 'Abbr': 'Yes', 'head': 153, 'deprel': 'advmod', 'ner': 'O'}),id=152), Annotation(721,727,Token,features=Features({'text': 'Hangul', 'lemma': 'Hangul', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 154, 'deprel': 'compound', 'ner': 'O'}),id=153), Annotation(728,730,Token,features=Features({'text': '한글', 'lemma': '한글', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 150, 'deprel': 'appos', 'ner': 'O'}),id=154), Annotation(731,733,Token,features=Features({'text': 'or', 'lemma': 'or', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 158, 'deprel': 'cc', 'ner': 'O'}),id=155), Annotation(735,745,Token,features=Features({'text': 'simplified', 'lemma': 'simplify', 'upos': 'VERB', 'xpos': 'VBN', 'Tense': 'Past', 'VerbForm': 'Part', 'head': 158, 'deprel': 'amod', 'ner': 'O'}),id=156), Annotation(746,751,Token,features=Features({'text': 'Hanzi', 'lemma': 'Hanzi', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 158, 'deprel': 'compound', 'ner': 'O'}),id=157), Annotation(752,754,Token,features=Features({'text': '汉字', 'lemma': '汉字', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 154, 'deprel': 'conj', 'ner': 'O'}),id=158), Annotation(755,757,Token,features=Features({'text': 'or', 'lemma': 'or', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 161, 'deprel': 'cc', 'ner': 'O'}),id=159), Annotation(758,763,Token,features=Features({'text': 'Farsi', 'lemma': 'Farsi', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 161, 'deprel': 'compound', 'ner': 'S-LANGUAGE'}),id=160), Annotation(758,763,LANGUAGE,features=Features({}),id=187), Annotation(764,769,Token,features=Features({'text': 'فارسی', 'lemma': 'فارسی', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 154, 'deprel': 'conj', 'ner': 'O'}),id=161), Annotation(770,773,Token,features=Features({'text': 'and', 'lemma': 'and', 'upos': 'CCONJ', 'xpos': 'CC', 'head': 163, 'deprel': 'cc', 'ner': 'O'}),id=162), Annotation(774,780,Token,features=Features({'text': 'Arabic', 'lemma': 'Arabic', 'upos': 'PROPN', 'xpos': 'NNP', 'Number': 'Sing', 'head': 161, 'deprel': 'conj', 'ner': 'S-LANGUAGE'}),id=163), Annotation(774,780,LANGUAGE,features=Features({}),id=188), Annotation(781,782,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 165, 'deprel': 'punct', 'ner': 'O'}),id=164), Annotation(782,797,Token,features=Features({'text': 'اَلْعَرَبِيَّةُ', 'lemma': 'gamommamgeramam', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 150, 'deprel': 'appos', 'ner': 'O'}),id=165), Annotation(797,798,Token,features=Features({'text': ',', 'lemma': ',', 'upos': 'PUNCT', 'xpos': ',', 'head': 168, 'deprel': 'punct', 'ner': 'O'}),id=166), Annotation(799,804,Token,features=Features({'text': 'which', 'lemma': 'which', 'upos': 'PRON', 'xpos': 'WDT', 'PronType': 'Rel', 'head': 168, 'deprel': 'nsubj', 'ner': 'O'}),id=167), Annotation(805,809,Token,features=Features({'text': 'goes', 'lemma': 'go', 'upos': 'VERB', 'xpos': 'VBZ', 'Mood': 'Ind', 'Number': 'Sing', 'Person': '3', 'Tense': 'Pres', 'VerbForm': 'Fin', 'head': 150, 'deprel': 'acl:relcl', 'ner': 'O'}),id=168), Annotation(810,814,Token,features=Features({'text': 'from', 'lemma': 'from', 'upos': 'ADP', 'xpos': 'IN', 'head': 170, 'deprel': 'case', 'ner': 'O'}),id=169), Annotation(815,820,Token,features=Features({'text': 'right', 'lemma': 'right', 'upos': 'NOUN', 'xpos': 'NN', 'Number': 'Sing', 'head': 168, 'deprel': 'obl', 'ner': 'O'}),id=170), Annotation(821,823,Token,features=Features({'text': 'to', 'lemma': 'to', 'upos': 'ADP', 'xpos': 'IN', 'head': 172, 'deprel': 'case', 'ner': 'O'}),id=171), Annotation(824,828,Token,features=Features({'text': 'left', 'lemma': 'leave', 'upos': 'VERB', 'xpos': 'VBN', 'Tense': 'Past', 'VerbForm': 'Part', 'head': 168, 'deprel': 'advcl', 'ner': 'O'}),id=172), Annotation(828,829,Token,features=Features({'text': '.', 'lemma': '.', 'upos': 'PUNCT', 'xpos': '.', 'head': 146, 'deprel': 'punct', 'ner': 'O'}),id=173)]), '': AnnotationSet([Annotation(0,4,Token,features=Features({}),id=0), Annotation(5,7,Token,features=Features({}),id=1), Annotation(8,9,Token,features=Features({}),id=2), Annotation(10,14,Token,features=Features({}),id=3), Annotation(15,24,Token,features=Features({}),id=4), Annotation(26,28,Token,features=Features({}),id=5), Annotation(29,37,Token,features=Features({}),id=6), Annotation(38,42,Token,features=Features({}),id=7), Annotation(43,44,Token,features=Features({}),id=8), Annotation(45,48,Token,features=Features({}),id=9), Annotation(49,59,Token,features=Features({}),id=10), Annotation(61,65,Token,features=Features({}),id=11), Annotation(66,68,Token,features=Features({}),id=12), Annotation(69,70,Token,features=Features({}),id=13), Annotation(71,79,Token,features=Features({}),id=14), Annotation(80,84,Token,features=Features({}),id=15), Annotation(85,93,Token,features=Features({}),id=16), Annotation(94,95,Token,features=Features({}),id=17), Annotation(96,99,Token,features=Features({}),id=18), Annotation(100,105,Token,features=Features({}),id=19), Annotation(106,114,Token,features=Features({}),id=20), Annotation(115,119,Token,features=Features({}),id=21), Annotation(121,124,Token,features=Features({}),id=22), Annotation(125,132,Token,features=Features({}),id=23), Annotation(133,139,Token,features=Features({}),id=24), Annotation(140,145,Token,features=Features({}),id=25), Annotation(146,148,Token,features=Features({}),id=26), Annotation(149,155,Token,features=Features({}),id=27), Annotation(156,159,Token,features=Features({}),id=28), Annotation(160,163,Token,features=Features({}),id=29), Annotation(164,169,Token,features=Features({}),id=30), Annotation(169,170,Token,features=Features({}),id=31), Annotation(171,180,Token,features=Features({}),id=32), Annotation(181,185,Token,features=Features({}),id=33), Annotation(186,189,Token,features=Features({}),id=34), Annotation(190,194,Token,features=Features({}),id=35), Annotation(195,199,Token,features=Features({}),id=36), Annotation(199,200,Token,features=Features({}),id=37), Annotation(201,207,Token,features=Features({}),id=38), Annotation(208,210,Token,features=Features({}),id=39), Annotation(211,218,Token,features=Features({}),id=40), Annotation(219,221,Token,features=Features({}),id=41), Annotation(222,231,Token,features=Features({}),id=42), Annotation(232,236,Token,features=Features({}),id=43), Annotation(238,244,Token,features=Features({}),id=44), Annotation(244,245,Token,features=Features({}),id=45), Annotation(246,255,Token,features=Features({}),id=46), Annotation(256,258,Token,features=Features({}),id=47), Annotation(259,266,Token,features=Features({}),id=48), Annotation(267,270,Token,features=Features({}),id=49), Annotation(271,275,Token,features=Features({}),id=50), Annotation(276,278,Token,features=Features({}),id=51), Annotation(279,285,Token,features=Features({}),id=52), Annotation(286,291,Token,features=Features({}),id=53), Annotation(291,292,Token,features=Features({}),id=54), Annotation(293,295,Token,features=Features({}),id=55), Annotation(296,299,Token,features=Features({}),id=56), Annotation(300,303,Token,features=Features({}),id=57), Annotation(304,306,Token,features=Features({}),id=58), Annotation(307,310,Token,features=Features({}),id=59), Annotation(311,315,Token,features=Features({}),id=60), Annotation(316,319,Token,features=Features({}),id=61), Annotation(320,321,Token,features=Features({}),id=62), Annotation(321,322,Token,features=Features({}),id=63), Annotation(324,328,Token,features=Features({}),id=64), Annotation(329,332,Token,features=Features({}),id=65), Annotation(333,338,Token,features=Features({}),id=66), Annotation(339,346,Token,features=Features({}),id=67), Annotation(347,350,Token,features=Features({}),id=68), Annotation(351,355,Token,features=Features({}),id=69), Annotation(356,362,Token,features=Features({}),id=70), Annotation(363,367,Token,features=Features({}),id=71), Annotation(368,371,Token,features=Features({}),id=72), Annotation(372,384,Token,features=Features({}),id=73), Annotation(385,392,Token,features=Features({}),id=74), Annotation(392,393,Token,features=Features({}),id=75), Annotation(394,399,Token,features=Features({}),id=76), Annotation(400,402,Token,features=Features({}),id=77), Annotation(403,407,Token,features=Features({}),id=78), Annotation(408,410,Token,features=Features({}),id=79), Annotation(411,416,Token,features=Features({}),id=80), Annotation(417,421,Token,features=Features({}),id=81), Annotation(421,422,Token,features=Features({}),id=82), Annotation(424,428,Token,features=Features({}),id=83), Annotation(429,431,Token,features=Features({}),id=84), Annotation(432,439,Token,features=Features({}),id=85), Annotation(440,441,Token,features=Features({}),id=86), Annotation(442,445,Token,features=Features({}),id=87), Annotation(446,451,Token,features=Features({}),id=88), Annotation(451,452,Token,features=Features({}),id=89), Annotation(452,487,Token,features=Features({}),id=90), Annotation(489,492,Token,features=Features({}),id=91), Annotation(493,494,Token,features=Features({}),id=92), Annotation(495,499,Token,features=Features({}),id=93), Annotation(500,505,Token,features=Features({}),id=94), Annotation(506,513,Token,features=Features({}),id=95), Annotation(514,522,Token,features=Features({}),id=96), Annotation(522,523,Token,features=Features({}),id=97), Annotation(523,540,Token,features=Features({}),id=98), Annotation(541,543,Token,features=Features({}),id=99), Annotation(544,548,Token,features=Features({}),id=100), Annotation(550,552,Token,features=Features({}),id=101), Annotation(553,554,Token,features=Features({}),id=102), Annotation(554,558,Token,features=Features({}),id=103), Annotation(559,560,Token,features=Features({}),id=104), Annotation(560,564,Token,features=Features({}),id=105), Annotation(565,566,Token,features=Features({}),id=106), Annotation(566,573,Token,features=Features({}),id=107), Annotation(574,577,Token,features=Features({}),id=108), Annotation(578,579,Token,features=Features({}),id=109), Annotation(580,585,Token,features=Features({}),id=110), Annotation(586,588,Token,features=Features({}),id=111), Annotation(589,595,Token,features=Features({}),id=112), Annotation(596,600,Token,features=Features({}),id=113), Annotation(601,602,Token,features=Features({}),id=114), Annotation(603,604,Token,features=Features({}),id=115), Annotation(604,605,Token,features=Features({}),id=116), Annotation(606,613,Token,features=Features({}),id=117), Annotation(614,617,Token,features=Features({}),id=118), Annotation(617,618,Token,features=Features({}),id=119), Annotation(618,619,Token,features=Features({}),id=120), Annotation(620,623,Token,features=Features({}),id=121), Annotation(624,625,Token,features=Features({}),id=122), Annotation(625,626,Token,features=Features({}),id=123), Annotation(627,632,Token,features=Features({}),id=124), Annotation(633,640,Token,features=Features({}),id=125), Annotation(640,641,Token,features=Features({}),id=126), Annotation(641,642,Token,features=Features({}),id=127), Annotation(643,644,Token,features=Features({}),id=128), Annotation(645,646,Token,features=Features({}),id=129), Annotation(646,649,Token,features=Features({}),id=130), Annotation(649,650,Token,features=Features({}),id=131), Annotation(650,651,Token,features=Features({}),id=132), Annotation(653,654,Token,features=Features({}),id=133), Annotation(655,656,Token,features=Features({}),id=134), Annotation(656,657,Token,features=Features({}),id=135), Annotation(658,664,Token,features=Features({}),id=136), Annotation(665,673,Token,features=Features({}),id=137), Annotation(673,674,Token,features=Features({}),id=138), Annotation(674,675,Token,features=Features({}),id=139), Annotation(678,682,Token,features=Features({}),id=140), Annotation(683,685,Token,features=Features({}),id=141), Annotation(686,690,Token,features=Features({}),id=142), Annotation(691,692,Token,features=Features({}),id=143), Annotation(693,696,Token,features=Features({}),id=144), Annotation(697,706,Token,features=Features({}),id=145), Annotation(707,714,Token,features=Features({}),id=146), Annotation(714,715,Token,features=Features({}),id=147), Annotation(716,720,Token,features=Features({}),id=148), Annotation(721,727,Token,features=Features({}),id=149), Annotation(728,730,Token,features=Features({}),id=150), Annotation(731,733,Token,features=Features({}),id=151), Annotation(735,745,Token,features=Features({}),id=152), Annotation(746,751,Token,features=Features({}),id=153), Annotation(752,754,Token,features=Features({}),id=154), Annotation(755,757,Token,features=Features({}),id=155), Annotation(758,763,Token,features=Features({}),id=156), Annotation(764,769,Token,features=Features({}),id=157), Annotation(770,773,Token,features=Features({}),id=158), Annotation(774,780,Token,features=Features({}),id=159), Annotation(781,782,Token,features=Features({}),id=160), Annotation(782,797,Token,features=Features({}),id=161), Annotation(797,798,Token,features=Features({}),id=162), Annotation(799,804,Token,features=Features({}),id=163), Annotation(805,809,Token,features=Features({}),id=164), Annotation(810,814,Token,features=Features({}),id=165), Annotation(815,820,Token,features=Features({}),id=166), Annotation(821,823,Token,features=Features({}),id=167), Annotation(824,828,Token,features=Features({}),id=168), Annotation(828,829,Token,features=Features({}),id=169)]), 'TGaz1': AnnotationSet([Annotation(238,244,Lookup,features=Features({'what': 'company', 'country': 'Everywhere, really!'}),id=3), Annotation(279,291,Lookup,features=Features({'what': 'person', 'country': 'US'}),id=4), Annotation(333,346,Lookup,features=Features({'what': 'person', 'country': 'UK'}),id=5)]), 'TGaz2': AnnotationSet([Annotation(238,244,Lookup,features=Features({'where': 'Everywhere, really!'}),id=0), Annotation(279,291,Lookup,features=Features({'what': 'person', 'country': 'US'}),id=1), Annotation(333,346,Lookup,features=Features({'what': 'person', 'country': 'UK'}),id=2)]), 'Pampac1': AnnotationSet([Annotation(133,139,PersOrOrg,features=Features({}),id=0), Annotation(140,145,PersOrOrg,features=Features({}),id=1), Annotation(149,155,PersOrOrg,features=Features({}),id=2), Annotation(156,159,PersOrOrg,features=Features({}),id=3), Annotation(160,163,PersOrOrg,features=Features({}),id=4), Annotation(164,169,PersOrOrg,features=Features({}),id=5), Annotation(238,244,PersOrOrg,features=Features({}),id=6), Annotation(246,255,PersOrOrg,features=Features({}),id=7), Annotation(259,265,PersOrOrg,features=Features({}),id=8), Annotation(279,285,PersOrOrg,features=Features({}),id=9), Annotation(286,291,PersOrOrg,features=Features({}),id=10), Annotation(333,338,PersOrOrg,features=Features({}),id=11), Annotation(339,346,PersOrOrg,features=Features({}),id=12), Annotation(351,355,PersOrOrg,features=Features({}),id=13), Annotation(372,384,PersOrOrg,features=Features({}),id=14)]), 'Pampac2': AnnotationSet([Annotation(133,169,PROPNSEQ,features=Features({}),id=0), Annotation(190,218,PROPNSEQ,features=Features({}),id=1), Annotation(238,265,PROPNSEQ,features=Features({}),id=2), Annotation(279,291,PROPNSEQ,features=Features({}),id=3), Annotation(333,355,PROPNSEQ,features=Features({}),id=4), Annotation(721,730,PROPNSEQ,features=Features({}),id=5), Annotation(746,780,PROPNSEQ,features=Features({}),id=6)])})" ] }, "execution_count": 62, "metadata": {}, "output_type": "execute_result" } ], "source": [ "doc2" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Processing\n", "\n", "* Instance of Annotator: process a document and return it (usually the same document)\n", " * optionally: return None or a list of documents\n", "* Pipeline:\n", " * run several Annotators in Sequence\n", " * handle None / list of returned documents\n", " * also handle per-corpus initialization, finishing, return values\n", "* Executor (work in progress!) \n", " * Run a pipeline on a corpus or a document source / destination pair\n", " \n", "* Runner: \n", " * run a pipeline on a corpus, source/destination pair from the command line\n", " * Multiprocessing capable\n", " * run several workers locally or on a cluster (using Ray infrastructure)\n", " * corpus / source / destination must be multiprocessing capable, duplicated over nodes\n", " * e.g. DirFilesCorpus, DirFilesSource init: `nparts=1, partnr=0`" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Machine Learning\n", "\n", "* Currently Work in Progress\n", "* Not part of the `gatenlp` package \n", "* Planned: for each ML/DL backend a separate package\n", " * upcoming: `gatenlp-ml-huggingface` \n", " * basic functionality working: `gatenlp-ml-tner` for chunking / NER\n", " * https://github.com/GateNLP/python-gatenlp-ml-tner/\n", " * Example: https://github.com/GateNLP/python-gatenlp-ml-tner/tree/main/examples\n", "* Install ML package into gatenlp environment or install gatenlp into ML package environment\n", "* Most modern ML systems require GPU / powerful computers" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## What about Ontologies?\n", "\n", "* Java GATE has Ontology-related plugins\n", " * but very outdated and limited (no support for OWL2)\n", "* (Currently) no direct support in GateNLP\n", "* But: Python packages [rdflib](https://rdflib.readthedocs.io/en/stable/), [Owlready2](https://owlready2.readthedocs.io/en/v0.37/) and maybe others\n", "* Finding terms:\n", " * Extract names, process names as docs, build gazetteer, use gazetteer to match and annotate with URI\n", "* match classes, subclasses, etc: directly use e.g. Owlready2 API from e.g. PAMPAC or in own annotators" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "# GATE Worker\n", "\n", "* Allows running the Java GATE process from Python\n", "* API for exchanging document and performing frequent Java GATE tasks from Python\n", "* [Py4J](https://www.py4j.org/) API to run ANY Java from Python\n", "* Python connects to a Java process, communicates over sockets\n", " * Option 1: Start GATE GUI, load PythonWorkerLr, then connect a GateNLP GateWorker to it\n", " * Option 2: Start Java GATE worker using the `gatenlp-gate-worker` command\n", " * Option 3: directly start the Java GATE worker when creating the GateNLP GateWorker instance\n", "\n", "Let's try Option 3 first: `GATE_HOME` environment variable must be set, or must know GATE installation directory" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## GATE Worker" ] }, { "cell_type": "code", "execution_count": 63, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "name": "stderr", "output_type": "stream", "text": [ "Trying to start GATE Worker on port=25333 host=127.0.0.1 log=false keep=false\n", "Process id is 1298277\n" ] }, { "name": "stdout", "output_type": "stream", "text": [ "CREOLE plugin loaded: creole \n", "CREOLE plugin loaded: Format: Bdoc 1.10\n", "Plugin Python version: 3.0.7 commit: 9adf5ed dirty: false\n", "Lib interaction version: 4.1 commit: 7819f1c dirty: false\n", "Python gatenlp version: 1.0.7 commit: 8c15d82 dirty: false\n", "CREOLE plugin loaded: Python 3.0.7\n" ] }, { "name": "stderr", "output_type": "stream", "text": [ "PythonWorkerRunner.java: starting server with 25333/127.0.0.1/ZBpgCbe3cAR-ue3HRkV2a9qRXBE/false\n" ] } ], "source": [ "from gatenlp.gateworker import GateWorker\n", "\n", "gs = GateWorker()\n", "# if GATE_HOME not set use gs = GateWorker(gatehome=\"/where/Gate/is/Installed\")\n", "# if java is not on the PATH use gs = GateWorker(java=\"\"/path/to/the/java/binary\")\n", "# If port(s) in use e.g.: `ss -ltp -at dport=:25333` " ] }, { "cell_type": "code", "execution_count": 64, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "class gate.corpora.DocumentImpl\n", "GATE Document_00015\n", "set()\n" ] } ], "source": [ "# Create a GATE document on the JAVA GATE side and return a handle\n", "gdoc1 = gs.createDocument(\"An example document mentioning Barack Obama and New York\")\n", "# Can call Java API methods on that handle and get/convert the result\n", "print(gdoc1.getClass())\n", "print(gdoc1.getName())\n", "print(gdoc1.getAnnotationSetNames())" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## GATE Worker" ] }, { "cell_type": "code", "execution_count": 65, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "CREOLE plugin loaded: ANNIE 9.0\n" ] } ], "source": [ "# lets load the prepared ANNIE pipeline on the Java side and process the GATE document with it\n", "gs.loadMavenPlugin(\"uk.ac.gate.plugins\", \"annie\", \"9.0\")\n", "gpipe = gs.loadPipelineFromPlugin(\"uk.ac.gate.plugins\", \"annie\", \"/resources/ANNIE_with_defaults.gapp\")\n", "gcorp = gs.newCorpus()\n", "gcorp.add(gdoc1)\n", "gpipe.setCorpus(gcorp)\n", "gpipe.execute()" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## GATE Worker\n", "\n", "So far, everything happened on the Java side, use a GateWorker API method to convert the document into a Python GateNLP document:" ] }, { "cell_type": "code", "execution_count": 66, "metadata": { "scrolled": false, "slideshow": { "slide_type": "slide" } }, "outputs": [ { "data": { "text/html": [ "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", " \n", " \n", "
\n", "\n", "
" ], "text/plain": [ "Document(An example document mentioning Barack Obama and New York,features=Features({'gate.SourceURL': 'created from String'}),anns={'': AnnotationSet([Annotation(0,2,Token,features=Features({'orth': 'upperInitial', 'string': 'An', 'kind': 'word', 'length': '2', 'category': 'DT'}),id=0), Annotation(0,56,Sentence,features=Features({}),id=21), Annotation(2,3,SpaceToken,features=Features({'string': ' ', 'kind': 'space', 'length': '1'}),id=1), Annotation(3,10,Token,features=Features({'orth': 'lowercase', 'string': 'example', 'kind': 'word', 'length': '7', 'category': 'NN'}),id=2), Annotation(10,11,SpaceToken,features=Features({'string': ' ', 'kind': 'space', 'length': '1'}),id=3), Annotation(11,19,Token,features=Features({'orth': 'lowercase', 'string': 'document', 'kind': 'word', 'length': '8', 'category': 'NN'}),id=4), Annotation(19,20,SpaceToken,features=Features({'string': ' ', 'kind': 'space', 'length': '1'}),id=5), Annotation(20,30,Token,features=Features({'orth': 'lowercase', 'string': 'mentioning', 'kind': 'word', 'length': '10', 'category': 'VBG'}),id=6), Annotation(30,31,SpaceToken,features=Features({'string': ' ', 'kind': 'space', 'length': '1'}),id=7), Annotation(31,37,Token,features=Features({'orth': 'upperInitial', 'string': 'Barack', 'kind': 'word', 'length': '6', 'category': 'NNP'}),id=8), Annotation(31,43,Lookup,features=Features({'majorType': 'person_full', 'gender': 'male'}),id=17), Annotation(31,43,Person,features=Features({'firstName': 'Barack', 'ruleFinal': 'PersonFinal', 'gender': 'male', 'surname': 'Obama', 'kind': 'fullName', 'rule': 'GazPerson'}),id=29), Annotation(37,38,SpaceToken,features=Features({'string': ' ', 'kind': 'space', 'length': '1'}),id=9), Annotation(38,43,Token,features=Features({'orth': 'upperInitial', 'string': 'Obama', 'kind': 'word', 'length': '5', 'category': 'NNP'}),id=10), Annotation(38,43,Lookup,features=Features({'majorType': 'person_full', 'gender': 'male'}),id=18), Annotation(43,44,SpaceToken,features=Features({'string': ' ', 'kind': 'space', 'length': '1'}),id=11), Annotation(44,47,Token,features=Features({'orth': 'lowercase', 'string': 'and', 'kind': 'word', 'length': '3', 'category': 'CC'}),id=12), Annotation(47,48,SpaceToken,features=Features({'string': ' ', 'kind': 'space', 'length': '1'}),id=13), Annotation(48,51,Token,features=Features({'orth': 'upperInitial', 'string': 'New', 'kind': 'word', 'length': '3', 'category': 'NNP'}),id=14), Annotation(48,56,Lookup,features=Features({'majorType': 'location', 'minorType': 'city'}),id=19), Annotation(48,56,Location,features=Features({'ruleFinal': 'LocFinal', 'rule': 'Location1', 'locType': 'city'}),id=30), Annotation(51,52,SpaceToken,features=Features({'string': ' ', 'kind': 'space', 'length': '1'}),id=15), Annotation(52,56,Token,features=Features({'orth': 'upperInitial', 'string': 'York', 'kind': 'word', 'length': '4', 'category': 'NNP'}),id=16), Annotation(52,56,Lookup,features=Features({'majorType': 'location', 'minorType': 'city'}),id=20)])})" ] }, "execution_count": 66, "metadata": {}, "output_type": "execute_result" } ], "source": [ "pdoc1 = gs.gdoc2pdoc(gdoc1)\n", "pdoc1" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## GateWorker\n", "\n", "* Stopping: the GateWorker (Java process) can get stopped using `gs.close()`\n", "* Will also automatically stop when the Python process ends" ] }, { "cell_type": "code", "execution_count": 67, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "name": "stderr", "output_type": "stream", "text": [ "Java GatenlpWorker ENDING: 1298277\n" ] } ], "source": [ "gs.close()" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "# GateWorker Annotator\n", "\n", "An annotator to process Python GateNLP documents with a Java GATE pipeline" ] }, { "cell_type": "code", "execution_count": 68, "metadata": { "scrolled": true, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "name": "stderr", "output_type": "stream", "text": [ "Trying to start GATE Worker on port=31313 host=127.0.0.1 log=false keep=false\n", "Process id is 1298331\n" ] }, { "name": "stdout", "output_type": "stream", "text": [ "CREOLE plugin loaded: creole \n", "CREOLE plugin loaded: Format: Bdoc 1.10\n", "Plugin Python version: 3.0.7 commit: 9adf5ed dirty: false\n", "Lib interaction version: 4.1 commit: 7819f1c dirty: false\n", "Python gatenlp version: 1.0.7 commit: 8c15d82 dirty: false\n", "CREOLE plugin loaded: Python 3.0.7\n" ] }, { "name": "stderr", "output_type": "stream", "text": [ "PythonWorkerRunner.java: starting server with 31313/127.0.0.1/9S-V4nw7ia2D_IFvcnC7WT-f_00/false\n" ] }, { "name": "stdout", "output_type": "stream", "text": [ "CREOLE plugin loaded: ANNIE 9.0\n" ] } ], "source": [ "from gatenlp.gateworker import GateWorkerAnnotator\n", "# Specify a prepared GATE pipeline file to get loaded into Java GATE\n", "# Specify a GateWorker\n", "gw = GateWorker(port=31313)\n", "gs_app = GateWorkerAnnotator(pipeline=\"data/annie.xgapp\", gateworker=gw)" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## GateWorkerAnnotator\n", "\n", "Example, running on a directory corpus:" ] }, { "cell_type": "code", "execution_count": 69, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Features({'gate.SourceURL': 'created from String', '__idx_140538250499152': 2})\n" ] }, { "name": "stderr", "output_type": "stream", "text": [ "Java GatenlpWorker ENDING: 1298331\n" ] }, { "data": { "text/html": [ "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", " \n", " \n", "
\n", "\n", "
" ], "text/plain": [ "Document(A third document for testing,features=Features({'gate.SourceURL': 'created from String', '__idx_140538250499152': 2}),anns={'': AnnotationSet([Annotation(0,1,Token,features=Features({'orth': 'upperInitial', 'string': 'A', 'kind': 'word', 'length': '1', 'category': 'DT'}),id=0), Annotation(0,28,Sentence,features=Features({}),id=10), Annotation(1,2,SpaceToken,features=Features({'string': ' ', 'kind': 'space', 'length': '1'}),id=1), Annotation(2,7,Token,features=Features({'orth': 'lowercase', 'string': 'third', 'kind': 'word', 'length': '5', 'category': 'JJ'}),id=2), Annotation(2,7,Lookup,features=Features({'majorType': 'date', 'minorType': 'ordinal'}),id=9), Annotation(7,8,SpaceToken,features=Features({'string': ' ', 'kind': 'space', 'length': '1'}),id=3), Annotation(8,16,Token,features=Features({'orth': 'lowercase', 'string': 'document', 'kind': 'word', 'length': '8', 'category': 'NN'}),id=4), Annotation(16,17,SpaceToken,features=Features({'string': ' ', 'kind': 'space', 'length': '1'}),id=5), Annotation(17,20,Token,features=Features({'orth': 'lowercase', 'string': 'for', 'kind': 'word', 'length': '3', 'category': 'IN'}),id=6), Annotation(20,21,SpaceToken,features=Features({'string': ' ', 'kind': 'space', 'length': '1'}),id=7), Annotation(21,28,Token,features=Features({'orth': 'lowercase', 'string': 'testing', 'kind': 'word', 'length': '7', 'category': 'NN'}),id=8)])})" ] }, "execution_count": 69, "metadata": {}, "output_type": "execute_result" } ], "source": [ "from gatenlp.processing.executor import SerialCorpusExecutor\n", "dircorpus = DirFilesCorpus(\"data/dir1\", sort=True)\n", "exe = SerialCorpusExecutor(annotator=gs_app, corpus=dircorpus)\n", "exe()\n", "gw.close()\n", "\n", "tmpdoc = dircorpus[2]\n", "print(tmpdoc.features)\n", "tmpdoc" ] }, { "cell_type": "code", "execution_count": 70, "metadata": { "slideshow": { "slide_type": "skip" } }, "outputs": [ { "data": { "text/html": [ "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", " \n", " \n", "
\n", "\n", "
" ], "text/plain": [ "Document(This is another document for testing which mentions John Smith.,features=Features({'__idx_140538250499152': 1}),anns={})" ] }, "execution_count": 70, "metadata": {}, "output_type": "execute_result" } ], "source": [ "from gatenlp.processing.annotator import AnnotatorFunction\n", "# NOT SHOWN: clean the corpus we have just processed\n", "def clearanns(doc):\n", " # instead of just removing annotations, return a completely new document\n", " docnew = Document(doc.text)\n", " return docnew\n", "annclear = AnnotatorFunction(clearanns)\n", "execlear = SerialCorpusExecutor(annotator=annclear, corpus=dircorpus)\n", "execlear()\n", "dircorpus[1]" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "# Java GATE Python Plugin\n", "\n", "* Java GATE Plugin\n", "* Documentation: http://gatenlp.github.io/gateplugin-Python/\n", "* provides a Processing Resource (PR) PythonPr \n", "* PythonPr: makes it possible to use a Python program to annotated GATE documents \n", "* Plugin comes with its own copy of the `gatenlp` package!\n", " * but can use `gatenlp` installed separately as well\n" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Requirements\n", "\n", "* Python 3.6 or higher installed\n", "* possibly own Python environment\n", " * `conda create -n gatenlp python=3.9`\n", "* `sortedcontainers`package\n", " * `pip install sortedcontainers`\n", "* Optionally other dependencies for using Spacy etc.\n" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "# Using the Plugin in GATE\n", "\n", "* Requires GATE 8.6.1 or later\n", "* Load from Plugin Manager \n", " * make sure to use latest version\n", " * make sure \"Load always\" was not checked, NOT already loaded!\n", " * make sure \"Save session\" is not enabled, if necessary restart!\n", "* Or: will get loaded with a pipeline that uses the plugin automatically" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "# Create new Pipeline\n", "\n", "1. Load the Python plugin: check \"Load Now\" for \"Python\", then click \"Apply All\"\n", "2. Create PR PythonPr: right click \"Processing Resource\", choose \"PythonPr\"\n", "3. Click file dialog button for the `pythonProgram` init parameter\n", "4. In the file selection dialog, choose directory and enter a non-existing name `test1.py`, click Open, then OK\n", "5. When the specified file does not exist, it is created with a template content\n", "6. Double click PR to show in GATE \n", "7. Save often!\n" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Edit PythonPr Program\n", "\n", "* from the GATE GUI: \n", " * Save: write content to the file (no warning if changed with other program!)\n", " * Save & Use: also check for syntax error\n", "* with external editor:\n", " * changes NOT automatically reflected in GATE Editor!\n", " * to use a changed file: right click PythonPr and select \"Reinitialise\"" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## PythonPr Program Template" ] }, { "cell_type": "code", "execution_count": 71, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "from gatenlp import Document, AnnotationSet, GateNlpPr, interact\n", "\n", "@GateNlpPr\n", "class MyAnnotator:\n", " # the following method is run on every document, this method must exist:\n", " def __call__(self, doc, **kwargs):\n", " pass\n", "\n", " # the start and finish methods are optional, if they exist the start\n", " # method is called before the first document of a corpus and the finish \n", " # method is called after the last document.\n", " # def start(self, **kwargs):\n", " # pass\n", " # def finish(self, **kwargs):\n", " # pass\n", "\n", "# THE FOLLOWING MUST BE PRESENT SO THAT GATE CAN COMMUNICATE WITH THE PYTHON PROCESS!\n", "if __name__ == \"___main__\": # NOTE: changed from __main__ to ___main__ to prevent running in Notebook!\n", " interact()\n" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## PythonPr Program Example\n" ] }, { "cell_type": "code", "execution_count": 72, "metadata": { "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "from gatenlp import Document, AnnotationSet, GateNlpPr, interact\n", "\n", "@GateNlpPr\n", "class MyAnnotator:\n", " def __init__(self):\n", " self.n_docs = 0\n", " def __call__(self, doc, **kwargs):\n", " self.n_docs += 1\n", " doc.annset().add(0,3,\"SomeType\")\n", " doc.features[\"docnr\"] = self.n_docs\n", " def start(self, **kwargs):\n", " print(\"Processing starting, we got kwargs:\", kwargs)\n", " self.n_docs = 0\n", " def finish(self, **kwargs):\n", " print(\"Processing finished, documents processed: \", self.n_docs)\n", "\n", "if __name__ == \"___main__\": # NOTE: changed from __main__ to ___main__ to prevent running in Notebook!\n", " interact()" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Create PythonPr Pipeline\n", "\n", "* new Application (Pipeline)\n", "* add PythonPr to the pipeline\n", "* Review Runtime Parameters\n", "* Create/load a document\n", "* Create a corpus for the document\n", "* Double-click pipeline, choose corpus\n", "* Run Application\n", "* See example pipeline `pythonpr-example1.xgapp`" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## PythonPr Runtime Parameters\n", "\n", "* `pythonBinary`/`pythonBinaryUrl`: if `python` is not on the path or you want to use a specific binary\n", " * use a specific environment by using the binary from that environment!\n", "* `usePluginGatenlpPackage`: the Python plugin contains its own version of Python `gatenlp`, if `false` use whatever is installed into the environment instead\n", "* `setsToUse`: replace `*` with a list of Annotation Set names to avoid transferring lots of existing anntoations\n", "* `programParams`: send arbitrary parameters to the Python program (as kwargs)\n", " * can get pre-set if a JSON file `pythonscript.py.parms` exists \n", "* `configFile`: select any file/dir to pass as `_config_file` parameter to the Python program (as kwarg)" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## How it works\n", "\n", "- when pipeline/contoller runs, a separate Python process is started\n", "- the Python code is loaded\n", "- communication is done via pipes over stdin/stdout between Java and Python\n", "- `interact()` communicates with the Java PythonPr\n", "- when the pipeline starts, the `start` method is called, `programParams` passed on\n", "- for each document:\n", " * the document is converted to bdoc json, transferred, converted to GateNLP document and passed to `__call__`\n", " * the changes to the document done via the gatenlp API are recorded and sent back to Java\n", " * PythonPr applies the changes to the document\n", "- when the pipeline finishes, the `finish` method is called and any results returned to Java" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Multiprocessing\n", "\n", "* In GATE, multiprocessing is done via *duplication* of the pipeline and running pipelines in parallel\n", "* Done via [gcp](https://github.com/GateNLP/gcp), easily using `gcp-direct.sh`\n", "* PythonPr supports duplication and multiprocessing\n", "* One parallel Python process for each duplicate\n", "* number of duplicates and duplicate id passed to each process\n", "* if more than one duplicate, the `reduce(resultslist)` method is invoked\n", "* see example pipeline `pythonpr-count-words.xgapp`" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## PythonPr Prepared Pipelines\n", "\n", "* Two prepared pipelines: for running Spacy and Stanza\n", "* Right click Applications - Ready Made Applications - Python \n", " * python-spacy \n", " * python-stanford-stanza\n" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## What to use when?\n", "\n", "* Java GATE \"Classic\": \n", " * need existing plugins/pipelines, need GUI editing/annotation\n", " * within Java app; GCP / Mimir involved\n", "* Gate Worker from Python:\n", " * Need Java GATE plugins/pipelines\n", " * but Python context / application\n", " * consider running in Java GATE first, process result files with Python GateNLP\n", "* Python GateNLP:\n", " * Python context, ML/DNN (Pytorch, Tensorflow), numpy, ...\n", " * Want to use Stanza/Spacy; GateNLP-only functions" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "More documentation:\n", "\n", "* Python GateNLP https://gatenlp.github.io/python-gatenlp/\n", "* Java GATE Python Plugin: http://gatenlp.github.io/gateplugin-Python/\n", "* Java GATE Format_Bdoc plugin: https://gatenlp.github.io/gateplugin-Format_Bdoc/\n", "\n", "\n", "## Thank You & Have Fun with GATE and GateNLP!\n", "\n" ] }, { "cell_type": "code", "execution_count": 73, "metadata": { "slideshow": { "slide_type": "skip" } }, "outputs": [ { "data": { "text/plain": [ "1656789312.4475234" ] }, "execution_count": 73, "metadata": {}, "output_type": "execute_result" } ], "source": [ "import time\n", "time.time()" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [] } ], "metadata": { "celltoolbar": "Slideshow", "execution": { "allow_errors": true }, "kernelspec": { "display_name": "gatenlp-37", "language": "python", "name": "gatenlp-37" }, "language_info": { "codemirror_mode": { "name": "ipython", "version": 3 }, "file_extension": ".py", "mimetype": "text/x-python", "name": "python", "nbconvert_exporter": "python", "pygments_lexer": "ipython3", "version": "3.7.9" }, "rise": { "auto_select": "none", "enable_chalkboard": false, "header": "", "scroll": true, "theme": "white", "transition": "none" }, "widgets": { "application/vnd.jupyter.widget-state+json": { "state": {}, "version_major": 2, "version_minor": 0 } } }, "nbformat": 4, "nbformat_minor": 4 }