References
 [Agatonovic et al. 08]   
M. Agatonovic, N. Aswani, K. Bontcheva, H. Cunningham, T. Heitz, Y. Li, I. Roberts,
       and V. Tablan.  Large-scale, parallel automatic patent annotation.  In Proceedings of the
       1st ACM workshop on Patent information retrieval (PaIR ’08, 30 October 2008, PaIR ’08,
       pages 1–8, New York, NY, USA, October 2008. ACM.
       
 [Ao & Takagi 05]   
H. Ao and T. Takagi.  ALICE: an algorithm to extract abbreviations from MEDLINE.  J
       Am Med Inform Assoc, 12(5):576–586, 2005.
       
 [Aronson & Lang 10]   
A. R. Aronson and F.-M. Lang. An overview of MetaMap: historical perspective and recent
       advances. Journal of the American Medical Informatics Association (JAMIA), 17:229–236,
       2010.
       
 [Aswani & Gaizauskas 09]   
N. Aswani and R. Gaizauskas.  Evolving a General Framework for Text Alignment: Case
       Studies with Two South Asian Languages.  In Proceedings of the International Conference
       on Machine Translation: Twenty-Five Years On, Cranfield, Bedfordshire, UK, November
       2009.
       
 [Aswani & Gaizauskas 10]   
N. Aswani  and  R. Gaizauskas.   Developing  Morphological  Analysers  for  South  Asian
                                                                                         
                                                                                         
       Languages:  Experimenting  with  the  Hindi  and  Gujarati  Languages.   In  7th  Language
       Resources and Evaluation Conference (LREC), La Valletta, Malta, May 2010. ELRA.
       
 [Aswani et al. 05]   
N. Aswani,  V. Tablan,  K. Bontcheva,  and  H. Cunningham.    Indexing  and  Querying
       Linguistic  Metadata  and  Document  Content.    In  Proceedings  of  Fifth  International
       Conference on Recent Advances in Natural Language Processing (RANLP2005), Borovets,
       Bulgaria, 2005.
       
 [Aswani et al. 06]   
N. Aswani,  K. Bontcheva,  and  H. Cunningham.     Mining  information  for  instance
       unification. In 5th International Semantic Web Conference (ISWC2006), Athens, Georgia,
       USA, 2006.
       
 [Azar 89]   
S. Azar. Understanding and Using English Grammar. Prentice Hall Regents, 1989.
       
 [Baker et al. 02]   
P. Baker,  A. Hardie,  T. McEnery,  H. Cunningham,  and  R. Gaizauskas.   EMILLE,  A
       67-Million Word Corpus of Indic Languages: Data Collection, Mark-up and Harmonisation.
       In Proceedings of 3rd Language Resources and Evaluation Conference (LREC’2002), pages
       819–825, 2002.
       
 [Bird & Liberman 99]   
S. Bird and M. Liberman.   A Formal Framework for Linguistic Annotation.   Technical
       Report MS-CIS-99-01, Department of Computer and Information Science, University of
       Pennsylvania, Philadelphia, PA, 1999. http://xxx.lanl.gov/abs/cs.CL/9903003.
       
 [Bontcheva & Sabou 06]   
K. Bontcheva and M. Sabou. Learning Ontologies from Software Artifacts: Exploring and
       Combining Multiple Sources. In Workshop on Semantic Web Enabled Software Engineering
       (SWESE), Athens, G.A., USA, November 2006.
       
 [Bontcheva 04]   
K. Bontcheva.   Open-source  Tools  for  Creation,  Maintenance,  and  Storage  of  Lexical
       Resources  for  Language  Generation  from  Ontologies.   In  Proceedings  of  4th  Language
       Resources and Evaluation Conference (LREC’04), 2004.
       
 [Bontcheva 05]   
K. Bontcheva.    Generating  Tailored  Textual  Summaries  from  Ontologies.    In  Second
       European Semantic Web Conference (ESWC’2005), 2005.
       
 [Bontcheva et al. 00]   
K. Bontcheva,  H. Brugman,  A. Russel,  P. Wittenburg,  and  H. Cunningham.     An
       Experiment in Unifying Audio-Visual and Textual Infrastructures for Language Processing
       R&D.  In Proceedings of the Workshop on Using Toolsets and Architectures To Build NLP
       Systems at COLING-2000, Luxembourg, 2000. http://gate.ac.uk/.
       
 [Bontcheva et al. 02a]   
K. Bontcheva, H. Cunningham, V. Tablan, D. Maynard, and O. Hamza. Using GATE as
       an           Environment           for           Teaching           NLP.                              In
       Proceedings of the ACL Workshop on Effective Tools and Methodologies in Teaching NLP,
       2002. http://gate.ac.uk/sale/acl02/gate4teaching.pdf.
       
 [Bontcheva et al. 02b]   
K. Bontcheva,  H. Cunningham,  V. Tablan,  D. Maynard,  and  H. Saggion.   Developing
       Reusable and Robust Language Processing Components for Information Systems using
       GATE.    In  Proceedings  of  the  3rd  International  Workshop  on  Natural  Language  and
       Information Systems (NLIS’2002), Aix-en-Provence, France, 2002. IEEE Computer Society
       Press. http://gate.ac.uk/sale/nlis/nlis.ps.
       
 [Bontcheva et al. 02c]   
K. Bontcheva,                 M. Dimitrov,                 D. Maynard,                 V. Tablan,
       and  H. Cunningham.   Shallow  Methods  for  Named  Entity  Coreference  Resolution.   In
       Chaînes de références et résolveurs d’anaphores, workshop TALN 2002, Nancy, France,
       2002. http://gate.ac.uk/sale/taln02/taln-ws-coref.pdf.
       
 [Bontcheva et al. 03]   
K. Bontcheva,  A. Kiryakov,  H. Cunningham,  B. Popov,  and  M. Dimitrov.   Semantic
       web  enabled,  open  source  language  technology.     In  EACL  workshop  on  Language
       Technology  and  the  Semantic  Web:  NLP  and  XML,   Budapest,   Hungary,   2003.
       http://gate.ac.uk/sale/eacl03-semweb/bontcheva-etal-final.pdf.
       
 [Bontcheva et al. 04]   
K. Bontcheva,  V. Tablan,  D. Maynard,  and  H. Cunningham.     Evolving  GATE  to
       Meet  New  Challenges  in  Language  Engineering.      Natural  Language  Engineering,
       10(3/4):349—373, 2004.
       
 [Bontcheva et al. 06a]   
K. Bontcheva, H. Cunningham, A. Kiryakov, and V. Tablan.  Semantic Annotation and
       Human Language Technology. In J. Davies, R. Studer, and P. Warren, editors, Semantic
       Web Technology: Trends and Research. John Wiley and Sons, 2006.
       
 [Bontcheva et al. 06b]   
K. Bontcheva,  J. Davies,  A. Duke,  T. Glover,  N. Kings,  and  I. Thurlow.    Semantic
       Information  Access.   In  J. Davies,  R. Studer,  and  P. Warren,  editors,  Semantic  Web
       Technologies. John Wiley and Sons, 2006.
       
 [Bontcheva et al. 09]   
K. Bontcheva, B. Davis, A. Funk, Y. Li, and T. Wang.  Human Language Technologies.
       In J. Davies, M. Grobelnik, and D. Mladenic, editors, Semantic Knowledge Management,
       pages 37–49. 2009.
       
 [Bontcheva et al. 10]   
K. Bontcheva,  H. Cunningham,  I. Roberts,  and  V. Tablan.    Web-based  collaborative
       corpus annotation: Requirements and a framework implementation.  In Proceedings of the
       LREC 2010 Workshop on New Challenges for NLP Frameworks, 17–23 May 2010, pages
       20–27, Valletta, Malta, May 2010.
       
 [Bontcheva et al. 13]   
K. Bontcheva, L. Derczynski, A. Funk, M. A. Greenwood, D. Maynard, and N. Aswani.
                                                                                         
                                                                                         
       TwitIE:  An  Open-Source  Information  Extraction  Pipeline  for  Microblog  Text.     In
       Proceedings  of  the  International  Conference  on  Recent  Advances  in  Natural  Language
       Processing. Association for Computational Linguistics, 2013.
       
 [Booch 94]   
G. Booch. Object-Oriented Analysis and Design 2nd Edn. Benjamin/Cummings, 1994.
       
 [Bosma & Vossen 10]   
W. Bosma  and  P. Vossen.    Bootstrapping  language-neutral  term  extraction.    In  7th
       Language Resources and Evaluation Conference (LREC), Valletta, Malta, 2010.
       
 [Brugman et al. 99]   
H. Brugman, K. Bontcheva, P. Wittenburg, and H. Cunningham. Integrating Multimedia
       and  Textual  Software  Architectures  for  Language  Technology.      Technical  report
       MPI-TG-99-1, Max-Planck Institute for Psycholinguistics, Nijmegen, Netherlands, 1999.
       
 [Caporaso et al. 07]   
J. G.  Caporaso,  W. A. B.  Jr.,  D. A.  Randolph,  K. B.  Cohen,  ,  and  L. Hunter.
       MutationFinder: A high-performance system for extracting point mutation mentions from
       text. Bioinformatics, 23(14):1862–1865, 2007.
       
 [Carletta 96]   
J. Carletta. Assessing agreement on classification tasks: the Kappa statistic. Computational
       Linguistics, 22(2):249–254, 1996.
       
 [Chinchor 92]   
N. Chinchor.    MUC-4  Evaluation  Metrics.    In  Proceedings  of  the  Fourth  Message
       Understanding Conference, pages 22–29, 1992.
       
 [Cimiano et al. 03]   
P. Cimiano, S.Staab, and J. Tane. Automatic Acquisition of Taxonomies from Text: FCA
       meets NLP.  In Proceedings of the ECML/PKDD Workshop on Adaptive Text Extraction
       and Mining, pages 10–17, Cavtat-Dubrovnik, Croatia, 2003.
                                                                                         
                                                                                         
       
 [Cobuild 99]   
C. Cobuild, editor. English Grammar. Harper Collins, 1999.
       
 [Cunningham & Bontcheva 05]   
H. Cunningham  and  K. Bontcheva.   Computational  Language  Systems,  Architectures.
       Encyclopedia of Language and Linguistics, 2nd Edition, pages 733–752, 2005.
       
 [Cunningham & Scott 04a]   
H. Cunningham   and   D. Scott.      Introduction   to   the   Special   Issue   on   Software
       Architecture   for   Language   Engineering.       Natural   Language   Engineering,   2004.
       http://gate.ac.uk/sale/jnle-sale/intro/intro-main.pdf.
       
 [Cunningham & Scott 04b]   
H. Cunningham and D. Scott, editors.  Special Issue of Natural Language Engineering on
       Software Architecture for Language Engineering. Cambridge University Press, 2004.
       
 [Cunningham 94]   
H. Cunningham. Support Software for Language Engineering Research. Technical Report
       94/05, Centre for Computational Linguistics, UMIST, Manchester, 1994.
       
 [Cunningham 99a]   
H. Cunningham.  A Definition and Short History of Language Engineering.  Journal of
       Natural Language Engineering, 5(1):1–16, 1999.
       
 [Cunningham 99b]   
H. Cunningham.   JAPE: a Java Annotation Patterns Engine.   Research Memorandum
       CS–99–06, Department of Computer Science, University of Sheffield, May 1999.
       
 [Cunningham 00]   
H. Cunningham.   Software  Architecture  for  Language  Engineering.   Unpublished  PhD
       thesis,  Department  of  Computer  Science,  University  of  Sheffield,  Sheffield,  UK,  2000.
       http://gate.ac.uk/sale/thesis/.
                                                                                         
                                                                                         
       
 [Cunningham 02]   
H. Cunningham. GATE, a General Architecture for Text Engineering. Computers and the
       Humanities, 36:223–254, 2002.
       
 [Cunningham 05]   
H. Cunningham.   Information  Extraction,  Automatic.   Encyclopedia  of  Language  and
       Linguistics, 2nd Edition, pages 665–677, dec 2005.
       
 [Cunningham et al. 94]   
H. Cunningham,   M. Freeman,   and   W. Black.      Software   Reuse,   Object-Oriented
       Frameworks and Natural Language Processing.  In New Methods in Language Processing
       (NeMLaP-1), 14-16 September 1994, pages 357–367, Manchester, 1994. UCL Press.
       
 [Cunningham et al. 95]   
H. Cunningham,  R. Gaizauskas,  and  Y. Wilks.     A  General  Architecture  for  Text
       Engineering  (GATE)  –  a  new  approach  to  Language  Engineering  R&D.    Technical
       Report  CS–95–21,  Department  of  Computer  Science,  University  of  Sheffield,  1995.
       http://xxx.lanl.gov/abs/cs.CL/9601009.
       
 [Cunningham et al. 96a]   
H. Cunningham, K. Humphreys, R. Gaizauskas, and M. Stower.  CREOLE Developer’s
       Manual. Technical report, Department of Computer Science, University of Sheffield, 1996.
       http://www.dcs.shef.ac.uk/nlp/gate.
       
 [Cunningham et al. 96b]   
H. Cunningham, K. Humphreys, R. Gaizauskas, and Y. Wilks.   TIPSTER-Compatible
       Projects at Sheffield. In Advances in Text Processing, TIPSTER Program Phase II. DARPA,
       Morgan Kaufmann, California, 1996.
       
 [Cunningham et al. 96c]   
H. Cunningham,                                                                                   Y. Wilks,
       and R. Gaizauskas. GATE – a General Architecture for Text Engineering. In Proceedings
       of the 16th Conference on Computational Linguistics (COLING-96), Copenhagen, August
       1996. ftp://ftp.dcs.shef.ac.uk/home/hamish/auto_papers/Cun96b.ps.
                                                                                         
                                                                                         
       
 [Cunningham et al. 96d]   
H. Cunningham,  Y. Wilks,  and  R. Gaizauskas.   Software  Infrastructure  for  Language
       Engineering. In Proceedings of the AISB Workshop on Language Engineering for Document
       Analysis and Recognition, Brighton, U.K., April 1996.
       
 [Cunningham et al. 96e]   
H. Cunningham,  Y. Wilks,  and  R. Gaizauskas.    New  Methods,  Current  Trends  and
       Software Infrastructure for NLP.  In Proceedings of the Conference on New Methods in
       Natural Language Processing (NeMLaP-2), Bilkent University, Turkey, September 1996.
       ftp://ftp.dcs.shef.ac.uk/home/hamish/auto_papers/Cun96c.ps.
       
 [Cunningham et al. 97a]   
H. Cunningham,
       K. Humphreys,  R. Gaizauskas,  and  Y. Wilks.    GATE  –  a  TIPSTER-based  General
       Architecture  for  Text  Engineering.    In  Proceedings  of  the  TIPSTER  Text  Program
       (Phase  III)  6  Month  Workshop.  DARPA,  Morgan  Kaufmann,  California,  May  1997.
       ftp://ftp.dcs.shef.ac.uk/home/hamish/auto_papers/Cun97e.ps.
       
 [Cunningham et al. 97b]   
H. Cunningham,                                                                            K. Humphreys,
       R. Gaizauskas, and Y. Wilks.  Software Infrastructure for Natural Language Processing.
       In Proceedings of the 5th Conference on Applied Natural Language Processing (ANLP-97),
       March 1997. ftp://ftp.dcs.shef.ac.uk/home/hamish/auto_papers/Cun97a.ps.gz.
       
 [Cunningham et al. 98a]   
H. Cunningham, W. Peters, C. McCauley, K. Bontcheva, and Y. Wilks. A Level Playing
       Field  for  Language  Resource  Evaluation.   In  Workshop  on  Distributing  and  Accessing
       Lexical Resources at Conference on Language Resources Evaluation, Granada, Spain, 1998.
       http://www.dcs.shef.ac.uk/ hamish/dalr.
       
 [Cunningham et al. 98b]   
H. Cunningham, M. Stevenson, and Y. Wilks.   Implementing a Sense Tagger within a
       General Architecture for Language Engineering. In Proceedings of the Third Conference on
       New Methods in Language Engineering (NeMLaP-3), pages 59–72, Sydney, Australia, 1998.
       
 [Cunningham et al. 99]   
H. Cunningham,  R. Gaizauskas,  K. Humphreys,  and  Y. Wilks.    Experience  with  a
       Language Engineering Architecture: Three Years of GATE. In Proceedings of the AISB’99
       Workshop  on  Reference  Architectures  and  Data  Standards  for  NLP,  Edinburgh,  April
       1999. The Society for the Study of Artificial Intelligence and Simulation of Behaviour.
       http://www.dcs.shef.ac.uk/ hamish/GateAisb99.html.
       
 [Cunningham et al. 00a]   
H. Cunningham,                                                                             K. Bontcheva,
       W. Peters, and Y. Wilks. Uniform language resource access and distribution in the context
       of a General Architecture for Text Engineering (GATE). In Proceedings of the Workshop on
       Ontologies and Language Resources (OntoLex’2000), Sozopol, Bulgaria, September 2000.
       http://gate.ac.uk/sale/ontolex/ontolex.ps.
       
 [Cunningham et al. 00b]   
H. Cunningham, K. Bontcheva, V. Tablan, and Y. Wilks.   Software Infrastructure for
       Language Resources: a Taxonomy of Previous Work and a Requirements Analysis.   In
       Proceedings of the second International Conference on Language Resources and Evaluation
       (LREC 2000), 30 May – 2 Jun 2000, pages 815–824, Athens, Greece, 2000.
       
 [Cunningham et al. 00c]   
H. Cunningham,  D. Maynard,  K. Bontcheva,  V. Tablan,  and  Y. Wilks.    Experience
       of  using  GATE  for  NLP  R&D.    In  Proceedings  of  the  Workshop  on  Using  Toolsets
       and  Architectures  To  Build  NLP  Systems  at  COLING-2000,   Luxembourg,   2000.
       http://gate.ac.uk/.
       
 [Cunningham et al. 00d]   
H. Cunningham, D. Maynard, and V. Tablan. JAPE: a Java Annotation Patterns Engine
       (Second Edition).  Research Memorandum CS–00–10, Department of Computer Science,
       University of Sheffield, Sheffield, UK, November 2000.
       
 [Cunningham et al. 02]   
H. Cunningham, D. Maynard, K. Bontcheva, and V. Tablan. GATE: an Architecture for
       Development of Robust HLT Applications.   In Proceedings of the 40th Annual Meeting
       on Association for Computational Linguistics, 7–12 July 2002, ACL ’02, pages 168–175,
       Stroudsburg, PA, USA, 2002. Association for Computational Linguistics.
                                                                                         
                                                                                         
       
 [Cunningham et al. 03]   
H. Cunningham, V. Tablan, K. Bontcheva, and M. Dimitrov. Language Engineering Tools
       for Collaborative Corpus Annotation. In Proceedings of Corpus Linguistics 2003, Lancaster,
       UK, 2003. http://gate.ac.uk/sale/cl03/distrib-ollie-cl03.doc.
       
 [Damljanovic & Bontcheva 08]   
D. Damljanovic and K. Bontcheva.  Enhanced Semantic Access to Software Artefacts.  In
       Workshop on Semantic Web Enabled Software Engineering (SWESE), Karlsruhe, Germany,
       October 2008.
       
 [Damljanovic 10]   
D. Damljanovic. Towards Portable Controlled Natural Languages for Querying Ontologies.
       In M. Rosner and N. Fuchs, editors, Second Workshop on Controlled Natural Languages,
       volume  622  of  CEUR  Workshop  Pre-Proceedings  ISSN  1613-0073.  http://ceur-ws.org,
       Marettimo Island, Italy, September 2010.
       
 [Damljanovic et al. 08]   
D. Damljanovic, V. Tablan, and K. Bontcheva.  A Text-based Query Interface to OWL
       Ontologies.  In 6th Language Resources and Evaluation Conference (LREC), Marrakech,
       Morocco, May 2008. ELRA.
       
 [Damljanovic et al. 09]   
D. Damljanovic, F. Amardeilh, and K. Bontcheva.   CA Manager Framework: Creating
       Customised Workflows for Ontology Population and Semantic Annotation. In Proceedings
       of The Fifth International Conference on Knowledge Capture (KCAP’09), California, USA,
       September 2009.
       
 [Davies & Fleiss 82]   
M. Davies  and  J. Fleiss.    Measuring  Agreement  for  Multinomial  Data.    Biometrics,
       38:1047–1051, 1982.
       
 [Davis et al. 06]   
B. Davis,  S. Handschuh,  H. Cunningham,  and  V. Tablan.   Further  use  of  Controlled
                                                                                         
                                                                                         
       Natural Language for Semantic Annotation of Wikis.  In Proceedings of the 1st Semantic
       Authoring and Annotation Workshop at ISWC2006, Athens, Georgia, USA, November 2006.
       
 [Day et al. 97]   
D. Day,   J. Aberdeen,   L. Hirschman,   R. Kozierok,   P. Robinson,   and   M. Vilain.
       Mixed-Initiative Development of Language Processing Systems.  In Proceedings of the 5th
       Conference on Applied Natural Language Processing (ANLP-97), 1997.
       
 [Della Valle et al. 08]   
E. Della Valle, D. Cerizza, I. Celino, A. Turati, H. Lausen, N. Steinmetz, M. Erdmann,
       and A. Funk.  Realizing Service-Finder: Web service discovery at web scale.  In European
       Semantic Technology Conference (ESTC), Vienna, September 2008.
       
 [Derczynski et al. 13]   
L. Derczynski, A. Ritter, S. Clark, and K. Bontcheva.  Twitter Part-of-Speech Tagging
       for All: Overcoming Sparse and Noisy Data. In Proceedings of Recent Advances in Natural
       Language Processing (RANLP). Association for Computational Linguistics, 2013.
       
 [Derczynski et al. 14]   
L. Derczynski,  C. Field,  and  K. Bøgh.    DKIE:  Open  source  information  extraction
       for  Danish.    In  S. Wintner,  M. Tadia,  and  B. Babych,  editors,  Proceedings  of  the
       Demonstrations at the 14th Conference of the European Chapter of the Association for
       Computational Linguistics, pages 61–64. Association for Computational Linguistics, 2014.
       
 [Dimitrov 02a]   
M. Dimitrov.                                                               A                   Light-weight
       Approach to Coreference Resolution for Named Entities in Text. MSc Thesis, University of
       Sofia, Bulgaria, 2002. http://www.ontotext.com/ie/thesis-m.pdf.
       
 [Dimitrov 02b]   
M. Dimitrov.                                                               A                   Light-weight
       Approach to Coreference Resolution for Named Entities in Text. MSc Thesis, University of
       Sofia, Bulgaria, 2002. http://www.ontotext.com/ie/thesis-m.pdf.
       
 [Dimitrov et al. 02]   
M. Dimitrov, K. Bontcheva, H. Cunningham, and D. Maynard. A Light-weight Approach
       to Coreference Resolution for Named Entities in Text. In Proceedings of the Fourth Discourse
       Anaphora and Anaphor Resolution Colloquium (DAARC), Lisbon, 2002.
       
 [Dimitrov et al. 04]   
M. Dimitrov, K. Bontcheva, H. Cunningham, and D. Maynard. A Light-weight Approach
       to  Coreference  Resolution  for  Named  Entities  in  Text.    In  A. Branco,  T. McEnery,
       and  R. Mitkov,  editors,  Anaphora Processing: Linguistic, Cognitive and Computational
       Modelling. John Benjamins, 2004.
       
 [Dowman et al. 05a]   
M. Dowman,     V. Tablan,     H. Cunningham,     and     B. Popov.              Content
       augmentation    for    mixed-mode    news    broadcasts.          In    Proceedings   of   the
       3rd    European    Conference    on    Interactive    Television:    User    Centred    ITV
       Systems,   Programmes   and   Applications,    Aalborg    University,    Denmark,    2005.
       http://gate.ac.uk/sale/euro-itv-2005/content-augmentation-for-mixed-mode-news-broadcast-consumption.pdf.
       
 [Dowman et al. 05b]   
M. Dowman,  V. Tablan,  H. Cunningham,  and  B. Popov.    Web-assisted  annotation,
       semantic indexing and search of television and radio news.   In Proceedings of the 14th
       International World Wide Web Conference, Chiba, Japan, 2005.
       
 [Dowman et al. 05c]   
M. Dowman, V. Tablan, H. Cunningham, C. Ursu, and B. Popov. Semantically enhanced
       television news through web and video integration.  In Second European Semantic Web
       Conference (ESWC’2005), 2005.
       
 [DUC 01]   
NIST. Proceedings of the Document Understanding Conference, September 13 2001.
       
 [Eugenio & Glass 04]   
B. D. Eugenio and M. Glass. The kappa statistic: a second look. Computational Linguistics,
       1(30), 2004. (squib).
                                                                                         
                                                                                         
       
 [Finkel et al. 05]   
J. Finkel,  T. Grenager,  and  C. Manning.    Incorporating  non-local  information  into
       information extraction systems by Gibbs sampling.   In Proceedings of the 43rd Annual
       Meeting of the Association for Computational Linguistics, pages 363–370. Association for
       Computational Linguistics, 2005.
       
 [Fleiss 75]   
J. L. Fleiss. Measuring agreement between two judges on the presence or absence of a trait.
       Biometrics, 31:651–659, 1975.
       
 [Frakes & Baeza-Yates 92]   
W. Frakes  and  R. Baeza-Yates,  editors.    Information  retrieval,  data  structures  and
       algorithms. Prentice Hall, New York, Englewood Cliffs, N.J., 1992.
       
 [Funk et al. 07a]   
A. Funk,  D. Maynard,  H. Saggion,  and  K. Bontcheva.     Ontological  integration  of
       information  extracted  from  multiple  sources.   In  Multi-source Multilingual Information
       Extraction and Summarization (MMIES) workshop at Recent Advances in Natural Language
       Processing (RANLP07), pages 9–15, Borovets, Bulgaria, September 2007.
       
 [Funk et al. 07b]   
A. Funk,  V. Tablan,  K. Bontcheva,  H. Cunningham,  B. Davis,  and  S. Handschuh.
       CLOnE: Controlled Language for Ontology Editing. In Proceedings of the 6th International
       Semantic Web Conference (ISWC 2007), Busan, Korea, November 2007.
       
 [Gaizauskas et al. 95]   
R. Gaizauskas, T. Wakao, K. Humphreys, H. Cunningham, and Y. Wilks. Description of
       the LaSIE system as used for MUC-6. In Proceedings of the Sixth Message Understanding
       Conference (MUC-6), 6–8 November 1995, pages 207–220. Morgan Kaufmann, California,
       1995.
       
 [Gaizauskas et al. 96a]   
R. Gaizauskas,  P. Rodgers,  H. Cunningham,  and  K. Humphreys.   GATE  User  Guide.
       http://www.dcs.shef.ac.uk/nlp/gate, 1996.
                                                                                         
                                                                                         
       
 [Gaizauskas et al. 96b]   
R. Gaizauskas, H. Cunningham, Y. Wilks, P. Rodgers, and K. Humphreys.  GATE – an
       Environment to Support Research and Development in Natural Language Engineering. In
       Proceedings                             of                             the                             8th
       IEEE International Conference on Tools with Artificial Intelligence (ICTAI-96), Toulouse,
       France, October 1996. ftp://ftp.dcs.shef.ac.uk/home/robertg/ictai96.ps.
       
 [Gaizauskas et al. 03]   
R. Gaizauskas, M. A. Greenwood, M. Hepple, I. Roberts, H. Saggion, and M. Sargaison.
       The University of Sheffield’s TREC 2003 Q&A Experiments. In In Proceedings of the 12th
       Text REtrieval Conference, 2003.
       
 [Gaizauskas et al. 04]   
R. Gaizauskas, M. A. Greenwood, M. Hepple, I. Roberts, H. Saggion, and M. Sargaison.
       The University of Sheffield’s TREC 2004 Q&A Experiments. In In Proceedings of the 13th
       Text REtrieval Conference, 2004.
       
 [Gaizauskas et al. 05]   
R. Gaizauskas, M. A. Greenwood, M. Hepple, H. Harkema, H. Saggion, and A. Sanka.
       The University of Sheffield’s TREC 2005 Q&A Experiments. In In Proceedings of the 11th
       Text REtrieval Conference, 2005.
       
 [Gambäck & Olsson 00]   
B. Gambäck and F. Olsson.  Experiences of Language Engineering Algorithm Reuse.  In
       Second International Conference on Language Resources and Evaluation (LREC), pages
       155–160, Athens, Greece, 2000.
       
 [Gazdar & Mellish 89]   
G. Gazdar  and  C. Mellish.   Natural  Language  Processing  in  Prolog.   Addison-Wesley,
       Reading, MA, 1989.
       
 [Gooch 12]   
P. Gooch.  Badrex: In situ expansion and coreference of biomedical abbreviations using
       dynamic regular expressions. Technical report, City University London, London, 2012.
                                                                                         
                                                                                         
       
 [Greenwood et al. 02]   
M. A. Greenwood, I. Roberts, and R. Gaizauskas.  The University of Sheffield’s TREC
       2002 Q&A Experiments. In In Proceedings of the 11th Text REtrieval Conference, 2002.
       
 [Grishman 97]   
R. Grishman.
       TIPSTER Architecture Design Document Version 2.3.  Technical report, DARPA, 1997.
       http://www.itl.nist.gov/div894/894.02/related_projects/tipster/.
       
 [Hepple 00]   
M. Hepple. Independence and commitment: Assumptions for rapid training and execution
       of rule-based POS taggers.  In Proceedings of the 38th Annual Meeting of the Association
       for Computational Linguistics (ACL-2000), Hong Kong, October 2000.
       
 [Hripcsak & Heitjan 02]   
G. Hripcsak  and  D. Heitjan.    Measuring  agreement  in  medical  informatics  reliability
       studies. Journal of Biomedical Informatics, 35:99–110, 2002.
       
 [Hripcsak & Rothschild 05]   
G. Hripcsak  and  A. S.  Rothschild.    Agreement,  the  F-measure,  and  Reliability  in
       Information  Retrieval.     Journal  of  the  American  Medical  Informatics  Association,
       12(3):296–298, 2005.
       
 [Humphreys et al. 96]   
K. Humphreys,  R. Gaizauskas,  H. Cunningham,  and  S. Azzam.    CREOLE  Module
       Specifications. http://www.dcs.shef.ac.uk/nlp/gate/, 1996.
       
 [Humphreys et al. 98]   
K. Humphreys, R. Gaizauskas, S. Azzam, C. Huyck, B. Mitchell, H. Cunningham, and
       Y. Wilks.                  Description       of       the       LaSIE       system       as       used
       for MUC-7.  In Proceedings of the Seventh Message Understanding Conference (MUC-7).
       http://www.itl.nist.gov/iaui/894.02/related_projects/muc/index.html, 1998.
       
 [Humphreys et al. 99]   
K. Humphreys, R. Gaizauskas, M. Hepple, and M. Sanderson. The University of Sheffield
       TREC-8 Q&A System. In In Proceedings of the 8th Text REtrieval Conference, 1999.
       
 [Ide et al. 00]   
N. Ide, P. Bonhomme, and L. Romary.  XCES: An XML-based Standard for Linguistic
       Corpora. In Proceedings of the second International Conference on Language Resources and
       Evaluation (LREC 2000), 30 May – 2 Jun 2000, pages 825–830, Athens, Greece, 2000.
       
 [Jackson 75]   
M. Jackson. Principles of Program Design. Academic Press, London, 1975.
       
 [Jin et al. 06]   
Y. Jin, R. T. McDonald, K. Lerman, M. A. Mandel, S. Carroll, M. Y. Liberman, F. C.
       Pereira, R. S. Winters, , and P. S. White. Automated recognition of malignancy mentions
       in biomedical literature. BMC Bioinformatics, 7:492–499, 2006.
       
 [Kiryakov 03]   
A. Kiryakov. Ontology and Reasoning in MUMIS: Towards the Semantic Web. Technical
       Report  CS–03–03,  Department  of  Computer  Science,  University  of  Sheffield,  2003.
       http://gate.ac.uk/gate/doc/papers.html.
       
 [Kohlschütter et al. 10]   
C. Kohlschütter, P. Fankhauser, and W. Nejdl. Boilerplate Detection using Shallow Text
       Features.  In Proceedings of the Third ACM International Conference on Web Search and
       Data Mining, 2010.
       
 [Laclavik & Maynard 09]   
M. Laclavik and D. Maynard.  Motivating intelligent email in business: an investigation
       into current trends for email processing and communication research.  In Proceedings of
       Workshop on Emails in e-Commerce and Enterprise Context, 11th IEEE Conference on
       Commerce and Enterprise Computing, Vienna, Austria, 2009.
       
 [Lal & Ruger 02]   
P. Lal   and   S. Ruger.       Extract-based   summarization   with   simplification.       In
       Proceedings of the ACL 2002 Automatic Summarization / DUC 2002 Workshop,  2002.
       http://www.doc.ic.ac.uk/ srueger/pr-p.lal-2002/duc02-final.pdf.
       
 [Lal 02]   
P. Lal. Text summarisation. Unpublished M.Sc. thesis, Imperial College, London, 2002.
       
 [Li & Bontcheva 08]   
Y. Li and K. Bontcheva. Adapting support vector machines for f-term-based classification
       of patents.  ACM Transactions on Asian Language Information Processing, 7(2):7:1–7:19,
       2008.
       
 [Li & Cunningham 08]   
Y. Li and H. Cunningham.  Geometric and Quantum Methods for Information Retrieval.
       SIGIR Forum, 42(2):22–32, 2008.
       
 [Li & Shawe-Taylor 06]   
Y. Li and J. Shawe-Taylor. Using KCCA for Japanese-English Cross-language Information
       Retrieval  and  Document  Classification.    Journal  of  Intelligent  Information  Systems,
       27(2):117–133, 2006.
       
 [Li & Shawe-Taylor 07]   
Y. Li and J. Shawe-Taylor.   Advanced Learning Algorithms for Cross-language Patent
       Retrieval and Classification.  Information Processing and Management, 43(5):1183–1199,
       2007.
       
 [Li et al. 04]   
Y. Li,          K. Bontcheva,          and          H. Cunningham.                             An
       SVM Based Learning Algorithm for Information Extraction. Machine Learning Workshop,
       Sheffield, 2004. http://gate.ac.uk/sale/ml-ws04/mlw2004.pdf.
       
 [Li et al. 05a]   
Y. Li, K. Bontcheva, and H. Cunningham.  Using Uneven Margins SVM and Perceptron
       for Information Extraction. In Proceedings of Ninth Conference on Computational Natural
       Language Learning (CoNLL-2005), 2005.
       
 [Li et al. 05b]   
Y. Li, C. Miao, K. Bontcheva, and H. Cunningham.   Perceptron Learning for Chinese
       Word Segmentation.  In Proceedings of Fourth SIGHAN Workshop on Chinese Language
       processing (Sighan-05), pages 154–157, Korea, 2005.
       
 [Li et al. 05c]   
Y. Li, K. Bontcheva, and H. Cunningham. SVM Based Learning System For Information
       Extraction.  In J. Winkler, M. Niranjan, and N. Lawerence, editors, Deterministic and
       Statistical Methods in Machine Learning: First International Workshop, 7–10 September,
       2004, volume 3635 of Lecture Notes in Computer Science, pages 319–339, Sheffield, UK,
       2005. Springer.
       
 [Li et al. 07a]   
Y. Li, K. Bontcheva, and H. Cunningham.    Hierarchical, Perceptron-like Learning for
       Ontology Based Information Extraction. In 16th International World Wide Web Conference
       (WWW2007), pages 777–786, May 2007.
       
 [Li et al. 07b]   
Y. Li, K. Bontcheva, and H. Cunningham. Cost Sensitive Evaluation Measures for F-term
       Patent  Classification.   In  The First International Workshop on Evaluating Information
       Access (EVIA 2007), 15 May 2007, pages 44–53, Tokyo, Japan, May 2007.
       
 [Li et al. 07c]   
Y. Li,  K. Bontcheva,  and  H. Cunningham.    Experiments  of  opinion  analysis  on  the
       corpora MPQA and NTCIR-6.  In Proceedings of the Sixth NTCIR Workshop Meeting on
       Evaluation of Information Access Technologies: Information Retrieval, Question Answering
       and Cross-Lingual Information Access, pages 323–329, May 2007.
       
 [Li et al. 07d]   
Y. Li,  K. Bontcheva,  and  H. Cunningham.   SVM  Based  Learning  System  for  F-term
       Patent Classification. In Proceedings of the Sixth NTCIR Workshop Meeting on Evaluation
       of  Information  Access  Technologies:  Information  Retrieval,  Question  Answering  and
       Cross-Lingual Information Access, pages 396–402, May 2007.
       
 [Li et al. 09]   
Y. Li,  K. Bontcheva,  and  H. Cunningham.   Adapting  SVM  for  Data  Sparseness  and
       Imbalance:  A  Case  Study  on  Information  Extraction.   Natural  Language  Engineering,
       15(2):241–271, 2009.
       
 [Lombard et al. 02]   
M. Lombard,  J. Snyder-Duch,  and  C. C.  Bracken.        Content  analysis  in  mass
       communication: Assessment and reporting of intercoder reliability. Human Communication
       Research, 28:587–604, 2002.
       
 [LREC-1 98]   
Conference on Language Resources Evaluation (LREC-1), Granada, Spain, 1998.
       
 [LREC-2 00]   
Second Conference on Language Resources Evaluation (LREC-2), Athens, 2000.
       
 [Maeda & Strassel 04]   
K. Maeda and S. Strassel.  Annotation Tools for Large-Scale Corpus Development: Using
       AGTK at the Linguistic Data Consortium. In Proceedings of 4th Language Resources and
       Evaluation Conference (LREC’2004), 2004.
       
 [Manning & Schütze 99]   
C. Manning     and     H. Schütze.               Foundations     of     Statistical     Natural
       Language Processing. MIT press, Cambridge, MA, 1999. Supporting materials available at
       http://www.sultry.arts.usyd.edu.au/fsnlp/ .
       
 [Manov et al. 03]   
D. Manov, A. Kiryakov, B. Popov, K. Bontcheva, and D. Maynard.  Experiments with
                                                                                         
                                                                                         
       geographic               knowledge               for               information               extraction.
       In Workshop on Analysis of Geographic References, HLT/NAACL’03, Edmonton, Canada,
       2003. http://gate.ac.uk/sale/hlt03/paper03.pdf.
       
 [Marsh & Perzanowski 98]   
E. Marsh  and  D. Perzanowski.     Muc-7  evaluation  of  ie  technology:  Overview  of
       results.    In  Proceedings  of  the  Seventh  Message  Understanding  Conference  (MUC-7).
       http://www.itl.nist.gov/iaui/894.02/related_projects/muc/index.html, 1998.
       
 [Maynard & Greenwood 14]   
D. Maynard and M. A. Greenwood.  Who cares about sarcastic tweets? Investigating the
       impact of sarcasm on sentiment analysis. In Proceedings of LREC 2014, Reykjavik, Iceland,
       2014.
       
 [Maynard 05]   
D. Maynard. Benchmarking ontology-based annotation tools for the semantic web. In UK
       e-Science Programme All Hands Meeting (AHM2005) Workshop on Text Mining, e-Research
       and Grid-enabled Language Technology, Nottingham, UK, 2005.
       
 [Maynard 08]   
D. Maynard.  Benchmarking textual annotation tools for the semantic web.  In Proc. of
       6th International Conference on Language Resources and Evaluation (LREC), Marrakech,
       Morocco, 2008.
       
 [Maynard et al. 00]   
D. Maynard, H. Cunningham, K. Bontcheva, R. Catizone, G. Demetriou, R. Gaizauskas,
       O. Hamza,   M. Hepple,   P. Herring,   B. Mitchell,   M. Oakes,   W. Peters,   A. Setzer,
       M. Stevenson, V. Tablan, C. Ursu, and Y. Wilks. A Survey of Uses of GATE. Technical
       Report CS–00–06, Department of Computer Science, University of Sheffield, 2000.
       
 [Maynard et al. 01]   
D. Maynard,  V. Tablan,  C. Ursu,  H. Cunningham,  and  Y. Wilks.     Named  Entity
       Recognition from Diverse Text Types. In Recent Advances in Natural Language Processing
       2001 Conference, pages 257–274, Tzigov Chark, Bulgaria, 2001.
       
 [Maynard et al. 02a]   
D. Maynard, K. Bontcheva, H. Saggion, H. Cunningham, and O. Hamza.  Using a Text
       Engineering  Framework  to  Build  an  Extendable  and  Portable  IE-based  Summarisation
       System.   In  Proceedings  of  the  ACL  Workshop  on  Text  Summarisation,  pages  19–26,
       Phildadelphia, Pennsylvania, 2002. ACM.
       
 [Maynard et al. 02b]   
D. Maynard,  H. Cunningham,  K. Bontcheva,  and  M. Dimitrov.    Adapting  a  robust
       multi-genre  NE  system  for  automatic  content  extraction.   In  Proceedings  of  the  10th
       International  Conference  on  Artificial  Intelligence:  Methodology,  Systems,  Applications
       (AIMSA’02), Varna, Bulgaria, Sep 2002.
       
 [Maynard et al. 02c]   
D. Maynard,  H. Cunningham,  K. Bontcheva,  and  M. Dimitrov.   Adapting  A  Robust
       Multi-Genre NE System for Automatic Content Extraction.  In Proceedings of the Tenth
       International  Conference  on  Artificial  Intelligence:  Methodology,  Systems,  Applications
       (AIMSA 2002), 2002.
       
 [Maynard et al. 02d]   
D. Maynard, H. Cunningham, and R. Gaizauskas.  Named entity recognition at sheffield
       university.    In  H. Holmboe,  editor,  Nordic  Language  Technology  –  Arbog  for  Nordisk
       Sprogtechnologisk  Forskningsprogram  2002-2004,  pages  141–145.  Museum  Tusculanums
       Forlag, 2002.
       
 [Maynard et al. 02e]   
D. Maynard,  V. Tablan,  H. Cunningham,  C. Ursu,  H. Saggion,  K. Bontcheva,  and
       Y. Wilks. Architectural Elements of Language Engineering Robustness. Journal of Natural
       Language Engineering – Special Issue on Robust Methods in Analysis of Natural Language
       Data, 8(2/3):257–274, 2002.
       
 [Maynard et al. 03a]   
D. Maynard, K. Bontcheva, and H. Cunningham. From information extraction to content
       extraction. Submitted to EACL’2003, 2003.
       
 [Maynard et al. 03b]   
D. Maynard,  K. Bontcheva,  and  H. Cunningham.   Towards  a  semantic  extraction  of
       named entities.  In G. Angelova, K. Bontcheva, R. Mitkov, N. Nicolov, and N. Nikolov,
       editors, Proceedings of Recent Advances in Natural Language Processing (RANLP’03), pages
       255–261, Borovets, Bulgaria, Sep 2003. http://gate.ac.uk/sale/ranlp03/ranlp03.pdf.
       
 [Maynard et al. 03c]   
D. Maynard, K. Bontcheva, and H. Cunningham. Towards a semantic extraction of Named
       Entities. In Recent Advances in Natural Language Processing, Bulgaria, 2003.
       
 [Maynard et al. 03d]   
D. Maynard, V. Tablan, K. Bontcheva, and H. Cunningham.  Rapid customisation of an
       Information Extraction system for surprise languages.  Special issue of ACM Transactions
       on Asian Language Information Processing: Rapid Development of Language Capabilities:
       The Surprise Languages, 2:295–300, 2003.
       
 [Maynard et al. 03e]   
D. Maynard, V. Tablan, and H. Cunningham. NE recognition without training data on a
       language you don’t speak.  In ACL Workshop on Multilingual and Mixed-language Named
       Entity Recognition: Combining Statistical and Symbolic Models, Sapporo, Japan, 2003.
       
 [Maynard et al. 04a]   
D. Maynard,  K. Bontcheva,  and  H. Cunningham.    Automatic  Language-Independent
       Induction of Gazetteer Lists.  In Proceedings of 4th Language Resources and Evaluation
       Conference (LREC’04), Lisbon, Portugal, 2004. ELRA.
       
 [Maynard et al. 04b]   
D. Maynard,   H. Cunningham,   A. Kourakis,   and   A. Kokossis.       Ontology-Based
       Information Extraction in hTechSight. In First European Semantic Web Symposium (ESWS
       2004), Heraklion, Crete, 2004.
       
 [Maynard et al. 04c]   
D. Maynard,  M. Yankova,  N. Aswani,  and  H. Cunningham.   Automatic  Creation  and
       Monitoring of Semantic Metadata in a Dynamic Knowledge Portal.  In Proceedings of the
                                                                                         
                                                                                         
       11th International Conference on Artificial Intelligence: Methodology, Systems, Applications
       (AIMSA 2004), Varna, Bulgaria, 2004.
       
 [Maynard et al. 06]   
D. Maynard, W. Peters, and Y. Li. Metrics for evaluation of ontology-based information
       extraction.  In WWW 2006 Workshop on Evaluation of Ontologies for the Web (EON),
       Edinburgh, Scotland, 2006.
       
 [Maynard et al. 07a]   
D. Maynard, W. Peters, M. d’Aquin, and M. Sabou.  Change management for metadata
       evolution.  In ESWC International Workshop on Ontology Dynamics (IWOD), Innsbruck,
       Austria, June 2007.
       
 [Maynard et al. 07b]   
D. Maynard, H. Saggion, M. Yankova, K. Bontcheva, and W. Peters. Natural Language
       Technology  for  Information  Integration  in  Business  Intelligence.   In  10th International
       Conference on Business Information Systems (BIS-07), Poznan, Poland, 25-27 April 2007.
       
 [Maynard et al. 08a]   
D. Maynard,  W. Peters,  and  Y. Li.   Evaluating  evaluation  metrics  for  ontology-based
       applications:  Infinite  reflection.   In  Proc. of 6th International Conference on Language
       Resources and Evaluation (LREC), Marrakech, Morocco, 2008.
       
 [Maynard et al. 08b]   
D. Maynard, Y. Li, and W. Peters.  NLP Techniques for Term Extraction and Ontology
       Population.  In P. Buitelaar and P. Cimiano, editors, Bridging the Gap between Text and
       Knowledge - Selected Contributions to Ontology Learning and Population from Text. IOS
       Press, 2008.
       
 [Maynard et al. 09]   
D. Maynard,  A. Funk,  and  W. Peters.     SPRAT:  a  tool  for  automatic  semantic
       pattern-based ontology population.  In International Conference for Digital Libraries and
       the Semantic Web, Trento, Italy, September 2009.
       
 [McDonald & Pereira 05]   
R. McDonald  and  F. Pereira.   Identifying  Gene  and  Protein  Mentions  in  Text  Using
       Conditional Random Fields. BMC Bioinformatics, 6(Suppl 1):S6, 2005.
       
 [McDonald et al. 04]   
R. T.  McDonald,  R. S.  Winters,  M. Mandel,  Y. Jin,  P. S.  White,  and  F. Pereira.
       An  entity  tagger  for  recognizing  acquired  genomic  variations  in  cancer  literature.
       Bioinformatics, 20(17):3249–3251, 2004.
       
 [McEnery et al. 00]   
A. McEnery, P. Baker, R. Gaizauskas, and H. Cunningham. EMILLE: Building a Corpus
       of South Asian Languages. Vivek, A Quarterly in Artificial Intelligence, 13(3):23–32, 2000.
       
 [Osenova & Simov 04]   
P. Osenova  and  K. Simov.    BulTreeBank  stylebook.    Technical  Report  BTB-TR05,
       BulTreeBank Project, May 2004.
       
 [Pastra et al. 02]   
K. Pastra, D. Maynard, H. Cunningham, O. Hamza, and Y. Wilks.  How feasible is the
       reuse                                                                                                              of
       grammars for named entity recognition? In Proceedings of the 3rd Language Resources and
       Evaluation Conference, 2002. http://gate.ac.uk/sale/lrec2002/reusability.ps.
       
 [Peters et al. 98]   
W. Peters,  H. Cunningham,  C. McCauley,  K. Bontcheva,  and  Y. Wilks.     Uniform
       Language Resource Access and Distribution.  In Workshop on Distributing and Accessing
       Lexical Resources at Conference on Language Resources Evaluation, Granada, Spain, 1998.
       
 [Polajnar et al. 05]   
T. Polajnar, V. Tablan, and H. Cunningham.  User-friendly ontology authoring using a
       controlled language.  Technical Report CS Report No. CS-05-10, University of Sheffield,
       Sheffield, UK, 2005.
       
 [Porter 80]   
M. Porter. An algorithm for suffix stripping. Program, 14(3):130–137, 1980.
       
 [Ramshaw & Marcus 95]   
L. Ramshaw and M. Marcus.  Text Chunking Using Transformation-Based Learning.  In
       Proceedings of the Third ACL Workshop on Very Large Corpora, 1995.
       
 [Saggion & Funk 09]   
H. Saggion and A. Funk.  Extracting opinions and facts for business intelligence.  RNTI
       Journal, E(17):119–146, November 2009.
       
 [Saggion & Gaizauskas 04a]   
H. Saggion  and  R. Gaizauskas.   Mining  on-line  sources  for  definition  knowledge.   In
       Proceedings of the 17th FLAIRS 2004, Miami Bearch, Florida, USA, May 17-19 2004. AAAI.
       
 [Saggion & Gaizauskas 04b]   
H. Saggion and R. Gaizauskas. Multi-document summarization by cluster/profile relevance
       and redundancy removal. In Proceedings of the Document Understanding Conference 2004.
       NIST, 2004.
       
 [Saggion & Gaizauskas 05]   
H. Saggion and R. Gaizauskas. Experiments on statistical and pattern-based biographical
       summarization. In Proceedings of EPIA 2005, pages 611–621, 2005.
       
 [Saggion 04]   
H. Saggion.   Identifying definitions in text collections for question answering. lrec.   In
       Proceedings of Language Resources and Evaluation Conference. ELDA, 2004.
       
 [Saggion 06]   
H. Saggion.    Multilingual  Multidocument  Summarization  Tools  and  Evaluation.    In
       Proceedings of LREC 2006, 2006.
       
 [Saggion 07]   
H. Saggion.      Shef:   Semantic   tagging   and   summarization   techniques   applied   to
       cross-document   coreference.      In   Proceedings  of  SemEval  2007,  Assocciation  for
       Computational Linguistics, pages 292–295, June 2007.
       
 [Saggion et al. 02a]   
H. Saggion,  H. Cunningham,  K. Bontcheva,  D. Maynard,  C. Ursu,  O. Hamza,  and
       Y. Wilks.   Access  to  Multimedia  Information  through  Multisource  and  Multilanguage
       Information Extraction.  In Proceedings of the 7th Workshop on Applications of Natural
       Language to Information Systems (NLDB 2002), Stockholm, Sweden, 2002.
       
 [Saggion et al. 02b]   
H. Saggion,  H. Cunningham,  D. Maynard,  K. Bontcheva,  O. Hamza,  C. Ursu,  and
       Y. Wilks.  Extracting Information for Information Indexing of Multimedia Material.  In
       Proceedings of 3rd Language Resources and Evaluation Conference (LREC’2002), 2002.
       http://gate.ac.uk/sale/lrec2002/mumis_lrec2002.ps.
       
 [Saggion et al. 03a]   
H. Saggion,  K. Bontcheva,  and  H. Cunningham.    Robust  Generic  and  Query-based
       Summarisation.   In  Proceedings of the European Chapter of Computational Linguistics
       (EACL), Research Notes and Demos, 2003.
       
 [Saggion et al. 03b]   
H. Saggion,  H. Cunningham,  K. Bontcheva,  D. Maynard,  O. Hamza,  and  Y. Wilks.
       Multimedia Indexing through Multisource and Multilingual Information Extraction; the
       MUMIS project. Data and Knowledge Engineering, 48:247–264, 2003.
       
 [Saggion et al. 03c]   
H. Saggion,                 J. Kuper,                 H. Cunningham,                 T. Declerck,
       P. Wittenburg, M. Puts, F. DeJong, and Y. Wilks.  Event-coreference across Multiple,
       Multi-lingual Sources in the Mumis Project.  In Proceedings of the European Chapter of
       Computational Linguistics (EACL), Research Notes and Demos, 2003.
       
 [Saggion et al. 07]   
H. Saggion,  A. Funk,  D. Maynard,  and  K. Bontcheva.    Ontology-based  information
       extraction for business applications. In Proceedings of the 6th International Semantic Web
       Conference (ISWC 2007), Busan, Korea, November 2007.
       
 [Schwartz & Hearst 03]   
A. S.  Schwartz  and  M. A.  Hearst.    A  simple  algorithm  for  identifying  abbreviation
       definitions in biomedical text. Pacific Symposium on Biocomputing. Pacific Symposium on
       Biocomputing, pages 451–462, 2003.
       
 [Scott & Gaizauskas. 00]   
S. Scott and R. Gaizauskas.  The University of Sheffield TREC-9 Q&A System.  In In
       Proceedings of the 9th Text REtrieval Conference, 2000.
       
 [Settles 05]   
B. Settles.  ABNER: An open source tool for automatically tagging genes, proteins, and
       other entity names in text. Bioinformatics, 21(14):3191–3192, 2005.
       
 [Shaw & Garlan 96]   
M. Shaw and D. Garlan. Software Architecture. Prentice Hall, New York, 1996.
       
 [Simov & Osenova 03]   
K. Simov and P. Osenova. Practical annotation scheme for an HPSG treebank of Bulgarian.
       In Proceedings of the 4th International Workshop on Linguistically Interpreteted Corpora
       (LINC-2003), Budapest, Hungary, 2003.
       
 [Simov et al. 02]   
K. Simov,  G. Popova,  and  P. Osenova.   HPSG-based  syntactic  treebank  of  Bulgarian
       (BulTreeBank). In A. Wilson, P. Rayson, and T. McEnery, editors, A Rainbow of Corpora:
       Corpus  Linguistics  and  the  Languages  of  the  World,  pages  135–142.  Lincom-Europa,
       Munich, 2002.
       
 [Simov et al. 04a]   
K. Simov, P. Osenova, A. Simov, and M. Kouylekov.  Design and implementation of the
                                                                                         
                                                                                         
       Bulgarian  HPSG-based  treebank.   Journal of Research on Language and Computation,
       2(4):495–522, December 2004.
       
 [Simov et al. 04b]   
K. Simov, P. Osenova, and M. Slavcheva. BulTreeBank morphosyntactic tagset. Technical
       Report BTB-TR03, BulTreeBank Project, March 2004.
       
 [Stevenson et al. 98]   
M. Stevenson, H. Cunningham, and Y. Wilks.  Sense tagging and language engineering.
       In Proceedings of the 13th European Conference on Artificial Intelligence (ECAI-98), pages
       185–189, Brighton, U.K., 1998.
       
 [Tablan et al. 02]   
V. Tablan,                                     C. Ursu,                                     K. Bontcheva,
       H. Cunningham,  D. Maynard,  O. Hamza,  T. McEnery,  P. Baker,  and  M. Leisher.   A
       Unicode-based Environment for Creation and Use of Language Resources. In 3rd Language
       Resources and Evaluation Conference, Las Palmas, Canary Islands – Spain, 2002. ELRA.
       http://gate.ac.uk/sale/iesl03/iesl03.pdf.
       
 [Tablan et al. 03]   
V. Tablan, K. Bontcheva, D. Maynard, and H. Cunningham.  Ollie: on-line learning for
       information extraction.  In SEALTS ’03: Proceedings of the HLT-NAACL 2003 workshop
       on  Software  engineering  and  architecture  of  language  technology  systems,  volume 8,
       pages  17–24,  Morristown,  NJ,  USA,  2003.  Association  for  Computational  Linguistics.
       http://gate.ac.uk/sale/hlt03/ollie-sealts.pdf.
       
 [Tablan et al. 06a]   
V. Tablan, W. Peters, D. Maynard, H. Cunningham, and K. Bontcheva.  Creating tools
       for  morphological  analysis  of  sumerian.    In  5th  Language  Resources  and  Evaluation
       Conference (LREC), Genoa, Italy, May 2006. ELRA.
       
 [Tablan et al. 06b]   
V. Tablan,  T. Polajnar,  H. Cunningham,  and  K. Bontcheva.    User-friendly  Ontology
       Authoring  Using  a  Controlled  Language.   In  5th  Language  Resources  and  Evaluation
       Conference (LREC), Genoa, Italy, May 2006. ELRA.
                                                                                         
                                                                                         
       
 [Tablan et al. 08]   
V. Tablan,  D. Damljanovic,  and  K. Bontcheva.   A  Natural  Language  Query  Interface
       to Structured Information.  In Proceedings of the 5h European Semantic Web Conference
       (ESWC, 1–5 June 2008), volume 5021 of Lecture Notes in Computer Science, pages 361–375,
       Tenerife, Spain, 1–5 June 2008. Springer-Verlag New York Inc.
       
 [Tanabe & Wilbur 02]   
L. Tanabe and W. J. Wilbur.  Tagging Gene and Protein Names in Full Text Articles.
       In Proceedings of the ACL-02 workshop on Natural Language Processing in the biomedical
       domain, 7–12 July 2002, volume 3, pages 9–13, Philadelphia, PA, 2002. Association for
       Computational Linguistics.
       
 [Toutanova et al. 03]   
K. Toutanova,  D. Klein,  C. D.  Manning,  and  Y. Singer.    Feature-rich  part-of-speech
       tagging  with  a  cyclic  dependency  network.   In  Proceedings  of  the  2003  Conference  of
       the North American Chapter of the Association for Computational Linguistics on Human
       Language Technology, NAACL ’03, pages 173–180, 2003.
       
 [Tsuruoka et al. 05]   
Y. Tsuruoka, Y. Tateishi, J.-D. Kim, T. Ohta, J. McNaught, S. Ananiadou, and J. Tsujii.
       Developing  a  robust  part-of-speech  tagger  for  biomedical  text.    In  P. Bozanis  and
       E. Houstis,  editors,  Advances  in  Informatics:  Proceedings  of  the  10th  Panhellenic
       Conference on Informatics (PCI 2005), 11–13 November 2005, volume 3746 of Lecture Notes
       in Computer Science, pages 382–392, Volas, Greece, 2005. Springer Berlin Heidelberg.
       
 [Ursu et al. 05]   
C. Ursu,  T. Tablan,  H. Cunningham,  and  B. Popav.   Digital  media  preservation  and
       access through semantically enhanced web-annotation. In Proceedings of the 2nd European
       Workshop  on  the  Integration  of  Knowledge,  Semantic  and  Digital  Media  Technologies
       (EWIMT 2005), London, UK, December 01 2005.
       
 [van Rijsbergen 79]   
C. van Rijsbergen. Information Retrieval. Butterworths, London, 1979.
       
 [Wang et al. 05]   
T. Wang,  D. Maynard,  W. Peters,  K. Bontcheva,  and  H. Cunningham.   Extracting  a
       domain ontology from linguistic resource based on relatedness measurements. In Proceedings
       of the 2005 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2005),
       pages 345–351, Compiegne, France, Septmeber 2005.
       
 [Wang et al. 06]   
T. Wang, Y. Li, K. Bontcheva, H. Cunningham, and J. Wang.   Automatic Extraction
       of Hierarchical Relations from Text.  In Proceedings of the Third European Semantic Web
       Conference (ESWC 2006), Budva, Montenegro, 2006.
       
 [Wood et al. 03]   
M. M. Wood, S. J. Lydon, V. Tablan, D. Maynard, and H. Cunningham. Using parallel
       texts to improve recall in IE. In Recent Advances in Natural Language Processing, Bulgaria,
       2003.
       
 [Wood et al. 04]   
M. Wood,  S. Lydon,  V. Tablan,  D. Maynard,  and  H. Cunningham.     Populating  a
       Database from Parallel Texts using Ontology-based Information Extraction. In Proceedings
       of NLDB 2004, 2004. http://gate.ac.uk/sale/nldb2004/NLDB.pdf.
       
 [Yourdon 89]   
E. Yourdon. Modern Structured Analysis. Prentice Hall, New York, 1989.
       
 [Yourdon 96]   
E. Yourdon. The Rise and Resurrection of the American Programmer. Prentice Hall, New
       York, 1996.




