References
[Agatonovic et al. 08]
M. Agatonovic, N. Aswani, K. Bontcheva, H. Cunningham, T. Heitz, Y. Li,
I. Roberts, and V. Tablan. Large-scale, parallel automatic patent annotation.
In Proc. of 1st International CIKM Workshop on Patent Information Retrieval -
PaIR’08, Napa Valley, California, USA, October 30 2008.
[Aho et al. 86]
A. V. Aho, R. Sethi, and J. D. Ullman. Compilers Principles, Techniques, and
Tools. Addison-Wesley, Reading, Massachusetts, 1986.
[Aswani et al. 05]
N. Aswani, V. Tablan, K. Bontcheva, and H. Cunningham. Indexing and Querying
Linguistic Metadata and Document Content. In Proceedings of Fifth International
Conference on Recent Advances in Natural Language Processing (RANLP2005),
Borovets, Bulgaria, 2005.
[Aswani et al. 06]
N. Aswani, K. Bontcheva, and H. Cunningham. Mining information for instance
unification. In 5th International Semantic Web Conference (ISWC2006), Athens,
Georgia, USA, 2006.
[Azar 89]
S. Azar. Understanding and Using English Grammar. Prentice Hall Regents, 1989.
[Baker et al. 02]
P. Baker, A. Hardie, T. McEnery, H. Cunningham, and R. Gaizauskas. EMILLE,
A 67-Million Word Corpus of Indic Languages: Data Collection, Mark-up and
Harmonisation. In Proceedings of 3rd Language Resources and Evaluation
Conference (LREC’2002), pages 819–825, 2002.
[Bird & Liberman 99]
S. Bird and M. Liberman. A Formal Framework for Linguistic Annotation.
Technical Report MS-CIS-99-01, Department of Computer and Information Science,
University of Pennsylvania, 1999. http://xxx.lanl.gov/abs/cs.CL/9903003.
[Bontcheva & Sabou 06]
K. Bontcheva and M. Sabou. Learning Ontologies from Software Artifacts:
Exploring and Combining Multiple Sources. In Workshop on Semantic Web Enabled
Software Engineering (SWESE), Athens, G.A., USA, November 2006.
[Bontcheva 04]
K. Bontcheva. Open-source Tools for Creation, Maintenance, and Storage of Lexical
Resources for Language Generation from Ontologies. In Proceedings of 4th Language
Resources and Evaluation Conference (LREC’04), 2004.
[Bontcheva 05]
K. Bontcheva. Generating Tailored Textual Summaries from Ontologies. In Second
European Semantic Web Conference (ESWC’2005), 2005.
[Bontcheva et al. 00]
K. Bontcheva, H. Brugman, A. Russel, P. Wittenburg, and H. Cunningham. An
Experiment in Unifying Audio-Visual and Textual Infrastructures for Language
Processing R&D. In Proceedings of the Workshop on Using Toolsets and
Architectures To Build NLP Systems at COLING-2000, Luxembourg, 2000.
http://gate.ac.uk/.
[Bontcheva et al. 02a]
K. Bontcheva, H. Cunningham, V. Tablan, D. Maynard, and O. Hamza. Using
GATE as an Environment for Teaching NLP. In Proceedings of the ACL
Workshop on Effective Tools and Methodologies in Teaching NLP, 2002.
http://gate.ac.uk/sale/acl02/gate4teaching.pdf.
[Bontcheva et al. 02b]
K. Bontcheva, H. Cunningham, V. Tablan, D. Maynard, and H. Saggion.
Developing Reusable and Robust Language Processing Components for Information
Systems using GATE. In Proceedings of the 3rd International Workshop on Natural
Language and Information Systems (NLIS’2002), Aix-en-Provence, France, 2002.
IEEE Computer Society Press. http://gate.ac.uk/sale/nlis/nlis.ps.
[Bontcheva et al. 02c]
K. Bontcheva, M. Dimitrov, D. Maynard, V. Tablan, and H. Cunningham.
Shallow Methods for Named Entity Coreference Resolution. In Chaînes de
références et résolveurs d’anaphores, workshop TALN 2002, Nancy, France, 2002.
http://gate.ac.uk/sale/taln02/taln-ws-coref.pdf.
[Bontcheva et al. 03]
K. Bontcheva, A. Kiryakov, H. Cunningham, B. Popov, and M. Dimitrov.
Semantic web enabled, open source language technology. In EACL workshop on
Language Technology and the Semantic Web: NLP and XML, Budapest, Hungary,
2003. http://gate.ac.uk/sale/eacl03-semweb/bontcheva-etal-final.pdf.
[Bontcheva et al. 04]
K. Bontcheva, V. Tablan, D. Maynard, and H. Cunningham. Evolving GATE to
Meet New Challenges in Language Engineering. Natural Language Engineering,
10(3/4):349—373, 2004.
[Bontcheva et al. 06a]
K. Bontcheva, H. Cunningham, A. Kiryakov, and V. Tablan. Semantic Annotation
and Human Language Technology. In J. Davies, R. Studer, and P. Warren, editors,
Semantic Web Technology: Trends and Research. John Wiley and Sons, 2006.
[Bontcheva et al. 06b]
K. Bontcheva, J. Davies, A. Duke, T. Glover, N. Kings, and I. Thurlow. Semantic
Information Access. In J. Davies, R. Studer, and P. Warren, editors, Semantic Web
Technologies. John Wiley and Sons, 2006.
[Bontcheva et al. 09]
K. Bontcheva, B. Davis, A. Funk, Y. Li, and T. Wang. Human Language
Technologies. In J. Davies, M. Grobelnik, and D. Mladenic, editors, Semantic
Knowledge Management, pages 37–49. 2009.
[Booch 94]
G. Booch. Object-Oriented Analysis and Design 2nd Edn. Benjamin/Cummings,
1994.
[Brugman et al. 99]
H. Brugman, K. Bontcheva, P. Wittenburg, and H. Cunningham. Integrating
Multimedia and Textual Software Architectures for Language Technology. Technical
report MPI-TG-99-1, Max-Planck Institute for Psycholinguistics, Nijmegen,
Netherlands, 1999.
[Carletta 96]
J. Carletta. Assessing agreement on classification tasks: the Kappa statistic.
Computational Linguistics, 22(2):249–254, 1996.
[CC001]
LIBSVM: a library for support vector machines, 2001. Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm.
[Chinchor 92]
N. Chinchor. MUC-4 Evaluation Metrics. In Proceedings of the Fourth Message
Understanding Conference, pages 22–29, 1992.
[Cimiano et al. 03]
P. Cimiano, S.Staab, and J. Tane. Automatic Acquisition of Taxonomies from Text:
FCA meets NLP. In Proceedings of the ECML/PKDD Workshop on Adaptive Text
Extraction and Mining, pages 10–17, Cavtat-Dubrovnik, Croatia, 2003.
[Cobuild 99]
C. Cobuild, editor. English Grammar. Harper Collins, 1999.
[Cunningham & Bontcheva 05]
H. Cunningham and K. Bontcheva. Computational Language Systems,
Architectures. Encyclopedia of Language and Linguistics, 2nd Edition, pages
733–752, 2005.
[Cunningham & Scott 04a]
H. Cunningham and D. Scott. Introduction to the Special Issue on Software
Architecture for Language Engineering. Natural Language Engineering, 2004.
http://gate.ac.uk/sale/jnle-sale/intro/intro-main.pdf.
[Cunningham & Scott 04b]
H. Cunningham and D. Scott, editors. Special Issue of Natural Language
Engineering on Software Architecture for Language Engineering. Cambridge
University Press, 2004.
[Cunningham 94]
H. Cunningham. Support Software for Language Engineering Research. Technical
Report 94/05, Centre for Computational Linguistics, UMIST, Manchester, 1994.
[Cunningham 99a]
H. Cunningham. A Definition and Short History of Language Engineering. Journal
of Natural Language Engineering, 5(1):1–16, 1999.
[Cunningham 99b]
H. Cunningham. JAPE: a Java Annotation Patterns Engine. Research
Memorandum CS–99–06, Department of Computer Science, University of Sheffield,
May 1999.
[Cunningham 00]
H. Cunningham. Software Architecture for Language Engineering. Unpublished
PhD thesis, University of Sheffield, 2000. http://gate.ac.uk/sale/thesis/.
[Cunningham 02]
H. Cunningham. GATE, a General Architecture for Text Engineering. Computers
and the Humanities, 36:223–254, 2002.
[Cunningham 05]
H. Cunningham. Information Extraction, Automatic. Encyclopedia of Language
and Linguistics, 2nd Edition, pages 665–677, 2005.
[Cunningham et al. 94]
H. Cunningham, M. Freeman, and W. Black. Software Reuse, Object-Oriented
Frameworks and Natural Language Processing. In New Methods in Language
Processing (NeMLaP-1), September 1994, Manchester, 1994. (Re-published in book
form 1997 by UCL Press).
[Cunningham et al. 95]
H. Cunningham, R. Gaizauskas, and Y. Wilks. A General Architecture for Text
Engineering (GATE) – a new approach to Language Engineering R&D. Technical
Report CS–95–21, Department of Computer Science, University of Sheffield, 1995.
http://xxx.lanl.gov/abs/cs.CL/9601009.
[Cunningham et al. 96a]
H. Cunningham, K. Humphreys, R. Gaizauskas, and M. Stower. CREOLE
Developer’s Manual. Technical report, Department of Computer Science, University
of Sheffield, 1996. http://www.dcs.shef.ac.uk/nlp/gate.
[Cunningham et al. 96b]
H. Cunningham,
K. Humphreys, R. Gaizauskas, and Y. Wilks. TIPSTER-Compatible Projects at
Sheffield. In Advances in Text Processing, TIPSTER Program Phase II. DARPA,
Morgan Kaufmann, California, 1996.
[Cunningham et al. 96c]
H. Cunningham, Y. Wilks, and R. Gaizauskas. GATE – a General Architecture
for Text Engineering. In Proceedings of the
16th Conference on Computational Linguistics (COLING-96), Copenhagen, August
1996. ftp://ftp.dcs.shef.ac.uk/home/hamish/auto_papers/Cun96b.ps.
[Cunningham et al. 96d]
H. Cunningham, Y. Wilks, and R. Gaizauskas. Software Infrastructure for
Language Engineering. In Proceedings of the AISB Workshop on Language
Engineering for Document Analysis and Recognition, Brighton, U.K., April 1996.
[Cunningham et al. 96e]
H. Cunningham, Y. Wilks, and R. Gaizauskas. New Methods, Current Trends and
Software Infrastructure for NLP. In Proceedings of the Conference on New Methods
in Natural Language Processing (NeMLaP-2), Bilkent University, Turkey, September
1996. ftp://ftp.dcs.shef.ac.uk/home/hamish/auto_papers/Cun96c.ps.
[Cunningham et al. 97a]
H. Cunningham,
K. Humphreys, R. Gaizauskas, and Y. Wilks. GATE – a TIPSTER-based General
Architecture for Text Engineering. In Proceedings of the TIPSTER Text Program
(Phase III) 6 Month Workshop. DARPA, Morgan Kaufmann, California, May 1997.
ftp://ftp.dcs.shef.ac.uk/home/hamish/auto_papers/Cun97e.ps.
[Cunningham et al. 97b]
H. Cunningham, K. Humphreys, R. Gaizauskas, and Y. Wilks. Software
Infrastructure for Natural Language Processing. In Proceedings of the 5th
Conference on Applied Natural Language Processing (ANLP-97), March 1997.
ftp://ftp.dcs.shef.ac.uk/home/hamish/auto_papers/Cun97a.ps.gz.
[Cunningham et al. 98a]
H. Cunningham, W. Peters, C. McCauley, K. Bontcheva, and Y. Wilks. A Level
Playing Field for Language Resource Evaluation. In Workshop on Distributing
and Accessing Lexical Resources at Conference on Language Resources Evaluation,
Granada, Spain, 1998. http://www.dcs.shef.ac.uk/ hamish/dalr.
[Cunningham et al. 98b]
H. Cunningham, M. Stevenson, and Y. Wilks. Implementing a Sense Tagger within
a General Architecture for Language Engineering. In Proceedings of the Third
Conference on New Methods in Language Engineering (NeMLaP-3), pages 59–72,
Sydney, Australia, 1998.
[Cunningham et al. 99]
H. Cunningham, R. Gaizauskas, K. Humphreys, and Y. Wilks. Experience with
a Language Engineering Architecture: Three Years of GATE. In Proceedings of
the AISB’99 Workshop on Reference Architectures and Data Standards for NLP,
Edinburgh, April 1999. The Society for the Study of Artificial Intelligence and
Simulation of Behaviour. http://www.dcs.shef.ac.uk/ hamish/GateAisb99.html.
[Cunningham et al. 00a]
H. Cunningham, K. Bontcheva, W. Peters, and Y. Wilks. Uniform language
resource access and distribution in the context of a General Architecture for
Text Engineering (GATE). In Proceedings of the Workshop on Ontologies
and Language Resources (OntoLex’2000), Sozopol, Bulgaria, September 2000.
http://gate.ac.uk/sale/ontolex/ontolex.ps.
[Cunningham et al. 00b]
H. Cunningham, K. Bontcheva, V. Tablan, and Y. Wilks. Software Infrastructure
for Language Resources: a Taxonomy of Previous Work and a Requirements Analysis.
In Proceedings of the 2nd International Conference on Language Resources and
Evaluation (LREC-2), Athens, 2000. http://gate.ac.uk/.
[Cunningham et al. 00c]
H. Cunningham,
D. Maynard, K. Bontcheva, V. Tablan, and Y. Wilks. Experience of using GATE
for NLP R&D. In Proceedings of the Workshop on Using Toolsets and Architectures
To Build NLP Systems at COLING-2000, Luxembourg, 2000. http://gate.ac.uk/.
[Cunningham et al. 00d]
H. Cunningham, D. Maynard, and V. Tablan. JAPE: a Java Annotation Patterns
Engine (Second Edition). Research Memorandum CS–00–10, Department of
Computer Science, University of Sheffield, November 2000.
[Cunningham et al. 02]
H. Cunningham, D. Maynard, K. Bontcheva, and V. Tablan. GATE: A Framework
and Graphical Development Environment for Robust NLP Tools and Applications.
In Proceedings of the 40th Anniversary Meeting of the Association for Computational
Linguistics (ACL’02), 2002.
[Cunningham et al. 03]
H. Cunningham, V. Tablan,
K. Bontcheva, and M. Dimitrov. Language Engineering Tools for Collaborative
Corpus Annotation. In Proceedings of Corpus Linguistics 2003, Lancaster, UK, 2003.
http://gate.ac.uk/sale/cl03/distrib-ollie-cl03.doc.
[Damljanovic & Bontcheva 08]
D. Damljanovic and K. Bontcheva. Enhanced Semantic Access to Software
Artefacts. In Workshop on Semantic Web Enabled Software Engineering (SWESE),
Karlsruhe, Germany, October 2008.
[Damljanovic et al. 08]
D. Damljanovic, V. Tablan, and K. Bontcheva. A text-based query interface to
owl ontologies. In 6th Language Resources and Evaluation Conference (LREC),
Marrakech, Morocco, May 2008. ELRA.
[Damljanovic et al. 09]
D. Damljanovic, F. Amardeilh, and K. Bontcheva. CA Manager Framework:
Creating Customised Workflows for Ontology Population and Semantic Annotation.
In Proceedings of The Fifth International Conference on Knowledge Capture
(KCAP’09), California, USA, September 2009.
[Davies & Fleiss 82]
M. Davies and J. Fleiss. Measuring Agreement for Multinomial Data. Biometrics,
38:1047–1051, 1982.
[Davis et al. 06]
B. Davis, S. Handschuh, H. Cunningham, and V. Tablan. Further use of
Controlled Natural Language for Semantic Annotation of Wikis. In Proceedings
of the 1st Semantic Authoring and Annotation Workshop at ISWC2006, Athens,
Georgia, USA, November 2006.
[Della Valle et al. 08]
E. Della Valle, D. Cerizza, I. Celino, A. Turati, H. Lausen, N. Steinmetz,
M. Erdmann, and A. Funk. Realizing Service-Finder: Web service discovery at web
scale. In European Semantic Technology Conference (ESTC), Vienna, September
2008.
[Dimitrov 02a]
M. Dimitrov. A Light-weight Approach to Coreference Resolution for
Named Entities in Text. MSc Thesis, University of Sofia, Bulgaria, 2002.
http://www.ontotext.com/ie/thesis-m.pdf.
[Dimitrov 02b]
M. Dimitrov. A Light-weight Approach to Coreference Resolution for
Named Entities in Text. MSc Thesis, University of Sofia, Bulgaria, 2002.
http://www.ontotext.com/ie/thesis-m.pdf.
[Dimitrov et al. 02]
M. Dimitrov, K. Bontcheva, H. Cunningham, and D. Maynard. A Light-weight
Approach to Coreference Resolution for Named Entities in Text. In Proceedings
of the Fourth Discourse Anaphora and Anaphor Resolution Colloquium (DAARC),
Lisbon, 2002.
[Dimitrov et al. 04]
M. Dimitrov, K. Bontcheva, H. Cunningham, and D. Maynard. A Light-weight
Approach to Coreference Resolution for Named Entities in Text. In A. Branco,
T. McEnery, and R. Mitkov, editors, Anaphora Processing: Linguistic, Cognitive
and Computational Modelling. John Benjamins, 2004.
[Dowman et al. 05a]
M. Dowman, V. Tablan, H. Cunningham, and B. Popov. Content
augmentation for mixed-mode news broadcasts. In Proceedings of the
3rd European Conference on Interactive Television: User Centred ITV
Systems, Programmes and Applications, Aalborg University, Denmark, 2005.
http://gate.ac.uk/sale/euro-itv-2005/content-augmentation-for-mixed-mode-news-broadcast-consumption.pdf.
[Dowman et al. 05b]
M. Dowman, V. Tablan, H. Cunningham, and B. Popov. Web-assisted annotation,
semantic indexing and search of television and radio news. In Proceedings
of the 14th International World Wide Web Conference, Chiba, Japan, 2005.
http://gate.ac.uk/sale/www05/web-assisted-annotation.pdf.
[Dowman et al. 05c]
M. Dowman, V. Tablan, H. Cunningham, C. Ursu, and B. Popov. Semantically
enhanced television news through web and video integration. In Second European
Semantic Web Conference (ESWC’2005), 2005.
[Eugenio & Glass 04]
B. D. Eugenio and M. Glass. The kappa statistic: a second look. Computational
Linguistics, 1(30), 2004. (squib).
[Fleiss 75]
J. L. Fleiss. Measuring agreement between two judges on the presence or absence
of a trait. Biometrics, 31:651–659, 1975.
[Frakes & Baeza-Yates 92]
W. Frakes and R. Baeza-Yates, editors. Information retrieval, data structures and
algorithms. Prentice Hall, New York, Englewood Cliffs, N.J., 1992.
[Funk et al. 07a]
A. Funk, D. Maynard, H. Saggion, and K. Bontcheva. Ontological integration
of information extracted from multiple sources. In Multi-source Multilingual
Information Extraction and Summarization (MMIES) workshop at Recent Advances
in Natural Language Processing (RANLP07), Borovets, Bulgaria, September 2007.
[Funk et al. 07b]
A. Funk, V. Tablan, K. Bontcheva, H. Cunningham, B. Davis, and S. Handschuh.
CLOnE: Controlled Language for Ontology Editing. In Proceedings of the 6th
International Semantic Web Conference (ISWC 2007), Busan, Korea, November
2007.
[Gaizauskas et al. 95]
R. Gaizauskas, T. Wakao, K. Humphreys, H. Cunningham, and Y. Wilks.
Description of the LaSIE system as used for MUC-6. In Proceedings of the Sixth
Message Understanding Conference (MUC-6). Morgan Kaufmann, California, 1995.
[Gaizauskas et al. 96a]
R. Gaizauskas, P. Rodgers, H. Cunningham, and K. Humphreys. GATE User
Guide. http://www.dcs.shef.ac.uk/nlp/gate, 1996.
[Gaizauskas et al. 96b]
R. Gaizauskas, H. Cunningham, Y. Wilks, P. Rodgers, and K. Humphreys. GATE
– an Environment to Support Research and Development in Natural Language
Engineering. In Proceedings of the 8th IEEE International Conference on
Tools with Artificial Intelligence (ICTAI-96), Toulouse, France, October 1996.
ftp://ftp.dcs.shef.ac.uk/home/robertg/ictai96.ps.
[Gaizauskas et al. 03]
R. Gaizauskas, M. A. Greenwood, M. Hepple, I. Roberts, H. Saggion, and
M. Sargaison. The University of Sheffields TREC 2003 Q&A Experiments. In In
Proceedings of the 12th Text REtrieval Conference, 2003.
[Gaizauskas et al. 04]
R. Gaizauskas, M. A. Greenwood, M. Hepple, I. Roberts, H. Saggion, and
M. Sargaison. The University of Sheffields TREC 2004 Q&A Experiments. In In
Proceedings of the 13th Text REtrieval Conference, 2004.
[Gaizauskas et al. 05]
R. Gaizauskas, M. Greenwood, M. Hepple, H. Harkema, H. Saggion, and
A. Sanka. The University of Sheffields TREC 2005 Q&A Experiments. In In
Proceedings of the 11th Text REtrieval Conference, 2005.
[Gambäck & Olsson 00]
B. Gambäck and F. Olsson. Experiences of Language Engineering Algorithm Reuse.
In Second International Conference on Language Resources and Evaluation (LREC),
pages 155–160, Athens, Greece, 2000.
[Gazdar & Mellish 89]
G. Gazdar and C. Mellish. Natural Language Processing in Prolog. Addison-Wesley,
Reading, MA, 1989.
[Greenwood et al. 02]
M. Greenwood, I. Roberts, and R. Gaizauskas. The University of Sheffields TREC
2002 Q&A Experiments. In In Proceedings of the 11th Text REtrieval Conference,
2002.
[Grishman 97]
R. Grishman.
TIPSTER Architecture Design Document Version 2.3. Technical report, DARPA,
1997. http://www.itl.nist.gov/div894/894.02/related_projects/tipster/.
[Hepple 00]
M. Hepple. Independence and commitment: Assumptions for rapid training and
execution of rule-based POS taggers. In Proceedings of the 38th Annual Meeting
of the Association for Computational Linguistics (ACL-2000), Hong Kong, October
2000.
[Hripcsak & Heitjan 02]
G. Hripcsak and D. Heitjan. Measuring agreement in medical informatics reliability
studies. Journal of Biomedical Informatics, 35:99–110, 2002.
[Hripcsak & Rothschild 05]
G. Hripcsak and A. S. Rothschild. Agreement, the F-measure, and Reliability in
Information Retrieval. Journal of the American Medical Informatics Association,
12(3):296–298, 2005.
[Humphreys et al. 96]
K. Humphreys, R. Gaizauskas, H. Cunningham, and S. Azzam. CREOLE Module
Specifications. http://www.dcs.shef.ac.uk/nlp/gate/, 1996.
[Humphreys et al. 98]
K. Humphreys, R. Gaizauskas, S. Azzam, C. Huyck, B. Mitchell,
H. Cunningham, and Y. Wilks. Description of the LaSIE system as used for
MUC-7. In Proceedings of the Seventh Message Understanding Conference (MUC-7).
http://www.itl.nist.gov/iaui/894.02/related_projects/muc/index.html, 1998.
[Humphreys et al. 99]
K. Humphreys, R. Gaizauskas, M. Hepple, and M. Sanderson. The University
of Sheffield TREC-8 Q&A System. In In Proceedings of the 8th Text REtrieval
Conference, 1999.
[Jackson 75]
M. Jackson. Principles of Program Design. Academic Press, London, 1975.
[Kiryakov 03]
A. Kiryakov. Ontology and Reasoning in MUMIS: Towards the Semantic Web.
Technical Report CS–03–03, Department of Computer Science, University of
Sheffield, 2003. http://gate.ac.uk/gate/doc/papers.html.
[Laclavik & Maynard 09]
M. Laclavik and D. Maynard. Motivating intelligent email in business: an
investigation into current trends for email processing and communication research.
In Proceedings of Workshop on Emails in e-Commerce and Enterprise Context, 11th
IEEE Conference on Commerce and Enterprise Computing, Vienna, Austria, 2009.
[Lal & Ruger 02]
P. Lal and S. Ruger. Extract-based summarization with simplification. In
Proceedings of the ACL 2002 Automatic Summarization / DUC 2002 Workshop,
2002. http://www.doc.ic.ac.uk/ srueger/pr-p.lal-2002/duc02-final.pdf.
[Lal 02]
P. Lal. Text summarisation. Unpublished M.Sc. thesis, Imperial College, London,
2002.
[Li & Bontcheva 08]
Y. Li and K. Bontcheva. Adapting support vector machines for f-term-based
classification of patents. ACM Transactions on Asian Language Information
Processing, 7(2):7:1–7:19, 2008.
[Li & Cunningham 08]
Y. Li and H. Cunningham. Geometric and Quantum Methods for Information
Retrieval. SIGIR Forum, 42(2):22–32, 2008.
[Li & Shawe-Taylor 03]
Y. Li and J. Shawe-Taylor. The SVM with Uneven Margins and Chinese Document
Categorization. In Proceedings of The 17th Pacific Asia Conference on Language,
Information and Computation (PACLIC17), Singapore, Oct. 2003.
[Li & Shawe-Taylor 06]
Y. Li and J. Shawe-Taylor. Using KCCA for Japanese-English Cross-language
Information Retrieval and Document Classification. Journal of Intelligent
Information Systems, 27(2):117–133, 2006.
[Li & Shawe-Taylor 07]
Y. Li and J. Shawe-Taylor. Advanced Learning Algorithms for Cross-language
Patent Retrieval and Classification. Information Processing and Management,
43(5):1183–1199, 2007.
[Li et al. 02]
Y. Li, H. Zaragoza, R. Herbrich, J. Shawe-Taylor, and J. Kandola. The
Perceptron Algorithm with Uneven Margins. In Proceedings of the 9th International
Conference on Machine Learning (ICML-2002), pages 379–386, 2002.
[Li et al. 04]
Y. Li, K. Bontcheva, and H. Cunningham. An SVM Based Learning Algorithm
for Information Extraction. Machine Learning Workshop, Sheffield, 2004.
http://gate.ac.uk/sale/ml-ws04/mlw2004.pdf.
[Li et al. 05a]
Y. Li, K. Bontcheva, and H. Cunningham. SVM Based Learning System
For Information Extraction. In M. N. J. Winkler and N. Lawerence, editors,
Deterministic and Statistical Methods in Machine Learning, LNAI 3635, pages
319–339. Springer Verlag, 2005.
[Li et al. 05b]
Y. Li, K. Bontcheva, and H. Cunningham. Using Uneven Margins SVM and
Perceptron for Information Extraction. In Proceedings of Ninth Conference on
Computational Natural Language Learning (CoNLL-2005), 2005.
[Li et al. 05c]
Y. Li, C. Miao, K. Bontcheva, and H. Cunningham. Perceptron Learning for
Chinese Word Segmentation. In Proceedings of Fourth SIGHAN Workshop on
Chinese Language processing (Sighan-05), pages 154–157, Korea, 2005.
[Li et al. 07a]
Y. Li, K. Bontcheva, and H. Cunningham. Hierarchical, Perceptron-like Learning
for Ontology Based Information Extraction. In 16th International World Wide Web
Conference (WWW2007), pages 777–786, May 2007.
[Li et al. 07b]
Y. Li, K. Bontcheva, and H. Cunningham. Cost Sensitive Evaluation Measures for
F-term Patent Classification. In The First International Workshop on Evaluating
Information Access (EVIA 2007), pages 44–53, May 2007.
[Li et al. 07c]
Y. Li, K. Bontcheva, and H. Cunningham. Experiments of opinion analysis on the
corpora MPQA and NTCIR-6. In Proceedings of the Sixth NTCIR Workshop Meeting
on Evaluation of Information Access Technologies: Information Retrieval, Question
Answering and Cross-Lingual Information Access, pages 323–329, May 2007.
[Li et al. 07d]
Y. Li, K. Bontcheva, and H. Cunningham. SVM Based Learning System for F-term
Patent Classification. In Proceedings of the Sixth NTCIR Workshop Meeting on
Evaluation of Information Access Technologies: Information Retrieval, Question
Answering and Cross-Lingual Information Access, pages 396–402, May 2007.
[Li et al. 09]
Y. Li, K. Bontcheva, and H. Cunningham. Adapting SVM for Data Sparseness and
Imbalance: A Case Study on Information Extraction. Natural Language Engineering,
15(2):241–271, 2009.
[Lombard et al. 02]
M. Lombard, J. Snyder-Duch, and C. C. Bracken. Content analysis in mass
communication: Assessment and reporting of intercoder reliability. Human
Communication Research, 28:587–604, 2002.
[LREC-1 98]
Conference on Language Resources Evaluation (LREC-1), Granada, Spain, 1998.
[LREC-2 00]
Second Conference on Language Resources Evaluation (LREC-2), Athens, 2000.
[Manning & Schütze 99]
C. Manning and H. Schütze. Foundations of Statistical Natural Language
Processing. MIT press, Cambridge, MA, 1999. Supporting materials available at
http://www.sultry.arts.usyd.edu.au/fsnlp/ .
[Manov et al. 03]
D. Manov, A. Kiryakov, B. Popov, K. Bontcheva, and D. Maynard. Experiments
with geographic knowledge for information extraction. In Workshop on
Analysis of Geographic References, HLT/NAACL’03, Edmonton, Canada, 2003.
http://gate.ac.uk/sale/hlt03/paper03.pdf.
[Maynard 05]
D. Maynard. Benchmarking ontology-based annotation tools for the semantic web.
In UK e-Science Programme All Hands Meeting (AHM2005) Workshop on Text
Mining, e-Research and Grid-enabled Language Technology, Nottingham, UK, 2005.
[Maynard 08]
D. Maynard. Benchmarking textual annotation tools for the semantic web. In Proc.
of 6th International Conference on Language Resources and Evaluation (LREC),
Marrakech, Morocco, 2008.
[Maynard et al. 00]
D. Maynard, H. Cunningham, K. Bontcheva,
R. Catizone, G. Demetriou, R. Gaizauskas, O. Hamza, M. Hepple, P. Herring,
B. Mitchell, M. Oakes, W. Peters, A. Setzer, M. Stevenson, V. Tablan, C. Ursu,
and Y. Wilks. A Survey of Uses of GATE. Technical Report CS–00–06, Department
of Computer Science, University of Sheffield, 2000.
[Maynard et al. 01]
D. Maynard, V. Tablan, C. Ursu, H. Cunningham, and Y. Wilks. Named Entity
Recognition from Diverse Text Types. In Recent Advances in Natural Language
Processing 2001 Conference, pages 257–274, Tzigov Chark, Bulgaria, 2001.
[Maynard et al. 02a]
D. Maynard, K. Bontcheva, H. Saggion, H. Cunningham, and O. Hamza. Using
a Text Engineering Framework to Build an Extendable and Portable IE-based
Summarisation System. In Proceedings of the ACL Workshop on Text
Summarisation, pages 19–26, Phildadelphia, Pennsylvania, 2002. ACM.
[Maynard et al. 02b]
D. Maynard, H. Cunningham, K. Bontcheva, and M. Dimitrov. Adapting a robust
multi-genre NE system for automatic content extraction. In Proceedings of the
10th International Conference on Artificial Intelligence: Methodology, Systems,
Applications (AIMSA’02), Varna, Bulgaria, Sep 2002.
[Maynard et al. 02c]
D. Maynard, H. Cunningham, K. Bontcheva, and M. Dimitrov. Adapting A
Robust Multi-Genre NE System for Automatic Content Extraction. In Proceedings of
the Tenth International Conference on Artificial Intelligence: Methodology, Systems,
Applications (AIMSA 2002), 2002.
[Maynard et al. 02d]
D. Maynard, H. Cunningham, and R. Gaizauskas. Named entity recognition at
sheffield university. In H. Holmboe, editor, Nordic Language Technology – Arbog for
Nordisk Sprogtechnologisk Forskningsprogram 2002-2004, pages 141–145. Museum
Tusculanums Forlag, 2002.
[Maynard et al. 02e]
D. Maynard, V. Tablan, H. Cunningham, C. Ursu, H. Saggion, K. Bontcheva,
and Y. Wilks. Architectural Elements of Language Engineering Robustness. Journal
of Natural Language Engineering – Special Issue on Robust Methods in Analysis of
Natural Language Data, 8(2/3):257–274, 2002.
[Maynard et al. 03a]
D. Maynard, K. Bontcheva, and H. Cunningham. From information extraction to
content extraction. Submitted to EACL’2003, 2003.
[Maynard et al. 03b]
D. Maynard, K. Bontcheva, and H. Cunningham. Towards a semantic
extraction of named entities. In G. Angelova, K. Bontcheva, R. Mitkov,
N. Nicolov, and N. Nikolov, editors, Proceedings of Recent Advances in Natural
Language Processing (RANLP’03), pages 255–261, Borovets, Bulgaria, Sep 2003.
http://gate.ac.uk/sale/ranlp03/ranlp03.pdf.
[Maynard et al. 03c]
D. Maynard, K. Bontcheva, and H. Cunningham. Towards a semantic extraction
of Named Entities. In Recent Advances in Natural Language Processing, Bulgaria,
2003.
[Maynard et al. 03d]
D. Maynard, V. Tablan, K. Bontcheva, and H. Cunningham. Rapid customisation
of an Information Extraction system for surprise languages. Special issue of ACM
Transactions on Asian Language Information Processing: Rapid Development of
Language Capabilities: The Surprise Languages, 2:295–300, 2003.
[Maynard et al. 03e]
D. Maynard, V. Tablan, and H. Cunningham. NE recognition without training
data on a language you don’t speak. In ACL Workshop on Multilingual and
Mixed-language Named Entity Recognition: Combining Statistical and Symbolic
Models, Sapporo, Japan, 2003.
[Maynard et al. 04a]
D. Maynard, K. Bontcheva, and H. Cunningham. Automatic
Language-Independent Induction of Gazetteer Lists. In Proceedings of 4th Language
Resources and Evaluation Conference (LREC’04), Lisbon, Portugal, 2004. ELRA.
[Maynard et al. 04b]
D. Maynard, H. Cunningham, A. Kourakis, and A. Kokossis. Ontology-Based
Information Extraction in hTechSight. In First European Semantic Web Symposium
(ESWS 2004), Heraklion, Crete, 2004.
[Maynard et al. 04c]
D. Maynard, M. Yankova, N. Aswani, and H. Cunningham. Automatic Creation
and Monitoring of Semantic Metadata in a Dynamic Knowledge Portal. In
Proceedings of the 11th International Conference on Artificial Intelligence:
Methodology, Systems, Applications (AIMSA 2004), Varna, Bulgaria, 2004.
[Maynard et al. 06]
D. Maynard, W. Peters, and Y. Li. Metrics for evaluation of ontology-based
information extraction. In WWW 2006 Workshop on Evaluation of Ontologies for
the Web (EON), Edinburgh, Scotland, 2006.
[Maynard et al. 07a]
D. Maynard, W. Peters, M. d’Aquin, and M. Sabou. Change management for
metadata evolution. In ESWC International Workshop on Ontology Dynamics
(IWOD), Innsbruck, Austria, June 2007.
[Maynard et al. 07b]
D. Maynard, H. Saggion, M. Yankova, K. Bontcheva, and W. Peters. Natural
Language Technology for Information Integration in Business Intelligence. In
10th International Conference on Business Information Systems (BIS-07), Poznan,
Poland, 25-27 April 2007.
[Maynard et al. 08a]
D. Maynard, W. Peters, and Y. Li. Evaluating evaluation metrics for
ontology-based applications: Infinite reflection. In Proc. of 6th International
Conference on Language Resources and Evaluation (LREC), Marrakech, Morocco,
2008.
[Maynard et al. 08b]
D. Maynard, Y. Li, and W. Peters. NLP Techniques for Term Extraction and
Ontology Population. In P. Buitelaar and P. Cimiano, editors, Bridging the Gap
between Text and Knowledge - Selected Contributions to Ontology Learning and
Population from Text. IOS Press, 2008.
[McEnery et al. 00]
A. McEnery, P. Baker, R. Gaizauskas, and H. Cunningham. EMILLE: Building
a Corpus of South Asian Languages. Vivek, A Quarterly in Artificial Intelligence,
13(3):23–32, 2000.
[Pastra et al. 02]
K. Pastra, D. Maynard, H. Cunningham, O. Hamza, and Y. Wilks. How
feasible is the reuse of grammars for named entity recognition? In
Proceedings of the 3rd Language Resources and Evaluation Conference, 2002.
http://gate.ac.uk/sale/lrec2002/reusability.ps.
[Peters et al. 98]
W. Peters, H. Cunningham, C. McCauley, K. Bontcheva, and Y. Wilks. Uniform
Language Resource Access and Distribution. In Workshop on Distributing and
Accessing Lexical Resources at Conference on Language Resources Evaluation,
Granada, Spain, 1998.
[Polajnar et al. 05]
T. Polajnar, V. Tablan, and H. Cunningham. User-friendly ontology authoring
using a controlled language. Technical Report CS Report No. CS-05-10, University
of Sheffield, Sheffield, UK, 2005.
[Porter 80]
M. Porter. An algorithm for suffix stripping. Program, 14(3):130–137, 1980.
[Ramshaw & Marcus 95]
L. Ramshaw and M. Marcus. Text Chunking Using Transformation-Based
Learning. In Proceedings of the Third ACL Workshop on Very Large Corpora, 1995.
[Saggion & Gaizauskas 04a]
H. Saggion and R. Gaizauskas. Mining on-line sources for definition knowledge.
In Proceedings of the 17th FLAIRS 2004, Miami Bearch, Florida, USA, May 17-19
2004. AAAI.
[Saggion & Gaizauskas 04b]
H. Saggion and R. Gaizauskas. Multi-document summarization by cluster/profile
relevance and redundancy removal. In Proceedings of the Document Understanding
Conference 2004. NIST, 2004.
[Saggion & Gaizauskas 05]
H. Saggion and R. Gaizauskas. Experiments on statistical and pattern-based
biographical summarization. In Proceedings of EPIA 2005, pages 611–621, 2005.
[Saggion 04]
H. Saggion. Identifying definitions in text collections for question answering. lrec.
In Proceedings of Language Resources and Evaluation Conference. ELDA, 2004.
[Saggion 06]
H. Saggion. Multilingual Multidocument Summarization Tools and Evaluation. In
Proceedings of LREC 2006, 2006.
[Saggion 07]
H. Saggion. Shef: Semantic tagging and summarization techniques applied to
cross-document coreference. In Proceedings of SemEval 2007, Assocciation for
Computational Linguistics, pages 292–295, June 2007.
[Saggion et al. 02a]
H. Saggion, H. Cunningham, K. Bontcheva, D. Maynard, C. Ursu, O. Hamza,
and Y. Wilks. Access to Multimedia Information through Multisource and
Multilanguage Information Extraction. In Proceedings of the 7th Workshop on
Applications of Natural Language to Information Systems (NLDB 2002), Stockholm,
Sweden, 2002.
[Saggion et al. 02b]
H. Saggion, H. Cunningham, D. Maynard, K. Bontcheva, O. Hamza, C. Ursu,
and Y. Wilks. Extracting Information for Information Indexing of Multimedia
Material. In Proceedings of 3rd Language Resources and Evaluation Conference
(LREC’2002), 2002. http://gate.ac.uk/sale/lrec2002/mumis_lrec2002.ps.
[Saggion et al. 03a]
H. Saggion, K. Bontcheva, and H. Cunningham. Robust Generic and Query-based
Summarisation. In Proceedings of the European Chapter of Computational
Linguistics (EACL), Research Notes and Demos, 2003.
[Saggion et al. 03b]
H. Saggion, H. Cunningham, K. Bontcheva, D. Maynard, O. Hamza, and
Y. Wilks. Multimedia Indexing through Multisource and Multilingual Information
Extraction; the MUMIS project. Data and Knowledge Engineering, 48:247–264,
2003.
[Saggion et al. 03c]
H. Saggion, J. Kuper, H. Cunningham, T. Declerck, P. Wittenburg, M. Puts,
F. DeJong, and Y. Wilks. Event-coreference across Multiple, Multi-lingual Sources
in the Mumis Project. In Proceedings of the European Chapter of Computational
Linguistics (EACL), Research Notes and Demos, 2003.
[Saggion et al. 07]
H. Saggion, A. Funk, D. Maynard, and K. Bontcheva. Ontology-based information
extraction for business applications. In Proceedings of the 6th International Semantic
Web Conference (ISWC 2007), Busan, Korea, November 2007.
[Scott & Gaizauskas. 00]
S. Scott and R. Gaizauskas. The University of Sheffield TREC-9 Q&A System. In
In Proceedings of the 9th Text REtrieval Conference, 2000.
[Shaw & Garlan 96]
M. Shaw and D. Garlan. Software Architecture. Prentice Hall, New York, 1996.
[Stevenson et al. 98]
M. Stevenson, H. Cunningham, and Y. Wilks. Sense tagging and language
engineering. In Proceedings of the 13th European Conference on Artificial Intelligence
(ECAI-98), pages 185–189, Brighton, U.K., 1998.
[Tablan et al. 02]
V. Tablan, C. Ursu, K. Bontcheva, H. Cunningham, D. Maynard, O. Hamza,
T. McEnery, P. Baker, and M. Leisher. A Unicode-based Environment for
Creation and Use of Language Resources. In 3rd Language Resources and
Evaluation Conference, Las Palmas, Canary Islands – Spain, 2002. ELRA.
http://gate.ac.uk/sale/iesl03/iesl03.pdf.
[Tablan et al. 03]
V. Tablan, K. Bontcheva, D. Maynard, and H. Cunningham. OLLIE: On-Line
Learning for Information Extraction. In Proceedings of the HLT-NAACL Workshop
on Software Engineering and Architecture of Language Technology Systems,
Edmonton, Canada, 2003. http://gate.ac.uk/sale/hlt03/ollie-sealts.pdf.
[Tablan et al. 06a]
V. Tablan, T. Polajnar, H. Cunningham, and K. Bontcheva. User-friendly
ontology authoring using a controlled language. In 5th Language Resources and
Evaluation Conference (LREC), Genoa, Italy, May 2006. ELRA.
[Tablan et al. 06b]
V. Tablan, W. Peters, D. Maynard, H. Cunningham, and K. Bontcheva. Creating
tools for morphological analysis of sumerian. In 5th Language Resources and
Evaluation Conference (LREC), Genoa, Italy, May 2006. ELRA.
[Tablan et al. 08]
V. Tablan, D. Damljanovic, and K. Bontcheva. A natural language query interface
to structured information. In Proceedings of the 5h European Semantic Web
Conference (ESWC 2008), Tenerife, Spain, June 2008.
[Ursu et al. 05]
C. Ursu, T. Tablan, H. Cunningham, and B. Popav. Digital media preservation
and access through semantically enhanced web-annotation. In Proceedings of the 2nd
European Workshop on the Integration of Knowledge, Semantic and Digital Media
Technologies (EWIMT 2005), London, UK, December 01 2005.
[van Rijsbergen 79]
C. van Rijsbergen. Information Retrieval. Butterworths, London, 1979.
[Wang et al. 05]
T. Wang, D. Maynard, W. Peters, K. Bontcheva, and H. Cunningham. Extracting
a domain ontology from linguistic resource based on relatedness measurements.
In Proceedings of the 2005 IEEE/WIC/ACM International Conference on Web
Intelligence (WI 2005), pages 345–351, Compiegne, France, Septmeber 2005.
[Wang et al. 06]
T. Wang, Y. Li, K. Bontcheva, H. Cunningham, and J. Wang. Automatic
Extraction of Hierarchical Relations from Text. In Proceedings of the Third European
Semantic Web Conference (ESWC 2006), Budva, Montenegro, 2006.
[Witten & Frank 99]
I. H. Witten and E. Frank. Data Mining: Practical Machine Learning Tools and
Techniques with Java Implementations. Morgan Kaufmann, 1999.
[Wood et al. 03]
M. M. Wood, S. J. Lydon, V. Tablan, D. Maynard, and H. Cunningham. Using
parallel texts to improve recall in IE. In Recent Advances in Natural Language
Processing, Bulgaria, 2003.
[Wood et al. 04]
M. Wood, S. Lydon, V. Tablan, D. Maynard, and H. Cunningham. Populating
a Database from Parallel Texts using Ontology-based Information Extraction. In
Proceedings of NLDB 2004, 2004. http://gate.ac.uk/sale/nldb2004/NLDB.pdf.
[Yourdon 89]
E. Yourdon. Modern Structured Analysis. Prentice Hall, New York, 1989.
[Yourdon 96]
E. Yourdon. The Rise and Resurrection of the American Programmer. Prentice
Hall, New York, 1996.