References
[Appelt 99]
D. E. Appelt. An Introduction to Information Extraction. Artificial Intelligence
Communications, 12(3):161–172, 1999.
[Aswani et al. 05]
N. Aswani, V. Tablan, K. Bontcheva, and H. Cunningham. Indexing and Querying
Linguistic Metadata and Document Content. In Proceedings of Fifth International
Conference on Recent Advances in Natural Language Processing (RANLP2005),
Borovets, Bulgaria, 2005.
[Aswani et al. 06]
N. Aswani, K. Bontcheva, and H. Cunningham. Mining information for instance
unification. In 5th International Semantic Web Conference (ISWC2006), Athens,
Georgia, USA, 2006.
[Azar 89]
S. Azar. Understanding and Using English Grammar. Prentice Hall Regents, 1989.
[Baker et al. 02]
P. Baker, A. Hardie, T. McEnery, H. Cunningham, and R. Gaizauskas. EMILLE,
A 67-Million Word Corpus of Indic Languages: Data Collection, Mark-up and
Harmonisation. In Proceedings of 3rd Language Resources and Evaluation
Conference (LREC’2002), pages 819–825, 2002.
[Bird & Liberman 99]
S. Bird and M. Liberman. A Formal Framework for Linguistic Annotation.
Technical Report MS-CIS-99-01, Department of Computer and Information Science,
University of Pennsylvania, 1999. http://xxx.lanl.gov/abs/cs.CL/9903003.
[Bontcheva 04]
K. Bontcheva. Open-source Tools for Creation, Maintenance, and Storage of Lexical
Resources for Language Generation from Ontologies. In Proceedings of 4th Language
Resources and Evaluation Conference (LREC’04), 2004.
[Bontcheva et al. 00]
K. Bontcheva, H. Brugman, A. Russel, P. Wittenburg, and H. Cunningham. An
Experiment in Unifying Audio-Visual and Textual Infrastructures for Language
Processing R&D. In Proceedings of the Workshop on Using Toolsets and
Architectures To Build NLP Systems at COLING-2000, Luxembourg, 2000.
http://gate.ac.uk/.
[Bontcheva et al. 02a]
K. Bontcheva, H. Cunningham, V. Tablan, D. Maynard, and O. Hamza. Using
GATE as an Environment for Teaching NLP. In Proceedings of the ACL
Workshop on Effective Tools and Methodologies in Teaching NLP, 2002.
http://gate.ac.uk/sale/acl02/gate4teaching.pdf.
[Bontcheva et al. 02b]
K. Bontcheva, H. Cunningham, V. Tablan, D. Maynard, and H. Saggion.
Developing Reusable and Robust Language Processing Components for Information
Systems using GATE. In Proceedings of the 3rd International Workshop on Natural
Language and Information Systems (NLIS’2002), Aix-en-Provence, France, 2002.
IEEE Computer Society Press. http://gate.ac.uk/sale/nlis/nlis.ps.
[Bontcheva et al. 02c]
K. Bontcheva, M. Dimitrov, D. Maynard, V. Tablan, and H. Cunningham.
Shallow Methods for Named Entity Coreference Resolution. In Chaînes de
références et résolveurs d’anaphores, workshop TALN 2002, Nancy, France, 2002.
http://gate.ac.uk/sale/taln02/taln-ws-coref.pdf.
[Bontcheva et al. 03]
K. Bontcheva, A. Kiryakov, H. Cunningham, B. Popov, and M. Dimitrov.
Semantic web enabled, open source language technology. In EACL workshop on
Language Technology and the Semantic Web: NLP and XML, Budapest, Hungary,
2003. http://gate.ac.uk/sale/eacl03-semweb/bontcheva-etal-final.pdf.
[Bontcheva et al. 04]
K. Bontcheva, V. Tablan, D. Maynard, and H. Cunningham. Evolving GATE to
Meet New Challenges in Language Engineering. Natural Language Engineering,
10(3/4):349—373, 2004.
[Booch 94]
G. Booch. Object-Oriented Analysis and Design 2nd Edn. Benjamin/Cummings,
1994.
[Brugman et al. 99]
H. Brugman, K. Bontcheva, P. Wittenburg, and H. Cunningham. Integrating
Multimedia and Textual Software Architectures for Language Technology. Technical
report MPI-TG-99-1, Max-Planck Institute for Psycholinguistics, Nijmegen,
Netherlands, 1999.
[Campione et al. 98]
M. Campione, K. Walrath, A. Huml, and the Tutuorial Team. The Java Tutorial
Continued: The Rest of the JDK. Addison-Wesley, Reading, MA, 1998.
[Carletta 96]
J. Carletta. Assessing agreement on classification tasks: the Kappa statistic.
Computational Linguistics, 22(2):249–254, 1996.
[CC001]
LIBSVM: a library for support vector machines, 2001. Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm.
[Chinchor 92]
N. Chinchor. Muc-4 evaluation metrics. In Proceedings of the Fourth Message
Understanding Conference, pages 22–29, 1992.
[Cimiano et al. 03]
P. Cimiano, S.Staab, and J. Tane. Automatic Acquisition of Taxonomies from Text:
FCA meets NLP. In Proceedings of the ECML/PKDD Workshop on Adaptive Text
Extraction and Mining, pages 10–17, Cavtat-Dubrovnik, Croatia, 2003.
[Cobuild 99]
C. Cobuild, editor. English Grammar. Harper Collins, 1999.
[Cowie & Lehnert 96]
J. Cowie and W. Lehnert. Information Extraction. Communications of the ACM,
39(1):80–91, 1996.
[Cunningham & Bontcheva 05]
H. Cunningham and K. Bontcheva. Computational Language Systems,
Architectures. Encyclopedia of Language and Linguistics, 2nd Edition, pages
733–752, 2005.
[Cunningham & Scott 04a]
H. Cunningham and D. Scott. Introduction to the Special Issue on Software
Architecture for Language Engineering. Natural Language Engineering, 2004.
http://gate.ac.uk/sale/jnle-sale/intro/intro-main.pdf.
[Cunningham & Scott 04b]
H. Cunningham and D. Scott, editors. Special Issue of Natural Language
Engineering on Software Architecture for Language Engineering. Cambridge
University Press, 2004.
[Cunningham 94]
H. Cunningham. Support Software for Language Engineering Research. Technical
Report 94/05, Centre for Computational Linguistics, UMIST, Manchester, 1994.
[Cunningham 99a]
H. Cunningham. A Definition and Short History of Language Engineering. Journal
of Natural Language Engineering, 5(1):1–16, 1999.
[Cunningham 99b]
H. Cunningham. Information Extraction: a User Guide (revised version). Research
Memorandum CS–99–07, Department of Computer Science, University of Sheffield,
May 1999.
[Cunningham 99c]
H. Cunningham. JAPE: a Java Annotation Patterns Engine. Research
Memorandum CS–99–06, Department of Computer Science, University of Sheffield,
May 1999.
[Cunningham 00]
H. Cunningham. Software Architecture for Language Engineering. Unpublished
PhD thesis, University of Sheffield, 2000. http://gate.ac.uk/sale/thesis/.
[Cunningham 02]
H. Cunningham. GATE, a General Architecture for Text Engineering. Computers
and the Humanities, 36:223–254, 2002.
[Cunningham 05]
H. Cunningham. Information Extraction, Automatic. Encyclopedia of Language
and Linguistics, 2nd Edition, pages 665–677, 2005.
[Cunningham et al. 94]
H. Cunningham, M. Freeman, and W. Black. Software Reuse, Object-Oriented
Frameworks and Natural Language Processing. In New Methods in Language
Processing (NeMLaP-1), September 1994, Manchester, 1994. (Re-published in book
form 1997 by UCL Press).
[Cunningham et al. 95]
H. Cunningham, R. Gaizauskas, and Y. Wilks. A General Architecture for Text
Engineering (GATE) – a new approach to Language Engineering R&D. Technical
Report CS–95–21, Department of Computer Science, University of Sheffield, 1995.
http://xxx.lanl.gov/abs/cs.CL/9601009.
[Cunningham et al. 96a]
H. Cunningham, K. Humphreys, R. Gaizauskas, and M. Stower. CREOLE
Developer’s Manual. Technical report, Department of Computer Science, University
of Sheffield, 1996. http://www.dcs.shef.ac.uk/nlp/gate.
[Cunningham et al. 96b]
H. Cunningham,
K. Humphreys, R. Gaizauskas, and Y. Wilks. TIPSTER-Compatible Projects at
Sheffield. In Advances in Text Processing, TIPSTER Program Phase II. DARPA,
Morgan Kaufmann, California, 1996.
[Cunningham et al. 96c]
H. Cunningham, Y. Wilks, and R. Gaizauskas. GATE – a General Architecture
for Text Engineering. In Proceedings of the
16th Conference on Computational Linguistics (COLING-96), Copenhagen, August
1996. ftp://ftp.dcs.shef.ac.uk/home/hamish/auto_papers/Cun96b.ps.
[Cunningham et al. 96d]
H. Cunningham, Y. Wilks, and R. Gaizauskas. Software Infrastructure for
Language Engineering. In Proceedings of the AISB Workshop on Language
Engineering for Document Analysis and Recognition, Brighton, U.K., April 1996.
[Cunningham et al. 96e]
H. Cunningham, Y. Wilks, and R. Gaizauskas. New Methods, Current Trends and
Software Infrastructure for NLP. In Proceedings of the Conference on New Methods
in Natural Language Processing (NeMLaP-2), Bilkent University, Turkey, September
1996. ftp://ftp.dcs.shef.ac.uk/home/hamish/auto_papers/Cun96c.ps.
[Cunningham et al. 97a]
H. Cunningham,
K. Humphreys, R. Gaizauskas, and Y. Wilks. GATE – a TIPSTER-based General
Architecture for Text Engineering. In Proceedings of the TIPSTER Text Program
(Phase III) 6 Month Workshop. DARPA, Morgan Kaufmann, California, May 1997.
ftp://ftp.dcs.shef.ac.uk/home/hamish/auto_papers/Cun97e.ps.
[Cunningham et al. 97b]
H. Cunningham, K. Humphreys, R. Gaizauskas, and Y. Wilks. Software
Infrastructure for Natural Language Processing. In Proceedings of the 5th
Conference on Applied Natural Language Processing (ANLP-97), March 1997.
ftp://ftp.dcs.shef.ac.uk/home/hamish/auto_papers/Cun97a.ps.gz.
[Cunningham et al. 98a]
H. Cunningham, W. Peters, C. McCauley, K. Bontcheva, and Y. Wilks. A Level
Playing Field for Language Resource Evaluation. In Workshop on Distributing
and Accessing Lexical Resources at Conference on Language Resources Evaluation,
Granada, Spain, 1998. http://www.dcs.shef.ac.uk/ hamish/dalr.
[Cunningham et al. 98b]
H. Cunningham, M. Stevenson, and Y. Wilks. Implementing a Sense Tagger within
a General Architecture for Language Engineering. In Proceedings of the Third
Conference on New Methods in Language Engineering (NeMLaP-3), pages 59–72,
Sydney, Australia, 1998.
[Cunningham et al. 99]
H. Cunningham, R. Gaizauskas, K. Humphreys, and Y. Wilks. Experience with
a Language Engineering Architecture: Three Years of GATE. In Proceedings of
the AISB’99 Workshop on Reference Architectures and Data Standards for NLP,
Edinburgh, April 1999. The Society for the Study of Artificial Intelligence and
Simulation of Behaviour. http://www.dcs.shef.ac.uk/ hamish/GateAisb99.html.
[Cunningham et al. 00a]
H. Cunningham, K. Bontcheva, W. Peters, and Y. Wilks. Uniform language
resource access and distribution in the context of a General Architecture for
Text Engineering (GATE). In Proceedings of the Workshop on Ontologies
and Language Resources (OntoLex’2000), Sozopol, Bulgaria, September 2000.
http://gate.ac.uk/sale/ontolex/ontolex.ps.
[Cunningham et al. 00b]
H. Cunningham, K. Bontcheva, V. Tablan, and Y. Wilks. Software Infrastructure
for Language Resources: a Taxonomy of Previous Work and a Requirements Analysis.
In Proceedings of the 2nd International Conference on Language Resources and
Evaluation (LREC-2), Athens, 2000. http://gate.ac.uk/.
[Cunningham et al. 00c]
H. Cunningham,
D. Maynard, K. Bontcheva, V. Tablan, and Y. Wilks. Experience of using GATE
for NLP R&D. In Proceedings of the Workshop on Using Toolsets and Architectures
To Build NLP Systems at COLING-2000, Luxembourg, 2000. http://gate.ac.uk/.
[Cunningham et al. 00d]
H. Cunningham, D. Maynard, and V. Tablan. JAPE: a Java Annotation Patterns
Engine (Second Edition). Research Memorandum CS–00–10, Department of
Computer Science, University of Sheffield, November 2000.
[Cunningham et al. 02]
H. Cunningham, D. Maynard, K. Bontcheva, and V. Tablan. GATE: A Framework
and Graphical Development Environment for Robust NLP Tools and Applications.
In Proceedings of the 40th Anniversary Meeting of the Association for Computational
Linguistics (ACL’02), 2002.
[Cunningham et al. 03]
H. Cunningham, V. Tablan,
K. Bontcheva, and M. Dimitrov. Language Engineering Tools for Collaborative
Corpus Annotation. In Proceedings of Corpus Linguistics 2003, Lancaster, UK, 2003.
http://gate.ac.uk/sale/cl03/distrib-ollie-cl03.doc.
[Davies & Fleiss 82]
M. Davies and J. Fleiss. Measuring Agreement for Multinomial Data. Biometrics,
38:1047–1051, 1982.
[Dean et al. 04]
M. Dean, G. Schreiber, S. Bechhofer, F. van Harmelen, J. Hendler, I. Horrocks,
D. L. McGuinness, P. F. Patel-Schneider, and L. A. Stein. OWL web
ontology language reference. W3C recommendation, W3C, Feb 2004.
http://www.w3.org/TR/owl-ref/.
[Dimitrov 02a]
M. Dimitrov. A Light-weight Approach to Coreference Resolution for
Named Entities in Text. MSc Thesis, University of Sofia, Bulgaria, 2002.
http://www.ontotext.com/ie/thesis-m.pdf.
[Dimitrov 02b]
M. Dimitrov. A Light-weight Approach to Coreference Resolution for
Named Entities in Text. MSc Thesis, University of Sofia, Bulgaria, 2002.
http://www.ontotext.com/ie/thesis-m.pdf.
[Dimitrov et al. 02]
M. Dimitrov, K. Bontcheva, H. Cunningham, and D. Maynard. A Light-weight
Approach to Coreference Resolution for Named Entities in Text. In Proceedings
of the Fourth Discourse Anaphora and Anaphor Resolution Colloquium (DAARC),
Lisbon, 2002.
[Dimitrov et al. 04]
M. Dimitrov, K. Bontcheva, H. Cunningham, and D. Maynard. A Light-weight
Approach to Coreference Resolution for Named Entities in Text. In A. Branco,
T. McEnery, and R. Mitkov, editors, Anaphora Processing: Linguistic, Cognitive
and Computational Modelling. John Benjamins, 2004.
[Dowman et al. 05a]
M. Dowman, V. Tablan, H. Cunningham, and B. Popov. Content
augmentation for mixed-mode news broadcasts. In Proceedings of the
3rd European Conference on Interactive Television: User Centred ITV
Systems, Programmes and Applications, Aalborg University, Denmark, 2005.
http://gate.ac.uk/sale/euro-itv-2005/content-augmentation-for-mixed-mode-news-broadcast-consumption.pdf.
[Dowman et al. 05b]
M. Dowman, V. Tablan, H. Cunningham, and B. Popov. Web-assisted annotation,
semantic indexing and search of television and radio news. In Proceedings
of the 14th International World Wide Web Conference, Chiba, Japan, 2005.
http://gate.ac.uk/sale/www05/web-assisted-annotation.pdf.
[Dowman et al. 05c]
M. Dowman, V. Tablan, H. Cunningham, C. Ursu, and B. Popov. Semantically
enhanced television news through web and video integration. In Second European
Semantic Web Conference (ESWC’2005), 2005.
[Eugenio & Glass 04]
B. D. Eugenio and M. Glass. The kappa statistic: a second look. Computational
Linguistics, 1(30), 2004. (squib).
[Fleiss 75]
J. L. Fleiss. Measuring agreement between two judges on the presence or absence
of a trait. Biometrics, 31:651–659, 1975.
[Frakes & Baeza-Yates 92]
W. Frakes and R. Baeza-Yates, editors. Information retrieval, data structures and
algorithms. Prentice Hall, New York, Englewood Cliffs, N.J., 1992.
[Gaizauskas & Wilks 98]
R. Gaizauskas and Y. Wilks. Information Extraction: Beyond Document Retrieval.
Journal of Documentation, 54(1):70–105, 1998.
[Gaizauskas et al. 96a]
R. Gaizauskas, P. Rodgers, H. Cunningham, and K. Humphreys. GATE User
Guide. http://www.dcs.shef.ac.uk/nlp/gate, 1996.
[Gaizauskas et al. 96b]
R. Gaizauskas, H. Cunningham, Y. Wilks, P. Rodgers, and K. Humphreys. GATE
– an Environment to Support Research and Development in Natural Language
Engineering. In Proceedings of the 8th IEEE International Conference on
Tools with Artificial Intelligence (ICTAI-96), Toulouse, France, October 1996.
ftp://ftp.dcs.shef.ac.uk/home/robertg/ictai96.ps.
[Gambäck & Olsson 00]
B. Gambäck and F. Olsson. Experiences of Language Engineering Algorithm Reuse.
In Second International Conference on Language Resources and Evaluation (LREC),
pages 155–160, Athens, Greece, 2000.
[Gazdar & Mellish 89]
G. Gazdar and C. Mellish. Natural Language Processing in Prolog. Addison-Wesley,
Reading, MA, 1989.
[Grishman 97]
R. Grishman.
TIPSTER Architecture Design Document Version 2.3. Technical report, DARPA,
1997. http://www.itl.nist.gov/div894/894.02/related_projects/tipster/.
[Hepple 00]
M. Hepple. Independence and commitment: Assumptions for rapid training and
execution of rule-based POS taggers. In Proceedings of the 38th Annual Meeting
of the Association for Computational Linguistics (ACL-2000), Hong Kong, October
2000.
[Horrocks & vanHarmelen 01]
I. Horrocks and F. van Harmelen. Reference Description of
the DAML+OIL (March 2001) Ontology Markup Language. Technical report, 2001.
http://www.daml.org/2001/03/reference.html.
[Hripcsak & Heitjan 02]
G. Hripcsak and D. Heitjan. Measuring agreement in medical informatics reliability
studies. Journal of Biomedical Informatics, 35:99–110, 2002.
[Hripcsak & Rothschild 05]
G. Hripcsak and A. S. Rothschild. Agreement, the F-measure, and Reliability in
Information Retrieval. Journal of the American Medical Informatics Association,
12(3):296–298, 2005.
[Humphreys et al. 96]
K. Humphreys, R. Gaizauskas, H. Cunningham, and S. Azzam. CREOLE Module
Specifications. http://www.dcs.shef.ac.uk/nlp/gate/, 1996.
[Jackson 75]
M. Jackson. Principles of Program Design. Academic Press, London, 1975.
[Kiryakov 03]
A. Kiryakov. Ontology and Reasoning in MUMIS: Towards the Semantic Web.
Technical Report CS–03–03, Department of Computer Science, University of
Sheffield, 2003. http://gate.ac.uk/gate/doc/papers.html.
[Lal & Ruger 02]
P. Lal and S. Ruger. Extract-based summarization with simplification. In
Proceedings of the ACL 2002 Automatic Summarization / DUC 2002 Workshop,
2002. http://www.doc.ic.ac.uk/ srueger/pr-p.lal-2002/duc02-final.pdf.
[Lal 02]
P. Lal. Text summarisation. Unpublished M.Sc. thesis, Imperial College, London,
2002.
[Lassila & Swick 99]
O. Lassila and R. Swick. Resource Description Framework (RDF) Model
and Syntax Specification. Technical Report 19990222, W3C Consortium,
http://www.w3.org/TR/REC-rdf-syntax/, 1999.
[Li & Shawe-Taylor 03]
Y. Li and J. Shawe-Taylor. The SVM with Uneven Margins and Chinese Document
Categorization. In Proceedings of The 17th Pacific Asia Conference on Language,
Information and Computation (PACLIC17), Singapore, Oct. 2003.
[Li et al. 02]
Y. Li, H. Zaragoza, R. Herbrich, J. Shawe-Taylor, and J. Kandola. The
Perceptron Algorithm with Uneven Margins. In Proceedings of the 9th International
Conference on Machine Learning (ICML-2002), pages 379–386, 2002.
[Li et al. 04]
Y. Li, K. Bontcheva, and H. Cunningham. An SVM Based Learning Algorithm
for Information Extraction. Machine Learning Workshop, Sheffield, 2004.
http://gate.ac.uk/sale/ml-ws04/mlw2004.pdf.
[Li et al. 05a]
Y. Li, K. Bontcheva, and H. Cunningham. SVM Based Learning System
For Information Extraction. In M. N. J. Winkler and N. Lawerence, editors,
Deterministic and Statistical Methods in Machine Learning, LNAI 3635, pages
319–339. Springer Verlag, 2005.
[Li et al. 05b]
Y. Li, K. Bontcheva, and H. Cunningham. Using Uneven Margins SVM and
Perceptron for Information Extraction. In Proceedings of Ninth Conference on
Computational Natural Language Learning (CoNLL-2005), 2005.
[Li et al. 05c]
Y. Li, C. Miao, K. Bontcheva, and H. Cunningham. Perceptron Learning for
Chinese Word Segmentation. In Proceedings of Fourth SIGHAN Workshop on
Chinese Language processing (Sighan-05), pages 154–157, Korea, 2005.
[Li et al. 07a]
Y. Li, K. Bontcheva, and H. Cunningham. Cost Sensitive Evaluation Measures for
F-term Patent Classification. In The First International Workshop on Evaluating
Information Access (EVIA 2007), pages 44–53, May 2007.
[Li et al. 07b]
Y. Li, K. Bontcheva, and H. Cunningham. Experiments of opinion analysis on the
corpora MPQA and NTCIR-6. In Proceedings of the Sixth NTCIR Workshop Meeting
on Evaluation of Information Access Technologies: Information Retrieval, Question
Answering and Cross-Lingual Information Access, pages 323–329, May 2007.
[Li et al. 07c]
Y. Li, K. Bontcheva, and H. Cunningham. SVM Based Learning System for F-term
Patent Classification. In Proceedings of the Sixth NTCIR Workshop Meeting on
Evaluation of Information Access Technologies: Information Retrieval, Question
Answering and Cross-Lingual Information Access, pages 396–402, May 2007.
[Lombard et al. 02]
M. Lombard, J. Snyder-Duch, and C. C. Bracken. Content analysis in mass
communication: Assessment and reporting of intercoder reliability. Human
Communication Research, 28:587–604, 2002.
[LREC-1 98]
Conference on Language Resources Evaluation (LREC-1), Granada, Spain, 1998.
[LREC-2 00]
Second Conference on Language Resources Evaluation (LREC-2), Athens, 2000.
[Manning & Schütze 99]
C. Manning and H. Schütze. Foundations of Statistical Natural Language
Processing. MIT press, Cambridge, MA, 1999. Supporting materials available at
http://www.sultry.arts.usyd.edu.au/fsnlp/ .
[Manov et al. 03]
D. Manov, A. Kiryakov, B. Popov, K. Bontcheva, and D. Maynard. Experiments
with geographic knowledge for information extraction. In Workshop on
Analysis of Geographic References, HLT/NAACL’03, Edmonton, Canada, 2003.
http://gate.ac.uk/sale/hlt03/paper03.pdf.
[Maynard 05]
D. Maynard. Benchmarking ontology-based annotation tools for the semantic
web. In UK e-Science Programme All Hands Meeting (AHM2005) Workshop ”Text
Mining, e-Research and Grid-enabled Language Technology”, Nottingham, UK, 2005.
[Maynard et al. ]
D. Maynard, K. Bontcheva, and H. Cunningham. From information extraction to
content extraction. Submitted to EACL’2003.
[Maynard et al. 00]
D. Maynard, H. Cunningham, K. Bontcheva,
R. Catizone, G. Demetriou, R. Gaizauskas, O. Hamza, M. Hepple, P. Herring,
B. Mitchell, M. Oakes, W. Peters, A. Setzer, M. Stevenson, V. Tablan, C. Ursu,
and Y. Wilks. A Survey of Uses of GATE. Technical Report CS–00–06, Department
of Computer Science, University of Sheffield, 2000.
[Maynard et al. 01]
D. Maynard, V. Tablan, C. Ursu, H. Cunningham, and Y. Wilks. Named Entity
Recognition from Diverse Text Types. In Recent Advances in Natural Language
Processing 2001 Conference, pages 257–274, Tzigov Chark, Bulgaria, 2001.
[Maynard et al. 02a]
D. Maynard, K. Bontcheva, H. Saggion, H. Cunningham, and O. Hamza. Using
a Text Engineering Framework to Build an Extendable and Portable IE-based
Summarisation System. In Proceedings of the ACL Workshop on Text
Summarisation, pages 19–26, Phildadelphia, Pennsylvania, 2002. ACM.
[Maynard et al. 02b]
D. Maynard, H. Cunningham, K. Bontcheva, and M. Dimitrov. Adapting A
Robust Multi-Genre NE System for Automatic Content Extraction. In Proceedings of
the Tenth International Conference on Artificial Intelligence: Methodology, Systems,
Applications (AIMSA 2002), 2002.
[Maynard et al. 02c]
D. Maynard, H. Cunningham, and R. Gaizauskas. Named entity recognition at
sheffield university. In H. Holmboe, editor, Nordic Language Technology – Arbog for
Nordisk Sprogtechnologisk Forskningsprogram 2002-2004, pages 141–145. Museum
Tusculanums Forlag, 2002.
[Maynard et al. 02d]
D. Maynard, V. Tablan, H. Cunningham, C. Ursu, H. Saggion, K. Bontcheva,
and Y. Wilks. Architectural Elements of Language Engineering Robustness. Journal
of Natural Language Engineering – Special Issue on Robust Methods in Analysis of
Natural Language Data, 8(2/3):257–274, 2002.
[Maynard et al. 03a]
D. Maynard, K. Bontcheva, and H. Cunningham. Towards a semantic extraction
of Named Entities. In Recent Advances in Natural Language Processing, Bulgaria,
2003.
[Maynard et al. 03b]
D. Maynard, V. Tablan, and H. Cunningham. NE recognition without training
data on a language you don’t speak. In ACL Workshop on Multilingual and
Mixed-language Named Entity Recognition: Combining Statistical and Symbolic
Models, Sapporo, Japan, 2003.
[Maynard et al. 04a]
D. Maynard, K. Bontcheva, and H. Cunningham. Automatic
Language-Independent Induction of Gazetteer Lists. In Proceedings of 4th Language
Resources and Evaluation Conference (LREC’04), Lisbon, Portugal, 2004. ELRA.
[Maynard et al. 04b]
D. Maynard, H. Cunningham, A. Kourakis, and A. Kokossis. Ontology-Based
Information Extraction in hTechSight. In First European Semantic Web Symposium
( ESWS 2004), Heraklion, Crete, 2004.
[Maynard et al. 04c]
D. Maynard, M. Yankova, N. Aswani, and H. Cunningham. Automatic Creation
and Monitoring of Semantic Metadata in a Dynamic Knowledge Portal. In
Proceedings of the 11th International Conference on Artificial Intelligence:
Methodology, Systems, Applications (AIMSA 2004), Varna, Bulgaria, 2004.
[Maynard et al. 06]
D. Maynard, W. Peters, and Y. Li. Metrics for evaluation of ontology-based
information extraction. In WWW 2006 Workshop on ”Evaluation of Ontologies for
the Web” (EON), Edinburgh, Scotland, 2006.
[McEnery et al. 00]
A. McEnery, P. Baker, R. Gaizauskas, and H. Cunningham. EMILLE: Building
a Corpus of South Asian Languages. Vivek, A Quarterly in Artificial Intelligence,
13(3):23–32, 2000.
[Pastra et al. 02]
K. Pastra, D. Maynard, H. Cunningham, O. Hamza, and Y. Wilks. How
feasible is the reuse of grammars for named entity recognition? In
Proceedings of the 3rd Language Resources and Evaluation Conference, 2002.
http://gate.ac.uk/sale/lrec2002/reusability.ps.
[Peters et al. 98]
W. Peters, H. Cunningham, C. McCauley, K. Bontcheva, and Y. Wilks. Uniform
Language Resource Access and Distribution. In Workshop on Distributing and
Accessing Lexical Resources at Conference on Language Resources Evaluation,
Granada, Spain, 1998.
[Polajnar et al. 05]
T. Polajnar, V. Tablan, and H. Cunningham. User-friendly ontology authoring
using a controlled language. Technical Report CS Report No. CS-05-10, University
of Sheffield, Sheffield, UK, 2005.
[Porter 80]
M. Porter. An algorithm for suffix stripping. Program, 14(3):130–137, 1980.
[Ramshaw & Marcus 95]
L. Ramshaw and M. Marcus. Text Chunking Using Transformation-Based
Learning. In Proceedings of the Third ACL Workshop on Very Large Corpora, 1995.
[Saggion et al. 02a]
H. Saggion, H. Cunningham, K. Bontcheva, D. Maynard, C. Ursu, O. Hamza,
and Y. Wilks. Access to Multimedia Information through Multisource and
Multilanguage Information Extraction. In Proceedings of the 7th Workshop on
Applications of Natural Language to Information Systems (NLDB 2002), Stockholm,
Sweden, 2002.
[Saggion et al. 02b]
H. Saggion, H. Cunningham, D. Maynard, K. Bontcheva, O. Hamza, C. Ursu,
and Y. Wilks. Extracting Information for Information Indexing of Multimedia
Material. In Proceedings of 3rd Language Resources and Evaluation Conference
(LREC’2002), 2002. http://gate.ac.uk/sale/lrec2002/mumis_lrec2002.ps.
[Saggion et al. 03a]
H. Saggion, K. Bontcheva, and H. Cunningham. Robust Generic and Query-based
Summarisation. In Proceedings of the European Chapter of Computational
Linguistics (EACL), Research Notes and Demos, 2003.
[Saggion et al. 03b]
H. Saggion, H. Cunningham, K. Bontcheva, D. Maynard, O. Hamza, and
Y. Wilks. Multimedia Indexing through Multisource and Multilingual Information
Extraction; the MUMIS project. Data and Knowledge Engineering, 48:247–264,
2003.
[Saggion et al. 03c]
H. Saggion, J. Kuper, H. Cunningham, T. Declerck, P. Wittenburg, M. Puts,
F. DeJong, and Y. Wilks. Event-coreference across Multiple, Multi-lingual Sources
in the Mumis Project. In Proceedings of the European Chapter of Computational
Linguistics (EACL), Research Notes and Demos, 2003.
[Shaw & Garlan 96]
M. Shaw and D. Garlan. Software Architecture. Prentice Hall, New York, 1996.
[Stevenson et al. 98]
M. Stevenson, H. Cunningham, and Y. Wilks. Sense tagging and language
engineering. In Proceedings of the 13th European Conference on Artificial Intelligence
(ECAI-98), pages 185–189, Brighton, U.K., 1998.
[Tablan et al. 02]
V. Tablan, C. Ursu, K. Bontcheva, H. Cunningham, D. Maynard, O. Hamza,
T. McEnery, P. Baker, and M. Leisher. A Unicode-based Environment for
Creation and Use of Language Resources. In 3rd Language Resources and
Evaluation Conference, Las Palmas, Canary Islands – Spain, 2002. ELRA.
http://gate.ac.uk/sale/iesl03/iesl03.pdf.
[Tablan et al. 03]
V. Tablan, K. Bontcheva, D. Maynard, and H. Cunningham. OLLIE: On-Line
Learning for Information Extraction. In Proceedings of the HLT-NAACL Workshop
on Software Engineering and Architecture of Language Technology Systems,
Edmonton, Canada, 2003. http://gate.ac.uk/sale/hlt03/ollie-sealts.pdf.
[Unicode Consortium 96]
Unicode Consortium. The Unicode Standard, Version 2.0. Addison-Wesley, Reading,
MA, 1996.
[Ursu et al. 05]
C. Ursu, T. Tablan, H. Cunningham, and B. Popav. Digital media preservation
and access through semantically enhanced web-annotation. In Proceedings of the 2nd
European Workshop on the Integration of Knowledge, Semantic and Digital Media
Technologies (EWIMT 2005), London, UK, December 01 2005.
[van Rijsbergen 79]
C. van Rijsbergen. Information Retrieval. Butterworths, London, 1979.
[Wang et al. 05]
T. Wang, D. Maynard, W. Peters, K. Bontcheva, and H. Cunningham. Extracting
a domain ontology from linguistic resource based on relatedness measurements.
In Proceedings of the 2005 IEEE/WIC/ACM International Conference on Web
Intelligence (WI 2005), pages 345–351, Compiegne, France, Septmeber 2005.
[Wang et al. 06]
T. Wang, Y. Li, K. Bontcheva, H. Cunningham, and J. Wang. Automatic
Extraction of Hierarchical Relations from Text. In Proceedings of the Third European
Semantic Web Conference (ESWC 2006), Budva, Montenegro, 2006.
[Witten & Frank 99]
I. H. Witten and E. Frank. Data Mining: Practical Machine Learning Tools and
Techniques with Java Implementations. Morgan Kaufmann, 1999.
[Wood et al. 03]
M. M. Wood, S. J. Lydon, V. Tablan, D. Maynard, and H. Cunningham. Using
parallel texts to improve recall in IE. In Recent Advances in Natural Language
Processing, Bulgaria, 2003.
[Wood et al. 04]
M. Wood, S. Lydon, V. Tablan, D. Maynard, and H. Cunningham. Populating
a Database from Parallel Texts using Ontology-based Information Extraction. In
Proceedings of NLDB 2004, 2004. http://gate.ac.uk/sale/nldb2004/NLDB.pdf.
[Yourdon 89]
E. Yourdon. Modern Structured Analysis. Prentice Hall, New York, 1989.
[Yourdon 96]
E. Yourdon. The Rise and Resurrection of the American Programmer. Prentice
Hall, New York, 1996.