SSLST 2011 Text Mining Course
Lecturer: Dr Diana Maynard
Schedule
- Module 1: Wed 31 August: 10.00 - 12.00: Introduction to Text Mining
- Module 2: Wed 31 August: 14.00 - 16.00: Semantic Annotation
- Module 3: Thur 1 Sep: 12.00 - 14.00: Opinion Mining
- Module 4: Fri 2 Sep: 12.00 - 14.00: Applications
Note: You will need to download and install the latest version of GATE if you are planning to attend this course.
You can download GATE here.
You will also need to download all the hands-on material.
You can now download the version of the news texts which have a Key annotation set, for use with the various evaluation tools in Module 2 here. Note: do NOT run ANNIE over this corpus before evaluating, otherwise you will lose the Key annotation set.
Module 1
- Introduction: goals and basic components of text mining
- NLP components of a text mining system
- GATE and text mining tools
- Hands-on: experiment with NLP components in GATE
- Module 1 slides
Module 2
- Semantic annotation: ontology-based information extraction, ontology population
- Performance evaluation: metrics and tools for evaluation, acceptability criteria
- Hands-on: evaluation tools in GATE
- Module 2 slides
Module 3
- Opinion mining: rule-based and machine learning techniques; analysis of social media
- Hands-on: use ANNIC to view sentiment annotations and look for patterns
- Module 3 slides
Module 4
- System development cycle, semantic annotation project
- Applications: business intelligence, multimedia annotation, news search
- Hands-on: MIMIR and semantic search
- Module 4 slides - part 1
- Module 4 slides - part 2




