gate.creole.tokeniser
Class DefaultTokeniser
java.lang.Object
gate.util.AbstractFeatureBearer
gate.creole.AbstractResource
gate.creole.AbstractProcessingResource
gate.creole.AbstractLanguageAnalyser
gate.creole.tokeniser.DefaultTokeniser
- All Implemented Interfaces:
- ANNIEConstants, Executable, FeatureBearer, LanguageAnalyser, NameBearer, ProcessingResource, Resource, Serializable
- public class DefaultTokeniser
- extends AbstractLanguageAnalyser
A composed tokeniser containing a SimpleTokeniser
and a
Transducer
.
The simple tokeniser tokenises the document and the transducer processes its
output.
- See Also:
- Serialized Form
Fields inherited from interface gate.creole.ANNIEConstants |
ANNOTATION_COREF_FEATURE_NAME, DATE_ANNOTATION_TYPE, DOCUMENT_COREF_FEATURE_NAME, LOCATION_ANNOTATION_TYPE, LOOKUP_ANNOTATION_TYPE, LOOKUP_CLASS_FEATURE_NAME, LOOKUP_MAJOR_TYPE_FEATURE_NAME, LOOKUP_MINOR_TYPE_FEATURE_NAME, LOOKUP_ONTOLOGY_FEATURE_NAME, MONEY_ANNOTATION_TYPE, ORGANIZATION_ANNOTATION_TYPE, PERSON_ANNOTATION_TYPE, PERSON_GENDER_FEATURE_NAME, PR_NAMES, SENTENCE_ANNOTATION_TYPE, SPACE_TOKEN_ANNOTATION_TYPE, TOKEN_ANNOTATION_TYPE, TOKEN_CATEGORY_FEATURE_NAME, TOKEN_KIND_FEATURE_NAME, TOKEN_LENGTH_FEATURE_NAME, TOKEN_ORTH_FEATURE_NAME, TOKEN_STRING_FEATURE_NAME |
Methods inherited from class gate.creole.AbstractResource |
checkParameterValues, getName, getParameterValue, getParameterValue, removeResourceListeners, setName, setParameterValue, setParameterValue, setParameterValues, setParameterValues, setResourceListeners |
DEF_TOK_DOCUMENT_PARAMETER_NAME
public static final String DEF_TOK_DOCUMENT_PARAMETER_NAME
- See Also:
- Constant Field Values
DEF_TOK_ANNOT_SET_PARAMETER_NAME
public static final String DEF_TOK_ANNOT_SET_PARAMETER_NAME
- See Also:
- Constant Field Values
DEF_TOK_TOKRULES_URL_PARAMETER_NAME
public static final String DEF_TOK_TOKRULES_URL_PARAMETER_NAME
- See Also:
- Constant Field Values
DEF_TOK_GRAMRULES_URL_PARAMETER_NAME
public static final String DEF_TOK_GRAMRULES_URL_PARAMETER_NAME
- See Also:
- Constant Field Values
DEF_TOK_ENCODING_PARAMETER_NAME
public static final String DEF_TOK_ENCODING_PARAMETER_NAME
- See Also:
- Constant Field Values
DefaultTokeniser
public DefaultTokeniser()
init
public Resource init()
throws ResourceInstantiationException
- Initialise this resource, and return it.
- Specified by:
init
in interface Resource
- Overrides:
init
in class AbstractProcessingResource
- Throws:
ResourceInstantiationException
execute
public void execute()
throws ExecutionException
- Description copied from interface:
Executable
- Starts the execution of this executable
- Specified by:
execute
in interface Executable
- Overrides:
execute
in class AbstractProcessingResource
- Throws:
ExecutionException
interrupt
public void interrupt()
- Notifies all the PRs in this controller that they should stop their
execution as soon as possible.
- Specified by:
interrupt
in interface Executable
- Overrides:
interrupt
in class AbstractProcessingResource
setTokeniserRulesURL
public void setTokeniserRulesURL(URL tokeniserRulesURL)
getTokeniserRulesURL
public URL getTokeniserRulesURL()
setEncoding
public void setEncoding(String encoding)
getEncoding
public String getEncoding()
setTransducerGrammarURL
public void setTransducerGrammarURL(URL transducerGrammarURL)
getTransducerGrammarURL
public URL getTransducerGrammarURL()
setAnnotationSetName
public void setAnnotationSetName(String annotationSetName)
getAnnotationSetName
public String getAnnotationSetName()