Verwendungen von Package
opennlp.tools.tokenize
Packages, die opennlp.tools.tokenize verwenden
Package
Beschreibung
Experimental package related to converting various corpora to OpenNLP Format.
Experimental package related to the
Arvores Deitadas corpus
format.Experimental package related to the corpus format used by the "brat rapid annotation tool" (brat).
Experimental package related to the CoNNL-U format.
Experimental package related to the Irish Sentence Bank format.
Experimental package related to the
letsmt
corpus format.Experimental package related to the
MASC
corpus format.Experimental package related to the
MUC
corpus format.Package related to identifying sentence boundaries.
Contains classes related to finding token or words in a string.
This package contains classes for generating sequence features.
-
Von opennlp.tools.cmdline.parser verwendete Klassen in opennlp.tools.tokenize
-
Von opennlp.tools.cmdline.tokenizer verwendete Klassen in opennlp.tools.tokenizeKlasseBeschreibungA marker interface for evaluating
tokenizers
.TheTokenizerModel
is the model used by a learnableTokenizer
.ATokenSample
is text with token spans. -
Von opennlp.tools.formats verwendete Klassen in opennlp.tools.tokenizeKlasseBeschreibungA
Detokenizer
merges tokens back to their detokenized representation.ATokenSample
is text with token spans. -
Von opennlp.tools.formats.ad verwendete Klassen in opennlp.tools.tokenize
-
Von opennlp.tools.formats.brat verwendete Klassen in opennlp.tools.tokenizeKlasseBeschreibungThe interface for tokenizers, which segment a string into its tokens.The
TokenizerModel
is the model used by a learnableTokenizer
. -
Von opennlp.tools.formats.conllu verwendete Klassen in opennlp.tools.tokenize
-
Von opennlp.tools.formats.convert verwendete Klassen in opennlp.tools.tokenizeKlasseBeschreibungA
Detokenizer
merges tokens back to their detokenized representation.ATokenSample
is text with token spans. -
Von opennlp.tools.formats.irishsentencebank verwendete Klassen in opennlp.tools.tokenize
-
Von opennlp.tools.formats.letsmt verwendete Klassen in opennlp.tools.tokenize
-
Von opennlp.tools.formats.masc verwendete Klassen in opennlp.tools.tokenize
-
Von opennlp.tools.formats.muc verwendete Klassen in opennlp.tools.tokenize
-
Von opennlp.tools.sentdetect verwendete Klassen in opennlp.tools.tokenize
-
Von opennlp.tools.tokenize verwendete Klassen in opennlp.tools.tokenizeKlasseBeschreibungA
Detokenizer
merges tokens back to their detokenized representation.This enum contains an operation for every token to merge the tokens together to their detokenized form.A basicTokenizer
implementation which performs tokenization using character classes.Interface for context generators required forTokenizerME
.The interface for tokenizers, which segment a string into its tokens.A marker interface for evaluatingtokenizers
.The factory that providesTokenizer
default implementation and resources.TheTokenizerModel
is the model used by a learnableTokenizer
.ATokenSample
is text with token spans.A basicTokenizer
implementation which performs tokenization using white spaces. -
Von opennlp.tools.tokenize.lang verwendete Klassen in opennlp.tools.tokenize
-
Von opennlp.tools.tokenize.lang.en verwendete Klassen in opennlp.tools.tokenize
-
Von opennlp.tools.util.featuregen verwendete Klassen in opennlp.tools.tokenize