| ⇢ | A | SanitizerFactory added | |
| ⇢ | A | JaTinySegmenterTokenizer added | |
| ⇢ | A | PunctuationRegExTokenizer added | |
| ⇢ | A | NGramTokenizerTest added | |
| ⇢ | A | CombinedNGramTokenizerTest added | |
| ⇢ | A | PunctuationRegExTokenizerTest added | |
| ⇢ | A | NullLanguageDetector added | |
| ⇢ | A | CombinedSanitizerTextStopwordTest added | |
| ⇢ | A | TextCatLanguageDetector added | |
| ⇢ | A | JaTokenizerTest added | |
| ⋮ | view more | ||
| A | ↛ | TokenizerTest removed | |
| A | ↛ | StopwordAnalyzer removed | |
| A | ↛ | Tokenizer removed | |
| A | ↛ | SampleTextStopwordTest removed | |
| A | ↛ | StopwordAnalyzerTest removed | |
| ⇢ | D | JaTinySegmenterTokenizer::segment() added | |
| ⇢ | D | Sanitizer::sanitizeWith() added | |
| ⇢ | C | JaCompoundGroupTokenizer::tokenize() added | |
| ⇢ | C | NGramTokenizer::createNGrams() added | |
| ⇢ | B | NGramTokenizerTest::stringProvider() added | |
| ⇢ | B | JaTinySegmenterTokenizerTest::stringProvider() added | |
| ⇢ | B | IcuWordBoundaryTokenizer::createTokens() added | |
| ⇢ | B | CdbStopwordAnalyzer::createCdbByLanguage() added | |
| ⇢ | B | IcuWordBoundaryTokenizerTest::stringProvider() added | |
| ⇢ | B | NormalizerTest::testReduceLengthTo() added | |
| ⋮ | view more | ||
| D | ↛ | Sanitizer::sanitizeBy() removed | |
| B | ↛ | StopwordAnalyzer::loadListFromCache() removed | |
| B | ↛ | TokenizerTest::stringProvider() removed | |
| B | ↛ | StopwordAnalyzerTest::defaultListStopWordsProvider... removed | |
| A | ↛ | SanitizerTest::stringProvider() removed | |
| A | ↛ | TokenizerTest::testUnknownOption() removed | |
| A | ↛ | StopwordAnalyzer::setCustomStopwordList() removed | |
| A | ↛ | TokenizerTest::testTokenize() removed | |
| A | ↛ | SampleTextStopwordTest::testByLanguage() removed | |
| A | ↛ | SanitizerTest::testSanitizeByStopwords() removed | |
| ⋮ | view more | ||