mirror of
https://github.com/MarginaliaSearch/MarginaliaSearch.git
synced 2025-02-24 21:29:00 +00:00
14 lines
532 B
Markdown
14 lines
532 B
Markdown
# Language Processing
|
|
|
|
This library contains various tools used in language processing.
|
|
|
|
## Central Classes
|
|
|
|
* [SentenceExtractor](java/nu/marginalia/language/sentence/SentenceExtractor.java) -
|
|
Creates a [DocumentLanguageData](java/nu/marginalia/language/model/DocumentLanguageData.java) from a text, containing
|
|
its words, how they stem, POS tags, and so on.
|
|
|
|
## See Also
|
|
|
|
[converting-process/ft-keyword-extraction](../../processes/converting-process/ft-keyword-extraction) uses this code to identify which keywords
|
|
are important. |