MarginaliaSearch/code/libraries/language-processing/readme.md
2024-02-27 21:22:21 +01:00

16 lines
608 B
Markdown

# Language Processing
This library contains various tools used in language processing.
## Central Classes
* [SentenceExtractor](java/nu/marginalia/language/sentence/SentenceExtractor.java) -
Creates a [DocumentLanguageData](java/nu/marginalia/language/model/DocumentLanguageData.java) from a text, containing
its words, how they stem, POS tags, and so on.
## See Also
[features-convert/keyword-extraction](../../features-convert/keyword-extraction) uses this code to identify which keywords
are important.
[features-qs/query-parser](../../features-qs/query-parser) also does some language processing.