MarginaliaSearch/code/libraries/language-processing
Viktor Lofgren 7f7021ce64 (sentence-extractor) Fix resource leak in sentence extractor
The code would always re-initialize the static ngramLexicon and rdrposTagger fields with new instances even if they were already instantiated, leading to a ton of unnecessary RAM allocation.

The modified behavior checks for nullity before creating a new instance.
2024-04-05 18:52:58 +02:00
..
java/nu/marginalia/language (sentence-extractor) Fix resource leak in sentence extractor 2024-04-05 18:52:58 +02:00
resources/dictionary (refac) Remove src/main from all source code paths. 2024-02-23 16:13:40 +01:00
test/nu/marginalia/language (qs) Retire NGramBloomFilter, integrate new segmentation model instead 2024-03-19 10:42:09 +01:00
test-resources/html (refac) Remove src/main from all source code paths. 2024-02-23 16:13:40 +01:00
build.gradle (sys) Upgrade to JDK22 2024-03-21 14:33:27 +01:00
readme.md Clean up documentation and rename domain-links to link-graph 2024-02-28 11:40:39 +01:00

Language Processing

This library contains various tools used in language processing.

Central Classes

See Also

features-convert/keyword-extraction uses this code to identify which keywords are important.