MarginaliaSearch/code/libraries/language-processing
Viktor Lofgren a18edad04c (index) Remove stopword list from converter
We want to index all words in the document, stopword handling is moved to the index where we change the semantics to elide inclusion checks in query construction for a very short list of words tentatively hard-coded in SearchTerms.
2024-08-15 09:36:50 +02:00
..
java/nu/marginalia/language (index) Remove stopword list from converter 2024-08-15 09:36:50 +02:00
resources/dictionary (index) Remove stopword list from converter 2024-08-15 09:36:50 +02:00
test/nu/marginalia/language (dld) Refactor DocumentLanguageData 2024-07-19 12:24:55 +02:00
test-resources/html (refac) Remove src/main from all source code paths. 2024-02-23 16:13:40 +01:00
build.gradle (*) Lift jetty and guava-dependencies 2024-05-23 14:20:01 +02:00
readme.md Clean up documentation and rename domain-links to link-graph 2024-02-28 11:40:39 +01:00

Language Processing

This library contains various tools used in language processing.

Central Classes

See Also

features-convert/keyword-extraction uses this code to identify which keywords are important.