MarginaliaSearch/code/features-convert/keyword-extraction
Viktor Lofgren d36055a2d0 (keyword-extractor) Retire TfIdfHigh WordFlag
This will bring the word flags count down to 8, and let us pack every value in a byte.
2024-07-17 13:54:39 +02:00
..
java/nu/marginalia/keyword (keyword-extractor) Retire TfIdfHigh WordFlag 2024-07-17 13:54:39 +02:00
test/nu/marginalia (coded-sequence) Replace GCS usage with an interface 2024-07-16 14:37:50 +02:00
test-resources/test-data (refac) Remove src/main from all source code paths. 2024-02-23 16:13:40 +01:00
build.gradle (converter) Amend existing modifications to use gamma coded positions lists 2024-05-30 14:20:36 +02:00
readme.md (docs) Begin un-fucking the docs after refactoring 2024-02-27 21:22:21 +01:00

Keyword Extraction

This code deals with identifying keywords in a document, their positions in the document, their important based on TF-IDF and their grammatical functions based on POS tags.

Central Classes

See Also