MarginaliaSearch/code/features-convert/keyword-extraction
Viktor Lofgren e8ab1e14e0 (keyword-extraction) Update upper limit to number of positions per word
After real-world testing, it was determined that 256 was still a bit too low, but 512 seems like it will only truncate outlier cases like assembly code and certain tabulations.
2024-07-02 20:52:32 +02:00
..
java/nu/marginalia/keyword (keyword-extraction) Update upper limit to number of positions per word 2024-07-02 20:52:32 +02:00
test/nu/marginalia (keyword) Increase the work area for position encoding 2024-06-28 16:42:39 +02:00
test-resources/test-data (refac) Remove src/main from all source code paths. 2024-02-23 16:13:40 +01:00
build.gradle (converter) Amend existing modifications to use gamma coded positions lists 2024-05-30 14:20:36 +02:00
readme.md (docs) Begin un-fucking the docs after refactoring 2024-02-27 21:22:21 +01:00

Keyword Extraction

This code deals with identifying keywords in a document, their positions in the document, their important based on TF-IDF and their grammatical functions based on POS tags.

Central Classes

See Also