MarginaliaSearch/code/features-convert/keyword-extraction
Viktor Lofgren 0894822b68 (converter) Add position information to serialized document data
This is not hooked in yet, and the term metadata is still left intact.  It should probably shrink to a smaller representation (byte?) with the upcoming removal of the position mask.
2024-05-28 14:18:03 +02:00
..
java/nu/marginalia/keyword (converter) Add position information to serialized document data 2024-05-28 14:18:03 +02:00
test/nu/marginalia (convert) Initial integration of segmentation data into the converter's keyword extraction logic 2024-03-19 14:28:42 +01:00
test-resources/test-data (refac) Remove src/main from all source code paths. 2024-02-23 16:13:40 +01:00
build.gradle (converter) Add position information to serialized document data 2024-05-28 14:18:03 +02:00
readme.md (docs) Begin un-fucking the docs after refactoring 2024-02-27 21:22:21 +01:00

Keyword Extraction

This code deals with identifying keywords in a document, their positions in the document, their important based on TF-IDF and their grammatical functions based on POS tags.

Central Classes

See Also