MarginaliaSearch/code/processes/converting-process/ft-keyword-extraction
Viktor Lofgren e0c0ed27bc (keyword-extraction) Clean up code and add tests for position and spans calculation
This code has been a bit of a mess and historically significantly flaky, so some test coverage is more than overdue.
2024-12-08 14:14:52 +01:00
..
java/nu/marginalia/keyword (keyword-extraction) Clean up code and add tests for position and spans calculation 2024-12-08 14:14:52 +01:00
test/nu/marginalia (keyword-extraction) Clean up code and add tests for position and spans calculation 2024-12-08 14:14:52 +01:00
test-resources/test-data (restructure) Clean up repo by moving stray features into converter-process and crawler-process 2024-07-30 10:14:00 +02:00
build.gradle (restructure) Clean up repo by moving stray features into converter-process and crawler-process 2024-07-30 10:14:00 +02:00
readme.md (doc) Correct dead links and stale information in the docs 2024-09-13 11:01:05 +02:00

Keyword Extraction

This code deals with identifying keywords in a document, their positions in the document, their important based on TF-IDF and their grammatical functions based on POS tags.

Central Classes

See Also