MarginaliaSearch/code/processes/converting-process/ft-keyword-extraction
Viktor Lofgren fdc3efa250 (setup) Remove OpenNLP tokenization model
This update eliminates all occurrences of the OpenNLP token model from the setup script, configuration, and test files, as this model file is no longer used.
2024-11-28 16:03:05 +01:00
..
java/nu/marginalia/keyword (keyword-extractor) Fix bug where external link keywords weren't generating document spans as intended 2024-11-25 18:30:22 +01:00
test/nu/marginalia (setup) Remove OpenNLP tokenization model 2024-11-28 16:03:05 +01:00
test-resources/test-data (restructure) Clean up repo by moving stray features into converter-process and crawler-process 2024-07-30 10:14:00 +02:00
build.gradle (restructure) Clean up repo by moving stray features into converter-process and crawler-process 2024-07-30 10:14:00 +02:00
readme.md (doc) Correct dead links and stale information in the docs 2024-09-13 11:01:05 +02:00

Keyword Extraction

This code deals with identifying keywords in a document, their positions in the document, their important based on TF-IDF and their grammatical functions based on POS tags.

Central Classes

See Also