MarginaliaSearch/code/libraries/term-frequency-dict
Viktor Lofgren 8a81a480a1 (ngram) Only extract frequencies of title words, but use the body to increment the counters...
The sign of the counter is used to indicate whether a term has appeared as title.  Until it's seen in the title, it's provisionally saved as a negative count.
2024-04-12 18:08:31 +02:00
..
java/nu/marginalia (ngram) Only extract frequencies of title words, but use the body to increment the counters... 2024-04-12 18:08:31 +02:00
test/nu/marginalia/segmentation (ngram) Clean up ngram lexicon code 2024-04-12 17:45:06 +02:00
build.gradle (sys) Upgrade to JDK22 2024-03-21 14:33:27 +01:00
readme.md Clean up documentation and rename domain-links to link-graph 2024-02-28 11:40:39 +01:00

Term Frequency Dictionary

This dictionary is used by various parts of the system to evaluate for example the TF-IDF score of a keyword.

Central Classes