MarginaliaSearch/code/libraries/term-frequency-dict/java/nu/marginalia/segmentation
Viktor Lofgren 8a81a480a1 (ngram) Only extract frequencies of title words, but use the body to increment the counters...
The sign of the counter is used to indicate whether a term has appeared as title.  Until it's seen in the title, it's provisionally saved as a negative count.
2024-04-12 18:08:31 +02:00
..
BasicSentenceExtractor.java (convert) Initial integration of segmentation data into the converter's keyword extraction logic 2024-03-19 14:28:42 +01:00
HasherGroup.java (convert) Initial integration of segmentation data into the converter's keyword extraction logic 2024-03-19 14:28:42 +01:00
NgramExtractorMain.java (ngram) Only extract frequencies of title words, but use the body to increment the counters... 2024-04-12 18:08:31 +02:00
NgramLexicon.java (ngram) Only extract frequencies of title words, but use the body to increment the counters... 2024-04-12 18:08:31 +02:00