MarginaliaSearch/code/features-convert
Viktor Lofgren 7a1edc0880 (term-freq) Reduce the number of low-relevance words in the dictionary
Using a statistical trick to reduce the number of low-frequency words in the dictionary, as they are numerous and not very informative.
2024-07-19 12:23:28 +02:00
..
adblock (*) Lift jetty and guava-dependencies 2024-05-23 14:20:01 +02:00
anchor-keywords (sentence-extractor) Add tag information to document language data 2024-07-18 15:57:48 +02:00
data-extractors (term-freq) Reduce the number of low-relevance words in the dictionary 2024-07-19 12:23:28 +02:00
keyword-extraction (sentence-extractor) Add tag information to document language data 2024-07-18 15:57:48 +02:00
pubdate (*) Lift jetty and guava-dependencies 2024-05-23 14:20:01 +02:00
reddit-json (*) Lift jetty and guava-dependencies 2024-05-23 14:20:01 +02:00
stackexchange-xml (*) Lift jetty and guava-dependencies 2024-05-23 14:20:01 +02:00
summary-extraction (keywords) Add position information to keywords 2024-05-28 16:54:53 +02:00
topic-detection (*) Lift jetty and guava-dependencies 2024-05-23 14:20:01 +02:00
readme.md Update features-convert/readme.md 2023-03-25 12:43:58 +01:00

Converter Features

Major features

Smaller features: