MarginaliaSearch/code/functions/search-query
Viktor Lofgren 22b35d5d91 (sentence-extractor) Add tag information to document language data
Decorates DocumentSentences with information about which HTML tags they are nested in, and removes some redundant data on this rather memory hungry object.  Separator information is encoded as a bit set instead of an array of integers.

The change also cleans up the SentenceExtractor class a fair bit.  It no longer extracts ngrams, and a significant amount of redundant operations were removed as well.  This is still a pretty unpleasant class to work in, but this is the first step in making it a little bit better.
2024-07-18 15:57:48 +02:00
..
api (index) Reduce the number of abstractions around result ranking 2024-07-16 08:18:54 +02:00
java/nu/marginalia (sentence-extractor) Add tag information to document language data 2024-07-18 15:57:48 +02:00
test/nu/marginalia (test) Correcting flaky tests 2024-07-11 16:13:23 +02:00
build.gradle (*) Lift jetty and guava-dependencies 2024-05-23 14:20:01 +02:00
readme.md Clean up documentation and rename domain-links to link-graph 2024-02-28 11:40:39 +01:00

The search query subsystem is responsible for parsing a query, translating it to a request, and then dispatching it to the appropriate index nodes and translating the responses back again.