MarginaliaSearch/code/execution
Viktor Lofgren aebb2652e8 (wip) Extract and encode spans data
Refactoring keyword extraction to extract spans information.

Modifying the intermediate storage of converted data to use the new slop library, which is allows for easier storage of ad-hoc binary data like spans and positions.

This is a bit of a katamari damacy commit that ended up dragging along a bunch of other fairly tangentially related changes that are hard to break out into separate commits after the fact.  Will push as-is to get back to being able to do more isolated work.
2024-07-27 11:44:13 +02:00
..
api (crawl) Add new functionality for re-crawling a single domain 2024-07-05 15:31:55 +02:00
java/nu/marginalia (wip) Extract and encode spans data 2024-07-27 11:44:13 +02:00
test/nu/marginalia/svc (refac) Remove src/main from all source code paths. 2024-02-23 16:13:40 +01:00
build.gradle (wip) Extract and encode spans data 2024-07-27 11:44:13 +02:00
readme.md Clean up documentation and rename domain-links to link-graph 2024-02-28 11:40:39 +01:00

The execution subsystem is responsible for the execution of long running tasks on each index node. It lives in the executor-service module.

It accomplishes this using the message queue and actor library, which permits program state to survive crashes and reboots.

The subsystem exposes four APIs:

  • Execution API - for starting and stopping tasks, also contains miscellaneous commands
  • Crawl API - for managing the crawl workflow
  • Sideload API - for sideloading data
  • Export API - for exporting data