MarginaliaSearch/code/processes/website-adjacencies-calculator
Viktor Lofgren 708a741960 (test) Clean up test usage of migrations
Several tests were manually running migrations in a large copy-paste blob of code.  This makes the test less useful as it's possible to break the code while keeping the tests green by introducing a new migration that never gets run in the tests, and it's also difficult to reason about what the tests are doing.

A new test helper library is introduced with a TestMigrationLoader that can both run Flyway migrations, or load specific migrations in the cases a specific set of migrations need to be loaded.   Existing tests are migrated to use the new code.
2024-01-12 15:55:50 +01:00
..
src (*) install script for deploying Marginalia outside the codebase 2024-01-11 12:40:03 +01:00
build.gradle (test) Clean up test usage of migrations 2024-01-12 15:55:50 +01:00
readme.md (executor-service) Embed dist/ in executor-service's docker image 2023-10-19 17:48:34 +02:00

Website Adjacencies Calculator

This job updates the website similarity table based on the data in the domain and links-tables in the URL database.

It performs a brute force cosine similarity calculation across the entire link graph.

These adjacencies power the explorer service and random websites-functionality.