MarginaliaSearch/code/processes/crawling-process/java/nu/marginalia
Viktor Lofgren f4d79c203d (crawler) Adjust revisit logic
The revisit logic wasn't sufficiently dampening the recrawl rate for websites that largely have not changed.

Modified it to be more reactive to the degree to which the content has changed, while applying upper and lower limits depending on the size of the crawl set.
2024-07-16 15:12:38 +02:00
..
crawl (crawler) Adjust revisit logic 2024-07-16 15:12:38 +02:00