MarginaliaSearch/code/processes/live-crawling-process/test/nu/marginalia/livecrawler
Viktor Lofgren 0ca43f0c9c (live-crawler) Improve live crawler short-circuit logic
We should not wait until we've fetched robots.txt to decide whether we have any data to fetch!  This makes the live crawler very slow and leads to unnecessary requests.
2024-12-27 20:54:42 +01:00
..
LiveCrawlDataSetTest.java (live-crawler) Keep track of bad URLs 2024-11-22 00:55:46 +01:00
SimpleLinkScraperTest.java (live-crawler) Improve live crawler short-circuit logic 2024-12-27 20:54:42 +01:00