MarginaliaSearch/code/processes/live-crawling-process/java/nu/marginalia
Viktor Lofgren 52eb5bc84f (live-crawler) Keep track of bad URLs
To avoid hammering the same invalid URLs for up to two months, URLs that fail to fetch correctly are on a dice roll added to a bad URLs table, that prevents further attempts at fetching them.
2024-11-22 00:55:46 +01:00
..
livecrawler (live-crawler) Keep track of bad URLs 2024-11-22 00:55:46 +01:00