MarginaliaSearch/code/processes/crawling-process/java/nu/marginalia/crawl/retreival
2024-10-05 17:49:39 +02:00
..
revisit (crawler) Refactor 2024-09-23 17:51:07 +02:00
sitemap (crawler) Refactor 2024-09-23 17:51:07 +02:00
CrawlDataReference.java (*) Remove the crawl spec abstraction 2024-10-03 13:41:17 +02:00
CrawlDelayTimer.java (crawler) Refactor 2024-09-23 17:51:07 +02:00
CrawlerRetreiver.java (crawler) Properly enqueue links from the root document in the crawler 2024-10-05 17:49:39 +02:00
CrawlerWarcResynchronizer.java (crawler) Refactor 2024-09-23 17:51:07 +02:00
DomainCrawlFrontier.java (crawler) Refactor 2024-09-23 17:51:07 +02:00
DomainProber.java (crawler) Refactor boundary between CrawlerRetreiver and HttpFetcherImpl 2024-09-24 15:08:22 +02:00