MarginaliaSearch/code/processes/crawling-process/java/nu/marginalia/crawl
2024-10-05 17:55:59 +02:00
..
fetcher (crawler, EXPERIMENT) Disable content type probing and use Accept header instead 2024-09-30 14:53:01 +02:00
logic (crawler) Refactor boundary between CrawlerRetreiver and HttpFetcherImpl 2024-09-24 15:08:22 +02:00
retreival (crawler) Properly enqueue links from the root document in the crawler 2024-10-05 17:49:39 +02:00
spec (*) Remove the crawl spec abstraction 2024-10-03 13:41:17 +02:00
warc (crawler) Code quality 2024-04-22 15:37:35 +02:00
AbortMonitor.java (refac) Remove src/main from all source code paths. 2024-02-23 16:13:40 +01:00
CrawlerMain.java (crawler) Properly enqueue links from the root document in the crawler 2024-10-05 17:55:59 +02:00
CrawlerModule.java (refac) Remove src/main from all source code paths. 2024-02-23 16:13:40 +01:00