MarginaliaSearch/code/processes/crawling-process/model/java/nu/marginalia
2025-01-19 15:07:11 +01:00
..
io (live-crawler) Improve live crawler short-circuit logic 2024-12-27 20:54:42 +01:00
model (crawler) Migrate away from using OkHttp in the crawler, use Java's HttpClient instead. 2025-01-19 15:07:11 +01:00
parquet/crawldata (crawler) Migrate away from using OkHttp in the crawler, use Java's HttpClient instead. 2025-01-19 15:07:11 +01:00
ContentTypes.java (crawler) Reintroduce content type probing and clean out bad content type data from the existing crawl sets 2024-12-11 17:01:52 +01:00