MarginaliaSearch/code/processes/crawling-process/model/java/nu/marginalia
2025-01-26 13:18:14 +01:00
..
io (converter) Reduce lock contention in converter by separating the processing of full and simple-track domains 2025-01-26 13:18:14 +01:00
model (crawler) Add favicon data to domain state db in its own table 2025-01-22 11:41:20 +01:00
parquet/crawldata (crawler) Migrate away from using OkHttp in the crawler, use Java's HttpClient instead. 2025-01-19 15:07:11 +01:00
slop Merge branch 'master' into slop-crawl-data-spike 2025-01-21 13:32:58 +01:00
ContentTypes.java (crawler) Reintroduce content type probing and clean out bad content type data from the existing crawl sets 2024-12-11 17:01:52 +01:00