MarginaliaSearch/code/processes/crawling-process/model/java/nu/marginalia
2024-12-11 17:01:52 +01:00
..
io Merge branch 'master' into live-search 2024-11-21 16:00:20 +01:00
model (model) Remove deprecated fields from CrawledDocument and CrawledDomain 2024-11-20 15:27:05 +01:00
parquet/crawldata (crawler) Reintroduce content type probing and clean out bad content type data from the existing crawl sets 2024-12-11 17:01:52 +01:00
ContentTypes.java (crawler) Reintroduce content type probing and clean out bad content type data from the existing crawl sets 2024-12-11 17:01:52 +01:00