mirror of
https://github.com/MarginaliaSearch/MarginaliaSearch.git
synced 2025-02-24 13:19:02 +00:00
![]() Replaces Parquet output and processing with the new Slop-based format. Includes data migration functionality, updates to handling and writing of crawl data, and introduces support for SLOP in domain readers and converters. |
||
---|---|---|
.. | ||
crawldata/format | ||
CrawledDomainReader.java | ||
CrawlerOutputFile.java | ||
SerializableCrawlDataStream.java |