MarginaliaSearch/code/processes/crawling-process
Viktor Lofgren 667b0ca0b0 (converter, WIP) Refactor CrawledDomainReader to not return iterators.
Instead return a closable class SerializableCrawlDataStream.
2023-07-24 16:28:30 +02:00
..
src (converter, WIP) Refactor CrawledDomainReader to not return iterators. 2023-07-24 16:28:30 +02:00
build.gradle (crawler) WIP 2023-07-20 21:05:16 +02:00
readme.md More restructuring, big bug fixes in keyword extraction. 2023-03-13 17:39:53 +01:00

Crawling Process

The crawling process downloads HTML and saves them into per-domain snapshots.

Central Classes

See Also