MarginaliaSearch/code/processes/live-crawler/java/nu/marginalia/livecrawler
Viktor Lofgren d6575dfee4 (live-crawler) Crude first-try process for live crawling #WIP
Some refactoring is still needed, but an dummy actor is in place and a process that crawls URLs from the livecapture service's RSS endpoints; that makes it all the way to being indexable.
2024-11-19 21:00:18 +01:00
..
LiveCrawlDataSet.java (live-crawler) Crude first-try process for live crawling #WIP 2024-11-19 21:00:18 +01:00
LiveCrawlerMain.java (live-crawler) Crude first-try process for live crawling #WIP 2024-11-19 21:00:18 +01:00
LiveCrawlerModule.java (live-crawler) Crude first-try process for live crawling #WIP 2024-11-19 19:35:01 +01:00
SimpleLinkScraper.java (live-crawler) Crude first-try process for live crawling #WIP 2024-11-19 19:35:01 +01:00