MarginaliaSearch/code/execution/java/nu/marginalia/actor/task
Viktor Lofgren a91ab4c203 (live-crawler) Crude first-try process for live crawling #WIP
Some refactoring is still needed, but an dummy actor is in place and a process that crawls URLs from the livecapture service's RSS endpoints; that makes it all the way to being indexable.
2024-11-19 19:35:01 +01:00
..
ActorProcessWatcher.java (refac) Remove src/main from all source code paths. 2024-02-23 16:13:40 +01:00
ConvertActor.java (chore) Remove use of deprecated STR.-style string templates 2024-11-11 18:02:28 +01:00
ConvertAndLoadActor.java (chore) Remove lombok 2024-11-11 21:14:38 +01:00
CrawlActor.java (*) Remove the crawl spec abstraction 2024-10-03 13:41:17 +02:00
DownloadSampleActor.java (download-sample) Break apart actor for better error recovery 2024-10-04 13:39:43 +02:00
ExportAtagsActor.java (refac) Remove src/main from all source code paths. 2024-02-23 16:13:40 +01:00
ExportDataActor.java Clean up documentation and rename domain-links to link-graph 2024-02-28 11:40:39 +01:00
ExportFeedsActor.java (refac) Remove src/main from all source code paths. 2024-02-23 16:13:40 +01:00
ExportSampleDataActor.java (chore) Remove use of deprecated STR.-style string templates 2024-11-11 18:02:28 +01:00
ExportSegmentationModelActor.java (ngrams) Remove the vestigial logic for capturing permutations of n-grams 2024-04-11 18:12:01 +02:00
ExportTermFreqActor.java (actor) Add a feed scraping actor 2024-09-28 12:33:29 +02:00
LiveCrawlActor.java (live-crawler) Crude first-try process for live crawling #WIP 2024-11-19 19:35:01 +01:00
RecrawlSingleDomainActor.java (crawl) Add new functionality for re-crawling a single domain 2024-07-05 15:31:55 +02:00
RestoreBackupActor.java (refac) Remove src/main from all source code paths. 2024-02-23 16:13:40 +01:00
TriggerAdjacencyCalculationActor.java (refac) Remove src/main from all source code paths. 2024-02-23 16:13:40 +01:00