mirror of
https://github.com/MarginaliaSearch/MarginaliaSearch.git
synced 2025-02-24 13:19:02 +00:00
![]() To let up the pressure on domains with lot sof subdomains such as substack, medium, neocities, etc. a per-domain mutex is added that will limit crawling of these domains to one thread at a time. |
||
---|---|---|
.. | ||
fetcher | ||
revisit | ||
sitemap | ||
Cookies.java | ||
CrawlDataReference.java | ||
CrawlDelayTimer.java | ||
CrawledDocumentFactory.java | ||
CrawlerRetreiver.java | ||
CrawlerWarcResynchronizer.java | ||
DomainCrawlFrontier.java | ||
DomainLocks.java | ||
DomainProber.java | ||
LinkFilterSelector.java | ||
RateLimitException.java |