mirror of
https://github.com/MarginaliaSearch/MarginaliaSearch.git
synced 2025-02-24 13:19:02 +00:00
![]() Turns out throttling to only 1 lock per domain means the crawler chokes hard on large hosting websites such as wordpress. Giving these a slightly larger allowance. |
||
---|---|---|
.. | ||
retreival | ||
spec | ||
warc | ||
AbortMonitor.java | ||
CrawlerMain.java | ||
CrawlerModule.java |