mirror of
https://github.com/MarginaliaSearch/MarginaliaSearch.git
synced 2025-02-24 13:19:02 +00:00
![]() We should not wait until we've fetched robots.txt to decide whether we have any data to fetch! This makes the live crawler very slow and leads to unnecessary requests. |
||
---|---|---|
.. | ||
LiveCrawlDataSetTest.java | ||
SimpleLinkScraperTest.java |