mirror of
https://github.com/MarginaliaSearch/MarginaliaSearch.git
synced 2025-02-23 21:18:58 +00:00
![]() Added an additional filter step to ensure URLs with binary suffixes are excluded during crawling. This prevents unnecessary processing of non-HTML content, improving the efficiency of the link parsing process. |
||
---|---|---|
.. | ||
java/nu/marginalia/link_parser | ||
build.gradle | ||
readme.md |
Link Parser
Deals with the various cases in link parsing, such as relative links, internal links, external links, pathological links, etc.