MarginaliaSearch/code/processes/converting-process/test-resources/html
2024-07-30 10:14:00 +02:00
..
summarization (restructure) Clean up repo by moving stray features into converter-process and crawler-process 2024-07-30 10:14:00 +02:00
work-set (restructure) Clean up repo by moving stray features into converter-process and crawler-process 2024-07-30 10:14:00 +02:00
monadnock.html (restructure) Clean up repo by moving stray features into converter-process and crawler-process 2024-07-30 10:14:00 +02:00
readme.md (restructure) Clean up repo by moving stray features into converter-process and crawler-process 2024-07-30 10:14:00 +02:00
theregister.html (restructure) Clean up repo by moving stray features into converter-process and crawler-process 2024-07-30 10:14:00 +02:00

HTML samples

This directory and its subdirectories contains samples from real websites, used for testing language and HTML processing, including a wide span of edge cases.

Do not redistribute, base works on these files, or use for any other purpose than testing.