MarginaliaSearch/code/processes/converting-process/java/nu/marginalia/converting/model
Viktor Lofgren 8b8bf0748f (feature-extraction) Add new DocumentHeaders class encapsulating Html headers.
Also adds a few new html features for CDNs and  S3 hosting for use in ranking and query refinement.
2024-11-11 13:26:15 +01:00
..
CrawlPlan.java (crawler/converter) Remove legacy junk from parquet migration 2024-04-22 12:34:28 +02:00
DisqualifiedException.java (wip) Extract and encode spans data 2024-07-27 11:44:13 +02:00
DocumentHeaders.java (feature-extraction) Add new DocumentHeaders class encapsulating Html headers. 2024-11-11 13:26:15 +01:00
GeneratorType.java (refac) Remove src/main from all source code paths. 2024-02-23 16:13:40 +01:00
ProcessedDocument.java (btree) Clean up code 2024-05-18 18:03:17 +02:00
ProcessedDocumentDetails.java (refac) Remove src/main from all source code paths. 2024-02-23 16:13:40 +01:00
ProcessedDomain.java (refac) Remove src/main from all source code paths. 2024-02-23 16:13:40 +01:00
WorkDir.java (crawler/converter) Remove legacy junk from parquet migration 2024-04-22 12:34:28 +02:00