MarginaliaSearch/code/libraries/term-frequency-dict/test/nu/marginalia/segmentation
Viktor Lofgren 6efc0f21fe (index) Clean up data model
The change set cleans up the data model for the term-level data.  This used to contain a bunch of fields with document-level metadata.  This data-duplication means a larger memory footprint and worse memory locality.

The ranking code is also modified to not accept SearchResultKeywordScores, but rather CompiledQueryLong and CqDataInts containing only the term metadata and the frequency information needed for ranking.  This is again an effort to improve memory locality.
2024-04-24 14:44:39 +02:00
..
HasherGroupTest.java (convert) Initial integration of segmentation data into the converter's keyword extraction logic 2024-04-24 14:44:17 +02:00
NgramLexiconTest.java (index) Clean up data model 2024-04-24 14:44:39 +02:00