Viktor Lofgren
|
336d6fdd14
|
(index-client) Fix error when zero results are found
|
2024-09-25 20:23:13 +02:00 |
|
Viktor Lofgren
|
73f973cc06
|
(search-query) Add pagination to search query API and the direct query-service interface
|
2024-09-25 14:20:59 +02:00 |
|
Viktor Lofgren
|
3dec4b6b34
|
(index) Fix bug where tcfFirstPosition lit up because one term was in the title and the other was missing from the document
This was because firstPosition calculation was not invalidated when positions were missing.
|
2024-09-24 13:33:37 +02:00 |
|
Viktor Lofgren
|
9c292a4f62
|
(doc) Fix outdated links in documentation
|
2024-09-22 13:56:17 +02:00 |
|
Viktor Lofgren
|
8e78286068
|
Merge branch 'master' into term-positions
|
2024-09-17 15:20:46 +02:00 |
|
Viktor Lofgren
|
f4eeef145e
|
(index) Reduce fetch size to improve timeout characteristics
|
2024-09-17 15:20:41 +02:00 |
|
Viktor Lofgren
|
87aa869338
|
(index) Correct positions mask to take into account offsets when overlapping
|
2024-09-17 14:40:37 +02:00 |
|
Viktor Lofgren
|
a74df7f905
|
(index) Increase buffer size for PrioDocIdsTransformer
|
2024-09-17 13:52:52 +02:00 |
|
Viktor Lofgren
|
b95646625f
|
(index) Correct prio index construction with mmap
Accidentally snuck in behavior from full index
|
2024-09-17 13:39:08 +02:00 |
|
Viktor Lofgren
|
6e47eae903
|
(index) Correct strange close handling of PositionsFileConstructor
|
2024-09-13 16:34:14 +02:00 |
|
Viktor Lofgren
|
934af0dd4b
|
(index) Correct units in log message when shrinking the documents file
|
2024-09-13 16:33:19 +02:00 |
|
Viktor Lofgren
|
a8bec13ed9
|
(index) Evaluate using mmap reads during index construction in favor of filechannel reads
It's likely that this will be faster, as the reads are on average small and sequential, and can't be buffered easily.
|
2024-09-13 16:14:56 +02:00 |
|
Viktor Lofgren
|
8047e77757
|
(doc) Correct dead links and stale information in the docs
|
2024-09-13 11:01:05 +02:00 |
|
Viktor Lofgren
|
50ec922c2b
|
(index) Fix broken index tests
Also cleaned up the tests to be less fragile to ranking algorithm changes.
|
2024-09-10 10:23:46 +02:00 |
|
Viktor Lofgren
|
cfbbeaa26e
|
(ranking) Clean up ranking test code
|
2024-09-08 15:46:51 +02:00 |
|
Viktor Lofgren
|
bb5d946c26
|
(index, EXPERIMENTAL) Clean up ranking code
|
2024-08-29 11:34:23 +02:00 |
|
Viktor Lofgren
|
abab5bdc8a
|
(index, EXPERIMENTAL) Evaluate using Varint instead of GCS for position data
|
2024-08-26 14:20:39 +02:00 |
|
Viktor Lofgren
|
30bf845c81
|
(index) Speed up minDist calculations by excluding large lists
|
2024-08-26 13:04:15 +02:00 |
|
Viktor Lofgren
|
67a98fb0b0
|
(coded-sequence) Handle weird legacy HTML that puts everything in a heading
|
2024-08-26 12:49:15 +02:00 |
|
Viktor Lofgren
|
f3182a9264
|
(coded-sequence) Evaluate new minDist implementation
|
2024-08-26 12:02:37 +02:00 |
|
Viktor Lofgren
|
fdf05cedae
|
(index) Optimize DocumentSpan.countIntersections
|
2024-08-25 14:12:30 +02:00 |
|
Viktor Lofgren
|
9c5f463775
|
(index) Optimize DocumentSpan.countIntersections
|
2024-08-25 13:59:11 +02:00 |
|
Viktor Lofgren
|
893fae6d59
|
(index) Optimize DocumentSpan.countIntersections
|
2024-08-25 13:51:43 +02:00 |
|
Viktor Lofgren
|
5660f291af
|
(index) Optimize DocumentSpan.countIntersections
|
2024-08-25 13:43:29 +02:00 |
|
Viktor Lofgren
|
efd56efc63
|
(index) Optimize SequenceOperations.minDistance
|
2024-08-25 13:28:06 +02:00 |
|
Viktor Lofgren
|
d94373f4b1
|
(index) Optimize calculatePositionsMask
|
2024-08-25 13:24:37 +02:00 |
|
Viktor Lofgren
|
a5585110a6
|
(index) Optimize SequenceOperations
|
2024-08-25 13:16:31 +02:00 |
|
Viktor Lofgren
|
965c89798e
|
(index) Optimize DocumentSpan
|
2024-08-25 12:44:33 +02:00 |
|
Viktor Lofgren
|
982b03382b
|
(index) Optimize DocumentSpan
|
2024-08-25 12:31:15 +02:00 |
|
Viktor Lofgren
|
24b805472a
|
(index) Evaluate performance implication of decoding gcs early
|
2024-08-25 12:23:09 +02:00 |
|
Viktor Lofgren
|
6ce029b317
|
(index) Remove vestigial parameter
|
2024-08-25 12:14:12 +02:00 |
|
Viktor Lofgren
|
63e5b0ab18
|
(index) Correct weightedCounts calculations
|
2024-08-25 12:06:56 +02:00 |
|
Viktor Lofgren
|
3fb3c0b92e
|
(index) Optimize ranking calculations
|
2024-08-25 11:56:11 +02:00 |
|
Viktor Lofgren
|
aa2c960b74
|
(index) Optimize ranking calculations
|
2024-08-25 11:53:44 +02:00 |
|
Viktor Lofgren
|
9aa8f13731
|
(index) Remove tcfAvgDist ranking parameter
This is captured by tcfProximity already
|
2024-08-25 11:20:19 +02:00 |
|
Viktor Lofgren
|
65bee366dc
|
(index) Try harmonic mean for avgMinDist
|
2024-08-25 11:11:52 +02:00 |
|
Viktor Lofgren
|
53700e6667
|
(index) Try harmonic mean for avgMinDist
|
2024-08-25 11:08:41 +02:00 |
|
Viktor Lofgren
|
7f498e10b7
|
(index) Adjust proximity score
|
2024-08-25 11:01:35 +02:00 |
|
Viktor Lofgren
|
6eb0f13411
|
(index) Adjust handling of full phrase matches to prioritize full query matches over large partial matches
|
2024-08-25 10:54:04 +02:00 |
|
Viktor Lofgren
|
773377fe84
|
(index) Correct handling of full phrase match group
|
2024-08-25 10:48:34 +02:00 |
|
Viktor Lofgren
|
4372c8c835
|
(index) Give ranking components more consistent names
|
2024-08-25 10:44:27 +02:00 |
|
Viktor Lofgren
|
099133bdbc
|
(index) Fix verbatim match score after moving full phrase group to a separate entity
|
2024-08-25 10:43:35 +02:00 |
|
Viktor Lofgren
|
b09e2dbeb7
|
(build) Fix dependency churn from testcontainers
Apparently you need to pull in commons-codec now in order to run testcontainers, through spooky action at a distance.
|
2024-08-25 10:35:48 +02:00 |
|
Viktor Lofgren
|
96bcf03ad5
|
(index) Address broken tests
They are still broken, but less so.
|
2024-08-25 10:34:36 +02:00 |
|
Viktor Lofgren
|
0999f07320
|
(search-query) Add new ranking parameters for proximity and verbatim matches
|
2024-08-25 10:34:12 +02:00 |
|
Viktor Lofgren
|
9eb1f120fc
|
(index) Repair positions bitmask for search result presentation
|
2024-08-22 11:28:23 +02:00 |
|
Viktor Lofgren
|
b0a874a842
|
(*) Upgrade slop library -> 0.0.5
|
2024-08-18 11:05:27 +02:00 |
|
Viktor Lofgren
|
93652e0937
|
(qdebug) Accurately display positions when intersecting with spans
|
2024-08-15 11:55:48 +02:00 |
|
Viktor Lofgren
|
0a383a712d
|
(qdebug) Accurately display positions when intersecting with spans
|
2024-08-15 11:44:17 +02:00 |
|
Viktor Lofgren
|
03d5dec24c
|
(*) Refactor termCoherences and rename them to phrase constraints.
|
2024-08-15 11:02:19 +02:00 |
|