Commit Graph

  • 73f973cc06 (search-query) Add pagination to search query API and the direct query-service interface Viktor Lofgren 2024-09-25 12:56:38 +0200
  • e9e8580913 (converter) Fix NPE bugs in converter due to the reintroduction of CrawledDocument.headers Viktor Lofgren 2024-09-25 12:10:26 +0200
  • 8b85a58fea (search UX) Autofocus on the search form Viktor Lofgren 2024-09-24 15:56:03 +0200
  • 40512511af (crawler) Refactor boundary between CrawlerRetreiver and HttpFetcherImpl Viktor Lofgren 2024-09-24 15:08:15 +0200
  • 10d8fc4fe7
    Update ROADMAP.md Viktor 2024-09-24 14:57:30 +0200
  • 9899d45ea8
    Merge pull request #118 from MarginaliaSearch/vlofgren-patch-1 Viktor 2024-09-24 14:13:47 +0200
  • 3eea471ca6
    Update ROADMAP.md Viktor 2024-09-24 14:13:32 +0200
  • 3dec4b6b34 (index) Fix bug where tcfFirstPosition lit up because one term was in the title and the other was missing from the document Viktor Lofgren 2024-09-24 13:33:37 +0200
  • 162fc25ebc (minor) Fix accidental commit errors Viktor Lofgren 2024-09-23 18:03:09 +0200
  • e9854f194c (crawler) Refactor Viktor Lofgren 2024-09-23 17:51:07 +0200
  • 9c292a4f62 (doc) Fix outdated links in documentation Viktor Lofgren 2024-09-22 13:56:17 +0200
  • edb42836da (vcs) Fix shared state issues with VarintCodedSequence's iterators. Viktor Lofgren 2024-09-21 16:09:15 +0200
  • 1ff88ff0bc (vcs) Stopgap fix for quoted queries with the same term appearinc multiple times Viktor Lofgren 2024-09-21 14:07:59 +0200
  • 28e7c8e5e0 Increase temporal bias weight to give the recent results filter a bit more recency Viktor Lofgren 2024-09-17 18:11:40 +0200
  • 463b3ed0ce
    Merge pull request #99 from MarginaliaSearch/term-positions Viktor 2024-09-17 15:30:04 +0200
  • 8e78286068 Merge branch 'master' into term-positions Viktor Lofgren 2024-09-17 15:20:46 +0200
  • f4eeef145e (index) Reduce fetch size to improve timeout characteristics Viktor Lofgren 2024-09-17 15:20:41 +0200
  • 87aa869338 (index) Correct positions mask to take into account offsets when overlapping Viktor Lofgren 2024-09-17 14:40:37 +0200
  • 60ad4786bc (index) Use MemorySegment.copy for LongArray->LongArray transfers Viktor Lofgren 2024-09-17 13:56:31 +0200
  • a74df7f905 (index) Increase buffer size for PrioDocIdsTransformer Viktor Lofgren 2024-09-17 13:52:52 +0200
  • 9f9c6736ab (index) Use MemorySegment.copy for LongArray->LongArray transfers Viktor Lofgren 2024-09-17 13:49:02 +0200
  • b95646625f (index) Correct prio index construction with mmap Viktor Lofgren 2024-09-17 13:39:08 +0200
  • 6e47eae903 (index) Correct strange close handling of PositionsFileConstructor Viktor Lofgren 2024-09-13 16:34:14 +0200
  • 934af0dd4b (index) Correct units in log message when shrinking the documents file Viktor Lofgren 2024-09-13 16:33:19 +0200
  • a8bec13ed9 (index) Evaluate using mmap reads during index construction in favor of filechannel reads Viktor Lofgren 2024-09-13 16:14:56 +0200
  • 1cf62f5850 (doc) Correct dead links and stale information in the docs Viktor Lofgren 2024-09-13 11:02:13 +0200
  • 8047e77757 (doc) Correct dead links and stale information in the docs Viktor Lofgren 2024-09-13 11:01:05 +0200
  • 2a92de29ce (loader) Fix it so that the loader doesn't explode if it sees an invalid URL Viktor Lofgren 2024-09-12 11:36:00 +0200
  • 99523ca079 (query-parser) Remove test that is no longer relevant Viktor Lofgren 2024-09-10 10:35:56 +0200
  • 35f49bbb60 (coded-sequence) Add equals and hashCode to VCS Viktor Lofgren 2024-09-10 10:33:56 +0200
  • 50ec922c2b (index) Fix broken index tests Viktor Lofgren 2024-09-10 10:18:08 +0200
  • cfbbeaa26e (ranking) Clean up ranking test code Viktor Lofgren 2024-09-08 15:46:51 +0200
  • a3b0189934 Fix build errors after merge Viktor Lofgren 2024-09-08 10:22:32 +0200
  • 8f367d96f8 Merge branch 'master' into term-positions Viktor Lofgren 2024-09-08 10:12:53 +0200
  • f78ef36cd4 (slop) Upgrade to 0.0.8, add encodings to string columns. Viktor Lofgren 2024-09-04 15:19:00 +0200
  • dc67c81f99 (summary) Fix a few cases where noscript tags would sometimes be used for document summary Viktor Lofgren 2024-09-04 15:00:40 +0200
  • 50ba8fd099 (query-parsing) Correct handling of trailing parentheses Viktor Lofgren 2024-09-03 11:45:14 +0200
  • 99b3b00b68 (query-parsing) Merge QueryTokenizer into QueryParser and add escaping of query grammar Viktor Lofgren 2024-09-03 11:35:32 +0200
  • f6d981761d (query-parsing) Drop search term elements that aren't indexed by the search engine Viktor Lofgren 2024-09-03 11:24:05 +0200
  • 8290c19e24 (query-parsing) Drop search term elements that aren't indexed by the search engine Viktor Lofgren 2024-09-03 11:21:01 +0200
  • 7a69dff6cf (search) Correct handling of languages on fandom Viktor Lofgren 2024-09-01 13:46:01 +0200
  • bfb7ed2c99 (search) Translate cursed medium URLs to scribe.rip links via the search application Viktor Lofgren 2024-09-01 13:32:14 +0200
  • e19dc9b13e (search) Translate cursed fandom URLs to breezewiki links via the search application Viktor Lofgren 2024-09-01 13:23:35 +0200
  • 74148c790e (crawler) Pull additional new domains from node-affinity 0 Viktor Lofgren 2024-09-01 13:00:36 +0200
  • 3d77456110 (*) Add domain parking service to ip blocklist Viktor Lofgren 2024-09-01 12:53:22 +0200
  • ab6a4b1749 (control) Correct id value for domain addition tool Viktor Lofgren 2024-09-01 12:25:15 +0200
  • aeeb1d0cb7 (control) Add utility for adding domains from an external URL Viktor Lofgren 2024-09-01 12:14:21 +0200
  • 185b79f2a5 (converter) Fix bug where sideloaded reddit content was errouneously categoriszed as wiki-generated. Viktor Lofgren 2024-09-01 11:30:25 +0200
  • 8d0f9652c7 (crawler) Correct RSS-sitemap behavior Viktor Lofgren 2024-08-31 11:38:34 +0200
  • 5353805cc6 (crawler) Correct RSS-sitemap behavior Viktor Lofgren 2024-08-31 11:37:09 +0200
  • 5407da5650 (crawler) Grab favicons as part of root sniff Viktor Lofgren 2024-08-31 11:32:56 +0200
  • b1bfe6f76e (control) New view for domains Viktor Lofgren 2024-08-30 17:06:48 +0200
  • 74e25370ca (control) New view for domains Viktor Lofgren 2024-08-29 15:40:40 +0200
  • bb5d946c26 (index, EXPERIMENTAL) Clean up ranking code Viktor Lofgren 2024-08-29 11:34:23 +0200
  • abab5bdc8a (index, EXPERIMENTAL) Evaluate using Varint instead of GCS for position data Viktor Lofgren 2024-08-26 14:20:39 +0200
  • 30bf845c81 (index) Speed up minDist calculations by excluding large lists Viktor Lofgren 2024-08-26 13:04:15 +0200
  • 77efce0673 (paper-doll) Fix compilation Viktor Lofgren 2024-08-26 12:51:29 +0200
  • 67a98fb0b0 (coded-sequence) Handle weird legacy HTML that puts everything in a heading Viktor Lofgren 2024-08-26 12:49:15 +0200
  • 7d471ec30d (coded-sequence) Evaluate new minDist implementation Viktor Lofgren 2024-08-26 12:45:11 +0200
  • f3182a9264 (coded-sequence) Evaluate new minDist implementation Viktor Lofgren 2024-08-26 12:02:37 +0200
  • 805cb5ad58 (coded-sequence) Correct behavior of findIntersections Viktor Lofgren 2024-08-25 14:54:17 +0200
  • fdf05cedae (index) Optimize DocumentSpan.countIntersections Viktor Lofgren 2024-08-25 14:12:30 +0200
  • 9c5f463775 (index) Optimize DocumentSpan.countIntersections Viktor Lofgren 2024-08-25 13:59:11 +0200
  • 893fae6d59 (index) Optimize DocumentSpan.countIntersections Viktor Lofgren 2024-08-25 13:51:43 +0200
  • 5660f291af (index) Optimize DocumentSpan.countIntersections Viktor Lofgren 2024-08-25 13:43:29 +0200
  • efd56efc63 (index) Optimize SequenceOperations.minDistance Viktor Lofgren 2024-08-25 13:28:06 +0200
  • d94373f4b1 (index) Optimize calculatePositionsMask Viktor Lofgren 2024-08-25 13:24:37 +0200
  • 0d01a48260 (index) Optimize SequenceOperations Viktor Lofgren 2024-08-25 13:19:37 +0200
  • 00ab2684fa (index) Optimize SequenceOperations Viktor Lofgren 2024-08-25 13:17:38 +0200
  • a5585110a6 (index) Optimize SequenceOperations Viktor Lofgren 2024-08-25 13:16:31 +0200
  • 965c89798e (index) Optimize DocumentSpan Viktor Lofgren 2024-08-25 12:44:33 +0200
  • 982b03382b (index) Optimize DocumentSpan Viktor Lofgren 2024-08-25 12:31:15 +0200
  • 24b805472a (index) Evaluate performance implication of decoding gcs early Viktor Lofgren 2024-08-25 12:23:09 +0200
  • 6ce029b317 (index) Remove vestigial parameter Viktor Lofgren 2024-08-25 12:14:12 +0200
  • 63e5b0ab18 (index) Correct weightedCounts calculations Viktor Lofgren 2024-08-25 12:06:56 +0200
  • 6dda2c2d83 (coded-sequence) Reduce allocations in GCS.values() Viktor Lofgren 2024-08-25 12:06:31 +0200
  • 3fb3c0b92e (index) Optimize ranking calculations Viktor Lofgren 2024-08-25 11:56:11 +0200
  • aa2c960b74 (index) Optimize ranking calculations Viktor Lofgren 2024-08-25 11:53:44 +0200
  • 4fbcc02f96 (index) Adjust sensible defaults for ranking parameters Viktor Lofgren 2024-08-25 11:24:16 +0200
  • 9aa8f13731 (index) Remove tcfAvgDist ranking parameter Viktor Lofgren 2024-08-25 11:20:19 +0200
  • 65bee366dc (index) Try harmonic mean for avgMinDist Viktor Lofgren 2024-08-25 11:11:52 +0200
  • 53700e6667 (index) Try harmonic mean for avgMinDist Viktor Lofgren 2024-08-25 11:08:41 +0200
  • 7f498e10b7 (index) Adjust proximity score Viktor Lofgren 2024-08-25 11:01:35 +0200
  • 6eb0f13411 (index) Adjust handling of full phrase matches to prioritize full query matches over large partial matches Viktor Lofgren 2024-08-25 10:54:04 +0200
  • 773377fe84 (index) Correct handling of full phrase match group Viktor Lofgren 2024-08-25 10:48:34 +0200
  • 4372c8c835 (index) Give ranking components more consistent names Viktor Lofgren 2024-08-25 10:43:54 +0200
  • 099133bdbc (index) Fix verbatim match score after moving full phrase group to a separate entity Viktor Lofgren 2024-08-25 10:43:35 +0200
  • b09e2dbeb7 (build) Fix dependency churn from testcontainers Viktor Lofgren 2024-08-25 10:35:48 +0200
  • 96bcf03ad5 (index) Address broken tests Viktor Lofgren 2024-08-25 10:34:36 +0200
  • 0999f07320 (search-query) Add new ranking parameters for proximity and verbatim matches Viktor Lofgren 2024-08-25 10:34:12 +0200
  • 5d2b455572 (search) Clean up inconsistent usage of MathClient in SearchOperator Viktor Lofgren 2024-08-24 10:19:49 +0200
  • 1c4dff3899 (*) Experimental migration from Spark to Jooby Viktor Lofgren 2024-08-23 12:00:45 +0200
  • ea75ddc0e0 (search) Absorb SearchQueryIndexService into SearchOperator, and clean up SearchOperator Viktor Lofgren 2024-08-22 11:50:52 +0200
  • 2db0e446cb (search) Absorb SearchQueryIndexService into SearchOperator, and clean up SearchOperator Viktor Lofgren 2024-08-22 11:49:29 +0200
  • 557bdaa694 (search) Clean up SearchQueryIndexService and surrounding code Viktor Lofgren 2024-08-22 11:44:57 +0200
  • 9eb1f120fc (index) Repair positions bitmask for search result presentation Viktor Lofgren 2024-08-22 11:28:23 +0200
  • 266d6e4bea (slop) Replace SlopPageRef<T> with SlopTable.Ref<T> Viktor Lofgren 2024-08-21 10:13:49 +0200
  • e4c97a91d8 (*) Comment clarity Viktor Lofgren 2024-08-21 10:12:00 +0200
  • b0a874a842 (*) Upgrade slop library -> 0.0.5 Viktor Lofgren 2024-08-18 11:05:27 +0200
  • bca40de107 (*) Upgrade slop library Viktor Lofgren 2024-08-18 10:43:41 +0200