Commit Graph

  • f89274d1ea (minor) Fix broken test Viktor Lofgren 2024-02-06 12:12:26 +0100
  • 7286596fb4 (deps) Remove monkey patched GSON Viktor Lofgren 2024-02-06 12:11:39 +0100
  • a2fc83d94e (control) Add configurable border styling Viktor Lofgren 2024-02-06 12:05:02 +0100
  • 2161799cc3 (sideload) Fix filename error in dealing with stackoverflow files Viktor Lofgren 2024-02-06 11:18:00 +0100
  • c88f132057 (sideload) Fix filename error in dealing with stackoverflow files Viktor Lofgren 2024-02-06 11:10:03 +0100
  • c6313a5906 (sideload) Fix filename error in dealing with stackoverflow files Viktor Lofgren 2024-02-06 11:06:36 +0100
  • eadcdb5bed (minor) Improve error handling, naming logging in IndexResultDecorator Viktor Lofgren 2024-02-05 21:05:44 +0100
  • 6e7649b5f7 (loader) Mitigate fragile paging behavior Viktor Lofgren 2024-02-05 21:05:03 +0100
  • d986f90074 (index) Fix consistency between RandomFileAssembler implementations Viktor Lofgren 2024-02-05 21:01:32 +0100
  • 53c575db3f (index-construction) Make random-write file strategy configurable Viktor Lofgren 2024-02-05 12:31:15 +0100
  • 6dcc20038c (index-journal) Make index journal page size configurable Viktor Lofgren 2024-02-05 11:26:05 +0100
  • 885cd00aee Added implementation in wmsa home / setup.sh to grab suffix list. howdycat 2024-02-04 14:38:17 -0500
  • fa145f632b (sideload) Add special handling for sideloaded wiki documents Viktor Lofgren 2024-02-02 21:22:07 +0100
  • 785d8deadd (crawler) Improve meta-tag redirect handling, add tests for redirects. Viktor Lofgren 2024-02-01 20:30:43 +0100
  • 93a2d5afbf (*) Fix poorly named test Viktor Lofgren 2024-02-01 20:08:15 +0100
  • d60c6b18d4 (doc) Update the readme's the crawler, as they've grown stale. Viktor Lofgren 2024-02-01 18:10:55 +0100
  • d1e02569f4 (language-processing) Add a system property for configuring which language detection model to use Viktor Lofgren 2024-01-31 13:02:33 +0100
  • 9ce67029ca (language-processing) Add a system property for configuring which language detection model to use Viktor Lofgren 2024-01-31 13:02:16 +0100
  • 98f3382cea (minor) Fix test and improve error message Viktor Lofgren 2024-01-31 11:53:41 +0100
  • 52a0255814 (*) Add flag for disabling ASCII flattening Viktor Lofgren 2024-01-31 11:50:59 +0100
  • eb59ac8535 (index-ranking) Adjust the BM25P factors a bit Viktor Lofgren 2024-01-30 21:27:29 +0100
  • acc2b4e10f (*) Update the readme with a link to the demo video Viktor Lofgren 2024-01-26 13:49:41 +0100
  • 6f830f0e08 (*) Update the readme with a link to the demo video Viktor Lofgren 2024-01-26 13:48:47 +0100
  • 6edc318597 (control) Fix typo in URL linking to new-crawl-specs v24.01.0 Viktor Lofgren 2024-01-26 10:43:10 +0100
  • 182c0cf28e (control) Add warnings about domain data contamination Viktor Lofgren 2024-01-25 18:26:15 +0100
  • 0b105b5986 (converter) Update hyperlink text for new crawl spec creation. Viktor Lofgren 2024-01-25 18:05:11 +0100
  • e91d5dc339 Added getTld method howdycat 2024-01-25 11:36:04 -0500
  • 081c7d22bc Fix typo in install.sh Viktor Lofgren 2024-01-25 17:08:18 +0100
  • 6aee896657 (*) Add single-node barebones configuration Viktor Lofgren 2024-01-25 16:40:28 +0100
  • cae1bad274 (*) Add download-sample action, refactor file storage Viktor Lofgren 2024-01-25 13:36:30 +0100
  • 1b8b97b8ec (sample-exporter) Add some limits on sizes and lengths Viktor Lofgren 2024-01-25 11:51:53 +0100
  • 0846606b12 (doc) Add ide quick-start guide Viktor Lofgren 2024-01-24 14:39:33 +0100
  • 245ebcdfc6 (doc) Add ide quick-start guide Viktor Lofgren 2024-01-24 14:37:58 +0100
  • 1b1e711c93 (doc) Add ide quick-start guide Viktor Lofgren 2024-01-24 14:36:44 +0100
  • c088c25b09 (*) Fix broken test, clean up code Viktor Lofgren 2024-01-24 12:50:41 +0100
  • 958d64720e (control) Add a view for restarting aborted processes Viktor Lofgren 2024-01-24 12:47:10 +0100
  • 2f648d2bb7 initial tld parser howdycat 2024-01-23 21:21:07 -0500
  • 805afad4fe (control) New GUI for exporting crawl data samples Viktor Lofgren 2024-01-23 17:07:45 +0100
  • 400f4840ad (*) Fix broken code in jmh Viktor Lofgren 2024-01-23 17:07:57 +0100
  • ee7792596d (*) Fix broken test Viktor Lofgren 2024-01-23 12:03:47 +0100
  • 0081328aca (converter) Adjust which flags are set by anchor text keywords Viktor Lofgren 2024-01-23 11:54:00 +0100
  • 3fff7f6878 (converter) Fix issue where quality limits were no longer enforced Viktor Lofgren 2024-01-23 11:42:17 +0100
  • f15dd06473 (index) Delayed close() of SearchIndexReader Viktor Lofgren 2024-01-23 11:08:41 +0100
  • dd26819d66 (actor) Try to rare data race where a finished job is considered dead. Viktor Lofgren 2024-01-22 21:22:38 +0100
  • 562012fb22 (doc) Migrate documentation https://docs.marginalia.nu/ Viktor Lofgren 2024-01-22 19:40:08 +0100
  • a6d257df5b (converter) Update Stackexchange sideload instruction Viktor Lofgren 2024-01-22 18:29:20 +0100
  • 41d896ba3e (converter) Refactor content type check in PlainTextDocumentProcessorPlugin Viktor Lofgren 2024-01-22 17:52:14 +0100
  • 51cdf46645 (control) Improve accessibility in search-to-ban template Viktor Lofgren 2024-01-22 15:01:00 +0100
  • 1eb0adf6d3 (array) Add sun.misc.Unsafe variant of LongArray Viktor Lofgren 2024-01-22 13:38:42 +0100
  • 40c9d2050f (control) Fully automatic conversion Viktor Lofgren 2024-01-22 13:01:09 +0100
  • 3a325845c7 (mq) Add better error handling in fsm and mq Viktor Lofgren 2024-01-22 12:58:33 +0100
  • 6a1bfd6270 (array) Remove unused 'madvise' code and 3rd party dependency on 'uppend' Viktor Lofgren 2024-01-22 12:56:45 +0100
  • b91ea1d7ca (control) Re-add gui for sideloading dirtrees Viktor Lofgren 2024-01-20 18:09:40 +0100
  • c5760cd535 (test) Fix broken test Viktor Lofgren 2024-01-20 13:39:40 +0100
  • 91c7960800 (crawler) Extract additional configuration properties Viktor Lofgren 2024-01-20 10:36:04 +0100
  • 2079a5574b (control) Update heading in restore backup template Viktor Lofgren 2024-01-19 21:46:53 +0100
  • 27ffb8fa8a (converter) Integrate zim->db conversion into automatic encyclopedia processing workflow Viktor Lofgren 2024-01-19 13:59:03 +0100
  • 22c8fb3f59 (crawler) Fix a bug where reference copies of crawl data was written without etag and last-modified Viktor Lofgren 2024-01-18 16:02:27 +0100
  • 964419803a Fix broken test Viktor Lofgren 2024-01-18 15:42:01 +0100
  • 6271d5d544 (mq) Add relation tracking between MQ messages for easier tracking and debugging. Viktor Lofgren 2024-01-18 15:08:27 +0100
  • 175bd310f5 (control) Message queue UX improvements Viktor Lofgren 2024-01-18 13:05:50 +0100
  • 67ee6f4126 (control) Clean up filtering UX in Events table Viktor Lofgren 2024-01-18 12:35:39 +0100
  • 01b312f14c (*) Make new index nodes accept queries by default Viktor Lofgren 2024-01-18 12:05:37 +0100
  • 18638c62de (control) Rephrase text Viktor Lofgren 2024-01-18 11:53:10 +0100
  • 753d000788 (control) Add toggle for automatic loading of processed data Viktor Lofgren 2024-01-18 11:51:31 +0100
  • 19e781b104 (control) Add basic input validation to node actions Viktor Lofgren 2024-01-18 11:30:17 +0100
  • aa2df327db (index) Prevent index from attempting to answer queries when no index data is loaded Viktor Lofgren 2024-01-17 21:14:57 +0100
  • 321fa94b8f (crawler) Fix rare exception in content type handling due to improper length checking of a split() array Viktor Lofgren 2024-01-17 21:14:21 +0100
  • ca80957143
    Merge pull request #73 from MarginaliaSearch/configurable-search-sets Viktor 2024-01-17 21:12:20 +0100
  • 41cdb8f71b (control) Fix broken update button in the update-domain-ranking-set form Viktor Lofgren 2024-01-17 18:21:09 +0100
  • 304d4c9acf (control) Fix result ordering in the file storage listing view Viktor Lofgren 2024-01-17 10:56:16 +0100
  • 7fd4c092e3 (control) Clean up UX and accessibility for new domain ranking sets. Viktor Lofgren 2024-01-17 10:47:14 +0100
  • 2fe5705542 (control) GUI for ranking sets Viktor Lofgren 2024-01-16 17:10:09 +0100
  • e968365858 (index) Use new DomainRankingSets to configure ranking algos in index svc Viktor Lofgren 2024-01-16 12:42:51 +0100
  • 36ad4c7466 (db) Add a new configuration object 'domain ranking set' for storing ranking parameters Viktor Lofgren 2024-01-16 11:17:40 +0100
  • 5a62b3058f (query-api) Make the search set identifier a string value in the API Viktor Lofgren 2024-01-16 10:55:24 +0100
  • ec8fe9f031 (doc) Add screenshot to conversion step in crawling doc Viktor Lofgren 2024-01-15 16:31:33 +0100
  • a1df9e886a (control) Also clean up stale 'NEW' messages Viktor Lofgren 2024-01-15 16:14:02 +0100
  • ce5ae1931d (doc) Update Crawling Docs Viktor Lofgren 2024-01-15 16:08:01 +0100
  • b9445d4f62 (doc) Update Crawling Docs Viktor Lofgren 2024-01-15 16:06:59 +0100
  • fd1eec99b5 (cleanup) Fix broken tests Viktor Lofgren 2024-01-15 15:44:33 +0100
  • e162406d40 (control) New control-side actors for cleaning up stale service heartbeats and message queue entries Viktor Lofgren 2024-01-15 15:44:23 +0100
  • c41e68aaab (control) New export actions for RSS/Atom feeds and term frequency data Viktor Lofgren 2024-01-15 14:54:26 +0100
  • 4665af6c42 (control) Move export data endpoint to actions controller Viktor Lofgren 2024-01-15 11:06:22 +0100
  • c0b15427fe (control) New crawl view should use radio buttons as multiple specs aren't supported Viktor Lofgren 2024-01-15 11:03:47 +0100
  • f29a9d972d (control) Move 'new crawl spec' to /node/:id/actions, out of /node/:id/storage Viktor Lofgren 2024-01-15 11:02:00 +0100
  • b192373ae7 (control) Highlight unavailable items (creating, deleting) in node actions views Viktor Lofgren 2024-01-15 10:47:54 +0100
  • c042650382 (docs) Improve query service documentation Viktor Lofgren 2024-01-13 21:16:45 +0100
  • 07a916a720 (search) Give the swipe hint on mobile a nicer finish Viktor Lofgren 2024-01-13 18:51:54 +0100
  • 5134044530 (assistant) Make assistant client more robust to the service going down Viktor Lofgren 2024-01-13 18:29:30 +0100
  • 4c62065e74 (install) Add two separate templates for the install script Viktor Lofgren 2024-01-13 18:27:42 +0100
  • d28fc99119 (MainClass) ensure logging isn't loaded before service name is known Viktor Lofgren 2024-01-13 18:19:50 +0100
  • c9fb45c85f (search) Fix control.hideMarginaliaApp handling Viktor Lofgren 2024-01-13 17:24:15 +0100
  • 7c6e18f7a7 (*) Overhaul settings and properties Viktor Lofgren 2024-01-13 17:12:18 +0100
  • 176b9c9666 (convert) Add sizeHints to legacy serializable cawl data stream Viktor Lofgren 2024-01-13 15:50:36 +0100
  • ecd9c35233 (control) Clean up the event log Viktor Lofgren 2024-01-13 13:28:02 +0100
  • 71e32c57d9 (control) Add better timestamps for the events and message queue views Viktor Lofgren 2024-01-13 13:04:42 +0100
  • 2fefd0e4e3 (control) Add better timestamps for the events and message queue views Viktor Lofgren 2024-01-13 13:03:52 +0100
  • 81eaf79a25 (control) UX polish Viktor Lofgren 2024-01-13 12:31:13 +0100
  • 8dea7217a6 (control) UX fixes, node GUI doesn't break when an executor service goes offline. Viktor Lofgren 2024-01-13 12:17:30 +0100