MarginaliaSearch/code
Viktor Lofgren 1097fe6e25 Fix bugs related to search result selection in the case with multiple search terms.
* A deduplication filter step ran too early, and removed many good results on the basis that they partially, but did not fully fit another set of search terms.

* Altered the query creation process to prefer documents where multiple terms appear in the priority index.
2023-03-29 15:17:55 +02:00
..
api Permit search results that are all synthetic to pass relevancy check. 2023-03-27 17:27:35 +02:00
common Documentation for DB 2023-03-25 16:14:16 +01:00
features-convert Update features-convert/readme.md 2023-03-25 12:43:58 +01:00
features-crawl Yet more restructuring. Improved search result ranking. 2023-03-16 21:35:54 +01:00
features-index Fix bugs related to search result selection in the case with multiple search terms. 2023-03-29 15:17:55 +02:00
features-search Move database to a separate module 2023-03-25 15:26:17 +01:00
libraries Fix typeahead suggestions 2023-03-25 10:20:52 +01:00
process-models readme.md 2023-03-22 15:10:30 +01:00
processes Move database to a separate module 2023-03-25 15:26:17 +01:00
services-core Fix bugs related to search result selection in the case with multiple search terms. 2023-03-29 15:17:55 +02:00
services-satellite Move database to a separate module 2023-03-25 15:26:17 +01:00
tools Move database to a separate module 2023-03-25 15:26:17 +01:00
readme.md Fix broken diagram links after doc/ restructuring. 2023-03-25 16:32:10 +01:00

Code

This is a pretty large and diverse project with many moving parts.

You'll find a short description in each module of what it does and how it relates to other modules. The modules each have names like "library" or "process" or "feature". These have specific meanings. See doc/module-taxonomy.md.

Overview

A map of the most important components and how they relate can be found below.

image

Services

Processes

Processes are batch jobs that deal with data retrieval, processing and loading.

Tools

Features

Features are relatively stand-alone components that serve some part of the domain. They aren't domain-independent, but isolated.

Libraries and primitives

Libraries are stand-alone code that is independent of the domain logic.

  • common elements for creating a service, a client etc.
  • libraries containing non-search specific code.
    • array - large memory mapped area library
    • btree - static btree library