MarginaliaSearch/code
Viktor Lofgren e49b1dd155 Better handling of quote terms, fix bug in handling of longer queries.
... where some terms may previously have been ignored. The latter bug was due to the handling of QueryHeads with AnyOf-style predicates interacting poorly with alreadyConsideredTerms in SearchIndex.java
2023-04-10 13:20:40 +02:00
..
api Better handling of quote terms, fix bug in handling of longer queries. 2023-04-10 13:11:40 +02:00
common Increase search result relevance (#8) 2023-04-07 20:18:08 +02:00
features-convert Clean up artifact extractor. 2023-04-10 13:07:54 +02:00
features-crawl Yet more restructuring. Improved search result ranking. 2023-03-16 21:35:54 +01:00
features-index Better handling of quote terms, fix bug in handling of longer queries. 2023-04-10 13:20:40 +02:00
features-search Increase search result relevance (#8) 2023-04-07 20:18:08 +02:00
libraries Increase search result relevance (#8) 2023-04-07 20:18:08 +02:00
process-models Bugfix crawl plan, doesn't use rewrite() everywhere 2023-03-30 15:41:07 +02:00
processes Consider average sentence length when selecting search results. This promotes proses over code listings, tabular data, etc. 2023-03-30 15:46:15 +02:00
services-core Better handling of quote terms, fix bug in handling of longer queries. 2023-04-10 13:20:40 +02:00
services-satellite Move database to a separate module 2023-03-25 15:26:17 +01:00
tools Better handling of quote terms, fix bug in handling of longer queries. 2023-04-10 13:11:40 +02:00
readme.md Fix broken diagram links after doc/ restructuring. 2023-03-25 16:32:10 +01:00

Code

This is a pretty large and diverse project with many moving parts.

You'll find a short description in each module of what it does and how it relates to other modules. The modules each have names like "library" or "process" or "feature". These have specific meanings. See doc/module-taxonomy.md.

Overview

A map of the most important components and how they relate can be found below.

image

Services

Processes

Processes are batch jobs that deal with data retrieval, processing and loading.

Tools

Features

Features are relatively stand-alone components that serve some part of the domain. They aren't domain-independent, but isolated.

Libraries and primitives

Libraries are stand-alone code that is independent of the domain logic.

  • common elements for creating a service, a client etc.
  • libraries containing non-search specific code.
    • array - large memory mapped area library
    • btree - static btree library