MarginaliaSearch/code
Viktor Lofgren 0152004c42 Initial Commit Anchor Tags
* Added new (optional) model file in $WMSA_HOME/data/atags.parquet
* Converter gets a component for creating a projection of its domains onto the full atags parquet file
* New WordFlag ExternalLink
* These terms are also for now flagged as title words
* Fixed a bug where Title words aliased with UrlDomain words
* Fixed a bug in the encyclopedia sideloader that gave everything too high topology ranking
2023-11-04 14:24:17 +01:00
..
api (control) GUI for atags export 2023-10-31 17:55:47 +01:00
common Initial Commit Anchor Tags 2023-11-04 14:24:17 +01:00
features-convert Initial Commit Anchor Tags 2023-11-04 14:24:17 +01:00
features-crawl (build) Move unit test configuration to root build.gradle 2023-10-04 12:46:22 +02:00
features-index Initial Commit Anchor Tags 2023-11-04 14:24:17 +01:00
features-qs (refactor) Remove features-search and update documentation 2023-10-09 15:12:30 +02:00
features-search (refactor) Remove features-search and update documentation 2023-10-09 15:12:30 +02:00
libraries (docs) Update documentation 2023-10-27 12:45:39 +02:00
process-models (converter) More robust filename resolution 2023-10-20 14:16:03 +02:00
processes Initial Commit Anchor Tags 2023-11-04 14:24:17 +01:00
services-application (control/search) Add ability to suggest removing a site from random exploration 2023-11-02 15:29:49 +01:00
services-core Initial Commit Anchor Tags 2023-11-04 14:24:17 +01:00
tools Initial Commit Anchor Tags 2023-11-04 14:24:17 +01:00
readme.md (docs) Update documentation 2023-10-27 12:45:39 +02:00

Code

This is a pretty large and diverse project with many moving parts.

You'll find a short description in each module of what it does and how it relates to other modules. The modules each have names like "library" or "process" or "feature". These have specific meanings. See doc/module-taxonomy.md.

Overview

A map of the most important components and how they relate can be found below.

image

Services

Processes

Processes are batch jobs that deal with data retrieval, processing and loading.

Tools

Features

Features are relatively stand-alone components that serve some part of the domain. They aren't domain-independent, but isolated.

Libraries and primitives

Libraries are stand-alone code that is independent of the domain logic.

  • common elements for creating a service, a client etc.
  • libraries containing non-search specific code.
    • array - large memory mapped area library
    • btree - static btree library