mirror of
https://github.com/MarginaliaSearch/MarginaliaSearch.git
synced 2025-02-23 13:09:00 +00:00
(docs) Update the documentation up-to-date information
This commit is contained in:
parent
eaeb23d41e
commit
35996d0adb
11
code/common/linkdb/readme.md
Normal file
11
code/common/linkdb/readme.md
Normal file
@ -0,0 +1,11 @@
|
||||
The link database contains information about links,
|
||||
such as their ID, their URL, their title, their description,
|
||||
and so forth.
|
||||
|
||||
The link database is a sqlite file. The reason this information
|
||||
is not in the MariaDB database is that this would make updates to
|
||||
this information take effect in production immediately, even before
|
||||
the information was searchable.
|
||||
|
||||
It is constructed by the [loading-process](../../processes/loading-process), and consumed
|
||||
by the [search-service](../../services-core/search-service).
|
18
code/process-models/processed-data/readme.md
Normal file
18
code/process-models/processed-data/readme.md
Normal file
@ -0,0 +1,18 @@
|
||||
The processed-data package contains models and logic for
|
||||
reading and writing parquet files with the output from the
|
||||
[converting-process](../../processes/converting-process).
|
||||
|
||||
Main models:
|
||||
|
||||
* [DocumentRecord](src/main/java/nu/marginalia/model/processed/DocumentRecord.java)
|
||||
* * [DocumentRecordKeywordsProjection](src/main/java/nu/marginalia/model/processed/DocumentRecordKeywordsProjection.java)
|
||||
* * [DocumentRecordMetadataProjection](src/main/java/nu/marginalia/model/processed/DocumentRecordMetadataProjection.java)
|
||||
* [DomainLinkRecord](src/main/java/nu/marginalia/model/processed/DomainLinkRecord.java)
|
||||
* [DomainRecord](src/main/java/nu/marginalia/model/processed/DomainRecord.java)
|
||||
|
||||
Since parquet is a column based format, some of the readable models are projections
|
||||
that only read parts of the input file.
|
||||
|
||||
## See Also
|
||||
|
||||
[third-party/parquet-floor](../../../third-party/parquet-floor)
|
8
third-party/parquet-floor/readme.md
vendored
8
third-party/parquet-floor/readme.md
vendored
@ -6,3 +6,11 @@ Git: https://github.com/strategicblue/parquet-floor
|
||||
|
||||
It's basically an adaptor for Parquet I/O without
|
||||
needing to pull half of Hadoop into your project.
|
||||
|
||||
The library has been modified with support for reading
|
||||
and writing lists of values, and the default
|
||||
compression has been altered to zstd.
|
||||
|
||||
# Further reading:
|
||||
|
||||
https://parquet.apache.org/docs/
|
Loading…
Reference in New Issue
Block a user