2023-09-14 09:33:36 +00:00
|
|
|
The processed-data package contains models and logic for
|
|
|
|
reading and writing parquet files with the output from the
|
|
|
|
[converting-process](../../processes/converting-process).
|
|
|
|
|
|
|
|
Main models:
|
|
|
|
|
2024-02-27 20:15:49 +00:00
|
|
|
* [DocumentRecord](java/nu/marginalia/model/processed/DocumentRecord.java)
|
|
|
|
* * [DocumentRecordKeywordsProjection](java/nu/marginalia/model/processed/DocumentRecordKeywordsProjection.java)
|
|
|
|
* * [DocumentRecordMetadataProjection](java/nu/marginalia/model/processed/DocumentRecordMetadataProjection.java)
|
|
|
|
* [DomainLinkRecord](java/nu/marginalia/model/processed/DomainLinkRecord.java)
|
|
|
|
* [DomainRecord](java/nu/marginalia/model/processed/DomainRecord.java)
|
2023-09-14 09:33:36 +00:00
|
|
|
|
|
|
|
Since parquet is a column based format, some of the readable models are projections
|
|
|
|
that only read parts of the input file.
|
|
|
|
|
|
|
|
## See Also
|
|
|
|
|
|
|
|
[third-party/parquet-floor](../../../third-party/parquet-floor)
|