Possibility to use Parquet for easy S3 backup?

👋 

This is just a question, not a feature request or issue or anything of the sort :)

I'm just starting to learn DuckDB and I wonder if it's a lot of work to make PhoenixAnalytics work with Parquet files? The upside is that they can be easily backed up to S3 (immutable), are quite storage-efficient (40% smaller according to https://benchmark.clickhouse.com/) and almost as fast (60% slower according to https://benchmark.clickhouse.com/) as DuckDB's custom storage format, and have zero load time. The downside is that they might require compaction, but at the same time that enables TTL.

<img width="918" alt="Screenshot 2024-10-22 at 19 20 22" src="https://github.com/user-attachments/assets/beffd378-5ec3-49ac-a3ce-5fc5dde40028">

Or maybe there are other ways to stream / backup DuckDB to S3, like in https://motherduck.com/blog/differential-storage-building-block-for-data-warehouse/ or https://litestream.io?



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Possibility to use Parquet for easy S3 backup? #27

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Possibility to use Parquet for easy S3 backup? #27

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions