data engineering

Parquet, SQL, DuckDB, arrow, dbplyr and R

As opposed to traditional row-based storage (e.g., SQL), Parquet files (.parquet) are columnar-based, and feature efficient compression (fast read/write and small disk usage) and optimized performance for big data.