Matt Martin's Substack

Matt Martin's Substack

Converting CSV's to Parquet

Matt Martin's avatar
Matt Martin
Oct 22, 2024
∙ Paid

CSV files are a vastly common way to exchange and share data; they have their pluses and minuses, but from a cloud strategy perspective, parquet files have become the gold standard. They compress better than CSV, they carry a more formalized schema with them, whereas engines reading CSV’s have to sample several hundred or thousand rows and guess the dat…

User's avatar

Continue reading this post for free, courtesy of Matt Martin.

Or purchase a paid subscription.
© 2026 Matt Martin · Privacy ∙ Terms ∙ Collection notice
Start your SubstackGet the app
Substack is the home for great culture