Blog post: Converting CSV to parquet in Clojure

Following some conversations we had at the conj we wrote a short article about using Clojure to convert CSV into parquet files.

Interest is certainly building around parquet as a file format, and we think it’s good tech. This is potentially exciting if you work with tabular/columnar data and want it to load fast.

1 Like

Did you wrap the Java thing, or did you implement it from scratch?

Is the source code published?

Great questions!

That article is about a new single-jar deps-only lib that simplifies doing the work described in the article:

The heavy lifting is done by our library for tabular data processing tech.ml.dataset (TMD)

hth - keep the good questions coming!

1 Like

Thanks!

That looks like a lot of work that went into it.

1 Like

You’re welcome.

And indeed, this is just a small moment in a much larger undertaking of trying to understand “functional data science”.

Thanks for looking into it.

This topic was automatically closed 182 days after the last reply. New replies are no longer allowed.