sql
sql-workflow
io
meta
style guide
local-pyspark
performance
reference
reference
Your technical destination for pySpark
sql-workflow
style guide
local-pyspark
reference
Spark Memory Calculator - WORK IN PROGRESS - Get the max partition size your executors can handle
I'm slowly building up this website to make the experience of getting up to speed with pyspark less painful.
Right now, if you want to find anything for pySpark besides the documentation, the experience is very painful and time consuming -
This is why I'm working on this website. I just want one authorative place. This started with me sending my friends copies of my personal notes, and has now matured into a dedicated website.
I'm not a great technical writer or anything. I'm learning as time passes. So, this website will improve as time passes.
And obviously, everything here is free. Just become a good engineer, and build better solutions into the world.
Enjoy.