NOTE: This Wiki is obsolete as of November 2016 and is retained for reference only.
Overview
PySpark is built on top of Spark's Java API. Data is processed in Python and cached / shuffled in the JVM:
...
PySpark is built on top of Spark's Java API. Data is processed in Python and cached / shuffled in the JVM:
...