Spark Installation
Follow instructions here: https://spark.apache.org/docs/latest/spark-standalone.html. Make sure the following steps are done:
- Install spark (either download pre-built spark, or build assembly from source). Note that Spark has different distributions for different versions of Hadoop. Keep note of the spark-assembly-*.jar location on the node Hive will run from.
- Start Spark cluster (Master and workers). Keep note of the Spark master URL.