Pyspark Run on BDA Edge Nodes Fails in Distributed Mode Works but Works in Local Mode
(Doc ID 2068501.1)
Last updated on NOVEMBER 08, 2022
Applies to:
Big Data Appliance Integrated Software - Version 4.2.0 and laterLinux x86-64
Symptoms
Launching a Pyspark job from a BDA edge node in distributed mode is returning the following error:
Error from python worker:
python: module pyspark.daemon not found
PYTHONPATH was:
/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/jars/spark-assembly-1.3.0-cdh5.4.0-hadoop2.6.0-cdh5.4.0.jar
java.io.EOFException
Cause
To view full details, sign in with your My Oracle Support account. |
|
Don't have a My Oracle Support account? Click to get started! |
In this Document
Symptoms |
Cause |
Solution |
References |