Spark on YARN Frequently Asked Questions
(Doc ID 1920743.1)
Last updated on DECEMBER 27, 2022
Applies to:
Big Data Appliance Integrated Software - Version 3.0.1 and laterLinux x86-64
Purpose
To provide answers to frequently asked questions for Spark on YARN.
Questions and Answers
To view full details, sign in with your My Oracle Support account. |
|
Don't have a My Oracle Support account? Click to get started! |
In this Document
Purpose |
Questions and Answers |
How to configure Spark in BDA 3.0.1 to run Spark on YARN? |
What modes can Spark on YARN be run in? |
How to use Spark on Yarn - cluster mode so it uses the same shared resources and allocation specified in dynamic resource pool settings as YARN? |
How long do Yarn container logs remain for Spark? I am assuming the period is defined in one of the parameters in CM. Can you point me where in CM it's defined? |
I don't see the folder /user/spark/applicationHistory. Where is it located? |
How to find logs when running the Spark application? |
I had to stop the Spark services to run Spark on YARN. When do I re-start the service? |
For YARN cluster mode, the argument in step 6 to run SparkPi exmaple in Doc ID 1916688.1 says, --args yarn-standalone. Is it a typo to specify stand-alone for cluster mode? |
On the BDA is upgrading Spark on Yarn supported? |
Is it possible to upgrade Spark2 on the BDA? |
How to change spark logging level from INFO to WARN? |
On BDA V4.10/CDH 5.12.1 what do Spark warnings like "WARN metastore.ObjectStore: Version information not found in metastore" indicate? |
In BDA 5.1 with CDH 6.x is ORC a supported format with Spark applications? |
References |