My Oracle Support Banner

Cloudera Manager Frequently Asked Questions (FAQ) (Doc ID 1530717.1)

Last updated on OCTOBER 10, 2019

Applies to:

Big Data Appliance Integrated Software - Version 2.0.1 and later
Linux x86-64

Purpose

This document provides answers to frequently asked questions about Hadoop distributed by Cloudera for use on the Oracle Big Data Appliance(BDA).

NOTE: In the examples that follow, user details, table name, company name, email, hostnames, etc. represent a fictitious sample (and are used to provide an illustrative example only). Any similarity to actual persons, or entities, living or dead, is purely coincidental and not intended in any manner.

 

Questions and Answers

To view full details, sign in with your My Oracle Support account.

Don't have a My Oracle Support account? Click to get started!


In this Document
Purpose
Questions and Answers
 What database is used by Cloudera Manager(CM) components?
 Where to check in CM for the database used for Cloudera Manager components and which node the database resides?
 Is it possible to set log rotation options on Cloudera Manager Agent log files?
 Is replication setup for MySQL(which manages CM) on BDA?
 Is automatic failover configured between primary MySQL and backup?
 Is automatic purging of MySQL binary logs enabled on BDA?
 Are there any cases where it is correct to update configuration files manually and not through Cloudera Manager (CM) for the services managed by CM: HDFS, Hue, MapReduce, Oozie, Zookeeper?
 What about managing other services in Cloudera Manager?
 What about configuring Hive?
 What are the /var/run/cloudera-scm-agent/process directories ?
 What is the difference between a "Service Safety Valve" and "Client Safety Valve ?
 Is there any way in Cloudera Manager to see the previously executed Hive queries for the past two weeks?
 Do we have any options to retrieve resource usage stats for jobs, meaning report of memory and CPU usage by day by job?
 On the CPU time, what components are included in the total? How do I, for example, interpret total CPU Usage and Duration in that report?
 Is there anyway to calculate how many containers (2 MB / 1 CPU) that particular user might consume when we migrate to CDH 5 / YARN?
 My understanding is that in MR2, one can determine how many concurrent tasks are launched per node by dividing the resources allocated to YARN by the resources allocated to each MapReduce task, and taking the minimum of the two types of resources (memory and CPU).  I read somewhere saying BDA 3.0 / CDH 5.0 does not support  CPU allocation yet, it supports only memory allocation. So the concurrent tasks per node = yarn.nodemanager.resource.memory-mb divided by mapreduce.[map|reduce].memory.mb. Can you confirm it?
 Does it mean, it will use both memory and cpu in the calculation if we use FIFO scheduler; but use memory only if we use Fair scheduler? Since we are using Fair scheduler, then I assume cores will be ignored in the calculation. Is that right?
 How can the cluster name be changed?
 Can a single Cloudera Manager manage two BDA clusters?
 If one node of a non-BDA CDH 5.0 cluster can not access HDFS i.e. "hadoop fs -ls /" returns the local file system how can this be resolved?
 On changing the email-IDs for alerting on Cloudera Manager, why still not getting any emails for the new email-IDs on any of the alerts?
 After shutting down (shutdown -h) Node 3 lots of CM configuration warnings are raised.  What could the reason be?
 Is setting up a CM alert to trigger when the hdfs /tmp directory gets to some free space amount possible in CDH 5.4/BDA V4.2?
 How to enable HA for CM for the secure Cluster with TLS/SSL? Is it supported on the BDA?
 What is the Cloudera Manager API and where is the documentation?
 Can the Cloudera Manager API be used to start / stop all Cloudera services in a Linux shell script?
 After an upgrade or maintenance task is there a way to automatically restart the Cloudera Manager Management (mgmt) Services?
 Need to export usage metrics as CSV/Excel from Cloudera Manager on BDA. Need advise on how I can do this, e.g. hdfs growth and usage by users?
 Does Cloudera Manager store LDAP details?
 Need to export this as either JSON or CSV file but without going through Cloudera Manager.  How make use of its API to do it?
 Is there a utility to compare settings in Cloudera Manager (CM) between different BDA/CDH environments? (for example compare Development BDA cluster to PROD BDA Cluster)
 How can CM cluster services be restarted outside of the CM GUI?
 Does a Cloudera Manager diagnostic bundle get uploaded directly to an Oracle SR?
References

My Oracle Support provides customers with access to over a million knowledge articles and a vibrant support community of peers and Oracle experts.