My Oracle Support Banner

Using Distcp to Copy Data Between Two CDH Clusters Located in different DMZ Fails (Doc ID 1627255.1)

Last updated on NOVEMBER 08, 2022

Applies to:

Big Data Appliance Integrated Software - Version 2.0.1 to 2.4.0 [Release 2.0 to 2.4]
Linux x86-64

Symptoms

Using distcp command to copy data from a source cluster which runs CDH4.2 (not BDA cluster) to a BDA cluster which runs on CDH4.4.  The source cluster is on a different DMZ than the BDA destination cluster and they are connected through admin network only.


The distcp command failed with the following error:


java.net.ConnectException: Connection timed out
  at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
  at sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:599)
  at org.apache.hadoop.net.SocketIOWithTimeout.connect(SocketIOWithTimeout.java:207)
  at org.apache.hadoop.net.NetUtils.connect(NetUtils.java:525)
  at org.apache.hadoop.hdfs.DFSOutputStream.createSocketForPipeline(DFSOutputStream.java:1227)
  at org.apache.hadoop.hdfs.DFSOutputStream$DataStreamer.createBlockOutputStream(DFSOutputStream.java:1053)
  at org.apache.hadoop.hdfs.DFSOutputStream$DataStreamer.nextBlockOutputStream(DFSOutputStream.java:1013)
  at org.apache.hadoop.hdfs.DFSOutputStream$DataStreamer.run(DFSOutputStream.java:463)

 

Cause

To view full details, sign in with your My Oracle Support account.

Don't have a My Oracle Support account? Click to get started!


In this Document
Symptoms
Cause
Solution


My Oracle Support provides customers with access to over a million knowledge articles and a vibrant support community of peers and Oracle experts.