Cloudera Enterprise 6.0 Beta | Other versions

Spark 2 Known Issues

The following sections describe the current known issues and limitations in Cloudera Distribution of Apache Spark 2. In some cases, a feature from the upstream Apache Spark project is currently not considered reliable enough to be supported by Cloudera. For a number of integration features in CDH that rely on Spark, the feature does not work with Cloudera Distribution of Apache Spark 2 because CDH components are not introducing dependencies on Spark 2.

Continue reading:

Hive CBO not Supported with Spark

The Hive cost-based optimizer (CBO) is not supported for use with Spark.

Spark Standalone

Spark Standalone is not supported for Spark 2. It is not included with CDH 6. By default, there is only a single Spark service available, not separate services for YARN and Spark Standalone cluster managers as in CDH 5.

Structured Streaming is not supported

Cloudera does not support the Structured Streaming API.

SparkR is not Supported

SparkR is not supported for Spark 2. (SparkR is also not supported in CDH with Spark 1.6.)

GraphX is not Supported

GraphX is not supported for Spark 2. (GraphX is also not supported in CDH with Spark 1.6.)

Thrift Server

The Thrift JDBC/ODBC server is not supported for Spark 2. (The Thrift server is also not supported in CDH with Spark 1.6.)

Spark SQL CLI is not Supported

The Spark SQL CLI is not supported for Spark 2. (The Spark SQL CLI is also not supported in CDH with Spark 1.6.)

Page generated March 7, 2018.