⚙️
Setting Up Cloudera Data Platform(CDP)
  • CDP Overview
  • Why CDP?
  • CDP Services
  • Setting up Google Cloud Platform(GCP) for Cloudera
  • Creating User
  • Configuring Network Settings
  • Configuring Oracle Java
  • Installing Server
  • Configuring MySQL
  • Set Firewall rule on GCP
  • Cloudera Data Platform Installation
  • Working with Cloudera Manager
  • Set Up a Cluster
  • Testing Your Hadoop Installation
  • Installing Hive
  • Hive Validation
  • Deploying Spark 2.4
  • Running Job on Apache Spark2
  • Installing Kafka
  • Kafka Validation
  • Common Warnings and Errors
Powered by GitBook
On this page

CDP Overview

NextWhy CDP?

Last updated 3 years ago

After the merger of Cloudera and Hortonworks in Q4 2018, Cloudera has revamped its platform completely which is called Cloudera Data Platform (CDP). The management, deployment and using CDP is easy like never before. The Unified platform takes care of end to end data management from Edge to AI.

CDP retains components from Cloudera Distribution of Hadoop (e.g. Impala) as well as Hortonworks Data Platform (e.g. NiFi).

Unlike previous distributions of Hadoop from Cloudera/Hortonworks, Cloud is considered as first class citizen and hence it allows us to manage data in any environment, including public clouds like AWS, Azure and GCP (Google Cloud platform), private clouds, Hybrid clouds etc. This new runtime is Artificially intelligent to scale up/down workloads to optimize cost.