⚙️
Setting Up Cloudera Data Platform(CDP)
  • CDP Overview
  • Why CDP?
  • CDP Services
  • Setting up Google Cloud Platform(GCP) for Cloudera
  • Creating User
  • Configuring Network Settings
  • Configuring Oracle Java
  • Installing Server
  • Configuring MySQL
  • Set Firewall rule on GCP
  • Cloudera Data Platform Installation
  • Working with Cloudera Manager
  • Set Up a Cluster
  • Testing Your Hadoop Installation
  • Installing Hive
  • Hive Validation
  • Deploying Spark 2.4
  • Running Job on Apache Spark2
  • Installing Kafka
  • Kafka Validation
  • Common Warnings and Errors
Powered by GitBook
On this page

Why CDP?

PreviousCDP OverviewNextCDP Services

Last updated 3 years ago

CDP is different from other data platforms in various ways:

  • Any cloud – CDP provides different options to manage, analyze, and experiment with data on-premises, in hybrid, private cloud, and multiple public cloud environments.

  • Multi-function – CDP reduces the time and effort to deploy common application types with five new self-service experiences: flow & streaming, data engineering, data warehouse, operational database, and machine learning.

  • Secure and Governed – CDP simplifies security, privacy, and compliance for varying enterprise data on any cloud through shared data experience (SDX) technologies.SDX consists of a set of technologies that helps an enterprise to deploy,manage and share data in a secure manner. The services within SDX technologies are - Data Catalog,Data Lake,Replication Manager and Workload Manager.

  • Open – CDP is 100% open source, open compute and open storage.

Source:
https://www.cloudera.com/products/cloudera-data-platform.html