CDP Services
Last updated
Last updated
CDP is a virtualized platform that can manage data and data workloads by providing various services:-
Management Console
Workload Manager
Data Catalog
Replication Manager
Data Hub
Data warehouse
Machine Learning
Cloudera Runtime
Management Console is a service for administering CDP. As a CDP administrator, you can use it for managing environments, data lakes, environment resources, and users across all CDP services.
The Workload Manager Service provides you with the tools you need to gain an in-depth understanding of your workloads.Workload Manager can auto-generate workload views for you, or you can manually define views based on information that is important to you, such as a specific database, statement type, or user.
Data Catalog helps you to understand data across multiple clusters.Using Data Catalog, you can understand how data is interpreted for use, how it is created and modified, and how data access is secured and protected.
Replication Manager is a service for copying,restoring and migrating data between environments within the enterprise data cloud. It is simple, easy-to-use, and has rich data movement capability to move existing data and metadata to the cloud.
Cloudera Machine Learning is a cloud-native machine learning platform built for CDP. It combines self-service data science and data engineering in a single, portable service as part of an enterprise data cloud for multi-function analytics on data anywhere.
Cloudera Runtime is the core open source software distribution within CDP that is maintained, supported,versioned and packaged as a single entity by Cloudera. It includes approximately 50 open source projects that consists of various data management tools within CDP, including Cloudera Manager (which is used to configure and monitor clusters managed in CDP).
Data Hub - It is a CDP service that administrators use to create and manage clusters powered by Cloudera Runtime.
Data Warehouse - Data Warehouse is a CDP service for self-service creation of independent data warehouses and data marts that autoscale up and down to meet varying workload demands.
Data Lake - It is a single logical store of data that provides a mechanism for storing, accessing, organizing, securing, and managing data within an enterprise cloud.
After you create a cluster using the Management Console, you use Cloudera Manager to manage, configure, and monitor the cluster and Cloudera Runtime services.