Cloudera Manager

From GM-RKB
Jump to navigation Jump to search

A Cloudera Manager is a Hadoop cluster manager that is released by Cloudera Inc..



References

2015

2014

  • http://www.cloudera.com/content/cloudera/en/products-and-services/cloudera-enterprise/cloudera-manager.html
    • Cloudera Manager is designed to make administration of your enterprise data hub simple and straightforward, at any scale. With Cloudera Manager, you can easily deploy and centrally operate the complete Big Data stack. The application automates the installation process, reducing deployment time from weeks to minutes; gives you a cluster-wide, real-time view of nodes and services running; provides a single, central console to enact configuration changes across your cluster; and incorporates a full range of reporting and diagnostic tools to help you optimize performance and utilization.


  • http://www.cloudera.com/content/cloudera/en/products-and-services/cloudera-enterprise/cloudera-manager/cloudera-manager-features.html
    • Deployment, Configuration & Management
      • Automated Deployment & Hadoop Readiness Checks
      • Install the complete CDH stack in minutes and ensure optimal settings
      • Service Management: Configure and manage all CDH services, including Impala and Search, from a central interface
      • Security Management: Configure and manage security across the cluster - including Kerberos authentication and role-based (administrator and read-only) administration
      • Resource Management: Allocate cluster resources by workload or by user/group/application to eliminate contention and ensure Quality-of-Service (QoS)
      • High Availability: Easily configure and manage High Availability for various services like HDFS, MapReduce, Oozie, YARN, HBase
      • Client Configuration Management: Centrally configure all client access to the cluster
      • Node Templating: Easily deploy and expand heterogeneous clusters by creating templates for node roles
      • Comprehensive Workflows: Perform end-to-end tasks such as start/stop/restart clusters, services and roles, add/delete hosts, decommission nodes etc.
      • Multi-Cluster Management: Manage multiple CDH clusters from a single instance of Cloudera Manager
    • Monitor
      • Service, Host & Activity Monitoring: Get a consolidated, real-time view of the state of all services, hosts and activities running in the cluster
      • Events & Alerts: Create, aggregate and receive alerts on relevant Hadoop events pertaining to system health, log messages, user actions and activities Set thresholds and create custom alerts for metrics collected by CM
    • Diagnose
      • Global Time Control: Correlate all views along a configurable timeline to simplify diagnosis
      • Proactive Health Checks: Monitor dozens of service performance metrics and get alerts you when you approach critical thresholds
      • Heatmaps: Visualize health status and metrics across the cluster to quickly identify problem nodes and take action
      • Customizable Charts: Report and visualize on key time-series metrics about services, roles and hosts
      • Intelligent Log Management: Gather, view and search Hadoop logs collected from across the cluster
    • Integrate
      • Comprehensive API: Easily integrate Cloudera Manager with your existing enterprise-wide management and monitoring tools
      • 3rd Party Application Management: Deploy, manage and monitor services for 3rd party applications running on the cluster (e.g. data integration tools, math/machine learning applications, non-CDH services etc.)
    • Advanced Management Features (Enabled by Subscription)
      • Operational Report & Quota Management: Visualize current and historical disk usage; set user and group-based quotas; and track MapReduce, Impala, YARN and HBase usage
      • Configuration History & Rollbacks
      • Maintain a trail of all actions and a complete record of configuration changes, including the ability to roll back to previous states
      • Rolling Updates: Stage service updates and restarts to portions of the cluster sequentially to minimize downtime when upgrading or updating your cluster
      • AD Kerberos Integration: Integrate directly with Active Directory to get started easily with Kerberos
      • Scheduled Diagnostics: Take a snapshot of the cluster state and automatically send it to Cloudera support to assist with optimization and issue resolution.
      • Automated Backup & Disaster Recovery: Centrally configure and manage snapshotting and replication workflows for HDFS, Hive and HBase