Hadoop Cluster Profile Templates
The Hadoop cluster profile template specifies the number of nodes in the cluster. The template also takes care of provisioning and configuring the Hadoop cluster services. Apache Software Foundation projects around the world develop services for Hadoop deployment and integration. Some Hadoop distributions support only a subset of these services, or have their own distribution-specific services.
Each of the following supplies a dedicated function:
Note |
You cannot uncheck some of the services because they are necessary to create a Hadoop cluster. All mandatory services are checked by default. |
Hadoop Services |
Cloudera |
MapR |
Hortonworks |
---|---|---|---|
HDFS |
Yes |
— |
Yes |
CLDB |
— |
Yes |
— |
YARN/MapReduce |
Yes |
Yes |
Yes |
ZooKeeper |
Yes |
Yes |
Yes |
HBase |
Yes |
Yes |
Yes |
Hive |
Yes |
Yes |
Yes |
Oozie |
Yes |
Yes |
Yes |
Hue |
Yes |
— |
— |
Spark |
Yes |
Yes |
Yes |
Key-Value Store Indexer |
Yes |
— |
— |
Solr |
Yes |
— |
— |
Sqoop |
Yes |
Yes |
Yes |
Impala |
Yes |
— |
— |
Flume |
Yes |
Yes |
— |
PIG |
— |
Yes |
Yes |
MAHOUT |
— |
Yes |
— |
Falcon |
— |
— |
Yes |
Tez |
— |
— |
Yes |
Storm |
— |
— |
Yes |
Ganglia/Ambari Metrics |
— |
— |
Yes |
Drill |
— |
Yes |
— |
Kafka |
Yes |
— |
Yes |