overview

High availability of Hadoop

Hadoop high availability is classified into HDFS high availability and YARN high availability. The implementation of HDFS and YARN High availability are similar. However, HDFS NameNode has higher requirements on data storage and consistency than YARN ResourceManger, so its implementation is more complicated.

HDFS high availability architecture

Cluster Architecture Design

Environment to prepare

The cluster structures,

reference

How to Set Up Hadoop Cluster with HDFS High Availability