overview
High availability of Hadoop
Hadoop high availability is classified into HDFS high availability and YARN high availability. The implementation of HDFS and YARN High availability are similar. However, HDFS NameNode has higher requirements on data storage and consistency than YARN ResourceManger, so its implementation is more complicated.
HDFS high availability architecture
Cluster Architecture Design
Environment to prepare
The cluster structures,
reference
How to Set Up Hadoop Cluster with HDFS High Availability