This is the 14th day of my participation in the August More Text Challenge. For details, see:August is more challenging
directory
zookeeper
- Zookeeper profile
- The core concept
- The principle of CAP
- Conformance protocol
- Set up the ZooKeeper environment
- Single building
- The cluster structures,
- Zookeeper is used
- The basic use
- Command line operation
- The API operations
- Zookeeper source analysis
- Zk startup process
- Watcher core mechanics
- Leader election
- Application scenarios of ZooKeeper
- Distributed Configuration center
- Load balancing
- Uniform Naming Service
- DNS service
- Cluster management
- A distributed lock
- Distributed queue
dubbo
- RPC core
- Common RPC framework
- agreement
- serialization
- IO model
- Write a set of simple RPC principles
- Dubbo high availability
- Dubbo fault tolerance
- Dubbo Load balancing policy and custom implementation
- Dubbo service degraded
- Services available
- Multiversion control
- Multiple registry
- Dubbo principle
- agreement
- serialization
- Underlying communication Principle
- Dubbo source
- Dubbo of spi
- Dubbo configuration file loading and parsing
- Provider Exposes services
- Consumer services
1. A brief introduction to zookeeper
1.1 Core Concepts
Zookeeper is a distributed coordination service that provides distributed data consistency solutions. Distributed applications can publish and subscribe data, load balancing, naming services, cluster management of distributed locks, distributed queues, and more. Distributed data consistency: strong consistency and final consistency Strong consistency: If the data is inconsistent and the data service is not provided externally, the final consistency can be achieved through locking: the final synchronization is required without real-time requirements
1.2 the principle of CAP
CAP: C consistency, A availability,p partition fault tolerance
- Consistency generally refers to strong consistency
- Availability means being available all the time, so it conflicts with C
- N Partition fault tolerance: provides external consistency and availability even when the network is interrupted
Partition fault-tolerant must be designed so that either AP or CP is guaranteed
1.3 Consistency Protocol
When a transaction needs to span multiple nodes, in order to ensure the ACID (atomicity, Consistency, Isolation, persistence) properties of the transaction (which all nodes must meet), a coordinator needs to be selected to coordinate the distributed node scheduling. Two-phase commit (as the name implies, the commit process is divided into two phases
- Phase 1 commits the transaction request
- At present, if there is a transaction operation (adding, deleting and modifying), the coordinator is first sent to the coordinator, who sends the SQL to all participants at the same time. The coordinator waits for the participants’ feedback on whether the execution is successful, and then the participants themselves execute the SQL statement to know whether the execution is successful
- Phase 1 preexecutes the SQL but does not commit the transaction
- Phase 2 Transaction commit:
- Transaction commit
- The coordinator sends commit requests to each participant node
- After receiving the COMMIT request, the participant node performs the commit operation of the transaction
- After each participant node completes the transaction commit, it sends back the commit confirmation to the coordinator
- After accepting the ACK of each participant node, the coordinator completes the transaction commit
- Interrupt the transaction
- Send a rollback request,
- Each participant node rolls back the transaction
- Feedback the rollback result
- The coordinator accepts the ACK of each participant node and rolls back the transaction
- There is a problem
- Synchronous blocking: The time after the coordinator sends the message can only wait, unable to do other operations
- Single point problem: Once the coordinator has a problem, the nodes cannot guarantee consistency
- Split brain: When the participant loses contact with the coordinator, the participant re-selects the coordinator. The participant who receives the COMMIT commits the transaction. The participant who loses contact does not commit, and the consistency of the transaction cannot be guaranteed
- Transaction commit
Three phase commit (as the name suggests is the submission process is divided into three stages, the upgrade version of two-phase commit Actually is to commit the transaction request into two step, formed the CanCommit, PreCommit, doCommit three stages of transactional consistency
Phase 1: CanCommit
- 1. Transaction inquiry
- 2. Each participant node feedback the response of the transaction query to the coordinator
Phase 2: PreCommit (based on the feedback from step 1, there are two cases: transaction PreCommit and transaction interrupt)
- 1. Perform transaction precommit
- 1) Sending preCommit requests: The coordinator sends preCommit requests to all participant nodes and enters the Prepared phase
- 2) Transaction preCommit: After receiving the preCommit request, each participant node performs the transaction operation
- 3) Each participant and node feedback transaction execution to the coordinator
- 2. Interrupt transaction When any participant node responds no to the coordinator or after the timeout, the coordinator does not receive the feedback from the participant, the transaction will be interrupted. The transaction interruption is divided into two steps:
- 1) The coordinator sends abort requests to each participant node
- 2) The participant receives an abort request or interrupts the transaction after a timeout
Phase 3: doCommit commits the transaction
- 1. Submit the file
- 1) Send submit request The coordinator sends doCommit requests to all nodes
- 2) Transaction commit After each participant node receives the doCommit request, the transaction commit operation is performed
- 3) Feedback the result of transaction commit after each participant node completes the transaction commit, it will send ACK to the coordinator
- 4) Transaction completion After receiving the ACK feedback from each participant, the coordinator completes the transaction
- 2. Interrupt the transaction
- 1) After the participant accepts the abort request, the transaction is rolled back
- 2) After the rollback is complete, the participant sends an ACK to the coordinator
- 3) After the coordinator accepts the rollback ACK, the transaction is rolled back
Fixed synchronous blocking (introduced timeout), and single point issues (coordinator does not receive messages for a long time and commits) but failed to resolve network partition issues (split brain)
Network partition problem solution Paxos algorithm
Paxos algorithm is based on message passing and has high fault tolerance. It is also the most effective algorithm to solve distributed consistency problems. In a distributed system, if there is a breakdown or network exception, the value of a certain data can be quickly and correctly agreed within the cluster, and no matter what happens, the consistency of the entire system will not be broken
Four characters in PAxOS: 1. Client: proposer 2. Propose person: proposer 3. Acceptor: decision maker 4
The algorithm is divided into two phases: 1. Parpare phase: preparation phase 2. Accept phase: consent phase
Preparation stage
- (a) A proposer submits a proposal numbered N and sends it to any acceptors
- When receiving a prepare(N) request, it compares the values of N and maxN. If the value is less than maxN, the proposal is obsolete and rejected. If N is greater than or equal to maxN, the proposal is accepted. Then, the Proposal(myid,maxN,value) with the largest number ever accepted by the proposer will be fed back to the proposer: MaxN indicates the maximum number of an acceptor proposal that has been accepted. MaxN indicates the content of the proposal. If the current voter does not accpet any proposal, the proposer will send the proposal(myID,null,null) back to the proposer.
The accept stage
- If the proposer sends a proposal prepare(N) and receives feedback from more than half of the acceptors, the proposer sends the actual proposal(N,value) to all the proposers.
- After an acceptor accepts a proposal (N, value) sent by an acceptor, the acceptor sends the maximum number of proposal (N, value) it has ever accepted and the maximum number of prepare it has received. If N is greater than the two numbers, the current vote accepts the proposal and sends the feedback to the proposer. Otherwise reject the offer.
- If the proposer does not receive more than half of the vote accept feedback, it will enter the prepare stage again, increase the proposal number, and make a new prepare request. If more than half of the votes are accepted, the other voters who do not give feedback to the proposer are called learner, and they actively synchronize the proposer’s proposal.
Zookeeper uses ZAB protocol instead of PAxOS because paxOS algorithm is difficult to implement, and there are problems of live lock and full order (there is no guarantee for the order of two final submissions).
Leader election algorithm
Leader Election Mechanism
- A ZooKeeper cluster works only when more than half of the servers are started
- Until the cluster is working properly, the servers with the smaller myID will vote for the servers with the larger myID until the cluster is working properly to select the Leader
- After the Leader is selected, the previous server changes from looking to following, and the later servers are followers