The transformation of ethereum consensus mechanism is not only an epitome of the general trend of PoS becoming more and more popular, but also the first mainstream public chain project transforming from PoW to PoS, which will surely inject fresh blood into the PoS camp and promote the development of PoS consensus. The next part of this article will discuss what to think about the trend of the public chain consensus shifting from PoW to PoS.
And PoS in consensus mechanism
First of all, it needs to be clarified that proof of work, proof of equity and any other proof (PoX) are themselves used to resist witch attacks and are not equivalent to consensus mechanism. Figuratively speaking, they are used only to determine who gets to vote, and the final consensus is determined by counting the number of votes cast.
In the case of bitcoin, the most familiar example, proof of work only proves that a packaged block is formally legal, and ultimately determines whether the transaction contained in the block is valid, depending on whether the block is on the longest chain. Here generating a block with proof of work is like casting a vote, and the chain that gets the most votes (the longest chain) is the valid ledger.
Ethereum, also based on proof of work, uses a different way of counting votes. Instead of choosing the longest chain rule when forking, it greedily chooses the largest barysubtree that can be observed. This consensus mechanism is also known as GHOST (Greedy busiest Observed Subtree).
On the other hand, how voting rights are decided is a very central part of any consensus. Consensus protocols based on the same anti-witch attack mechanism often have a lot in common, and replacing that mechanism often means the entire system needs to be redesigned. As a result, a lot of times we simply use “PoW consensus” and “PoS consensus” to categorize consensus mechanisms.
Features of PoW and PoS mechanisms
The biggest advantage of PoW consensus based on proof of work is its high security.
This security comes from two aspects: first, the marginal cost of voting itself is relatively high, because each successful vote requires a large number of calculations and consumes a lot of electricity in the process; Second, the votes cast and voting rights are strongly bound, because the computational problem to be solved by the proof of workload is calculated according to the block to be cast, and the proof of workload done for one block A cannot be used to vote for another block B.
One second for security is particularly important because it ensures the workload by choices made are unable to cancel, can’t modify, even want to change my past not vote for out – unless the other option to make effort to prove that competition, which need to consume a lot of time and power and other resources, Being a former voter doesn’t give me any advantage over the attacker.
Another advantage of PoW consensus mechanism is its simple structure, easy to analyze and implement. For example, the longest chain mechanism used in Bitcoin is easy to analyze the behavior of ordinary miners from the perspective of game theory, so as to have an objective judgment on its safety. In addition, the logic of determining the longest chain of Bitcoin is also quite simple, and it is not easy to make mistakes.
However, the disadvantages of PoW are also obvious: first, it is high energy consumption, which is also the point that PoW is most criticized and criticized. The National Development and Reform Commission, for example, plans to eliminate virtual currency mining. Although some argue that the current pot-digging industry uses a lot of cheap electricity that, for various reasons, is hard to plug into the grid and would otherwise be wasted, and that the PoW consensus is worth the money, it does little to reverse the prevailing view that PoW mining is a waste of energy.
The second disadvantage is the high latency, slow validation, and low throughput associated with a design that controls the lone block rate by PoW consensus to ensure security. In fact, this shortcoming can be alleviated to some extent by other consensus protocols based on PoW (such as GHOST protocol based on DAG and Conflux protocol based on tree graph, etc.), but related technologies still need to pass the test of practice and time.
Finally, there are governance challenges posed by the PoW consensus mechanism, especially when the interests of miners conflict with those of developers and users. Existing governance is difficult to balance between the three, a little improper treatment may lead to community split, hard fork chain.
The greatest advantage of PoS consensus based on proof of equity is that it almost perfectly solves all the shortcomings of PoW consensus: first, PoS has a very low demand for energy consumption; Secondly, the main bottleneck of PoS confirmation delay and throughput is network communication. It is not necessary to deliberately reduce the block output speed and throughput like the PoW consensus that adopts the longest chain rule. Through reasonable design, the transaction confirmation speed and system throughput can be improved to a relatively ideal degree. Finally, because PoS consensus (with the exception of DPoS) does not require miners dedicated to proof of work, only developers and users, community governance is relatively simple.
On the other hand, the PoS consensus also loses almost all of the advantages of the PoW consensus mentioned above.
In terms of security, the very nature of the PoS consensus requires that the right to vote and the exercise of the right to vote are separated from each other, so that the same vote can be used to produce two votes of different content, and both votes appear to be legitimate separately.
This leads directly to “nothing-at-stake attacks” and “long-range attacks”, which are not available in the PoW consensus.
Disinterested attack refers to a problem in which a voter can vote for both branches of a fork at the same time, thereby gaining votes on both branches. “Long-range attack” means that an attacker can obtain the right to vote in a certain period of time ago by buying other users’ private keys. If the assets corresponding to the right to vote have been transferred, the cost of buying the corresponding private keys will be far lower than the cost of actually holding the same share of assets.
In addition, because always make sure of the right to vote in PoS consensus before deciding how to vote, and is limited by traffic, only part of the user selected to participate in each round, so the attacker can also be directed to buy or blocked by DDoS impending voting rights, and in particular rounds is far higher than its share of the assets held in the voting rights, This is known as a Bribing attack.
It can be said that every PoS consensus protocol design process is unavoidable to fight against several attacks mentioned above. Therefore, PoS consensus protocol has to design a variety of complex rules to detect or resist these attacks against PoS mechanism, which directly leads to PoS consensus mechanism is often more complex, analysis and implementation are much more difficult than PoW consensus. Even so, both the existing PoS and DPoS consensus (compared to the PoW consensus) require some compromise on security and/or decentralization.
Is it right or wrong to switch from PoW to PoS?
From the comparison in the previous section, we can see that PoS consensus has obvious improvement over PoW consensus in efficiency, especially in the speed of reaching consensus. Its disadvantages mainly include more complex protocol design and analysis, as well as potential security risks (PoS of proxy, or DPoS, In effect, some decentralization is abandoned to reduce system complexity and further improve the performance of consensus protocols.
In an ideal world, PoS consensus can confirm a transaction with just two rounds of broadcasts, a speed that PoW consensus can never achieve. Since confirming a trade by PoW consensus must wait for the block containing the transaction to gain majority support, this means waiting for other miners to generate enough blocks, the time and number of rounds broadcast must be much longer than the ideal PoS consensus. Although actual PoS consensus may require more than two broadcasts, and the latency of each broadcast may be significantly higher than that of a PoW consensus network made up of professional miners, this does not undermine the already demonstrated advantage of PoS consensus in terms of confirmation speed and its higher upper limit.
In terms of throughput, the PoS consensus is not significantly better than the PoW consensus, and may even be slightly lower. However, the extra cost of PoS consensus to obtain high throughput is significantly lower than that of existing PoW consensus protocol because of its fast validation. For example, the Conflux consensus protocol based on PoW needs to be improved in many aspects from theory to algorithm, so as to achieve high throughput under the same security as PoW with the longest chain rule. The most time-consuming voting process, which in most PoS consensus actually takes time regardless of block size, can be scaled up to the maximum throughput of nodes by simply expanding blocks — at the expense of increasing validation time and not overusing it.
As for security, the disadvantages of PoS consensus system mainly lie in more potential risks and the possibility of being attacked. The security of bitcoin’s consensus mechanism has been proven for a long time, and the security and reliability of mainstream PoW chains such as Ethereum have been proven over time, but no PoS public chain has yet been tested to the same level. The existence of multiple types of attacks, such as disinterested attacks, long-range attacks and bribery attacks, also makes it harder to trust the security of PoS consensus — even if they can resist these types of attacks, who can guarantee that there are not other attacks that are not well known?
The non-binding nature of voting rights and voting behavior in PoS consensus allows consensus participants to take a much richer set of actions than miners in PoW, which means greater freedom for attackers and makes it harder to analyze how other participants will behave.
The security of much of the existing PoS consensus depends on the “honest majority”, but in practice it is hard to find so-called “honest participants” who will unconditionally implement the agreement. In the real world, people are closer to what economics and game theory talk about as “rational people” — a rational good person who will do nothing to harm others and not benefit himself. But it is also possible to do bad things that violate consensus agreements when there is a big enough revenue drive.
Therefore, in addition to guarantee a safe consensus PoS system in the case of most of the participants are honest is safe, still must through the design of incentive mechanism, ensure that every rational participants will abide by the consensus agreement, rather than trying to greater revenues from the violation of the agreement, this is known as the “incentive compatibility in the game theory”. The existing mainstream PoW consensus is incentive compatible. For example, according to the rules of Bitcoin, only the producer of the block on the longest chain can get the mining reward, so the miners will dig behind the longest chain according to the rules in order to get the reward. As for PoS consensus system, its own rules are very complex and participants have a large space for action, so it is more difficult to analyze PoS consensus system in depth and prove similar incentive compatibility.
To sum up, the security shortcomings of PoS are actually more of a hidden danger rather than an unfixable vulnerability, which can be partially or even completely eliminated through perfect design and precise analysis. In the right circumstances, we can even make sacrifices in other ways to get security that is usable. Therefore, the author believes that using PoS consensus to replace PoW consensus is more beneficial than harmful on the premise that the security of PoS consensus can meet the requirements of application scenarios.
In addition, the chain from PoW to PoS actually has an additional advantage, that is, it avoids the problem of too much concentration of tokens in the initial stage of the PoS common chain. Often when a public chain project is launched all tokens are in the hands of the development team and early investors, which creates the risk that the revenue generated on the chain and even the consensus of the entire chain will be monopolized by a few people. Of course, ethereum has been operating under the PoW consensus for many years, and its tokens have been dispersed to a large number of users. There is no problem that a few users who control a large number of tokens control the whole chain through the PoS consensus.
Finally, the author still thinks that the existing mainstream public chain from PoW to PoS needs to be cautious. After all, there is no recognized PoS consensus that can safely replace the existing PoW, and the rush to introduce a PoS consensus mechanism that has not been proven to be secure is likely to bring unnecessary risks and even disastrous consequences to the entire consensus system. The Ethereum community is clearly aware of this, so since Vitalik Buterin first proposed the switch to PoS consensus in December 2015, the PoS protocol designed for the Ethereum upgrade has gone through multiple iterations and iterations. In addition, the upgrade route of Casper FFG, which adopts PoW/PoS hybrid consensus first, and then Casper CBC, which adopts pure PoS consensus, is planned.
We wish ethereum the best of luck with this “eat the crab” upgrade, which serves as an example of successful transformation and provides the entire blockchain community with more experience in using PoS consensus. Of course, even if Ethereum has some bumps in the road during this upgrade, it will certainly contribute to a better understanding and use of the PoS consensus in another form.