The original address: www.xilidou.com/2018/02/01/…
CAS is a modern operating system, an important means to solve the concurrency problem, recently while looking at eureka source code. Encountered a lot of CAS operations. Today is a systematic review of CAS in Java.
Read this article and you will learn:
- What is the CAS
- What is the CAS implementation principle?
- Application of CAS in reality
- spinlocks
- Atom type
- Current limiter
- The disadvantage of the CAS
What is the CAS
CAS: full name Compare and swap, literally: “Compare and swap”, a CAS involves the following operations:
We assume that the old data V in memory, the old expected value A, and the new value B need to be modified.
- Compare whether A and V are equal. (compare)
- If it’s equal, write B to V. (exchange)
- Returns whether the operation succeeded.
When multiple threads perform CAS operations on a resource at the same time, only one thread succeeds in the operation, but the other threads do not block, and the other threads receive a failure signal. So CAS is actually an optimistic lock.
How is CAS implemented
Following the AtomInteger code, we can see that the sum.misc.Unsafe class was called. Looking Unsafe, the name signifies an Unsafe class that exploits a well-placed flaw in Java’s visibility rules for classes and packages. Unsafe is a class that compromises Java’s security standards for speed.
Looking even further down the pike, we discovered the Native way to compareAndSwapInt:
public final native boolean compareAndSwapInt(Object var1, long var2, int var4, int var5);
Copy the code
In other words, these CAS methods should use local methods. So the specific implementation of these methods need to be searched in the JDK source code.
So I download a source to continue its downward exploration, we found that in/jdk9u/hotspot/SRC/share/vm/unsafe. The CPP in this code:
{CC "compareAndSetInt", CC "(" OBJ "J""I""I"")Z", FN_PTR(Unsafe_CompareAndSetInt)},
Copy the code
This involves calling the JNI, so you can learn on your own if you’re interested. We searched Unsafe_CompareAndSetInt and found:
UNSAFE_ENTRY(jboolean, Unsafe_CompareAndSetInt(JNIEnv *env, jobject unsafe, jobject obj, jlong offset, jint e, jint x)) {
oop p = JNIHandles::resolve(obj);
jint* addr = (jint *)index_oop_from_field_offset_long(p, offset);
return (jint)(Atomic::cmpxchg(x, addr, e)) == e;
} UNSAFE_END
Copy the code
Finally, we see the core code, Atomic:: CMPXCHG.
Continue to explore the underlying, on the file Java/jdk9u/hotspot/SRC/os_cpu/linux_x86 / vm/atomic_linux_x86 HPP have this code:
inline jint Atomic::cmpxchg (jint exchange_value, volatile jint* dest, jint compare_value, cmpxchg_memory_order order) {
int mp = os::is_MP();
__asm__ volatile (LOCK_IF_MP(%4) "cmpxchgl %1,(%3)"
: "=a" (exchange_value)
: "r" (exchange_value), "a" (compare_value), "r" (dest), "r" (mp)
: "cc", "memory");
return exchange_value;
}
Copy the code
As you can see from the file name, the JVM should have different implementations of Atomic:: CMPXCHG for different operating systems. Since most of our services use 64-bit Linux, we’ll look at the implementation of Linux_x86.
Let’s continue with the code:
__asm__
This is a piece of embedded assembly code. That is, using assembly code in C.- Here,
volatile
A bit like JAVA, but not for memory visibility, but to tell the compiler to stop optimizing code that accesses this variable. LOCK_IF_MP(%4)
If the operating system is multithreaded, add a LOCK.cmpxchgl
It’s the compact-and-swap version. But we know that there are three steps in comparison and exchange that are not atomic. So add a LOCK in the case of multiple cores, and the CPU hardware guarantees its atomicity.- How does LOCK work? We go to the official website of Intel, it can be seen that the early implementation of LOCK is directly blocking the BUS of CUP, such implementation efficiency is very low. It was later optimized for the X86 CPU to have the ability to lock a particular memory address, and when that particular memory address is locked, it prevents other system buses from reading or modifying that memory address.
So much for the low-level exploration of CAS. Let’s summarize how cas is implemented in JAVA:
- Java’s CAS takes advantage of the CAS operations provided by the Unsafe class.
- Unsafe’s CAS relies on the JVM’s Atomic:: CMPXCHG for different operating system implementations
- The implementation of Atomic:: CMPXCHG uses assembler CAS operations and ensures atomicity using the LOCK signal provided by the CPU hardware
The application of CAS
Knowing the principle of CAS, let’s continue to look at the application of CAS:
spinlocks
public class SpinLock {
private AtomicReference<Thread> sign =new AtomicReference<>();
public void lock(a){
Thread current = Thread.currentThread();
while(! sign .compareAndSet(null, current)){
}
}
public void unlock (a){
Thread current = Thread.currentThread();
sign .compareAndSet(current, null); }}Copy the code
The spin lock, which I think is a pretty good name, keeps the while() loop locked () until the CAS operation succeeds.
AtomicInteger incrementAndGet ()
public final int getAndAddInt(Object var1, long var2, int var4) {
int var5;
do {
var5 = this.getIntVolatile(var1, var2);
} while(!this.compareAndSwapInt(var1, var2, var5, var5 + var4));
return var5;
}
Copy the code
It is similar to spin lock, that is, keep while until the operation succeeds.
Token bucket current limiter
The token bucket limiter is a system that adds tokens to the bucket at a constant rate. Get the token from the token bucket before each request. Access is only possible if a token is obtained. Refuse service when there is no token in the token bucket. Let’s look at how Eureka’s stream limiter uses CAS to maintain the addition and distribution of tokens in a multi-threaded environment.
public class RateLimiter {
private final long rateToMsConversion;
private final AtomicInteger consumedTokens = new AtomicInteger();
private final AtomicLong lastRefillTime = new AtomicLong(0);
@Deprecated
public RateLimiter(a) {
this(TimeUnit.SECONDS);
}
public RateLimiter(TimeUnit averageRateUnit) {
switch (averageRateUnit) {
case SECONDS:
rateToMsConversion = 1000;
break;
case MINUTES:
rateToMsConversion = 60 * 1000;
break;
default:
throw new IllegalArgumentException("TimeUnit of " + averageRateUnit + " is not supported"); }}// Provide a method for the outside world to obtain the token
public boolean acquire(int burstSize, long averageRate) {
return acquire(burstSize, averageRate, System.currentTimeMillis());
}
public boolean acquire(int burstSize, long averageRate, long currentTimeMillis) {
if (burstSize <= 0 || averageRate <= 0) { // Instead of throwing exception, we just let all the traffic go
return true;
}
/ / add a token
refillToken(burstSize, averageRate, currentTimeMillis);
/ / consumer token
return consumeToken(burstSize);
}
private void refillToken(int burstSize, long averageRate, long currentTimeMillis) {
long refillTime = lastRefillTime.get();
long timeDelta = currentTimeMillis - refillTime;
// Calculate how many tokens need to be added based on frequency
long newTokens = timeDelta * averageRate / rateToMsConversion;
if (newTokens > 0) {
long newRefillTime = refillTime == 0
? currentTimeMillis
: refillTime + newTokens * rateToMsConversion / averageRate;
// CAS ensures that only one thread enters the population
if (lastRefillTime.compareAndSet(refillTime, newRefillTime)) {
while (true) {
int currentLevel = consumedTokens.get();
int adjustedLevel = Math.min(currentLevel, burstSize); // In case burstSize decreased
int newLevel = (int) Math.max(0, adjustedLevel - newTokens);
// While true until the update succeeds
if (consumedTokens.compareAndSet(currentLevel, newLevel)) {
return;
}
}
}
}
}
private boolean consumeToken(int burstSize) {
while (true) {
int currentLevel = consumedTokens.get();
if (currentLevel >= burstSize) {
return false;
}
// While true until there is no token or until the token is obtained
if (consumedTokens.compareAndSet(currentLevel, currentLevel + 1)) {
return true; }}}public void reset(a) {
consumedTokens.set(0);
lastRefillTime.set(0); }}Copy the code
So sort out the role of CAS in token bucket flow limiter. Is to ensure that in the case of multi-threading, the thread does not block the filling token and consumption token.
inductive
Through the above three applications, we can summarize the application scenarios of CAS:
- The use of CAS prevents threads from blocking.
- Most of the time we use while true until success.
CAS shortcomings
- The ABA problem is that A value changes from Ato B and then to A, and the CAS operation does not detect that value change. You can use A version of AtomicStampedReference that carries A similar timestamp
- Performance issues. Most of the time, we use while true to modify the data until it succeeds. The advantage is that it is correspondingly fast, but when the number of threads keeps increasing, performance degrades significantly because each thread needs to execute, consuming CPU time.
conclusion
CAS is one of the important ideas of the whole programming. CAS is present in the entire computer implementation. Microscopically assembled CAS are the building blocks for implementing atomic operations at the operating system level. From a programming language perspective, CAS is the cornerstone of multi-threaded non-blocking operations. Macroscopically, in distributed systems, we can use the idea of CAS to implement a distributed lock using external storage like Redis.
In some ways, architecture magnifies the micro implementation, or the underlying idea is to miniaturize the macro architecture. The mind of a computer is impenetrable, so understanding the underlying implementation improves the capability of the architecture, and improving the capability of the architecture also deepens the understanding of the underlying implementation. Computer knowledge is vast, but the skills are limited. Grasp the basic several sets of breakthrough, from the perspective of thought and thinking to learn computer knowledge. Don’t waste your energy chasing new technology. Follow the ‘start guide line’ and write a demo, and the result is a demo.
Stopping and looking back at the basics and classics may be more technical.
I hope this article has been helpful.
Free hand lift frame series article address:
Barehanded framework – Request merging in high concurrency environments
Hands-free framework – Implementing IoC
Hands-free framework – Implementing Aop
Welcome to follow my wechat official account