The introduction
If you’re familiar with Go, you’ve probably heard of the concept of the Go coroutine, the Goroutine, and most of the articles that talk about goroutine are that it’s very lightweight compared to threads, it has the same amount of memory, You can run more Goroutines. But few articles have explained how goroutine is able to take up fewer resources. How much memory does a goroutine take up? These questions will be explained in this paper.
Some basic conclusions
- The memory occupied by goroutine is managed in the stack
- The amount of stack space occupied by goroutine is allocated by the Runtime as needed
- A JVM in a 64-bit environment, for example, allocates a fixed 1MB stack space per thread by default, which can cause stack overflows if the size is misallocated
You are smart enough to see from these conclusions that Goroutines are lighter than threads, and the key is the dynamic allocation of stack space to maximize memory resources. Since this is dynamic allocation, it’s a bit nitpicking to say how much memory a single Goroutine takes up out of context. So let’s take a look at how Goroutine dynamically allocates stack space.
Segmented stack
Earlier versions of Go used a segmented stack for memory management, and when a Goroutine was created, the Runtime allocated 8KB of memory for the coroutine. So the question is, what do I do when I run out of 8KB?
To solve this problem, Go inserts a small piece of pre-code at the entrance to each function, which checks to see if stack space is running out and, if so, calls moreStack () to expand space.
Morestack () allocates a new area of memory to the stack space. The structure at the bottom of the new stack is then filled with various data about the stack, including the address of the old stack that you just came from. When a new stack segment is obtained, Goroutine is restarted by re-executing the function that caused the stack to run out. This is called stack splitting
+---------------+
| |
| unused |
| stack |
| space |
+---------------+
| test |
| |
+---------------+
| |
| lessstack |
+---------------+
| Stack info |
| |-----+
+---------------+ |
|
|
+---------------+ |
| test | |
| | <---+
+---------------+
| rest of stack |
| |
Copy the code
Piecewise stack backtracking mechanism: As shown above, the new stack inserts a stack entry for LessStack (). This function is not actually called explicitly. It is set when the function that exhausted the old stack returns, such as test() in the figure. When test() returns, it is returned to LessStack (), which queries the structure at the bottom of the stack and adjusts the stack pointer (SP) so that it can trace back to the previous stack segment. Then, the new stack segment space can be freed.
Problems with segmented stacks
The segmented stack mechanism allows the stack to expand and shrink as needed. The programmer doesn’t need to care about stack size.
But the segmented stack has its flaws. Shrinking the stack is a relatively expensive operation. This is especially true if you are splitting the stack in a loop. The function grows the stack, splits the stack, returns the stack, and frees stack segments. If you do this in a loop, it can be very expensive. For example, if the loop goes through all of this once, the next time the stack is exhausted, the stack segments have to be redistributed, and then released again, and so on and so on, the overhead is huge.
This is known as a hot split problem. This is the main reason the Golang development team switched to a new stack management approach called stack copying.
Continuous stack
Since GO1.4, the continuous stack mechanism has been used in earnest.
Stack copy starts out a lot like segmented stack. The coroutine runs, uses stack space, and triggers the same stack overflow detection when the stack is about to run out.
However, unlike a backtracking link in a fragment stack, the stack copy creates a new fragment that is twice the size of the old stack and copies the old stack in full. So when the stack shrinks to the old stack size, the Runtime does nothing. Shrink becomes a no OP free operation. In addition, when the stack grows again, the Runtime doesn’t need to do anything but reuse the space that was just expanded.
It’s not as easy as it sounds, but copying stacks is a daunting task. Since a variable in the stack can get its address in Golang, it ends up with a pointer to the stack. If the moving stack is easily copied, any pointer to the old stack will be invalid.
Golang’s memory-safe mechanism dictates that any pointer that can point to the stack must exist in the stack.
So you can use the garbage collector to help with stack copying, because the garbage collector needs to know which Pointers can be collected, so it can find out what part of the stack is a pointer, and when it does stack copying, it updates the pointer information to point to the new target, and all the Pointers associated with it.
However, many of the core scheduling functions and GC cores in the Runtime are written in C, and these functions do not get pointer information, so they cannot be copied. This is done in a special stack, and the stack size is defined by the Runtime developer.