This is the first day of my participation in the August More text Challenge. For details, see: August More Text Challenge

Today we are going to talk about the garbage collector in Java. The previous article talked about the garbage collection algorithm. This time we are going to talk about the garbage collector in a deeper way.

If you don’t already know anything about the JVM, check out these articles and follow the [high-frequency interview Questions] column to build your knowledge together.

Talk about the MEMORY structure of the JVM!

Talk about the difference between heap and stack in the JVM!

Do you know what algorithms are involved in garbage collection?

According to the book Understanding the Java Virtual Machine, if the collection algorithm is the methodology of memory collection, then the garbage collector is the implementation of memory collection.

Different garbage collectors have been used in different versions of the Java Virtual Machine in different times.

Let’s start by drawing a diagram showing the specific garbage collectors.

Be insane. The upper part (Serial, ParNew, Parallel Scavenge) belongs to the Cenozoic collector. The lower part (CMS, Serial Old, ParNew Old) belongs to the Old collector. G1 is a special collector because it has neither the new generation nor the old.

Collector iteration

Prior to JDK1.3.1, the Serial collector was the only new generation of collectors that had no choice but to say the least.

After JDK1.3.1, Parallel, CMS, G1, and so on appeared.

After JDK1.7 Update14, the COLLECTOR in its virtual machine is officially changed to the G1 collector.

A couple of very important concepts

Stop The World: When The garbage collector is in The process of garbage collection, it must pause all other threads. This pause state will continue until The end of The garbage collection task. This process is called Stop The World, which is a garbage collection pause mechanism, and it is a mechanism that all garbage collectors have, after all, stopping threads every time a collection is done, which causes a certain amount of delay to The system.

There is no way to solve the problem of having to stop all threads, but over the years, and with different versions of the garbage collector, this process has gotten shorter and shorter.

Throughput: Time to run user code/(time to run user code + garbage collection pause). The higher the Throughput, the better the collector performance.

Concurrency and parallelism in the collector:

(1) Concurrency. The garbage collection task executes concurrently with the worker thread. This does not mean that pause times do not exist, but they are greatly reduced.

(2) Parallelization: During The collection process, multiple collection tasks are performed in parallel to reduce The pause time caused by Stop The World and improve The performance of garbage collection.

Minor GC and Full GC:

Minor GC: Garbage collection in the memory of the new generation. It is frequently performed and the collection speed is fast.

Full GC: Garbage collection in memory in the old age. The old age GC is also called the Major GC. It may execute a Minor GC, but this Minor GC may not be executed. Of course, a Full GC is several times slower than a Minor GC.

Serial collector

Collection area: Cenozoic

Use algorithm: copy algorithm

Parallelism supported or not: Not supported, single threaded collector.

Concurrency: Not supported

Known as the oldest and most basic collector, as a single-threaded collector, simple and efficient is its advantage over other single-threaded collectors; It is also because of the single thread, no interaction overhead between threads, to a certain extent, such as lightweight application level systems, resulting in a simple and efficient application mode.

Here’s a rough idea of how it would go about garbage collection:

(1) Single-thread GC is performed and all worker threads are suspended.

(2) Using the replication algorithm for collection.

(3) After the collection is completed, the worker thread will be started. It is possible to continue Full GC here.

ParNew

Collection area: Cenozoic

Use algorithm: copy algorithm

Parallelism or not: Multi-threaded collectors are supported.

Concurrency: Not supported

The ParNew collector is a multithreaded version of the Serial collector, having the same characteristics as the Serial collector except that it processes the collection in parallel.

In a single-CPU environment, the Serial collector is always the best choice because of the multiple thread interactions, so the ParNew collector performs better than the Serial collector in a multi-CPU server environment.

Here’s how ParNew works:

(1) Multiple threads perform GC and pause all worker threads.

(2) Using the replication algorithm for collection.

(3) After the collection is completed, the worker thread will be started. It is possible to continue Full GC here.

Parallel Scavenge

Collection area: Cenozoic

Use algorithm: copy algorithm

Parallelism or not: Multi-threaded collectors are supported.

Concurrency: Not supported

Parallel Scavenge This collector is different from the exploiture collector in that it is designed to control throughput. The exploiture collector is designed to maximise throughput by reducing the amount of time the collection takes to be exploiture.

To control throughput, the Parallel Scavenge collector specifically provides two parameters to achieve throughput.

-xx :MaxGCPauseMillis(maximum garbage collection pause time) : Set a value in milliseconds. The collector tries to keep the collection time as short as possible without making the collection faster.

-xx :GCTimeRatio(throughput size) : an integer ranging from 0 to 100. This value represents the ratio of garbage collection time to the total collection time.

The Parallel Scavenge collector is also called a throughput collector. You can set the ratio of Eden and Survivor zones by using the -xx :SurvivorRatio parameter.

Serial Old

Collection area: Old age

Use an algorithm: mark-collate algorithm

Parallelism supported or not: Not supported, single threaded collector.

Concurrency: Not supported

Slightly different from the Serial collector, but the collection process is the same.

Parallel Old

Collection area: Old age

Use an algorithm: mark-collate algorithm

Parallelism or not: Multi-threaded collectors are supported.

Concurrency: Not supported

Parallel Old has been available since JDK1.6, and it is recommended to consider this collector in situations where throughput is high or CPU resource sensitive.

CMS

Collection area: Old age

Use an algorithm: Mark the algorithm clearly

Parallelism or not: Multi-threaded collectors are supported.

Concurrency Supported or not: Yes

CMS, full name Concurrent Mark Sweep, is a collector with the goal of obtaining the shortest collection pause time. CMS is relatively classic. To achieve its goal, it uses four steps to implement the collection process.

(1) initial marking (2) concurrent marking (3) re-marking (4) concurrent clearing

Does that eliminate Stop The World? That’s not possible, Stop The World will still appear at The initial markup, concurrent markup.

What these four steps mainly do will come back to us in another article.

The other thing to note here is that CMS uses a tag-clear algorithm, which is bound to create a lot of memory fragmentation, which is often asked by interviewers.

G1

Collection area: all

Use algorithm: copy algorithm

Parallelism or not: Multi-threaded collectors are supported.

Concurrency Supported or not: Yes

The G1 could have been written in its own article and explained in more detail later, but today we’ll just look at a few features of the G1.

1. Parallel and concurrent coexistence, under The multi-CPU and multi-core resources, can more effectively shorten The Stop The World pause time.

2. Although only one collector is used to clean up all the memory, it still carries out generational collection, but it takes a different way to process.

3. Since G1 manages all the memory, it divides the memory into multiple Region Spaces in spatial dimension to solve the problem of using replication algorithm to collect.

4. The G1 also has the advantage of being able to predict pause times.

Combination strategy

How do the seven garbage collectors fit together, except G1, which is responsible for the new generation and the old?

Be insane and Serial. Be insane and Parallel Old. Be insane.

Then from a realistic point of view, Serial can be combined with Serial Old and CMS, because ParNew is equivalent to the multi-threaded version of Serial, so ParNew and Serial can be combined with Serial Old and CMS.

The Parallel Scavenge collector can be used in addition to the Parallel Old collector. It can be combined with the Serial Old collector.

Here, the picture above, to make it clearer,

Here, the seven collectors are all said, only after understanding the essence, we can gradually tune the knowledge reserve, the foundation is the most important, everyone treasure.