For Internet companies, the problem of online CPU and memory surge is very common (such as sudden surge of traffic), while for programmers, it is basically the following steps as troubleshooting

Normal operation

1. Run the top command to check which process occupies too many cpus

The PID of the Java process is 11391

2. Check the CPU usage of all Java process threads

top -Hp < PID >

PID is the ID of the thread

3. Convert the thread ID to hexadecimal. The stack information is displayed in hexadecimal, so you need to convert the thread ID to hexadecimal

printf %x 11405

4. View the stack information

Jstack process < number > | grep < hexadecimal thread Id >

If VM Thread is os_PRIO =0 TID = 0x00007F871806e000 nID = 0xA Runnable, VM Thread is reclaimed by GC

5. Check the process GC status

Jstat -gcutil < process number > < count interval milliseconds > < count times >

See if the GC of a process continues to change, and if the FGC returned is large and keeps increasing

Confirm the Full GC!

Or use

Jmap-heap < process ID >

Check to see if the heap is overflowing, especially if the heap usage reaches the threshold (see the garbage collector and the threshold configured at startup) and the process is Full GC

6. Output the dump file and check the number of program instances using a tool

Jmap-dump :format=b,file=filename < PID > Export the memory heap of a process to a file. Use the VISUalVM or MAT tool in the JDK to check the objects in the memory

Cause analysis,

1. Excessive memory consumption leads to excessive GC times of FULL

Perform operations 1 to 5

See which threads are responsible for garbage collection

Monitoring the GC with the jstat command, you can see that the number of FULL GC sessions is very high

2. There is a lot of CPU consumption in the code

Perform Steps 1-4

Stack information can be used to locate which line of code is consuming CPU

3. Deadlock occurs due to improper lock usage

Perform Steps 1-4

If there is a deadlock, it will be directly prompted. Keyword: deadlock. Step 4, the location of the business deadlock is printed.

Deadlock causes: Typically, two threads are waiting for each other to hold a lock.

4. A large number of threads randomly access the interface slowly

There is an obstructing operation in a certain position of the code, resulting in the overall time consuming of the function call, but the occurrence is relatively random; Usually the CPU consumption is not much, and the memory occupation is not high.

Ideas:

The interface is first found, and a large number of threads will block at the choke point as the pressure gauge continues to increase access.

Perform Steps 1-4.

Check thread blocking status, TIMED_WAITING is blocked

5. A thread enters the WAITING state for some reason, and the function is unavailable but cannot be reappeared.

Perform Steps 1-4.

Jstack queries several times, each time with an interval of 30 seconds, to compare the thread that stays in the WAITING state caused by parking. For example, CountDownLatch backs the timer so that the relevant threads wait ->AQS-> locksupport.park ().

To learn more about computer programming techniques, watch the instructional video