The JVM? Isn’t this one enough
Writing in the front
As we programmers spend more and more time in the hole, we inevitably run into problems with the JVM, such as OOM exceptions, excessive GC counts, long GC sessions, etc. At this time, do not have the JVM knowledge, encountered GC problems, a face confused ah, how to tune! Also, knowledge of the JVM is almost a must-ask for a job interview, so if you don’t know it, you’ll feel confused when asked. This chapter is dedicated to providing a detailed introduction to the JVM, as simple and efficient as possible.
Content abstract
A JAVA memory model introduction two JVM internal composition three JAVA object memory layout introduction and creation process four JVM parameters tuning five garbage collection mechanism details six JVM performance analysis commandsCopy the code
Java memory model
Definition 1:
Describes a set of specifications that define how variables can be accessed.Copy the code
2 features
-
Visibility: When a thread makes changes to a shared variable, other threads can immediately sense the changeCopy the code
-
Atomicity: It is indivisible. When a thread is doing a certain business, it either succeeds or fails at the same time. It cannot be added or split.Copy the code
-
Orderliness: When a computer executes a program, the compiler and processor often rearrange instructions to improve performance.Copy the code
3 Volatile keyword
3.1 features:
- Guaranteed visibility
- Atomicity is not guaranteed
- Disallow instruction reordering optimization. (Ensures order in JMM)
If a shared variable is volatile, a memory barrier is inserted into the instruction sequence at compile time to prevent a particular type of handler from reordering.
At the virtual machine level, the order of code execution is modified to minimize the impact of CPU vacancy when memory operations are much slower than THE CPU speed.
3.2 the principle
- When a volatile variable is written, the shared variable in the thread’s local memory is flushed to main memory.
- When a volatile variable is read, the thread next reads the shared variable from main memory.
3.3 How to solve the atomicity problem?
- Use atomic classes under JUC packages, such as AtomicInteger
- Lock synchronized
Internal components of the JVM
Class loader subsystem
The main classes of class loaders are:
Bootstrap ClassLoader: a core Java class library written in C/C++ that loads Java and does not inherit ClassLoader and has no parent ClassLoader. Extended ClassLoader: loads the class library in the jre/lib/ext directory. Application loader: default loader for loading user-defined classes. Why custom class loaders? Isolating loading classes to avoid class conflicts Extending loading sources to prevent source leakage Custom steps: Inherit from ClassLoader you can also inherit from URLClassLoaderCopy the code
Class loading process:
1 load
Obtaining the binary byte stream defined by a class's fully qualified name converts the static storage structure represented by the byte stream into the runtime data structure of the method areaCopy the code
2 links
File format verification: Verifies that the byte stream complies with Class file format specifications. For example, if the byte stream starts with 0xCAFEBABE metadata verification: Bytecode verification: Through the analysis of data flow and control flow, the program semantics are legitimate and logical. Symbol reference validation: Ensures that the parse action is performed correctly. Static: allocates memory for class variables and sets the default initial value of the class variable to zero. Note: This allocates memory only for class variables (static), but not for instance variables, which are resolved in the Java heap along with the object allocation at instantiation: The procedural resolution action of replacing symbolic references in the constant pool with direct references (memory addresses) is mainly for class or interface, field, class method, interface method, method type, method handle, and call point qualifier 7 class symbolic references.Copy the code
Symbolic reference: a set of symbols that describe an object, which can be any literal. A direct reference is a pointer to a target directly, a relative offset, or a handle to the target indirectly.
3 the initialization
Only active use of a class can lead to class initialization, which is the procedure that executes the class constructor method <clinit>. Active use of a class includes the following six types: -- Create an instance of a Class, that is, new -- access or assign a value to a static variable of a Class or interface -- call a static method of a Class -- reflect (e.g. Class.forname (" com.shengsiyuan.test ") -- initialize a subclass of a Class, The parent class is also initialized - a class (Java Test) that is marked as the startup class when the Java VIRTUAL machine starts, and a main class is run directly using the java.exe commandCopy the code
2 Run time data area
The stack:
Life cycle: The master program runs, while the thread is created, follows the life cycle of the thread, and the thread ends the stack to release memory. There is no garbage collection and the thread is private. Operation principle: The data in the Stack is in the format of Stack Frame, which is a memory block, a data set, and a data set related to method and runtime data. When method A is called, A stack frame F1 is generated and pushed onto the stack. Method A calls method B and F2 is pushed onto the stack. Method B calls method C and F3 is pushed onto the stack... After the operations are complete,...... is displayed F3 stack frame, F2 stack frame, F1 stack frame.Copy the code
Internal structure of stack frame:
Local variables to preserve local variables (8 kinds of basic data type/object reference address) dynamically link to the runtime constant pool method reference operand stack Is mainly used to store the results of calculation process middle, at the same time as a variable in the process of computing a temporary storage space The bottom is the return address array implementation method Some additional informationCopy the code
Pile:
The internal structure
Survivor zone: YGC/MinorGC will occur when the zone is full. GC will only occur when the Zone is full. Survivor zone: 0 survivor zone: 1/10 of the entire Heap. When an object is still available after 15 GCS, it enters this region. The GCS that occur in this region are called MajorGCCopy the code
Method area (jdk1.8 called meta space)
Storing this data internally:
Runtime constant pool method information Class Class information Domain information (Class attributes) Notice Using local memory The VM specification does not specify whether to reclaim or not. There may also be Class unload thread sharingCopy the code
Local method stack
Manages calls to local methodsCopy the code
Program counter
Each thread has a program calculator, which is a pointer to the method bytecode (the code for the next instruction to be executed) in the method area, which is read by the execution engine. No GC, no OOM because the CPU has to keep switching between threads and needs to know where to start after switching backCopy the code
3 Execution Engine
Task: Interpret/compile bytecode instructions to local machine instructions on the corresponding platformCopy the code
4 Local method interface
Native modification method, implemented by C/C++Copy the code
JAVA object memory layout and creation process
1 Memory Layout
An object header contains two parts: 1 Runtime data Hash value GC generational age lock status flag thread held lock bias thread ID 2 Type pointer: pointing to class metadata 2 Instance data includes fields inherited from the parent class and its ownCopy the code
2 Creation Process:
1 Load the class metadata and check whether the corresponding class is loaded, linked, or initialized. 2 Allocate memory for an object. If the memory is continuously allocated in pointer collision mode, if the memory is not continuously allocated, record the free memory in the free list Default initialization of properties (zero value initialization) 5 Sets object header information 6 Display initialization of properties/code block initialization /Copy the code
Four JVM parameter tuning
1 Parameter Classification
1 Standard parameters such as Java -version 2 XX parameter Boolean Type formula: -xx: + or - An attribute. + indicates enabled, and - indicates disabled. For example, -xx :+PrintGCDetails KV type formula: -xx :key=value For example, -xx :MetaspaceSize=21mCopy the code
2 Common Parameters
-xms: initial memory size, 1/64 of the default physical memory is equivalent to -xx :InititalHeapSize -xmx: maximum allocated memory, 1/4 of the default physical memory is equivalent to -xx :MaxHeapSize -xss: -xx :MetaspaceSize: set the size of the meta-space, the local memory used by the meta-space. -xx :+PrintGCDetails: prints the collection information: -xx :SurvivorRatio: sets the ratio of the new generation Eden to the S0 / S1 space. The default value is 8:1:1. -xx :NewRatio: Sets the ratio between the young generation and the old generation. The default value is 2, that is, new generation: old age = 1:2-xx :MaxTenuringThreshold: Set the maximum age for entering the old age. The default value is 15, that is, the object needs to pass 15 GC to enter the old age.Copy the code
-jar jar/war package name
5. Recycling
1 How do I check whether an object is reclaimed?
Reference counting algorithm: adds a reference counter to an object, incrementing it by 1 each time a reference is made to it; When a reference is invalid, the counter value is reduced by 1; The reference counting algorithm is not used in the Java language because it is difficult to solve the problem of circular references between objects. When A and B are not referenced by other objects, but A and B refer to each other, the counter cannot be 0, so the two objects cannot be reclaimed. Root search algorithm: through a series of objects named "GC Roots" as the starting point, search down from these nodes, the search path is called the Reference Chain. When an object is not connected to GC Roots by any Reference Chain, it is proved that the object is not available. What can be used as GC Roots objects? 1. Objects referenced in the stack. 2. The object referenced by the class static attribute in the method area. The object referenced by the constant in the method area. For example, objects decorated with static final. Objects referenced in the local method stack. 5. Sync lock the object held by syncCopy the code
2 Reference Relationship
Strong references: Soft references are never reclaimed by the JVM as long as strong references exist: Weak references are reclaimed by the JVM only when memory is low: The reference object is reclaimed by the JVM whenever GC exists. Must be used in conjunction with the reference queue, not alone. The referenced object is placed in the ReferenceQueue ReferenceQueue after GC. You can do some business after the object is destroyed. The sole purpose: to receive a system notification when the object is reclaimed by the collectorCopy the code
3 the GC algorithm
1 Replication algorithm: the memory space is divided into two parts and only one part is used each time. The surviving objects in the used memory are copied to the unused memory block. After that, all objects in the used memory block are cleared and the roles of the two memory blocks are swapped. Advantages: Efficient operation can ensure the continuity of space, and there will be no memory fragmentation disadvantages: need twice the memory space Scenario: used in the new generation. The fewer live objects copied, the better the performance 2 Mark/clear mark phase: traverses from the reference root node (GC Roots) and marks all referenced objects, generally marked as reachable objects in the object header. Cleanup phase: Linear traversal of all objects in the heap. If an object is not marked as reachable in its Header, it is recycled. Disadvantages: memory fragmentation, inefficient maintenance of a free list, need to stop the entire application 3 mark/compress mark phase for GC: start with GC Roots, mark all referenced objects, generally mark reachable objects in the object header. Compression phase: Compress all living objects to one end of memory, discharge them in order, and then clean up the space outside the boundary. Conclusion: The efficiency of memory consolidation after the execution of the mark/clear algorithm is lower than that of suspending the user thread during the move of the replication algorithm. STW scenario: the old 4-partition algorithm divides the whole heap space into successive different cells, and each cell is used and reclaimed independentlyCopy the code
JVM uses generational collection algorithm reason: Because the life cycle of different objects is different
New generation: The GC replication algorithm is used, because new generation is generally new and objects have low survival rate.
Old age: standard clear or standard whole. Because in the old days because of the high survival rate.
4 Possible errors during GC:
There are two major categories:
1, StackOverflowError: OutOfMemoryError:GC overhead Limit exceeded the number of times the server resources are used for GC, and the GC effect is very poor. OutOfMemoryError: The Direct buffer memory allocated out of the heap is insufficient, that is, exceeds the value of -xx :MaxDirectMemorySize. This exception is typically reported in NIO programs. OutOfMemoryError: Unable to create new native thread The cause is that too many threads are created in an application process. You can set the maximum number of threads Linux allows a single process to run. 6, OutOfMemoryError: Metaspace: solution: Java virtual machine heap memory setting is not enough, can adjust by parameter - Xms, -xmx. Memory leak: The object is no longer used by the program, but the GC cannot reclaim it. It is possible that objects cannot be reclaimed in singleton mode, database connection, or network connection.Copy the code
STW: (stop-the-world) is when The garbage collection algorithm is executed and all other threads of The application are suspended (except for The garbage collection helper). A global pause in Java, in which all Java code stops and native code can execute but cannot interact with the JVM; Most of these phenomena are caused by GC. This event is present in all GC’s. The JVM is automatically initiated and completed in the background.
The difference between Minor and Full GC
1. Minor garbage collection, also known as Cenozoic garbage collection. Because Java objects are mostly ephemeral, Minor GC is frequent and generally fast; Full GC: Also known as Major GC or old GC, refers to GC occurring in an old era; Major GC is typically 10 times slower than Minor GC.Copy the code
5 Garbage collector
Classification:
1 Serial: a Serial garbage collector that has only one thread for garbage collection and suspends the user thread until it completes. Enable the command: -xx :+UseSerialGC Application scenario: This command is used in Client mode. The Server insane collector is used with the Parallel Scavenge collector. The Server Insane collector is used with the Parallel Old collector. 2 as the backup plan of CMS collector, specific classification is used when Concurrent Mode Failure occurs in Concurrent collection: Serial GC: occurs in the new generation and adopts replication algorithm. Serial Old GC: For older generations, using the tag/compression algorithmCopy the code
2 Parallel: -xx :+UseParrallelGC: 1 ParNew: used for the new generation, the old generation can use CMS to start the command: "-xx :+UseConcMarkSweepGC" : When CMS is specified, ParNew is used as the Cenozoic collector by default. "-xx :+UseParNewGC" : specifies ParNew forcibly. Features: With the exception of multi-threading, the behavior and characteristics of the Serial collector are the same; Insane. The Parallel Scavenge collector has some characteristics similar to the ParNew collector. Copy algorithm is adopted; Multithreaded collection; Application scenario: High throughput is the goal, that is, reduce garbage collection time, so that user code can get longer running time; When the application runs on multiple cpus and does not have a particularly high pause time requirement, that is, the program mainly performs calculations in the background without much interaction with the user; For example, applications that perform batch processing, order processing, payroll, scientific calculations;Copy the code
3 CMS: concurrent garbage collector Purpose: Reduce STW time enable command: -xx :+UseConMarkSweepGC Features: for the old era based on the "mark-clean" algorithm (no compression operation, memory fragmentation); To obtain the shortest recovery pause time as the goal; Concurrent collection, low pause; Application scenario: A scenario in which there is a lot of user interaction. Expect the system to stop the shortest time, pay attention to service response speed; To give users a better experience; Recovery steps: 1 the initial mark (CMS-initial-mark) will cause STW; The initial tagging phase marks GC ROOT objects and objects associated with GC ROOT objects in the old generation. 2 Concurrent mark (CMs-concurrent-mark) traverses from the directly associated object of GC Roots. This process takes longer, but does not require suspending the user thread 3 re-mark (CMs-mark) In order to correct the concurrent marking period, the marking record of the portion of objects that caused the marking to change because the user thread continued to operate also causes THE STW 4 concurrent sweep of objects that are judged to be garbage during the marking phase and can be performed concurrently with the user threadCopy the code
Why not use a tag compression algorithm? Because when concurrent cleanup occurs, the user thread is still running while the memory is being defiled, the object's memory address cannot be changed. Advantages: Low latency of Concurrent collection Disadvantages: 1) Memory fragmentation is generated. 2) Floating Garbage cannot be processed and "Concurrent Mode Failure" may occur. Floating Garbage: Garbage generated by user threads during Concurrent collection is called Floating Garbage. This makes it necessary to reserve a certain amount of memory space for concurrent cleanup, unlike other collectors, which can almost fill up the old years and then collect; A "Concurrent Mode Failure" occurs if the CMS does not have enough memory reserved for the program. 3 is very sensitive to CPU resources. The default number of collection threads in CMS is =(number of cpus +3)/4. If the number of cpus is less than four, the system is adversely affected. G1: garbage-first collector: a collector that is commercially available in JDk7-U4; Macroscopically, it does not distinguish between young and old generations and divides memory into 2048 independent sub-regions. Make a small distinction between the young and the old. The biggest benefit is that full memory scans are avoided. Design targets replace CMS collectors. Features: 1 Parallelism and concurrency can make full use of the hardware advantages of multi-CPU and multi-core environment; Can be parallel to shorten The "Stop The World" pause time; You can also have garbage collection run concurrently with the user program; 2 generation collection, collection scope includes new generation and old generation can independently manage the entire GC heap (new generation and old generation), without the need to match with other collectors; Being able to deal with objects of different eras in different ways; 3. Combined with a variety of garbage collection algorithms, spatial integration, no debris generated from the overall view, is based on mark-collation algorithm; Locally (between two regions), it is based on the replication algorithm. 4 predictable pauses: High throughput with low pauses G1 not only pursues low pauses, but also builds predictable pause time models; You can explicitly specify that within a time slice of M milliseconds, garbage collection takes no more than N milliseconds. Algorithm used: Replication algorithm between regions Command: -xx :+UseG1GC: enable "-xx :MaxGCPauseMillis" : set the pause time target for G1. The default value is 200 ms. Advantages: The user can specify the pause time: use -xx :MaxGCPauseMills maximum GC pause time. The JVM does what it can. Parallel and concurrent applications do not generate memory fragments. Scenarios are as follows: Server oriented applications apply to machines with large memory and multiple processors. The primary application is to provide a solution for applications that require low GC latency and have a large heap; On a large memory, multi-processor machine, G1 may be better than CMS for: more than 50% of the Java heap is occupied by active data; The frequency of object assignment or chronological lifting varies greatly; GC pauses are too long (longer than 0.5 to 1 second). Do you have to use G1? Not necessarily: If there are no problems with the current collector, don't rush to G1; If your application is looking for low pauses, try G1; Whether or not to replace the CMS requires actual scenario testing.Copy the code
Description of throughput and collector concerns
Throughput
The ratio of CPU time spent running user code to total CPU consumption;
Throughput = run user code time/(run user code time + garbage collection time);
High throughput means less garbage collection time and longer runtime for user code;
Desired goals (concerns) of the garbage collector
1 Pause Time
Shorter pauses are suitable for applications that need to interact with the user;
Good response speed can improve user experience;
2 the throughput
High throughput can efficiently use THE CPU time to complete the computation task as soon as possible;
Mainly suitable for tasks that do not require much interaction in the background;
3. Footprint
Minimize the memory space of the heap while achieving the first two goals;
Better spatial locality can be obtained;
Vi Performance Analysis
Common commands:
Jps: Lists all Java applications in the s-system. Jps -l: Outputs the full path of the main function. 2 Jinfo: View or modify VM parameters. Syntax: Jinfo <option> < PID > For example, jinfo-flag PrintGCDetails pid Displays whether GC log printing is enabled. You can also change the parameters dynamically: jinfo-flag +PrintGCDetails PID 3 Jmap: memory mapping tool it can generate heap dump files of Java programs, and also view the statistics of object instances in the heap. Syntax: jmap-dump :format=b,file= Dump file address <javaPid> After dump files are generated, you can use tools such as Jhat, Visual VM, and MAT to analyze them. Jmap-histo pid > s.testt Collects object statistics of Java programs: Outputs object statistics of pid Java programs to s.testt. 4 Jstat: Used to view Java application runtime information and heap details. Example: Output GC-related heap information: jstat -GC or -gcutil PID 5 Jstack: Used to export the thread stack of Java applications. Syntax: Jstack [-l] <pid> can help developers find deadlock problems. 6 Jvisualvm: Java Visual VM visualization tool 7 Jconsole: monitoring heap information, class loading, thread monitoring, can also detect deadlocks. 8 Jcmd: command line tool added after JDK1.7. It can export the heap, view Java processes, export thread information, perform GC, and so on. Jcmd <pid> Thread.print: prints Thread stack information Jcmd <pid> gc. heap_dump Dump file address: exports heap information for Mat analysis Jcmd <pid> help: Check which commands are supportedCopy the code
To be continued…