directory
- 1. The description of the JVM
- 2. Class loading subsystem
- 3. Run time data area
- 4. Vm stack
- 5. The heap area
- 6. Method of area
- 7. Object instantiation and memory access
- Overview of garbage collection
- 9. Classic garbage collector
- 10. The G1 and ZGC
The JVM memory layout defines the JAVA memory allocation, allocation, and management strategies during the running process, ensuring efficient and stable running of the JVM.
-
The gray ones are private to individual threads: separate program counters, local method stacks, and virtual machine stacks
-
The red color is shared by multiple threads: interthread sharing, method area, heap area
In general, JVM optimizations are 95% heap optimizations, 5% method optimizations, and less stack optimizations
1.PC register (program counter)
The PC register is used to store the address pointing to the next instruction, and the instruction code to be executed. The execution engine reads the next instruction address according to the program counter, and then executes according to the instruction
In an active thread, only the stack frame at the top of the stack is valid, called the current stack frame, and the method associated with this stack frame is called the current method.
1.1 Function of program counter
-
1. It’s a small chunk of memory, almost negligible. It is also the fastest storage area
-
2. In the JVM specification, each thread has its own program counter, which is thread-private and whose lifetime is consistent with that of the thread
-
3. There is only one method executing on a thread at any one time, which is called the current method. The program counter stores the JVM instruction address of the Java method being executed by the current thread; Or, if a native method is executed, it does not specify a value (undefined) because the program counter is not responsible for the local method stack.
-
4. It is an indicator of program control flow. Branches, loops, jumps, exception handling, thread recovery, and other basic functions rely on this counter
-
5. The bytecode interpreter works by changing the value of this counter to select bytecode instructions to surprise
-
6. It is the only area where the Java Virtual Machine specification does not specify any OOM (Out Of Memery) situations and there is no garbage collection
Decompile bytecode files using Javap -v xxx.class to view instructions and other information
According to the contents of the PC register, the execution engine reads the corresponding instructions in the method area, and then reads the data in the local variable, or modifies the operation stack under the thread
1.2 Program counters often ask questions
- 1. What is the use of using a PC register to store byte code instruction addresses?
- (1) Multithreading is macroscopically parallel (multiple events occurring simultaneously at the same time), but in fact it is executed concurrently and alternately
- (2) Because the CPU needs to constantly switch between threads, it needs to know where to start to continue execution after switching back
- (3) The JVM bytecode interpreter needs to change the value of the PC register to determine what bytecode instruction should be executed next
So, at any given moment in the concurrent execution of multiple threads, one processor, or one core in a multi-core processor, will execute only one instruction in a particular thread. This will inevitably lead to frequent interruptions or recovery, how to ensure that there is no difference? After each thread is created, it will generate its own program counter and stack frame. Program counter does not affect each thread.
- 2. Why is the PC register set to thread private?
- (1) We all know the so-called multi-threading in a specific period of time will only execute one of the methods of a thread, THE CPU will keep dropping to do task switching, so it is bound to lead to frequent interruption or recovery, how to ensure that there is no difference?
- (2) In order to accurately record the address of the current bytecode instructions being executed by each thread, it is naturally best to assign each thread a PC register, so that each thread can perform calculations independently without interfering with each other.
2. Java VM stack
Java instructions are designed on a stack basis, and different CPUS have different architectures, so they cannot be register-based
2.1 Stacks and heaps in memory
The stack is the unit of runtime and the heap is the unit of storage
-
1. The stack solves the problem of how programs run, that is, how programs execute, or how they process data. The heap solves the problem of data storage, how and where data is stored.
-
2. Generally speaking, objects are mostly stored in the heap space, which is a relatively large part of the runtime data area
-
3. Stack space holds local variables of the base data type and references to objects that reference the data type
2.2 Internal structure of the VM Stack
What a virtual stack does: When each thread is created, a virtual stack is created. The stack consists of stack frames, each representing a method. Saves local variables and partial results of methods, and participates in method calls and returns.
The stack frame content
- 1. Local Variables
- 2. Operand Stack (or expression Stack)
- 3. Dynamic Linking (or method reference for executing “runtime constant pooling”)—-
- 4. Method Return address (or definition of method normal exit or abnormal exit)
- 5. Some additional information
3. Local variation scale
The main function is to store the variables used in the operation process, and the execution engine adds or extracts data to it according to the instructions
- 1. It is defined as a numeric array, which is mainly used to store method parameters and local variables defined in the method body. These data types include various basic data types, object references, and returnAddressleixing
- 2. As the local variable table is built on the stack of the thread, it is the thread’s private data, so there is no data security problem
- 3. The capacity required by the local variables table is determined at compile time and stored in the maximum Local Variables data item of the Code attribute of the method. The size of the local variable scale does not change during method execution
- 4. Variables in the local variable table are only valid in the current method call. During method execution, the virtual machine passes the parameter values to the parameter variable list using the local variable table. When the method call ends, the local variable table is destroyed along with the method stack frame.
3.1 View the local variation scale
Use javap command to parse bytecode file and check the local variation table of stack frame corresponding to main() method, as shown in the figure:
3.2 Slot Slot
- 1. Locally, a variable table is equivalent to an array. The basic storage unit is Slot.
- 2. Local variable table stores various basic data types (8 kinds), reference types and variables of returnAddress type known at compilation time.
- 3. In the local variable table, types up to 32 bits occupy only one slot (including the returnAddress type), and 64-bit types (long and double) occupy two slots.
- Byte, short, char, and float are converted to int before storage. Boolean is also converted to int, with 0 indicating false and non-0 indicating true.
- Long and double occupy both slots.
3.2.1Slot Features
-
1. When an instance method is called, its method parameters and local variables defined inside the method body are copied to each slot in the local variable table in the declared order
-
2. To access the value of a 64bit local variable in the local variable table, only use the previous index. (e.g., accessing a long or double variable)
-
3. If the current frame is created by a constructor or instance method, the reference to this will be placed in slot with index 0, and the rest of the arguments will be sorted in the argument list. (Non-static method local variable table index=0 store this pointer)
-
4. Static methods cannot refer to this, because this does not exist in the local variable table of the stack frame corresponding to the static method
3.2.2Slot Reuse
The slots in the local variable table in the stack frame can be reused. If a local variable goes out of its scope, the new local variable declared after its scope is likely to reuse the slots of the expired local variable, so as to achieve the purpose of saving resources.
private void test2(a) {
int a = 0;
{
int b = 0;
b = a+1;
}
// The slot position occupied by the destroyed variable B before variable C was used
int c = a+1;
}
Copy the code
In this case, three slots are used. This occupies slot 0, A occupies slot 1, and C duplicates slot B
4. Classification of variables
By position declared in the class:
- Member variables:Default initialization assignment is used before it is used
- During the preparation phase, class variables are assigned a default value of 0. During the initialization phase, explicit assignments of class variables are made to static code blocks.
- Non-static variables: Instance variables: As the object is created, instance variable space is allocated in the heap space, with default assignments.
- Local variables: must be explicitly assigned before they can be used! Otherwise, the compilation fails
5. Operand stack
Is a first-in, last-out stack structure, as long as the purpose of the program is to store the required values or temporary results
During the execution of a method, data is written to or extracted from the stack according to the bytecode instructions, i.e., push or pop. Some bytecode instructions push values onto the operand stack, while others push operands off the stack and use them to push the results onto the stack. (such as bytecode instructions bipush operations) for example: perform copy, swap, sum, etc
5.1 Code Examples
0: pushes the variable 15 onto the operand stack
1: Assigns variables from the operand stack to local variables
2: pushes variable 8 onto the operand stack
3: Assigns variables from the operand stack to local variables
4. Remove 15 from the local variable and add it to the operand stack
5. Remove 8 from the local variable and add it to the operand stack
The iADD operation adds the variables in the operand stack and stores them in the operand stack
Returns the result in the operand stack to the local variable table
5.2 Characteristics of operand stack
-
Operand stack, mainly used to store the intermediate results of the calculation process, and as a temporary storage space for variables during the calculation process.
-
The operand stack is a workspace of the JVM’s execution engine. When a method starts executing, a new stack frame is created. The operand stack of this method is empty
-
Each stack of operands has an explicit stack depth for storing values. The maximum depth required is defined at compile time and stored in the method’s code property as the value of max_stack.
-
Any element in a stack can be an arbitrary Java data type. 32-bit types occupy one stack unit of depth, while 64-bit types occupy two stack units of depth
-
Instead of accessing an index, the operand stack can only access data once through the standard push and pop operations
-
If the called method has a return value, the return value is pushed into the operand stack of the current stack frame and updates the NEXT bytecode instruction to be executed in the PC register.
-
The data types of the elements in the operand stack must exactly match the sequence of bytecode instructions, which is verified by the compiler at compile time and again during the data flow analysis phase of the class validation phase during class loading.
6. Dynamic linking
1. Runtime constant pools are in the methods area. (note: JVMS in JDK1.7 and later have removed runtime constant pools from the methods area, creating an area in the Java Heap for runtime constant pools.)
2. Each stack frame contains an internal reference to the runtime Constant pool or the method to which the stack frame belongs. The purpose of including this reference is to enable dynamic linking in code that supports the current method. Such as invokedynamic instructions
3. When a Java source file is compiled into a bytecode file, all variable and method references are saved as symbolic Refenrence in the constant pool of the Class bytecode file (viewed by JavAP decompression). For example, describing a method that calls another method is represented by symbolic references to the method in the constant pool. The purpose of dynamic linking is to convert these symbolic references (#) into direct references to the calling method.
6.1 Method Invocation
- 1. Static links
When a bytecode file is loaded into the JVM, if the target method being called is known at compile time and the runtime remains the same. The process of converting a symbolic reference to a calling method in this case into a direct reference is called static chaining.
- 2. Dynamic links
If the invoked method cannot be determined at compile time, that is to say, the symbolic reference of the invoked method can only be converted into a direct reference during the program run time. Because this reference conversion process is dynamic, it is also called dynamic linking.
The Binding mechanism of the corresponding method is: Early Binding and Late Bingding. Binding is the process by which a symbolic reference to a field, method, or class is replaced with a direct reference, which happens only once.
- 1. Early binding
Early binding is invoked if the target method at compile time, and the run-time remains the same, this method can be bound with subordinate type, as a result, due to clearly define the target method is called which one on earth is, therefore, you can use the static link way of converting symbols refer to reference directly.
- 2. Late binding
If the method to be called cannot be determined at compile time, only the related method can be bound according to the actual type at run time, which is called late binding.
6.2 Method invocation instructions
Ordinary call instructions:
- 1.
invokestatic
: Call a static method, and determine the unique method version in the parsing stage; - 2.
invokespecial
: Call methods, private and parent methods, and determine the unique method version in the parsing stage; - 3.
invokevirtual
Call all virtual methods; - 4.
invokeinterface
: calls interface methods;
Dynamic call instruction (new in Java7) :
- 5.
invokedynamic
: Dynamically resolves the method to be called and executes it
The first four instructions are fixed inside the virtual machine, and the method invocation is performed without human intervention, whereas the InvokeDynamic instruction allows the user to determine the method version.
The invokestatic and Invokespecial commands call non-virtual methods
Invokevirtual (unless final, the JVM will classify final method calls as Invokevirtual instructions, but note that final method calls are not virtual methods) and invokeInterface instructions are called virtual methods.
/** * Parse calls to non-virtual methods, virtual method tests */
class Father {
public Father(a){
System.out.println("Father default constructor");
}
public static void showStatic(String s){
System.out.println("Father show static"+s);
}
public final void showFinal(a){
System.out.println("Father show final");
}
public void showCommon(a){
System.out.println("Father show common"); }}public class Son extends Father{
public Son(a){
super(a); }public Son(int age){
this(a); }public static void main(String[] args) {
Son son = new Son();
son.show();
}
// Not overridden superclass methods, because static methods cannot be overridden
public static void showStatic(String s){
System.out.println("Son show static"+s);
}
private void showPrivate(String s){
System.out.println("Son show private"+s);
}
public void show(a){
//invokestatic
showStatic(Big Head boy);
//invokestatic
super.showStatic(Big Head boy);
//invokespecial
showPrivate(" hello!");
//invokespecial
super.showCommon();
// Invokevirtual considers this method non-virtual because it declares that final cannot be overridden by subclasses
showFinal();
// The virtual method is as follows
//invokevirtual
showCommon();// Not explicitly adding super is considered virtual because subclasses may override showCommon
info();
MethodInterface in = null;
// InvokeInterface is not sure which interface implementation class needs to be overridden
in.methodA();
}
public void info(a){}}interface MethodInterface {
void methodA(a);
}
Copy the code
6.3 The nature of method rewriting
When each method is called, the reference address of the method needs to be looked up in the constant pool. If there is no corresponding method, the reference address of the method is looked up in the parent class method. If there is no corresponding method, an error is reported.
Dynamic dispatch is frequently used in object-oriented programming, and it can affect execution efficiency if you have to re-search through the method metadata for the appropriate target during each dynamic dispatch. Therefore, to improve performance, the JVM implements this by creating a virtual Method table in the method section of the class (non-virtual methods do not appear in the table). Use index tables instead of lookups.
- Each class has a virtual method table that holds the actual entry to each method.
- So when is the virtual method table created? The virtual method table is created and initialized during the linking phase of the class load, and the JVM initializes the virtual method table for that class after the class’s variable initializers are ready.
Example: We define three classes and a Friendly interface
/ / friendly interface
interface Friendly{void sayHello(a);void sayGoodbye(a);
}
/ / Dog class
class Dog{public void sayHello(a){}public String toString(a){return "Dog"; }}/ / cocker spaniel
class CockerSpaniel extends Dog implements Friendly{public void sayHello(a){superThe sayHello (); }public void sayGoodbye(a){}}Copy the code
The virtual method table of the Dog class
Cocker spaniel virtual method table: Cocker spaniel virtual method table: Cocker spaniel virtual method table: Cocker spaniel virtual method table: Cocker spaniel virtual method table: Cocker spaniel virtual method table: Cocker spaniel virtual method table It’s an efficiency boost
7. Method returns the address
The value of the PC register that holds the method that called the method
-
1. The return address is the value of the PC counter of the caller (the caller of a method may also be a method), that is, the address of the next instruction that calls the method. In the case of an exception exit, the return address is determined by the exception table, which is generally not stored in the stack frame.
-
2. In the process of method execution had abnormal (Exception), and the Exception not processed within the method, which is as long as there is no search in this method the Exception table to match the Exception handler, will cause the method exits, referred to as “abnormal complete export methods when an Exception is thrown during the execution of Exception handling, handle exceptions are stored in a table, Easy to find the code to handle exceptions when they occur.