Maybe many friends will feel a little difficult to learn NIO, and many concepts in it are not so clear. Before we dive into Java NIO programming, let’s talk about some of the basics today: the I/O model. The following article starts with the concepts of synchronization and asynchrony, then explains the difference between blocking and non-blocking, then introduces the difference between blocking and non-blocking IO, and then introduces the difference between synchronous and asynchronous IO, and then introduces the five IO models. Finally, two design patterns (Reactor and Proactor) related to high performance IO design are introduced.

The following is an outline of the table of contents:

What is synchronization? What is asynchrony?

Two. What is blocking? What is non-blocking?

What is blocking IO? What is non-blocking IO?

What is synchronous IO? What is asynchronous IO?

Five. Five IO models

Two high-performance IO design modes

If there is anything wrong, please understand and welcome criticism.

Please respect the author’s labor achievements, please mark the link of the original text:

www.cnblogs.com/dolphin0520…

What is synchronization? What is asynchrony?

The concept of synchronization and asynchrony has been around for a long time, and there are many expressions about it on the Internet. Here is my personal understanding:

Synchronization is: if there are multiple tasks or events to occur, these tasks or events must be carried out one by one. The execution of one event or task will cause the whole process to wait temporarily, and these events cannot be executed concurrently.

Asynchrony means that if multiple tasks or events occur, these events can be executed concurrently, without the execution of one event or task causing the entire process to wait.

These are synchronous and asynchronous. For example, if A task includes two sub-tasks A and B, for synchronization, when A is in the execution process, B can only wait until A is finished. For asynchrony, A and B can be executed concurrently. B does not have to wait for the completion of A to execute the task. In this way, the execution of A will not lead to the temporary wait of the entire task.

If you still don’t understand, you can look at the following two pieces of code:

void fun1(a) {}void fun2(a) {}void function(a){ fun1(); fun2() ..... . }Copy the code

This code is a typical synchronization. In function, fun1 cannot execute while it is executing, and fun2 must wait for fun1 to complete.

void fun1(a) {}void fun2(a) {}void function(a){

    new Thread(){

        public void run(a) {

            fun1();

        }

    }.start();

     

    new Thread(){

        public void run(a) { fun2(); } }.start(); . . }Copy the code

In fact, synchronization and asynchrony are very broad concepts that focus on whether the occurrence or execution of an event will result in a temporary wait for the entire process when multiple tasks and events occur. I think an analogy can be drawn between synchronization and asynchrony and the Synchronized keyword in Java. When multiple threads access a variable at the same time, each thread accessing the variable is an event. For synchronization, each thread must access the variable one by one. While one thread accesses the variable, other threads must wait. For asynchrony, multiple threads do not have to access the variable individually, but can access it simultaneously. This code is typically asynchronous in that the execution of fun1 does not affect the execution of fun2, and the execution of fun1 and fun2 does not result in a temporary wait for their subsequent execution.

Therefore, I personally feel that synchronous and asynchronous can be expressed in many ways, but keep in mind that when multiple tasks and events occur, the occurrence or execution of one event will result in a temporary wait for the entire process. Generally speaking, the way to achieve asynchronous through multi-threading, but do not remember the multi-threading and asynchronous painting equal sign, asynchronous is only a macro mode, using multi-threading to achieve asynchronous is just a means, and through the way of multi-process can also achieve asynchronous.

Two. What is blocking? What is non-blocking?

Having introduced the difference between synchronous and asynchronous, this section looks at the difference between blocking and non-blocking.

Blocking is: when an event or task is in the process of execution, it sends a request operation, but because the condition required by the request operation is not met, it will wait until the condition is met.

Non-blocking: when an event or task is in the process of executing, it sends a request action, and if the condition required for the request action is not met, it immediately returns a flag indicating that the condition is not met and will not wait forever.

This is the difference between blocking and non-blocking. That is, the difference between blocking and non-blocking is whether an operation is requested and if the condition is not met, whether it waits or returns a flag message.

Here’s a simple example:

So if I want to read something from a file, if there’s nothing in the file to read, for synchronization, I’m going to wait there until there’s something in the file to read; For non-blocking, a flag message is returned telling the file that there is nothing to read.

Some friends on the Internet equate synchronous and asynchronous with blocking and non-blocking respectively, but in fact, they are two completely different sets of concepts. Note that it is important to understand the difference between these two sets of concepts for subsequent understanding of the IO model.

Synchronization and asynchrony focus on whether the execution of a task will lead to a temporary wait of the whole process in the execution of multiple tasks.

Blocking and non-blocking focus on whether a request operation will return a flag indicating that the condition is not met if the condition is not met.

Understanding blocking and non-blocking can be understood in the same way as thread blocking. When a thread performs a request operation, it blocks if the condition is not met, that is, it waits for the condition to be met.

What is blocking IO? What is non-blocking IO?

Before we look at blocking and non-blocking I/OS, let’s look at the next specific I/O operation.

Generally speaking, I/O operations include read and write operations to hard disks, sockets, and peripherals.

When a user thread initiates an IO request operation (this article uses the read request operation as an example), the kernel checks to see if the data to be read is ready. In the case of blocking IO, if the data is not ready, it will wait until the data is ready. For non-blocking IO, if the data is not ready, a flag message is returned telling the user that the data the thread is currently trying to read is not ready. When the data is ready, the data is copied to the user’s thread and a complete IO read operation is completed. In other words, a complete IO read operation consists of two phases:

1) Check whether the data is ready;

2) Copy data (the kernel copies data to the user thread).

So the difference between blocking IO and non-blocking IO is in the first phase, if the data is not ready, whether to wait while checking whether the data is ready or just return a flag message.

In Java, traditional IO is blocking IO, such as reading data through a socket. If the read() method is not ready, the current thread will block in the read method until it returns data. In non-blocking IO, when data is not ready, the read() method should return a flag telling the current thread that the data is not ready, rather than waiting.

What is synchronous IO? What is asynchronous IO?

Synchronous IO and asynchronous IO are defined as follows in Unix Network Programming:

A synchronous I/O operation causes the requesting process to be blocked until that I/O operation completes. An asynchronous I/O operation does not cause the requesting process to be blocked.

Synchronous I/O: If a thread requests an I/O operation, the thread will block until the IO operation is complete.

Asynchronous I/O means that if a thread requests AN I/O operation, the I/O operation will not cause the requesting thread to block.

In fact, the synchronous and asynchronous IO models are for user thread/kernel interactions:

For synchronous I/O: After the user sends AN I/O request, if the data is not ready, the user thread or the kernel constantly polls whether the data is ready. When the data is ready, the data is copied from the kernel to the user thread.

Asynchronous I/O: Only the sending of THE I/O request is done by the user thread. Both phases of the I/O operation are automatically completed by the kernel, which then sends a notification to the user thread that the I/O operation has completed. That is, in asynchronous IO, there is no blocking on the user thread.

This is the key difference between synchronous I/O and asynchronous I/O. The key difference between synchronous I/O and asynchronous I/O is whether the data copy is done by the user thread or the kernel. Therefore, asynchronous IO must be supported by the underlying operating system.

Note that synchronous AND asynchronous I/OS are different from blocking and non-blocking I/OS.

Blocking and non-blocking IO are reflected in whether the user thread is waiting for the data to be ready or receives a flag message when the user requests an I/O operation if the data is not ready. That is, blocking and non-blocking IO are reflected in how data is processed during the first phase of AN IO operation to see if it is ready.

Five. Five IO models

There are five IO models mentioned in Unix Network Programming: blocking IO, non-blocking IO, multiplexing IO, signal-driven IO, and asynchronous IO.

Here are the similarities and differences between these five IO models.

1. Block the IO model

In the most traditional IO model, data is blocked during reading and writing.

When the user thread makes an I/O request, the kernel checks to see if the data is ready. If not, the kernel waits for the data to be ready, and the user thread blocks and hands over the CPU. When the data is ready, the kernel copies the data to the user thread and returns the result to the user thread before the user thread unblocks.

Typical examples of blocking IO models are:

data = socket.read();
Copy the code

2. The non-blocking IO model blocks the read method until the data is ready.

When the user thread initiates a read operation, it does not wait, but gets a result immediately. If the result is an error, it knows that the data is not ready, and it can send the read operation again. Once the data in the kernel is ready and the user thread requests it again, it copies the data to the user thread and returns.

So, in fact, in the non-blocking IO model, the user thread needs to constantly ask if the kernel data is ready, which means that non-blocking IO does not surrender CPU, but always occupies CPU.

Typical non-blocking IO models are as follows:

while(true){

    data = socket.read();

    if(data! = error){process databreak; }}Copy the code

There is a serious problem with non-blocking IO, however. In a while loop, you need to constantly ask the kernel if the data is ready, which can lead to very high CPU usage. Therefore, the while loop is rarely used to read data.

The multiplexing IO model is widely used at present. Java NIO is essentially multiplexing IO.

In the multiplexing I/O model, there is a thread polling the state of multiple sockets, and the actual I/O operation is called only when the socket actually has read/write events. Because in the multiplexing IO model, only one thread can be used to manage multiple sockets, the system does not need to create new processes or threads, and do not need to maintain these threads and processes, and only when there are socket read and write events, the IO resources will be used, so it greatly reduces resource consumption. (Selector. Select () does not mean that there is no blocking, but that it is reflected in multiple blocks or one block, with a thread polling multiple channels continuously, using the event driver to inform the kernel thread to read and write. The IO model of Java NIO is synchronous non-blocking, where synchronous asynchrony refers to whether the actual IO operation (user-kernel copy of data) requires process participation. By saying that Java NIO provides asynchronous processing, I mean asynchrony in the programming model. Event-driven based on the Reactor pattern, event handler registration and handler execution are asynchronous.

In Java NIO, each channel is queried by selector. Select () to see if there is an arrival event. If there is no event, the block will always be there, so this will cause the user thread to block.

May have friends will say, I can use multithreading + blocking IO to achieve a similar effect, but because in a multithreaded + blocking IO, each socket corresponding to a single thread, such can cause a lot of resource usage, and especially for long connection, the resources of the thread has not released, if there are a lot of connections in succession behind, It creates a performance bottleneck.

Multiplexing IO mode, through a thread can manage multiple sockets, only when the socket really have read and write events will occupy resources to carry out the actual read and write operations. Therefore, multiplexing IO is suitable for a large number of connections.

In addition, multiplexing IO is more efficient than the non-blocking IO model because in non-blocking IO, the constant query of socket state is done through the user thread, whereas in multiplexing IO, polling for each socket state is done by the kernel, which is much more efficient than user threads.

Note, however, that the multiplexing IO model detects the arrival of events in a polling manner, and responds to the arrival of events one by one. Therefore, for the multiplexing IO model, once the event response body is large, the subsequent events will not be processed and the new event polling will be affected.

4. Signal-driven IO model

In signal driven IO model, when a user thread launched an IO request operation, gives the corresponding socket function to register a signal, then the user thread will continue to perform, when the kernel data ready thread sends a signal to the user, the user thread receives the signal, then signal function invokes the IO read and write operations to requests for actual IO operation.

5. Asynchronous I/O model

The asynchronous IO model is the ideal one. In the asynchronous IO model, when a user thread initiates a read operation, it can immediately start doing other things. On the other hand, from the kernel’s point of view, when it receives an asynchronous read, it will immediately return indicating that the read request has been successfully initiated and therefore no blocks will be generated for the user thread. The kernel then waits for the data to be ready, then copies the data to the user thread, and when all is done, the kernel sends a signal to the user thread that the read operation is complete. In other words, the user thread does not need to know how the entire IO operation is actually going on. It only needs to make a request first. When receiving the success signal from the kernel, it indicates that the IO operation has been completed and the data can be used directly.

In the asynchronous IO model, neither phase of the IO operation blocks the user thread. Both phases are automatically completed by the kernel, which then sends a signal to the user that the thread operation has completed. The user thread does not need to call the IO function again for specific reads and writes. This is different from the signal-driven model. In the signal-driven model, when the user thread receives the signal indicating that the data is ready, the user thread needs to call the IO function for the actual read and write operation. However, in the asynchronous I/O model, receiving a signal indicates that the I/O operation has been completed, and there is no need to call the IO function in the user thread for the actual read and write operation.

Note that Asynchronous IO requires low-level support from the operating system, and in Java 7, Asynchronous IO is provided.

The first four IO models are actually synchronous IO, but the last one is truly asynchronous IO, because in both multiplexed IO and signal-driven IO, phase 2 of the IO operation causes user threads to block, that is, when the kernel copies data.

Two high-performance IO design modes

In the traditional network service design mode, there are two classic modes:

One is multithreading, and one is thread pooling.

In multithreaded mode, called client, the server creates a new thread to handle the client’s read and write events, as shown in the following figure:

This mode is easy to handle, but because the server uses a thread for each client connection to process, so that the resource consumption is very large. Therefore, when the number of connections reaches the upper limit, there will be a resource bottleneck, or even a server crash.

Therefore, in order to solve this one thread corresponding to a client mode problems, put forward with the method of the thread pool, is said to create a fixed-size thread pool, to a client, to come up with a free thread from the thread pool to process, when the client after processing the read and write operations, will hand over to the thread take up. This avoids the resource waste of creating threads for each client and makes them reusable.

However, thread pools also have their drawbacks. If the connections are mostly long connections, the threads in the thread pool may be occupied for a period of time. When another user requests a connection, the client connection will fail because there are no free threads available to process the connection, thus affecting the user experience. Therefore, thread pools are suitable for a large number of short-connected applications.

As a result, there are two high-performance IO design patterns: Reactor and Proactor.

In the Reactor model, each client registers the events they are interested in first, and then a thread polls each client to see if there is an event. When there is an event, each event is processed in sequence, and when all events are processed, polling is continued, as shown in the following figure:

It can be seen from the above five IO models that the multiplexing IO adopts the Reactor model. Note that each event is processed sequentially in the figure above, although you can use multiple threads or thread pools to speed up event processing.

In Proactor mode, when an event is detected, a new asynchronous operation will be initiated and then handed over to the kernel thread for processing. When the kernel thread completes the IO operation, it sends a notification to inform that the operation is complete. It can be known that the asynchronous IO model adopts the Proactor mode.

References:

Unix Network Programming

Blog.csdn.net/goldensuny/…

My.oschina.net/XYleung/blo…

xmuzyq.iteye.com/blog/783218

www.cnblogs.com/ccdev/p/354…

alicsd.iteye.com/blog/868702

www.smithfox.com/?e=191

www.cnblogs.com/Anker/p/325…

Blog.csdn.net/hguisu/arti…

www.cnblogs.com/dawen/archi…

  

Author: Haizi

    

Source: www.cnblogs.com/dolphin0520…

    

This blog does not indicate the reprint of the article belongs to the author haizi and blog garden, welcome to reprint, but without the consent of the author must retain this statement, and give the original link in the obvious position of the article page, otherwise reserve the right to pursue legal responsibility.