Before writing this article, I thought this knowledge point involves knowledge point too much and too miscellaneous, looking for a big cow to write a look, but after reading about seven or eight articles, my heart is more entangled — ๐๐๐ really good miscellaneous… Memory points can not find, make the heart of chaos, probably is entangled disease made. So start to sum up, so that they can have a grasp of the memory point, but also let their knowledge of this knowledge point has a superficial cognition. The purpose of this article is to describe the process, if you want to delve into it, you can search for it by yourself. I will put the knowledge points and concepts interspersed in a small box or paste a reference link.
Let’s start with a process overview:
- DNS resolution: Resolves a domain name into an IP address
- TCP connection: TCP three-way handshake
- Sending an HTTP request
- The server processes the request and returns HTTP packets
- The browser parses the rendered page
- End of connection: TCP wave four times
1. DNS resolution
After entering the URL in the browser, the domain name is resolved first. The browser sends the domain name to the DNS server. The DNS server searches for the IP address corresponding to the domain name and returns the IP address to the browser. The browser injects the IP address into the protocol and sends the request parameters to the corresponding server.
1. What is URL? Uniform Resource Locator (URL) is a Uniform Resource Locator (URL) used to locate resources on the Internet. Such as http://www.w3school.com.cn/ht. Grammar rules, comply with the following: ` scheme: / / host. Domain: port/path/filename ` each part of the explanation is as follows: scheme: define the type of Internet service. Common protocols include HTTP, HTTPS, FTP, and File. The most common type is HTTP, and HTTPS is used for encrypted network transmission. Host: defines the domain host (WWW is the default HTTP host) domain: defines the Internet domain name, for example, w3school.com.cn port: defines the port number on the host (the default HTTP port number is 80) path: Define the path on the server (if omitted, the document must be at the root of the web site). Filename: specifies the name of a document or resource. 2. What is DNS? DNS is a mapping table between domain names and IP addresses. Domain names (such as www.google.com) are easier for us to remember, but machines are not good at this expression. Therefore, domain names need to be converted into IP addresses for machine recognition, which is the result of DNS. Root DNS server The root server is a necessary facility for setting up the Internet and manages the main directory of the Internet. There are 13 sets of root DNS servers in the world. Recursive query The client host queries the local DNS server recursively. Recursive query: The domain name queried by the client host cannot be found in the local DNS server. Therefore, the local DNS server sends requests to other root DNS servers as DNS clients instead of letting the client host query all the time. The result of recursive query is either the RETURNED IP address or an error message indicating that the IP address cannot be queried. Iterative query The local DNS server sends query requests to the root server, TOP-LEVEL DOMAIN name server, and host domain name server in an iterative process. For example, when the local DNS server sends a query request to the root server, the root server tells the local DNS server: "I don't have what you are looking for, go to the TOP-LEVEL domain name server to find it", and returns the address of the top-level domain name server to the local domain name server. After receiving the address, the local domain name server continues to send requests to the top-level domain name server. The TOP-LEVEL DNS server either returns the IP address or tells the local DNS server which permission DNS server to send a request to next until the IP address is found or an error message is returned when the IP address cannot be found, and then the information is returned to the client host. The picture below shows the difference between the two queries Recursive process: the host to local DNS server, other DNS servers (such as: I'm looking for an apple to eat, found A, ask A have, A said I help you to find B, B may have, if B has, then gave the apple to A, B A, then the apple to me, this is A recursive) iterative process: Local DNS server โ root server, local DNS server โ top-level DNS server, local DNS server โ Permission DNS server; (for example: I want to look for an apple, find A, A said I don't have either, B may have, you go to B; I went to B, AND B said I didn't have one either. You go to C, and I went to C again, and finally found an apple. This is the iterative process.)Copy the code
2. TCP connection: TCP three-way handshake
Before sending data, the client initiates a TCP three-way handshake to synchronize the serial number and confirmation number of the client and server, and exchange TCP window size information.
Note: Ack: replies Fin: the end; Seq: the first sequence number of a data segment SYN: synchronization. Indicates the start session requestCopy the code
- First handshake: client A sets the flag bit SYN to 1 and randomly generates A packet with the value of SEq =X (the value of X is in the range of =1234567) to the server. Client A enters the SYN_SENT state and waits for server B to confirm (the first handshake is initiated by the browser and tells the server to send the request).
- Second handshake: After receiving the packet, server B knows that client A requests to establish A connection by flag bit SYN=1. Server B sets flag bit SYN and ACK to 1, ACK =X+1, randomly generates A value seq=Y, and sends the packet to client A to confirm the connection request. Server B enters the SYN_RCVD state (second handshake, From the server, tell the browser I’m ready to accept, you send).
- Third handshake: After receiving the confirmation, client A checks whether the ACK is X+1 and ACK is 1. If yes, it sets the flag ACK bit to 1, ACK =Y+1, and sends the packet to server B. Server B checks whether the ACK is Y+1 and ACK is 1. Client A and server B enter the ESTABLISHED state, and A three-way handshake is completed, and data can be transmitted between client A and server B. (The third handshake, sent by the browser, tells the server, “I’m sending soon, get ready to accept.”)
Why do you need three handshakes? According to the fourth edition of Computer Networks, the purpose of the “three-way handshake” is “to prevent the failure of the connection request message segment suddenly sent to the server, resulting in an error.” The example in the book is as follows: “Invalid connection request segment” is generated when the first connection request segment sent by the client is not lost, but is delayed at a network node for a long time. As a result, it does not reach the server until a certain time after the connection is released. Originally, this is an invalid packet segment. However, after the server receives the invalid connection request packet segment, it mistakenly thinks it is a new connection request sent by the client. Then the client sends a confirmation message to agree to establish a connection. Assuming that the “three-way handshake” is not used, a new connection is established as soon as the server sends an acknowledgement. Since the client does not send a connection request, it ignores the server’s confirmation and does not send data to the server. However, the server assumes that the new transport connection has been established and waits for data from the client. As a result, many of the server’s resources are wasted. The three-way handshake prevents this from happening. For example, the client does not issue an acknowledgement to the server’s acknowledgement. When the server receives no acknowledgement, it knows that the client has not requested a connection.” . The main purpose is to prevent the server from wasting resources by waiting.
3. The browser sends an HTTP request to the Web server
After the TCP three-way handshake, the HTTP request message is sent to the server. (I have written a separate thread describing HTTP request messages.
HTTP request packet format: request line + request header + Blank line + message body. The request line contains the request mode (GET, POST, DELETE, or PUT), request resource path (URL), and HTTP version.
4. The server processes the request and returns the HTTP packet
When the server receives the request, it sends a reply, which is the response data. An HTTP response packet is similar to an HTTP request. The format of an HTTP response packet is as follows: status line + response header + blank line + message body. The status line contains the HTTP version number, status code, and status description.
5. The browser parses the rendered page
After the browser gets the response text, it parses the HTML code, requests JS, CSS and other resources, and finally renders the page to the user. Page rendering is generally divided into the following steps:
(1) Parse the DOM Tree according to the HTML file
(2) Parse the CSSOM Tree(CSS rule Tree)
(3) Combine DOM Tree and CSSOM Tree to construct Render Tree
(4) Reflow: Calculate node information according to Render Tree
Repaint: Paint the entire page based on the calculated information.
6. TCP wave four times.
When the data transfer is complete, you need to disconnect the TCP connection and initiate the TCP wave four times
2. The server sends Ack and Seq packets to the client, indicating that it agrees to close the request. The host initiator enters the FIN_WAIT_2 state. (The server tells the browser, “I’ve received my request message. I’m ready to close it. You’re ready, too.”)
3. The server sends a Fin, Ack, or Seq packet to the client to close the connection. And enter the LAST_ACK state. (The server tells the browser that I’m done sending a response message and you’re ready to close it.)
4. The client sends a packet segment, Ack and Seq, to the server. Then enter the wait TIME_WAIT state. The passive party closes the connection after receiving the packet segment from the initiator. If the initiator waits for a certain period of time and does not receive a reply, the system shuts down normally. (The browser tells the server that I have received the response message and I am ready to close, so are you)
To put it simply:
1, A -- >B: A tells B: "I'm finished";
2, B -- >A: B tells A: "Ok, I know you are finished."
3, A -- >A: B tells A: "I'm finished."
4, A -- >B: A tells B: "Ok, I know you have sent and received."
The latter
The above is basically in the address bar from entering URL to page loading display of the whole process, of course, there are a lot of details are not written in, but these knowledge is enough for the front end, interested partners can delve into. Finally, small is the front small white one, the most original intention of writing an article is to let oneself have a more profound impression and understanding of the knowledge point, write the thing is also very small white, if there is wrong, welcome to correct ~ then is the hope to see a friend can point like, can also pay attention to a wave ~ I will continue to output!
Personal Blog links
GitHub
CSDN homepage
Nuggets personal homepage
Jane book personal home page