The cache
1. Cache understanding
- Cache definition:
- The browser stores the data requested by the user on the local disk. When the user needs to change the data again, the user does not need to send the request again and directly obtains the data from the browser
- Benefits of caching:
- Reduce the number of requests
- Save bandwidth and avoid wasting unnecessary network resources
- Reduce server stress
- Improve web page loading speed and improve user experience
2. Cache classification
-
Strong cache
-
Instead of sending a request to the server, the data is fetched directly from the local cache
-
200 OK (from memory cache)
-
Negotiate the cache
-
Send a request to the server, which determines whether the negotiated cache is hit based on the resources in the request header
-
If it hits, a 304 status code is returned to inform the browser to read the resource from the cache
-
Strong cache & negotiated cache common ground
-
Both read resources from the browser side
-
Strong cache vs. negotiated cache
- Strong caching does not send requests to the server
- The negotiation cache sends a request to the server and decides whether to use the cache based on the information returned by the server
3. Schematic diagram of cache usage
4. Header parameters in the cache
1. Default browser cache time
Chrome and Firefox cache time for JS, CSS and other files in memory, is: (access time – the last modification time of the file) ÷ 10 Example: Assuming that index. HTML is modified at 5:0 when accessed at 7:0, then the cache time is 26060 ÷ 10 = 720 seconds and the page cache time is 720 seconds
1. Strong cache header parameters
- Expires.
- This is the http1.0 specification; Its value is an absolute time gmT-formatted time string, as in
Mon, 10 Jun 2015 21:31:12 GMT
If the request is sent before Expires, the local cache is always valid; otherwise, the request is sent to the server to retrieve the resource - Cache-control: Max – age = number
- This is the http1.1 header information, mainly using the field max-age value to determine, it is a relative value; If the request time is earlier than the current request time, the Cache will be hit. If the request time is earlier than the current request time, the Cache will not be hit.
- Cache-control specifies the common values of cache-control:
- No-cache: negotiation cache is required instead of local cache. Verify with the server whether the returned response has been changed. If there is an Etag in the previous response, the amount of the request is verified with the server. If the resource has been changed, the cache is used.
- No-store: directly forbids the browser to cache data. Each time the user requests the resource, a request will be sent to the server, and the complete resource will be downloaded each time.
- Public: can be cached by all users, including end users and intermediate proxy servers such as CDN.
- Private: the device can be cached only by the browser of the terminal user and cannot be cached by a trunk cache server such as the CDN.
- Note: Cache-control takes precedence when cache-control and Expires coexist
From the memory cache with the from disk cache, rounding blog.csdn.net/garrettzxd/…
They work with HTTP caching. Memory cache hits are the fastest, but it has a short cycle. Base64 images, smaller JS and CSS have a greater chance of being written into memory. Other large JS, CSS and images are written directly to the hard disk for caching.
Files stored in disk space are stored by entering Chrome ://version/ in the browser
2. Negotiate header parameters for the cache
Important: The negotiated cache is always used by the server to determine whether the cache resource is available, so the client and server communicate with each other through some kind of identifier, so that the server can determine whether the requested resource is cache accessible
Last-modified: indicates that the server is generated to the client if-modified-since: indicates that the client is generated to the server with a changed name
- Last-modified/if-modified-since: Both values are time strings in GMT format
- The first time a browser requests a resource from the server, the server returns the resource with a last-Modified header added to the respone header, which indicates when the resource was Last Modified on the server
- When the browser requests the resource from the server again, it adds an if-modified-since header to the request’s header, whose value is the last-modified value returned from the previous request
- When the server receives a resource request again, it determines whether the resource has changed according to if-modified-since and the time when the resource was last Modified on the server. If the resource has Not changed, 304 Not Modified is returned, but the resource content is Not returned. If there are changes, the resource content is returned as normal. When the server returns a 304 Not Modified response, the last-Modified header is Not added to the response header, because since the resource has Not changed, the last-Modified header will Not change. This is the response header when the server returns 304
- When the browser receives the response from 304, it loads the resource from the cache
- If the negotiated cache is not hit and the browser loads the resource directly from the server, the last-Modified Header will be updated when it is reloaded, and if-modified-since will enable the last-modified value returned the previous time on the next request
If-none-match: indicates that the client assigns a new name to the server
-
Etag/If-None-Match
- These two values are unique identifying strings for each resource generated by the server and change whenever the resource changes
- The judgment process is similar to last-modified/if-modified-since
-
Last-modified does not produce Etag
- Etag was introduced in HTTP1.1 to solve several last-Modified problems
- Some files may change periodically, but their contents do not change (just change the modification time), at which point we do not want the client to think that the file has been modified and GET again
- Some files are Modified very frequently, such as If they are Modified less than seconds (say N times in 1s), and if-modified-since the granularity that can be checked is s-level, and such changes cannot be determined (or the UNIX record MTIME is only accurate to seconds).
- Some servers do not know exactly when a file was last modified.
-
Summary:
- Caches can be more accurately controlled using eTags because etags are unique identifiers on the server side that are automatically generated by the server or generated by the developer.
- Last-modified and ETag can be used together. The server will verify the ETag first. If the ETag is consistent, the server will continue to compare last-Modified, and finally decide whether to return 304.
5. How does strong caching reload new resources
-
By updating the resource path referenced in the page, let the browser voluntarily abandon the load cache to load the new resource
-
Example: https://www.baidu.com/s?t=7aec0h3KB3Ba8lAbuyPg0AC0eDa59IvtDSmtMQBc6eW
-
Benefits:
- The query value is modified each time the file is changed, and when the query value is different, the page references different resource paths. The browser automatically loads new resources.