Render pages: How browsers work
Fast loading of page content and smooth interaction are what users want from the Web experience, so developers should strive for both. Understanding how to improve performance and perceived performance helps you understand how a browser works.
An overview of the
Responsive sites provide a better user experience. Users expect a Web experience with fast content loading and smooth interaction
Waiting times for resources to load and single-threaded browser execution in most cases are the two main causes of Web performance impact.
Wait times are a major threat that needs to be overcome to get the browser to load resources quickly. To achieve fast loading, the goal is to send the requested information as quickly as possible, or at least as fast as it looks. The network wait time is the link transmission time consumed when the binary is transferred over the link to the computer. Web performance optimization is all about getting pages to load as quickly as possible.
For the most part, browsers execute single-threaded. In order to have smooth interaction, the developer’s goal is to ensure the interactive performance of the site from smooth page scrolling to click-to-response. Rendering time is the key factor, ensuring that the main thread can do all the tasks it’s given and still handle user interactions all the time. By understanding the single-threaded nature of the browser and minimizing the responsibilities of the main thread, you can optimize Web performance to ensure smooth rendering and timely interaction responses.
Build a DOM tree
The first step is to process the HTML tags and construct the DOM tree. HTML parsing involves tokenization and tree construction. HTML tags include start and end tags, as well as attribute names and values. If the document is well-formed, parsing it is simple and quick. The parser parses the tokenized input into the document, building the document tree.
The DOM tree describes the content of the document. The element is the first tag and the root of the document tree. Trees reflect the relationships and hierarchies between different tags. Tags nested within other tags are child nodes. The more DOM nodes there are, the longer it takes to build the DOM tree.
When the parser finds a non-blocking resource, such as an image, the browser requests the resource and continues parsing. Parsing can also continue when a CSS file is encountered, but for
Preload the scanner
This process takes up the main thread when the browser builds the DOM tree. When this happens, the preload scanner will parse the available content and request high-priority resources, such as CSS, JavaScript, and Web fonts. Thanks to the preloaded scanner, we don’t have to wait until the parser finds a reference to an external resource to request it. It retrieves the resources in the background so that by the time the main HTML parser reaches the requested resources, they may already be running or have been downloaded. The optimizations provided by preloading scanners reduce congestion.
Build CSSOM tree
The second step is to process the CSS and build the CSSOM tree. The CSS object model is similar to the DOM. DOM and CSSOM are two trees. They are independent data structures. Browsers translate CSS rules into style maps that you can understand and use. The browser iterates through each rule set in the CSS, creating a tree of nodes with parent, child, and sibling relationships based on the CSS selector.
As with HTML, browsers need to convert the CSS rules they receive into something they can use. Therefore, it repeats the HTML to object process, but for CSS.
The CSSOM tree includes styles from the user agent style sheet. Browsers start with the most general rules that apply to nodes and recursively optimize the style of computation by applying more specific rules. In other words, it cascades property values.
CSSOM is built very, very quickly and is not shown in a unique color in current development tools. Instead, recalculate styles in the developer tools shows the total time required to parse the CSS, construct the CSSOM tree, and recursively compute the computed style. In terms of Web performance optimization, it is easy to achieve because the total time to create a CSSOM is usually less than the time required for a DNS lookup.
JavaScript
While the CSS is parsed and the CSSOM is created, other resources, including JavaScript files, are being downloaded (thanks to the Preload scanner). JavaScript is interpreted, compiled, parsed, and executed. Scripts are parsed into abstract syntax trees. Some browser engines use the “Abstract Syntax Tree” and pass it to the interpreter to print out the bytecode executed on the main thread. This is called JavaScript compilation.
The browser also builds the accessibility tree that assistive devices use to analyze and interpret content. The Accessibility Object Model (AOM) is similar to a semantic version of DOM. When the DOM is updated, the browser updates the accessibility tree. Assistive technologies alone cannot modify the accessibility tree.
Before AOM is built, screen readers cannot access the content.
Apply colours to a drawing
The rendering steps include styling, layout, drawing, and in some cases compositing. The CSSOM tree and DOM tree created in the parsing step are combined into a Render tree, which is then used to calculate the layout of each visible element and then draw it to the screen. In some cases, content can be promoted to their own layers and composited, improving performance by drawing part of the screen on the GPU instead of the CPU, thereby freeing up the main thread.