preface
With the increasing complexity of front-end application scenarios, the processing of big data is inevitable. So today, let’s take a real application scenario as an example to talk about how the front end handles big data through child threads.
Currently, the refresh rate of the mainstream monitor is 60Hz, that is, 16ms for a frame. Therefore, it is recommended that the refresh rate be less than 16ms for animation playback, less than 100ms for user operation response, and less than 1000ms for page display.
— According to RAIL, the User perception performance model proposed by the Chrome team.
The above application is the user optimal experience model proposed by Google team. From the perspective of JS operation, it roughly means to ensure that every JS task is executed in the shortest possible time.
Case scenario
In the modern Web program, the requirement of data and report export has been very common. As the volume of exported data increases and the complexity of the data increases, the most common time fields may also require front-end conversion in most cases, so the traversal of the source data is always unavoidable. Now take exporting the monitoring data report of various factors of a site as an example:
Report Format Requirements
- Each data contains a number of factor data, and each factor data contains the monitoring data of the change factor and the corresponding evaluation grade;
- It is required to export hourly data of 90 days in the last quarter, and the data source is about 2100 items (the condition of paging query);
- The report time format is
YYYY Year MM month DD Day HH
(for example: 23:00 on December 25, 2020), each factor content is factor data + factor grade (for example: 2.36 (I)).
The data source,
The data returned from the back end is in the following format
{
"dateTime": "2021-06-05 14:00:00." "."name": "Site 1"."factorDatas": [{"code": "w01010"."grade": 1."value": 26.93},
{"code": "w666666"."grade": 1."value": 1.26}}]Copy the code
Basic processing of data sources
Corresponding to the report export requirements, the traversal of the more than 2000 pieces of data is always unavoidable, and there will even be large cycles nested with small cycles.
- Large loops need to handle dateTime fields;
- In the mini-loop, you need to loop the factorDatas field, query the grade name corresponding to grade, and finally concatenate the required report format.
The topic
Simple implementation
The following code is just mock code; by default the front end has finished loading all the data
The normal development process, of course, is to use the for loop to continuously call the paging interface to continuously query the data until the data is queried, and then do a unified loop to process each row of data. To facilitate data processing, separate some public methods from a utility class:
class UtilsSerice {
/** * Get water quality information *@param waterType
* @param keyValue
* @param keyName? * /
static async getGradeInfo(waterType: WaterTypeStringEnum, keyValue: string | number, keyName? :string) :Promise<WaterGrade | null | undefined> {
// The key of the cache data
const flagId: string = waterType + keyValue;
// If there is a value in the cache, return it directly
if (TEMP_WATER_GRADE_MAP.get(flagId)) {
return TEMP_WATER_GRADE_MAP.get(flagId);
}
// Get the list of levels
const gradeList: WaterGrade[] = await this.getEnvData(waterType);
// Query the level of a level
const gradeInfo: WaterGrade = gradeList.find((item: WaterGrade) = > {
const valueName: string | number | undefined = keyName === 'id' ? 'id' : item.hasOwnProperty('value')?'value' : 'level';
return item[valueName] === keyValue;
}) as WaterGrade;
// Cache the level information to facilitate the next query of the level
if (gradeInfo) {
TEMP_WATER_GRADE_MAP.set(flagId, gradeInfo);
}
returngradeInfo; }}Copy the code
The data export logic is as follows:
// Assume that allList is already 2100 data sets
const allList = [{"dateTime": "2021-06-05 14:00:00." "."code": "sssss"."name": "Site 1"."factorDatas": [{"code": "w01010"."grade": 1."value": 26.93}, {"code": "w666666"."grade": 1."value": 1.26}]}]
const table: ObjectUnknown[] = [];
for (let i = 0; i < allList.length; i ++) {
constrows = {... allList[i]}// Process the time format as required
rows['tiemStr'] = moment(allList[i].dateTime).format('YYYY ')
for (let j = 0; j < allList[i].factorDatas.length; j ++) {
const code = allList[i].factorDatas[j].code
const value = allList[i].factorDatas[j].value
const grade = allList[i].factorDatas[j].grade
// Get the level data asynchronously here on demand ---- This method has been optimized for performance as much as possible
const gradeStr = await UtilsSerice.getGradeInfo('surface', grade, 'value')
rows[code] = `${value}(${gradeStr}) `
}
table.push(rows)
}
const downConfig: ExcelDownLoadConfig = {
tHeader: ['Point name'.'Access code'.'Monitoring time'.'factor 1'.'factor 2'.'factor 2'].bookType: 'xlsx'.autoWidth: 80.filename: 'Data query'.filterVal: ['name'.'code'.'tiemStr'.'w01010'.'w01011'.'w01012'].multiHeader: [].merges: []};// This method is generic Excel data processing logic
const res: any = await ExcelService.downLoadExcelFileOfMain(table, downConfig);
const file = new Blob([res.data], { type: 'application/octet-stream' });
// Save the file
saveAs(file, res.filename);
Copy the code
Because the JS engine thread is a single thread and is mutually exclusive with the GUI rendering thread, when performing complex JS calculation tasks, the user’s intuitive feeling is that the system is stuck, such as input box cannot be input, animation stops, buttons are invalid, etc. The above code can achieve data export, you can see that the main thread export data image rotation has stopped, input box has been unable to enter.
I believe that no matter how friendly party A is, it is unacceptable to estimate such a system.
thinking
Slightly programming experience for the development of more or less will understand, because of big data for loop traverse blocked other script execution, based on the idea that there is a performance optimization probability will experience the development engineering of normal university this big traversal split into multiple small tasks to less caton, this scheme can also be a certain extent, solve the problem of caton. However, such optimization scheme of time fragmentation and task splitting is not suitable for all big data processing, especially those with strong dependence on the data before and after, so this optimization scheme will not be discussed in this article. This article is about Webworkers:
It allows multiple JavaScript scripts to be executed concurrently in a Web application, with each stream of script execution called a thread, independent of each other and managed by a JavaScript engine in the browser. This will make thread-level message communication a reality. Makes it possible to do multithreaded programming in Web pages.
– IMWeb community
WebWorker has several features:
- Ability to run for a long time (response)
- Fast start and ideal memory consumption
- Natural sandbox environment
WebWorker use
create
// Create a Worker object and pass it the URL of the script to be executed in the new thread
const worker = new Worker('worker.js');
Copy the code
communication
// Send a message
worker.postMessage({first:1.second:2});
// Listen for messages
worker.onmessage = function(event){
console.log(event)
};
Copy the code
The destruction
The worker is terminated in the main thread and cannot be reused for messaging thereafter. Note: Once Terminate is established, you cannot re-enable it, only create it separately.
worker.terminate();
Copy the code
Export Function Migration
Next, we will talk about how to migrate the data export part of the code to webWorker. Before the function migration, we need to comb out the prerequisites of data export:
1: webWorker needs to be able to call Ajax to get interface data; 2. Scripts that can load Excel. Js in webWorker; 3: saveAs in file-saver can be called normally;
Based on the above conditions, let’s discuss one by one. The first one is fortunate that webWorker supports making Ajax requests for data; Second, the importScripts() interface is provided in webWorker, so instances of Excel can be generated in webWorker. The third point is a bit of a pity. The DOM object cannot be used in the webWorker, but file-Saver uses the DOM, so the child thread can only process the data and pass it to the main thread, which will perform the file save operation (there is a small optimization here, more on later).
Scheme comparison
At present, there are many solutions to integrate webWorker in the industry. The following is a simple comparison (from Tencent’s front end team) :
project | Introduction to the | Build a package | Low-level API encapsulation | Call declarations across threads | Availability monitoring | Excelstor malleable |
---|---|---|---|---|---|---|
worker-loader | Webpack official, source packaging ability | ✔ ️ | ✘ | ✘ | ✘ | ✘ |
promise-worker | Encapsulate the base API as promise-based communication | ✘ | ✔ ️ | ✘ | ✘ | ✘ |
comlink | Chrome team, communication RPC encapsulation | ✘ | ✔ ️ | Function of the same name (proxy-based) | ✘ | ✘ |
workerize-loader | The community is currently a relatively complete program | ✔ ️ | ✔ ️ | Function of the same name (generated based on AST) | ✘ | ✘ |
alloy-worker | Transaction-oriented highly available Worker communication framework | Provide build scripts | Communication ️ controller | Function of the same name (based on convention), TS declaration | Complete monitoring indicators, full cycle error monitoring | Namespace, transaction generation script |
webpack5 | Webpack5 is used to replace the worker-loader | Provide build scripts | ✘ | ✘ | ✘ | ✘ |
Based on the above comparison and my preference for TS, this case uses Alloy worker for webWorker integration. Due to the problem of the official NPM package, it cannot be integrated in a timely manner, so it can only be manually integrated.
The worker integration
Official integration document
First, copy the core basic worker communication source code to the project directory SRC /worker.
Declare the transaction
The first step is to add the transaction for the data export in SRC /worker/common/action-type.ts.
export const enum TestActionType {
MessageLog = 'MessageLog'.// Declare the transaction for the data export
ExportStationReportData = 'ExportStationReportData'
}
Copy the code
Request and response data type declarations
In the SRC/worker/common/content – the ts file declaration request and response data types.
Request data type declarations that communicate transactions across threads
export declare namespace WorkerPayload {
namespace ExcelWorker {
// These two parameters are required to export data when calling ExportStationReportData
type ExportStationData = {
factorList: SelectOptions[];
accessCodes: string[]; } & Transfer; }}Copy the code
Response data type declarations that communicate each transaction across threads
export declare namespace WorkerReponse {
namespace ExcelWorker {
type ExportStationData = {
data: any; } & Transfer; }}Copy the code
Main thread logic
SRC /worker/main-thread Create excel. Ts file to write data transaction code.
/** * step 4: Declare the main thread business logic code * TODO */
export default class Excel extends BaseAction {
protected threadAction: IMainThreadAction;
/** * Export monitoring point data *@param payload* /
public asyncexportStationReportData(payload? : WorkerPayload.ExcelWorker.ExportStationData):Promise<WorkerReponse.ExcelWorker.ExportStationData> {
return this.controller.requestPromise(TestActionType.ExportStationReportData, payload);
}
protected addActionHandler(): void{}}Copy the code
Main thread logical instantiation
SRC /worker/main-thread/index
The main thread declares the transaction namespace
// Declare only the transaction namespace for transactions that call other namespaces within a transaction
export interface IMainThreadAction {
/ /...
excel: Excel;
}
Copy the code
The main thread declares transaction instantiation
export default class MainThreadWorker implements IMainThreadAction {
/ /...
public excel: Excel;
public constructor(options: IAlloyWorkerOptions) {
/ /...
this.excel = new Excel(this.controller, this);
}
/ /... Omit code
}
Copy the code
Child thread logic
SRC /worker/worker-thread Create excel. Ts file to write data transaction code. This file contains the core data export function.
Data request, data processing
export default class Test extends BaseAction {
protected threadAction: IWorkerThreadAction;
protected addActionHandler(): void {
this.controller.addActionHandler(TestActionType.ExportStationReportData, this.exportStationReportData.bind(this));
}
/** * get data query *@protected* /
@HttpGet('/list')
protected async getDataList(@HttpParams() queryDataParams: QueryDataParams, factors? : SelectOptions[],@HttpRes() res? :any) :PromiseThe < {total: number; list: TableRow[] }> {
return {list: res.rows}
}
/** * Test export data *@private* /
private asyncexportExcel(payload? : WorkerPayload.ExcelWorker.ExportExcel):Promise<any> {
try {
// Add XLSX to the worker
importScripts('https://cdnjs.cloudflare.com/ajax/libs/xlsx/0.16.9/xlsx.core.min.js');
const table: ObjectUnknown[] = [];
for (let i = 0; i < allList.length; i ++) {
constrows = {... allList[i]}// Process the time format as required
rows['tiemStr'] = moment(allList[i].dateTime).format('YYYY ')
for (let j = 0; j < allList[i].factorDatas.length; j ++) {
const code = allList[i].factorDatas[j].code
const value = allList[i].factorDatas[j].value
const grade = allList[i].factorDatas[j].grade
// Get the level data asynchronously here on demand ---- This method has been optimized for performance as much as possible
const gradeStr = await UtilsSerice.getGradeInfo('surface', grade, 'value')
rows[code] = `${value}(${gradeStr}) `
}
table.push(rows)
}
const downConfig: ExcelDownLoadConfig = {
tHeader: ['Point name'.'Access code'.'Monitoring time'.'factor 1'.'factor 2'.'factor 2'].bookType: 'xlsx'.autoWidth: 80.filename: 'Data query'.filterVal: ['name'.'code'.'tiemStr'.'w01010'.'w01011'.'w01012'].multiHeader: [].merges: []};const res = await ExcelService.downLoadExcelFile(table, downConfig, (self as any).XLSX);
// Due to the worker limitations mentioned earlier (DOM access is not possible), the child thread processes the objects needed by Excel and passes the data to the main thread, which then exports the data
// Normal postMessage will clone the tree, but the data processed here may be very large, it is estimated that the data will be directly transferred
return {
transferProps: ['data'].data: res.data,
filename: res.filename,
}
} catch (e) {
console.log(e); }}}Copy the code
Child thread logical instantiation
SRC /worker/worker-thread/index add excel;
The main thread declares the transaction namespace
// Declare only the transaction namespace for transactions that call other namespaces within a transaction
export interface IWorkerThreadAction {
/ /...
excel: Excel;
}
Copy the code
Child threads declare transaction instantiation
class WorkerThreadWorker implements IWorkerThreadAction {
public excel: Excel
/ /... Omit code
public constructor() {
this.controller = new Controller();
this.excel = new Excel(this.controller, this);
/ /... Omit code}}Copy the code
At this point, the export function has been fully migrated to child threads.
Main thread call
The main thread also calls the data export function simply by instantiating a child thread and then happily dumping the complex computational logic on the child thread, something like this.
class HomPage extends VueComponent {
public created() {
try {
// Instantiate a child thread and mount it on the window
const alloyWorker = createAlloyWorker({
workerName: 'alloyWorker--test'.isDebugMode: true
});
}catch (e) {
console.log(e); }}/** * Child thread data export *@private* /
private async exportExcelFile() {
// Just call the declared method
(window as any).alloyWorker.excel.exportStationReportData({
factorList: factors,
accessCodes: [{ accessCode: 'sss'.name: 'Test site' }]
}).then((res: any) = > {
// Big data export effect, data returned by the child thread
console.log(res);
// Convert binary data returned by child thread to Blob for file saving
const file = new Blob([res.data], { type: 'application/octet-stream' });
// Save the filesaveAs(file, res.filename); }); }}Copy the code
The results are as follows, and you can definitely feel that there is no lag in the page during the data export process.
conclusion
The above code uses a real demand case to verify that webWorker greatly improves user experience. This requirement is probably rare in most developments, but it does occur occasionally. Of course, webWorker is not the only solution. In the case of the same amount of calculation, the calculation in the child thread is not much faster than the main thread, or even slower than the main thread. Therefore, some calculations that do not require high timely feedback can only be put into the child thread for calculation. If you want to improve the efficiency of the calculation, you can only improve the efficiency of the algorithm or use WebAssembly, which will be discussed in the future.
reference
- Web_Workers_API
- The worker information
- alloy-worker