Background of 0.
The company’s system is a partial background system, and the display effect is mostly list display. The display of large amount of data in list display is mostly queried by paging.
Elasticsearch version: 6.3.2
Java Client version: REST-high-level Client 6.3.2
Problem: After 1.9W+ data paging query, there is no problem in the total number of query results (1.9W+). When viewing the data in the next few pages, there is no data and an error is reported in the background.
Screenshot below:
The background error message is as follows:
{"error": {"root_cause": [{"type":"query_phase_execution_exception"."reason":"Result window is
too large, from + size must be less than or equal to: [10000] but was [19530]. See the scroll
api for a more efficient way to request large data sets. This limit can be set by changing the
[index.max_result_window] index level setting."}]."type":"search_phase_execution_exception"."reason":"all shards
failed"."phase":"query"."grouped":true."failed_shards": [{"shard": 0."index":"dragon-dev"."node":"ln1Y4mW3S2mHDpqMBrcURQ"."reason": {"type":"query_phase_execution_exception"."reason":"Result window is too large, from + size must be less than or equal to: [10000] but was [19530]. See the scroll api for a more efficient way to request large data sets. This limit can be set by changing the [index.max_result_window] index level setting."}}},"status": 500}Copy the code
1. Solutions
In the background error message, you can clearly see the error cause:
Result window is too large, from + size must be less than or equal to: [10000] but was [19530] : the total number of results was too large to exceed the limit of 10000.
See the scroll api for a more efficient way to request large data sets. This limit can be set by changing the [index.max_result_window] index level setting: provides a solution, which means to solve the large data request by viewing the scroll method, and can also change the limit size by modifying the index.max_result_window parameter.
This paper is mainly modifiedindex.max_result_window
Parameters quickly solve problems.
2. Just do it
As mentioned in the ElasticSearch blog, ES paginates by loading from+size into memory, i.e. I click 1953 page and show 10 pages per page, ES puts 19530 pages into memory. The downside is clear: too much data can consume too much memory, so ES limits the maximum number of paging queries to 10000 by default.
How do I change the maximum number of paging queries (index.max_result_window) parameter?
In fact, it is very simple to configure in setting. (This can be set at index creation time or after index creation.)
PUT/index name /_settings {"index" : { "max_result_window" : 10000000}}
Copy the code
Run the above script to change the maximum number of displays to 1000W.
3. Summary
If data cannot be displayed due to the limitation of ES paging query, you can modify configuration parameters or use the Scroll API (recommended) to solve the problem. This article will not discuss how to use scroll API to improve, you can refer to the official website for examples. Good luck.