I’m Little Xia Lufei. Learning shapes our lives. Technology changes the world. With you!
The introduction
Redis is a favorite for backend development in e-commerce environments because of its full memory based, high performance, and rich data types. Among them, ordered set Zset is one of the basic data types, and each member has score (which can be used for sorting), so it is very suitable for use in the scene of daily list of rewards and earnings in the last week.
Preliminary Study on data Structure
The encoding of ordered collection objects can be Ziplist or Skiplist. Ziplist encoding is used when the following conditions are met:
- The number of elements is less than 128
- All members are less than 64 bytes long
The upper limit of the above two conditions can be changed by using zset-max-ziplist-entries and zset-max-ziplist-value.
An ordered collection of Ziplist codes is stored with compressed list nodes next to each other, the first node holding member and the second holding score. Set elements in Ziplist are sorted in ascending order of score, with smaller score placed at the top of the table.
Underlying the ordered collection of skiplist codes is a structure named zset, and a zset structure contains both a dictionary and a skiplist. The skip list holds all collection elements from small to large by score. The dictionary keeps the mapping from member to score, so that the complexity of O(1) can be used to find the score value corresponding to member. Although both structures are used together, they share members and scores of the same elements through Pointers, so no extra memory is wasted.
Dive into skiplist
Before introducing jumpers, let’s review the basic linked list and think about how the data structures evolved into jumpers step by step.
In such a linked list, if we are looking for some data, we need to start from scratch by comparing it until we find the node that contains the data, or until we find the first node that is larger than the given data (none). In other words, the time complexity is order n. Similarly, when we want to insert new data, we go through the same lookup process to determine where to insert it.
Suppose we add a pointer to each node to point to the next node, as shown below:
Now when we want to find the data, we can start by looking along this new list. When it encounters a node larger than the data to be checked, it goes back to the original list for lookup. For example, if we want to find 23, the path we want to find is in the direction indicated by the pointer marked in red below:
- 23 is first compared to 7, then to 19, bigger than both of them, and then going backwards.
- But 23 is smaller than 26 when compared to 26, so go back to the following linked list (the original linked list) and compare to 22.
- Twenty-three is larger than twenty-two, so go back to twenty-six along the pointer below. 23 is smaller than 26, indicating that the data 23 does not exist in the original list, and its insertion position should be between 22 and 26.
During this lookup, the time complexity is no longer O(N) due to the addition of Pointers, which means we no longer need to compare each node in the list individually. There are about half as many nodes to compare.
In the same way, we can extend the pointer to a new list at the upper level to create a third list. The diagram below:
On this new three-tier list structure, if we still look for 23, we first compare 19 along the topmost list and find that 23 is larger than 19. Then we know that we only need to go after 19 and skip all the nodes before 19. As you can imagine, when the list is long enough, this kind of multi-layer list lookup allows us to skip many of the lower nodes, greatly speeding up the lookup.
Skiplist is inspired by this idea of multi-tiered linked lists. In fact, the way the list is generated above, the number of nodes in each layer of the list is half of the number of nodes in the lower layer, so that the search process is very similar to a binary search, so that the search time can be reduced to order log n. However, this method has major problems when inserting data. When a new node is inserted, the strict 2:1 relationship between the number of nodes in the next two linked lists is disrupted. If this correspondence is to be maintained, all nodes after the newly inserted node (including the newly inserted node) must be re-adjusted, which causes the time complexity to degenerate back into O(n). Deleting data has the same problem.
Skiplist avoids this problem by not requiring a strict relationship between the number of nodes in the next two levels of the linked list. Instead, it assigns a random level to each node. Attached to the source code:
#define ZSKIPLIST_MAXLEVEL 32 #define ZSKIPLIST_P 0.25 int zslRandomLevel(void) {int level = 1; while ((random()&0xFFFF) < (ZSKIPLIST_P * 0xFFFF)) level += 1; return (level<ZSKIPLIST_MAXLEVEL) ? level : ZSKIPLIST_MAXLEVEL; }Copy the code
The calculation of random numbers when performing inserts is a critical process that has a significant impact on skiplist’s statistical features. This is not an ordinary random number that follows a uniform distribution, but follows certain rules:
- First, each node must have a level 1 pointer (each node is in the level 1 list).
- If a node has a pointer to level I (I >=1) (that is, the node is already in the list from level 1 to level I), then the probability that it has a pointer to level (I +1) is p.
- The maximum number of layers of a node cannot exceed a maximum value, denoted as MaxLevel (32 in Redis).
For example, if a node has a random number of layers 3, then it is linked to the list of layers 1 through 3. For clarity, the following figure shows how to create a Skiplist through step-by-step insertion operations:
As you can see from the skiplist creation and insertion process above, the level of each node is randomly determined, and the insertion of a new node does not affect the level of other nodes. Therefore, the insert operation only needs to change the pointer before and after the insert node, and does not need to adjust many nodes. This reduces the complexity of the insert operation. In fact, this is an important feature of Skiplist, which makes it significantly better at insert performance than balanced tree solutions. We’ll come back to that later.
Skiplist refers to the fact that, in addition to the lowest level of the list, it produces several sparse lists in which Pointers intentionally skip nodes (and the higher the list, the more nodes it skips). This allows us to search for data in the upper echelon of the list, and then move down to the first level of the list to precisely locate the data. In the process, we skipped some nodes, thus speeding up the search.
The skiplist we just created contains a total of four linked lists. Now assume we are still looking for 23 in it, as shown below:
It should be noted that the insertion process of each node demonstrated above actually requires a similar search process before insertion, and the insertion operation is completed after the insertion position is determined.
In practice, each skiplist node should contain member and score parts. In the previous description, we did not specifically distinguish member and score, but in fact, the list is sorted according to score, and the search process is also compared according to score.
Why skip tables instead of hash tables or balanced trees
- The elements of Skiplist and various balanced trees (AVL, red-black, etc.) are ordered, while hash tables are not. Therefore, the hash table can only do a single key lookup, not a range lookup. Range lookup refers to finding all nodes whose size is between two specified values.
- Balanced trees are more complex than skiplist operations when it comes to range lookups. In the equilibrium tree, after we find the low value in the specified range, we need to continue to find other nodes that do not exceed the high value in a middle-order traversal order. The middle-order traversal here is not easy to achieve without some modification of the equilibrium tree. Range lookups on Skiplist are as simple as a few steps through the level 1 list after finding small values.
- Balanced tree insertions and deletions can trigger complex logic adjustments to subtrees, whereas Skiplist insertions and deletions require simple and fast changes to adjacent nodes’ Pointers.
- Skiplist is more flexible than balanced trees in terms of memory footprint. In general, a balanced tree contains 2 Pointers per node (one to the left and right subtrees), whereas skiplist contains an average of 1/(1-p) per node, depending on the size of the parameter p. If p=1/4, as in Redis, then on average each node contains 1.33 Pointers, which is better than a balanced tree.