GitHub 21.5K Star Java engineers become god’s path, not to learn about it!

GitHub 21.5K Star Java engineers become god’s way, really not to learn about it!

In order to solve the concurrency problem, many locks are introduced in MySQL database. Most of the time, database locks are automatically added during database operations.

As a result, many programmers often ignore the real mechanism of database locking. For example, I often ask candidates in interviews, do you know what the lock of MySQL Innodb is?

I have heard many different answers to this question, but few people can answer it perfectly. To answer this question, you need to know the isolation level of the database, indexes, and so on.

MySQL Innodb lock related introduction, in MySQL’s official documents (dev.mysql.com/doc/refman/…). The introduction of this article is also based on this official document.

Record Lock

**Record Lock is a Lock placed on an indexed Record. SELECT c1 FROM t WHERE c1 = 10 For UPDATE; The record c1=10 is locked to prevent any other transactions from inserting, updating, or deleting rows with c1 value 10.

It is important to note that record locks lock index records. Even if the table does not have an index defined, InnoDB creates a hidden clustered index and uses this index to lock records.

Gap Lock

A Gap Lock is a Lock on the Gap between index records, or on the Gap before the first index record or after the last index record.

So, how to understand the so-called Gap here?

Gap refers to the place in InnoDB’s index data structure where new values can be inserted.

When you use the phrase SELECT… FOR UPDATE locks a set of rows. InnoDB can create locks that apply to the actual values in the index and the gaps between them. For example, if all values greater than 10 are selected for update, a gap lock will prevent another transaction from inserting new values greater than 10.

Since this is a lock, the concurrency of the database can be affected, so gap locks are only effective at the isolation level of Repeatable Reads.

With isolation of Repeatable Reads, lock read operations (SELECT… For update, lock in share mode), update, delete

  • For unique indexes with unique search criteria, InnoDB locks only found index records, not gaps.

  • For other search criteria, InnoDB locks the index range scanned, using a gap lock or a next-key lock to block gaps covered by other transaction insert ranges.

That is, FOR SELECT FOR UPDATE, LOCK IN SHARE MODE, UPDATE, and DELETE statements, a gap or next-key LOCK is acquired except FOR a unique search of a unique index, that is, the scope of its scan is locked.

Next-Key Lock

A next-key lock is a combination of a record lock on an index record and a gap lock on the gap before the index record.

Suppose an index contains the values 10, 11, 13, and 20. Possible next-key locks for this index include the following ranges:

(-∞, 10] (10, 11] (11, 13] (13, 20] (20, ∞)Copy the code

For the last gap, ∞ is not a true index record, so in effect, this next-key lock locks only the gap after the maximum index value.

As a result, next-key locks are left open and right closed.

Next-key Lock, like Gap Lock, only works in InnoDB’s RR isolation level.

Repeatable Reads can solve phantom Reads

Repeatable Reads in MySQL can solve the problem of unrepeatable Reads, but not phantom Reads. Only Serializable can solve the problem. But that’s not true.

Because MySQL is different from standard RR, standard Repeatable Reads do have phantom Reads problem, but Repeatable Reads in InnoDB can solve phantom Reads problem by next-key lock.

Because we know that with next-key locks, when row locks are needed, gaps in the index are locked at the same time. This prevents other transactions from inserting records into these gaps, which solves the phantom read problem.

Regarding this issue, there has been a wide discussion, you can refer to: github.com/Yhzhtk/note… There are many great gods who have expressed their opinions.

MySQL lock principle

Record Lock, Gap Lock, and next-key Lock were introduced, but locking rules were not explained. As for the locking rule, I understand it after reading the article in “MySQL Actual Combat 45” by Dinqi big guy. The locking rule he summarizes contains two “principles”, two “optimizations” and a “bug” :

Principle 1: The basic unit of locking is next-key lock. It’s a front open and back closed interval. Principle 2: Only objects accessed during lookup are locked. Optimization 1: equivalent query on index, next-key lock degenerates to row lock when unique index is locked. Optimization 2: the next-key lock degrades to a gap lock when the last value does not meet the equivalence condition. One bug: Range queries on unique indexes access up to the first value that does not meet the criteria.

Suppose that the database table currently has the following records:

Update t set d=d+1 where id=7 update t set d=d+1 where id=7

  • According to principle 1, the lock unit is next-key lock, session A lock range is (5,10);
  • According to optimization 2, this is an equivalent query (id=7), and id=10 does not meet the query conditions, so the next-key lock degenerates into a gap lock, so the final lock range is (5,10).

Select * from t where id>=10 and id<11 for update

  • Next key lock (5,10) next key lock (5,10) next key lock (15
  • According to optimization 1, the equivalent condition on the primary key ID degenerates into a row lock, and only the row lock with id=10 is added.
  • According to principle 2, everything accessed must be locked, so next-key lock(10,15) is required. So the row lock id=10 and next-key lock(10,15) are finally added.

Select * from t where id>10 and id<=15 for update * according to principle 1, the unit of the lock is next-key lock, the next-key lock is assigned to (10,15), and since id is the only key, the loop should stop when id=15. * However, InnoDB will scan forward to the first behavior that does not meet the criteria, that is, id=20. And since this is a range scan, the next-key lock on the index id (15,20) is also locked.

Suppose that the database table currently has the following records:

Select id from t where c=5 lock in share mode

  • According to principle 1, the lock unit is next-key lock, so (0,5) is assigned a next-key lock. Note that c is a normal index, so only access to c=5 cannot be stopped immediately. You need to traverse right until c=10.
  • According to principle 2, everything accessed should be locked, so (5,10) should be next-key locked.
  • According to optimization 2: equivalence judgment, we traverse to the right, and the last value does not meet the equivalence condition of C =5, so it degenerates into a gap lock (5,10).
  • According to principle 2, only accessed objects are locked. This query uses overridden indexes and does not require access to primary key indexes, so no locks are placed on primary key indexes.

Select * from t where c>=10 and c<11 for update

  • Next key lock (5,10) next key lock (5,10) next key lock (15
  • According to principle 2, everything accessed must be locked, so next-key lock(10,15) is required.
  • Because c is A non-unique index, there are no optimization rules, that is, it does not degenerate into A row lock. Therefore, sesion A adds (5,10] and (10,15] next-key locks on c.

conclusion

Above, we introduced the Lock mechanism in InnoDB, a total of three types of Lock, respectively Record Lock, Gap Lock and next-key Lock.

A Record Lock is used to Lock index records. A Gap Lock is a Gap Lock between index records. Next-key Lock is a combination of Record Lock and Gap Lock that locks both index records and gaps. His range is left open and right closed.

InnoDB RR level, the basic unit of locking is next-key lock, as long as the scanned data will be locked. A range query on a unique index accesses up to the first value that does not satisfy the condition.

In order to improve performance and concurrency, there are also two optimization points:

  • A next-key lock is degraded to a row lock when a unique index is locked.
  • The next-key lock is degraded to a gap lock when the last value does not meet the equivalence condition.

So much for locks. However, the isolation level of RR can solve many problems such as phantom reading to some extent, but it also has some side effects, such as decreased concurrency and easy to cause deadlock.

InnoDB uses RR as its default level. InnoDB uses RC as its default level. InnoDB uses RR as its default level.

About the author: Hollis, a person with a unique pursuit of Coding, is a technical expert of Alibaba, co-author of “Three Courses for Programmers”, and author of a series of articles “Java Engineers becoming Gods”.

Follow the public account [Hollis], the background reply “into god map” can be downloaded to receive the Java engineer advanced mind map.