MySQL concurrency control is a trade-off between data security and concurrency processing capability, with different locking strategies to determine the impact on system overhead and performance.
Basic Knowledge To make subsequent explanations easier to understand, here are some basic concepts explained.
Pessimistic locking Pessimistic locking refers to the conservative attitude towards data being modified by other current transactions on the system, as well as transactions from external systems, so that data is locked during the entire data processing process.
select… For Update is MySQL’s way of implementing pessimistic locking. In the case of pessimistic locking, other transactions cannot modify the data in order to maintain transaction isolation.
Now in the highly concurrent architecture of the Internet, under the influence of the Fail-fast idea, pessimistic locking has been relatively rare.
Compared with pessimistic lock, optimistic lock adopts a looser locking mechanism. Pessimistic locking relies on the locking mechanism of the database to ensure maximum exclusivity of operations. However, the resulting database performance costs, especially for long transactions, are often prohibitive.
Optimistic locking mechanism solves this problem to a certain extent. Optimistic locking is mostly implemented based on the Version recording mechanism. By adding a Version field of numeric type to the database table, the Version field value will be read together when the data is read. Each time the data is updated, the Version value will be +1. When we submit the update, compare the current version information of the database table with the version value extracted for the first time. If the current version number of the database table is equal to the version value extracted for the first time, it will be updated; otherwise, it is considered as expired data and the update fails.
Lock granularity MySQL defines two types of lock granularity: table level and row level.
Table locks are controlled by MySQL Server and are divided into read locks and write locks. Advantage is small overhead, fast lock; No deadlocks occur; The disadvantage is that the lock granularity is large, the probability of lock conflict is highest, and the concurrency is lowest. Table locks are suitable for the scenario with many queries and few updates.
When a read lock is added to a table, the session can only read the currently locked table. Other sessions can still read but cannot write to the table.
When a write lock is added to a table, the session can read or write to the locked table. Other sessions cannot read or write to the locked table.
Row locking is implemented by the storage engine and InnoDB supports it but MyISAM does not. Advantage is large overhead, slow lock; Deadlocks occur; The disadvantages are the minimum locking granularity, the lowest probability of lock conflict, and the highest concurrency. InnoDB engine uses row-level locking by default. Row-level locking is suitable for scenarios where updates by index are frequent.
InnoDB is the most commonly used storage engine of MySQL. This article explains the locking mechanism of MySQL from this perspective. For more information about the MySQL storage engine and b-tree, check out the blog “Understanding MySQL’s Index Once and for all.”
Classification of row locks
- Record locks
Record Lock Locks unique index column or primary key column records, and the value exists, otherwise the lock type is gap lock. FOR example, SELECT a FROM T WHERE a = 12 FOR UPDATE, lock the index whose value is 12 to prevent other transactions FROM inserting, deleting, or updating the row whose value is 12.
- Clearance lock
A Gap Lock is only available at the repeatable read, serialization isolation level. A Gap Lock is placed between index records, or before or after an index, excluding the index itself.
FOR example, if SELECT a FROM t WHERE a > 15 and a < 20 FOR UPDATE and a exists 1,2, 5, 10, 15,20, then the gap between (15,20) is locked.
Intermittent lock range:
When a primary key or unique index is read, the where condition is all exactly hit (= or in), and this scenario itself does not have phantom reads, so only record locks are added. When a column without an index is read, a gap lock will be added to the entire table. (all primary key x locks, all primary key gap locks) non-unique index column, if the WHERE condition is partially matched (>, <, like, etc.) or not matched at all, the nearby gap lock will be added. For example, a table has the following non-unique indexes: 2,6,9,9,11,15. Delete from table where another_id = 9 Delete from table where another_id = 9 Delete from table where another_id = 9 delete from table where another_id = 9 delete from table where another_id = 9 delete from table where another_id = 9 See the picture summary at the end of the article for more information. When records are retrieved and locked using range criteria, the intermittent locking mechanism blocks concurrent inserts of key values within the qualifying range, often resulting in severe lock waits. Therefore, in practical application development, especially for applications with a large number of concurrent inserts, business logic should be optimized as far as possible, equality conditions should be used to access updated data as far as possible, and scope conditions should be avoided.
The gap lock and the gap lock are non-conflicting, the only function of the gap lock is to prevent the insertion of other transactions, in RR (repeatable read) level to solve the phantom read problem.
For example, if the id is 3,4, and 5, the gap lock will lock the data whose ID is greater than 3. In this case, adding new data with id=6 will block, thus avoiding phantom reading.
Snapshot reads and current reads will be explained in detail in the next blog post, Understanding MySQL transaction Principles and transaction isolation
- Key in the lock
Next-key Lock is a combination of record Lock and gap Lock. Only at the repeatable read, serialized isolation level.
An index has four values: 10,11,13,20. InnoDB can use record locks to hold 10, 11,13, and 20 as needed. It can also use gap locks to hold (-∞,10), (10,11), (11,13), (13,20), and (20,+∞) ranges. The key lock is a combination of record lock and gap lock.
- Insert intent lock
Insert Intention Locks are a special type of gap Locks that are locked only when an Insert is performed. There are no conflicts between Insert Intention Locks. Multiple rows can be inserted into a gap at the same time. It is this feature that solves the illusion problem.
Suppose there is a record index with key values 4 and 7, and different transactions insert 5 and 6, respectively. Each transaction generates an insert intent lock between 4 and 7, acquiring an exclusive lock on the insert row, but not locking each other, because rows do not conflict.
Intentlock InnoDB intentlock main user multi-granularity lock coexistence situation. For example, if transaction A wants to lock S on A table, if A row in the table has been locked X by transaction B, the lock application should also be blocked. If there is a lot of data in the table, checking lock flags row by row can be expensive and system performance can suffer. To solve this problem, a new lock type can be introduced at the table level to represent the locking of the row to which it belongs, which leads to the concept of “intentional locking.”
For example, if there are 100 million records in the table, transaction A locks some of them, then transaction B needs to add table level lock to the table, if there is no intention to lock, then we need to check whether the 100 million records in the table are locked. If there is an intent lock, then if transaction A adds an intent lock and then X lock before updating A record, transaction B checks whether there is an intent lock on the table and whether the existing intent lock conflicts with the lock it intends to add. If there is A conflict, transaction B will wait until transaction A releases, without detecting each record. When transaction B updates the table, it doesn’t need to know which row is locked, it just needs to know that one row is locked anyway.
The main purpose of intent locks is to resolve the conflict between row locks and table locks, showing that “a transaction is holding a lock on a row or is about to hold a lock.”
Both shared and exclusive locks are row-level locks. Intentional shared locks and intentional exclusive locks are table locks. Intentional shared locks and intentional exclusive locks are automatically added and released by the system without manual intervention.
- A Shared lock
S Lock (Shared Lock) : Also known as read Lock, allows one transaction to read a data set, preventing other transactions from acquiring an exclusive Lock on the data set. A shared lock and a shared lock can be used together. For example, if transaction T holds S lock on data object A, then transaction T can read A but cannot modify A. Other transactions can only hold S lock on A but cannot hold X lock on A until T releases S lock on A. This guarantees that other transactions can read A, but cannot make any changes to A until T releases the S lock on A.
- Exclusive lock
An Exclusive Lock (also known as a write Lock) allows a transaction to acquire update data for an Exclusive Lock, preventing other transactions from acquiring shared and Exclusive locks for the same data set. Exclusive locks are incompatible with exclusive locks and shared locks. For example, if transaction T locks data object A with X, transaction T can either read A or modify A. Other transactions cannot lock A again until T releases the lock on A.
- Intent shared lock
Intention Shared Lock: a transaction intends to assign a Shared Lock to a row. Before assigning a Shared Lock to a row, the transaction must acquire the IS Lock for that table.
- Intent exclusive lock
Intention Exclusive Lock: a transaction that intends to place an Exclusive Lock on a row must acquire an IX Lock on that table before placing an Exclusive Lock on that row.
IS lock and IX lock are proposed only to quickly determine whether the records in the table are locked when the table level S lock and X lock are added later, so as to avoid traversing to check whether the records in the table are locked. In other words, IS lock and IX lock are compatible, IX lock and IX lock are compatible.
There is no conflict between intent locks, but there is conflict between shared locks, exclusive locks, and intent locks. The compatibility of table level locks is shown in the following table.
compatibility
X
IX
S
IS
X
Are not compatible
Are not compatible
Are not compatible
Are not compatible
IX
Are not compatible
Compatible with
Are not compatible
Compatible with
S
Are not compatible
Are not compatible
Compatible with
Compatible with
IS
Are not compatible
Compatible with
Compatible with
Compatible with
- On the lock
Auto-inc Locks, auto-increment Locks, are a special type of table lock. If the table has auto_INCREMENT columns, the increment lock will be acquired before data is inserted into the table, and the lock will be released after the increment column is +1. If the transaction is rolled back, the increment will not be rolled back, so the increment column is not necessarily continuous increment. MySQL has introduced an optional lightweight lock (mutex) mechanism to replace AUTOINC locks since version 5.1.22. See Reference 3)
- Metadata lock
Metadata locks are controlled by MySQL Server to maintain data consistency of table metadata. Metadata locks ensure that metadata cannot be written to a table when there are active transactions (explicit or implicit) on the table. MDL locks were introduced from MySQL5.5 to protect table metadata information and to resolve or ensure consistency between DDL operations and DML operations.
For the introduction of MDL, it mainly solves two problems:
For example, in the repeatable isolation level, session A modifies the table structure during the two query sessions. Therefore, the two query results are inconsistent and cannot meet the repeatable read requirements. For example, when session A executes multiple update statements, another session B changes the table structure and commits it first. As A result, replication errors will occur when the slave reperforms ALTER first and then update. Each performing a DML, DDL statements apply to MDL lock, DML operations need to MDL read lock, DDL operations require MDL write lock (MDL locking process is the automatic control system, can’t direct intervention, read sharing, read and write the mutex, write a mutex), application of MDL lock operation will form a queue, the queue to write lock acquisition priority than read lock.
Once an MDL write lock wait occurs, not only the current operation will be blocked, but all subsequent operations in this table will also be blocked (although online DDL was introduced in MySQL5.6 to degrade queued MDL write locks to prevent blocking of MDL read locks).
Lock time SELECT XXX Query statement In normal cases, snapshot data is read and locked only until the transaction ends. SELECT XXX LOCK IN SHARE MODE statement for current read, S LOCK and metadata read LOCK until end of transaction. The SELECT XXX FOR UPDATE statement locks the current read with X and metadata read locks until the end of the transaction. DML statements (INSERT, DELETE, UPDATE) are the current read, and X locks and metadata read locks are added until the end of the transaction. DDL statements (ALTER, CREATE, etc.) write locks to Canadian data and commit implicitly and cannot be rolled back until the end of the transaction. Why are DDL statements committed implicitly? Because DDL is a data definition language, it is responsible for creating, deleting, and modifying databases. When things go wrong, the consequences are likely to be incalculable. Commit at the end of each execution to ensure fluency, prevent data from blocking, and improve overall database performance.
Example: Because DDL statements are implicitly committed, if session A starts A transaction, DML operations are performed, DDL operations are performed, and then session A rolls back the transaction. The transaction rolled back by session A is an empty transaction because there is an implicit commit during the DDL operation
SQL statements that do not use indexes lock the entire table. SQL > alter table update; Use the prefix like; The field is not indexed; Database optimization moves index queries to full table scans, and so on. Mysql prior to version 5.6 locks tables when directly modifying table structures. “Query each table index and use the best index unless the optimizer decides that a table scan is more efficient. A single use scan is based on whether the best index spans more than 30% of the table, but a fixed percentage no longer determines whether to use an index or scan. Now the optimizer is more complex and estimates based on additional factors such as table size, number of rows, and I/O block size.” See Reference 1.
The following conclusions are based on MySQL5.6 and InnoDB’s default RR level. They are only used to understand the locking mechanism mentioned in this article. Sorry for any mistake.
INFORMATION_SCHEMA Provides access to database metadata and information about the MySQL server, such as database or table names, column data types, and access permissions. There is a collection of tables for the InnoDB database engine that record database transactions and locks.
MySQL > alter table transaction and lock;
SELECT * FROM information_schema.INNODB_TRX; SELECT * FROM information_schema.INNODB_LOCKs; SELECT * FROM information_schema.INNODB_LOCK_waits; show engine innodb status \G; You can see if there are table locks waiting or deadlocks in the query result. If a deadlock occurs, you can KILL the currently running transaction by killing trx_mysQL_thread_id.
The command line deadlock of query transactions and locks is a common problem in concurrent systems, and also appears in the scenario of concurrent read and write requests of MySQL database. A “deadlock” occurs when two or more transactions are waiting for each other to release a lock they already hold or are waiting for resources to be locked in a loop because they are not locked in the same order. A common error message is “no Deadlock found when trying to get lock…”
MySQL > select * from ‘MySQL’;
Deadlocks are reproduced by simulating concurrent transactions at multiple terminals. Use the above four commands to view transaction and lock information. You can view the execution plan through Explain. After deadlock exception occurs, InnoDB monitoring mechanism is enabled to obtain real-time deadlock information, which periodically (every 15 seconds) prints InnoDB running status to the error log file of mysqld service.
The Standard InnoDB Monitor and InnoDB Lock Monitor are important monitoring methods. They can be enabled by corresponding system parameters.
set GLOBAL innodb_status_output=ON; Enable standard monitoring set GLOBAL Innodb_STATus_output_LOCKS; In addition, MySQL provides a system parameter innodb_print_all_deadlocks specifically for logging deadlocks. When a deadlock occurs, the deadlock log is logged to the MySQL error log file.
In addition, MySQL provides a system parameter innodb_print_all_deadlocks specifically for logging deadlocks. When a deadlock occurs, the deadlock log is logged to MySQL’s error log file.
How to avoid deadlock as much as possible Reasonable index design, high differentiation of columns in front of the composite index, so that business SQL through the index to locate fewer rows, reduce lock contention. Try to find records by primary key/index, range lookup increases the possibility of lock conflicts, and do not use the database to do extra quota calculations. For example, some programs use select… The where… order by rand(); Statements like this do not use indexes and will result in the entire table being locked. Great things are divided into small things. Large transactions tend to be deadlocked, and if business permits, large transactions are broken down. Access tables and rows in a fixed order. For example, if two transactions update data, transaction A updates data in the order of 1,2; Transaction B updates data in the order of 2,1. This is more likely to cause deadlocks. Lower the isolation level. If services allow, it is also a good choice to lower the isolation level, for example, to change the isolation level from RR to RC to avoid many deadlocks caused by gap locks. Reference Documents:
www.searchdoc.cn/rdbms/mysql… Dev.mysql.com/doc/refman/… Dev.mysql.com/doc/refman/…