The SQL standard defines four classes of isolation levels, including specific rules that define which changes are visible and which are visible within and outside a transaction. Low-level isolation levels generally support higher concurrency and have lower system overhead.
Basic Elements of Transaction (ACID)
1. Atomicity: After a transaction starts, all operations are either finished or not done. It is impossible to stop in the middle. If an error occurs during the execution of a transaction, it will be rolled back to the state before the transaction began, and all operations will be as if they did not happen. That is to say, the transaction is an indivisible whole.
2. Consistency: The integrity constraint of the database is not broken before and after the transaction. For example, if A transfers money to B, it is impossible for A to withhold money and B does not receive it.
3. Isolation: Only one transaction is allowed to request the same data at the same time, and there is no interference between different transactions. For example, if A is withdrawing money from A bank card, B cannot transfer money to the card until the withdrawal process is complete.
4, Durability: When a transaction completes, all updates made by a transaction to the database are saved to the database and cannot be rolled back.
Summary: Atomicity is the basis of transaction isolation, isolation and persistence are the means, the ultimate goal is to maintain data consistency.
Second, the concurrency of transactions
1, dirty reads: transaction reads the transaction B updated data, then A RollBack operation, B so dirty data is read into A data (A transaction has been updated to A data, at the same time, the another transaction reads the same data, for some reason, before A RollBack operation, is A firm after read data is not correct)
2, non-repeatable reads: transaction reads the same data for many times, A transaction B in the process of transaction A read many times, to make the data update and submit, lead to A multiple times while reading the same data, inconsistent results (in A single transaction data inconsistency of the two queries, it may be two query process into the middle of A transaction updating original data).
3, magic reading: System administrator A changed the score of all the students in the database from the specific score to ABCDE, but system administrator B inserted A record of the specific score at this time. When system administrator A finished the change, he found that there was still A record that had not been changed, just as if he had hallucinated. This is called magic reading (if the number of pens in a transaction is inconsistent between the two queries, for example, if one transaction queries several rows and another transaction inserts new columns at the same time, the previous transaction will find several columns that it did not have before in subsequent queries).
Summary: It’s easy to confuse unrepeatable reading with magic reading. Unrepeatable reading focuses on modification, while magic reading focuses on addition or deletion. To solve the problem of unrepeatable reads, you only need to lock the rows that meet the condition. To solve phantom reads, you need to lock the table
In short:
- Dirty read: A transaction in one thread reads uncommitted data from another thread.
- Non-repeatable read (virtual read) : a transaction in one thread reads an Update submitted by another thread.
- Phantom read: when a transaction in one thread reads data from an INSERT submitted by another thread.
Three and four isolation levels are introduced
At this isolation level, all transactions can see the results of other Uncommitted transactions. This isolation level is rarely used in real-world applications because its performance is not much better than other levels. Reading uncommitted data is also known as Dirty reads. When formatting binary logs, the RECOMMENDED format is The ROW format, which satisfies a simple definition of isolation: a transaction can only see the changes made by Committed transactions. This isolation level also supports so-called NonrepeatableRead, because other instances of the same transaction may have new commits in the process of that instance, so the same SELECT may return different results. Repeatable Read This is MySQL’s default transaction isolation level and ensures that multiple instances of the same transaction will see the same rows of data when reading data concurrently. In theory, though, this leads to another thorny problem: Phantom Read. Simply put, phantom reading refers to when a user reads a row in a certain range, another transaction inserts a new row in that range, and when the user reads a row in that range, a new phantom row is found. InnoDB and Falcon storage engines address this problem through the Multiversion Concurrency Control (MVCC) mechanism. Serializable This is the highest isolation level and addresses the phantom read problem by forcing transactions to be ordered so that they cannot conflict with each other. In short, it places a shared lock on each read row. At this level, a lot of timeouts and lock contention can result.
4. Operation demonstration
Check out the Mysql version
Check InnoDB storage engine’s system-level isolation level and session-level isolation level. The default isolation level is REPEATable -read
select @@global.tx_isolation,@@tx_isolation; Copy the code
1. Read unsubmitted:
(1) open client A, set the current transaction mode to read uncommitted, and query the value of user:
(2) Open another client B and update table user before client A commits transaction:
(3) At this time, although the transaction of client B has not been committed, client A can query the updated data of B:
(4) Once client B’s transaction is rolled back for some reason, all operations will be cancelled, and the data queried by client A is actually dirty data:
You can solve this problem by reading the committed isolation level
2, Read committed
(1) open client A, set current transaction mode to Read COMMITTED, and query the value of user:
(2) Open another client B and update table user before client A commits transaction:
(3) At this time, the transaction of client B has not been committed, and client A cannot query the updated data of client B, which solves the dirty read problem
(4) After the transaction of client B is committed with the COMMIT command, check the data of client A
(5) Client A performs the same query as the previous step, but the result is inconsistent with the previous step, that is, the problem of unrepeatable reads is caused. In the application, let’s assume that we are in the session of client A, and the amount value of 1 is 5000, but other transactions change the amount value of 1 to 4900, we do not know. If you use the value 4900 to do something else, it’s a problem, but it’s really rare. To avoid this problem, you can use a repeatable read isolation level.
3. Repeat
(1) Open client A, set transaction mode to REPEATable read, query the initial value of table user:
(2) on the client side before A transaction is committed, open another client B, user and submit update tables, client B transactions actually can modify the client to the line of A transaction query, namely mysql repeatable read queries to the line, don’t lock the affairs in the SQL standard transaction isolation level for repeatable read, read and write operations to the lock, Mysql does not have a lock. Be careful to lock rows in your application, or you will use the amount of ID 1 in step (1) as an intermediate value of 5000 to do something else.
(3) Perform the query in Step (1) on client A:
(4) After performing step (1), the amount value of STEP 1 is 50000, which is the same as the query result of Step (1). There is no unrepeatable read problem. Update user set amount = amount-100 where id = 1 where amount = 5000-100 = 4900 So the data is 4800, and the consistency of the data is not broken!
(5) Start the transaction on client A and query the initial value of the user table
(6) Start the transaction on client B, add a new data, the amount field value is 8000, and commit
(7) The sum of amount calculated on client A is 4800+3500+6700=15000, and the value of client B is not taken into account. After client A submits, the sum of amount calculated on client A is 23000, which is because 8000 of client B is included. From the perspective of the client, Client B will not be visible to the client, it will feel like the world is falling, 8000 dollars more, this is unreal, from the developer’s point of view, data consistency is not broken. But in an application, our code might commit 15000 to the user. If you want to avoid this low probability scenario, then serialize the transaction isolation level described below.
serialization
(1) open client A and set transaction mode to SERIalIZABLE;
(2) open a client B and set the transaction mode to serialIZABLE, insert a record error, the table is locked, insert failure, mysql transaction isolation level is serializable when the table lock, so there is no magic read situation, this isolation level is very low concurrency, often a transaction occupied a table. There are thousands of other transactions that just sit around and wait until they’re finished committing and are rarely used in development.
Supplement:
The default transaction isolation level in mysql is that repeatable reads do not lock read rows
When the transaction isolation level is serialized, reading data locks the entire table
3, the time you read this, if you stand in the Angle of the developer, perhaps will feel not repeatable read and illusions, logically and do not have what problem, the final data is consistent, still but stand in the user’s point of view, they usually can only see A transaction (can only see the client A, don’t know the client B) to the existence of this undercover, without considering the transaction the phenomenon of concurrent execution, Once the same data is read multiple times with different results, or a new record appears out of thin air, they may be suspicious. This is a user experience problem.
4. Transaction in mysql executes, the end result will not appear the consistency of the data, because in a transaction, mysql perform an operation may not be the middle results of an operation before use, it will according to the actual situation of other concurrent transaction processing, seems illogical, but ensures the consistency of data; But when a transaction is executed within an application, the results of one operation are used by the next operation and other calculations are performed. This is we have to be careful, repeatable read should lock the row, serialization should lock the table, otherwise it will break the consistency of data.
5, when the transaction is executed in mysql, mysql will comprehensively deal with the actual situation of each transaction, resulting in the consistency of data is not destroyed, but the application program according to the logical routine to play cards, and not as smart as mysql, there will inevitably be data consistency problems.
The higher the isolation level, the greater the data integrity and consistency, but the greater the impact on concurrency performance. You can’t have it both ways. For most applications, the priority is to set the isolation level of the database system to Read Committed, which avoids dirty reads and has good concurrency performance. Although it can cause concurrent problems such as unrepeatable reads and phantom reads, it can be controlled by applications using pessimistic or optimistic locking in the rare cases where such problems may occur.
I hope the above content can help you. Many PHPer will encounter some problems and bottlenecks when they are advanced, and they have no sense of direction when writing too many business codes. I have sorted out some information, including but not limited to: Distributed architecture, high scalability, high performance, high concurrency, server performance tuning, TP6, Laravel, YII2, Redis, Swoft, Kafka, Mysql optimization, shell scripting, Docker, microservices, Nginx, etc. Many knowledge points are free to share with you