MVCC transaction versioning
1. Problems that may occur when multiple transactions operate on the same data
Dirty writes, dirty reads, unrepeatable reads, and phantom reads can occur when multiple transactions perform CRUD operations on the same data
1. Write dirty
So dirty write is if you have two transactions and transaction A updates the value of A and a generates an undo log and then transaction B updates the value of B and THEN B generates an undo log and at that point, a’s thread rolled back
This value was empty before a was updated and then all of a sudden A rolls back and then A changes that value to NULL based on undo log, and then A rolls back all of B to null and then when transaction B queries it becomes null and that’s dirty write
2. Dirty reads
So the problem with dirty reads is the same two transactions where transaction A updates a row but it’s not committed and then transaction B reads it and it reads the value of transaction A while transaction B is working on it and then transaction A rolls back all of a sudden and then transaction B reads the value of transaction A It turns out to be empty and this is dirty reading
Summary: Either dirty read or dirty write is a transaction that reads or modifs data from another transaction that has not yet been committed. It may change its behavior at any time, causing your data to be lost or the data that was previously checked to be lost. This is a dirty read or dirty write
3. Do not repeat the read
We assume that there is now A transaction opened A And this time the transaction b to update the data In transaction affairs before submitting A is b read less than the transaction data of b So you can avoid the dirty read and write dirty asked But at this time of a transaction at this time we opened a transaction for a data read many times But this time, Transaction B has changed and committed the transaction and the value that transaction A is reading has changed because transaction B has committed the transaction and the value that transaction A is reading is not going to be able to read if a is trying to read something that hasn’t changed during this transaction it’s not going to be able to read it, and that’s called unrepeatable read
4. The magic to read
Select * from xx where id>1 select * from xx where id>1; select * from xx where id>1; select * from xx where id>1 At this point, transaction A goes to the woC and finds out if I have an illusion or not, I had 10 data before and now I have 12 data suddenly, which is the illusion
These problems are all concurrent transaction problems, so the database design transaction isolation mechanism, MVCC multi-version isolation mechanism, locking mechanism
2. Database isolation mechanism
1. Four isolation mechanisms of the SQL standard
Read Uncommitted, Read committed, REPEATable Read, serialIZABLE Different isolation levels can solve different problems
Isolation level | Write * * * * is dirty | Dirty read | Phantom read | Phantom read |
---|---|---|---|---|
read uncommitted | false | true | true | true |
read committed | false | false | true | true |
repeatable read | false | false | false | true |
serializable | false | false | false | false |
This is a standard transaction isolation level and you don’t want to serialize it because serialization doesn’t allow you to serialize a transaction but Mysql has an isolation level of RR that allows you to avoid phantom reads and this is a standard transaction isolation level that allows you to avoid phantom reads and this is a standard transaction isolation level that allows you to serialize transactions RR transaction isolation solves the illusion problem
3. The MVCC mechanism prefixes the undo log version chain
Trx_id is the transaction ID of the last update of this transaction. Roll_pointer refers to the undo log generated before you update this transaction. We’ve already talked about undo log, so we don’t need to say more here.
For example
So let’s say we have a transaction a with id=50 that inserts a data and the hidden field is pointing to an empty undolog and the inserted value is a rool_pointer pointing to an empty undolog so there was no value before.
If a transaction b changes its value to B, a new undo log is generated and roll_pointer points to the value that was actually rolled back.
Now there’s another transaction C and that’s when transaction C will point to the version chain of the transaction B that you just modified
This is the concept of txr_id and ROLL_pinter hidden fields, including the concept of multiple versions of the undo log chain, when multiple transactions serially update a row of data!
What exactly is a ReadView
ReadView: when you perform a transaction, a ReadView is generated for you. There are four key items in the ReadView
MySQL > select * from ‘m_IDS’; MySQL > select * from ‘m_ids’;
2. One is min_txr_id, which is the smallest value in m_IDS
3. Max_txr_id = max_txr_id = max_txr_id
4. Creator_txr_id is the id of your current transaction
(id=45); (id= 59); (id= 59); (id= 59)
Now transaction A will open a ReadView that contains transaction IDS 45 and 60 and then transaction A will query this row of data to determine whether the transaction ID txr_id is less than the minimum id in the ReadView At this time, txr_id =32 is less than the ID in ReadView, indicating that the data is submitted before the transaction starts, so the query data is the data with ID 32
- If transaction B changes this value and commits this query to transaction B’s ID59, then m_IDS will be removed if the value is greater than min_txr_id and less than max_txr_id If transaction id59 exists in m_ids, then check the undo log and find the value
- If transaction A updates this value on its own and the transaction ID is 45, then it saves a snapshot of transaction B and when transaction A looks at it and finds that transaction ID is 45 then it can read it because it changed the value itself
-
And then all of a sudden in the middle of transaction A it starts transaction C and it has id 78 and then it updates that value to C and commits the transaction
The value of trx_id=45 is the same as the value of creator_trx_id in ReadView. The value of trx_id=45 is the same as that of creator_trx_id in ReadView. Therefore, I directly read the version I modified before, as shown in the figure below.
ReadView is a way to make sure that you can only read the value that was committed by other transactions before you started the transaction and you can’t read the value that was committed after you started the transaction or you can’t read the value that was committed after you started the transaction
5. How is the RC level implemented according to readView
Suppose we have A row in our database that was inserted before A transaction with transaction ID =50, and now two transactions are active, transaction A (ID =60) and transaction B (ID =70), as shown in the figure below.
Then transaction B does an update and the transaction ID of the data changes to 70 and generates an undo log that roll_pointer points to
So transaction A comes in and finds that the transaction ID of this data is 70, that is, between the transaction IDS of the ReadView, which means that there was an active transaction before the ReadView was generated, and the transaction changed the value of this data, but transaction B hasn’t committed yet, ReadView’s m_IDS active transaction list contains two ids [60, 70]. Therefore, according to ReadView mechanism, transaction A cannot find the value B changed by transaction B.
And then when transaction B commits when transaction A queries transaction A it’s going to generate a new ReadView because transaction B commits when transaction B is not in m_IDS and the value that transaction A reads is the value of transaction B with ID =70
This is the implementation of Rc +readView
6. How is RR level implemented according to readView
Suppose we have A row in our database that was inserted before A transaction with transaction ID =50, and now two transactions are active, transaction A (ID =60) and transaction B (ID =70), as shown in the figure below.
Then transaction B does an update and the transaction ID of the data changes to 70 and generates an undo log that roll_pointer points to
So transaction A comes in and finds that the transaction ID of this data is 70, that is, between the transaction IDS of the ReadView, which means that there was an active transaction before the ReadView was generated, and the transaction changed the value of this data, but transaction B hasn’t committed yet, ReadView’s m_IDS active transaction list contains two ids [60, 70]. Therefore, according to ReadView mechanism, transaction A cannot find the value B changed by transaction B.
And then at that point transaction B commits at that point transaction A queries the RR level ReadView and once it’s generated it doesn’t change it started with 60 and 70 so when transaction A queries again it’s still only going to have two transactions in it so it’s going to go down the undolog chain The value of transaction B is not read. This is the implementation of undolog version chain +ReadView at RR level