A database transaction is a sequence of database operations that access and may operate on various data items, all of which are executed or none of which are an indivisible unit of work. A transaction consists of all database operations performed between the beginning of a transaction and the end of a transaction. Four features of transactions:
- Atomicity: Transactions are either completely completed or completely cancelled. If the transaction crashes, the state goes back to before the transaction (transaction rollback).
- Isolation: If two transactions T1 and T2 are running at the same time, the final result of transaction T1 and T2 is the same regardless of which one finishes first.
- Durability: Once a transaction commits, data is saved in the database regardless of what happens (crashes or errors).
- Consistency: Only valid data (according to relational and function constraints) can be written to the database.
atomic
Several unknown affairs operate the same data at the same time, all operations are completed before the unified submission of the results, any operation failure to cancel the previous operation, to ensure that the operation will not be submitted to the database for preservation, to ensure that life and death together!
Isolation,
- Serializable (Serializable, SQLite default mode) : Highest level of isolation. Two simultaneous transactions are 100% isolated, each with its own “world” and executed sequentially.
- Repeatable read (MySQL default mode) : If a transaction executes successfully and new data is added (transaction commit), this data is visible to other ongoing transactions. However, if a transaction successfully modifies a piece of data, the result is not visible to the running transaction. Therefore, the isolation between transactions is broken only for new data and remains isolated for existing data.
- Read Committed (Oracle, PostgreSQL, SQL Server default mode) : Repeatable Read + new isolation break. If transaction A reads data D, and data D is then modified (or deleted) and committed by transaction B, the change (or deletion) to the data is visible when transaction A reads data D again. This is called non-repeatable read.
- Read Uncommitted: The lowest level of isolation, Read Committed + new isolation breaks. If transaction A reads data D and then data D is modified by transaction B (but not committed and transaction B is still running), the modification is visible when transaction A reads data D again. If transaction B rolls back, then the second read of data D by transaction A is meaningless because it is A modification made by transaction B that never happened (rollback already happened). This is called dirty read.
If the consistency requirements are not high, the general use of read submitted, because repeated read is not considered, in the lock is generally only record lock, not the use of gap lock, concurrency is better, it is said to use the most. A persistent transaction guarantees that data will not be lost, and that when a database is restarted after a crash due to force majeure, it guarantees that:
- In successful transactions, data is saved to disk
- For uncommitted transactions, data is rolled back
consistency
When a database recovers from a crash, the following steps occur:
- Parsing the existing transaction logs and analyzing which transactions need to be rolled back and which need to be written to disk (the database crashes before you can write to disk).
- Redo, write to disk. Check the LSN of the data page where the data resides. If the LSN of the data page is greater than or equal to the LSN of the transaction operation, the disk has been written. Otherwise, the disk is written.
- Undo: Rolls back the LSN in reverse order
After these phases, the database can be restored to the state before the crash and data consistency is ensured.