This is the 14th day of my participation in Gwen Challenge.
A previous article successfully implemented gTID-based Mysql master master synchronization. But the fundamental things are not well understood, such as what is GTID? What is the master/slave synchronization process? What are the pros and cons of GTID? So let’s plug in these problems and see how it works.
1. Master/slave synchronization process
1.1 Related concepts
InnoDB uses undo and redo logs to ensure atomicity, consistency, and persistence of transactions. It also uses write-ahead Logging to change random writes into sequential appending writes to improve transaction performance.
Binglog Binary logs, also called archive logs
The binlog log is used to record all updates and commit data or all data that has been potentially updated and committed data. Used for master-slave synchronization and time-based node restore.
Redolog redolog
Record the state of the transaction after it is about to change. When a transaction commits, the redo log is persisted and the data can be changed in memory. When the system crashes, data does not fall to the disk, but the redo log is persisted. The system restores all data to the latest state according to the redo log.
Undolog Rollback logs
Record the status of the transaction before it changes. Before operating on the data, the data is backed up to the Undo log and then modified. If an error occurs or the user performs a ROLLBACK statement, the system can use the historical version of the Undo log to restore the state before the transaction began.
1.2 Process Analysis
Master/standby synchronization flow chart (refer to the pictures on the Internet for further replacement)
After receiving the update request from the client, the master library performs the update logic of the internal transaction and writes the binlog at the same time
Standby database B maintains A long connection with primary database A. Primary library A has an internal thread dedicated to servicing this long connection for standby library B. The complete process of a transaction log synchronization looks like this:
Run the change master command on standby library B to set the IP, port, username, password of primary library A, and the location from which the binlog request should start. This location contains the file name and log offset. Run the start slave command on slave library B. In this case, the slave library starts two threads, namely io_thread and SQL_thread, as shown in the figure. Io_thread is responsible for establishing connections with the primary library. After verifying the user name and password, primary database A reads the binlog from the local database to the location sent by secondary database B and sends the log to secondary database B. Standby database B obtains the binlog and writes it to a local file, which is called the relay log. Sql_thread reads transfer logs, parses commands in logs, and executes them. Introduction of multi-threaded replication where SQL_threads evolve into multiple threads
2. GTID explanation
2.1 What is GTID
The Global Transaction Identifier (GTID) is the unique Identifier that is generated when a Transaction is committed. It consists of two parts in the format:
GTID=source_id:transaction_id
Copy the code
Among them:
Server_uuid is automatically generated when a database instance is first started and is a globally unique value; Transaction_id is an integer with an initial value of 1 that is assigned to the transaction each time a transaction is committed and incremented by 1. The transaction_ID is not the transaction ID we normally use. The transaction ID is assigned during the execution of the transaction. If the transaction is rolled back, the transaction ID will be incremented. The transaction_ID is assigned when the transaction commits, so the GTID tends to be sequential.
2.2 Generation mode of GTID
GTID can be generated in two ways, depending on the value of the session variable gtid_next.
In GTID mode, each transaction corresponds to a GTID. The GTID can be generated in two ways, depending on the value of the session variable gtid_next.
If gtid_next=automatic, the default value is used. At this point, MySQL assigns server_uuid:gno to the transaction.
- A. SET @@session. GTID_NEXT= ‘server_uuid:gno’;
- B. Add the GTID to the GTID collection of this instance.
If gtid_next is the value of a specified GTID, such as current_gTID specified by set gtid_next= ‘current_GTID’, then there are two possibilities:
- A. If the current_GTID already exists in the instance’s GTID set, the subsequent transaction will be ignored by the system.
- B. If the current_GTID does not exist in the instance’s GTID collection, the current_GTID will be assigned to the next transaction, meaning that the system does not need to generate a new GTID for the transaction and therefore does not need to increment gNO.
A CURRENT_GTID can only be used by one transaction. After this transaction commits, if the next transaction is to be executed, the set command is executed to set gtid_next to another GTID or automatic.
Each MySQL instance maintains a collection of GtiDs that correspond to “all transactions performed by the instance
2.3 Enabling GTID
When starting the database instance, set the following parameters
enforce_gtid_consistency=on
gtid_mode=on
Copy the code
Note that none of the above parameters is necessary
2.4 Advantages and disadvantages of GTID
advantages
- Set up convenient master/slave synchronization
To perform synchronization Settings on the slave, you only need to set change master to master_auto_position=1. You do not need to run the show master status command on the master to find the binlog log name and POSITION point. MySQL uses an internal mechanism called GTID to automatically find some synchronization.
- Support for multi-threaded replication (library-based)
Prior to Mysql5.6, Salve replication was single-threaded, event-by-event reading applied, but when the client concurrently wrote to the Master, there was a synchronization delay from the slave library. In MySQL 5.6, we can put multiple tables in multiple libraries, so that we can use multi-threaded replication, when there is only one library, multi-threaded replication is not useful
- Powerful failover capability
When the Master crashes, Salve must switch to Master. GTID provides powerful fault recovery capabilities
It is convenient to handle master-slave data conflicts and skip them by committing an empty transaction from the slave library
disadvantages
- Non-transactional engines are not supported
The CREATE TABLE… SELECT, temporary table, and other statements are not supported
The statement is split into createTable and INSERT transactions, and the two transactions are assigned the same GTID, causing the insert to be ignored by the standby database.
- There are Errant transaction
That is, transactions that do not normally execute from the master, but directly execute from the slave. Avoid such transactions. If data must be written on the slave, disable log bin. If such a situation occurs, you can resolve the problem by either executing an empty transaction (repeating the key value) or redoing the Slave (the Slave performs more transactions), depending on the case.