The data collected by crawler can be stored in data sets besides text files and Excel, such as Mysql, Redis, mongodb, etc. Today, Brother Chen will teach you how to use Python to connect to Mysql, and explain to you with crawler.
Prerequisite: You have installed mysql by default.
01
Introduction of Mysql
Mysql is a relational database that supports large databases and can handle large databases with tens of millions of records. After the data set collected by crawler is stored in mysql, related data can be taken out in one step with the help of mysql’s associated query. The specific functions are not described here, and the actual operation begins below.
1. Install pymysql
Install with the following command
pip install pymysql
Copy the code
Pymysql library: Python3 links to mysql
Remark:
Ps :MYSQLdb only works with python2.x
Python3 does not support MYSQLdb, but pymysql instead
ImportError: No module named ‘MYSQLdb’
2. The python mysql connection
import pymysql as pmq
Copy the code
Localhost is the local IP address. In this case, localhost indicates the current host. Otherwise, change localhost to the corresponding database IP address.
Root is the database user name, 123456 is the database password, and PYTHon_chenge is the database name.
Python_chenge mysql > create table python_chenge mysql > create table python_chenge mysql > create table python_chenge mysql > create table python_chenge mysql > create table python_chenge
02
Build table
Before storing, create a table in Python with four fields (a primary key + movie name, link, rating)
Create the movie tableCopy the code
Create table movie with fields (ID,title, URL,rate), CHARACTER SET UTF8 COLLATE UTF8_general_ci is a string encoding SET to UTF8 format
Id is the primary key whose type is int, AUTO_INCREMENT, and not null
Title, url is a string of type varchar(100), also non-empty
The rate is a number with a decimal, so it is a float
03
Insert data
The crawler has collected the data, Python has built the table, and you can then insert the collected data into the database in two ways
Insert dataCopy the code
The ID is incremented, so you don’t need to pass it in.
Once you have defined the insert method, start storing to the database
for i in json_data['subjects']:
Copy the code
04
The query
1. Query all information
Query all data in the table
# queryCopy the code
2. Query the specified data
For example, query all fields of the data titled: Chinatown 3
# query a single entryCopy the code
05
Update the change
Update the data, again using the example above: Chinatown 3 with id 7, change the score of Chinatown 3 from 5.5 to 6
Update # # #Copy the code
Also look at the database
06
delete
Similarly, take Chinatown as an example. Its ID is 7. If it is deleted, we can update the ID to delete it
def delete(Id):
Copy the code
After the deletion, there is no item 7 data, indicating that the deletion is successful
07
summary
This is the end of today’s technical explanation article, mainly will be how to use Python to connect to mysql, and to create tables, insert data, query, update modify and delete. (Dry articles, recommended collection)
Recommended reading
Photos blurry? Let’s use Python to make photos in HIGH definition
Sampling + map visualization | taught you how to star character and graph display
Seven national censuses from 1953 to 2021 and the population changes of each province