Database knowledge is a course that every computer student has learned, but for me, after the course, most of the database knowledge I have learned is returned to the teacher, unless I use it in the project, otherwise I don’t have much chance to contact with it. By chance, I saw this “SQL will know will know” on the desk of Java students, and I took it to flip over, thinking that the content is good, suitable for reviewing the standard grammar of DBMS, and also suitable for entry, very clear. Of course, the purpose of this book is just to brush up on the grammar, the more advanced or lower level of database knowledge is not mentioned in this book, you can only find another book or take out the textbook to read.

The structure of this note is the same as that of the book. It is arranged according to the table of contents of the book. However, the content and expression are not necessarily according to the original text of the book, and may be modified according to their own feelings.

Chapter 1: Learn about SQL

1.1 database

A database is a container that holds organized data. The word “organized” here is interesting, which is the data organization structure at the bottom of the database, which will not be involved in this book.

You can simply think of a database as a file cabinet, where you store the different data in the file cabinet.

Note that there are two concepts that should be distinguished: database (DB) and database management system (DBMS) :

  1. Database: a container for storing data
  2. Database Management System: A System that manages the data in a Database. For example, adding, deleting, modifying, and querying the data in a Database are all operated through this System.

1.1.2 table

Tables are equivalent to cabinets in file cabinets. Each cabinet stores different files and has a different organizational structure for the files, that is, each cabinet has its own data structure.

More specific examples are as follows: the student table is one cabinet, and the class table is another cabinet. The files in the cabinet of the student table contain the student ID, name, gender and other fields; Class table contains class code, class name, number of students and other fields. These different fields can be thought of as the data structure of the files in each cabinet.

Table names in the same database should be unique. Table names in different databases can be the same.

1.1.3 Columns and data types

A column is a field in a table, such as student name, student NUMBER, gender, and so on.

Columns that hold different data can be of different types, such as name as a string type, age as a numeric type, date of enrollment as a date type, and so on.

Note: Different DBMSS may have different support for data types, and even different names for the same data types in different DBMSS.

1.1.4 line

A row is a record in a table, that is, a stored piece of data.

1.1.5 the primary key

Each table should have a column to uniquely represent each row of data, for example, each person’s ID card can uniquely represent one person, each student’s student number can uniquely represent each student, this type of column can be used as the primary key.

Note:

  1. Each table should have a primary key defined
  2. Not only one column can be used as a primary key, but a combination of multiple columns can also be used as a primary key, as long as the combination of the columns is different.

1.2 What is SQL

SQL, pronounced as S-Q-L or SEQUEL, is a Structured Query Language used to perform complex operations on data in a database by combining simple keywords, such as adding, deleting, modifying, and querying.

The above concepts may be a little abstract for beginners, but after learning and operating, you will have a more concrete feeling. So practice more.

Chapter 2 data Retrieval

2.1 SELECT Keyword

To query data from a database, use the SELECT keyword. You must also know what to look up and where to look up, specifically which column (field) data from which table. For example, to query the saved product name from the product table:

SELECT prod_name FROM Product
Copy the code

You can print all the values of prod_NAME saved in the Product table.

Note:

  1. SQL statements are case-insensitive, but the keyword is capitalized, column names are capitalized, and table names are capitalized.
  2. SQL can be written in multiple lines, one keyword per line, for more clarity, as in the example above.
SELECT prod_name 
FROM Product
Copy the code

2.3 Check the data of multiple columns

In the example in the previous section, only the prod_NAME column for the item name was retrieved. If you want to look up other item attributes, such as item price prod_price, you can separate the column names with commas:

SELECT prod_name, prod_price
FROM Product
Copy the code

You can find out the name and price of all the goods.

2.4 Retrieve all columns

The wildcard * character is used to query data for all columns in a table

SELECT *
FROM Product
Copy the code

However, it should be noted that such operation performance is not good.

2.5 Retrieving different values

If you want the output to have no duplicate values, you can use the DISTINCT keyword to modify the SELECT, for example, if you want to query for different commodity prices in the table:

SELECT DISTINCT prod_price
FROM Product
Copy the code

Each value in the output is different, that is, it does not contain the same price.

2.6 Limiting query results

If you want to query the first row or a few rows, you can limit, but different DBMS implementation is different, for example, in SQL Server with the TOP keyword to specify the first n rows of data query, for example:

SELECT TOP 5 prod_name
FROM Product
Copy the code

In this way, the first five trade names can be found.

2.7 Writing of notes

There are two ways to write comments in SQL:

  1. Inline comments, starting with two consecutive horizontal bars, for example:
SELECT TOP 5 prod_name -- This is the comment, query the first 5 item names
FROM Product
Copy the code
  1. Newline comments, like typical high-level programming languages, are used/ * * /To enclose comments, for example:
/* This is a comment on the first 5 items */
SELECT TOP 5 prod_name
FROM Product
Copy the code

Chapter 3 sorts the output of the search results

When the display order of the queried data is not specified, the retrieved data is unordered and is not necessarily determined by the time the data was inserted. When you specify the ORDER of output, use the ORDER BY keyword and specify that the letters are ASCENDING or descending, with ASCENDING indicated BY ASC (ASCENDING) and descending indicated BY DESC(DESENDING).

For example, specify sorting by item name:

SELECT prod_name 
FROM Product
ORDER BY prod_price DESC
Copy the code

The output commodity name is in order of the commodity price from largest to smallest.

3.2 Sort by multiple columns (fields)

Take the above example as an example. If there are multiple commodities with the same price, how should the commodities with the same price be sorted? Can you specify it? The answer is yes, by specifying the second column name after the first. That is, if the first column name is identical, the second column name is sorted. Similarly, multiple column names are sorted in sequence. For example, the same item name is sorted by item ID in descending order.

SELECT prod_name 
FROM Product
ORDER BY prod_price, prod_id DESC
Copy the code

Note: The sort statement ORDER BY should be placed at the end of the query, otherwise an error will be reported.

Chapter 4 filters data and specifies search criteria

Described above retrieval statement is seldom used, because it will be a column to retrieve data from the if you want to retrieve data according to a certain condition, for example, the query cost less than 100 what is the commodity name, will be carried out in accordance with the price terms query, conditions to use the WHERE keyword to specify, such as the above example use SQL language is expressed as:

SELECT prod_name 
FROM Product
WHERE prod_price < 100
Copy the code

4.1 the operator

The above example only shows the meaning of less than, but also greater than, equal to, not equal to, within a numeric segment, not NULL, and so on. Many DBMSS have operators that are unique to them.

More examples:

  • BETWEEN ANDOperator to query data within a numeric range, such as the name of an item with a price between 50 and 100
SELECT prod_name 
FROM Product
WHERE prod_price BETWEEN 50 AND 100
Copy the code
  • Query the price reduction record asNULLThe field of
SELECT prod_name 
FROM Product
WHERE prod_price_reduction IS NULL
Copy the code

See books for more information.

Chapter 5 Advanced data filtering

This chapter focuses on the combination of multiple conditions, mainly involving the following operators.

  1. ANDOperator to join multiple query criteria, take the intersection of multiple criteria, such as the name of the toy car and the price of less than 100 goods ID:
SELECT prod_id 
FROM Product
WHERE prod_name = 'Toy car' AND prod_price < 100
Copy the code
  1. OROperator to join multiple query conditions, for example, the name of the query is toy car or toy plane product ID:
SELECT prod_id 
FROM Product
WHERE prod_name = 'Toy car' OR prod_name = 'Toy Plane'
Copy the code

Note: When combining multiple AND AND AND OR operators, pay attention to the order of the conditions, because AND is treated more preferentially than OR. Sometimes the result is not what we expect, so we should explicitly group multiple operators with parentheses, for example:

SELECT prod_id 
FROM Product
WHERE prod_name = 'Toy car' OR prod_name = 'Toy Plane' AND prod_price < 100
Copy the code

We expect to use the above code to get product ids for toy cars and toy planes that cost less than 100.

However, the above code actually yields a combination of the product ID with the product name toy car and the product ID with the product name toy Plane and the price < 100.

If you want to get the expected result, you group the above conditions with parentheses as follows:

SELECT prod_id 
FROM Product
WHERE (prod_name = 'Toy car' OR prod_name = 'Toy Plane') AND prod_price < 100
Copy the code
  1. INOperator to query data that satisfies one of several criteria, equivalent toORShort for operator, such as find the name of the toy car or toy plane item ID in addition to useORIn addition to doing it, you can use itINLet’s do it as follows:
SELECT prod_id 
FROM Product
WHERE prod_name IN ('Toy car'.'Toy Plane')
Copy the code
  1. NOTThe operator can also be used to query data that is not a condition! =For example, if the name of the item is not the id of the toy car:
SELECT prod_id 
FROM Product
WHERE NOTProd_name = toy carCopy the code

Chapter 6 wildcards

Previously, query targets were explicitly conditional, and wildcards were used when you wanted to find an item name that contained some string in its name. Use the LIKE keyword with the wildcard characters %,_,[], etc. Where % can be used for any number of characters and _ is used for one character.

The content here is similar to regular expressions, but not exactly the same.

Specific examples are as follows:

  1. use%Wildcard to match multiple characters

Find an item name beginning with ‘Fish’ :

SELECT prod_name 
FROM Product
WHERE NOT prod_name LIKE 'Fish%'
Copy the code

Find the product name with ‘Fish’ in the middle of the name:

SELECT prod_name 
FROM Product
WHERE NOT prod_name LIKE '%Fish%'
Copy the code

Find the product name beginning with F and ending with H in the name:

SELECT prod_name 
FROM Product
WHERE NOT prod_name LIKE 'F%h'
Copy the code
  1. use_Wildcard to match a character
SELECT prod_name 
FROM Product
WHERE NOT prod_name LIKE 'F__h'
Copy the code
  1. use[]To match one of the characters in parentheses, search for strings starting with a, B, or C
SELECT prod_name 
FROM Product
WHERE NOT prod_name LIKE '[abc]%'
Copy the code

Note: Wildcards can cause performance problems, so don’t abuse them. If you do use wildcards, be careful not to place them at the beginning of the query.

Wildcards for the same function may differ in different DBMSS. For example, % is * in Access.

Chapter 7 uses computed attributes

In the example above, we directly output the queried data, which is at best sorted, but we can do more with the data, such as joining the two queried fields, for example, the name of the product and the price of the output, to form the following example: toy plane -199. You can also sum up the numeric data from the query and output it.

  • Connect different fields with+This syntax is similar to Java and JavaScript string concatenation operations, for example:
SELECT prod_name + The '-' + prod_price
FROM Product
Copy the code

The output looks like the following:

Toy plane -199Toy car -99
Copy the code
  • You can also use a separate name for the result of the join to facilitate subsequent operationsASFor example, if the result above is named prod_shot, it can be written like this:
SELECT prod_name + The '-' + prod_price AS prod_shot
FROM Product
Copy the code
prod_shot
-------------Toy plane - 199 toy car - 99Copy the code
  • Perform an arithmetic operation on a number, such as unit price * quantity to calculate the total price:

    SELECT prod_count, prod_price, prod_count * prod_price AS expend_price
    FROM Product
    Copy the code

    The following output is displayed:

    prod_count	prod_price	expend_price
    ----------	----------	------------10, 15, 150, 2, 20, 40Copy the code
  • Use functions on columns

    Columns can be used to manipulate the results of a query, such as TRIM() if left and right whitespace are removed:

    SELECT TRIM(prod_name) + The '-' + prod_price AS prod_shot
    FROM Product
    Copy the code

    Prod_name is concatenated with whitespace removed from both sides.

More functions include RTRIM to remove Spaces on the right, LTRIM to remove Spaces on the left, and so on.

Chapter 8 uses functions to process data

The three functions for removing whitespace were mentioned in the previous chapter, but there are many more functions available, including text handlers, date and time handlers, and numeric handlers. Typical examples are as follows:

  1. Text processing function

    String manipulation functions include UPPER to convert a string to uppercase, LENGTH to return the LENGTH of a string, SOUNDEX to find similar-sounding strings, and more.

    The third function mentioned above, SOUNDEX, is interesting. If you want to find ‘Michaelle’, but write ‘Michael’ in the search, the above method will not be found. But the two words sound similar and can be found using the SOUNDEX keyword as follows:

    SELECT prod_name
    FROM Product
    WHERE SOUNDEX(prod_name) = SOUNDEX('Michael')
    Copy the code

    You can successfully find ‘Michaelle’ by using the code above.

  2. Date handler function

    Example: Find the name of an item with a production date of 2012, and use the DATEPART function to extract a specific part of the date:

    SELECT prod_name
    FROM Product
    WHERE DATEPART('yyyy', prod_date) = 2012
    Copy the code
  3. Numerical processing function

    I’m going to do some manipulation on numbers, ABS, COS, EXP, PI, SIN, TAN, SQRT.

Chapter 9 summary functions

  • Sometimes the need is not to know what the specific data are, but just need to know the statistics of the data, such as average, total, maximum, minimum and other data, can be solved by the function. Using the average function as an example, query the average of all commodity prices:
SELECT AVG(prod_price) AS avg_price
FROM Product
Copy the code

The output result is the average of all commodity prices.

  • And the same thing as aboveCOUNT MAX MIN SUM.
  • Can be combined withDISTINCTThe keyword specifies that only all different data will be counted, for example:
SELECT AVG(DISTINCT prod_price) AS avg_price
FROM Product
Copy the code

Instead of averaging all the data, you average only the different values.

  • You can use more than one statistical function simultaneously

    SELECT AVG(DISTINCT prod_price) AS avg_price,
    		MAX(DISTINCT prod_price) AS max_price,
    		SUM(DISTINCT prod_price) AS sum_price
    FROM Product
    Copy the code

Note: Using this function to get results is faster than reading all the data and then counting.

Chapter 10 Grouping data

If you want to GROUP a column or field, you can use the GROUP BY statement. If you want to filter groups, you can use the HAVING keyword.

Here the concept of a group is abstract, and an example makes it easy to understand. Select * from XXX, select * from XXX, select * from XXX, select * from XXX

SELECT prod_id, COUNT(*) as num_prods
FROM Product
GROUP BY prod_id
Copy the code

Output:

prod_id		num_prods
--------	----------1, 5, 3, 100, 2, 55Copy the code

This will count for each ID instead of counting for the entire table

When you want to display only ids whose count is greater than 50 and their count, you can filter by using the HAVING keyword:

SELECT prod_id, COUNT(*) as num_prods
FROM Product
GROUP BY prod_id
HAVING COUNT(*) > 50
Copy the code

Output:

prod_id		num_prods
--------	----------
3			100
2			55
Copy the code

Only the result if count > 50 is displayed.

Remember that you can still use ORDER BY to sort each group, for example:

SELECT prod_id, COUNT(*) as num_prods
FROM Product
GROUP BY prod_id
HAVING COUNT(*) > 50
ORDER BY num_prods
Copy the code

Output:

prod_id		num_prods
--------	----------2 55 3, 100Copy the code

Chapter 11 uses subqueries

Subqueries are simply nested queries within a query, using the results of one query as criteria for another query.

SELECT cust_name, cust_contact
FROM Customers
WHERE cust_id IN (SELECT cust_id
                 FROM Orders
                 WHERE order_num IN(SELECT order_num
                                   FROM OrderItems
                                   WHERE prod_id = 'RGAN01'))Copy the code

You can see that there are three SELECT statements above, and the three SELECT statements are not independent, but nested with each other, so it can be considered as 3 layers.

The search results are evaluated from the inside out, that is, the order in which the innermost SELECT returns the results of the next inner WHERE keyword, and so forth, and finally returns the results of the outermost query.

Note that all but the outermost query statements can only query a single column, meaning that the return value can only be a single column.