Author | Basic Innovation – Zhang Jian
Introduction: The IEMS energy management system independently developed by Hangzhou Basic Innovation Technology Co., Ltd. is a big data platform for the Internet of Things to collect and process the classified energy consumption data of buildings, such as water energy and electric energy, and analyze the building energy consumption status to realize the application of building energy conservation and other functions. Before, the platform used MySQL to store meter reading data, and when the data amount reached 15 million pieces, there would be a long query delay or even stuck. Therefore, TDengine was adopted, and the performance was nearly 50 times higher than MySQL by comparison.
background
The largest amount of data in the energy management system is the data reported by each intelligent meter reading and sensor. In July 2017, hangzhou began to launch smart meter reading for natural gas. By 2018, 370,000 users had access, and the number of users exceeded 770,000. In addition, smart water meters, electricity meters and so on are constantly reported data.
These intelligent meter reading data to be recorded for a long time, and regular query, statistics generated reports; In the system, it is also necessary to display the real-time reading and historical change curve of the device when the user clicks to view the specific device.
With the continuous expansion of the company’s business, we also need a general platform to monitor and store all customers’ data. The traditional relational database is no longer enough to support the business volume, so it is very important to choose a suitable database for the structural characteristics of the time sequence of meter reading data. In this paper, we will introduce the use of TDengine, the most popular temporal data analysis engine, in hangzhou basic Innovation IEMS system.
IEMS energy management system consists of the master station, NB Bluetooth acquisition terminal (not needed for wireless communication), metering equipment, etc. The main station of the system is composed of communication service, business application, database, interface and background service. Database service consists of data storage, computing service and data presentation. After user opens the metering function, through network prepaid watt-hour meter and the system software, can realize the prepaid management and utilization of online data acquisition meter reading, cost statistics, the basic function such as prepaid prepaid phone sell electricity business management, realize the lack of power, can send after top-up, facilitate the management of remote energy management mode. The whole system architecture is relatively large, including the hierarchical structure as shown in the figure below.
In the development iteration of IEMS energy management system, the company began to use TDengine timing database, which is mainly used for the storage and management of data reported by various intelligent meter reading devices of industrial and commercial hydropower energy management and monitoring system, and built a real-time and historical data storage capable of bearing millions of meter reading numbers (as shown in the figure below) : Intelligent meter reading data is sent to the system through the concentrator, and the system writes the data into TDengine database in real time. Then based on TDengine data platform, further do a variety of business related application layer development, such as equipment data real-time query, historical record AD hoc query and some periodic report work.
Why TDengine in IEMS system?
In previous versions of the system, we used MySQL database to store device reporting data. However, with the increasing number of devices connected to the system and users’ higher requirements for real-time data feedback, MySQL has been unable to meet business requirements. We found it a better solution to store the device data in the timing database TDengine.
Query efficiency improvement
When storing meter reading data in MySQL, the company’s smart electricity meter, water meter and other devices are stored separately, and the data of similar devices are stored in a large table. At present, the reporting frequency of meter reading data is generally at the level of minutes (5-30 minutes). The simultaneous writing pressure of 2000 smart meters in the first pilot district is not too great, and MySQL can carry it. However, as the number of access tables increases and the storage time range increases, the query efficiency will be caused by the excessive accumulation of data records in each type of device. Still smart meters pilot, for example, when we import the data from 1700 meters only 30 days after the meter to the total number of records to reach close to 15 million, we again for some users monthly electricity consumption statistics report, query in larger query delay even jammed, query a device’s latest reading speed is greatly decreased. And that’s the number of devices in just one pilot area.
So we had to consider partitioning, libraries, and tables to optimize, with each type of device being written into a series of tables over a 7-day time horizon; Different cells are dealt with by sub-library. This ensures that there are not too many entries in each table and that querying data for a given day is not too slow and slow.
However, the operation of database and table has brought a lot of trouble to the business layer, often encountered some queries need to deal with a variety of cross-table, cross-library queries, logic complex error prone. After getting into the timing database TDengine, we found that it had solved the problem of data partitioning in this temporal dimension at its storage level. TDengine stores data files according to the time range. The default time range is 10 days. The data file name corresponds to a time range.
Therefore, when querying the data of a certain day long ago, the time filtering range entered in the retrieval can be used to quickly find the file where the data is located and search this file. In essence, the purpose of MySQL database partition is the same as before, which is to reduce the size of files to be retrieved and improve the query speed. After completing the database and table, a table in MySQL can have more than 300,000 pieces of data. It takes about 0.5 seconds to query the latest records. TDengine, on the other hand, stores all the data of 2000 electricity meters for 6 months (more than 800 million in total). It takes less than 0.01 second to query the latest data of one electricity meter, which is nearly 50 times better performance!
At the same time, we tried to query the full data of a device for 30 days (15 million), which can complete the whole query in 5 seconds, much faster than the relational library.
Table structure design is simplified
The problems mentioned above have been solved by TDengine’s storage strategy while improving query efficiency. When using TDengine, the classification and group retrieval of terminal devices can also be realized through its super table label filtering, which further simplifies the logical processing of the business layer. At the bottom of our platform we deal with a number of different projects, each with its own device grouping.
In TDengine, the core idea of table design is that one terminal device has one table, and the table structure of the same device is defined by a super table. The data fields reported by devices of the same type are the same, and the structures of sub-tables in the super table are the same.
However, TDengine allows each device to be tagged with static information such as the device ID, project ID, device type number, and group number of the device in the project.
For our query scenario, the above information is often needed to be filtered and filtered in the super table labels. When querying the super table, you can add filtering conditions for labels such as item IDS in the WHERE statement to query device data by item. The same query in a relational library inevitably involves the association of multiple tables, which is more complex and less efficient.
TDengine table design and query examples
In electricity meter data collection, data is mainly divided into task data, alarm data, command data, original message. TDengine is a table per device design idea. Therefore, for the above data of each type of device, we build the table according to the device.
– Task data
Task data is the load data periodically sent by the electricity meter, which is composed of the time stamp of data collection and the specific electricity meter reading. This idea is very simple, is to first build a super table, define the collection field, and then create a table for each device to store its reported records.
– Alarm data
The alarm data is a set of data sent when the electricity meter is abnormal. It consists of alarm time and load data. The fields and task data are basically the same. We can define some basic attributes of the device that do not change over time as tags, such as device ID, project, type, superior device, etc. Indicators that change over time are defined as data columns in the table, such as operating status, event alarm, relay status, ABC three-phase voltage, ABC three-phase current, active power, reactive power, etc. The structure of the alarm supertable in a project is as follows. The labels are defined as follows: meter_no meter reading number, project_id project ID, gateway_NO gateway ID, meter_type meter reading type.
Field | Type | Length | Note |========================================================================sjsj |TIMESTAMP | 8| |jssj |TIMESTAMP | 8| |yxzt |BINARY | 10| |sjzt |BINARY | 128| |jdqzt |BINARY | 10| |axdy |FLOAT | 4| |bxdy |FLOAT | 4| |cxdy |FLOAT | 4| |axdl |FLOAT | 4| |bxdl |FLOAT | 4| |cxdl |FLOAT | 4| |sydl |FLOAT | 4| |zyggl |FLOAT | 4| |axyggl |FLOAT | 4| |bxyggl |FLOAT | 4| |cxyggl |FLOAT | 4| |zwggl |FLOAT | 4| |axwggl |FLOAT | 4| |bxwggl |FLOAT | 4| |cxwggl |FLOAT | 4| |zglys |FLOAT | 4| |axglys |FLOAT | 4| |bxglys |FLOAT | 4| |cxglys |FLOAT | 4| |zxygzdl |FLOAT | 4| |zxygzdl1 |FLOAT | 4| |zxygzdl2 |FLOAT | 4| |zxygzdl3 |FLOAT | 4| |zxygzdl4 |FLOAT | 4| |axwd |FLOAT | 4| |bxwd |FLOAT | 4| |cxwd |FLOAT | 4| |lxwd |FLOAT | 4| |hjwd |FLOAT | 4| |meter_no |BINARY | 12|tag |project_id |BINARY | 20|tag |gateway_no |BINARY | 12|tag |meter_type |BINARY | 32|tag |Copy the code
When querying the latest status of devices, we can directly query all devices that meet the filtering conditions from TDengine’s super table based on different dimensions such as device ID, project, or device type. For example, query the latest data records of all devices and group them according to meter_NO, gateway_NO, project_id, and meter_type. Use only one SQL statement as follows:
select last_row(*) from db.sb_fhsj group by meter_no,gateway_no,project_id,meter_type;Copy the code
Labels can also be filtered by value, for example, the latest data of a certain type of device in a project can be filtered according to business needs:
select last_row(*) from db.sb_fhsjwhere project_id=$condition1 and meter_type=$condition2group by meter_no,gateway_no,project_id,meter_type;Copy the code
Query the data of a single device in the past day and directly filter meter_NO:
select * from db.sb_fhsjwhere meter_no=$condition1 and _c0 >= now -1dCopy the code
These queries are straightforward, and the business code is easy to develop as long as the table structure is well designed.
– Command data
Command data is the operation result returned by the system. TDengine’s single-table queries are much faster than MySQL’s, so we also use it to store log data that doesn’t require reprocessing.
Field | Type | Length | Note |=======================================================================sjsj |TIMESTAMP | 8| |cmd |BINARY | 2| | frame |BINARY | 1000| |upordown |TINYINT | 1| |body |BINARY | 2048| |project_id |BINARY | 20|tag |gateway_no |BINARY | 12|tag |Copy the code
– Original packet
Raw packets are upstream unparsed data sent by the device. Queries are retrieved by time range, so we store them directly in TDengine as well. Several long binary string fields are used. Note For historical missing data, use IMPORT instead of INSERT to complete INSERT.
Field | Type | Length | Note |=======================================================================sjsj |TIMESTAMP | 8| |task_id |BINARY | 64| |cmd |BINARY | 4| |message |BINARY | 1024| |data |BINARY | 2048| |meter_no |BINARY | 12|tag |project_id |BINARY | 20|tag |gateway_no |BINARY | 12|tag |Copy the code
This section describes the UI interfaces of related systems
Factory project equipment import — When the iot devices are connected to the platform, the data collected by the devices will be written into TDengine in real time. On this page, the real-time value of each device is refreshed periodically, that is, a timed task is maintained to execute the last_row() query.
O&m Platform Overview – Meter reading, settlement, and recent alarms. Alarm information is stored in the alarm table. The background polls the alarm table to check whether the alarm status is displayed on the device detection page in a timely manner.
Operation and Maintenance Project Overview – Real-time meter reading status statistics Display real-time query and statistics on the operating status reported by meter reading.
Load data analysis – Query and analyze historical load indicator data. This involves the user’s choice. The user needs to see the historical data of the table, directly select the time range and device ID on the front page, record and draw the change curve through TDengine second feedback historical data.
conclusion
In use, TDengine, as a sequential database designed for the Internet of Things, does show the advantages of high performance and simple architecture in the case of multiple devices and high acquisition frequency. TDengine is nearly 50 times faster than MySQL in the same real-time device data query scenario. The design of its super table saves a lot of joint table query logic and greatly simplifies the development of business layer. Our current system is already fully capable of data caching and sequential indexing, and we will continue to explore the use of TDengine’s streaming computing and subscription capabilities to fully leverage the underlying database and further optimize the system architecture of the platform.
About the author:
Jian Zhang, BASIC Innovation UI designer, joined the R&D department of Hangzhou Basic Innovation Technology Co., LTD in 2017 and has been working till now. Currently, he is responsible for the overall visual design of the company’s system software, including the product interface design of WEB terminal and mobile terminal. From 2019 to 2020, he led the UI development and design of IEMS energy management system.
Company Profile:
The IEMS energy management system independently developed by Hangzhou Basic Innovation Technology Co., Ltd. is an Internet of Things big data platform that collects and processes the classified energy consumption data of buildings such as water energy and electric energy, analyzes the building energy consumption, and realizes the application of building energy conservation. System to achieve the metering automation management of water, electricity, etc, including the automatic meter reading data acquisition, query, statistics, settlement, remote control, data analysis, monitoring equipment, energy consumption and output of various forms of graphic, curve, report forms, etc., achieve finally let enterprise energy management more intelligent, more energy efficient, the purpose of security. Through centralized collection of energy consumption data of meters, energy metering equipment management and other means, users can also experience classified itemized query, statistics, settlement, recharge, refund, statements and other functions, so that managers have an accurate grasp of the proportion of energy cost and development trend of enterprises.