Abstract: In this paper, the development of graph neural network in academia and industry is introduced, and the basic concept and representation of graph neural network are given, and the variants of graph neural network are summarized. Finally, the framework of Huawei Cloud graph neural network is introduced.
This article is shared from huawei cloud community “dry rice people, dry rice soul, understand the map neural network steady rice basin”, the original author: agile xiaozhi.
Rice! Don’t you understand the graph neural network (GNN) that will be used by big factories in 2021? A article with your GNN from entry to take-off, do a bowl of the most stable GNN rice people!
What is a graph of a neural network?
To understand the graph neural network, first of all, you need to understand what a graph is! Graph is a common structure in human social life. For example, social networks between people form graph, subway lines and high-speed railway lines form graph, Internet users buy commodities form “Netizen-commodity” graph, Internet web pages link to each other form graph, and mutual citation of papers form graph. Many tasks can be accomplished based on the information in these graphs, such as predicting whether a user will buy an item or be interested in it based on their historical interaction with the item; Another example is to predict whether a friend relationship is formed between a user and a user according to the friend relationship or communication record between users.
Since graphs are everywhere, how do we analyze graphs? At present, graph embedding technology is usually used to solve graph analysis tasks. Through the graph embedding technology, the structure and content of the graph can be represented by a low-dimensional vector, which can be used as input in the downstream learning task. In addition, graph embedding can be combined with deep learning techniques. For example, based on the assumptions of local connection and translation invariability, Graph embedding can be combined with Convolutional Neural Network (CNN) to obtain a Graph Neural Network (GNN).
The learning tasks on the map include:
-
Graph node classification task: Each node in the graph has corresponding characteristics. When we know the categories of some nodes, we can classify nodes of unknown types.
-
Edge structure prediction task: There are also many types of edge relationships between nodes in the graph, and this task is to predict the relationship between nodes.
-
Graph classification: This task is to classify the whole graph. The basic idea is to aggregate the features of nodes in the graph as the features of the graph, and then classify them.
What has become of the academic world?
Development of academia
In recent years, graph neural network has ushered in a rapid eruption period. In the aspect of theoretical research, the principle interpretation, variant model and the extension and adaptation of various graph data of graph neural network are studied. According to the related papers in the top conferences in the past year, it can be found that graph neural network has become the biggest research hotspot.
Figure 2.1 Development of academia
Industry Development
In terms of application practice, the graph neural network shows unprecedented permeability, from visual reasoning to open reading comprehension problems, from the research and development of drug molecules to the design of 5G chips, from traffic flow prediction to 3D point cloud data learning, it can be seen that the graph neural network is extremely broad application prospects.
Ant Financial used the graph neural network model to mine the relationship pattern between “normal users” and “insurance fraud groups” in the equipment sharing graph, so as to realize the identification of malicious accounts. Each node in the figure has its own characteristic information, which can be used to mine the device node information associated with a certain user node. When a user is associated with many devices, it can be considered that the user has high risk. At the same time, based on the association relationship in the figure, nodes connected to the malicious user and associated device may also have high risks.
Figure 2.2 Ant Financial: High-risk account identification
Didi Chuxing studies an online ride-hailing demand prediction model based on spatio-temporal multi-graph convolutional neural network. Through the analysis of the complex spatiotemporal dependence between regions, the demand for online ride-hailing is accurately predicted to guide the scheduling of vehicles, improve the utilization rate of vehicles, reduce the waiting time, and alleviate the traffic congestion to a certain extent.
Figure 2.3 Didi Chuxing: Vehicle regulation and management
Ali Mother uses graph neural network to mine various relationships of Query, Item and Ad from different dimensions such as user behavior log and content attribute. For the online request scenario, the distance between the user query word vector, the node vector in the pre-action and the advertisement node vector is calculated to carry out efficient vectoquantization nearest neighbor retrieval, so as to quickly match the advertisement that meets the user’s intention and recommend it to the user.
Figure 2.4 Mama Ali: Search AD matching
Netease Music uses graph neural network to mine the characteristics of users, songs and users’ behavior characteristics of songs, so as to realize accurate music recommendation. Each node in the figure has structural information. If a user frequently subscribing to music of a certain category or has a high score for music of a certain category, the system can determine that the user is more interested in music of this category and recommend more music of this category to the user.
Figure 2.5 netease: Music recommendation
Graph neural network
Basic concepts of graph theory
To solve the problem of non-Euclidean structured data representation, researchers introduce Graph in the abstract sense of Graph theory to represent non-Euclidean structured data.
A Graph G consists of a set of vertices and a set of edges, and can usually be defined in the following form:
The set of Vertex V can be expressed as
The set of edges (Edge) E can be expressed as
A representation of a graph
The degree matrix, adjacence matrix and Laplace matrix of vertices are often used to characterize graphs.
Degree matrix of vertices D: the number of edges associated with the vertices
Adjacency matrix A: A common representation of graph structures
Laplacian matrix L: a representation of graph structure
The following figure shows an example of a connected graph and its corresponding degree matrix, adjacence matrix, and Laplace matrix.
Figure 3.1 Basic concepts of graph theory
Figure neural network model
Graph embedded model
Graph Embedding means that nodes, edges or subgraphs in the Graph are represented by low-dimensional continuous vectors. In order to obtain the embedded representation of the graph, the message propagation mechanism in the graph can be utilized. The message propagation mechanism in the figure consists of two steps: message aggregation/combine and node update. Message aggregation refers to the embedded representation of nodes in the learning center according to the characteristics of neighboring nodes. The message propagation mechanism in the figure can be represented by the following formula.
Where, □ represents differentiable functions that are independent of the input order, such as summation, mean or maximum function, etc. γ and φ represent differentiable functions, such as multilayer perceptrons.
Figure 3.2 Figure embedding model
Graph convolutional neural networks
In graph convolutional neural network, the propagation mode between layers is as follows:
The following figure is A schematic diagram of FIG. Convolutional neural network. The input of FIG. Convolutional neural network is A graph. After passing through several layers, the node feature changes from X to Z, sharing the parameter A in the middle multiple hidden layers.
FIG. 3.3 FIG. Convolutional neural networks
A two-layer graph convolutional neural network is constructed, and the activation functions are ReLU and Softmax respectively. Then the overall forward propagation formula is:
Finally, according to feature Z, downstream tasks can be done, such as node classification task, graph classification task, node connection prediction task, etc.
Graph attention network
The attention mechanism can be understood as a weighted summation process: for a given query, there are a series of values and keys corresponding to one of them. How do you compute the result of the query? Simply calculate the similarity between the query and all the keys, and then sum all the values weighted according to the similarity. This similarity is called attention coefficients, which can be calculated as follows:
Type, a is the weight coefficient of feedforward neural networks, | | concatenation operation.
Figure 3.4 Attention network
Using the attention mechanism, the features of each node in the figure can be updated:
By using multi-head attention mechanism, K weight coefficients can be used to update node features respectively:
FIG. 3.5 Multi-head mechanism in the attention network
The advantages of the graph attention network include: parallel computing can be carried out on different nodes, nodes with different degrees can be processed at the same time, and graph structures that have never been seen can be processed and used to solve inductive learning problems.
Heterogeneous graph attention network
GCN operates directly on the homograph and induces fusion according to the attributes of its neighborhood to obtain the embedded representation of the current node. In the homogeneity diagram, the propagation rules of each layer are shown as follows
In heterogeneous networks, there are many types of nodes T={τ1, τ2, τ3… }, GCN cannot be directly applied to heterogeneous networks. To solve this problem, heterogeneous graph convolution can be adopted to consider the heterogeneity of various types of information, and the type-dependent transformation matrix can be used to project them into a common implicit space.
When given a particular node, different types of adjacent nodes may have different effects on it. For example, adjacent nodes of the same type may carry more useful information, while different adjacent nodes of the same type may have different importance. Therefore, a two-layer attention mechanism of heterogeneous networks can be designed.
Figure neural network platform
The difficulty of the current graph neural network platform development lies in the lack of a unified algorithm framework and the need to improve the efficiency of data processing. The traversal of graph data and its interaction with deep learning will greatly reduce the computation efficiency of graph, which is also one of the bottlenecks of deep learning. If you want to make a breakthrough in performance, you need to redesign a new graph deep learning framework. Huawei Cloud Graph neural network framework is described below.
FIG. 4.1 Multi-head mechanism in the attention network
(1) New GNN framework based on Graph engine: Based on the efficient neural network training operator in ModelArts, combined with the existing high-performance Graph Engine Service (GES) framework platform capability, and made use of the features of Graph Engine with high concurrency and low delay, the training process of GNN was highly parallel. For example, skip probability estimation of edge, vertex neighborhood sampling and negative sample construction are all solved into local operations of each vertex. The system provides dynamic schedulers that allow these local operations to be executed in a highly parallel manner, which greatly improves the overall throughput of the system.
(2) Unification of various GNN algorithm frameworks: The unified architecture is used to implement unsupervised large-scale graph embedding (such as DeepWalk, Node2Vec) and semi-supervised graph convolution (such as GCN, GraphSAGE) and other GNN algorithms, reducing the maintenance cost of the system.
(3) INTEGRATION of GNN and Graph data management: Enterprise-level GNN applications are usually not one-time computations, and the data scale is large, so the data must be maintained and managed. However, the existing GNN usually does not have this capability, so users can only build a separate database for maintenance, and then export the whole data during calculation. It not only consumes a lot of resources, but also introduces many problems such as data consistency. GES uses Property Graph and ecologically compatible factual standard Gremlin Graph query language to manage and maintain distributed Graph data. When training is needed, all kinds of operators are called locally in Graph engine and executed concurrently, which reduces the end-to-end performance loss.
With the help of efficient neural network training advantages of ModelArts and high-performance graph computing advantages of GES, Huawei Yuntu neural network greatly improves the overall computing efficiency of GNN. Taking Node2VEc algorithm as an example, in PPI data set, Huawei Yuntu neural network from sampling to training can be completed in 2min, which is 20 times more than the traditional open source implementation.
Graph neural networks have a future!
With the increasing heat of graph neural network research, different varieties of graph neural network have been emerging. In addition, because the graph neural network has a good expression ability for non-European spatial data, it has a broad application prospect in the cross fields with a large amount of data accumulation, such as e-commerce, finance, transportation and social science. This paper introduces the development of graph neural network in academia and industry, gives the basic concept and representation of graph neural network, summarizes the variants of graph neural network, and finally introduces the framework of Huawei Cloud graph neural network. Hope this article can provide some reference for you on the GNN road!
Click follow to learn about the fresh technologies of Huawei Cloud