After developing the user tag, how to apply the tag to the reality is a very important problem. Only the product design can make the label play the real value, this article will introduce the process of user portrait productization.
First, label display
The first is the label display function, which is mainly used by business personnel and research and development personnel, in order to see the whole user label system more intuitively.
Different tag systems will have different levels, so the design of this page needs us to display a tree structure, convenient for future expansion.
At the last level, such as natural gender, you can design a statistics page, and when you enter the page, you can display the corresponding statistics.
You can intuitively see the value ratio of labels and provide good suggestions for businesses. In addition, you can display the specific description of labels to illustrate the fluctuation of labels on a daily basis and observe the changes of labels.
What are the data sources for this part? As mentioned earlier, the metadata information of these tags is stored in mysql for easy query.
Therefore, the tree chart and label description information need to be obtained from mysql, while the proportion chart data can be obtained from Hbase, Hive, and ES directly. However, the historical fluctuations of daily tags are still displayed as historical records in mysql after running tags every day.
Second, label query
This feature can be used by r&d personnel and business personnel.
The label query function is actually a process of global portrait of the user. We need to display the full amount of label information of a user.
After you enter a user ID, you can view the attributes, behaviors, and risk control attributes of the user. Learn a specific user profile from multiple perspectives.
The tag information is specific. Searching for a single ID from Hive slows down the query speed. Therefore, you are advised to obtain the tag information from Hbase or ES to greatly improve the query efficiency and real-time performance.
Label management
This feature is intended for use by developers.
For labels, you can’t make big changes every time you add a label. This is very labor-intensive, so you must have functions that can manage labels.
Here defines the basic information of the label, development methods, developers and so on. After completing the development of the label, you can input the label directly on this page, and then you can complete the on-line work of the label, so that the business personnel can use the label.
Add and edit TAB pages. You can add drop-down boxes or input boxes to input information.
As mentioned earlier, the metadata information for these tags is stored in Mysql, as long as they are added and modified.
Iv. User group
As the core function of user portrait, user group function. It is the bridge between user portrait and business system, and also the value of user portrait.
This function is mainly used by business people.
This feature allows users to define a group of people, and the rules for the delineation are the constraints on the tag.
After delineating the crowd, we can provide interaction between them and the outbound call system, customer service system, advertising system and Push system of the business system, so as to achieve the purpose of real fine operation.
For the judgment of tag rules, recorded rules need to be stored in Mysql, and rules need to be parsed into computable logic in crowd calculation. Parsing into Sql or any other query language is very difficult, which is a big challenge for development.
In this function, the function of crowd comparison can also be added to delineate and compare different labels of different groups. This is also a huge test of query performance.
However, the user group function as the core of the user portrait is what we must achieve. For the technical architecture, Hbase is better at KV query, but the performance of multi-dimensional query is poor. Therefore, you can use ES index to query the Hbase Rowkey in ES and then query Hbase. There are also many companies that choose to migrate to ES as a whole to do this. So is ES up to the job?
In the next chapter, we will talk about how to use ES to achieve user clustering, which will be continued
reference
User Profiling: Methodology and Engineering Solutions
For more blog posts and scientific news about real-time data analysis, please visit real-time Streaming Computing for more information about user portraits.