Comprehensive Demand Map for Local Living (GENE: Lifestyle GEneral NEeds Net (” LIFESTYLE GEneral NEeds Net “), from the perspective of user demand, deeply excavates the diverse NEeds of users in local life scenes, and forms a knowledge map associated with multi-industries and multi-types of supplies, aiming to improve the efficiency of supply and demand matching of the platform and help business growth. This paper introduces the background, system design and algorithm practice of local life comprehensive demand map, and shows its application in many business lines of Meituan, hoping to bring some help or inspiration to you.
The background,
1.1 Business Status
With the mission of “helping people to eat better and live better”, Meituan covers hundreds of industries such as takeout, catering, hotel, home stay, tourism, ticket, movie/performance, leisure/fun, beauty, medical, parent-child, education, marriage, life service and so on, meeting the diversified life service needs of hundreds of millions of users. In order to continue to increase the value of the platform, in addition to driving users and merchants to continuously improve the quantity and quality, more efficient matching of user demand and merchant supply is also an important part of the platform.
To improve the efficiency of matching, we need to understand user demand and merchant supply more fully and deeply, and try to organize and manage supply from the user’s perspective. At present, “industry – category – merchant – commodity” is a common way of supply organization and management. However, with the rapid development of business and industry, the pain points brought by this way of organization become increasingly prominent. For example:
- It is difficult to get a suitable matching result for some users whose directivity is unclear. For example, “Where to play with your baby this weekend?” Because the matching results of the platform are not ideal, users can often only make category decisions offline, decide to take the baby to nongjiale barbecue, and then search for the corresponding Nongjiale group purchase on the platform.
- Some requirements span multiple categories, and the matching process is not smooth enough. For example, “Where do you go to relax with your friends this weekend?” After the user makes the category decision offline, the optional categories include KTV, bar, secret room, board game, etc. However, the bearing pages of all kinds of purposes are independent of each other, and the user needs to switch back and forth between the bearing pages.
- Users still struggle to find supplies to meet their needs in a partially targeted category. For example, in the medical and beauty category, due to the lack of relevant knowledge, users often do not understand the functions of various service items provided by merchants, which parts are suitable for them, and what materials should be used, so they cannot find the suitable service supply efficiently.
The essential reason for the above problems is that the supply is mainly organized from an industry perspective, without fully considering the user perspective. In the current market environment where meeting user needs is the primary goal, we need to iterate and improve the existing supply organization.
1.2 Problem Analysis
In order to solve the above problems, we try to analyze them from an external perspective, combining first principles. In the whole human society, according to Maslow’s hierarchy of needs [1], human needs can be summarized and stratified. If you think of human society as a system, in a subsystem called the market, people trade to meet their needs.
Starting from the level of needs, human beings complete transactions in the market, and the final needs are met. Then the process of trading in the market can be broken down into “thinking -> considering -> selecting evaluation -> trading purchase -> performing a contract/service”. In addition, through the first three stages, the user’s requirements gradually evolve from coarse grained to fine grained, from abstract to concrete. The following will be interpreted in combination with specific examples:
- Hierarchy of needs: There is an emotional hierarchy in Maslow’s hierarchy of needs, which corresponds to the rich and diverse emotional needs of human beings, including family love, friendship, love and so on. As a mother, there is often a strong mother-child relationship between her and her baby, and she wants to continue to strengthen this emotion.
- Thoughts: For this reason, mothers often try to spend more time with their babies. By increasing the dimension of companionship, it becomes “playing with the baby”; By adding dimensions such as time and space, it becomes “where to play with your baby on the weekend?”
- Consider: There are a variety of options that moms have for the problem, from outdoor barbecues to farm pickings to theme parks. When she decides to go for an outdoor barbecue, it translates into specific merchandise needs, such as buying a grill.
- Evaluating options: The mother then chooses between the available supplies. As people often say, shopping around, the choice will be based on different factors, such as price, quality, reputation and so on.
- Transactional purchase: When the choice is made, the mother makes a transaction in exchange for a good or service.
- Performance service: namely the arrival of goods, completion of services, etc.
The market is a subsystem of human society, and the e-commerce platform is a subsystem of the market. At the same time, the e-commerce platform is an online subsystem, which provides search, recommendation and other forms of supply and retrieval capabilities. The current situation is that users often complete the process from “thinking” to “considering” offline, which is transformed into specific demand for goods/services, and then enter the e-commerce platform to complete the follow-up process from “selection and evaluation” to “performance service” (as shown in Figure 1 below). However, e-commerce platforms tend to focus on the latter three stages of capacity improvement, and tend to ignore the first two stages.
Therefore, it is difficult for users to form the mind of “thinking” and “considering” on the platform, while most e-commerce platforms organize and manage the supply in the manner of “industry-category-merchant — commodity”. In the end, the relationship between the user and the e-commerce platform forms a mutual constraint.
In fact, compared to the clear commodity demand of “grill”, there are a lot of abstract, vague, unclear needs of users, which still stay in the first two stages. For example, where to play with your baby on the weekend? Where do you go to relax with your friends on the weekend? How can I make myself more beautiful before marriage? How to cultivate children’s hands-on ability in summer vacation? At the same time, such requirements often span multiple categories, or have multiple options within the same category.
Only by breaking the existing constraint relationship and providing users with the ability of the first two stages (thoughts and considerations) can e-commerce platforms further meet the needs of users. The decision-making cost of users is further reduced, the decision-making process is more coherent, and the user experience can be improved accordingly. At the same time, the transaction process of users in the market can be further realized online.
With “industry – category – merchant – commodity” as a reference, if the e-commerce platform can identify the needs of users in the first two stages, and establish a new association relationship between them and the supply, supplemented by search, recommendation and other retrieval capabilities, it is possible for users to complete the first two stages online. As a semantic network to reveal the relationship between entities, knowledge graph is particularly suitable to solve the above problems.
Second, solutions
2.1 Solution
Following the example in 1.2, the mother will “Where will she play with her baby this weekend?” “Has translated into specific” outdoor barbecue “needs, which extend to more specific” grills “and” rural group buying “needs. At this time, my mother will go to various physical e-commerce platforms and life service e-commerce platforms represented by Meituan for “selection evaluation”. The two e-commerce platforms use search and recommendation technology to locate specific goods/services in the physical supply pool or service supply pool respectively, and then give feedback to the mother.
For the purpose that the technical team expects to achieve as mentioned in 1.2, a representative reference case at present is Alibaba’s e-commerce cognitive map AliCoCo[2]. Its basic construction idea is to start from the user’s perspective, firstly to mine various types of atomic words, and then to combine atomic words and mine relevant candidate phrases, and then to identify the real user needs, and finally to the corresponding supply. Its hierarchy is shown in Figure 2:
- Classification layer: Construct a complete classification system, which includes all kinds of classifications in the vast world, including general categories such as space and time, as well as colors, functions and the most important categories involved in e-commerce.
- Atomic concept layer: expanded on the basis of various categories in the classification layer, including atomic concepts under various categories (such as space -> outdoor, event -> barbecue, time -> Christmas, color -> red, function -> warmth, category -> dress) and the relationship between atomic concepts.
- E-commerce concept layer: Above the atomic concept layer, it contains users’ shopping needs composed of atomic concepts or directly mined phrases, that is, e-commerce concepts (such as outdoor barbecue), so that users’ shopping needs are explicitly expressed in a phrase that conforms to natural language.
- Commodity layer: contains the relationship between commodity and various atomic concepts and e-commerce concepts (such as outdoor barbecue -> grill, butter, tinfoil).
Based on the map, the mother can directly express her needs for “outdoor barbecue” on Tmall, rather than the more specific “grill”; Tmall will also feed back to the mother other important items related to outdoor barbecues other than grills. From the perspective of correspondence, AliCoCo’s e-commerce concept layer corresponds to the “consideration” stage, and the commodity layer corresponds to the “selection and evaluation” stage. Obviously, thanks to AliCoCo, Tmall can intervene in users’ transaction process from the “think” stage.
It follows that we should be able to build a more complete map to cover the “mind” stage. At this stage, human needs are gradually crystallized by adding one or more dimensional constraints, according to maslow’s hierarchy of needs. For such dimensional constraints, we collectively call them “scene constraints”. Therefore, the requirements corresponding to the “consideration” stage are called “representational requirements”; The requirements corresponding to the “mind start” phase are called “scenario requirements”. To this end, we hope to build a GENE: Lifestyle GEneral NEeds Net, as shown in Figure 3 below. For the hundreds of comprehensive industries involved in the local life scene, we believe that the new supply organization is more close to the needs of users, and it is also to solve the problem of supply and demand matching from the perspective of users.
2.2 Specific Plan
Continue in 2.1, we try to build a multi-level graph structure, and will be “representational demand” and “scene demand” split into separate layers, both to avoid the two types of demand caused by the confusion, and at the same level of and can carry out more detailed requirements of the user scenarios apart, more rich. Local life comprehensive demand map (GENE) is mainly composed of six parts, including scene demand layer, scene element layer, concrete demand layer, demand object layer, industry system layer and supply layer, as shown in Figure 4 below:
At the scene requirement layer, we use the phrase human-readable to represent the user needs of the scene, such as “where to play with the three-year-old baby on National Day”, “Make yourself more beautiful before marriage”, “pupils improve their thinking ability” and so on. In the expression of a scene demand, usually includes characters, purpose, time, space, way and other elements, take “National Day to accompany the 3-year-old baby to play” as an example, “3-year-old baby” is the character, “play with the baby” is the purpose, “National Day” is the time.
At the scene element level, in order to better express the scene requirements, we disassemble these short sentences and refine them into several fine-grained vocabularies. These vocabularies are used to complete the coverage and systematic organization of the characters, purpose, time, space, mode and other elements in the scene requirements, which are called “scene elements”.
In the concrete demand layer, the description of scenario demand often does not explicitly point to a specific service/supply, but implies a batch of potential services/supply suitable for this scenario. For example, in the case of “Where to play with a three-year-old baby on National Day”, outdoor barbecue, feeding alpacas, playing on slides and riding ponies are all specific services suitable for this scene. Therefore, we need to show all these specific services explicitly in the form of phrases. These phrases directly reflect users’ specific service requirements, which are called “concrete requirements”.
In the requirement object layer, in order to further understand the representational requirements, the representational requirements are divided into objects corresponding to the specific service requirements, which are called “requirement objects”, and the interaction between users and objects in the service. For example, feeding alpaca with concrete needs can be divided into alpaca (demand object) and feeding (service interaction). Due to the diversity of local life services, besides feeding alpaca, a variety of concrete needs such as touching alpaca, riding alpaca and watching alpaca performance can be generated around alpaca as a demand object. In addition to the requirement object nodes, this layer also contains the attribute information of the requirement object to describe the requirement object in more detail. For example, for the concrete demand outdoor barbecue, it can be divided into barbecue (demand object), outdoor (demand object attribute) and experience (implicit service interaction).
At the industry system level, as the scenario requirements and concrete requirements of users often span multiple traditional service categories, in order to determine a specific business scope for user needs, we also need to build the category system involved in various industries, as the business foundation for the construction of the above layers.
In the supply layer, the virtual supply of contents and the physical supply of merchants and commodities will be associated with concrete demand and scene demand, so as to provide the corresponding supply support for user demand. For example, a supply of outdoor barbecue will be associated with concrete demand “outdoor barbecue”, and further associated with the scene demand “Where to play with the three-year-old baby on National Day”.
To sum up, in the comprehensive demand map of local life, users’ scenario-oriented needs and specific service needs are respectively expressed as short-sentence level scenario needs and phrase-level concrete needs. These two requirements are expressed through scene elements and requirement objects respectively. Finally, different types of supply will be associated with scene demand and concrete demand, so as to enhance the matching efficiency between supply and users with user demand as the link.
Three, the implementation method
At present, the comprehensive map of local life needs has preliminarily covered the diversified needs of users in three industries related to local life: play, medical beauty and education. In the process of constructing the graph, we build the graph layer by layer from bottom to top according to the order of industry system layer, demand object layer, concrete demand layer, scene element layer and scene demand layer, and establish the association relationship between the nodes of each layer and various types of supplies. The following will take the play industry as an example to introduce the construction details of each layer of the map and the algorithms involved.
3.1 Industry system layer
3.1.1 Construction of industry category tree
In the play industry, the industry system layer contains categories that can provide play services, and category information is represented by a tree structure. Because the construction of play industry system requires very high expert knowledge, and the design of this part is crucial to the knowledge mining of subsequent layers, we do not directly define artificial, but based on our current mature industry category tree, we build it by pruning and splitting it.
First, select the first-level category nodes related to play from the category tree, including “leisure and entertainment”, “parent-child”, “tourism”, “catering”, etc. For each primary category, we further filtered the next level of play-related categories down to the leaf category and cut out the non-play-related categories. In addition, we also divided the playing-related leaf categories that could be subdivided in the category tree after pruning, for example, “bath” was subdivided into “private soup”, “bath center”, etc., finally, a complete category tree of play industry was obtained.
3.1.2 Class supply association
After determining the category tree, we also need to obtain the dependency relationship between physical offerings (merchants and goods) and virtual offerings (content, such as UGC) and the category to provide data support for a series of subsequent mining. Since both merchandise and content can be linked to the merchant, we only need to obtain the merchant’s affiliation with the class. The play category tree is obtained by pruning and splitting existing categories. Except for the new categories, the relationships between other categories and merchants can directly inherit the original results. For the newly split category, we need to rebuild the business and its dependencies.
To determine which category a merchant belongs to, the most intuitive basis is the merchant name, commodity name and commodity details. However, the merchant name and commodity of many merchants often contain less information, which increases the difficulty of category discrimination. In order to ensure the accuracy of discriminant of merchant categories, we introduce more merchant information, including merchant UGC and merchant portrait, and design a discriminant model of multi-source heterogeneous data fusion. The overall model structure is shown in Figure 5 below:
Among them, feature extraction and processing methods of data from different sources are as follows:
- Merchant name, commodity name and commodity details: all are text data, which can be extracted directly through BERT[3] and output.
- Merchant UGC: Since the number of MERCHANT UGC is often very large, in order to make effective use of its information, firstly, UGC features are obtained through Encode in the way of Doc2Vec[4], and then output after feature processing through a self-attention [5] module.
- Merchant portrait: After being transformed into one-hot feature, it is output after nonlinear mapping through the full connection layer.
The above three features are connected and integrated, and the final category discrimination is realized through the full connection layer and softMax layer. Based on multi-source data fusion modeling, the merchant information has been fully utilized. For example, using only merchant name, commodity name and commodity details data, the accuracy rate of bath subclassification is 92% based on BERT discrimination, and the accuracy rate increases to 98% based on multi-source fusion model discrimination.
3.2 Requirement Object layer
In the requirement object layer, we hope to dig out the play object words involved in various categories in the system of the play industry as nodes of this layer. These words can describe the interaction objects in the actual process of users playing, which is the basis for the composition of concrete play requirements. To ensure the comprehensiveness of play object mining, we use a multi-source, multi-method approach.
In terms of data, we use relevant texts from merchants and users as mining corpus. Methodologically, we use two ways to mine play object words:
- The first is unsupervised expansion. Before mining, the operation will first provide some play object words as seed input according to experience. We use corpus in advance to build Word2Vec model of unsupervised Skip-gram structure, extract word vectors from seed words of business input, and quickly expand relevant object words by combining cosine similarity.
- The second is supervised tagging, which is defined as sequence tagging problem, and the BERT+CRF model is adopted to automatically identify new object words in the corpus.
In practice, in order to make mining more efficient, we carry out text matching for the expanded and inspected object words in the unsupervised link in the corpus, and transform the matching results into training samples in the supervised annotation link. At the same time, after operation quality inspection, the results with supervised annotation will also be taken as the input of unsupervised expansion. By combining the two links, we complete the mining of play objects. The complete process is shown in Figure 6 below. In addition, the operating manual audit to play in the process of the object word, for some of the core of the business is known to play object, will also enter the business side is associated with precipitation characteristics as its properties, further perfect play object information, for example, “kill” the play object, to increase the corresponding “live” and “desktop” and other types of properties.
After obtaining the object words, we also need to know which category the object words belong to, so as to further concrete demand mining and supply association. Therefore, we construct the relationship between the object words and the category. It is the most intuitive and accurate method to measure the relationship between them by the number of mentions of object words in various corpus texts. Therefore, we directly use object words to carry out text matching in the corpus under each category, and determine the relationship by word frequency. At the same time, we further build the upper and lower position and synonymy relations between object words. Currently, there are common supervised methods such as relation discrimination through projection and classification (such as BERT relational model). In the actual process, we use rules to assist the manual to quickly complete the construction based on the statistical characteristics of the object words and the results of the co-occurrence of Pattern.
3.3 Concrete demand layer
3.3.1 Concrete demand mining
Representational demand layer may be regarded as the user to play in the industry specific set of service requirements, each representational play demand is the layer of a node, the play object superposition pluralistic interaction between the user and the object and object description information, it is in the form of a phrase expresses the essence of the user to play service supply demands. The process of specific play demand mining can be divided into two steps:
- Candidate phrase generation: Around the play object words, a large number of phrases containing play objects are generated as the candidate set of concrete play needs.
- Phrase quality discrimination: A semantic discrimination model is established to extract real representational play requirements from the candidate set.
Candidate phrase generation
In Step 1, we first take the requirement object word as the core and use the same corpus as the play object to mine for the generation of candidate phrases. Common phrase mining algorithms, such as AutoPhrase[6], use Ngram to combine phrases, and this form is too redundant for phrases with required objects, so we consider phrase mining based on syntactic structure.
In order to make the generated phrases conform to the requirements of syntax, we use the preset syntax relation as the template for mining. In order to more efficiently dig out the syntactic relation in large-scale corpus, we obtain the Embedding of sentence components based on ELECTRA[7], which is more convenient, and then use BiAffine[8] to predict its syntactic relation. Through dependency parsing, we dig out the phrases that contain the corresponding play objects and accord with the syntactic relation in each class target corpus. In addition, attributes of objects in the requirement object layer are also used as object descriptions for phrase generation. Finally, all mined phrases, after coarse screening of statistical characteristics such as word frequency, will be used as the candidate set of representational play needs. Mining examples are shown in FIG. 7(a).
Phrase quality discrimination
In Step 2, although the phrases in the candidate set obtained through Step 1 conform to the preset syntactic relationship, there are still a large number of expressions that are not consistent with the actual needs of users semantically. Through sampling analysis, we find that less than 10% of the phrases meet the requirements. How to select a phrase that reflects the real user’s need to play from a large number of candidate phrases has become an urgent problem to solve.
AutoPhrase scores phrases through a discriminant model based on the statistical features of phrases. However, it is difficult to identify phrases with low semantic quality only through the statistical features. Therefore, we further build a discriminant model of Wide&Deep[9] structure based on the joint modeling of statistical and semantic features. To determine whether the phrases in the candidate set are concrete play requirements, we hope that the discriminant model can filter out a large number of low-quality phrases, so as to save a lot of labor costs for operation. The overall structure of the discriminant model is shown in FIG. 7(b), where:
- In the Wide section, the statistical characteristics of the global and context of the candidate phrase are extracted and output after nonlinear mapping through the full connection layer.
- In the Deep part, the Deep semantic features of candidate phrases are extracted, and the corresponding features are extracted and output by BERT.
The above Wide and Deep partial output features are connected and integrated, complementary advantages, through the full connection layer and SoftMax layer to achieve the final phrase discrimination. Accumulated in the process of practice, in addition to the direct use of the phrase tags as is sample, we still have some common sense through the preset Pattern constructed from a candidate set is samples, such as ornamental [plants], touch/animal, and sample for sampling structure of candidates, complete) model training, after combining with the active learning, after several rounds of iteration, The model eventually achieved a 92% recall rate and 85% accuracy. The remaining phrases that pass the quality test are reviewed and refined by the operators to become the final figurative play requirements.
3.3.2 Representation of supply correlation of demand
In the concrete demand layer, because the concrete play demand is obtained by the play object, there is a natural correspondence between the two. As for the relationship between the upper and lower position and synonymous relationship between the concrete play needs, it can be constructed in the process of manual audit based on the relationship between the objects and the syntactic relationship. In addition, it is more important to correlate the physical play demand with physical offerings (merchants and goods) and virtual offerings (content, such as UGC).
This problem is abstracted as a semantic matching problem, which is realized by matching the concrete play demand with the text information provided by the corresponding class. Among them, the merchant uses the text information of merchant name, the commodity uses the text information of commodity name and commodity details, and UGC uses its own text information. Since UGC and goods are part of the merchant, the relationship between representational play needs and UGC/ goods will also be added to the construction of their relationship with the merchant. The overall matching process is shown in Figure 8 below. Firstly, we matched the concrete play demand with UGC/ commodity, and on this basis, combined with the matching results of merchant name text, they were associated to merchants through rule aggregation.
Due to the large number of demands for figurative play and the text information supplied at the same time usually contains multiple clauses, for the balance of efficiency and effect, we divided the matching process into two stages: recall and sorting.
During the recall phase, we roughly screen out clauses that may be potentially related to representational play needs. For the figurative play requirement, we extend the synonymous label of the figurative play requirement based on the constructed synonymy relation, and match it with the clause text for coarser grain Pattern. For the matching clause, we enter the sorting stage to calculate the refined association relation.
In the sorting stage, we build a semantic matching model based on the relationship classification between BERT sentences, and add the full connection layer and Softmax layer after BERT to realize the classification. The model can identify the semantic matching relationship (association/disassociation) by predicting the coarse screen samples obtained in the recall stage. The average recall rate and accuracy of the final supply correlation were 90% and 95% respectively.
3.4 Scene element layer
3.4.1 Disassembling scene elements
The scenario element layer contains the scenario elements that constitute the user scenario requirements. As mentioned above, to describe a scene, it is necessary to explain the specific characters, time, space, purpose and other elements. For example, for “Where to play with a 3-year-old baby on National Day?” This scenario needs, we can do as follows: time – National Day, figure – 3-year-old baby, purpose – family companionship (play with the baby). Therefore, we disassemble the scene elements according to the above methods, so as to make the excavation and combing of scene elements as comprehensive and systematic as possible.
3.4.2 Scene element mining
After the disassembly of scene elements, the next step is to excavate scene elements in each disassembly category. Scene elements, as scene-based information of concrete needs, often come from users’ intuitive feelings. Therefore, we choose UGC context corpus associated with concrete play needs as the mining corpus. Similar to the method of requirement object mining, we take the extracted and summarized scene elements of each category as the seed words, and complete the scene elements mining by means of relevant element expansion and sequence annotation.
After determining the scene elements, the next key is to complete the construction of the relationship between the scene elements and the representational play needs, that is, for each scene element, find out its suitable representational play needs, such as cherry blossoms in spring and animals for children. Through the analysis of UGC text, we found that when users in UGC spoke about a specific figurative play need, they often explained some relevant scene elements. Therefore, we continued to select the CONTEXT corpus of UGC associated with figurative play need as the data source of relationship construction.
At first, we adopted the method based on Pattern. By summarizing the Pattern that can be used to judge the relationship between scene elements and representational play needs, we directly extracted the texts containing the two from the corpus. However, due to the diversity of user expression, not only the accuracy cannot be guaranteed, but also the limited Pattern affects recall. Therefore, we further try to use the model-based discrimination method to improve generalization and improve the construction of the relationship.
Since the representational play needs in the corpus we use are known, if scene elements are regarded as attributes of representational play needs, then the problem can be regarded as an aspect-based classification problem. Referring to the practice of attribution-level emotion classification [10], we constructed auxiliary sentences by presetting sentence templates, combining scene elements and concrete play needs, and transformed attribution-level classification into a QA-like sentence pair classification problem. For example, for a certain corpus that has been linked to the symbolic need to feed alpacas: “This Saturday we went to the farmhouse to feed alpacas”, one of the auxiliary sentences reads: “Weekends are good for feeding alpacas”.
BERT inter-sentence relationship classification model is adopted to realize sentence pair classification, as shown in Figure 9. The auxiliary sentence and corpus text are connected by [SEP] and input to the model for discrimination, and the model outputs the discrimination results (suitable/unsuitable). Finally, we extracted the results according to the relationship in all corpus, and then voted, calculated and scored the relationship between each scene element and representational requirement to determine the relationship between them.
3.5 Scenario Requirement Layer
3.5.1 Assembling scenarios
In the scenario requirement layer, we will assemble the information of the scenario element layer and the concrete requirement layer, so as to generate a large number of scenario requirements. The assembly of the scene needs, may only contain scene elements, such as “National Day with the 3-year-old baby where to play?” It does not contain any concrete demand, but can contain both scene elements and concrete demand. For example, in “Going to the countryside to pick strawberries on weekends”, weekend and outskirts are scene elements, while picking strawberries is concrete demand.
3.5.2 Scenario Requirement Identification
For the assembled scenes, the most important thing is to ensure that they are reasonable. For example, “weekend” and “parents” are reasonable play scenes, while “girlfriends” and “parents” are contradictory play scenes. To do this, we first need to calculate the relationship score between the scenario elements to guide the assembly of the scenario requirements. Scene elements are meaningful only when they rely on concrete requirements, match appropriate gameplay, and participate in the assembly of scene requirements. Therefore, for the construction of a reasonable relationship between scene elements, we try to evaluate the correlation between two scene elements through relationship transfer based on the relationship score between scene elements and representational requirements.
In section 3.4.2, we have quantified the relationship score between scene elements and concrete requirements. One of the most intuitive ideas is to calculate the relationship between scene elements by passing the relationship between scene elements – concrete requirements – scene elements. As shown in Figure 10(a), the relationship scores of the two scene elements of “parent-child” and “bestie” can be obtained by using the concrete demand of “feeding alpaca” as the link.
We first construct the scene elements and representational demand relationship score matrix, considering the style number to meet the long tail distribution situation, demand representational dimensions of matrix column normalized processing, at the same time in order to keep the scene elements – scene autocorrelation coefficients of the matrix is 1, the normalized after scene elements – representational demand matrix line L2 norm normalized processing, Therefore, the new matrix multiplied by the normalized matrix and its transpose matrix can be used as the relationship scoring matrix of scene elements – scene elements.
The relationship score between scene elements can be quickly obtained through the above method. However, this method only calculates the direct co-reference strength of scene elements in the concrete demand by the relationship transfer mode of scene elements – concrete demand – scene elements, resulting in insufficient coverage of the relationship between scene elements. To this end, we extend the transitive pattern to a longer chain of node relationships, which are markov compliant, as shown in Figure 10(b). But as the path of transmission grows, the cost of computing increases exponentially. Therefore, we adopted the one-step sequential difference method in reinforcement learning [11] to solve the problem, and took the concept of “maximizing the expectation of cumulative return” as the value of nodes, the set of scene element nodes as the state space in the reinforcement learning concept, and the set of concrete demand nodes as the action space.
For example, when we are in the state of “parent-child” scene elements, we can jump to the next state “girlfriends” or “outdoor” by choosing “Feed alpaca” or “role-play”. According to the decision function of the state jump process, a concrete demand node is randomly selected as the decision behavior according to all the concrete demands associated with the state of the current scene elements, and the probability of extraction is positively correlated with the score. The probability of state transition is a random jump to the relevant scene elements under the decision of the representational demand node, and the probability of jump is positively correlated with the score.
At the same time, for the specific mutually exclusive relationship, we formulated the reward matrix according to the actual business application needs to realize the diversified scene factor relationship scoring model. In this way, we transformed the node relation transfer model into markov decision model, and deduced the value iteration expression and the node relation score prediction formula by combining the Behrman optimal principle, as shown in FIG. 10(c). Formula according to the pie chart, on the premise of keep unchanged strategy by adopting the idea of bootstrap iterative calculating the value of the node, and further calculate the relationship between the scene elements, which can ensure to make better use of existing network information, improve the relationship between coverage, but also lower the constraint relation through reward matrix mutex relationship, the influence of the flexibility to adapt to different business needs.
Finally, based on the relationship score between the scene elements, we selected the scene requirements with high scores from the assembled scene requirements set, and generated the final scene demand expression according to the preset template, such as “relaxing with friends on weekends”, “playing with girlfriends”, and “A good place for outdoor barbecue with children on National Day”. These scene requirements can be linked to the corresponding concrete play requirements through the scene elements/concrete play requirements contained in them, and then related to the supply, so as to provide users with scenario-based play solutions.
Iv. Application practice
Local life needs comprehensive atlas, covers the user scene demand and representational demand, on the one hand, more lead to participate in the user decision, in “who have decided to”, “consideration”, “choosing assessment” and so on the multiple stages influence users, reduce its decision-making cost, on the other hand, provide a more diversified supply, supply and demand matching efficiently. In terms of application mode, it applies to various business forms such as search and recommendation.
After nearly a year of construction, the current comprehensive demand spectrum contains hundreds of thousands of core concrete requirements and scene node, and the relationship between tens of millions of, and in Meituan parent-child, leisure entertainment, medical beauty, education and training, and other business carried on the preliminary practice of application of an example by introducing the following specific application mode and application effect.
4.1 parents
Parent-child original user channel page matching efficiency between supply and demand, the ICON according to the traditional parent-child category, unable to meet the needs of users of different types (figure 11 (a) left), and guess or supply at the bottom of the form of a single, and reflect the high quality supply of user requirements, lack of decision-making information (figure 11 (b) left), thus to redesign the parent-child channel page. In order to fit the business characteristics of parent-child play, we apply the demand nodes and relationships related to parent-child play to the multiple traffic bits after the channel revision to provide labels and supply data support.
Among them, for ICONS, demand ICONS are generated across categories based on high-frequency scenes and representational requirements, such as “close to animals”, “baby bubble”, etc. (in FIG. 11(a)) and corresponding secondary pages (on the right of FIG. 11(a)). These ICONS contain similar requirements in original multiple categories and provide decision information for users in the “consideration” stage.
Recommended for the bottom and we supply needs of parent-child play representational, optimization, as the supply of high quality for the contents of its associated recommendations, and for each supply extract contains corresponding text representational demand, recommended as a reason for the leakage, these sentences from the perspective of the user’s actual demand presents information, greatly attracted users (FIG. 11 (b) to the right). In addition, according to browsing and transaction behavior, the relationship between the concrete demand and the user is further established by using the supply as the medium, which is applied to the recall and ranking optimization of personalized recommendation. The revised parent-child channel page meets users’ diversified recommendation needs and greatly improves user experience.
4.2 Leisure and Entertainment
In the leisure and entertainment channel page, we have a series of applications around the requirements of scene and visualization. On the one hand, based on organization play scene demand new scene ICON, such as meet user outdoor play “walk in the garden”, meet the users in the tide of indoor play “play indoor tide”, meet like the nighttime users “nightlife” have fun, meet friends and colleagues party party “group”, the ICON from the user scene of the play, It breaks the restrictions of traditional categories and makes the matching between users and supplies more smooth. The secondary page of each ICON will show the concrete gameplay requirements of each scene and the associated businesses and content.
On the other hand, in the scene navigation module of the channel page, it tries to use the scene requirements to further display the scene-oriented fun information, including more than a dozen play scene themes such as “one person’s joy”, “family warmth” and “birthday”, and recommends the businesses related to the concrete needs of these scenes. These scenario-based applications (FIG. 12(a)) act on users in the “mental arousal” stage and improve the decision-making efficiency of users.
In addition, some representational needs through rewriting can be used directly to the corresponding class purpose merchant list page quick sieve, live-action script to kill/desktop killed, for example, change/hanfu experience/lu pet will go/flight simulator (figure 12) in (b) left and, while we in the industry system layer class purpose segmentation result, also can be quick screening of merchants, For example, bath category segmentation (figure 12(b) right), the application of these quick screen, more convenient for users to choose the store.
Fifth, summarize the outlook
In local life services, how to continuously improve the matching efficiency between supply and users is a difficult problem in front of us. We try to start with the object of user concern, by digging into user demand and using it as a link to connect the supply and user. In order to fully explore and understand user needs, we strive to explore and try to build a comprehensive demand map of local life, which is constructed layer by layer in order of industry system layer, demand object layer, concrete demand layer, scene element layer and scene demand layer, and establish correlation relationship for various types of supply.
At present, the results of the comprehensive demand map can be applied to various business forms such as search and recommendation, and has achieved practical results in many business scenarios of Meituan. However, we are still in the early stages of exploration, and there are many iterations ahead, so here are some thoughts and prospects for the future:
- Wider industry coverage: on the one hand, deepen the construction of the existing entertainment, medical beauty and education industries, dig more nodes and relationships, and better understand the needs of users; On the one hand to beauty, marriage and other more industries horizontal coverage; In addition, the service experience map will be further extended to the whole link of user decision-making, covering the performance service link, analyzing the user needs and feedback, and better enabling businesses to improve user experience.
- More data introduction: The current map construction is mainly based on the text corpus of users and merchants of the platform. In the next step, more modal data such as images will be used, and external knowledge will be tried to be introduced to improve and supplement the current nodes and relationships.
- Deeper mapping applications: current map in search and recommended practices are mainly concentrated on the label and its associated supply direct application of the follow-up to consider further deepen the map application, make full use of the scene and scene elements of information demand, provide more accurate recommend side user intention recognition with support, so as to increase supply and users’ matching efficiency, make knowledge map more value.
reference
- [1] Maslow A H. A theory of human motivation[J]. Psychological review, 1943, 50(4): 370.
- [2] Luo X, Liu L, Yang Y, et al. AliCoCo: Alibaba e-commerce cognitive concept net[C]. Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data. 2020: 313-327.
- [3] Devlin J, Chang M W, Lee K, et al. Bert: A study on the performance of a deep bidirectional transformers machine [J]. ArXiv Preprint arXiv:1810.04805, 2018.
- [4] Le Q, Mikolov T. Distributed representations of sentences and documents[C]. International conference on machine learning. PMLR, 2014: 1188-1196.
- [5] Vaswani A, Shazeer N, Parmar N, et al. Attention is all you need[J]. arXiv preprint arXiv:1706.03762, 2017.
- [6] Shang J, Liu J, Jiang M, et al. Automated phrase mining from massive text corpora[J]. IEEE Transactions on Knowledge and Data Engineering, 2018, 30 (10) : 1825-1837.
- [7] Clark K, Luong M T, Le Q V, et al. Electra: Ieee Transactions on Geoscience and Remote Sensing [J]. Ieee Transactions on Geoscience and Remote Sensing: Ieee, 2005, 23 (3) : 235-237.
- [8] Dozat T, Manning C D. Deep biaffine attention for neural dependency parsing[J]. arXiv preprint arXiv:1611.01734, 2016.
- [9] Cheng H T, Koc L, Harmsen J, et al. Wide & deep learning for recommender systems[C]. Proceedings of the 1st workshop on deep learning for recommender systems. 2016: 7-10.
- [10] Sun C, Huang L, Qiu X. Utilizing BERT for aspect-based sentiment analysis via constructing auxiliary sentence[J]. arXiv preprint ArXiv: 1903.09588, 2019.
- [11] Sutton R S, Barto A G. Reinforcement learning: An introduction[J]. 2011.
Author’s brief introduction
Li Xiang, Chen Huan, Zhi Wei, Xiaoyang, Yan Ting, Xu Le, Cao Zhen, etc., all came from the platform technology department of Meituan To the comprehensive business data team.
Recruitment information
Meituandian platform Technology Department – To comprehensive business data team, long-term recruitment algorithm (natural language processing/recommendation algorithm), data warehouse, data science, system development and other positions students, coordinates in Shanghai. Interested students are welcome to send their resume to: licong.yu@meituan.com.
Read more technical articles from the Meituan technical team
Front end | | algorithm back-end | | | data security operations | iOS | Android | test
| in the public bar menu dialog reply goodies for [2020], [2019] special purchases, goodies for [2018], [2017] special purchases such as keywords, to view Meituan technology team calendar year essay collection.
| this paper Meituan produced by the technical team, the copyright ownership Meituan. You are welcome to reprint or use the content of this article for non-commercial purposes such as sharing and communication. Please note “Reprinted from Meituan technical team”. This article may not be reproduced or used commercially without permission. For any commercial activities, please send an email to tech@meituan.com to apply for authorization.