Make the data of all know, ali China invented the data, and then the concept of “middle” immediately became the most domestic enterprises with the tuyere, real, middle office and data platform, to find that the implementation of projects such as large data lake poor is not poor, and there are many agencies began to busy down the middle, although China is not to the point of everybody bored, But in general it has become less popular.
I found that there are also many articles on the Internet for analysis, but most of them are long and technical, so I will explain them to you in the most common terms.
First, explain the concept of the middle platform
First of all, no matter it is data center, or business center, belong to a kind of center. And the duty of Taiwan lies in abstract commonness to form general service ability.
The data center is the service ability of abstract processing data and forming general data. Data Center focuses on data, including data storage, data calculation, data analysis, etc. These capabilities are also common. For example, with the user portrait capability provided by the Data Center, we can use the same solution in all areas.
As shown in the figure above, there is a connection between the business and data centers. The business center generates data, and the data center processes the data generated by the business center and then mines the value of the data and feeds back to the business center, forming a data closed loop.
Infrastructure layer, providing lower-level service capabilities such as observability, CICD, containers, service governance, and so on, supporting various mid-platforms. In addition to data center and business center, but also should include AI center. Co-serve the front desk application.
Is the structure of the centre reasonable
To be honest, this structure is quite reasonable. Sandwiched between the front desk and the back desk, shielding the back desk’s data storage and responding to the front desk’s endless changing needs.
The foreground follows the interface and is inherently unstable. It is inevitable that there will always be multiple data requests. The background should be mainly responsible for data storage, sorting out data of different forms and sizes in appropriate ways. When big data is inverted, it is too noisy and requires certain stability. If the request of the front desk requires the background to do directly, then the background tube is too much.
Coping with flexible requests and disciplined data storage are partly the demands of two different optimization goals, and the same team dealing with both on the same set of hardware is prone to schizophrenia.
Moreover, the backend is shared by many frontends. If flexible data services are provided directly to the front desk, the coupling between the frontends may become high and maintenance costs will increase immediately.
Similarly, it is not appropriate to put all this data processing in the front desk, which is not secure, and the front desk team is too busy trying to make the interface look better and use it more smoothly to think about the data.
With the center is much better, the background concentration tube storage, the front concentration tube interface, the gap between the front and back by the center responsible for wiping out. Clear division of labor, each to perform their own duties, efficiency naturally improve.
If the structure makes sense, why can’t it work?
There are a lot of reasons, but most of them don’t get to the point. Because most of the people who say these things don’t write code, and most of the people who write code don’t talk.
The fundamental reason is that the industry is not ready to make data landing technology!
The center provides data services to the front desk. What are data services? Return some appropriate data after receiving the request. Compute, that is, move to the center stage what the database used to do in the background.
So, what technique do you want me to use to write this computation code?
Java? Are you kidding? Write a group summary of hundreds of lines, you expect me to improve efficiency? Want to respond quickly to changes at the front desk? I’ve been working on this code for days, so I’ll see you next week.
The tasks to be done by the middle desk, which is what the database used to do, are mostly computations related to structured data. Java, on the other hand, is a high-level language with few useful structured data computing libraries, and it takes hundreds or thousands of lines of code to do what SQL used to do in a few sentences. Long code, not only difficult to write, but also prone to error. What’s more, Java programmers cost a lot of money, but their efficiency doesn’t increase. Why bother?
However, it seems that the mid-stage structure of some large factories is being implemented well, and how to explain this?
It may be that large factory talents, Java code accumulation is rich, these calculations will be easier. And, whisper it quietly, these Internet giants, while big, are nowhere near as sophisticated as traditional industries. You may not be able to make sense of what the big factory can make sense of.
Instead of Java, can we continue to use SQL?
We need to put a database in the center, take a bunch of data out of the background and move it back to the center. How much data to move? It looks like all the data could be used for computing, so you’d have to move the whole background data over. But this thing is still called center stage? It’s just a move backstage. It’s just a full belly.
In the absence of database-independent, integrated and embedded structured data computing capabilities that support diverse data sources, simple and convenient, rich and powerful, data central platform is a fantasy, good-looking architecture, but unable to be implemented. Unless your business is simple enough, forcing your way to the middle stage will only result in higher development costs and lower efficiency, no more flexibility and a lot more hassle.
The data center is constrained by computing power. Only after computing engines with the above characteristics can the reasonable architecture of the data center really play a role.