At present, the concept of "zhongtai" is very popular, including data zhongtai, AI Zhongtai, business Zhongtai, technology zhongtai and so on. In the first technical...
Station B chooses Flink + Hudi's data lake technical scheme and the optimization made for it. The paper introduces the pain points of traditional off-line...
In 2012, Toutiao was launched, opening the chapter of intelligent recommendation in the content media industry. Since then, a large number of information products have...
In the process of front-end development, you may have thought about such a question: what is front-end development developing? In my opinion, the essence of...
I believe that you will not be unfamiliar with Flink. As the world's most active Apache open source project for three consecutive years, Flink's popularity...
SparkSql is a distributed Sql engine based on Spark computing framework. It uses DataFrame and DataSet to carry structured and semi-structured data to realize complex...
This article is compiled from the topic "Flink application and Practice in 58.com" shared by Feng Haitao, head of 58.com real-time computing platform, in Flink...
Introduction: How to accelerate Data science with Distributed Python on the cloud. If you are familiar with data science stacks such as NUMpy, PANDAS, or...
This article explains the architecture differences between the old and new versions of Hadoop and MapReduce, describes the Rpc architecture design of Yarn, and uses...
Introduction: This paper mainly introduces how to process labels of massive crowds by MaxCompute, analyze and model by Hologres, so as to support interactive experience...