Big data refers to the collection of data that cannot be captured, managed and processed by conventional software tools within a certain period of time....
SF Technology's big data cluster needs to collect massive monitoring data every day to ensure the stable operation of the cluster. Although OpenSDB +HBase was...
Hologres (Chinese name) interactive analysis is ali cloud from the research on the number of one-stop real-time warehouse, the cloud native system combines real-time service...
However, it is difficult to build a perfect database if we are limited to using Hive without considering performance issues, so Hive performance tuning is...
Without further ado, Cloudera Manager has installed CDH5.x. (image-1c6abb-1513138023093)] HDFS management interface and HBase Web UI [image-1c6abb-1513138023093]] (image-82c66f-1513138023093) Hive HiveServer2 Web UI...(image-82c66f-1513138023093)
ETL (Extract-Transform-Load) is an abbreviation of Extract-Transform-Load (Extract-Transform-Load). For data developers, we often encounter a variety of data processing, transformation, and migration, so it is...
In "The Mystifying Parameters for Clustering TdEngine", we show you how to distribute data evenly among the nodes. Next, we will continue to explore with...
Tip ambari: sslhandshakeException: Client Requested Protocol TLSV1 is not enabled or not supported Ambari due to a JRE configuration problem. Modified the JRE configuration of...
This article will discuss how to use the Nested structure in ElasticSearch for data storage, query, and aggregation, and discuss ElasticSearch's solution to the limitation...
The highlighted data, itself a field in the document, is returned to you separately as highlight. ES provides a highlight attribute at the same level...
Most of the following operation and maintenance operations can be visualized on the platform using Logi-Kafka-Manager; @[TOC] 1.topicCommand1.1.Topic create bin/kafka-topics. Sh --create --bootstrap-server localhost:9092 --replication-factor...
Hadoop has been developed for more than 10 years, and the versions have gone through numerous updates and iterations. At present, the major versions of...
Hologres (Chinese name) interactive analysis is ali cloud from the research on the number of one-stop real-time warehouse, the cloud native system combines real-time service...
Reference: Elasticsearch Reference [7.10] » Term-level Queries » Term Query Term = 'Compliable' and 'Compliable' It will not be divided into hand and machine; Then...
Summary: This article translates a series of technical articles by Databricks on the data Lake Delta Lake. It is well known that Databricks dominates many...
SQOOP is an open source big data component that is used to transfer data between Hadoop(Hive, HBase, etc.) and traditional databases (MySQL, PostgreSQL, Oracle, etc.).
TalkingData, A leading independent third-party mobile data service platform in China, has announced that it has completed A Series A funding round led by Northern...
After BosonNLP fully opened the word segmentation and part of speech tacking engine in early September, many friends, especially those engaged in data processing and...
The Client provides commands for managing and accessing the hdfsnameNode, which is a master. The Client provides commands for managing and accessing the hdfsnameNode, which...
OLAP databases with MPP architectures such as Doris typically handle large amounts of data by increasing concurrency. Essentially, Doris's data is stored in a data...
Under the background of digitalization and intelligent transformation, data, as the core means of production of enterprises, is expected to play a greater value. From...