The function of the ElasticSearch
Distributed search engines and data analytics
-
Search: such as Baidu search, Taobao commodity search, some app internal search, IT system site search.
-
Data analysis: for example, the number of visits to e-commerce websites, the number of new users every day, and which city has more users.
The full text retrieval
-
Structured search: for example, you want to search for Java related books in a bookstore.
-
Data analysis: Analyze the data you search for, such as which books sell the most and which have the highest reviews.
-
Real-time processing: If it takes you an hour to process data, it’s not real-time processing, it’s offline batch processing. Real-time processing only takes seconds.
-
For huge amounts of data in real-time processing: because ES is distributed, automatically spread huge amounts of data to multiple servers, storage and retrieval of nature to reach into the real-time data processing, for example, for example Lucence, it is a standalone application, so can only deal with single server can handle the amount of data.
What scenarios does ElasticSearch work in
-
Wikipedia: Full text search, highlighting, search recommendations and more
-
GitHub: Hundreds of billions of lines of code
-
E-commerce sites: Search for goods
-
Log data analysis: Logstash data collection, ES data analysis and processing
-
Commodity price monitoring site: the user sets the price threshold of a commodity, when the monitored commodity amount is lower than this threshold, it will send a message to inform the user
The characteristics of ElasticSearch
-
It can be used as a large distributed cluster (hundreds of servers) technology, processing petabyte level data, serving large companies, or running on a single machine, serving small companies.
-
ElasticSearch is not a new technology, it is a combination of full text search, data analysis, and distributed technology, which is unique to ES. The implementation of This technology is very old, such as Lucence’s full text search, myCAT, and commercial data analysis software. That’s all there is.
-
Out of the box for the user, very simple, and why is explained in a later section. As small and medium-sized enterprises, directly 3 minutes of deployment can be used in the production environment.
-
As a complement to traditional databases, ElasticSearch provides a lot of functionality that a database can’t provide, such as full text search, synonym processing, relevance ranking, complex data analysis, and near real-time processing of massive data.