Hello, I’m a side dish. A man who wants to be a man who talks about architecture! If you also want to be the person I want to be, otherwise point a attention to be a companion, let the side dish no longer lonely!
This article focuses on dynamic link tracing in SpringCloud
If necessary, you can refer to it
If there is help, do not forget to like the shadow
Wechat public account has been opened, xiao CAI Liang remember, did not follow the students remember to follow oh!
About the concept of micro service we have made the corresponding opinions in the previous two articles, did not see the small partner can parachute to check some, different opinions welcome background messages!
-
Understanding Microservices – Gateway to Service Gateways
-
Understanding Microservices – The Fault-tolerant Sentinel of Services
So in this post we continue our topic of Understanding Microservices, and continue to learn about a solution for link tracing in distributed systems, Sleuth. Being well-informed can certainly be a gimmick, but it can also be a goal!
We no longer dispute the quality of micro-services, recently saw a lot of negative publicity of the characteristics of micro-services, do not know whether the big data accurate feeding ~ ps: estimated backstage portrait has been labeled as an anti-micro molecule. This is of course a joke, end the joke, we come to micro-service learning brick!
The key to microservices is splitting services, but the key is not the micro, but the right size. The moment we step out of a split service, we are responsible for our split service, which is a programmer’s responsibility! When there is a problem with the service, whether you are still struggling to troubleshoot, but do not know where to locate. We can’t debug the code on the line. When we find the problem, we often have the following dialogue:
In addition to the helpless wry smile, can only blindly groping, finally one day programmer side dish no longer compromise, and strive to solve the problem. Can we have a similar monitoring function, when the request came in, the distribution of an identity token to it, when it is in the service of each service will leave traces of it, when it is a problem, we only need to check the travel to which services, equivalent to draw a trajectory, behavior through the behavior of trajectory tracking service call the question! It’s easy to say, but it’s hard to implement. As the retreat is about to strike, I wonder how it is possible that no one in microservices has thought of this problem, so I walk around microservice components, trying to find the implementation to solve the problem. Finally, Sleuth shows up.
Sleuth
1. Know Sleuth
All this stuff was meant to bring out Sleuth. After all that talk, let’s just describe what it does in one sentence: provide traceability solutions in distributed systems. It borrows a lot from Google Dapper, so is it another shell changer?
Let’s take a look at the terminology and related concepts used in Sleuth first, just to get us more professional and talk architecture
- Trace
A tree structure is formed by concatenating a set of spans with the same Trace Id. In order to implement request tracing, the service tracing framework only needs to create a unique identifier (Trace Id) for the request when it arrives at the entry endpoint of the distributed system, and keep passing this unique value until the entire request is returned when it is not circulating within the system. We can then use this unique identifier to concatenate all requests into a complete request link.
- Span
Represents a set of basic units of work. In order to measure the latency of each processing unit, when a request arrives at each service component, a unique identifier (SpanId) is used to mark its start, process, and end. With the start and end timestamps of the SpanId, we can count the time that the Span was called. In addition, we can get metadata such as the name of the event, request information, and so on
- Annotation
Records events over a period of time. Important comments for internal use are as follows:
-
Client Send (CS) : A Client sends a request to start a request
-
Server Received (SR) : The Server receives the request and begins to process it. Sr-cs = Network latency (time of service invocation)
-
Server Send (SS) : The Server is ready to Send the process to the client. Ss-sr = Request processing time on the server
-
Client Reveived (CR) : The Client receives a response from the server and the request ends. Cr-sr = Total time of requests
After understanding the three concepts, we will know directly below!
2. Master Sleuth
As always, we present our micro service projects:
Simply introduce yourself to a few services
- Store-gateway: indicates the service gateway
- Store-order: Order service
- Store-product: Products and services
Then we need to introduce Sleuth dependencies into the parent project
<dependency>
<groupId>org.springframework.cloud</groupId>
<artifactId>spring-cloud-starter-sleuth</artifactId>
</dependency>
Copy the code
Introduce the dependencies directly in dependencies of the parent project, so that each child service can repeat the dependencies differently
Then we start with our simple code pattern:
Store – order service
OrderController:
ProductService:
Store – the product service
ProductController:
We use Feign to remotely invoke the interface in the product service in the order service, then start the service call and watch the console print:
Notice what I’ve circled. Do you notice anything different? Yes! This string contains information such as the service name/TraceId/SpanId. The sequential calls have a global TraceId that links the calls. By analyzing the microservice log, it is not difficult to see the specific process of the request. One drawback, however, is that as the number of services increases, or the number of logs increases, it is not easy to extract the invocation process of a request from the log file. Is there a plug-in that can be used for full-text retrieval and visual presentation to help solve this problem? Having raised the problem, the side dish has naturally found a way to solve it! That is ZipKin
Use ZipKin
ZipKin is a Twitter open source project based on Google Dapper, which solves the problem we mentioned above: collecting service timing data to solve the latency problem in the microservice architecture, including data collection, storage, search and presentation
We can use it to collect trace data of request links on various servers, and through its REST API interface to assist us to query trace data to implement monitoring program of distributed system, so as to find the problem of delay increase in the system and find the root cause of system performance bottleneck in time!
In addition to our development-oriented API, ZipKin also provides a user-friendly UI component to help us intuitively search for trace information and analyze request link details, such as the processing time of each user request in a given period of time.
Do you see a light in the eye when you hear the UI component, indicating that we can better manage link tracking through the console
Not only that, ZipKin also provides plugable data storage, such as in-Memory, MySQL, Cassandra, and Elasticsearch. There are not too many ways to support, there is always one you like!
Take a look at ZipKin’s infrastructure diagram:
As you can see, ZipKin’s structure is not very complex, consisting of four core components:
- Collector: Collector component. It is used to process trace information discovered by external systems and then convert it to the Span format processed internally by ZipKin for subsequent storage, analysis, presentation, and so on
- Storage: Storage component. This is primarily used to process trace information received by the collector, which is stored in memory by default, and we can modify the storage policy to store it in our other repositories
- RestFul API: INDICATES an API component. An interface used to provide external access, such as tracing information for clients, or access information for external systems
- WebUI: UI components. Based on API components, users can query and analyze trace information conveniently and intuitively through UI components
So ZiPKin is a little bit like Sentinel, which we talked about last time, which is a server and a client.
The client is the application in the microservice. The URL address of the server is configured on the client. Once the invocation between services occurs, it will be listened by the Sleuth listener configured in the microservice, and generate the corresponding Trace and Span information and send it to the server.
1. ZipKin server
The server is a ready-made SpringBoot service, we just need to download the JAR package, can run directly! Download address
Then we can start it from the command line:
Java jar zipkin - server - 2.12.9 - exec. JarCopy the code
After successful startup, visit http://localhost:9411 to view the following page:
At this point, our server has been successfully installed. Then go back to our client integration
2. ZipKin client
We need to introduce ZipKin dependencies into every service
<dependency>
<groupId>org.springframework.cloud</groupId>
<artifactId>spring-cloud-starter-zipkin</artifactId>
</dependency>
Copy the code
Then configure it in the configuration file:
spring:
zipkin:
base-url: http://127.0.0.1:9411/ # Zipkin server address
discoveryClientEnabled: false # Tell nacos to treat it as a URL, not as a service name
sleuth:
sampler:
probability: 1.0 # Percentage of samples
Copy the code
Then we go back to the project, start the project and access the microservice by accessing the interface we’ve defined above, and then take a look at the ZipKin UI
You can see that the request link that we just accessed the microservice has appeared. Click on one of the records to see the specific access route.
By tracing the request link, you can determine which module of the service is taking more time, and you can optimize or Bug it.
3. ZipKin persistence
In ZipKin, link tracing data is stored in memory by default, but this is obviously not suitable for production. Hence the support in ZipKin for persisting link-tracing data to a mysql database or elasticSearch.
MySQL
- First we need to configure the database information
CREATE TABLE IF NOT EXISTS zipkin_spans (
`trace_id_high` BIGINT NOT NULL DEFAULT 0 COMMENT 'If non zero, this means the trace uses 128 bit traceIds instead of 64 bit',
`trace_id` BIGINT NOT NULL,
`id` BIGINT NOT NULL,
`name` VARCHAR(255) NOT NULL,
`parent_id` BIGINT,
`debug` BIT(1),
`start_ts` BIGINT COMMENT 'Span.timestamp(): epoch micros used for endTs query and to implement TTL',
`duration` BIGINT COMMENT 'Span.duration(): micros used for minDuration and maxDuration query'
) ENGINE=InnoDB ROW_FORMAT=COMPRESSED CHARACTER SET=utf8 COLLATE utf8_general_ci;
ALTER TABLE zipkin_spans ADD UNIQUE KEY(`trace_id_high`, `trace_id`, `id`) COMMENT 'ignore insert on duplicate';
ALTER TABLE zipkin_spans ADD INDEX(`trace_id_high`, `trace_id`, `id`) COMMENT 'for joining with zipkin_annotations';
ALTER TABLE zipkin_spans ADD INDEX(`trace_id_high`, `trace_id`) COMMENT 'for getTracesByIds';
ALTER TABLE zipkin_spans ADD INDEX(`name`) COMMENT 'for getTraces and getSpanNames';
ALTER TABLE zipkin_spans ADD INDEX(`start_ts`) COMMENT 'for getTraces ordering and range';
Copy the code
CREATE TABLE IF NOT EXISTS zipkin_annotations (
`trace_id_high` BIGINT NOT NULL DEFAULT 0 COMMENT 'If non zero, this means the trace uses 128 bit traceIds instead of 64 bit',
`trace_id` BIGINT NOT NULL COMMENT 'coincides with zipkin_spans.trace_id',
`span_id` BIGINT NOT NULL COMMENT 'coincides with zipkin_spans.id',
`a_key` VARCHAR(255) NOT NULL COMMENT 'BinaryAnnotation.key or Annotation.value if type == -1',
`a_value` BLOB COMMENT 'BinaryAnnotation.value(), which must be smaller than 64KB',
`a_type` INT NOT NULL COMMENT 'BinaryAnnotation.type() or -1 if Annotation',
`a_timestamp` BIGINT COMMENT 'Used to implement TTL; Annotation.timestamp or zipkin_spans.timestamp',
`endpoint_ipv4` INT COMMENT 'Null when Binary/Annotation.endpoint is null',
`endpoint_ipv6` BINARY(16) COMMENT 'Null when Binary/Annotation.endpoint is null, or no IPv6 address',
`endpoint_port` SMALLINT COMMENT 'Null when Binary/Annotation.endpoint is null',
`endpoint_service_name` VARCHAR(255) COMMENT 'Null when Binary/Annotation.endpoint is null'
) ENGINE=InnoDB ROW_FORMAT=COMPRESSED CHARACTER SET=utf8 COLLATE utf8_general_ci;
ALTER TABLE zipkin_annotations ADD UNIQUE KEY(`trace_id_high`, `trace_id`, `span_id`, `a_key`, `a_timestamp`) COMMENT 'Ignore insert on duplicate';
ALTER TABLE zipkin_annotations ADD INDEX(`trace_id_high`, `trace_id`, `span_id`) COMMENT 'for joining with zipkin_spans';
ALTER TABLE zipkin_annotations ADD INDEX(`trace_id_high`, `trace_id`) COMMENT 'for getTraces/ByIds';
ALTER TABLE zipkin_annotations ADD INDEX(`endpoint_service_name`) COMMENT 'for getTraces and getServiceNames';
ALTER TABLE zipkin_annotations ADD INDEX(`a_type`) COMMENT 'for getTraces';
ALTER TABLE zipkin_annotations ADD INDEX(`a_key`) COMMENT 'for getTraces';
ALTER TABLE zipkin_annotations ADD INDEX(`trace_id`, `span_id`, `a_key`) COMMENT 'for dependencies job';
Copy the code
CREATE TABLE IF NOT EXISTS zipkin_dependencies (
`day` DATE NOT NULL,
`parent` VARCHAR(255) NOT NULL,
`child` VARCHAR(255) NOT NULL,
`call_count` BIGINT
) ENGINE=InnoDB ROW_FORMAT=COMPRESSED CHARACTER SET=utf8 COLLATE utf8_general_ci;
ALTER TABLE zipkin_dependencies ADD UNIQUE KEY(`day`, `parent`, `child`);
Copy the code
- Specify mysql information at startup
Java -dstorage_type =mysql -dmysql_host =127.0.0.1 -dmysql_tcp_port =3306 -dmysql_db =zipkin -dmysql_user =root - DMYSQL_PASS = root - jar zipkin - server - 2.12.9 - exec. JarCopy the code
ElasticSearch
- Start the ES
- Specify ES information at startup
Java -jar zip-server-2.12.9-exec. jar --STORAGE_TYPE=elasticsearch -- es-host =localhost:9200Copy the code
END
Here we have completed the solution of service link tracking in distributed system! What does that solve the problem for us?
- An EXCEPTION occurs on an API call across a microservice, quickly locating the problem.
- Performance bottlenecks occur in API calls across microservices, and the performance bottlenecks are quickly identified.
Don’t talk, don’t be greedy and lazy, and side dish together to do a blowing cow X to do the framework of the program ape ~ point a concern to do a companion, let side dish no longer lonely. See you later!
Today you work a little harder, tomorrow you can say less to beg! I’m a side dish, a man who grew strong with you. 💋 wechat public account has been opened, xiao CAI Liang, did not pay attention to the students remember to pay attention to oh!