The premise
Spring-actuator collects measurement statistics, uses Prometheus to collect data, and Grafana to display data, which monitors the performance and business data of machines in the generation environment. In general, we call this operation “burying point”. The measurement API in SpringBoot relies on Spring – Actuator integration using Micrometer. The official website is Micrometer.io.
In practice, business developers have been found to abuse Micrometer’s measurement type Counter, resulting in the use of only counting statistics in any case. This article analyzes the role and application scenarios of other metrics apis based on Micrometer.
A measurement library provided by Micrometer
Meter is a set of interfaces used to collect measurements from applications. The word Meter can be translated as “Meter” or “micrometer,” but neither of these sounds quite right, so we’ll call it Meter instead. Meter is created and stored by MeterRegistry, which is understandably the factory and cache center for Meter, and generally every JVM application must create a concrete implementation of MeterRegistry when using Micrometer.
The specific types of Micrometer include Timer, Counter, Gauge, DistributionSummary, LongTaskTimer, FunctionCounter, FunctionTimer, and TimeGauge.
The following sections describe the use methods and actual scenarios of these types in detail. A specific Meter type needs to be uniquely identified by its name and Tag(here referring to the Tag interface provided by Micrometer). The advantage of this method is that it can be marked by name and different tags can be used to distinguish various dimensions for data statistics.
MeterRegistry
MeterRegistry is an abstract class in Micrometer. Its main implementations include:
SimpleMeterRegistry: The latest data for each Meter can be collected into SimpleMeterRegistry instances, but this data is not published to other systems, meaning that the data is stored in the application’s memory. CompositeMeterRegistry: Multiple MeterRegistry aggregates that internally maintain a list of MeterRegistry. Global MeterRegistry: factory class IO. Micrometer. Core. Instrument. The Metrics of holding a static final globalRegistry CompositeMeterRegistry instance.
Of course, consumers can also inherit MeterRegistry themselves to implement a custom MeterRegistry. SimpleMeterRegistry is suitable for debugging. It can be used as follows: MeterRegistry = new SimpleMeterRegistry(); Counter counter = registry.counter(“counter”); counter.increment();
The internally held MeterRegistry list is empty when the CompositeMeterRegistry instance is initialized. If a Meter instance is added with it, The Meter instance operation is invalid CompositeMeterRegistry Composite = new CompositeMeterRegistry();
Counter compositeCounter = composite.counter(“counter”); compositeCounter.increment(); // <- Actually this step is invalid, but no error is reported
SimpleMeterRegistry simple = new SimpleMeterRegistry(); composite.add(simple); // <- Add the SimpleMeterRegistry instance to the CompositeMeterRegistry instance
compositeCounter.increment(); // <- count success
The global MeterRegistry is much easier to use because all you need to do is manipulate the static methods of the Metrics factory class: metrics.addregistry (new SimpleMeterRegistry()); Counter counter = Metrics.counter(“counter”, “tag-1”, “tag-2”); counter.increment();
Tag and Meter naming
In Micrometer, the Meter naming convention uses the English comma (dot). ) Separate words. However, different monitoring systems may have different naming conventions. If the naming conventions are inconsistent, the new monitoring system may be damaged during system migration or switchover.
The naming rules of words separated by commas in Micrometer can be converted through NamingConvention, which is the underlying NamingConvention interface. The naming rules can be adapted to different monitoring systems, and the names and marks with special characters that are not allowed by the monitoring system can be eliminated. Developers can also override NamingConvention to implement custom naming conversions: registry.config().namingConvention(myCustomNamingConvention); .
In Micrometer, there is a default conversion for some of the main monitoring or storage system naming conventions, such as: MeterRegistry registry =… registry.timer(“http.server.requests”);
For different monitoring systems or storage systems, the name is automatically changed as follows:
Prometheus – http_server_requests_duration_seconds. The Atlas – httpServerRequests. Graphite – HTTP. Server requests. InfluxDB – http_server_requests.
The NamingConvention already provides five default conversion rules: dot, snakeCase, camelCase, upperCamelCase, and Slashes.
In addition, Tag is an important function of Micrometer. Strictly speaking, only when a measurement framework realizes the function of Tag can it truly collect measurement data in multiple dimensions. The name of the Tag generally needs to be meaningful, which means that the dimension or type of measurement can be inferred from the name of the Tag.
Assuming that we need to monitor database calls and Http request call statistics, the general recommendation is: MeterRegistry registry =… registry.counter(“database.calls”, “db”, “users”) registry.counter(“http.requests”, “uri”, “/api/users”)
Thus, when we select a counter named “database.calls”, we can further select groups “DB” or “users” to count the contribution or composition of different groups to the total number of calls. An inverse example is: MeterRegistry registry =… registry.counter(“calls”, “class”, “database”, “db”, “users”);
registry.counter(“calls”, “class”, “http”, “uri”, “/api/users”);
The counter obtained by naming “calls” is basically unable to be grouped for statistical analysis due to the confusion of tags. At this time, it can be considered that the statistical data of the time series obtained is meaningless. It is possible to define a global Tag that will be attached to all meters (as long as the same MeterRegistry is used). The global Tag can be defined as: MeterRegistry =… registry.counter(“calls”, “class”, “database”, “db”, “users”);
registry.counter(“calls”, “class”, “http”, “uri”, “/api/users”);
MeterRegistry registry = … registry.config().commonTags(“stack”, “prod”, “region”, “us-east-1”); Registry.config ().commontags (Arrays. AsList (tag.of (“stack”, “prod”), tag.of (“region”, “US-east-1 “))); Used as above, you can do multi-dimensional in-depth analysis through the host, instance, region, stack, and other operating environments.
Two more things to note:
1. The value of Tag must not be null. 2, Micrometer, the Tag must come in pairs, namely the Tag must be set to an even number of, in fact they exist in the form of Key = Value, specific can see IO. Micrometer. Core. The instrument. The Tag interface: public interface Tag extends Comparable { String getKey();
String getValue();
static Tag of(String key, String value) {
return new ImmutableTag(key, value);
}
default int compareTo(Tag o) {
return this.getKey().compareTo(o.getKey());
}
Copy the code
}
Of course, MeterFilter can be used when you need to filter necessary tags or names for statistics or whitelist Meter names. MeterFilter itself provides a series of static methods, multiple MeterFilters can be stacked or formed into a chain to achieve the user’s final filtering strategy. For example: MeterRegistry registry =… registry.config() .meterFilter(MeterFilter.ignoreTags(“http”)) .meterFilter(MeterFilter.denyNameStartsWith(“jvm”));
Means to ignore the “HTTP” tag and reject Meter names that begin with a string of “JVM”. See the MeterFilter class for more usage.
The combination of Meter naming and Meter Tag, with the name as the axis and Tag as the multi-dimensional element, can enrich the dimension of measurement data and facilitate statistics and analysis.
Meters
The meters mentioned above include Timer, Counter, Gauge, DistributionSummary, LongTaskTimer, FunctionCounter, FunctionTimer, and TimeGauge. Here’s a look at each of them and how I understand them to be used in real life scenarios (production environments, I should say).
Counter
Counter is a simpler Meter that is a single-valued measure type, or a single-valued Counter. The Counter interface allows the user to count with a fixed value that must be positive. To be precise: a Counter is a single-valued Counter with positive increments. Here’s a very simple use example:
Usage Scenarios:
The function of Counter is to record the total amount or value of XXX, which is suitable for some increasing types of statistics, such as placing orders, times of payment, total AMOUNT of Http requests, etc. Different scenarios can be distinguished by Tag. For placing orders, different tags can be used to mark different business sources or divide them by date. For total Http request records, you can use tags to distinguish between different urls. // entity @data public class Order {
private String orderId;
private Integer amount;
private String channel;
private LocalDateTime createTime;
Copy the code
}
public class CounterMain {
private static final DateTimeFormatter FORMATTER = DateTimeFormatter.ofPattern("yyyy-MM-dd"); static { Metrics.addRegistry(new SimpleMeterRegistry()); } public static void main(String[] args) throws Exception { Order order1 = new Order(); order1.setOrderId("ORDER_ID_1"); order1.setAmount(100); order1.setChannel("CHANNEL_A"); order1.setCreateTime(LocalDateTime.now()); createOrder(order1); Order order2 = new Order(); order2.setOrderId("ORDER_ID_2"); order2.setAmount(200); order2.setChannel("CHANNEL_B"); order2.setCreateTime(LocalDateTime.now()); createOrder(order2); Search.in(Metrics.globalRegistry).meters().forEach(each -> { StringBuilder builder = new StringBuilder(); builder.append("name:") .append(each.getId().getName()) .append(",tags:") .append(each.getId().getTags()) .append(",type:").append(each.getId().getType()) .append(",value:").append(each.measure()); System.out.println(builder.toString()); }); } private static void createOrder(Order Order) {Metrics. Counter ("order.create", "channel", order.getChannel(), "createTime", FORMATTER.format(order.getCreateTime())).increment(); }Copy the code
}
Console output name:order.create,tags:[tag(channel=CHANNEL_A), tag(createTime=2018-11-10)],type:COUNTER,value:[Measurement{statistic=’COUNT’, Name: value = 1.0}] order. Create, tags: [tag (channel = CHANNEL_B), tag(createTime=2018-11-10)],type:COUNTER,value:[Measurement{statistic=’COUNT’, Value =1.0}] The above example uses the global static method factory class Metrics to construct Counter instances. In fact, IO. Micrometer. Core. Instrument. Provides an internal Counter interface Builder class Counter. The Builder to instantiate the Counter, Counter. The Builder use is as follows: public class CounterBuilderMain {
Public static void main(String[] args) throws Exception{Counter Counter = Counter. Builder ("name") // Name .baseUnit("unit") // Base unit.description ("desc") // description. Tag ("tagKey", "tagValue") // tag. Register (new SimpleMeterRegistry()); // bind MeterRegistry counter.increment(); }Copy the code
}
FunctionCounter
FunctionCounter is a specialized type of Counter that abstracts Counter increments to the interface type ToDoubleFunction, which is the JDK1.8 specialized type interface for Function.
FunctionCounter is used in the same way as Counter. Here is how it is used: public class FunctionCounterMain {
public static void main(String[] args) throws Exception { MeterRegistry registry = new SimpleMeterRegistry(); AtomicInteger n = new AtomicInteger(0); AtomicInteger::get FunctionCounter. Builder (" FunctionCounter ", n, new ToDoubleFunction<AtomicInteger>() { @Override public double applyAsDouble(AtomicInteger value) { return value.get(); } }).baseUnit("function") .description("functionCounter") .tag("createOrder", "CHANNEL-A") .register(registry); // n.ncrementAndGet (); n.incrementAndGet(); n.incrementAndGet(); }Copy the code
}
One of the obvious benefits of using FunctionCounter is that we don’t need to be aware of the existence of an instance of FunctionCounter, we actually just need to manipulate the AtomicInteger instance as one of the FunctionCounter instance building elements, This approach to interface design can be seen in many frameworks.
Timer
Timer is used to record the execution time of short events and display the sequence and occurrence frequency of events through time distribution. All Timer implementations record at least the number of events that occurred and the total time of those events to generate a time series.
The base unit of the Timer depends on the server metrics, but in practice we don’t need to worry too much about the base unit of the Timer because Micrometer automatically selects the appropriate base unit when storing the generated time series. The common methods of a Timer interface are as follows: Public Interface Timer extends Meter {… void record(long var1, TimeUnit var3);
default void record(Duration duration) { this.record(duration.toNanos(), TimeUnit.NANOSECONDS); } <T> T record(Supplier<T> var1); <T> T recordCallable(Callable<T> var1) throws Exception; void record(Runnable var1); default Runnable wrap(Runnable f) { return () -> { this.record(f); }; } default <T> Callable<T> wrap(Callable<T> f) { return () -> { return this.recordCallable(f); }; } long count(); double totalTime(TimeUnit var1); default double mean(TimeUnit unit) { return this.count() == 0L ? 0.0d: this.totaltime (unit)/(double)this.count(); } double max(TimeUnit var1); .Copy the code
}
In fact, the more common and convenient methods are several functional interface input methods: Timer Timer =… timer.record(() -> dontCareAboutReturnValue()); timer.recordCallable(() -> returnValue());
Runnable r = timer.wrap(() -> dontCareAboutReturnValue()); Callable c = timer.wrap(() -> returnValue());
Usage Scenarios:
Based on personal experience and practice, the conclusions are as follows:
Records the execution time of the specified method for display. The execution time of some tasks is recorded to determine the rate of some data sources, such as the consumption rate of message queue messages.
Public class TimerMain {public class TimerMain {public class TimerMain {
private static final Random R = new Random(); static { Metrics.addRegistry(new SimpleMeterRegistry()); } public static void main(String[] args) throws Exception { Order order1 = new Order(); order1.setOrderId("ORDER_ID_1"); order1.setAmount(100); order1.setChannel("CHANNEL_A"); order1.setCreateTime(LocalDateTime.now()); Timer timer = Metrics.timer("timer", "createOrder", "cost"); timer.record(() -> createOrder(order1)); } private static void createOrder(Order order) { try { TimeUnit.SECONDS.sleep(R.nextInt(5)); Catch (InterruptedException e) {//no-op}}Copy the code
}
In a real production environment, spring-AOP can be used to abstract the logic of logging method time consumption into an aspect, thus reducing unnecessary redundant template code. The example above is to construct a Timer instance through Mertics, but you can actually construct it using Builder: MeterRegistry registry =… Timer Timer = timer.builder (“my.timer”).description(“a description of what this Timer does”) // Optional. Tags (“region”, “Test “) // Optional. Register (registry);
In addition, the use of Timer can also be based on its internal class timer.sample, through the start and stop methods to record the execution time between the two logic. Example: timer.sample Sample = timer.start (registry);
Response Response =…
sample.stop(registry.timer(“my.timer”, “response”, response.status()));
FunctionTimer
FunctionTimer is a specialized type of Timer that provides two monotonically increasing functions (which are not monotonically increasing, but generally need to remain the same or not decrease over time) : a function for counting and a function for recording the total call time. Its constructor takes the following inputs: public interface FunctionTimer extends Meter { static Builder builder(String name, T obj, ToLongFunction countFunction, ToDoubleFunction totalTimeFunction, TimeUnit totalTimeFunctionUnit) { return new Builder<>(name, obj, countFunction, totalTimeFunction, totalTimeFunctionUnit); }… }
Examples in the official documentation are as follows: IMap cache =… ; More ().timer(“cache.gets.latency”, Tags. Of (“name”, cache.getname ()), cache, C -> c.getLocalmapStats ().getGetOperationCount(), // is actually a cache method, C -> c.getLocalMapStats().getTotalGetLatency(), timeUnit.nanoseconds);
ToDoubleFunction ToDoubleFunction ToDoubleFunction ToDoubleFunction ToDoubleFunction ToDoubleFunction ToDoubleFunction ToDoubleFunction ToDoubleFunction ToDoubleFunction Public class FunctionTimerMain {public class FunctionTimerMain {
Public static void main(String[] args) throws Exception {public static void main(String[] args) throws Exception { AtomicLong totalTimeNanos = new AtomicLong(0); AtomicLong totalCount = new AtomicLong(0); FunctionTimer.builder("functionTimer", holder, p -> totalCount.get(), p -> totalTimeNanos.get(), TimeUnit.NANOSECONDS) .register(new SimpleMeterRegistry()); totalTimeNanos.addAndGet(10000000); totalCount.incrementAndGet(); }Copy the code
}
LongTaskTimer
LongTaskTimer is also a specialized type of Timer, which is mainly used to record the duration of a task that has been executed for a long time. The monitored events or tasks are still running before the task is completed, and the total elapsed time of the task execution will be recorded when the task is completed.
LongTaskTimer is suitable for recording the duration of events that run for a long time, such as relatively time-consuming scheduled tasks. In Spring applications, you can simply use the @scheduled and @timed annotations to record the total time of a Scheduled task based on Spring-AOP: Timed(value = “aws. Scrape “, longTask = true) @timed (fixedDelay = 360000) void scrapeResources() {Timed(value = “aws. Scrape “, longTask = true) @timed (fixedDelay = 360000) void scrapeResources() {
Of course, it is also convenient to use LongTaskTimer in non-Spring architectures: public class LongTaskTimerMain {
public static void main(String[] args) throws Exception{ MeterRegistry meterRegistry = new SimpleMeterRegistry(); LongTaskTimer longTaskTimer = meterRegistry.more().longTaskTimer("longTaskTimer"); Longtasktimer.record (() -> {// here write Task logic}); Metrics.more().longtaskTimer ("longTaskTimer").record(()-> {}); }Copy the code
}