One, foreword

In the microservice technology system, the technologies needed to build the microservice architecture have been listed.

This article is just a list of the technologies needed to build microservices, and is an overview of the technologies needed for microservices. But it doesn’t explain exactly how to build a microservice. What needs to be done? This article will address these issues.

Second, build microservices

2.1 Basic Process

The basic flow

Typically when you build a simple microservice, you have one client, one server, and one registry.

  1. Write server-side procedures, when the service is started, it will register its services to the registry
  2. Write a client program, when the client starts, will go to the registry to find the services they need
  3. When the client finds the service, it invokes the service provided by the server

These three steps build a simple microservice.

In the above three steps, what should be paid attention to? Define the service.

The service definition

How do YOU define a service? And all kinds of languages are common, all kinds of development can use a language. This is IDL.

One of the simplest request models in microservices is the request <-> reply. So how do you define the data format of the request part and the data format of the return part?

The giants are ready for us, protocol buffers and thrift. These are the two most commonly used IDL languages.

Select Protocol Buffers here. Protocol Buffers is a lightweight structured data storage format, language – and platform-independent, extensible serialization format. Its performance and efficiency are much better than JSON or XML. Because it is binary storage, it is less readable.

For example, define a message format for a service, request, or response:

// Define a service
service Simple{
    rpc SimpleService (SimpleRequest) returns (SimpleResponse){};
}

// Define send request information
message SimpleRequest{
    string data = 1;
}

// Define response information
message SimpleResponse{
    int32 code = 1;
    string value = 2;
}
Copy the code

Then using the protoc command can generate the corresponding language program. Serialize the data.

I use Go a lot, so I’m going to generate Go.

Service Transport protocol

Once the service is defined and the command is serialized, how to transfer the data out?

Whether to use TCP directly or application layer protocol. The application layer protocol includes HTTP and can be customized. Without special circumstances, HTTP is the preferred protocol, and now HTTP2 is available. For example, THE RPC framework gRPC uses the HTTP2 protocol.

2.2 Service Registration/Discovery

In the basic 3 process above, there is a service registry. What does this do?

A corporate business is developed with a microservices architecture. So there are multiple microservices to provide services, so many microservices, how to store? Are clients better able to discover and use these services?

  1. The service is easily searched by the client
  2. The service changes and the client can be notified

Service registries were created to provide these services for both clients and servers. The server can register the service in the service registry, the client can subscribe to the service, and the client can be notified of changes.

You can cache the list of services locally and provide services if you are disconnected from the service center network.

Common registry software:

  1. etcd
  2. consul
  3. zookeeper

I usually use ETCD, which is also used by K8S.

2.3 Service Monitoring

Why monitoring? When a running service fails, IT needs to be the first to know that the service is broken and fix the failure, rather than wait until users can no longer use the service. In that case, the business may suffer, the product experience will decline, and users will turn off. Therefore, service monitoring is a necessary measure to proactively notify the relevant developers when encountering AN IT failure, so that the developers can proactively repair the failure.

In fact, monitoring should be the infrastructure of IT, whether in the past monolithic applications, or later microservices architecture, monitoring is an indispensable part.

The most commonly used monitoring tool is Prometheus, a software product from CNCF.

Of course, there are many other monitoring software, you can look at, compare, and then choose the monitoring software suitable for enterprise business.

2.4 Service Link Tracing

With the above service monitoring, why service link tracing?

If you are a monolithic architecture, when you encounter a failure, service monitoring can solve the problem. The architecture is simple with fewer service invocation layers.

If you are a service architecture, a micro service may call several of the other services, and could you call the service also calls the other services, such a complex call link relations, when the service is something wrong with the screen error to screen more than one service, perhaps this message service on other services also need screen, As well as the entire call link relationship of this service, all need to be checked.

This is where link tracing comes in handy.

Now the distributed link tracking system, are from The Google Dapper paper and come.

Check out this Dapper paper: Dapper. PDF

2.5 Service Governance

Why service governance?

Because a series of problems will occur during the operation of the microservice API, which are as follows:

  1. The caller and the called, the called is temporarily unavailable because of the network, what to do?
  2. What if a microservice suddenly gets a surge in traffic that affects the health of the microservice associated with its invocation, for example slowing down other service invocations?
  3. What if the traffic continues to increase so that microservice A becomes unavailable, and the B and C services associated with the call become unavailable?
  4. What if a server for a service is down, delayed, or unavailable?

. .

And so on some of the microservices will run in the process of problems. To address these issues, several approaches to service governance have emerged.

Service timeout/retry

Service timeout: If there is no response within a specified period of time, the request is considered timeout. Identify a timeout and proceed further. Retry, for example, or log the error directly.

Service retry: Service invocation fails or times out due to network instability. In this case, you can retry the request again to increase the success rate of invocation.

Typically, service timeout and service retry are used together.

Service current limiting

First, if the traffic of an API becomes heavy and affects other services, you can limit the traffic. This can solve the problem of large access.

Second: the service has a limited number of requests for a particular request, such as 100 requests per second.

Service fusing

A circuit breaker is the shutting down of one service so that other services are not affected.

If the traffic is very large, or the system is abnormal or the Load is too high and temporarily unavailable, the microservice API request will fail. If the failure rate exceeds the specified threshold, the circuit breaker process can be carried out to protect other microservice apis.

This also has a graphic name: circuit fuse. Daily circuit switch, is to use this mode of operation, the current is too large, will trip, and then can be manually restored.

Service degradation

When the volume of access becomes large and exceeds the threshold set, some non-critical, non-core services can be downgraded and their external services suspended. Protect core API access from being affected.

Depend on the isolation

Why quarantine? It is also to ensure that one service is unavailable without affecting other services.

For example, service A calls service B, and then service B calls service C. If the access quantity of C becomes large or the machine is down, and SERVICE B cannot access or hang by calling service C, then the resource consumption of SERVER B will increase. If it continues to increase to unusable, then this API service may become unserviceable. To avoid this, the C service can be isolated. This prevents cascading faults.

This Isolation mode also has a special mode: Bulkhead Isolation.

In the shipbuilding industry, this model is often used to insulate cabins, using bulkheads to insulate different cabins, so that if a cabin breaks and floods, only one cabin can be lost and the others can not be affected. This model is also used in the software industry, based on shipbuilding experience.

Bulkhead mode:

Isolate critical resources for each workload or service, such as connection pools, memory, and CPU. The use of bulkheads avoids a scenario where a single workload (or service) consumes all resources, causing other services to fail. This pattern increases the resiliency of the system by preventing cascading failures caused by one service.

fallback

Emergency mode, backup mode, fallback mode, are not very good.

When service timeout, retry, fusing, or traffic limiting occurs, you can provide a rollback mechanism to recover the service in a timely manner and not affect user experience. Common rollback policies are as follows:

  1. Custom processing:

In this scenario, default data and locally cached data can be used for temporary support, or backup services can be used to obtain data. This applies to key service processes and scenarios that seriously affect user experience, such as core services such as merchant and product information

  1. fail-silent

Returns a null value or default value. This value is suitable for degraded functions, such as product recommendations. A null value does not affect user experience

  1. Fail-fast Indicates a rapid failure

Directly throws an exception, which is applicable to scenarios where data is not strongly dependent, such as the processing of non-core service timeouts

  1. failsafe

Failure security: Directly ignores exceptions. This mode is usually used in log inhalation operations

  1. failback

The system automatically recovers after a failure. The background records the failed request data and periodically resends the request. Usually used for message notification scenarios

  1. failover

Failure to switch automatically, when failure occurs, retry other servers

2.6 the gateway gateway

Why is a gateway needed and what does it do?

The microservices Gateway provides an entry point to the entire microservices API request. The back-end API services can be isolated and aggregated to uniformly process user requests.

Gateway implements some of the above flow limiting, fusing, monitoring, and other functions:

  1. Blacklisting mechanism
  2. Unified login authentication/authentication
  3. Cross-domain problem
  4. Log to intercept
  5. Access control
  6. Service governance some functions: current limiting, fusing, etc
  7. Load balancing
  8. Gray released

Gateway software:

  • kong
  • apisix
  • nginx+lua

The above are all gateways built on nginx + Lua.

Traefik Sping Cloud Gateway, etc.

2.7 Microservices framework

Another important framework is one for writing microservices. The dominant languages for building microservices today are Go and Java.

Go’s microservices framework:

  • go-kit
  • go-chassis
  • go-zero

Java microservices framework:

  • dubbo
  • Spring Cloud

2.8 Configuration Center

Finally, as the number of microservices increases, configuration files become more and more, such as database configuration, server configuration, distribution rules configuration, etc., at this time you may need to set up a configuration center.

Common configuration centers:

  • Apollo
  • Nacos

Iii. Reference:

  • Service fault tolerance pattern
  • Recovery mode
  • azure microservices
  • azure gateway