Sometimes, interviewers don’t have any preset answers to their questions, and as long as the candidate’s thinking is clear and the logic is consistent, even if the solution may not be optimal, it can still stand out. Do technical solutions, technology selection, must be based on business needs to compromise.
Selection considerations:
- Do you need complex data structure choices? If so, please use Redis as storage, otherwise both Redis and MC can be considered. For simple GET/SET requests with high performance requirements, use MC instead of Redis.
- Does data need to be stored persistently to prevent data loss? If so, please use Redis for data storage, and indicate when applying for the service that it needs to be used as storage rather than cache, and that persistent storage and regular data backup should be enabled.
- Is the Master/Slave mechanism required to ensure service HA? If yes, use Redis to store data. The platform deploys Slave Slave libraries for all Redis services by default to ensure the Master/Slave mechanism.
- Is it a counting service? If Redis is used as the storage, the performance is guaranteed and AOF is enabled to ensure that data will not be lost.
- Do you need multiple sharding for distributed deployment? If so, use Twemproxy for service deployment. When using Redis as the storage, you need to consider the support level of commands. Some Redis commands are not supported by Twemproxy. Otherwise, use Redis or MC on a standalone server as storage, or perform sharding operations on the client. When the platform uses Twemproxy+MC to deploy, it will automatically eliminate abnormal service instances to ensure the quality of the overall service. After eliminating abnormal service instances, some requests will appear MISS. When Twemproxy+Redis is deployed, the Master/Slave mechanism of Redis will be used. If the Master is abnormal, the Slave will be automatically promoted to ensure the service quality.
Typical usage scenarios:
Memcached:
- Session storage: the whole site Session service, mobile WAP Session
- Mobile MAPI services
Redis:
- Counter service
- Queue service: message queue
- Event list data (hashset, which stores all items under a particular event)
- Collection de-duplication: Redis, used by PMS to send SMS services, uses multiple collection de-duplication to prevent the same user from receiving multiple messages and causing harassment.
Memcached
The advantages of Memcached
- Memcached can take advantage of multiple cores, with very high single-instance throughput of hundreds of thousands of QPS (QPS peaks at 4-6W in a daily environment, depending on the size of the key and value bytes and the performance of the server hardware). Suitable for maximum load.
- The session Handle can be directly configured.
- Pit.
Limitations of Memcached:
- Only simple key/value data structures are supported, unlike Redis, which supports rich data types.
- The data cannot be persisted and cannot be backed up. The data can only be used for cache. After the restart, all data is lost.
- Data synchronization cannot occur and data cannot be migrated from an MC to another MC instance.
- Memcached memory Allocation Slab Allocation is used to manage memory. If the value size distribution is different, the memory usage will decrease and the memory will be kicked out even when the memory usage is low. Users need to pay attention to value design.
Redis
Advantages of Redis:
- Support for a variety of data structures, such as String, list, dict, set, zset, hyperloglog
- Supports persistent operations. Aof and RDB data can be persisted to disks for data backup or recovery, which is a good way to prevent data loss.
- Data can be replicated by Replication. The master-slave mechanism enables real-time synchronous Replication of data, including multi-level Replication and incremental Replication. The master-slave mechanism is an important means for Redis to implement HA.
- In single-threaded requests, all commands are executed sequentially, and data consistency is not a concern in concurrent cases.
- Support pub/ SUB message subscription mechanism, can be used for message subscription and notification.
- Simple transaction requirements are supported, but industry usage scenarios are rare and immature.
Limitations of Redis:
- Redis can only use single threads, and the performance is limited by CPU performance, so the maximum CPU of a single instance can reach 5-6Wqps per second (depending on the data structure, data size and server hardware performance, the QPS peak in daily environment is about 1-2W).
- Support for simple transaction requirements, but industry usage scenarios are rare and immature, which is both a strength and a weakness.
- Redis consumes a lot of memory for strings. You can compress storage using dict (hash table) to reduce memory consumption.
Twemproxy
Advantages of Twemproxy distributed scheme:
- Simple distributed solution, the business side can only use an IP/PORT mapping (online deployment using LVS+VIP+VPORT mode), all fragmented data access through Twemproxy internal.
- Both Redis and Memcached can use Twemproxy as their own distributed solution.
- Supports multiple hash algorithms, reducing performance loss.
- Low single point performance and HA problems in the middle tier can be solved by extending Twemproxy (online business usually uses 5-10 TWEMProxies for load balancing and failover through LVS)
- Twemproxy internally supports simple back-end storage failover schemes. For example, if the back-end MC instance is Down, requests routed to this MC instance can be transferred to other MC instances. The internal customized version of Twemproxy, combined with Sentinel cluster, supports failover of the Redis master-slave scheme. If the Redis master is down, the request is routed to the slave.
- You can easily migrate and expand back-end services without relying on DNS or service configuration. All configuration changes can be completed on LVS and Twemproxy, which is transparent to services and convenient for operation and maintenance.
Limitations of Twemproxy distributed scheme:
- When using Redis as back-end storage, many native Redis command requests are limited and Twemproxy itself does not support them.
- Twemproxy is a solution for the middle layer. Adding a layer will increase the delay of service requests. In addition, as the middle layer, Twemproxy itself may become the bottleneck of service, such as CPU or traffic problems.
- As service deployment becomes more difficult, management costs and complexity increase, rapid and simple automatic service deployment and management solutions are required, which increases requirements on O&M personnel.
- Twemproxy distributed middle-layer multi-nodes need the support of LVS. The performance limitation of LVS itself may cause service bottlenecks. The problem of LVS network card packet loss occurred once before, and large-scale optimization and disassembly have been carried out since then.