This article is from netease Cloud community

Author: netease Qiyu Android development team

preface

As an enterprise-level intelligent customer service system, netease Qiyu has high requirements for system stability, but it is difficult to ensure that users will not have problems in use. Android SDK is installed on users’ mobile phones, and due to the fragmentation of Android, it is particularly difficult to check the problems of Android SDK. Therefore, it is very important to record user operation logs.

Statement: netease Qiyu only records operation logs to restore problems, and does not record user privacy information.

The initial plan

At the beginning, netease Qiyu recorded logs by writing files directly. When a log was to be written, the file was first opened, then the log was written, and finally the file was closed. The problem with this is that frequent IO operations affect the performance of the program, and qiyu maintains a background process to ensure the timeliness of messages. When one process logs, the other process is locked out and waiting, which makes the problem worse. Using this solution may not seem to have much impact on the program at the moment, but as the log volume increases, more I/O operations will definitely cause performance bottlenecks.

Let’s examine the process of writing directly to a file:

  1. The user initiates a write operation

  2. The operating system searches the page cache a. If no match is found, a page missing exception is generated, and the page cache is created to write the content passed in by the user to the page cache B. If it does, the user’s incoming content is written directly to the page cache

  3. The call of user write is complete

  4. After a page is modified, it becomes a dirty page. The operating system uses two mechanisms to write the dirty page back to disk a. The user manually calls fsync() b. The pdflush process periodically writes dirty pages back to disk

It can be seen that the process of writing data from the program to disk actually involves two copies of data: one is the copy of user space memory to the cache of kernel space, and one is the copy of the cache of kernel space to the disk when writing back. It also involves frequent switching between kernel space and user space when writing back occurs.

SSD storage also has a “write up” problem compared to mechanical hard drives. The problem has to do with the physical structure of SSD storage. After an SSD has been written, the data to be written cannot be directly updated but can be overwritten. Data must be erased before being overwritten. However, the smallest unit of write is Page, and the smallest unit of erase is Block, and Block is much larger than Page. Therefore, when writing new data, you need to first read the data from the Block and the data to be written together, then erase the Block, and finally write the read data to the storage. As a result, the data actually written may be much larger than the data originally written.

I didn’t expect so many operations to be involved in simple file writing, but only transparent to the application layer.

Since there are so many operations per write, can we cache the logs and write them to disk once they reach a certain number?

While this can significantly reduce the number of I/OS, it can cause another, more serious problem — lost logs

The log is cached in memory. When the program crashes or the process is killed, the integrity of the log cannot be guaranteed. Moreover, the order of the log under the multi-process cannot be guaranteed because there are multiple processes in qiru.

A complete log solution needs to be met

  • High efficiency, can not affect the system performance, not because of the introduction of the log module caused by the application lag

  • Ensure log integrity. If log integrity is not guaranteed, log collection is meaningless

  • For multi-process applications, ensure that the final log order is accurate

High performance solution

Since we can’t reduce the number of writes, can we optimize while writing files?

The answer is yes, use MMAP

Mmap is a method of memory-mapping files. It maps a file or other object to the address space of a process to achieve the mapping between the file disk address and a segment of virtual address in the process virtual address space. The function prototype is as follows

void *mmap(void *addr, size_t length, int prot, int flags, int fd, off_t offset);Copy the code

Mmap operations provide a mechanism for user programs to access device memory directly, which is more efficient than copying data to each other in user and kernel space. It is commonly used in applications that require high performance.

In addition, MMAP ensures log integrity. Mmap write back time:

  • Out of memory

  • Process exits

  • Call msync or munmap

  • 30s-60s without MAP_NOSYNC (FreeBSD only)

When a file is mapped, the program will apply for a space of the same size in native memory. Therefore, it is recommended to map a small section of content each time, such as 64K, and then remap the content behind the file when it is full.

With log write performance and integrity issues resolved, how do you ensure log order across multiple processes?

Since Mmap writes data to shared memory, if two processes map a file at the same time, log overwriting is a problem.

Since the order cannot be guaranteed directly, we have to settle for the next best thing: the two processes map to different files, merge once a day, and sort the logs when merging.

Continue to optimize

According to the above scheme, JNI interface was designed, SO was packaged, and SDK was introduced. There seemed to be no problem, but as an SDK, it was not friendly to include SO, which would increase the difficulty of access to a certain extent.

So can we not use so?

In fact, Java already provides an implementation of memory mapping – MappedByteBuffer

MappedByteBuffer is located in the Java NIO package and is used to map file contents to buffers using the MMAP technology. Buffers are created using the FileChannel map method

MappedByteBuffer raf = new RandomAccessFile(file, "rw"); MappedByteBuffer buffer = raf.getChannel().map(FileChannel.MapMode.READ_WRITE, position, size);Copy the code

To test the efficiency of MappedByteBuffer, we wrote 64byte data to memory, MappedByteBuffer and disk files 500,000 times, and counted the time consuming

methods Time consuming
memory 384ms
MappedByteBuffer 700ms
Disk file 16805ms

It can be seen that although MappedByteBuffer is not as good as writing to memory, it has a qualitative improvement compared with writing to disk files.

conclusion

This paper mainly analyzes the problems existing in the way of directly writing files to record logs, and extends the high performance file writing scheme MMAP, which gives consideration to the writing performance and integrity, and ensures the log sequence in multi-process through compensation scheme. Finally, the implementation of memory mapping in the Java layer was discovered, avoiding the introduction of SO.


Netease Cloud Free experience pavilion, 0 cost experience 20+ cloud products!

For more information about NETEASE’s r&d, product and operation experience, please visit netease Cloud Community.


The Ministry of Industry and Information Technology publicized network security demonstration project netease Cloud E-Shield “adaptive DDoS attack depth detection and Defense system” was selected