This article is published by the Cloud + community

Picture theme color in the picture accounted for a large proportion of the page, can cooperate with the picture to play a very good visual effect, give a person a harmonious, consistent feeling. It can also be used in image classification, search recognition and so on. Usually, the extraction of theme color is completed at the back end. The front end provides the images to be processed in the form of links or IDS to the back end, and the back end extracts theme color by running corresponding algorithms, and then returns the corresponding results.

This can satisfy most display class scenarios, but for images that need to be “customized” and “generated” according to the user, this way has a upload image —-> back-end calculation —-> return result time, the waiting time may be relatively long. Therefore, I try to use canvas to extract the theme color of the picture in the front.

First, theme color algorithm

At present, the most commonly used theme color extraction algorithms are: minimum difference method, median segmentation method, octree algorithm, clustering, color modeling and so on. Among them, clustering and color modeling require tuning and regression calculation of extraction functions and samples, feature variables, etc., using Python’s numpy library and scikit-learn machine learning library, it is relatively easy to implement in Python, but there are no mature JS libraries for these two methods. And JS itself is not good at regression calculation of this kind of more complex calculation. I did not do in-depth research, but mainly focused on the previous several color quantization algorithms.

The least difference method is to find the color with the smallest difference in the case of a given color palette. The scene used is relatively small, so I mainly looked at the median segmentation method and octree algorithm, and carried out the practice.

Median segmentation

Median segmentation is usually an algorithm to reduce the bit depth of the image in image processing. It can be used to convert the high level image to the low level image, such as converting the 24bit image to the 8bit image. We can also use it to extract the theme color of the image. The principle is to regard the color of each pixel of the image as a point in a three-dimensional space with R, G and B as the coordinate axis. Since the value range of the three colors is 0~255, the colors in the image are distributed in the color cube, as shown in the following figure.

Then, the longest side in RGB is divided into two from the median of color statistics, so that the two cuboids contain the same number of pixels, as shown in the figure below

Repeat this process until the number of cuboids equals the number of theme colors, and finally take the midpoints of each cuboid.

In practical use, if the cuboids are only cut according to the midpoint, some cuboids will have a large volume but a small number of pixels. The solution is to prioritize cuboids before cutting, and the sorting coefficient is volume * number of pixels. This can basically solve such problems.

Octree algorithm

Octree algorithm is also common in color quantization. The main idea is to put down the values of R, G and B channels row by row after binary conversion to obtain eight columns of numbers. For example, #FF7880 is converted to

R: 1111 1111
G: 0111 1000
B: 0000 0000
Copy the code

After gluing RGB channel column by column, 8 numbers can be obtained, which is the position of the color in the octree, as shown in the figure.

After inserting all the colors, merge them until you get the number of colors you want.

In actual operation, the image pixels need to be traversed and then inserted into the octree, and there are more recursive operations in the insertion process, so it takes longer time than median segmentation.

2. Practice of median segmentation

According to the previous introduction and related materials on the Internet, I pasted the median segmentation code I understood and realized here, and compared the results with the existing magic color correlation algorithm of QQ Music by finding several pictures. Figure 1 is the median segmentation result, and Figure 2 is the result returned by background CGI

FIG. 1

Figure 2

It can be seen that there is a certain difference, but the difference is relatively small, and the processing speed is relatively fast on PC. The three pictures are about 70ms,100ms and 130ms respectively. The code is pasted here and will be analyzed after the subsequent batch processing is compared.

(function () {

    @param {Array} colorRange [[rMin, rMax],[gMin, gMax], [bMin, bMax]] colorRange * @param {any} total number of pixels, ImageData / 4 * @param {any} data Pixel data set */
    function ColorBox(colorRange, total, data) {
        this.colorRange = colorRange;
        this.total = total;
        this.data = data;
        this.volume = (colorRange[0] [1] - colorRange[0] [0]) * (colorRange[1] [1] - colorRange[1] [0]) * (colorRange[2] [1] - colorRange[2] [0]);
        this.rank = this.total * (this.volume);
    }

    ColorBox.prototype.getColor = function () {
        var total = this.total;
        var data = this.data;

        var redCount = 0,
            greenCount = 0,
            blueCount = 0;

        for (var i = 0; i < total; i++) {
            redCount += data[i * 4];
            greenCount += data[i * 4 + 1];
            blueCount += data[i * 4 + 2];
        }

        return [parseInt(redCount / total), parseInt(greenCount / total), parseInt(blueCount / total)];
    }

    // Get the cut edge
    function getCutSide(colorRange) {   // r:0,g:1,b:2
        var arr = [];
        for (var i = 0; i < 3; i++) {
            arr.push(colorRange[i][1] - colorRange[i][0]);
        }
        return arr.indexOf(Math.max(arr[0], arr[1], arr[2]));
    }

    // Cut the color range
    function cutRange(colorRange, colorSide, cutValue) {
        var arr1 = [];
        var arr2 = [];
        colorRange.forEach(function (item) {
            arr1.push(item.slice());
            arr2.push(item.slice());
        })
        arr1[colorSide][1] = cutValue;
        arr2[colorSide][0] = cutValue;
        return [arr1, arr2];
    }

    // Find the color with the median occurrence
    function getMedianColor(colorCountMap, total) {
        var arr = [];
        for (var key in colorCountMap) {
            arr.push({
                color: parseInt(key),
                count: colorCountMap[key]
            })
        }

        var sortArr = __quickSort(arr);
        var medianCount = 0;
        var medianColor = 0;
        var medianIndex = Math.floor(sortArr.length / 2)

        for (var i = 0; i <= medianIndex; i++) {
            medianCount += sortArr[i].count;
        }

        return {
            color: parseInt(sortArr[medianIndex].color),
            count: medianCount
        }

        // Another cut color judgment method, according to the product of the number and difference judgment, after my own test found that the effect is not as good as the median method, but less sorting, performance should be improved
        // var count = 0;
        // var colorMin = arr[0].color;
        // var colorMax = arr[arr.length - 1].color
        // for (var i = 0; i < arr.length; i++) {
        // count += arr[i].count;

        // var item = arr[i];

        // if (count * (item.color - colorMin) > (total - count) * (colorMax - item.color)) {
        // return {
        // color: item.color,
        // count: count
        / /}
        / /}
        // }

        return {
            color: colorMax,
            count: count
        }



        function __quickSort(arr) {
            if (arr.length <= 1) {
                return arr;
            }
            var pivotIndex = Math.floor(arr.length / 2),
                pivot = arr.splice(pivotIndex, 1) [0];

            var left = [],
                right = [];
            for (var i = 0; i < arr.length; i++) {
                if (arr[i].count <= pivot.count) {
                    left.push(arr[i]);
                }
                else{ right.push(arr[i]); }}return__quickSort(left).concat([pivot], __quickSort(right)); }}// Cut the color box
    function cutBox(colorBox) {
        var colorRange = colorBox.colorRange,
            cutSide = getCutSide(colorRange),
            colorCountMap = {},
            total = colorBox.total,
            data = colorBox.data;

        // Count the number of each value
        for (var i = 0; i < total; i++) {
            var color = data[i * 4 + cutSide];

            if (colorCountMap[color]) {
                colorCountMap[color] += 1;
            }
            else {
                colorCountMap[color] = 1; }}var medianColor = getMedianColor(colorCountMap, total);
        var cutValue = medianColor.color;
        var cutCount = medianColor.count;
        var newRange = cutRange(colorRange, cutSide, cutValue);
        var box1 = new ColorBox(newRange[0], cutCount, data.slice(0, cutCount * 4)),
            box2 = new ColorBox(newRange[1], total - cutCount, data.slice(cutCount * 4))
        return [box1, box2];
    }

    // Queue cutting
    function queueCut(queue, num) {

        while (queue.length < num) {

            queue.sort(function (a, b) {
                return a.rank - b.rank
            });
            var colorBox = queue.pop();
            var result = cutBox(colorBox);
            queue = queue.concat(result);
        }

        return queue.slice(0.8)}function themeColor(img, callback) {

        var canvas = document.createElement('canvas'),
            ctx = canvas.getContext('2d'),
            width = 0,
            height = 0,
            imageData = null,
            length = 0,
            blockSize = 1,
            cubeArr = [];

        width = canvas.width = img.width;
        height = canvas.height = img.height;

        ctx.drawImage(img, 0.0, width, height);

        imageData = ctx.getImageData(0.0, width, height).data;

        var total = imageData.length / 4;

        var rMin = 255,
            rMax = 0,
            gMin = 255,
            gMax = 0,
            bMin = 255,
            bMax = 0;

        // Get the range
        for (var i = 0; i < total; i++) {
            var red = imageData[i * 4],
                green = imageData[i * 4 + 1],
                blue = imageData[i * 4 + 2];

            if (red < rMin) {
                rMin = red;
            }

            if (red > rMax) {
                rMax = red;
            }

            if (green < gMin) {
                gMin = green;
            }

            if (green > gMax) {
                gMax = green;
            }

            if (blue < bMin) {
                bMin = blue;
            }

            if(blue > bMax) { bMax = blue; }}var colorRange = [[rMin, rMax], [gMin, gMax], [bMin, bMax]];
        var colorBox = new ColorBox(colorRange, total, imageData);

        var colorBoxArr = queueCut([colorBox], 8);

        var colorArr = [];
        for (var j = 0; j < colorBoxArr.length; j++) {
            colorBoxArr[j].total && colorArr.push(colorBoxArr[j].getColor())
        }

        callback(colorArr);
    }

    window.themeColor = themeColor

})()
Copy the code

Octree algorithm practice

Maybe it is the problem of my algorithm implementation, the final result obtained by the octree algorithm is not ideal, and the time consumed is also much longer than the median segmentation method, the average time is 160ms,250ms,400ms or mainly look at the octree algorithm… I’ll also paste the code

(function () {

    var OctreeNode = function () {
        this.isLeaf = false;
        this.pixelCount = 0;
        this.red = 0;
        this.green = 0;
        this.blue = 0;
        this.children = [null.null.null.null.null.null.null.null];
        this.next = null;
    }

    var root = null,
        leafNum = 0,
        colorMap = null,
        reducible = null;

    function createNode(index, level) {
        var node = new OctreeNode();
        if (level === 7) {
            node.isLeaf = true;
            leafNum++;
        } else {
            // Drop it into the reducible linked list at level 1
            node.next = reducible[level];
            reducible[level] = node;
        }

        return node;
    }

    function addColor(node, color, level) {
        if (node.isLeaf) {
            node.pixelCount += 1;
            node.red += color.r;
            node.green += color.g;
            node.bllue += color.b;
        }
        else {
            var str = "";
            var r = color.r.toString(2);
            var g = color.g.toString(2);
            var b = color.b.toString(2);
            while (r.length < 8) r = '0' + r;
            while (g.length < 8) g = '0' + g;
            while (b.length < 8) b = '0' + b;

            str += r[level];
            str += g[level];
            str += b[level];

            var index = parseInt(str, 2);

            if (null === node.children[index]) {
                node.children[index] = createNode(index, level + 1);
            }

            if (undefined === node.children[index]) {
                console.log(index, level, color.r.toString(2));
            }

            addColor(node.children[index], color, level + 1); }}function reduceTree() {

        // Find the deepest linked list with joinable nodes
        var level = 6;
        while (null == reducible[level]) {
            level -= 1;
        }

        // Take the header and remove it from the list
        var node = reducible[level];
        reducible[level] = node.next;

        // Merge child nodes
        var r = 0;
        var g = 0;
        var b = 0;
        var count = 0;
        for (var i = 0; i < 8; i++) {
            if (null === node.children[i]) continue;
            r += node.children[i].red;
            g += node.children[i].green;
            b += node.children[i].blue;
            count += node.children[i].pixelCount;
            leafNum--;
        }

        / / assignment
        node.isLeaf = true;
        node.red = r;
        node.green = g;
        node.blue = b;
        node.pixelCount = count;
        leafNum++;
    }

    function buidOctree(imageData, maxColors) {
        var total = imageData.length / 4;
        for (var i = 0; i < total; i++) {
            // Add color
            addColor(root, {
                r: imageData[i * 4].g: imageData[i * 4 + 1].b: imageData[i * 4 + 2]},0);

            // Merge leaf nodes
            while(leafNum > maxColors) reduceTree(); }}function colorsStats(node, object) {
        if (node.isLeaf) {
            var r = parseInt(node.red / node.pixelCount);
            var g = parseInt(node.green / node.pixelCount);
            var b = parseInt(node.blue / node.pixelCount);

            var color = r + ', ' + g + ', ' + b;
            if (object[color]) object[color] += node.pixelCount;
            else object[color] = node.pixelCount;
            return;
        }

        for (var i = 0; i < 8; i++) {
            if (null! == node.children[i]) { colorsStats(node.children[i], object); }}}window.themeColor = function (img, callback) {
        var canvas = document.createElement('canvas'),
            ctx = canvas.getContext('2d'),
            width = 0,
            height = 0,
            imageData = null,
            length = 0,
            blockSize = 1;

        width = canvas.width = img.width;
        height = canvas.height = img.height;

        ctx.drawImage(img, 0.0, width, height);

        imageData = ctx.getImageData(0.0, width, height).data;

        root = new OctreeNode();
        colorMap = {};
        reducible = {};
        leafNum = 0;

        buidOctree(imageData, 8)

        colorsStats(root, colorMap)

        var arr = [];
        for (var key in colorMap) {
            arr.push(key);
        }
        arr.sort(function (a, b) {
            return colorMap[a] - colorMap[b];
        })
        arr.forEach(function (item, index) {
            arr[index] = item.split(', ')
        })
        callback(arr)
    }
})()
Copy the code

4. Comparison of results

After running 10,000 images in batches, we got the following results

Average Time comparison (JS-CGI)

Can see pictures without considering load time, with a median segmentation method to extract the relatively short time consuming, and load time of the picture is insuperable barrier (slow down for 450 ms), but the current code has good optimization space, such as sampling interval, reduce image size drawing to the canvas, optimized cutting point searching, etc., You need to explore it a little bit more.

The color deviation

So it seems that the accuracy is still ok, about 76% of the color and CGI extraction results are similar, in more than 100 spot check found that some of the images extracted by the two theme colors have their own characteristics, or equal colors, such as

Five, the summary

In conclusion, the results extracted by the median segmentation method of canvas and CGI are relatively similar, and there are also many pictures with great differences, which need to be continuously optimized in subsequent practice. At the same time, the image loading time is also an insurmountable obstacle, but the current code still has good room for optimization, such as interval sampling, reducing the image size when drawing to canvas, optimizing the cutting point search, etc., which requires further exploration in the future.

Refer to the article

Acm.nudt.edu.cn/~twcourse/C…

Xcoder. In / 2014/09/17 /…

Blog. Rainy. Im / 2015/11/24 /…

Xinyo.org/archives/66…

Xinyo.org/archives/66…

Github.com/lokesh/colo…

Y.qq.com/m/demo/2018…

This article has been published by Tencent Cloud + community authorized by the author

For more fresh technology dry goods, you can follow usTencent Cloud technology community – Cloud Plus community official number and Zhihu organization number