Do not engage in any illegal crawler behavior, the following is only for front-end technology research and learning.

A small friend in the front group said that he crawler climb fast hand, climb to the Korean…. Browser display is normal, but F12 source code is Korean… I wonder how…

As shown in the figure below, there are characters similar to Korean in DOM, but normal data are displayed on the page. As a result, when crawler crawls sensitive data on the page, it gets “Korean” instead of the data we want, so as to protect sensitive data.

Take a look, as shown below, and it’s amazing

But a closer look, hey, is not their own custom font library, fooling who.

Our “Korean” is copied to the web site tool.chinaz.com/tools/unico… To convert the code online,

Hey, hey, I know what’s going on three.

  1. ꯎ껾껾뷝 (First step)
  2. [b ‘\ \ uabce’, b ‘\ \ uaefe’, ‘\ \ uaefe b, b’ \ \ ubddd] (step 2)
  3. [‘4’, ‘0’, ‘0’, ‘1’] (step 3)

Anyway, you have 10 numbers, you go through them once, and then you write your own set of mappings. Every time the “Hangeul” caught by the comparison of conversion and then put into the database, it is done ~