When it comes to poetry, the first thing that comes to mind is tang and Song poems.
Anyone can recite the verse, “Moonlight before my bed, I suspect frost on the ground.”
Poetry, as the classical carrier of ancient literature, concentrates too much of the author’s feelings and thoughts, many of which are worth our taste and learning.
Today, however, instead of ancient poetry, we will talk about modern poetry that is not well known.
Modern poetry, as the name implies, is written by modern people, according to the level and oblique rhyme of poetry.
Because the ancient and modern characters are different, the level and oblique rhyme used has also changed.
Ancient poetry generally uses the rhyme of flat water and the rhyme of ci Forest.
Modern poetry can also be written with ancient rhymes or with Chinese new rhymes (modern rhymes).
If there is no poem written according to the level and oblique, it is called ancient poetry or doggerel. Before the Tang Dynasty, the rules of level and oblique poetry had not yet been formed, so they were not in accordance with level and oblique poetry.
For example, the book of Songs, known to all, is ancient poetry.
As a programmer who loves poetry (and often writes a few crappy poems in song), I wondered if I could do something interesting with code.
As it happens, I saw a Git project on the Internet that day, and a programmer made statistics of the high-frequency words in poems of tang and Song dynasties, which aroused my keen interest.
Git project address:
Data and statistics
The ancient ones are counted. What about the modern ones?
So I spent a little time collecting 160,000 modern poems and 50,000 modern words for data analysis.
Of course, the works are also good and bad, but can ensure that most of the level and oblique patterns are correct, after verification.
Because I am a worker using PHP, the best language in the world, the code is all done in PHP.
For Chinese word segmentation, see my blog post about how to install the SCWS Chinese word segmentation PHP extension
As for the data and interface, I am sorry that it is not convenient to provide, because after all, we need to protect the copyright of authors and corresponding websites.
Then, I made the following website to show you some of the statistics in detail:
Shici. Lovefc. cn (it is best to use computer side, Google core browser access, I did not adapt to the mobile phone side of the access page)
It’s a random modern word from the data. I like it!
And mutter under your breath: If you’re lucky enough to pick up a random poem by someone named Linfeng, it’s a baby.
Although modern poetry is not popular, I think we should not regard it as dross.
Many writers’ poetry is very good, and indeed a way of expressing one’s thoughts, rather than whining.
Look well!