When human beings excel in a state, there is a tendency for imagination to break out of balance.
In 1905, Einstein denied the absolute space-time, triggering three revolutions in the physical world. Yang once said, “Einstein didn’t miss the point because he had a freer view of time and space. And to have a free eye, one must be able to look at the same subject both near and far.”
2021, Ali Cloud Video Cloud Panoramic Innovation Summit, trying to stand in the place of close-up and vision, panoramic observation of this era of hypervideo topics.
What kind of times is this?
This is the age of hypervideo.
Video makes flowing words and images evolve into the language of The Times. Video encapsulates emotions, positions, horizons and thinking in three dimensions. Video breaks and extends in both time and space.
Video is a natural science, including words, video, audio, space, gravity, humanity, emotion, it presents a picture of the world without boundaries, it expresses freedom and creates new freedom.
In the era of hypervideo, video has derived more new forms and built a brand new content chain, the so-called hypercontent. Videotization has gradually evolved into human-centered interaction, which carries multi-dimensional sensory experience and even transcends time and space, the so-called super interaction. Video makes everything media, people and people, people and things, people and nature, inductive link, produce a super social ability and phenomenon, the so-called hyperlink.
Video has become a new language of The Times, and video has become a new cultural movement in the new century. At the other end of the super-future, the physical perceptual boundary between the real world and the virtual world will be blurred, and the digital twinning of the whole scene will be realized eventually.
5G, of course, is the catalyst for the evolution of this era, connecting everything. And “cloud + video” is the catalyst of scene innovation, so that the virtual and real fusion.
As a result, all content and interactions will fuse in this age.
Where is the end of the content and interaction?
First, the content.
Technology, in all its forms, is first and foremost presenting a world of meaning.
Technology makes meaning and conveys emotion. Well, just like when bandwidth is limited, people focus on the transmission of information; When bandwidth load is high, what people transmit through multidimensional information is emotion. Jobs, in his 2001 interview, had begun to hope for more emotional delivery over the Internet, which today is possible in the video cloud.
If technical power content delivery emotional, it reviews the evolution of the content, you can see a clear context: from a line of text, a painting, to an image, has been developed to today’s live broadcast, a short video is full, and then to information and knowledge of video show, until the whole scene content gradually video, eventually to give priority to with 3 d, interactive immersive content forms. In this evolution process, it highlights the growth force of larger density, more dimensions, more senses and topological space.
Today, we can anticipate the immersive learning field in advance. Through the full integration of 5G, XR, holographic projection, digital twinning and cloud-based network technologies, abstract knowledge can be visualized and concretized to create borderless online and offline classroom. It can transform reading news into experiencing “spatial news”, and make use of limited virtual, ultra-high definition, 3D and 360 panoramic technologies, so that people can get a sense of place and participation, so that the news industry is facing great disruption. The more common is the immersive cultural expo, which combines virtual/augmented reality, holographic projection and intelligent interaction with the IP of cultural tourism to form the rudimentary industry of immersive and interactive narration of everything.
Abroad, immersive concerts are coming to the stage, with Sony partnering with Verizon to launch the Madison Bill Immersive VR Concert this winter. The experience is said to be a combination of 3D motion capture, volume capture and 3D reconstruction technology developed using a game engine. Panasonic has also announced a partnership with Illuminariums Entertainment to create a massive immersive entertainment center featuring 46 4K projections, interactive Lidar sensors, and highly customizable spatial audio.
Carefully taste, immersive content of the form of infinite imagination. In the content form, we can survey the linear growth path from physical immersion, virtual immersion, virtual mixed immersion, and then to ubiquitous intelligent immersion. The content form at the end will reconstruct the experience through the form of global interaction, bringing thousands of unique content.
Let’s look at interaction.
In The Course of Science, it is mentioned that “a revolutionary change in modern thought lies in the shift from a finite and closed world to an infinite universe. “And so it is, when we look closely at the reciprocal deduction.
From offline to online, all scenes are trying to make space and create no boundaries. Based on the promotion of technology and business, people’s interaction is gradually turning to the whole scene online, and the final form will also be an immersive interaction. It is not difficult to find that multi-terminal link, multi-person sharing, space breaking, virtual and real seamless combination, is the trend of this evolution. At the visible end point, human-computer interaction and brain-computer interface are the focus of exploration.
If you look at the 60 years of interactive development, it can be divided into three major development periods, and the next decade will focus on human-computer interaction, sensors, online social communication, brain-computer interfaces, and feature recognition.
Source: International Journal of Human — Computer Interaction Mapping Human — Computer Interaction Research Themes and Trends from Its Existence to Today: A Topic Modeling Based Review of the Past 60 Years
From the perspective of interaction, information will naturally pass from one interaction object to another, and digital will coexist and enhance the physical. Academically, interaction can be divided into: physical and digital continuum interaction, implicit interaction, sensory environment and perception interaction, public space interaction, virtual reality and augmented reality interaction. This ultimate immersive interaction aims at exploring a more natural way of interaction, hoping to release people’s abilities of stereo vision, touch and ontology perception, so that interaction is no longer limited to two-dimensional visual channels and visual feedback.
Among the new interactions, the latest 2021CES showcases the remote VR manipulation of Polen Robotics, Careos’s Smart Mirror AR beauty and hair salon, and holographic accessories unveiled by IKIN, a holographic technology firm. It can turn a smartphone or computer screen into an eye-free 3D effect. Of course, there is also the VR social network that Facebook has been laying out, trying to try another kind of life in the virtual world.
According to the Vision Report of 6G Era released by Samsung recently, highly immersive XR and high quality mobile holographic experience will be the common scene in 10 years.
The end of content and interaction is probably the synthesis of immersive fields, while intelligence has gradually “immersed” us into a pan-immersive era of virtual and real fusion. It’s not the future, it’s happening right now.
The supply of ecology and the balance of AI
From the future and the evolution of The Times back, put the line of sight, fall on the existing content ecology and technical support level.
From the perspective of full spectrum of video content, the whole industry chain covers content production, marketing communication, distribution platforms, playback terminals and technical support, while cloud computing and audio and video technologies strongly support the development of the whole industry chain of video content.
Driven by the new video culture consumption, new technologies are evolving and being applied, and new production modes and content forms are emerging.
As we know, the expansion of the new cultural consumption of video requires, on the one hand, the supply system of digital short video and on the other hand, the production capacity of ultra-high definition video, so as to bring the public into the wave of digital content and into the real 8K era.
Ultra-high definition video is a new round of intergenerational evolution of video technology following analog, standard definition and high-definition. It is an important development direction of today’s new generation of information technology, as well as 5G and artificial intelligence. At present, content production is the shortest board of UHD production. The promotion and development of content service layer plays a decisive role in the commercial landing of UHD.
AI can play a key role in this. We can think of vision as biology and physics. The biological world is human visual perception, while the physical phenomena are various inductions to light, including brightness, detailed description of light, and information related to time.
In this regard, the role played by AI is mainly divided into two parts. The first and most basic one is the understanding of video or image, including our common classification, marking, detection, segmentation and so on. This is also related to people, because people first understand the world. The second is related to production, such as our production, editing, processing, erasing, erasing and so on, and related to the underlying vision, is related to enhancement, and how to use AI technology to enable video on the underlying vision, is also the key.
A very important result for vision, in terms of the UHD capabilities that AI gives, is a new audio-visual experience, and experience is about a lot of things. The first is richer details. For example, how to enrich details when viewing an object with very low resolution or poor information experience, especially with the coming of 8K today? The second is more vivid color, at the level of color depth, gamut, and brightness, which is where the experience is very important. The third is the more immersive experience, the big view, the panoramic view, the stereo surround. It also includes wider applications in all industries.
AI drives HD forward, intelligence is the most basic, and can adapt to do things in different scenes, AI technology does not have the so-called universal ability, so in cartoons, news characters, biography different scenes, can have a good system, rather than a single model, universal model to deal with, So it’s very important to be able to adapt the best algorithms for different scenarios. Therefore, the intelligent AI technology driven by adaptive, high quality and self-evaluation is the key to Damo Academy’s efforts.
Beyond Ultra HD, the efficiency of AI in super content consumption is also being powerfully empowered.
At present, the fragmented consumption time of users is increasing. The scale of short video consumption users has exceeded 773 million people, and the market size of short video has exceeded 200 billion yuan. However, as we all know, on the content supply side, making a high-quality video is faced with difficulties in creative production and tool realization, and efficient large-scale output is even more difficult. In this connection, Ali Entertainment Media’s AI platform can realize five functions through AI research and development: dynamic material extraction, template video production, intelligent editing technology, intelligent material processing, and interactive special effects.
Considering its own business characteristics, Daewanyu hopes to improve efficiency and promote distribution on the platform side to create more and better products and tools for the industry. In the consumption side to provide users with more new consumption patterns and video consumption interactive new experience; On the industry side, it can cooperate with more B-side PGC or MCN.
Now, based on the linkage of technology and ecology, Ali Cloud Video Cloud is also promoting the whole model of media production to a new era — cloud integrated intelligent production architecture. This architecture includes four core links of content creation, material management, editing and packaging, rendering and synthesis, and has rich functions such as cloud broadcasting, cloud editing, AI processing and production. As a result, content production in the media industry will be given more possibilities with the help of a cloud-integrated architecture and AI capabilities. This mode of production will greatly reshape the content industry, allowing real content creators to release themselves from the complicated and repetitive labor and create richer content, forms and modes.
Video power has changed the logic of business
The evolution of The Times, the support of technology and the linkage of ecology are more landing on the commercial landing point.
In the past, when talking about the overall value of the Internet, the conventional use is the value of traffic. From the mobile end, the simplest is how many devices are covered in every month and every week. But now we have to use the time number to see. In just three years, the amount of time users spent on the entire video segment went from 1.6 trillion minutes to 4.8 trillion minutes. The numbers are staggering.
Facing the huge commercial space behind the phenomenon, we must think about how to drive and innovate more.
When we talk about video transmission, its origin is a carrier of information transmission. If the information transmission itself needs to be classified, it can be divided into one-to-one communication or transmission, one-to-many or many-to-many, and the other dimension can be divided into delay and real-time.
The carrying capacity of video can be combined with many industries. So, before watching video, we will basically say that the video industry, video track, and at this stage, we will think that all areas will be so combined with video, it is like cloud computing, no longer more as an industry concept, but a basic ability of the bottom of the Internet new economy. With this capability, every industry can do something innovative, based on cloud, based on video, based on video cloud.
And the video cloud, will become the industry video must choose, become a large video industry technology base.
As a kind of digitally intelligent infrastructure, the video cloud not only significantly reduces the entry threshold for video applications, but also promotes the prosperity of the big video industry by promoting the improvement of industrial efficiency.
From the demand side, video cloud can provide enterprises with video capabilities or videoize products, and can use more production, processing, transmission and consumption value-added capabilities. Such as live electricity was among the first to feel profoundly, there are changes in the whole electricity subject, originally can see is just a few big live electrical contractor, but video gives a platform of electricity transformation ability, let now a lot of content platform startup companies even have very big traffic centre, the host can be the center of the electricity, in the past, it is not there.
In addition, in the field of online education, many years of exploration have failed to fully realize online education. Later, the emergence of live broadcast has solved some problems of immersion in the industry. Students can have more interaction with teachers, which can solve some problems of learning efficiency. In essence, video does solve part of the immersion and effectiveness problem of education that online education has finally found its way to monetise in the last few years. Xu Fanlei, vice general manager of iResearch Institute, gave an accurate analysis of e-commerce and education.
In addition to e-commerce and education, which have the highest video penetration rate at present, the broad Internet entertainment, digital intelligence transformation of the media industry, and mobile collaborative office of enterprises are also the key fields of video cloud technology application. Based on video cloud technology, new business scenarios are constantly being opened, including new e-commerce, new education, new social networking, new finance, new medical treatment, and even more industries and industries.
The evolution of The Times, the penetration of video and the change of interaction make the realization logic, flow direction and organizational form of the industry undergo huge changes in pattern.
The ali cloud video cloud joint iresearch consulting a joint study, released “2021 China video cloud application scenario insight into white paper, in the perspective of cloud innovation, fully displaying video application full scene, full link, in view of the space, blind spots, opportunity and case depth profiling, focus on the commercial market for video cloud track laid important practical value.
Events and open source are amplifiers of the social imagination
In the era of hyper-video, the imagination of video cloud is not only limited to business scenes, but also to benefit the whole people and create diversified social values.
In February this year, ali cloud Intel organized, technical cooperation with youku strategic global video cloud innovation challenge starts, this competition is the world’s first focus on video cloud technology in the field of application and innovation of the industry, the tianchi ali cloud video cloud platforms and undertake, preliminary startup has attracted 4600 entries from colleges and universities around the world team. During the race, you can see the continuous emergence of innovative projects, full of social value and new vitality, such as the safe parking project implemented by visual algorithms, elderly care project.
It is worth mentioning that, through the cooperation with Youku platform, the competition provided a large-scale high-precision video segmentation data set for competitors to train their models, and finally refined into an authoritative data set in the field of video segmentation, which is very rare. The data set has the volume of rammed data, covering 180,000 frames and up to 300,000 maximum video target data sets, leading the industry in both annotation accuracy and content breadth. At the same time, the content type is highly consistent with the real scene and the scene is diversified, which has a high exploration significance for the video industry.
As an important factor of production in the information age, data is known as a new power source and an important basis for the development of artificial intelligence technology.
Through the cooperation with Alibaba Group’s Taobao, Tmall, Aliyun, Youku, AE and other business teams, as well as Tsinghua University, Shanghai Jiao Tong University, National Astronomical Observatories of Chinese Academy of Sciences, China Computer Society, Chinese Information Society, Union Medical College Hospital, Ruijin Hospital and other external authoritative research institutions, The Tianchi competition platform has opened more than 60 rare data sets of industries with real business scenarios, including e-commerce, finance, logistics, medical care, energy, etc., making outstanding contributions to the talent cultivation of global computer vision and creating a wider space for more technical developers.
It has to be said that the technology innovation competition that stimulates surging energy and the large-scale authoritative open source data sets enable more multidimensional social imagination, and the technology blooming on the basis of this is very exciting.
If you also have the grace to immerse in imagination
In the final analysis, technology, commerce, ecology, resources, everything is for human emotion and experience.
Technology is constantly interpenetrating with many fields, and art is probably the special field that we want to touch most, and it is also the nerve wire closest to the soft and glutinous feelings in the human heart.
7.10 “Imagine” Ali Cloud Video Cloud Panorama Innovation Summit, from the perspective of the host, truly from the imagination, trying to draw the distance between people and space with a sense of immersion of visual channels.
Of course, from the perspective of science and technology crossover art, what we deeply concern is the realization of aesthetic creation in the digital era.
We find that contemporary art creators are also constantly working on the integration of technology and art by virtue of their imagination and interdisciplinary ability. In the era of digital interaction, the artistic behavior of creation and communication is getting new in a comprehensive way, and further, profound changes are taking place in the sense, experience and thinking of artistic aesthetics. Aesthetic drive technology, technology feeds aesthetic.
In the era of digital interaction, the ultimate aesthetic pursuit is the pursuit of professionalism, and behind professionalism lies the creative efficiency and creative ability. Technology is undoubtedly an important tool to facilitate the multi-sensory and multi-dimensional realization of creativity, and AI tools based on deep learning are assisting in this process, giving wings to the creative brain.
And the number of intellectualization, reconstruction of visual interaction, the evolution of experience, is also very important summit on “cross-border wisdom” as the kernel, attempts to present some content and interactive experience of new devices, such as generated against network based learning technology and migration of cartoon drawing, and makes the real-time rendering of virtual screen shot, through the virtual idol face and motion capture technology, Everything is searching for new experiences of technology based on art and people.
The above is the limited vision of Ali Cloud Video Cloud in the new era, and the infinite content is still to be imagined.
In the era of hypervideo, the video cloud is everywhere
The video cloud is a new area of interdisciplinary research
It is a cloud integrated digital intelligence capability
The video cloud is the future of human imagination
Is opening up a whole new, unlimited, free world
Where there is imagination, there is the video cloud.
All the speech contents of this Video Cloud Panorama Innovation Summit will be released in the “Video Cloud Technology” official account.
“Video cloud technology” is your most noteworthy public account of audio and video technology. Every week, you will push practical technical articles from the front line of Ali Cloud, where you can exchange ideas with first-class engineers in the field of audio and video. Public number backstage reply [technology] can join Ali cloud video cloud product technology exchange group, and the industry’s big names together to discuss audio and video technology, get more industry latest information.