In fact, to try to understand it fully, this is one of the most complex processes we’ve ever seen, let alone how to reproduce it. Creating a machine that can see around us like we do is extremely difficult, not only because computers are hard to imitate, but also because we don’t fully understand how humans do it ourselves.
That happens roughly like this: the image of the ball passes through the eyeball and lands on the retina: some basic analysis is done and sent to the brain (where the visual cortex thoroughly analyzes the image). It then sends it to the rest of the cortex, compares it to everything else it knows, categorizes it by object and dimension, and responds: Raise your hand, grab the ball (having predicted its path). The whole process takes less than a second, with almost no conscious involvement and no mistakes. Thus, reconstructing human vision is not a single problem, but a set of problems, each of which is related to the other.
Of course, no one said it would be easy. Except for that A.I. pioneer: Marvin Minsky, who in 1966 instructed a graduate student to connect a camera to a computer and describe what it saw. Poor kid: Fifty years later, we’re still doing it.
see
An image sensor in a digital camera
In other words, without software, the capabilities of hardware are very limited, and that’s the biggest problem. But modern photography does offer an alternative direction.
describe
Neurons fire with each other when they move quickly at a certain Angle or in a certain direction. Advanced networks aggregate these into meta-patterns: a circle, moving upwards. The other network is made up of circles that are white with red lines. Another: It’s getting bigger. An image is thus assembled from these crude but complementary descriptions.
The visual areas of the brain use a pattern called a histogram of directional gradients to find edges and other features
It’s hard to come up with a definition that explains how the brain works, let alone models it.
The image shown above (from Purdue University’s Electronics Lab) shows that:
understand
Artificial intelligence and control
The future of computer vision lies in integrating the concrete and powerful systems that have been created with broader systems.
This is where the cutting edge of computer science meets artificial intelligence more generally, and it’s an area we’re working on. Computer scientists, engineers, psychologists, neurologists and philosophers have failed to find any definition of how the brain works, and simulation has been left out of the discussion.
But that doesn’t mean we’re desperate. The future of computer vision lies in integrating the powerful but concrete systems we create with broader systems that are more focused on conceptual understanding: context, attention, intention, etc.
The author | Devin ColdeweyThe original address