/ News

10.10.2013

Better robot vision

A neglected statistical tool could help robots better understand the objects in the world around them.

Object recognition is one of the most widely studied problems in computer vision. But a robot that manipulates objects in the world needs to do more than just recognize them; it also needs to understand their orientation. Is that mug right-side up or upside-down? And which direction is its handle facing?

To improve robots’ ability to gauge object orientation, Jared Glover, a graduate student in MIT’s Department of Electrical Engineering and Computer Science, is exploiting a statistical construct called the Bingham distribution. In a paper they’re presenting in November at the International Conference on Intelligent Robots and Systems, Glover and MIT alumna Sanja Popovic ’12, MEng ’13, who is now at Google, describes a new robot-vision algorithm, based on the Bingham distribution, that is 15 percent better than its best competitor at identifying familiar objects in cluttered scenes.

That algorithm, however, is for analyzing high-quality visual data in familiar settings. Because the Bingham distribution is a tool for reasoning probabilistically, it promises even greater advantages in contexts where information is patchy or unreliable. Inongoing work, Glover is using Bingham distributions to analyze the orientation of pingpong balls in flight, as part of a broader project to teach robots to play pingpong. In cases where visual information is particularly poor, his algorithm offers an improvement of more than 50 percent over the best alternatives.

“Alignment is key to many problems in robotics, from object-detection and tracking to mapping,” Glover says. “And ambiguity is really the central challenge to getting good alignments in highly cluttered scenes, like inside a refrigerator or in a drawer. That’s why the Bingham distribution seems to be a useful tool, because it allows the algorithm to get more information out of each ambiguous, local feature.”

Because Bingham distributions are so central to his work, Glover has also developed a suite of software tools that greatly speed up calculations involving them. The software isfreely available online, for other researchers to use.

In the rotation

One reason the Bingham distribution is so useful for robot vision is that it provides a way to combine information from different sources. Generally, determining an object’s orientation entails trying to superimpose a geometric model of the object over visual data captured by a camera — in the case of Glover’s work, a Microsoft Kinect camera, which captures a 2-D color image together with information about the distance of the color patches.

For simplicity’s sake, imagine that the object is a tetrahedron, and the geometric model consists of four points marking the tetrahedron’s four corners. Imagine, too, that software has identified four locations in an image where color or depth values change abruptly — likely to be the corners of an object. Is it a tetrahedron?

The problem, then, boils down to taking two sets of points — the model and the object — and determining whether one can be superimposed on the other. Most algorithms, Glover’s included, will take a first stab at aligning the points. In the case of the tetrahedron, assume that, after that provisional alignment, every point in the model is near a point in the object, but not perfectly coincident with it.

If both sets of points in fact describe the same object, then they can be aligned by rotating one of them around the right axis. For any given pair of points — one from the model and one from the object — it’s possible to calculate the probability that rotating one point by a particular angle around a particular axis will align it with the other. The problem is that the same rotation might move other pairs of points farther away from each other.

Glover was able to show, however, that the rotation probabilities for any given pair of points can be described as a Bingham distribution, which means that they can be combined into a single, cumulative Bingham distribution. That allows Glover and Popovic’s algorithm to explore possible rotations in a principled way, quickly converging on the one that provides the best fit between points.

Big umbrella

Moreover, in the same way that the Bingham distribution can combine the probabilities for each pair of points into a single probability, it can also incorporate probabilities from other sources of information — such as estimates of the curvature of objects’ surfaces. The current version of Glover and Popovic’s algorithm integrates point-rotation probabilities with several other such probabilities.

In experiments involving visual data about particularly cluttered scenes — depicting the kinds of environments in which a household robot would operate — Glover’s algorithm had about the same false-positive rate as the best existing algorithm: About 84 percent of its object identifications were correct, versus 83 percent for the competition. But it was able to identify a significantly higher percentage of the objects in the scenes — 73 percent versus 64 percent. Glover argues that that difference is because of his algorithm’s better ability to determine object orientations.

He also believes that additional sources of information could improve the algorithm’s performance even further. For instance, the Bingham distribution could also incorporate statistical information about particular objects — that, say, a coffee cup may be upside-down or right-side up, but it will very rarely be found at a diagonal angle.

Indeed, it’s because of the Bingham distribution’s flexibility that Glover considers it such a promising tool for robotics research. “You can spend your whole PhD programming a robot to find tables and chairs and cups and things like that, but there aren’t really a lot of general-purpose tools,” Glover says. “With bigger problems, like estimating relationships between objects and their attributes and dealing with things that are somewhat ambiguous, we’re really not anywhere near where we need to be. And until we can do that, I really think that robots are going to be very limited.”

Gary Bradski, vice president of computer vision and machine learning at Magic Leap and president and CEO of OpenCV, the nonprofit that oversees the most widely used open-source computer-vision software library, believes that the Bingham distribution will eventually become the standard way in which roboticists represent object orientation. “The Bingham distribution lives on a hypersphere,” Bradski says — the higher-dimensional equivalent of a circle or sphere. “We’re trying to represent 3-D objects, and the spherical representation fits naturally with the 3-D space. It’s just kind of a recoding of the features that has more natural properties.”

“It isn’t really as hard as the math looks,” Bradski adds. “It’s a better representation, so I think once it’s understood, this’ll just kind of become one of the things that is built in when you’re doing the 3-D fits. [Glover] found something that was obscure, but once people are familiar with it, it will just be a no-brainer.”

Source: http://web.mit.edu/newsoffice/2013/better-robot-vision-1007.html




/ About us

Founded by Russian entrepreneur Dmitry Itskov in February 2011 with the participation of leading Russian specialists in the field of neural interfaces, robotics, artificial organs and systems.

The main goals of the 2045 Initiative: the creation and realization of a new strategy for the development of humanity which meets global civilization challenges; the creation of optimale conditions promoting the spiritual enlightenment of humanity; and the realization of a new futuristic reality based on 5 principles: high spirituality, high culture, high ethics, high science and high technologies. 

The main science mega-project of the 2045 Initiative aims to create technologies enabling the transfer of a individual’s personality to a more advanced non-biological carrier, and extending life, including to the point of immortality. We devote particular attention to enabling the fullest possible dialogue between the world’s major spiritual traditions, science and society.

A large-scale transformation of humanity, comparable to some of the major spiritual and sci-tech revolutions in history, will require a new strategy. We believe this to be necessary to overcome existing crises, which threaten our planetary habitat and the continued existence of humanity as a species. With the 2045 Initiative, we hope to realize a new strategy for humanity's development, and in so doing, create a more productive, fulfilling, and satisfying future.

The "2045" team is working towards creating an international research center where leading scientists will be engaged in research and development in the fields of anthropomorphic robotics, living systems modeling and brain and consciousness modeling with the goal of transferring one’s individual consciousness to an artificial carrier and achieving cybernetic immortality.

An annual congress "The Global Future 2045" is organized by the Initiative to give platform for discussing mankind's evolutionary strategy based on technologies of cybernetic immortality as well as the possible impact of such technologies on global society, politics and economies of the future.

 

Future prospects of "2045" Initiative for society

2015-2020

The emergence and widespread use of affordable android "avatars" controlled by a "brain-computer" interface. Coupled with related technologies “avatars’ will give people a number of new features: ability to work in dangerous environments, perform rescue operations, travel in extreme situations etc.
Avatar components will be used in medicine for the rehabilitation of fully or partially disabled patients giving them prosthetic limbs or recover lost senses.

2020-2025

Creation of an autonomous life-support system for the human brain linked to a robot, ‘avatar’, will save people whose body is completely worn out or irreversibly damaged. Any patient with an intact brain will be able to return to a fully functioning  bodily life. Such technologies will  greatly enlarge  the possibility of hybrid bio-electronic devices, thus creating a new IT revolution and will make  all  kinds of superimpositions of electronic and biological systems possible.

2030-2035

Creation of a computer model of the brain and human consciousness  with the subsequent development of means to transfer individual consciousness  onto an artificial carrier. This development will profoundly change the world, it will not only give everyone the possibility of  cybernetic immortality but will also create a friendly artificial intelligence,  expand human capabilities  and provide opportunities for ordinary people to restore or modify their own brain multiple times.  The final result  at this stage can be a real revolution in the understanding of human nature that will completely change the human and technical prospects for humanity.

2045

This is the time when substance-independent minds will receive new bodies with capacities far exceeding those of ordinary humans. A new era for humanity will arrive!  Changes will occur in all spheres of human activity – energy generation, transportation, politics, medicine, psychology, sciences, and so on.

Today it is hard to imagine a future when bodies consisting of nanorobots  will become affordable  and capable of taking any form. It is also hard to imagine body holograms featuring controlled matter. One thing is clear however:  humanity, for the first time in its history, will make a fully managed evolutionary transition and eventually become a new species. Moreover,  prerequisites for a large-scale  expansion into outer space will be created as well.

 

Key elements of the project in the future

• International social movement
• social network immortal.me
• charitable foundation "Global Future 2045" (Foundation 2045)
• scientific research centre "Immortality"
• business incubator
• University of "Immortality"
• annual award for contribution to the realization of  the project of "Immortality”.

Login as user:

If you are registered on one of these websites, you can get a quick registration. To do this, please select the wesite and follow the instructions.

Login to 2045.com

Email:
You do not have login to 2045.com? Register!
Dear colleagues, partners, friends! If you support ​the 2045 strategic social initiative goals and values, please register on our website.

Quick registration:

If you are registered on one of these websites, you can get a quick registration. To do this, please select the wesite and follow the instructions.

Registration

Name:
Surname:
Field of activity:
Email:
Password:
Enter the code shown:

Show another picture

Восстановить пароль

Email:

Text:
Contact Email:
Attachment ( not greater than 5 Mb. ):
 
Close
avatar project milestones