If AI’s So Smart, Why Can’t It Grasp Cause and Effect?
Deep-learning models can spot patterns that humans can’t. But software still can’t explain, say, what caused one object to collide with another.
AI – Here’s a troubling fact. A self-driving car hurtling along the highway and weaving through traffic has less understanding of what might cause an accident than a child who’s just learning to walk.
A new experiment shows how difficult it is for even the best artificial intelligence systems to grasp rudimentary physics and cause and effect. It also offers a path for building AI systems that can learn why things happen.
The experiment was designed “to push beyond just pattern recognition,” says Josh Tenenbaum, a professor at MIT’s Center for Brains Minds & Machines, who led the work. “Big tech companies would love to have systems that can do this kind of thing.”
The most popular cutting-edge AI technique, deep learning, has delivered some stunning advances in recent years, fueling excitement about the potential of AI. It involves feeding a large approximation of a neural network copious amounts of training data.
Deep-learning algorithms can often spot patterns in data beautifully, enabling impressive feats of image and voice recognition. But they lack other capabilities that are trivial for humans.
To demonstrate the shortcoming, Tenenbaum and his collaborators built a kind of intelligence test for AI systems. It involves showing an AI program a simple virtual world filled with a few moving objects, together with questions and answers about the scene and what’s going on. The questions and answers are labeled, similar to how an AI system learns to recognize a cat by being shown hundreds of images labeled “cat.”
Systems that use advanced machine learning exhibited a big blind spot. Asked a descriptive question such as “What color is this object?” a cutting-edge AI algorithm will get it right more than 90 percent of the time. But when posed more complex questions about the scene, such as “What caused the ball to collide with the cube?” or “What would have happened if the objects had not collided?” the same system answers correctly only about 10 percent of the time.
David Cox, director of the MIT-IBM Watson AI Lab, which was involved with the work, says understanding causality is fundamentally important for AI. “We as humans have the ability to reason about cause and effect, and we need to have AI systems that can do the same.”
A lack of causal understanding can have real consequences, too. Industrial robots can increasingly sense nearby objects, in order to grasp or move them. But they don’t know that hitting something will cause it to fall over or break unless they’ve been specifically programmed—and it’s impossible to predict every possible scenario.
If a robot could reason causally, however, it might be able to avoid problems it hasn’t been programmed to understand. The same is true for a self-driving car. It could instinctively know that if a truck were to swerve and hit a barrier, its load could spill onto the road.
Causal reasoning would be useful for just about any AI system. Systems trained on medical information rather than 3-D scenes need to understand the cause of disease and the likely result of possible interventions.
Causal reasoning is of growing interest to many prominent figures in AI. “All of this is driving towards AI systems that can not only learn but also reason,” Cox says.
The test devised by Tenenbaum is important, says Kun Zhang, an assistant professor who works on causal inference and machine learning at Carnegie Mellon University, because it provides a good way to measure causal understanding, albeit in a very limited setting. “The development of more-general-purpose AI systems will greatly benefit from methods for causal inference and representation learning,” he says.
Besides showing weaknesses in existing AI programs, Tenenbaum and his colleagues built a new kind of AI system capable of learning about cause and effect that scores much higher on their intelligence test. Their approach combines several AI techniques. The system uses deep learning to recognize objects in a scene. The output of this is fed to software that builds a 3D model of the scene and how objects interact.
This article originally appeared on wired.com To read the full article and see the images, click here.
Nastel Technologies uses machine learning to detect anomalies, behavior and sentiment, accelerate decisions, satisfy customers, innovate continuously. To answer business-centric questions and provide actionable guidance for decision-makers, Nastel’s AutoPilot® for Analytics fuses:
- Advanced predictive anomaly detection, Bayesian Classification and other machine learning algorithms
- Raw information handling and analytics speed
- End-to-end business transaction tracking that spans technologies, tiers, and organizations
- Intuitive, easy-to-use data visualizations and dashboards