Robo Brain - a large-scale computational system that learns from publicly available Internet resources - is currently downloading and processing about 1 billion images, 120,000 YouTube videos, and 100 million how-to documents and appliance manuals.
The information is being translated and stored in a robot-friendly format that robots will be able to draw on when they need it.
To serve as helpers in our homes, offices and factories, robots will need to understand how the world works and how the humans around them behave.
"Our laptops and cell phones have access to all the information we want. If a robot encounters a situation it hasn't seen before it can query Robo Brain in the cloud," said Ashutosh Saxena, assistant professor of computer science at Cornell University.
More From This Section
Saxena and colleagues at Cornell, Stanford and Brown universities and the University of California, Berkeley, say Robo Brain will process images to pick out the objects in them, and by connecting images and video with text, it will learn to recognise objects and how they are used, along with human language and behaviour.
The system employs what computer scientists call "structured deep learning," where information is stored in many levels of abstraction.
A robot's computer brain stores what it has learned in a form mathematicians call a Markov model, which can be represented graphically as a set of points connected by lines (formally called nodes and edges).
In searching for knowledge, a robot's brain makes its own chain and looks for one in the knowledge base that matches within those limits.
"The Robo Brain will look like a gigantic, branching graph with abilities for multi-dimensional queries," said Aditya Jami, a visiting researcher art Cornell, who designed the large-scale database for the brain.