AI Robot Achieves Superhuman Skill in Labyrinth Game, Switzerland

Date:

It has long been recognized that AI can achieve a higher level of performance than humans in various games – but until now, physical skill remained the ultimate human prerogative. This is no longer the case. An AI technique known as deep reinforcement learning has pushed back the limits of what can be achieved with autonomous systems and AI, achieving superhuman performance in a variety of different games such as chess and Go, video games, and navigating virtual mazes.
Today, artificial intelligence is beginning to push back the boundaries and gain ground on man’s prerogative: physical skill.

Researchers at ETH Zurich have created an AI robot named CyberRunner whose task is to learn how to play the popular and widely accessible labyrinth marble game. The labyrinth is a game of physical skill whose goal is to steer a marble from a given start point to the endpoint. In doing so, the player must prevent the ball from falling into any of the holes that are present on the labyrinth board.

CyberRunner applies recent advances in model-based reinforcement learning to the physical world and exploits its ability to make informed decisions about potentially successful behaviors by planning real-world decisions and actions into the future.

Just like us humans, the robot learns through experience. While playing the game, it captures observations and receives rewards based on its performance, all through the eyes of a camera looking down at the labyrinth. A memory is kept of the collected experience. Using this memory, the model-based reinforcement learning algorithm learns how the system behaves, and based on its understanding of the game, it recognizes which strategies and behaviors are more promising (the critic). Consequently, the way the robot uses the two motors — its hands — to play the game is continuously improved (the actor). Importantly, the robot does not stop playing to learn; the algorithm runs concurrently with the robot playing the game. As a result, the robot keeps getting better, run after run.

See also  MIT Researchers Develop AI Tool to Protect Images from Manipulation

The learning on the real-world labyrinth is conducted in 6.06 hours, comprising 1.2 million time steps at a control rate of 55 samples per second. The AI robot outperforms the previously fastest recorded time, achieved by an extremely skilled human player, by over 6%.

Interestingly, during the learning process, CyberRunner naturally discovered shortcuts. It found ways to ‘cheat’ by skipping certain parts of the maze. The lead researchers, Thomas Bi and Prof. Raffaello D’Andrea, had to step in and explicitly instruct it not to take any of those shortcuts.

In addition, Bi and D’Andrea will open source the project and make it available on the website. Prof. Raffaello D’Andrea commented: We believe that this is the ideal testbed for research in real-world machine learning and AI. Prior to CyberRunner, only organizations with large budgets and custom-made experimental infrastructure could perform research in this area. Now, for less than 200 dollars, anyone can engage in cutting-edge AI research. Furthermore, once thousands of CyberRunners are out in the real world, it will be possible to engage in large-scale experiments, where learning happens in parallel, on a global scale. The ultimate in Citizen Science!

Artificial intelligence continues to redefine what was once considered solely within the domain of human capability. With the creation of CyberRunner, a robot capable of excelling at a physical game like the labyrinth marble game, researchers at ETH Zurich have achieved a landmark accomplishment. By utilizing advanced AI techniques such as deep reinforcement learning, the researchers have pushed the boundaries of autonomous systems to surpass human performance in physical skills.

See also  Interest Rates Surge Amidst Accelerating Innovation: What It Means for the Future, US

The labyrinth marble game requires fine motor skills and spatial reasoning abilities, making it a challenging game for humans to master. However, CyberRunner demonstrates how AI can excel in this task. Using model-based reinforcement learning, the robot learns through experience, capturing observations and receiving rewards based on its performance. The algorithm continuously improves the robot’s strategy and behavior, leading to impressive results.

In a remarkable feat, CyberRunner outperforms the fastest recorded time achieved by a skilled human player by over 6%. This achievement showcases the potential of AI to surpass human capabilities in physical endeavors. However, it is essential to note that during the learning process, the robot discovered shortcuts that could be considered cheating. To maintain fairness, the researchers intervened and instructed CyberRunner not to take advantage of these shortcuts.

Moreover, the researchers plan to make their project open source, allowing others to engage in cutting-edge AI research. This move aims to democratize access to real-world machine learning and AI experiments, eliminating the need for large budgets and specialized infrastructure. With the availability of CyberRunner’s design and code, individuals can contribute to the advancement of AI research on a global scale, fostering the spirit of citizen science.

The rise of AI beyond physical boundaries marks a significant milestone in the field of artificial intelligence. While AI has already proven its potential in various games, its ability to excel in physical skills opens up new possibilities for innovation and technological advancement. As researchers continue to push the boundaries of AI, we can expect further breakthroughs that redefine our understanding of what machines are capable of achieving.

Frequently Asked Questions (FAQs) Related to the Above News

Please note that the FAQs provided on this page are based on the news article published. While we strive to provide accurate and up-to-date information, it is always recommended to consult relevant authorities or professionals before making any decisions or taking action based on the FAQs or the news article.

Share post:

Subscribe

Popular

More like this
Related

Obama’s Techno-Optimism Shifts as Democrats Navigate Changing Tech Landscape

Explore the evolution of tech policy from Obama's optimism to Harris's vision at the Democratic National Convention. What's next for Democrats in tech?

Tech Evolution: From Obama’s Optimism to Harris’s Vision

Explore the evolution of tech policy from Obama's optimism to Harris's vision at the Democratic National Convention. What's next for Democrats in tech?

Tonix Pharmaceuticals TNXP Shares Fall 14.61% After Q2 Earnings Report

Tonix Pharmaceuticals TNXP shares decline 14.61% post-Q2 earnings report. Evaluate investment strategy based on company updates and market dynamics.

The Future of Good Jobs: Why College Degrees are Essential through 2031

Discover the future of good jobs through 2031 and why college degrees are essential. Learn more about job projections and AI's influence.