Robot Technology News
ROBO SPACE
Inner workings of AI an enigma - even to its creators
Inner workings of AI an enigma - even to its creators
By Thomas URBAIN
New York (AFP) May 13, 2025

Even the greatest human minds building generative artificial intelligence that is poised to change the world admit they do not comprehend how digital minds think.

"People outside the field are often surprised and alarmed to learn that we do not understand how our own AI creations work," Anthropic co-founder Dario Amodei wrote in an essay posted online in April.

"This lack of understanding is essentially unprecedented in the history of technology."

Unlike traditional software programs that follow pre-ordained paths of logic dictated by programmers, generative AI (gen AI) models are trained to find their own way to success once prompted.

In a recent podcast Chris Olah, who was part of ChatGPT-maker OpenAI before joining Anthropic, described gen AI as "scaffolding" on which circuits grow.

Olah is considered an authority in so-called mechanistic interpretability, a method of reverse engineering AI models to figure out how they work.

This science, born about a decade ago, seeks to determine exactly how AI gets from a query to an answer.

"Grasping the entirety of a large language model is an incredibly ambitious task," said Neel Nanda, a senior research scientist at the Google DeepMind AI lab.

It was "somewhat analogous to trying to fully understand the human brain," Nanda added to AFP, noting neuroscientists have yet to succeed on that front.

Delving into digital minds to understand their inner workings has gone from a little-known field just a few years ago to being a hot area of academic study.

"Students are very much attracted to it because they perceive the impact that it can have," said Boston University computer science professor Mark Crovella.

The area of study is also gaining traction due to its potential to make gen AI even more powerful, and because peering into digital brains can be intellectually exciting, the professor added.

- Keeping AI honest -

Mechanistic interpretability involves studying not just results served up by gen AI but scrutinizing calculations performed while the technology mulls queries, according to Crovella.

"You could look into the model...observe the computations that are being performed and try to understand those," the professor explained.

Startup Goodfire uses AI software capable of representing data in the form of reasoning steps to better understand gen AI processing and correct errors.

The tool is also intended to prevent gen AI models from being used maliciously or from deciding on their own to deceive humans about what they are up to.

"It does feel like a race against time to get there before we implement extremely intelligent AI models into the world with no understanding of how they work," said Goodfire chief executive Eric Ho.

In his essay, Amodei said recent progress has made him optimistic that the key to fully deciphering AI will be found within two years.

"I agree that by 2027, we could have interpretability that reliably detects model biases and harmful intentions," said Auburn University associate professor Anh Nguyen.

According to Boston University's Crovella, researchers can already access representations of every digital neuron in AI brains.

"Unlike the human brain, we actually have the equivalent of every neuron instrumented inside these models", the academic said. "Everything that happens inside the model is fully known to us. It's a question of discovering the right way to interrogate that."

Harnessing the inner workings of gen AI minds could clear the way for its adoption in areas where tiny errors can have dramatic consequences, like national security, Amodei said.

For Nanda, better understanding what gen AI is doing could also catapult human discoveries, much like DeepMind's chess-playing AI, AlphaZero, revealed entirely new chess moves that none of the grand masters had ever thought about.

Properly understood, a gen AI model with a stamp of reliability would grab competitive advantage in the market.

Such a breakthrough by a US company would also be a win for the nation in its technology rivalry with China.

"Powerful AI will shape humanity's destiny," Amodei wrote.

"We deserve to understand our own creations before they radically transform our economy, our lives, and our future."

Related Links
All about the robots on Earth and beyond!

Subscribe Free To Our Daily Newsletters
Tweet

RELATED CONTENT
The following news reports may link to other Space Media Network websites.
ROBO SPACE
Ping pong bot returns shots with high-speed precision
Boston MA (SPX) May 12, 2025
MIT engineers are getting in on the robotic ping pong game with a powerful, lightweight design that returns shots with high-speed precision. The new table tennis bot comprises a multijointed robotic arm that is fixed to one end of a ping pong table and wields a standard ping pong paddle. Aided by several high-speed cameras and a high-bandwidth predictive control system, the robot quickly estimates the speed and trajectory of an incoming ball and executes one of several swing types - loop, drive, o ... read more

ROBO SPACE
Lyten Unveils U.S.-Made Lithium-Sulfur Battery Platform for Advanced Drone Propulsion

Drone strike targets Port Sudan navy base: army source

Autonomous Black Hawk helicopter trials showcase future of aerial firefighting

Britain, U.S. attack Houthi drone manufacturing targets in Yemen

ROBO SPACE
SMART Launches WISDOM Research Group for Next-Generation 3D-Sensing Technologies

China's Tencent posts forecast-beating Q1 revenue on gaming growth

Accelerating Mathematical Discovery with AI for Tomorrow's Breakthroughs

System lets robots identify an object's properties through handling

ROBO SPACE
Silicon Spin Qubits Pave the Way for Scalable Quantum Computing

US reverses Biden-era export controls on advanced AI chips

Taiwan's TSMC and China's SMIC both report revenue surge in April

MIT engineers advance toward a fault-tolerant quantum computer

ROBO SPACE
EU asks Prague to hold off on S.Korean nuclear deal

Ontario Approves Construction of GE Vernova Hitachi's BWRX-300 Small Modular Reactor

Google agrees to fund three US nuclear plants

EDF complaint blocks Czech-Korean nuclear deal

ROBO SPACE
We're waiting to hug Edan, says grandmother of hostage to be freed from Gaza

Iraq arrests IS suspect for inciting the New Orleans attack

US to withdraw some 1,000 troops from Syria

'Bring him home': Philippines migrant workers grapple with Duterte fallout

ROBO SPACE
EU targets conservation red tape to speed up renewables permits

UK lab promises air-con revolution without polluting gases

Indians buy 14 million ACs a year, and need many more

How can India decarbonize its coal-dependent electric power system?

ROBO SPACE
Dongguk University Researchers Develop Scalable Zinc-Ion Battery Technology for Industrial Use

Chinese EV battery giant CATL aims to raise $4 bn in Hong Kong IPO

Fusion modeling breakthrough accelerates stellarator design and confinement accuracy

UT Austin researchers advance magnetic fusion design with new confinement method

ROBO SPACE
Tiangong returns largest sample set yet for biological and materials science research

Space is a place to found a community not a colony

China's Shenzhou-19 astronauts return to Earth

New Shenzhou Crew Begins Handover Operations Aboard Tiangong

Subscribe Free To Our Daily Newsletters




The content herein, unless otherwise known to be public domain, are Copyright 1995-2024 - Space Media Network. All websites are published in Australia and are solely subject to Australian law and governed by Fair Use principals for news reporting and research purposes. AFP, UPI and IANS news wire stories are copyright Agence France-Presse, United Press International and Indo-Asia News Service. ESA news reports are copyright European Space Agency. All NASA sourced material is public domain. Additional copyrights may apply in whole or part to other bona fide parties. All articles labeled "by Staff Writers" include reports supplied to Space Media Network by industry news wires, PR agencies, corporate press officers and the like. Such articles are individually curated and edited by Space Media Network staff on the basis of the report's information value to our industry and professional readership. Advertising does not imply endorsement, agreement or approval of any opinions, statements or information provided by Space Media Network on any Web page published or hosted by Space Media Network. General Data Protection Regulation (GDPR) Statement Our advertisers use various cookies and the like to deliver the best ad banner available at one time. All network advertising suppliers have GDPR policies (Legitimate Interest) that conform with EU regulations for data collection. By using our websites you consent to cookie based advertising. If you do not agree with this then you must stop using the websites from May 25, 2018. Privacy Statement. Additional information can be found here at About Us.