Looking inside the black box

Looking into the code.
Looking into the code.
DPA via Reuters
One of the biggest challenges facing artificial intelligence companies is that they don’t know everything about their algorithms. This so-called black box problem is exacerbated by the fact that deep learning models do precisely that — they learn. And when they learn they change. They take in enormous troves of data, detect patterns, and spit something out: How a sentence should read, what an image should look like, how a voice should sound.

But now researchers at Anthropic, the AI startup that makes the chatbot Claude, claim they’ve had a breakthrough in understanding their own model. In a blog post, Anthropic researchers disclosed that they’ve found 10 million “features” of their Claude 3 Sonnet language model, with certain patterns that pop up when a user inputs something it recognizes. They’ve been able to map features that are close to one another: One for the Golden Gate Bridge, for example, is close to another for Alcatraz Island, the Golden State Warrior, California Governor Gavin Newsom, and the Alfred Hitchcock film Vertigo — set in San Francisco. Knowing about these features allows Anthropic to turn them on or off, manipulating the model to break out of its typical mold.

This development offers hope that the companies behind powerful generative AI models will soon have much more control over their creations, as MIT professor Jacob Andreas told theNew York Times. “In the same way that understanding basic things about how people work has helped us cure diseases,” Andreas said, “understanding how these models work will both let us recognize when things are about to go wrong and let us build better tools for controlling them.”

More from GZERO Media

A 3D-printed miniature model depicting US President Donald Trump, the Chinese flag, and the word "tariffs" in this illustration taken on April 17, 2025.

REUTERS/Dado Ruvic

The US economy contracted 0.3% at an annualized rate in the first quarter of 2025, while China’s manufacturing plants saw their sharpest monthly slowdown in over a year. Behind the scenes, the world’s two largest economies are backing away from their extraordinary trade war.

A photovoltaic power station with a capacity of 0.8 MW covers an area of more than 3,000 square metres at the industrial site of the Chernobyl Nuclear Power Plant, Kyiv region, Ukraine, on April 12, 2025.
Volodymyr Tarasov/Ukrinform/ABACAPRESS.COM

Two months after their infamous White House fight, the US and Ukraine announced on Wednesday that they had finally struck a long-awaited minerals deal.

Indian paramilitary soldiers patrol along a road in Srinagar, Jammu and Kashmir, on April 29, 2025.
Firdous Nazir via Reuters Connect

Nerves are fraught throughout Pakistan after authorities said Wednesday they have “credible intelligence” that India plans to launch military strikes on its soil by Friday.

Palestinian Hamas and Islamic Jihad fighters form a human chain in front of the crowd gathered near the family home of slain Hamas leader Yahya Sinwar, where the Hamas militant group prepares to hand over Israeli and Thai hostages to a Red Cross team in Khan Yunis, on January 30, 2025, as part of their third hostage-prisoner exchange..
Photo by Majdi Fathi/NurPhot

Israel hunted Yahya Sinwar — the Hamas leader and mastermind of the Oct. 7 attack — for over a year. He was hidden deep within Gaza’s shadowy tunnel networks.

A gunman stands as Syrian security forces check vehicles entering Druze town of Jaramana, following deadly clashes sparked by a purported recording of a Druze man cursing the Prophet Mohammad which angered Sunni gunmen, as rescuers and security sources say, in southeast of Damascus, Syria April 29, 2025.
REUTERS/Yamam Al Shaar

Israel said the deadly drone strike was carried out on behalf of Syria's Druze community.

Britain's King Charles holds an audience with the Prime Minister of Canada Mark Carney at Buckingham Palace, on March 17, 2025.

Aaron Chown/Pool via REUTERS

King Charles is rumored to have been invited to Canada to deliver the speech from the throne, likely in late May, although whether he attends may depend on sensitivities in the office of UK Prime Minister Keir Starmer.

Getting access to energy, whether it's renewables, oil and gas, or other sources, is increasingly challenging because of long lead times to get things built in the US and elsewhere, says Greg Ebel, Enbridge's CEO, on the latest "Energized: The Future of Energy" podcast episode. And it's not just problems with access. “There is an energy emergency, if we're not careful, when it comes to price,” says Ebel. “There's definitely an energy emergency when it comes to having a resilient grid, whether it's a pipeline grid, an electric grid. That's something I think people have to take seriously.” Ebel believes that finding "the intersection of rhetoric, policy, and capital" can lead to affordability and profitability for the energy transition. His discussion with host JJ Ramberg and Arjun Murti, founder of the energy transition newsletter Super-Spiked, addresses where North America stands in the global energy transition, the implication of the revised energy policies by President Trump, and the potential consequences of tariffs and trade tension on the energy sector. “Energized: The Future of Energy” is a podcast series produced by GZERO Media's Blue Circle Studios in partnership with Enbridge. Listen to this episode at gzeromedia.com/energized, or on Apple, Spotify,Goodpods, or wherever you get your podcasts.