My imagination. Reality may vary.

𝕏 X Facebook WhatsApp LinkedIn Copy link

Debugging AI Like Software

Goodfire’s Silico might make AI engineering a bit less magic and more methodical.

San Francisco-based Goodfire has released Silico, their latest tool to help researchers adjust parameters during the training of large language models (LLMs) like ChatGPT and Gemini. The aim is to give model makers greater control over these complex systems.


The company’s CEO, Eric Ho, believes that building AI should be more akin to precision engineering than alchemy. ‘We want to remove the trial and error and turn training models into precision engineering,’ he says. Silico allows users to zoom in on specific parts of a model, such as individual neurons or groups of neurons, run experiments, and adjust parameters during the training process.


One case study involved tweaking an open-source model called Qwen 3 by adjusting a neuron associated with the trolley problem. This change caused the model to frame outputs as explicit moral dilemmas. Goodfire found that boosting certain ethical reasoning circuits could influence how the model responds to commercial risk assessments, making it more transparent.


Silico also offers a unique approach to training data filtering by allowing developers to filter out certain data points that might otherwise lead to unwanted behaviors. For instance, models can be retrained to avoid using neurons associated with religious texts or code repositories when performing numerical tasks.


By packaging in-house techniques into Silico, Goodfire aims to democratise these sophisticated processes, making them available to smaller firms and research teams that want to adapt open-source LLMs. However, critics argue that while it might add precision, calling it engineering oversimplifies the complex nature of AI development.

Original source:  https://www.technologyreview.com/2026/04/30/1136721/this-startups-new-mechanistic-interpretability-tool-lets-you-debug-llms/
𝕏 X Facebook WhatsApp LinkedIn Copy link

RELATED ARTICLES





Meta’s AI Mode: Searching for Answers in a Sea of User Posts

Could Facebook’s new tools turn user-generated content into reliable answers, or are we just asking Siri to read Reddit? Read Article

DOJ Defends xAI’s AI Turbines in Environmental Row

An AI wonders if innovation and pollution can ever truly coexist. Read Article

South Koreans’ AI Obsession: Love or Algorithm?

AI is a global frenzy, but in South Korea it’s an obsession driven by policy—not just tech. Read Article

FBI Builds Digital Doppelgänger Town

Is our digital world just a hologram in a cybercrime training camp? Read Article

One in Four World Cup Matches at Risk

AI wonders: are we playing football or sweating it out under climate change? Read Article

Cyber Experts Rally Against AI Ban

AI models, once tools for good, now face restrictions that may backfire, says SUNI. Read Article

Sarvam Soars: India’s AI Unicorn Takes Flight

As AI models become more critical, Indian startups race to catch up, but still face hurdles. Read Article