Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Android phones looking to mimic Apple’s new iPhone release strategy

    Tough-as-nails Galaxy Watch Ultra (2025) drops in price even further on Amazon

    This is the magnetic secondary screen I hope iPhones and Galaxies won’t pick up

    Facebook Twitter Instagram
    • Tech
    • Gadgets
    • Spotlight
    • Gaming
    Facebook Twitter Instagram
    circuitthoughtscircuitthoughts
    Subscribe
    • Home
    • Gadgets
    • Insights
    • Apps

      Google Uses AI Searches To Detect If Someone Is In Crisis

      Gboard Magic Wand Button Will Covert Your Text To Emojis

      Android 10 & Older Devices Now Getting Automatic App Permissions Reset

      Spotify Blend Update Increases Group Sizes, Adds Celebrity Blends

      Samsung May Improve Battery Significantly With Galaxy Watch 5

    • Gear
    • Mobiles
      1. Tech
      2. Gadgets
      3. Insights
      4. View All

      Android phones looking to mimic Apple’s new iPhone release strategy

      Tough-as-nails Galaxy Watch Ultra (2025) drops in price even further on Amazon

      This is the magnetic secondary screen I hope iPhones and Galaxies won’t pick up

      Galaxy Z Fold 8 release date: should you wait for Apple’s foldable iPhone or the Pixel 11 Pro Fold?

      March Update May Have Weakened The Haptics For Pixel 6 Users

      Project 'Diamond' Is The Galaxy S23, Not A Rollable Smartphone

      The At A Glance Widget Is More Useful After March Update

      Pre-Order The OnePlus 10 Pro For Just $1 In The US

      Motorola Edge+ Review: It Checks A Lot Of Boxes

      This Smartphone Concept Design Is Different… In A Good Way

      Twitter Just Made Searching Your Direct Messages Better

      That Netflix Price Hike Is Starting To Take Place

      Latest Huawei Mobiles P50 and P50 Pro Feature Kirin Chips

      Samsung Galaxy M62 Benchmarked with Galaxy Note10’s Chipset

      9.1

      Review: T-Mobile Winning 5G Race Around the World

      8.9

      Samsung Galaxy S21 Ultra Review: the New King of Android Phones

    • Computing
    circuitthoughtscircuitthoughts
    Home»Tech»This startup’s new mechanistic interpretability tool lets you debug LLMs
    Tech

    This startup’s new mechanistic interpretability tool lets you debug LLMs

    adminBy No Comments3 Mins Read
    Facebook Twitter Pinterest LinkedIn Tumblr Email
    Share
    Facebook Twitter LinkedIn Pinterest Email

    Mapping models

    Silico lets you zoom in on specific parts of a trained model, such as individual neurons or groups of neurons, and run experiments to see what those neurons do. (Assuming you have access to the model’s inner workings. Most people won’t be able to use Silico to poke around inside ChatGPT or Gemini, but you can use it to look at the parameters inside many open-source models.) You can then check what inputs make different neurons fire, and trace pathways upstream and downstream of a neuron to see how other neurons affect it and how it affects other neurons in turn.

    For example, Goodfire found one neuron inside the open-source model Qwen 3 that was associated with the so-called trolley problem. Activating this neuron changed the model’s responses, making it frame its outputs as explicit moral dilemmas. “When this neuron’s active, all sorts of weird things happen,” says Ho.

    Pinpointing the source of odd behavior like this is now pretty standard practice. But Goodfire wants to make it easier to adjust that behavior. Using Silico, developers can now adjust the parameters connected to individual neurons to boost or suppress certain behaviors.

    In another example, Goodfire researchers asked a model whether a company should disclose that its AI behaves deceptively in 0.3% of cases, affecting 200 million users. The model said no, citing the negative business impact of such a disclosure.

    By looking inside the model, the researchers found that boosting neurons that were found to be associated with transparency and disclosure flipped the answer from no to yes nine out of 10 times. “The model already had the ethical reasoning circuitry, but it was being outweighed by the commercial risk assessment,” says Ho.

    Tweaking the values of a model in this way is just one approach. Silico can also help steer the training process by filtering out certain training data to avoid setting unwanted values for certain parameters in the first place.   

    For example, many models will tell you that 9.11 is greater than 9.9. Looking inside a model to see what’s going on might reveal that it is being influenced by neurons associated with the Bible, in which verse 9.9 comes before 9.11, or by code repositories where consecutive updates are numbered 9.9, 9.10, 9.11 and so on. Using this information, the model can be retrained to make it avoid its “Bible” neurons when doing math.

    By releasing Silico, Goodfire wants to put techniques previously available to a few top labs into the hands of smaller firms and research teams that want to build their own model or adapt an open-source one. The tool will be available for a fee determined on a case-by-case basis according to customers’ requirements (Goodfire declined to give specific pricing details).

    #startups #mechanistic #interpretability #tool #lets #debug #LLMs

    debug interpretability lets LLMs mechanistic startups tool
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

    Related Posts

    Android phones looking to mimic Apple’s new iPhone release strategy

    Tough-as-nails Galaxy Watch Ultra (2025) drops in price even further on Amazon

    This is the magnetic secondary screen I hope iPhones and Galaxies won’t pick up

    Add A Comment

    Leave A Reply Cancel Reply

    Editors Picks
    8.5

    Apple Planning Big Mac Redesign and Half-Sized Old Mac

    Autonomous Driving Startup Attracts Chinese Investor

    Onboard Cameras Allow Disabled Quadcopters to Fly

    Top Reviews
    9.1

    Review: T-Mobile Winning 5G Race Around the World

    By
    8.9

    Samsung Galaxy S21 Ultra Review: the New King of Android Phones

    By
    8.9

    Xiaomi Mi 10: New Variant with Snapdragon 870 Review

    By
    circuitthoughts
    Facebook Twitter Instagram Pinterest Vimeo YouTube
    • Home
    • Tech
    • Gadgets
    • Mobiles
    • Our Authors
    © 2026 ThemeSphere. Designed by WPfastworld.

    Type above and press Enter to search. Press Esc to cancel.