<?xml version="1.0" encoding="UTF-8"?>
<source>
  <jobs>
    <job>
      <externalid>20e650c2-d9c</externalid>
      <Title>Research Scientist, Interpretability</Title>
      <Description><![CDATA[<p><strong>About the role:</strong></p>
<p>When you see what modern language models are capable of, do you wonder, &#39;How do these things work? How can we trust them?&#39;</p>
<p>The Interpretability team at Anthropic is working to reverse-engineer how trained models work because we believe that a mechanistic understanding is the most robust way to make advanced systems safe. We’re looking for researchers and engineers to join our efforts.</p>
<p>People mean many different things by &#39;interpretability&#39;. We&#39;re focused on mechanistic interpretability, which aims to discover how neural network parameters map to meaningful algorithms. Some useful analogies might be to think of us as trying to do &#39;biology&#39; or &#39;neuroscience&#39; of neural networks using “microscopes” we build, or as treating neural networks as binary computer programs we&#39;re trying to &#39;reverse engineer&#39;.</p>
<p><strong>Responsibilities:</strong></p>
<ul>
<li>Develop methods for understanding LLMs by reverse engineering algorithms learned in their weights</li>
</ul>
<ul>
<li>Design and run robust experiments, both quickly in toy scenarios and at scale in large models</li>
</ul>
<ul>
<li>Create and analyse new interpretability features and circuits to better understand how models work.</li>
</ul>
<ul>
<li>Build infrastructure for running experiments and visualising results</li>
</ul>
<ul>
<li>Work with colleagues to communicate results internally and publicly</li>
</ul>
<p><strong>You may be a good fit if you:</strong></p>
<ul>
<li>Have a strong track record of scientific research (in any field), and have done _some_ work on Interpretability</li>
</ul>
<ul>
<li>Enjoy team science – working collaboratively to make big discoveries</li>
</ul>
<ul>
<li>Are comfortable with messy experimental science. We&#39;re inventing the field as we work, and the first textbook is years away</li>
</ul>
<ul>
<li>You view research and engineering as two sides of the same coin. Every team member writes code, designs and runs experiments, and interprets results</li>
</ul>
<ul>
<li>You can clearly articulate and discuss the motivations behind your work, and teach us about what you&#39;ve learned. You like writing up and communicating your results, even when they&#39;re null</li>
</ul>
<p><strong>Role Specific Location Policy:</strong></p>
<ul>
<li>This role is based in San Francisco office; however, we are open to considering exceptional candidates for remote work on a case-by-case basis.</li>
</ul>
<p><strong>Logistics</strong></p>
<p><strong>Education requirements:</strong> We require at least a Bachelor&#39;s degree in a related field or equivalent experience. <strong>Location-based hybrid policy:</strong> Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices.</p>
<p><strong>Visa sponsorship:</strong> We do sponsor visas! However, we aren&#39;t able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this.</p>
<p><strong>We encourage you to apply even if you do not believe you meet every single qualification.</strong> Not all strong candidates will meet every single qualification as listed.</p>
<p style="margin-top:24px;font-size:13px;color:#666;">XML job scraping automation by <a href="https://yubhub.co">YubHub</a></p>]]></Description>
      <Jobtype>full-time</Jobtype>
      <Experiencelevel>senior</Experiencelevel>
      <Workarrangement>hybrid</Workarrangement>
      <Salaryrange>$350,000 - $850,000USD</Salaryrange>
      <Skills>Python, Mechanistic Interpretability, Neural Networks, Reverse Engineering, Experimental Science, Research, Engineering, Team Science, Communication</Skills>
      <Category>Engineering</Category>
      <Industry>Technology</Industry>
      <Employername>Anthropic</Employername>
      <Employerlogo>https://logos.yubhub.co/anthropic.com.png</Employerlogo>
      <Employerdescription>Anthropic is a company that aims to create reliable, interpretable, and steerable AI systems. It has a quickly growing team of researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.</Employerdescription>
      <Employerwebsite>https://www.anthropic.com</Employerwebsite>
      <Compensationcurrency></Compensationcurrency>
      <Compensationmin></Compensationmin>
      <Compensationmax></Compensationmax>
      <Applyto>https://job-boards.greenhouse.io/anthropic/jobs/4980427008</Applyto>
      <Location>San Francisco</Location>
      <Country></Country>
      <Postedate>2026-03-08</Postedate>
    </job>
  </jobs>
</source>