<?xml version="1.0" encoding="UTF-8"?>
<source>
  <jobs>
    <job>
      <externalid>3b359ef2-6f8</externalid>
      <Title>Machine Learning Systems Engineer, Research Tools</Title>
      <Description><![CDATA[<p>We are seeking an experienced Machine Learning Systems Engineer to join our Encodings and Tokenization team at Anthropic. This cross-functional role will be instrumental in developing and optimizing the encodings and tokenization systems used throughout our Finetuning workflows. As a bridge between our Pretraining and Finetuning teams, you&#39;ll build critical infrastructure that directly impacts how our models learn from and interpret data.</p>
<p>Responsibilities:</p>
<ul>
<li>Design, develop, and maintain tokenization systems used across Pretraining and Finetuning workflows</li>
<li>Optimize encoding techniques to improve model training efficiency and performance</li>
<li>Collaborate closely with research teams to understand their evolving needs around data representation</li>
<li>Build infrastructure that enables researchers to experiment with novel tokenization approaches</li>
<li>Implement systems for monitoring and debugging tokenization-related issues in the model training pipeline</li>
<li>Create robust testing frameworks to validate tokenization systems across diverse languages and data types</li>
<li>Identify and address bottlenecks in data processing pipelines related to tokenization</li>
<li>Document systems thoroughly and communicate technical decisions clearly to stakeholders across teams</li>
</ul>
<p>You May Be a Good Fit If You:</p>
<ul>
<li>Have significant software engineering experience with demonstrated machine learning expertise</li>
<li>Are comfortable navigating ambiguity and developing solutions in rapidly evolving research environments</li>
<li>Can work independently while maintaining strong collaboration with cross-functional teams</li>
<li>Are results-oriented, with a bias towards flexibility and impact</li>
<li>Have experience with machine learning systems, data pipelines, or ML infrastructure</li>
<li>Are proficient in Python and familiar with modern ML development practices</li>
<li>Have strong analytical skills and can evaluate the impact of engineering changes on research outcomes</li>
<li>Pick up slack, even if it goes outside your job description</li>
<li>Enjoy pair programming (we love to pair!)</li>
<li>Care about the societal impacts of your work and are committed to developing AI responsibly</li>
</ul>
<p>Strong Candidates May Also Have Experience With:</p>
<ul>
<li>Working with machine learning data processing pipelines</li>
<li>Building or optimizing data encodings for ML applications</li>
<li>Implementing or working with BPE, WordPiece, or other tokenization algorithms</li>
<li>Performance optimization of ML data processing systems</li>
<li>Multi-language tokenization challenges and solutions</li>
<li>Research environments where engineering directly enables scientific progress</li>
<li>Distributed systems and parallel computing for ML workflows</li>
<li>Large language models or other transformer-based architectures (not required)</li>
</ul>
<p>The annual compensation range for this role is $320,000-$405,000 USD.</p>
<p style="margin-top:24px;font-size:13px;color:#666;">XML job scraping automation by <a href="https://yubhub.co">YubHub</a></p>]]></Description>
      <Jobtype>full-time</Jobtype>
      <Experiencelevel>senior</Experiencelevel>
      <Workarrangement>hybrid</Workarrangement>
      <Salaryrange>$320,000-$405,000 USD</Salaryrange>
      <Skills>Machine Learning, Software Engineering, Python, Data Pipelines, ML Infrastructure, BPE, WordPiece, Tokenization Algorithms, Performance Optimization, Distributed Systems</Skills>
      <Category>Engineering</Category>
      <Industry>Technology</Industry>
      <Employername>Anthropic</Employername>
      <Employerlogo>https://logos.yubhub.co/anthropic.com.png</Employerlogo>
      <Employerdescription>Anthropic is a public benefit corporation that focuses on creating reliable, interpretable, and steerable AI systems.</Employerdescription>
      <Employerwebsite>https://www.anthropic.com/</Employerwebsite>
      <Compensationcurrency></Compensationcurrency>
      <Compensationmin></Compensationmin>
      <Compensationmax></Compensationmax>
      <Applyto>https://job-boards.greenhouse.io/anthropic/jobs/4952079008</Applyto>
      <Location>San Francisco, CA | New York City, NY | Seattle, WA</Location>
      <Country></Country>
      <Postedate>2026-04-18</Postedate>
    </job>
    <job>
      <externalid>b376a3b3-b21</externalid>
      <Title>Machine Learning Systems Engineer, Research Tools</Title>
      <Description><![CDATA[<p><strong>About the Role:</strong></p>
<p>We are seeking an experienced Machine Learning Systems Engineer to join our Encodings and Tokenization team at Anthropic. This cross-functional role will be instrumental in developing and optimising the encodings and tokenization systems used throughout our Finetuning workflows. As a bridge between our Pretraining and Finetuning teams, you&#39;ll build critical infrastructure that directly impacts how our models learn from and interpret data. Your work will be foundational to Anthropic&#39;s research progress, enabling more efficient and effective training of our AI systems while ensuring they remain reliable, interpretable, and steerable.</p>
<p><strong>Responsibilities:</strong></p>
<ul>
<li>Design, develop, and maintain tokenization systems used across Pretraining and Finetuning workflows</li>
<li>Optimise encoding techniques to improve model training efficiency and performance</li>
<li>Collaborate closely with research teams to understand their evolving needs around data representation</li>
<li>Build infrastructure that enables researchers to experiment with novel tokenization approaches</li>
<li>Implement systems for monitoring and debugging tokenization-related issues in the model training pipeline</li>
<li>Create robust testing frameworks to validate tokenization systems across diverse languages and data types</li>
<li>Identify and address bottlenecks in data processing pipelines related to tokenization</li>
<li>Document systems thoroughly and communicate technical decisions clearly to stakeholders across teams</li>
</ul>
<p><strong>You May Be a Good Fit If You:</strong></p>
<ul>
<li>Have significant software engineering experience with demonstrated machine learning expertise</li>
<li>Are comfortable navigating ambiguity and developing solutions in rapidly evolving research environments</li>
<li>Can work independently while maintaining strong collaboration with cross-functional teams</li>
<li>Are results-oriented, with a bias towards flexibility and impact</li>
<li>Have experience with machine learning systems, data pipelines, or ML infrastructure</li>
<li>Are proficient in Python and familiar with modern ML development practices</li>
<li>Have strong analytical skills and can evaluate the impact of engineering changes on research outcomes</li>
<li>Pick up slack, even if it goes outside your job description</li>
<li>Enjoy pair programming (we love to pair!)</li>
<li>Care about the societal impacts of your work and are committed to developing AI responsibly</li>
</ul>
<p><strong>Strong Candidates May Also Have Experience With:</strong></p>
<ul>
<li>Working with machine learning data processing pipelines</li>
<li>Building or optimising data encodings for ML applications</li>
<li>Implementing or working with BPE, WordPiece, or other tokenization algorithms</li>
<li>Performance optimisation of ML data processing systems</li>
<li>Multi-language tokenisation challenges and solutions</li>
<li>Research environments where engineering directly enables scientific progress</li>
<li>Distributed systems and parallel computing for ML workflows</li>
<li>Large language models or other transformer-based architectures (not required)</li>
</ul>
<p><strong>Logistics</strong></p>
<ul>
<li>Education requirements: We require at least a Bachelor&#39;s degree in a related field or equivalent experience.</li>
<li>Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices.</li>
<li>Visa sponsorship: We do sponsor visas! However, we aren&#39;t able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this.</li>
</ul>
<p><strong>How we&#39;re different</strong></p>
<p>We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles.</p>
<p style="margin-top:24px;font-size:13px;color:#666;">XML job scraping automation by <a href="https://yubhub.co">YubHub</a></p>]]></Description>
      <Jobtype>full-time</Jobtype>
      <Experiencelevel>senior</Experiencelevel>
      <Workarrangement>hybrid</Workarrangement>
      <Salaryrange>$320,000 - $405,000 USD</Salaryrange>
      <Skills>Machine Learning, Software Engineering, Python, Data Pipelines, ML Infrastructure, Tokenization, Encoding, BPE, WordPiece, Distributed Systems, Parallel Computing, Large Language Models, Transformer-Based Architectures</Skills>
      <Category>Engineering</Category>
      <Industry>Technology</Industry>
      <Employername>Anthropic</Employername>
      <Employerlogo>https://logos.yubhub.co/anthropic.com.png</Employerlogo>
      <Employerdescription>Anthropic is a quickly growing organisation with a mission to create reliable, interpretable, and steerable AI systems. The company&apos;s team includes researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.</Employerdescription>
      <Employerwebsite>https://job-boards.greenhouse.io</Employerwebsite>
      <Compensationcurrency></Compensationcurrency>
      <Compensationmin></Compensationmin>
      <Compensationmax></Compensationmax>
      <Applyto>https://job-boards.greenhouse.io/anthropic/jobs/4952079008</Applyto>
      <Location>San Francisco, CA | New York City, NY | Seattle, WA</Location>
      <Country></Country>
      <Postedate>2026-03-08</Postedate>
    </job>
  </jobs>
</source>