<?xml version="1.0" encoding="UTF-8"?>
<source>
  <jobs>
    <job>
      <externalid>fb1f459e-b3a</externalid>
      <Title>Machine Learning Research Scientist / Engineer, Reasoning</Title>
      <Description><![CDATA[<p>About Scale</p>
<p>At Scale, our mission is to accelerate the development of AI applications. We&#39;re looking for a Machine Learning Research Scientist/Engineer to join our team and help us shape the future of AI.</p>
<p>This role operates at the forefront of AI research and real-world implementation, with a strong focus on reasoning within large language models (LLMs). You will study the data types critical for advancing LLM-based agents, including browser and software engineering (SWE) agents. You will play a key role in shaping Scale&#39;s data strategy by identifying the most effective data sources and methodologies for improving LLM reasoning.</p>
<p>Success in this role requires a deep understanding of LLMs, planning algorithms, and novel approaches to agentic reasoning, as well as creativity in tackling challenges related to data generation, model interaction, and evaluation. You will contribute to impactful research on language model reasoning, collaborate with external researchers, and work closely with engineering teams to bring state-of-the-art advancements into scalable, real-world solutions.</p>
<p>Responsibilities</p>
<ul>
<li>Study the data types critical for advancing LLM-based agents, including browser and software engineering (SWE) agents</li>
<li>Shape Scale&#39;s data strategy by identifying the most effective data sources and methodologies for improving LLM reasoning</li>
<li>Contribute to impactful research on language model reasoning</li>
<li>Collaborate with external researchers</li>
<li>Work closely with engineering teams to bring state-of-the-art advancements into scalable, real-world solutions</li>
</ul>
<p>Requirements</p>
<ul>
<li>Practical experience working with LLMs, with proficiency in frameworks like PyTorch, JAX, or TensorFlow</li>
<li>A track record of published research in top ML and NLP venues (e.g., ACL, EMNLP, NAACL, NeurIPS, ICML, ICLR, CoLLM, etc.)</li>
<li>At least three years of experience solving complex ML challenges, either in a research setting or product development, particularly in areas related to LLM capabilities and reasoning</li>
<li>Strong written and verbal communication skills, along with the ability to work effectively across teams</li>
</ul>
<p>Nice to Have</p>
<ul>
<li>Hands-on experience fine-tuning open-source LLMs or leading bespoke LLM fine-tuning projects using PyTorch/JAX</li>
<li>Research and practical experience in building applications and evaluations related to LLM-based agents, including tool-use, text-to-SQL, browser agents, coding agents, and GUI agents</li>
<li>Experience with agent frameworks such as OpenHands, Swarm, LangGraph, or similar</li>
<li>Familiarity with advanced agentic reasoning techniques such as STaR and PLANSEARCH</li>
<li>Proficiency in cloud-based ML development, with experience in AWS or GCP environments</li>
</ul>
<p>Benefits</p>
<ul>
<li>Comprehensive health, dental and vision coverage</li>
<li>Retirement benefits</li>
<li>A learning and development stipend</li>
<li>Generous PTO</li>
<li>Commuter stipend</li>
</ul>
<p>Salary Range</p>
<p>$252,000-$315,000 USD</p>
<p style="margin-top:24px;font-size:13px;color:#666;">XML job scraping automation by <a href="https://yubhub.co">YubHub</a></p>]]></Description>
      <Jobtype>full-time</Jobtype>
      <Experiencelevel>senior</Experiencelevel>
      <Workarrangement>remote</Workarrangement>
      <Salaryrange>$252,000-$315,000 USD</Salaryrange>
      <Skills>PyTorch, JAX, TensorFlow, Large Language Models (LLMs), Planning Algorithms, Agentic Reasoning, Data Generation, Model Interaction, Evaluation, Agent Frameworks, Cloud-Based ML Development, AWS, GCP, STaR, PLANSEARCH</Skills>
      <Category>Engineering</Category>
      <Industry>Technology</Industry>
      <Employername>Scale AI</Employername>
      <Employerlogo>https://logos.yubhub.co/scale.com.png</Employerlogo>
      <Employerdescription>Scale AI is a leading AI data foundry that provides high-quality data to drive progress toward Artificial General Intelligence (AGI). It was founded 8 years ago and has since become a major player in the AI industry.</Employerdescription>
      <Employerwebsite>https://scale.com/</Employerwebsite>
      <Compensationcurrency></Compensationcurrency>
      <Compensationmin></Compensationmin>
      <Compensationmax></Compensationmax>
      <Applyto>https://job-boards.greenhouse.io/scaleai/jobs/4605596005</Applyto>
      <Location>San Francisco, CA; Seattle, WA; New York, NY</Location>
      <Country></Country>
      <Postedate>2026-04-18</Postedate>
    </job>
    <job>
      <externalid>7e3331e3-3f3</externalid>
      <Title>Software Engineer, Research - Human Data</Title>
      <Description><![CDATA[<p><strong>Software Engineer, Research - Human Data</strong></p>
<p><strong>About the Team</strong></p>
<p>OpenAI’s mission is to ensure that artificial general intelligence (AGI) benefits all of humanity. A key part of achieving that mission is training models that deeply understand and reflect human preferences — the <strong>Human Data</strong> team is at the heart of that effort.</p>
<p>The Human Data engineering team creates the systems that enable scalable, high-quality human feedback. These systems are essential to how OpenAI trains and improves its most advanced models. Engineers on this team collaborate closely with world-class researchers to bring alignment techniques to life — from experimental ideas to production-ready feedback loops.</p>
<p><strong>About the Role</strong></p>
<p>We’re looking for software engineers to join the Human Data team and build the platforms, prototypes, tools, and infrastructure that power how our AI models are trained, aligned, and evaluated. You’ll partner with researchers and cross-functional teams to bring alignment ideas to life, influence future model training, and shape how models interact with the real world.</p>
<p>We’re looking for people who are excited by technical ownership, enjoy working across the stack, and are eager to solve ambiguous problems in a high-impact, fast-paced environment.</p>
<p>This role is based in San Francisco, CA. We use a hybrid work model of 3 days in the office per week and offer relocation assistance to new employees.</p>
<p><strong>In this role, you will:</strong></p>
<ul>
<li>Build and maintain robust full-stack systems for feedback collection, data labeling, and evaluation pipelines, while maintaining high levels of security.</li>
</ul>
<ul>
<li>Translate experimental alignment research into scalable production infrastructure, including inference and model training stacks.</li>
</ul>
<ul>
<li>Design and iterate on user-facing tools and backend services to support high-quality data workflows</li>
</ul>
<ul>
<li>Partner with researchers, engineers, and program leads to shape feedback loops and model interaction paradigms</li>
</ul>
<ul>
<li>Drive infrastructure improvements that enable faster iteration and scaling across OpenAI’s frontier models, from internal research tooling all the way to production ChatGPT.</li>
</ul>
<p><strong>You might thrive in this role if you:</strong></p>
<ul>
<li>Have strong software engineering fundamentals and experience building production systems at scale</li>
</ul>
<ul>
<li>Enjoy full-stack development with end-to-end ownership — from backend pipelines to user interfaces</li>
</ul>
<ul>
<li>Are motivated by high-impact collaboration with research teams and solving novel, ambiguous problems</li>
</ul>
<ul>
<li>Are excited to shape how AI systems learn from human preferences and reflect a broad range of human values</li>
</ul>
<ul>
<li>Care deeply about inclusive tooling and building systems that enhance model safety, reliability, and usefulness</li>
</ul>
<p><strong>About OpenAI</strong></p>
<p>OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.</p>
<p style="margin-top:24px;font-size:13px;color:#666;">XML job scraping automation by <a href="https://yubhub.co">YubHub</a></p>]]></Description>
      <Jobtype>full-time</Jobtype>
      <Experiencelevel>mid</Experiencelevel>
      <Workarrangement>hybrid</Workarrangement>
      <Salaryrange>US$230K – $385K • Offers Equity
London£131K – £245K • Offers Equity</Salaryrange>
      <Skills>software engineering, full-stack development, data labeling, evaluation pipelines, security, inference and model training stacks, user-facing tools, backend services, data workflows, research collaboration, model interaction paradigms, infrastructure improvements, AI systems, human preferences, inclusive tooling, model safety, reliability, usefulness, strong software engineering fundamentals, experience building production systems at scale, full-stack development with end-to-end ownership, high-impact collaboration with research teams, solving novel, ambiguous problems, shaping how AI systems learn from human preferences</Skills>
      <Category>Engineering</Category>
      <Industry>Technology</Industry>
      <Employername>OpenAI</Employername>
      <Employerlogo>https://logos.yubhub.co/openai.com.png</Employerlogo>
      <Employerdescription>OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products.</Employerdescription>
      <Employerwebsite>https://jobs.ashbyhq.com</Employerwebsite>
      <Compensationcurrency></Compensationcurrency>
      <Compensationmin></Compensationmin>
      <Compensationmax></Compensationmax>
      <Applyto>https://jobs.ashbyhq.com/openai/4d6a5951-9838-434c-830a-22cb938ea228</Applyto>
      <Location>San Francisco; London, UK</Location>
      <Country></Country>
      <Postedate>2026-03-06</Postedate>
    </job>
  </jobs>
</source>