<?xml version="1.0" encoding="UTF-8"?>
<source>
  <jobs>
    <job>
      <externalid>fb1f459e-b3a</externalid>
      <Title>Machine Learning Research Scientist / Engineer, Reasoning</Title>
      <Description><![CDATA[<p>About Scale</p>
<p>At Scale, our mission is to accelerate the development of AI applications. We&#39;re looking for a Machine Learning Research Scientist/Engineer to join our team and help us shape the future of AI.</p>
<p>This role operates at the forefront of AI research and real-world implementation, with a strong focus on reasoning within large language models (LLMs). You will study the data types critical for advancing LLM-based agents, including browser and software engineering (SWE) agents. You will play a key role in shaping Scale&#39;s data strategy by identifying the most effective data sources and methodologies for improving LLM reasoning.</p>
<p>Success in this role requires a deep understanding of LLMs, planning algorithms, and novel approaches to agentic reasoning, as well as creativity in tackling challenges related to data generation, model interaction, and evaluation. You will contribute to impactful research on language model reasoning, collaborate with external researchers, and work closely with engineering teams to bring state-of-the-art advancements into scalable, real-world solutions.</p>
<p>Responsibilities</p>
<ul>
<li>Study the data types critical for advancing LLM-based agents, including browser and software engineering (SWE) agents</li>
<li>Shape Scale&#39;s data strategy by identifying the most effective data sources and methodologies for improving LLM reasoning</li>
<li>Contribute to impactful research on language model reasoning</li>
<li>Collaborate with external researchers</li>
<li>Work closely with engineering teams to bring state-of-the-art advancements into scalable, real-world solutions</li>
</ul>
<p>Requirements</p>
<ul>
<li>Practical experience working with LLMs, with proficiency in frameworks like PyTorch, JAX, or TensorFlow</li>
<li>A track record of published research in top ML and NLP venues (e.g., ACL, EMNLP, NAACL, NeurIPS, ICML, ICLR, CoLLM, etc.)</li>
<li>At least three years of experience solving complex ML challenges, either in a research setting or product development, particularly in areas related to LLM capabilities and reasoning</li>
<li>Strong written and verbal communication skills, along with the ability to work effectively across teams</li>
</ul>
<p>Nice to Have</p>
<ul>
<li>Hands-on experience fine-tuning open-source LLMs or leading bespoke LLM fine-tuning projects using PyTorch/JAX</li>
<li>Research and practical experience in building applications and evaluations related to LLM-based agents, including tool-use, text-to-SQL, browser agents, coding agents, and GUI agents</li>
<li>Experience with agent frameworks such as OpenHands, Swarm, LangGraph, or similar</li>
<li>Familiarity with advanced agentic reasoning techniques such as STaR and PLANSEARCH</li>
<li>Proficiency in cloud-based ML development, with experience in AWS or GCP environments</li>
</ul>
<p>Benefits</p>
<ul>
<li>Comprehensive health, dental and vision coverage</li>
<li>Retirement benefits</li>
<li>A learning and development stipend</li>
<li>Generous PTO</li>
<li>Commuter stipend</li>
</ul>
<p>Salary Range</p>
<p>$252,000-$315,000 USD</p>
<p style="margin-top:24px;font-size:13px;color:#666;">XML job scraping automation by <a href="https://yubhub.co">YubHub</a></p>]]></Description>
      <Jobtype>full-time</Jobtype>
      <Experiencelevel>senior</Experiencelevel>
      <Workarrangement>remote</Workarrangement>
      <Salaryrange>$252,000-$315,000 USD</Salaryrange>
      <Skills>PyTorch, JAX, TensorFlow, Large Language Models (LLMs), Planning Algorithms, Agentic Reasoning, Data Generation, Model Interaction, Evaluation, Agent Frameworks, Cloud-Based ML Development, AWS, GCP, STaR, PLANSEARCH</Skills>
      <Category>Engineering</Category>
      <Industry>Technology</Industry>
      <Employername>Scale AI</Employername>
      <Employerlogo>https://logos.yubhub.co/scale.com.png</Employerlogo>
      <Employerdescription>Scale AI is a leading AI data foundry that provides high-quality data to drive progress toward Artificial General Intelligence (AGI). It was founded 8 years ago and has since become a major player in the AI industry.</Employerdescription>
      <Employerwebsite>https://scale.com/</Employerwebsite>
      <Compensationcurrency></Compensationcurrency>
      <Compensationmin></Compensationmin>
      <Compensationmax></Compensationmax>
      <Applyto>https://job-boards.greenhouse.io/scaleai/jobs/4605596005</Applyto>
      <Location>San Francisco, CA; Seattle, WA; New York, NY</Location>
      <Country></Country>
      <Postedate>2026-04-18</Postedate>
    </job>
    <job>
      <externalid>307c2f1c-d78</externalid>
      <Title>Senior SDET - Tooling Engineer</Title>
      <Description><![CDATA[<p>We are looking for a highly skilled Senior Software Quality Engineer (SDET) to lead our end-to-end quality engineering initiatives across mobile, web, backend, and data platforms. This role combines deep technical expertise with a forward-thinking, AI-first mindset, driving innovation, scalability, and reliability through advanced automation and intelligent testing strategies.</p>
<p>As a senior member of the team, you will champion modern, AI-enhanced quality practices and help build a culture where continuous improvement, automation-first thinking, and data-driven decisions are embedded at every stage of product development. This is a hybrid position in Mountain View (Headquarters) and will require in-office work 2 days a week.</p>
<p>The base salary range for this full-time position is $210,000 to $257,000, plus equity and benefits. Our salary ranges are determined by role, level, and location. EarnIn provides excellent benefits for our employees, including healthcare, internet/cell phone reimbursement, and a learning and development stipend.</p>
<p><strong>Quality Engineering &amp; Test</strong></p>
<p>Own end-to-end quality across iOS and Android applications and their supporting backend services, ensuring high confidence in weekly (or faster) releases. Design and implement comprehensive test strategies covering:</p>
<ul>
<li>Native mobile applications (iOS &amp; Android)</li>
<li>Mobile-to-backend integrations (REST APIs, auth flows, event-driven systems)</li>
<li>Microservices and distributed systems</li>
<li>Critical web workflows that intersect with mobile journeys</li>
<li>Device, OS, browser, and network variability</li>
<li>App lifecycle events, offline behavior, retries, and edge cases</li>
</ul>
<p>Ensure critical user journeys are validated across mobile UI → API → backend → web touchpoints, preventing production escapes in high-impact flows. Partner with engineering teams to embed quality gates into the mobile release lifecycle, including pre-merge validation, release candidate verification, and post-deploy smoke testing.</p>
<p>Drive improvements in testability by introducing better logging, API contracts, observability hooks, feature flags, and deterministic state management. Establish meaningful quality metrics (crash analytics, defect trends, flaky tests, API reliability, release risk scoring) and surface actionable insights to engineering stakeholders.</p>
<p>Champion shift-left quality by influencing design reviews, API schema discussions, and acceptance criteria early in development.</p>
<p><strong>AI-Driven Quality and Automation</strong></p>
<p>Leverage AI to enhance mobile, backend, and web testing effectiveness, including:</p>
<ul>
<li>AI-assisted test case and test data generation</li>
<li>Intelligent regression suite prioritization based on code changes</li>
<li>Predictive defect detection and risk-based testing</li>
<li>Flaky test detection and automated stabilization insights</li>
</ul>
<p>Integrate AI-powered log intelligence, crash clustering, and anomaly detection into quality workflows. Continuously evaluate and experiment with AI-driven QA tools to increase coverage, reduce maintenance overhead, and accelerate release cycles.</p>
<p>Contribute to building an AI-augmented quality ecosystem that improves speed without compromising reliability.</p>
<p><strong>Automation Excellence</strong></p>
<p>Design, build, and scale robust automation frameworks using:</p>
<ul>
<li>XCUITest, Espresso, Appium (mobile automation)</li>
<li>Playwright (web and mobile web validation)</li>
<li>REST Assured or similar tools for API and service validation</li>
</ul>
<p>Ensure frameworks are modular, maintainable, and optimized for scale across multiple teams. Integrate automated validation into CI/CD pipelines (Jenkins, GitHub Actions, etc.) to enable:</p>
<ul>
<li>Pre-merge quality gates</li>
<li>Parallelized execution</li>
<li>Environment-aware test runs</li>
<li>Post-deployment smoke and regression coverage</li>
</ul>
<p>Build developer-friendly tooling that enables:</p>
<ul>
<li>Self-service test execution</li>
<li>Real-time reporting and dashboards</li>
<li>Faster debugging and failure triage</li>
<li>Scalable test data and environment management</li>
</ul>
<p>Continuously reduce flakiness, improve signal quality, and optimize execution time across mobile and backend suites.</p>
<p><strong>Performance, Scalability &amp; Reliability</strong></p>
<p>Design and execute performance validation across:</p>
<ul>
<li>Mobile app startup time and responsiveness</li>
<li>API latency, throughput, and reliability</li>
<li>Backend load and stress conditions</li>
<li>Web performance for critical flows</li>
</ul>
<p>Partner with engineering teams to analyze production logs, crash reports, browser telemetry, and service metrics. Lead root-cause analysis of complex cross-layer defects spanning mobile UI, APIs, backend services, and web surfaces.</p>
<p>Ensure reliability validation is embedded directly into release workflows.</p>
<p><strong>Cross-Functional Collaboration and Leadership</strong></p>
<p>Collaborate closely with mobile engineers, backend developers, web engineers, product managers, DevOps teams, and release managers to define clear, testable requirements and release criteria. Actively participate in sprint grooming, planning, stand-ups, and retrospectives.</p>
<p>Influence best practices around mobile-first design, API contracts, and release readiness. Support mobile app release activities, including release candidate validation, go/no-go recommendations, and post-release monitoring.</p>
<p>Mentor junior QA engineers and contribute to raising the technical bar in automation and cross-platform validation. Work effectively with globally distributed teams to coordinate testing across time zones.</p>
<p style="margin-top:24px;font-size:13px;color:#666;">XML job scraping automation by <a href="https://yubhub.co">YubHub</a></p>]]></Description>
      <Jobtype>full-time</Jobtype>
      <Experiencelevel>senior</Experiencelevel>
      <Workarrangement>hybrid</Workarrangement>
      <Salaryrange>$210,000 to $257,000.Minus equity and benefits</Salaryrange>
      <Skills>XCUITest, Espresso, Appium, Playwright, REST Assured, API contracts, Feature flags, Deterministic state management, AI-assisted test case and test data generation, Intelligent regression suite prioritization, Predictive defect detection, Risk-based testing, Flaky test detection, Automated stabilization insights, Log intelligence, Crash clustering, Anomaly detection, CI/CD pipelines, Pre-merge quality gates, Parallelized execution, Environment-aware test runs, Post-deployment smoke and regression coverage, Self-service test execution, Real-time reporting and dashboards, Faster debugging and failure triage, Scalable test data and environment management</Skills>
      <Category>Engineering</Category>
      <Industry>Technology</Industry>
      <Employername>EarnIn</Employername>
      <Employerlogo>https://logos.yubhub.co/earnin.com.png</Employerlogo>
      <Employerdescription>EarnIn is a financial technology company that provides earned wage access to individuals.</Employerdescription>
      <Employerwebsite>https://www.earnin.com/</Employerwebsite>
      <Compensationcurrency></Compensationcurrency>
      <Compensationmin></Compensationmin>
      <Compensationmax></Compensationmax>
      <Applyto>https://job-boards.greenhouse.io/earnin/jobs/7403324</Applyto>
      <Location>Mountain View, US</Location>
      <Country></Country>
      <Postedate>2026-04-18</Postedate>
    </job>
    <job>
      <externalid>cdd2fbec-490</externalid>
      <Title>Member of Technical Staff - Mid-training</Title>
      <Description><![CDATA[<p><strong>About xAI</strong></p>
<p>xAI&#39;s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge.</p>
<p>Our team is small and highly motivated, focused on engineering excellence. We operate with a flat organisational structure, where all employees are expected to be hands-on and contribute directly to the company&#39;s mission.</p>
<p><strong>Responsibilities</strong></p>
<p>We&#39;re looking for a Member of Technical Staff to join our team. Key responsibilities include:</p>
<p>Scaling synthetic coding data to trillions of tokens with large-scale Docker verification. Distilling the intelligence of flagship models into flash models through synthetic data generation. Optimising mid-training data mixtures to boost the ceiling for RL. Engineering long-context data recipes. Developing robust and diverse evaluation for mid-training checkpoints.</p>
<p><strong>Basic Qualifications</strong></p>
<p>To be successful in this role, you&#39;ll need:</p>
<p>Expertise in ML and large model scaling, with familiarity across all kinds of scaling laws. Strong ability to design ML experiments. Familiarity with state-of-the-art techniques for curating AI training data for text, image, audio, and video modalities. Strong engineering abilities in Spark, Ray, and other frameworks for large-scale data processing.</p>
<p><strong>Compensation and Benefits</strong></p>
<p>The base salary for this role is $180,000 - $440,000 USD. Our total rewards package includes equity, comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short &amp; long-term disability insurance, life insurance, and various other discounts and perks.</p>
<p style="margin-top:24px;font-size:13px;color:#666;">XML job scraping automation by <a href="https://yubhub.co">YubHub</a></p>]]></Description>
      <Jobtype>full-time</Jobtype>
      <Experiencelevel>mid</Experiencelevel>
      <Workarrangement>onsite</Workarrangement>
      <Salaryrange>$180,000 - $440,000 USD</Salaryrange>
      <Skills>ML, large model scaling, Docker verification, synthetic data generation, Spark, Ray</Skills>
      <Category>Engineering</Category>
      <Industry>Technology</Industry>
      <Employername>xAI</Employername>
      <Employerlogo>https://logos.yubhub.co/xai.com.png</Employerlogo>
      <Employerdescription>xAI creates AI systems to understand the universe and aid humanity in its pursuit of knowledge.</Employerdescription>
      <Employerwebsite>https://www.xai.com/</Employerwebsite>
      <Compensationcurrency></Compensationcurrency>
      <Compensationmin></Compensationmin>
      <Compensationmax></Compensationmax>
      <Applyto>https://job-boards.greenhouse.io/xai/jobs/4965893007</Applyto>
      <Location>Palo Alto, CA</Location>
      <Country></Country>
      <Postedate>2026-04-18</Postedate>
    </job>
    <job>
      <externalid>9e926934-312</externalid>
      <Title>Applied Scientist / Research Engineer (Internship)</Title>
      <Description><![CDATA[<p>About Mistral AI</p>
<p>At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life.</p>
<p>We are a global company with teams distributed between France, USA, UK, Germany, and Singapore. We offer a comprehensive AI platform that meets enterprise needs, whether on-premises or in cloud environments. Our offerings include le Chat, the AI assistant for life and work.</p>
<p>Role Summary</p>
<p>Mistral AI is seeking Applied Scientists Interns and Research Engineers Interns to drive innovative research and collaborate with clients on complex research projects. You will develop SOTA models across different modalities such as text, image, and speech. By developing novel methods and research ideas, you will apply these models across a diverse set of use cases and domains.</p>
<p>Responsibilities</p>
<p>• Run pre-training, post-training, and deploy state-of-the-art models on clusters with thousands of GPUs.
• Generate and curate data for pre-training and post-training, working on evaluations and making sure the model&#39;s performance beats expectations.
• Develop the necessary tools and frameworks to facilitate data generation, model training, evaluation, and deployment.
• Collaborate with cross-functional teams to tackle complex use cases using agents and RAG pipelines.
• Manage research projects and communications with client research teams.</p>
<p>About You</p>
<p>• You are fluent in English, and have excellent communication skills. You are at ease explaining complex technical concepts to both technical and non-technical audiences.
• You&#39;re an expert with PyTorch or JAX.
• You&#39;re not afraid of contributing to a big codebase and can find yourself around independently with little guidance.
• You write clean, readable, high-performance, fault-tolerant Python code.
• You don&#39;t need roadmaps: you just do. You don&#39;t need a manager: you just ship.
• Low-ego, collaborative, and eager to learn.
• You have a track record of success through personal projects, professional projects, or in academia.</p>
<p>Benefits</p>
<p>• Competitive salary
• Food: Daily lunch vouchers
• Sport: Monthly contribution to a Gympass subscription
• Transportation: Monthly contribution to a mobility pass</p>
<p style="margin-top:24px;font-size:13px;color:#666;">XML job scraping automation by <a href="https://yubhub.co">YubHub</a></p>]]></Description>
      <Jobtype>internship</Jobtype>
      <Experiencelevel>entry</Experiencelevel>
      <Workarrangement>onsite</Workarrangement>
      <Salaryrange></Salaryrange>
      <Skills>PyTorch, JAX, Python, GPU, data generation, model training, evaluation, deployment, agents, multi-modality, robotics, diffusion models, time-series analysis</Skills>
      <Category>Engineering</Category>
      <Industry>Technology</Industry>
      <Employername>Mistral AI</Employername>
      <Employerlogo>https://logos.yubhub.co/mistral.ai.png</Employerlogo>
      <Employerdescription>Mistral AI develops high-performance, open-source, and cutting-edge AI models, products, and solutions for enterprise needs.</Employerdescription>
      <Employerwebsite>https://mistral.ai</Employerwebsite>
      <Compensationcurrency></Compensationcurrency>
      <Compensationmin></Compensationmin>
      <Compensationmax></Compensationmax>
      <Applyto>https://jobs.lever.co/mistral/426ef8c0-eb26-4004-a690-f33c62b445a7</Applyto>
      <Location>Paris</Location>
      <Country></Country>
      <Postedate>2026-04-17</Postedate>
    </job>
    <job>
      <externalid>d9383bcf-242</externalid>
      <Title>Model Behavior Architect</Title>
      <Description><![CDATA[<p>About this role</p>
<p>As a Model Behavior Architect at Mistral AI, you will be at the forefront of defining and measuring Large Language Model (LLM) behavior. You will work closely with our Science team to define what &#39;good&#39; looks like for various tasks, including Reasoning, Audio, Alignment, Tools, and Frontier bets.</p>
<p>Responsibilities</p>
<ul>
<li>Interact with models to identify areas for improvement in model behavior</li>
<li>Gather internal and external feedback on model behavior to scope areas for improvement</li>
<li>Design and implement evaluation pipelines, data guidelines, data generation, and synthetic testing environments</li>
<li>Identify and fix edge case behaviors through rigorous testing</li>
<li>Develop robust evaluation pipelines for model candidates</li>
<li>Collaborate with AI Scientists</li>
</ul>
<p>About you</p>
<ul>
<li>You have a deep understanding of linguistics, language, and translation, engineering and code behavior, or LLM agents at work, including reasoning and tool use</li>
<li>You have prior knowledge in training and optimizing model behavior</li>
<li>You are an expert at building robust evaluations</li>
<li>You thrive in dynamic and technically complex environments</li>
<li>You have a track record of delivering innovative, out-of-the-box solutions to address real-world constraints</li>
</ul>
<p style="margin-top:24px;font-size:13px;color:#666;">XML job scraping automation by <a href="https://yubhub.co">YubHub</a></p>]]></Description>
      <Jobtype>full-time</Jobtype>
      <Experiencelevel>senior</Experiencelevel>
      <Workarrangement>onsite</Workarrangement>
      <Salaryrange></Salaryrange>
      <Skills>Large Language Models, Model Evaluation, Policy Writing, Evaluation Pipelines, Data Generation, Synthetic Testing Environments</Skills>
      <Category>Engineering</Category>
      <Industry>Technology</Industry>
      <Employername>Mistral AI</Employername>
      <Employerlogo>https://logos.yubhub.co/mistral.ai.png</Employerlogo>
      <Employerdescription>Mistral AI develops high-performance, open-source, and cutting-edge AI models, products, and solutions for enterprise and personal use.</Employerdescription>
      <Employerwebsite>https://mistral.ai</Employerwebsite>
      <Compensationcurrency></Compensationcurrency>
      <Compensationmin></Compensationmin>
      <Compensationmax></Compensationmax>
      <Applyto>https://jobs.lever.co/mistral/4337cebc-b951-4528-98f8-ebcb45db5645</Applyto>
      <Location>Paris</Location>
      <Country></Country>
      <Postedate>2026-04-17</Postedate>
    </job>
  </jobs>
</source>