<?xml version="1.0" encoding="UTF-8"?>
<source>
  <jobs>
    <job>
      <externalid>3b01c809-8ef</externalid>
      <Title>Staff Machine Learning Systems Engineer</Title>
      <Description><![CDATA[<p>As a Staff Machine Learning Systems Engineer at Reddit, you will lead the development of a platform for large-scale ML models. Your responsibilities will include designing end-to-end model lifecycle patterns (MLOps) to boost velocity of development for ML engineers, zero-to-one development and support of a graph ML codebase and platform, collaborating with ML engineers on performance tuning, optimizing batch data processing, and architecting pipelines to build and maintain massive graph data structures.</p>
<p>We are looking for an experienced engineer with 8+ years of experience in ML infrastructure, including model training and model deployments. You should have hands-on experience with ML optimization, cloud-based technologies, MLOps tools, and proficiency with common programming languages and frameworks of ML. Strong focus on scalability, reliability, performance, and ease of use is essential.</p>
<p>In addition to base salary, this job is eligible to receive equity in the form of restricted stock units, and depending on the position offered, it may also be eligible to receive a commission. Reddit offers a wide range of benefits to U.S.-based employees, including medical, dental, and vision insurance, 401(k) program with employer match, generous time off for vacation, and parental leave.</p>
<p style="margin-top:24px;font-size:13px;color:#666;">XML job scraping automation by <a href="https://yubhub.co">YubHub</a></p>]]></Description>
      <Jobtype>full-time</Jobtype>
      <Experiencelevel>staff</Experiencelevel>
      <Workarrangement>remote</Workarrangement>
      <Salaryrange>$230,000-$322,000 USD</Salaryrange>
      <Skills>ML infrastructure, model training, model deployments, ML optimization, cloud-based technologies, MLOps tools, Python, PyTorch, Tensorflow, graph ML codebase and platform, Apache Beam, Apache Spark, Ray Data</Skills>
      <Category>Engineering</Category>
      <Industry>Technology</Industry>
      <Employername>Reddit</Employername>
      <Employerlogo>https://logos.yubhub.co/redditinc.com.png</Employerlogo>
      <Employerdescription>Reddit is a community-driven platform with over 121 million daily active unique visitors and 100,000+ active communities.</Employerdescription>
      <Employerwebsite>https://www.redditinc.com</Employerwebsite>
      <Compensationcurrency></Compensationcurrency>
      <Compensationmin></Compensationmin>
      <Compensationmax></Compensationmax>
      <Applyto>https://job-boards.greenhouse.io/reddit/jobs/7731788</Applyto>
      <Location>Remote - United States</Location>
      <Country></Country>
      <Postedate>2026-04-18</Postedate>
    </job>
    <job>
      <externalid>980a6242-1cf</externalid>
      <Title>Member of Technical Staff - Quantitative Research</Title>
      <Description><![CDATA[<p>We&#39;re looking for a full-stack scientist to pioneer quantitative research efforts at Udio. You will build at the intersection of research, engineering and product, bridging disciplines by drawing on huge, one-of-a-kind proprietary datasets of music, metadata and user interactions/feedback.</p>
<p>Design &amp; own evaluation/optimization frameworks for frontier music models. Dive deep under the hood of our music generation systems, applying computational &amp; human resources to understand model capabilities and identify areas for growth. Build optimization loops and apply your findings to our pretraining, post-training and inference systems as applicable.</p>
<p>Drive product &amp; research roadmap. Own our data roadmap end-to-end, formulating research questions, exploring/linking/expanding data sources and conducting experiments at your discretion. Your work will span data mining, machine learning, causal inference, survey design and more, and your results will be critical for decision-making in product development, research investment and overall business direction.</p>
<p>Build stable infrastructure. Your work will reach far beyond the jupyter kernel, manifesting in robust integrations with our research &amp; product tech stacks, potentially in performance-critical paths. You&#39;ll also build large-scale standalone data processing systems, allocating resources as needed to manage the data ecosystem.</p>
<p>Champion scientific rigor. As our first quantitative researcher, you&#39;ll cultivate a culture of scientific rigor across the company and deepen common understanding of models, users and data. You&#39;ll proactively identify opportunities, define metrics, share results, and build a rigorous foundation upon which to understand our highly subjective domain.</p>
<p>We&#39;re looking for someone with deep quantitative expertise, preferably a Ph.D. in statistics, mathematics, physics, or another quantitative discipline, or 5+ years&#39; industry experience as a quantitative analyst / data scientist. Autonomy &amp; ownership are key, as you&#39;ll thrive in greenfield research domains, undefined product categories and small, flat teams. Engineering chops are also important, as you&#39;ll need to translate your ideas into clear, production-ready code and collaborate in an active research codebase.</p>
<p style="margin-top:24px;font-size:13px;color:#666;">XML job scraping automation by <a href="https://yubhub.co">YubHub</a></p>]]></Description>
      <Jobtype>full-time</Jobtype>
      <Experiencelevel>staff</Experiencelevel>
      <Workarrangement>remote</Workarrangement>
      <Salaryrange>$250k - $350k</Salaryrange>
      <Skills>Ph.D. in statistics, mathematics, physics, or another quantitative discipline, 5+ years&apos; industry experience as a quantitative analyst / data scientist, Deep learning frameworks, JAX, GCP, Apache Beam/DataFlow, Kubernetes, TensorFlow Data / TFRecord, Obsession with music &amp; the science of sound, Experience in DSP, MIR, music production / composition / performance, Big record collection</Skills>
      <Category>Engineering</Category>
      <Industry>Technology</Industry>
      <Employername>Udio</Employername>
      <Employerlogo>https://logos.yubhub.co/udio.com.png</Employerlogo>
      <Employerdescription>Udio builds AI experiences to empower musical artists and super fans, using best-in-class AI models and partnerships across the music industry.</Employerdescription>
      <Employerwebsite>https://udio.com</Employerwebsite>
      <Compensationcurrency></Compensationcurrency>
      <Compensationmin></Compensationmin>
      <Compensationmax></Compensationmax>
      <Applyto>https://job-boards.greenhouse.io/udio/jobs/5081608008</Applyto>
      <Location>New York City (Remote possible for exceptional candidates)</Location>
      <Country></Country>
      <Postedate>2026-04-17</Postedate>
    </job>
    <job>
      <externalid>e06c831d-23a</externalid>
      <Title>Machine Learning Engineer</Title>
      <Description><![CDATA[<p>The Personalization team at Spotify makes deciding what to play next on Spotify easier and more enjoyable for every listener. We seek to understand the world of music, podcasts, and audiobooks better than anyone else so that we can make great recommendations to every individual person and keep the world listening.</p>
<p>Our Minesweeper squad produces Human Understandable Language Knowledge to enrich music and talk content understanding. We use AI and ML techniques, including Large Language Models, to understand music, podcasts and audiobooks, building reliable, scalable systems to distribute that knowledge to Spotify internal teams, users, and creators.</p>
<p>We are looking for a Machine Learning Engineer to join our team and help build the future of music, podcast and audiobook listening experiences for millions of listeners at Spotify. This is a unique opportunity to help develop and shape Spotify content enrichment, and recommendations.</p>
<p>As a Machine Learning Engineer, you will:</p>
<ul>
<li>Utilize in-house and 3rd party LLMs to solve language understanding problems</li>
<li>Employ techniques such as fine-tuning and RAG to improve models</li>
<li>Contribute to designing, building, evaluating, shipping, and refining Spotify’s product by hands-on ML development</li>
<li>Help drive optimization, testing, and tooling to improve quality of our content enrichment assets</li>
<li>Collaborate with cross-functional teams of MLEs, data and backend engineers, and other stakeholders including tech research, data science, and product to develop new features and technologies</li>
<li>Perform data analysis to establish baselines and inform product decisions</li>
<li>Stay up-to-date on the latest machine learning algorithms and techniques</li>
</ul>
<p>You will be part of a motivated and supportive team that values agile software processes, data-driven development, reliability, and disciplined experimentation.</p>
<p>If you have a strong background in machine learning, especially experience with Large Language Models, and are passionate about fostering collaborative teams, we encourage you to apply.</p>
<p style="margin-top:24px;font-size:13px;color:#666;">XML job scraping automation by <a href="https://yubhub.co">YubHub</a></p>]]></Description>
      <Jobtype>full-time</Jobtype>
      <Experiencelevel>mid</Experiencelevel>
      <Workarrangement>remote</Workarrangement>
      <Salaryrange>$138,250-$197,500</Salaryrange>
      <Skills>Large Language Models, Machine Learning, Python, Scala, Java, SQL, PyTorch, TensorFlow, Ray, TFX, Apache Beam, Dataflow, Spark</Skills>
      <Category>Engineering</Category>
      <Industry>Technology</Industry>
      <Employername>Spotify</Employername>
      <Employerlogo>https://logos.yubhub.co/spotify.com.png</Employerlogo>
      <Employerdescription>Spotify is a music streaming service that offers users access to millions of songs, podcasts, and audiobooks. It has hundreds of millions of users worldwide.</Employerdescription>
      <Employerwebsite>https://www.spotify.com/</Employerwebsite>
      <Compensationcurrency></Compensationcurrency>
      <Compensationmin></Compensationmin>
      <Compensationmax></Compensationmax>
      <Applyto>https://jobs.lever.co/spotify/de3f6a47-4d75-4512-8351-b362f1d1c32e</Applyto>
      <Location>North America</Location>
      <Country></Country>
      <Postedate>2026-03-31</Postedate>
    </job>
    <job>
      <externalid>da758a3e-06e</externalid>
      <Title>Machine Learning Engineer</Title>
      <Description><![CDATA[<p>The Personalization (PZN) team at Spotify makes deciding what to play next on Spotify easier and more enjoyable for every listener. We seek to understand the world of music, podcasts and audiobooks better than anyone else so that we can make great recommendations to every individual and keep the world listening.</p>
<p>The TurnTable team’s mission is to own and innovate on AI DJ and the interactive listening experiences. Using a mixture of LLMs and traditional ML, we strive to provide depth and connection for all listeners. We are looking for a Machine Learning Engineer to join our team to build and improve our interactive listening experiences.</p>
<p><strong>Responsibilities</strong></p>
<ul>
<li>Design, build, evaluate, and ship an agent-based DJ solution that brings our DJ and interactive experiences to the next level.</li>
<li>Collaborate with cross-functional teams spanning user research, design, data science, product management, and engineering to build new product features that advance our mission to connect artists and fans in personalized and useful ways.</li>
<li>Prototype new approaches and productionize solutions at scale for our hundreds of millions of active users.</li>
<li>Promote and role-model best practices of ML systems development, testing, evaluation, etc., both inside the team as well as throughout the organization.</li>
<li>Be part of an active group of machine learning practitioners.</li>
</ul>
<p><strong>Requirements</strong></p>
<ul>
<li>An experienced ML practitioner motivated to work on complex real-world problems in a fast-paced and collaborative environment.</li>
<li>Strong background in machine learning, natural language processing, and generative AI, with experience in applying theory to develop real-world applications.</li>
<li>Hands-on expertise with implementing end-to-end production ML systems at scale. Experience with production LLM scale-based systems is a plus.</li>
<li>Experience with incorporating human feedback to improve LLM-based systems using techniques like DPO, KTO, and reinforcement fine-tuning.</li>
<li>Experience with designing end-to-end tech specs and modular architectures for ML frameworks in complex problem spaces in collaboration with product teams.</li>
<li>Experience with large-scale, distributed data processing frameworks/tools like Apache Beam, Apache Spark, and cloud platforms like GCP or AWS.</li>
</ul>
<p><strong>Benefits</strong></p>
<ul>
<li>Health insurance</li>
<li>Six-month paid parental leave</li>
<li>401(k) retirement plan</li>
<li>Monthly meal allowance</li>
<li>23 paid days off</li>
<li>13 paid flexible holidays</li>
<li>Paid sick leave</li>
</ul>
<p>The United States base range for this position is $176,166 - $251,666 plus equity.</p>
<p style="margin-top:24px;font-size:13px;color:#666;">XML job scraping automation by <a href="https://yubhub.co">YubHub</a></p>]]></Description>
      <Jobtype>full-time</Jobtype>
      <Experiencelevel>senior</Experiencelevel>
      <Workarrangement>remote</Workarrangement>
      <Salaryrange>$176,166 - $251,666</Salaryrange>
      <Skills>Machine Learning, Natural Language Processing, Generative AI, Apache Beam, Apache Spark, GCP, AWS</Skills>
      <Category>Engineering</Category>
      <Industry>Technology</Industry>
      <Employername>Spotify</Employername>
      <Employerlogo>https://logos.yubhub.co/spotify.com.png</Employerlogo>
      <Employerdescription>Spotify is a music streaming service that provides access to millions of songs, podcasts, and audiobooks. It has hundreds of millions of active users worldwide.</Employerdescription>
      <Employerwebsite>https://www.spotify.com</Employerwebsite>
      <Compensationcurrency></Compensationcurrency>
      <Compensationmin></Compensationmin>
      <Compensationmax></Compensationmax>
      <Applyto>https://jobs.lever.co/spotify/0cd7549d-880c-4861-b343-c0564cc8e9de</Applyto>
      <Location>North America</Location>
      <Country></Country>
      <Postedate>2026-03-31</Postedate>
    </job>
    <job>
      <externalid>11a36eab-3cb</externalid>
      <Title>Senior Data Engineer</Title>
      <Description><![CDATA[<p><strong>Job Description</strong></p>
<p>Are you ready to contribute to the evolution of our data pipelines for our B2C division? At Future, we are transforming our data-driven decision-making processes and we are looking for a passionate and experienced Data Engineer to join us.</p>
<p>This is an exciting opportunity for someone who excels in a creative environment, enjoys solving complex data challenges, and is eager to build impactful business insights, for this role you will directly report into the Head of Data Engineering</p>
<p><strong>Responsibilities</strong></p>
<ul>
<li>Develop and maintain new/current features of the data platform.</li>
<li>Responsible for delivery of development projects, including scoping, writing and sizing of stories involved.</li>
<li>Take ownership of BAU processes, develop area specific domain mastery, and seek means to automate them or reduce their impact.</li>
<li>Proposes and advocates for changes to reduce risk, cost and overhead.</li>
<li>Provide appropriate documentation for pipelines developed</li>
<li>Parameterise pipelines so configuration can be changed easily without having to perform deep changes to the codebase</li>
<li>Apply appropriate testing principles to ensure code is fit for purpose</li>
</ul>
<p><strong>Experience</strong></p>
<ul>
<li>Experience using Python on Google Cloud Platform for Big Data projects, BigQuery, DataFlow (Apache Beam), Cloud Run Functions, Cloud Run, Cloud Workflows, Cloud Composure</li>
<li>SQL development skills</li>
<li>Experience using Dataform or dbt</li>
<li>Demonstrated strength in data modelling, ETL development, and data warehousing</li>
<li>Knowledge of data management fundamentals and data storage principles</li>
<li>Familiarity with statistical models or data mining algorithms and practical experience applying these to business problems</li>
</ul>
<p><strong>What&#39;s in it for you</strong></p>
<p>The expected range for this role is £50,000 - £60,000</p>
<p>This is a Hybrid role from our Bath Office, working three days from the office, two from home … Plus more great perks, which include;</p>
<ul>
<li>Uncapped leave, because we trust you to manage your workload and time</li>
<li>When we hit our targets, enjoy a share of our profits with a bonus</li>
<li>Refer a friend and get rewarded when they join Future</li>
<li>Wellbeing support with access to our Colleague Assistant Programmes</li>
<li>Opportunity to purchase shares in Future, with our Share Incentive Plan</li>
</ul>
<p style="margin-top:24px;font-size:13px;color:#666;">XML job scraping automation by <a href="https://yubhub.co">YubHub</a></p>]]></Description>
      <Jobtype>full-time</Jobtype>
      <Experiencelevel>senior</Experiencelevel>
      <Workarrangement>hybrid</Workarrangement>
      <Salaryrange>£50,000 - £60,000</Salaryrange>
      <Skills>Python, Google Cloud Platform, BigQuery, DataFlow, Apache Beam, Cloud Run Functions, Cloud Run, Cloud Workflows, Cloud Composure, SQL, Dataform, dbt, data modelling, ETL development, data warehousing, data management fundamentals, data storage principles, statistical models, data mining algorithms</Skills>
      <Category>Engineering</Category>
      <Industry>Technology</Industry>
      <Employername>Future</Employername>
      <Employerlogo>https://logos.yubhub.co/j.com.png</Employerlogo>
      <Employerdescription>Future is a global leader in specialist media, with over 3,000 employees working across 200+ media brands.</Employerdescription>
      <Employerwebsite>https://apply.workable.com</Employerwebsite>
      <Compensationcurrency></Compensationcurrency>
      <Compensationmin></Compensationmin>
      <Compensationmax></Compensationmax>
      <Applyto>https://apply.workable.com/j/3535C2B9B5</Applyto>
      <Location>Bath</Location>
      <Country></Country>
      <Postedate>2026-03-09</Postedate>
    </job>
    <job>
      <externalid>6d5e164b-74d</externalid>
      <Title>Data Engineer</Title>
      <Description><![CDATA[<p><strong>Data Engineer</strong></p>
<p>Are you ready to contribute to the evolution of our data pipelines for our B2C division? We are transforming our data-driven decision-making processes and we are looking for a passionate and experienced Data Engineer to join us. This is an exciting opportunity for someone who grows in a creative environment, enjoys solving complex data challenges. You&#39;ll report into the Lead Data Engineer for this position and sit within the wider Data Engineer team.</p>
<p>The Data &amp; Business Intelligence team guides our organisation to become more data-driven. Our to market changes gives us a competitive edge. By ensuring visibility of objective performance data, we empower our teams to make rapid, informed decisions that enhance overall performance.</p>
<p><strong>Responsibilities</strong></p>
<ul>
<li>Maintain new/current features of the data platform.</li>
<li>Responsible for delivery of development projects.</li>
<li>Utilise established software engineering practices and principles.</li>
<li>Take ownership of BAU processes, develop area specific domain mastery.</li>
<li>Ensure compliance matters are followed.</li>
<li>Utilise CI/CD and infrastructure as code (Terraform) for rapid deployment of changes.</li>
</ul>
<p><strong>Experience</strong></p>
<ul>
<li>Experience using Python on Google Cloud Platform for Big Data projects, BigQuery, DataFlow (Apache Beam), Cloud Run Functions, Cloud Run, Cloud Workflows, Cloud Composure.</li>
<li>SQL development skills.</li>
<li>Demonstrated strength in data modelling, ETL development, and data warehousing.</li>
<li>Knowledge of data management fundamentals and data storage principles.</li>
<li>Familiarity with statistical models or data mining algorithms and practical experience applying these to business problems.</li>
</ul>
<p><strong>What&#39;s in it for you</strong></p>
<p>The expected range for this role is £45,000 - £50,000. This is a Hybrid role from our Bath Office, working three days from the office, two from home. Plus more great perks, which include:</p>
<ul>
<li>Uncapped leave, because we trust you to manage your workload and time.</li>
<li>When we hit our targets, enjoy a share of our profits with a bonus.</li>
<li>Refer a friend and get rewarded when they join Future.</li>
<li>Wellbeing support with access to our Colleague Assistant Programmes.</li>
<li>Opportunity to purchase shares in Future, with our Share Incentive Plan.</li>
</ul>
<p style="margin-top:24px;font-size:13px;color:#666;">XML job scraping automation by <a href="https://yubhub.co">YubHub</a></p>]]></Description>
      <Jobtype>full-time</Jobtype>
      <Experiencelevel>mid</Experiencelevel>
      <Workarrangement>hybrid</Workarrangement>
      <Salaryrange>£45,000 - £50,000</Salaryrange>
      <Skills>Python, Google Cloud Platform, BigQuery, DataFlow, Apache Beam, Cloud Run Functions, Cloud Run, Cloud Workflows, Cloud Composure, SQL, data modelling, ETL development, data warehousing, data management fundamentals, data storage principles, statistical models, data mining algorithms</Skills>
      <Category>Engineering</Category>
      <Industry>Technology</Industry>
      <Employername>Future</Employername>
      <Employerlogo>https://logos.yubhub.co/j.com.png</Employerlogo>
      <Employerdescription>Future is a global leader in specialist media, with over 3,000 employees working across 200+ media brands.</Employerdescription>
      <Employerwebsite>https://apply.workable.com</Employerwebsite>
      <Compensationcurrency></Compensationcurrency>
      <Compensationmin></Compensationmin>
      <Compensationmax></Compensationmax>
      <Applyto>https://apply.workable.com/j/BDB1B6F4CF</Applyto>
      <Location></Location>
      <Country></Country>
      <Postedate>2026-03-09</Postedate>
    </job>
    <job>
      <externalid>7f345e34-fa0</externalid>
      <Title>Software Engineering Manager</Title>
      <Description><![CDATA[<p>At Ford Motor Company, we believe freedom of movement drives human progress. We are seeking a Software Engineering Manager to provide engineering leadership to multiple product lines within the Ford Customer Service Division (FCSD). FCSD is a true one-stop shop, offering comprehensive diagnostics, repair, and service capabilities for a full portfolio of electrified, hybrid, and internal combustion vehicles globally.</p>
<p><strong>Responsibilities</strong></p>
<p><strong>Provide Engineering Leadership</strong></p>
<ul>
<li>Provide engineering leadership to multiple product lines within FCSD</li>
<li>Help business partners understand our iterative development approach and focus on delivering a Minimum Viable Product (MVP) and releases</li>
<li>Design and deliver industry-leading products and services to maximize value and productivity for commercial customers</li>
</ul>
<p><strong>Ensure Software Engineering Excellence</strong></p>
<ul>
<li>Ensure software engineering excellence (e.g. best practices and quality) is achieved within the FCSD Tech product line</li>
<li>Collaborate with other Product Line Anchors to reduce complexity across the portfolio, enhance interoperability between services, and build reusable API services</li>
</ul>
<p><strong>Provide Thought Leadership</strong></p>
<ul>
<li>Provide thought leadership for the development, structure, technology, and tools used within FCSD</li>
<li>Innovate and operate with an iterative, agile, and user-centric perspective</li>
</ul>
<p><strong>Communicate Technology Strategy</strong></p>
<ul>
<li>Clearly communicate technology strategy and vision to team members and internal and external stakeholders</li>
<li>Demonstrate software engineering excellence through actively coding, pairing, and performing code and architecture reviews with the software engineers within the FCSD Tech product line</li>
</ul>
<p><strong>Qualifications</strong></p>
<ul>
<li>Bachelor&#39;s degree in Computer Science or Engineering or related</li>
<li>5+ years experience with progressive leadership responsibilities in Software Engineering, Architecture, and Agile Framework</li>
<li>Experience with Lean methodology &amp; eXtreme Programming</li>
<li>Must be able to operationalize and assist teams with abstract technology concepts</li>
<li>Strong communication, collaborative, and influencing skills</li>
<li>Proven ability to work closely with senior leadership</li>
<li>Strong personal presence and capabilities to resolve technical concerns</li>
<li>Demonstrated ability to drive development of highly technical technology services and capabilities</li>
<li>Demonstrated understanding and ability to drive API economy and solutions</li>
<li>Demonstrated understanding and ability to drive highly available consumer-ready Internet properties and technical platforms</li>
<li>Experience collaborating with engineers, designers, and product owners</li>
<li>Excellent communication skills with the ability to adapt your communication style to the audience</li>
<li>Ability to work collaboratively and navigate complex decision making in a rapidly changing environment</li>
<li>Strong leadership and communication skills and the ability to teach others</li>
<li>Experience 3+ years with building and supporting cloud-native applications leveraging Java, Spring Boot, and REACT tech stack</li>
<li>Experience with cloud services and platform knowledge</li>
<li>Modern databases (Relational and non-relational)</li>
<li>Continuous integration/continuous delivery tools and pipelines, such as Tekton, Jenkins, Terraform, SonarQube, Maven, Gradle, Harness, Apigee X, etc.</li>
<li>Experience developing and deploying to cloud platforms, such as Google Cloud Platform, Pivotal Cloud Foundry, Amazon Web Services, and Microsoft Azure</li>
<li>Experience with GCP Dataflow (Apache Beam) and workflow orchestration</li>
</ul>
<p><strong>Benefits</strong></p>
<ul>
<li>Immediate medical, dental, vision, and prescription drug coverage</li>
<li>Flexible family care days, paid parental leave, new parent ramp-up programs, subsidized back-up child care, and more</li>
<li>Family building benefits, including adoption and surrogacy expense reimbursement, fertility treatments, and more</li>
<li>Vehicle discount program for employees and family members and management leases</li>
<li>Tuition assistance</li>
<li>Established and active employee resource groups</li>
<li>Paid time off for individual and team community service</li>
<li>A generous schedule of paid holidays, including the week between Christmas and New Year&#39;s Day</li>
<li>Paid time off and the option to purchase additional vacation time</li>
</ul>
<p style="margin-top:24px;font-size:13px;color:#666;">XML job scraping automation by <a href="https://yubhub.co">YubHub</a></p>]]></Description>
      <Jobtype>full-time</Jobtype>
      <Experiencelevel>senior</Experiencelevel>
      <Workarrangement>remote</Workarrangement>
      <Salaryrange>This position is a range of salary grade LL6.</Salaryrange>
      <Skills>Software Engineering, Agile Framework, Lean methodology, eXtreme Programming, Java, Spring Boot, REACT, Cloud services, Platform knowledge, Modern databases, Continuous integration/continuous delivery tools, Pipelines, GCP Dataflow, Apache Beam, Workflow orchestration, Cloud-native applications, Cloud platforms, API economy, Highly available consumer-ready Internet properties, Technical platforms</Skills>
      <Category>Engineering</Category>
      <Industry>Automotive</Industry>
      <Employername>Ford Motor Company</Employername>
      <Employerlogo></Employerlogo>
      <Employerdescription>Ford Motor Company is a multinational automaker that designs, manufactures, and markets vehicles and automotive-related products. It is one of the largest automakers in the world.</Employerdescription>
      <Employerwebsite>https://efds.fa.em5.oraclecloud.com</Employerwebsite>
      <Compensationcurrency></Compensationcurrency>
      <Compensationmin></Compensationmin>
      <Compensationmax></Compensationmax>
      <Applyto>https://efds.fa.em5.oraclecloud.com/hcmUI/CandidateExperience/en/sites/CX_1/job/59597</Applyto>
      <Location>United States</Location>
      <Country></Country>
      <Postedate>2026-03-09</Postedate>
    </job>
    <job>
      <externalid>0841fcf4-9ab</externalid>
      <Title>Data Engineer SE - II</Title>
      <Description><![CDATA[<p>We are on a mission to rid the world of bad customer service by “mobilizing” the way help is delivered. Today’s consumers want an always-available customer service experience that leaves them feeling valued and respected.</p>
<p>Helpshift helps B2B brands deliver this modern customer service experience through a mobile-first approach. We have changed how conversations take place, moving the conversation away from a slow, outdated email and desktop experience to an in-app chat experience that allows users to interact with brands in their own time.</p>
<p>Through our market-leading AI-powered chatbots and automation, we help brands deliver instant and rapid resolutions. Because agents play a key role in delivering help, our platform gives agents superpowers with automation and AI that simply works.</p>
<p><strong>About the Team</strong></p>
<p>Consumers care first and foremost about having their time valued by brands. Brands need insights into their customer service operation to serve their consumers effectively. Such insights and analytics are delivered through various data products like in-app analytics dashboards and data-sharing integrations.</p>
<p>The data platform team is responsible for designing, building, and maintaining the data infrastructure that enables such data and analytics products at scale. We build and manage data pipelines, databases, and other data structures to ensure that the data is reliable, accurate, and easily accessible.</p>
<p>We also enable internal stakeholders with business intelligence and machine learning teams with data ops. This team manages the platform that handles 2 Million events per minute and processes 1+ terabytes of data daily.</p>
<p><strong>About the Role</strong></p>
<ul>
<li>Building maintainable data pipelines both for data ingestion and operational analytics for data collected from 2 billion devices and 900M Monthly active users</li>
<li>Building customer-facing analytics products that deliver actionable insights and data, easily detect anomalies</li>
<li>Collaborating with data stakeholders to see what their data needs are and being a part of the analysis process</li>
<li>Write design specifications, test, deployment, and scaling plans for the data pipelines</li>
<li>Mentor people in the team &amp; organization</li>
</ul>
<p><strong>Requirements</strong></p>
<ul>
<li>3+ years of experience in building and running data pipelines that scale for TBs of data</li>
<li>Proficiency in high-level object-oriented programming language (Python or Java) is must</li>
<li>Experience in Cloud data platforms like Snowflake and AWS, EMR/Athena is a must</li>
<li>Experience in building modern data lakehouse architectures using Snowflake and columnar formats like Apache Iceberg/Hudi, Parquet, etc</li>
<li>Proficiency in Data modeling, SQL query profiling, and data warehousing skills is a must</li>
<li>Experience in distributed data processing engines like Apache Spark, Apache Flink, Datalfow/Apache Beam, etc</li>
<li>Knowledge of workflow orchestrators like Airflow, Dasgter, etc is a plus</li>
<li>Data visualization skills are a plus (PowerBI, Metabase, Tableau, Hex, Sigma, etc)</li>
<li>Excellent verbal and written communication skills</li>
<li>Bachelor’s Degree in Computer Science (or equivalent)</li>
</ul>
<p><strong>Benefits</strong></p>
<ul>
<li>Hybrid setup</li>
<li>Worker&#39;s insurance</li>
<li>Paid Time Offs</li>
<li>Other employee benefits to be discussed by our Talent Acquisition team in India.</li>
</ul>
<p style="margin-top:24px;font-size:13px;color:#666;">XML job scraping automation by <a href="https://yubhub.co">YubHub</a></p>]]></Description>
      <Jobtype>full-time</Jobtype>
      <Experiencelevel>senior</Experiencelevel>
      <Workarrangement>hybrid</Workarrangement>
      <Salaryrange></Salaryrange>
      <Skills>Python, Java, Snowflake, AWS, EMR/Athena, Apache Iceberg/Hudi, Parquet, Apache Spark, Apache Flink, Datalflow/Apache Beam, Airflow, Data modeling, SQL query profiling, data warehousing, PowerBI, Metabase, Tableau, Hex, Sigma</Skills>
      <Category>Engineering</Category>
      <Industry>Technology</Industry>
      <Employername>Helpshift</Employername>
      <Employerlogo>https://logos.yubhub.co/j.com.png</Employerlogo>
      <Employerdescription>Helpshift is a company that provides a mobile-first customer service experience for B2B brands. It has over 900 million active monthly consumers and is used by hundreds of leading brands.</Employerdescription>
      <Employerwebsite>https://apply.workable.com</Employerwebsite>
      <Compensationcurrency></Compensationcurrency>
      <Compensationmin></Compensationmin>
      <Compensationmax></Compensationmax>
      <Applyto>https://apply.workable.com/j/D451DB2325</Applyto>
      <Location>Pune, Maharashtra, India</Location>
      <Country></Country>
      <Postedate>2026-03-09</Postedate>
    </job>
    <job>
      <externalid>5008b4f7-b62</externalid>
      <Title>Member of Technical Staff - Data Research Engineer - MAI Superintelligence Team</Title>
      <Description><![CDATA[<p>We are seeking Data Research Engineers to join our Multimodal team, where we are building the next generation of foundation models across vision, language, audio, and beyond. If you are passionate about designing and curating high-quality datasets to power frontier AI models, this role is for you.</p>
<p>In this role, you’ll work at the intersection of data and innovation—collaborating with scientists, engineers, and annotators to curate, analyze, and evaluate diverse multimodal data sources critical to model development. You will lead efforts to:</p>
<ul>
<li>Develop novel data collection strategies</li>
<li>Improve dataset quality and integrity</li>
<li>Understand data-driven model behaviors</li>
<li>Align datasets with ethical and societal values</li>
</ul>
<p>This is a cross-disciplinary, high-impact role ideal for engineers who want to push the boundaries of what AI can learn from data, especially in multimodal contexts.</p>
<p>Microsoft Superintelligence Team</p>
<p>The MAIST is a startup-like team inside Microsoft AI, created to push the boundaries of AI toward Humanist Superintelligence—ultra-capable systems that remain controllable, safety-aligned, and anchored to human values. Our mission is to create AI that amplifies human potential while ensuring humanity remains firmly in control.</p>
<p>Responsibilities</p>
<ul>
<li>Create high-quality datasets for training and evaluation; run experiments on new datasets (data ablations) to assess their impact and determine the most effective data.</li>
<li>Develop and maintain scalable data pipelines for multimodal ingestion, preprocessing, filtering, and annotation.</li>
<li>Analyze real-world multimodal datasets to assess quality, diversity, relevance, and identify areas for improvement.</li>
<li>Build lightweight tools and workflows for dataset auditing, visualization, and versioning.</li>
<li>Collaborate with Safety, Ethics, and Governance teams to ensure datasets meet standards for quality, privacy, and responsible AI practices.</li>
</ul>
<p>Embody our culture and values.</p>
<p>Qualifications</p>
<ul>
<li>Bachelor’s Degree in AI, Computer Science, Data Science, Statistics, Physics, Engineering, or related technical discipline AND 4+ years technical engineering experience with coding in languages including, but not limited to, Python and common data libraries (Pandas, NumPy, etc.) OR equivalent experience.</li>
<li>2+ years of experience in data analysis or data engineering, including work with large-scale datasets that are unstructured or semi-structured.</li>
<li>Proficiency in statistics and exploratory data analysis methods.</li>
<li>Familiarity with data processing frameworks such as Spark, Ray, or Apache Beam.</li>
<li>Ability to communicate technical findings clearly to research and product teams.</li>
</ul>
<p style="margin-top:24px;font-size:13px;color:#666;">XML job scraping automation by <a href="https://yubhub.co">YubHub</a></p>]]></Description>
      <Jobtype>full-time</Jobtype>
      <Experiencelevel>staff</Experiencelevel>
      <Workarrangement>hybrid</Workarrangement>
      <Salaryrange>USD $119,800 – $234,700 per year</Salaryrange>
      <Skills>Python, Pandas, NumPy, Spark, Ray, Apache Beam, Data analysis, Data engineering, Statistics, Exploratory data analysis, Data processing frameworks, Lightweight tools and workflows, Dataset auditing, Visualization, Versioning</Skills>
      <Category>Engineering</Category>
      <Industry>Technology</Industry>
      <Employername>Microsoft</Employername>
      <Employerlogo>https://logos.yubhub.co/microsoft.ai.png</Employerlogo>
      <Employerdescription>Microsoft is a multinational technology company that develops, manufactures, licenses, and supports a wide range of software products, services, and devices.</Employerdescription>
      <Employerwebsite>https://microsoft.ai</Employerwebsite>
      <Compensationcurrency></Compensationcurrency>
      <Compensationmin></Compensationmin>
      <Compensationmax></Compensationmax>
      <Applyto>https://microsoft.ai/job/member-of-technical-staff-data-research-engineer-mai-superintelligence-team-6/</Applyto>
      <Location>New York</Location>
      <Country></Country>
      <Postedate>2026-03-08</Postedate>
    </job>
    <job>
      <externalid>9ee6a205-e17</externalid>
      <Title>Member of Technical Staff - Pretraining Text Data</Title>
      <Description><![CDATA[<p>We are seeking engineers and researchers to join our Pretraining Text Data team, where we are building the next generation of foundation large language models. If you are passionate about designing and curating high-quality datasets to power frontier AI models, this role is for you.</p>
<p>In this role, you’ll work at the intersection of data and innovation—collaborating with scientists, engineers, and annotators to curate, analyze, and evaluate diverse text datasets critical to model development. You will lead efforts to:</p>
<ul>
<li>Develop novel data collection strategies</li>
<li>Improve dataset quality and integrity</li>
<li>Understand data-driven model behaviors</li>
<li>Train models to understand the impact of data and data mixes</li>
<li>Align datasets with ethical and societal values</li>
</ul>
<p>This is a cross-disciplinary, high-impact role ideal for engineers and researchers who want to push the boundaries of what AI can learn from data.</p>
<p>Microsoft Superintelligence Team</p>
<p>The MAIST is a startup-like team inside Microsoft AI, created to push the boundaries of AI toward Humanist Superintelligence—ultra-capable systems that remain controllable, safety-aligned, and anchored to human values. Our mission is to create AI that amplifies human potential while ensuring humanity remains firmly in control.</p>
<p>Responsibilities</p>
<ul>
<li>Create high-quality datasets for training and evaluation; run experiments on new datasets (data ablations) to assess their impact and determine the most effective data.</li>
<li>Develop and maintain scalable data pipelines for text data ingestion, preprocessing, filtering, and annotation.</li>
<li>Analyze real-world text datasets to assess quality, diversity, relevance, and identify areas for improvement.</li>
<li>Build lightweight tools and workflows for dataset auditing, visualization, and versioning.</li>
<li>Collaborate with Safety, Ethics, and Governance teams to ensure datasets meet standards for quality, privacy, and responsible AI practices.</li>
</ul>
<p>Embody our culture and values.</p>
<p>Qualifications</p>
<ul>
<li>Bachelor’s Degree in AI, Computer Science, Data Science, Statistics, Physics, Engineering, or related technical discipline AND 4+ years technical engineering experience with coding in languages including, but not limited to, Python and common data libraries (Pandas, NumPy, etc.) OR equivalent experience.</li>
<li>2+ years of experience in data analysis or data engineering, including work with large-scale datasets that are unstructured or semi-structured.</li>
<li>Proficiency in statistics and exploratory data analysis methods.</li>
<li>Familiarity with data processing frameworks such as Spark, Ray, or Apache Beam.</li>
<li>Ability to communicate technical findings clearly to research and product teams.</li>
</ul>
<p>Software Engineering IC4</p>
<ul>
<li>The typical base pay range for this role across the U.S. is USD $119,800 – $234,700 per year.</li>
</ul>
<p>Software Engineering IC5</p>
<ul>
<li>The typical base pay range for this role across the U.S. is USD $139,900 – $274,800 per year.</li>
</ul>
<p>This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.</p>
<p style="margin-top:24px;font-size:13px;color:#666;">XML job scraping automation by <a href="https://yubhub.co">YubHub</a></p>]]></Description>
      <Jobtype>full-time</Jobtype>
      <Experiencelevel>staff</Experiencelevel>
      <Workarrangement>hybrid</Workarrangement>
      <Salaryrange>USD $119,800 – $234,700 per year</Salaryrange>
      <Skills>Python, Pandas, NumPy, Spark, Ray, Apache Beam, Data Science, Statistics, Physics, Engineering, Data Analysis, Data Engineering, Exploratory Data Analysis</Skills>
      <Category>Engineering</Category>
      <Industry>Technology</Industry>
      <Employername>Microsoft</Employername>
      <Employerlogo>https://logos.yubhub.co/microsoft.ai.png</Employerlogo>
      <Employerdescription>Microsoft is a multinational technology company that develops, manufactures, licenses, and supports a wide range of software products, services, and devices.</Employerdescription>
      <Employerwebsite>https://microsoft.ai</Employerwebsite>
      <Compensationcurrency></Compensationcurrency>
      <Compensationmin></Compensationmin>
      <Compensationmax></Compensationmax>
      <Applyto>https://microsoft.ai/job/member-of-technical-staff-pretraining-text-data-3/</Applyto>
      <Location>New York</Location>
      <Country></Country>
      <Postedate>2026-03-08</Postedate>
    </job>
    <job>
      <externalid>f0e01847-2e0</externalid>
      <Title>Member of Technical Staff - Data Research Engineer - MAI Superintelligence Team</Title>
      <Description><![CDATA[<p>We are seeking Data Research Engineers to join our Multimodal team, where we are building the next generation of foundation models across vision, language, audio, and beyond. If you are passionate about designing and curating high-quality datasets to power frontier AI models, this role is for you. In this role, you’ll work at the intersection of data and innovation—collaborating with scientists, engineers, and annotators to curate, analyze, and evaluate diverse multimodal data sources critical to model development. You will lead efforts to:</p>
<p>Develop novel data collection strategies</p>
<p>Improve dataset quality and integrity</p>
<p>Understand data-driven model behaviors</p>
<p>Align datasets with ethical and societal values</p>
<p>This is a cross-disciplinary, high-impact role ideal for engineers who want to push the boundaries of what AI can learn from data, especially in multimodal contexts.</p>
<p style="margin-top:24px;font-size:13px;color:#666;">XML job scraping automation by <a href="https://yubhub.co">YubHub</a></p>]]></Description>
      <Jobtype>full-time</Jobtype>
      <Experiencelevel>staff</Experiencelevel>
      <Workarrangement>hybrid</Workarrangement>
      <Salaryrange>USD $119,800 – $234,700 per year (U.S.) or USD $158,400 – $258,000 per year (San Francisco Bay area and New York City metropolitan area)</Salaryrange>
      <Skills>Python, Pandas, NumPy, data libraries, data analysis, data engineering, large-scale datasets, unstructured or semi-structured data, statistics, exploratory data analysis methods, data processing frameworks, Spark, Ray, Apache Beam, Master’s Degree in AI, Computer Science, Data Science, Statistics, Physics, Engineering, or related technical discipline, 8+ years technical engineering experience with coding in languages including, but not limited to, Python and common data libraries (Pandas, NumPy, etc.)</Skills>
      <Category>Engineering</Category>
      <Industry>Technology</Industry>
      <Employername>Microsoft</Employername>
      <Employerlogo>https://logos.yubhub.co/microsoft.ai.png</Employerlogo>
      <Employerdescription>Microsoft is a multinational technology company that develops, manufactures, licenses, and supports a wide range of software products, services, and devices.</Employerdescription>
      <Employerwebsite>https://microsoft.ai</Employerwebsite>
      <Compensationcurrency></Compensationcurrency>
      <Compensationmin></Compensationmin>
      <Compensationmax></Compensationmax>
      <Applyto>https://microsoft.ai/job/member-of-technical-staff-data-research-engineer-mai-superintelligence-team-4/</Applyto>
      <Location>Mountain View</Location>
      <Country></Country>
      <Postedate>2026-03-08</Postedate>
    </job>
    <job>
      <externalid>2bfc37e4-bc3</externalid>
      <Title>Researcher, Pretraining Safety</Title>
      <Description><![CDATA[<p><strong>Job Posting</strong></p>
<p><strong>Researcher, Pretraining Safety</strong></p>
<p><strong>Location</strong></p>
<p>San Francisco</p>
<p><strong>Employment Type</strong></p>
<p>Full time</p>
<p><strong>Department</strong></p>
<p>Safety Systems</p>
<p><strong>Compensation</strong></p>
<ul>
<li>$295K – $445K • Offers Equity</li>
</ul>
<p>The base pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience. If the role is non-exempt, overtime pay will be provided consistent with applicable laws. In addition to the salary range listed above, total compensation also includes generous equity, performance-related bonus(es) for eligible employees, and the following benefits.</p>
<ul>
<li>Medical, dental, and vision insurance for you and your family, with employer contributions to Health Savings Accounts</li>
</ul>
<ul>
<li>Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses (parking and transit)</li>
</ul>
<ul>
<li>401(k) retirement plan with employer match</li>
</ul>
<ul>
<li>Paid parental leave (up to 24 weeks for birth parents and 20 weeks for non-birthing parents), plus paid medical and caregiver leave (up to 8 weeks)</li>
</ul>
<ul>
<li>Paid time off: flexible PTO for exempt employees and up to 15 days annually for non-exempt employees</li>
</ul>
<ul>
<li>13+ paid company holidays, and multiple paid coordinated company office closures throughout the year for focus and recharge, plus paid sick or safe time (1 hour per 30 hours worked, or more, as required by applicable state or local law)</li>
</ul>
<ul>
<li>Mental health and wellness support</li>
</ul>
<ul>
<li>Employer-paid basic life and disability coverage</li>
</ul>
<ul>
<li>Annual learning and development stipend to fuel your professional growth</li>
</ul>
<ul>
<li>Daily meals in our offices, and meal delivery credits as eligible</li>
</ul>
<ul>
<li>Relocation support for eligible employees</li>
</ul>
<ul>
<li>Additional taxable fringe benefits, such as charitable donation matching and wellness stipends, may also be provided.</li>
</ul>
<p>More details about our benefits are available to candidates during the hiring process.</p>
<p>This role is at-will and OpenAI reserves the right to modify base pay and other compensation components at any time based on individual performance, team or company results, or market conditions.</p>
<p><strong><strong>About the Team</strong></strong></p>
<p>The Safety Systems team is responsible for various safety work to ensure our best models can be safely deployed to the real world to benefit the society and is at the forefront of OpenAI&#39;s mission to build and deploy safe AGI, driving our commitment to AI safety and fostering a culture of trust and transparency.</p>
<p>The Pretraining Safety team’s goal is to build safer, more capable base models and enable earlier, more reliable safety evaluation during training. We aim to:</p>
<ol>
<li><strong>Develop upstream safety evaluations</strong> that to monitor how and when unsafe behaviors and goals emerge;</li>
</ol>
<ol>
<li><strong>Create safer priors</strong> through targeted pretraining and mid-training interventions that make downstream alignment more effective and efficient</li>
</ol>
<ol>
<li><strong>Design safe-by-design architectures</strong> that allow for more controllability of model capabilities</li>
</ol>
<p>In addition, we will conduct the foundational research necessary for understanding how behaviors emerge, generalize, and can be reliably measured throughout training.</p>
<p><strong><strong>About the Role</strong></strong></p>
<p>The Pretraining Safety team is pioneering how safety is built into models before they reach post-training and deployment. In this role, you will work throughout the full stack of model development with a focus on pre-training:</p>
<ul>
<li>Identify safety-relevant behaviors as they first emerge in base models</li>
</ul>
<ul>
<li>Evaluate and reduce risk without waiting for full-scale training runs</li>
</ul>
<ul>
<li>Design architectures and training setups that make safer behavior the default</li>
</ul>
<ul>
<li>Strengthen models by incorporating richer, earlier safety signals</li>
</ul>
<p>We collaborate across OpenAI’s safety ecosystem—from Safety Systems to Training—to ensure that safety foundations are robust, scalable, and grounded in real-world risks.</p>
<p><strong><strong>In this role, you will:</strong></strong></p>
<ul>
<li>Develop new techniques to predict, measure, and evaluate unsafe behavior in early-stage models</li>
</ul>
<ul>
<li>Design data curation strategies that improve pretraining priors and reduce downstream risk</li>
</ul>
<ul>
<li>Explore safe-by-design architectures and training configurations that improve controllability</li>
</ul>
<ul>
<li>Introduce novel safety-oriented loss functions, metrics, and evals into the pretraining stack</li>
</ul>
<ul>
<li>Work closely with cross-functional safety teams to unify pre- and post-training risk reduction</li>
</ul>
<p><strong><strong>You might thrive in this role if you:</strong></strong></p>
<ul>
<li>Have experience developing or scaling pretraining architectures (LLMs, diffusion models, multimodal models, etc.)</li>
</ul>
<ul>
<li>Are comfortable working with training infrastructure, data pipelines, and evaluation frameworks (e.g., Python, PyTorch/JAX, Apache Beam)</li>
</ul>
<ul>
<li>Enjoy hands-on research — designing, implementing, and iterating on experiments</li>
</ul>
<ul>
<li>Enjoy collaborating with diverse technical and cross-functional partners (e.g., policy, legal, training)</li>
</ul>
<ul>
<li>Are data-driven with strong statistical reasoning and rigor in experimental design</li>
</ul>
<ul>
<li>Value building clean, scalable research workflows and streamlining processes for yourself and others</li>
</ul>
<p><strong><strong>About OpenAI</strong></strong></p>
<p>OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.</p>
<p style="margin-top:24px;font-size:13px;color:#666;">XML job scraping automation by <a href="https://yubhub.co">YubHub</a></p>]]></Description>
      <Jobtype>full-time</Jobtype>
      <Experiencelevel>mid</Experiencelevel>
      <Workarrangement>onsite</Workarrangement>
      <Salaryrange>$295K – $445K • Offers Equity</Salaryrange>
      <Skills>pretraining architectures, training infrastructure, data pipelines, evaluation frameworks, Python, PyTorch/JAX, Apache Beam, hands-on research, collaboration, data-driven, statistical reasoning, LLMs, diffusion models, multimodal models, safe-by-design architectures, training configurations, loss functions, metrics, evals</Skills>
      <Category>Engineering</Category>
      <Industry>Technology</Industry>
      <Employername>OpenAI</Employername>
      <Employerlogo>https://logos.yubhub.co/openai.com.png</Employerlogo>
      <Employerdescription>OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products.</Employerdescription>
      <Employerwebsite>https://jobs.ashbyhq.com</Employerwebsite>
      <Compensationcurrency></Compensationcurrency>
      <Compensationmin></Compensationmin>
      <Compensationmax></Compensationmax>
      <Applyto>https://jobs.ashbyhq.com/openai/d829b701-5ee2-414f-8596-ef94911a168a</Applyto>
      <Location>San Francisco</Location>
      <Country></Country>
      <Postedate>2026-03-06</Postedate>
    </job>
    <job>
      <externalid>55f3e52b-904</externalid>
      <Title>Member of Technical Staff - Data Research Engineer</Title>
      <Description><![CDATA[<p><strong>Summary</strong></p>
<p>Microsoft AI are looking for a talented Member of Technical Staff - Data Research Engineer at their Redmond office. This role sits at the intersection of data and innovation—collaborating with scientists, engineers, and annotators to curate, analyze, and evaluate diverse multimodal data sources critical to model development. You will lead efforts to develop novel data collection strategies, improve dataset quality and integrity, understand data-driven model behaviors, and align datasets with ethical and societal values.</p>
<p><strong>About the Role</strong></p>
<p>As a Data Research Engineer, you will be responsible for creating high-quality datasets for training and evaluation, running experiments on new datasets (data ablations) to assess their impact and determine the most effective data. You will also develop and maintain scalable data pipelines for multimodal ingestion, preprocessing, filtering, and annotation. Additionally, you will analyze real-world multimodal datasets to assess quality, diversity, relevance, and identify areas for improvement. You will build lightweight tools and workflows for dataset auditing, visualization, and versioning. You will collaborate with Safety, Ethics, and Governance teams to ensure datasets meet standards for quality, privacy, and responsible AI practices.</p>
<p><strong>Accountabilities</strong></p>
<ul>
<li>Create high-quality datasets for training and evaluation</li>
<li>Run experiments on new datasets (data ablations) to assess their impact and determine the most effective data</li>
<li>Develop and maintain scalable data pipelines for multimodal ingestion, preprocessing, filtering, and annotation</li>
<li>Analyze real-world multimodal datasets to assess quality, diversity, relevance, and identify areas for improvement</li>
<li>Build lightweight tools and workflows for dataset auditing, visualization, and versioning</li>
</ul>
<p><strong>The Candidate we&#39;re looking for</strong></p>
<p><strong>Experience:</strong></p>
<ul>
<li>4+ years technical engineering experience with coding in languages including, but not limited to, Python and common data libraries (Pandas, NumPy, etc.)</li>
</ul>
<p><strong>Technical skills:</strong></p>
<ul>
<li>Proficiency in statistics and exploratory data analysis methods</li>
<li>Familiarity with data processing frameworks such as Spark, Ray, or Apache Beam</li>
</ul>
<p><strong>Personal attributes:</strong></p>
<ul>
<li>Ability to communicate technical findings clearly to research and product teams</li>
</ul>
<p><strong>Benefits</strong></p>
<ul>
<li>Competitive salary</li>
<li>Comprehensive benefits package</li>
<li>Opportunities for professional growth and development</li>
<li>Collaborative and dynamic work environment</li>
</ul>
<p style="margin-top:24px;font-size:13px;color:#666;">XML job scraping automation by <a href="https://yubhub.co">YubHub</a></p>]]></Description>
      <Jobtype>full-time</Jobtype>
      <Experiencelevel>staff</Experiencelevel>
      <Workarrangement>onsite</Workarrangement>
      <Salaryrange>USD $119,800 – $234,700 per year</Salaryrange>
      <Skills>Python, Pandas, NumPy, Spark, Ray, Apache Beam, statistics, exploratory data analysis, data processing frameworks</Skills>
      <Category>Engineering</Category>
      <Industry>Technology</Industry>
      <Employername>Microsoft AI</Employername>
      <Employerlogo>https://logos.yubhub.co/microsoft.ai.png</Employerlogo>
      <Employerdescription>Microsoft AI is a leading technology company that specializes in artificial intelligence, machine learning, and data science. They are known for their innovative products and services that help organizations make data-driven decisions. Microsoft AI is committed to empowering every person and organization on the planet to achieve more.</Employerdescription>
      <Employerwebsite>https://microsoft.ai</Employerwebsite>
      <Compensationcurrency></Compensationcurrency>
      <Compensationmin></Compensationmin>
      <Compensationmax></Compensationmax>
      <Applyto>https://microsoft.ai/job/member-of-technical-staff-data-research-engineer-mai-superintelligence-team-5/</Applyto>
      <Location>Redmond</Location>
      <Country></Country>
      <Postedate>2026-03-06</Postedate>
    </job>
    <job>
      <externalid>41ac4a39-9a3</externalid>
      <Title>Member of Technical Staff - Pretraining Text Data</Title>
      <Description><![CDATA[<p><strong>Summary</strong></p>
<p>Microsoft AI are looking for a talented Member of Technical Staff - Pretraining Text Data at their Redmond office. This role sits at the heart of strategic decision-making, turning market data into actionable insights for a company that&#39;s revolutionising AI technology. You&#39;ll work directly with leadership to shape the company&#39;s direction in the AI market.</p>
<p><strong>About the Role</strong></p>
<p>We are seeking engineers and researchers to join our Pretraining Text Data team, where we are building the next generation of foundation large language models. If you are passionate about designing and curating high-quality datasets to power frontier AI models, this role is for you. In this role, you’ll work at the intersection of data and innovation—collaborating with scientists, engineers, and annotators to curate, analyze, and evaluate diverse text datasets critical to model development. You will lead efforts to:</p>
<ul>
<li>Develop novel data collection strategies</li>
<li>Improve dataset quality and integrity</li>
<li>Understand data-driven model behaviors</li>
<li>Train models to understand the impact of data and data mixes</li>
<li>Align datasets with ethical and societal values</li>
</ul>
<p><strong>Accountabilities</strong></p>
<ul>
<li>Create high-quality datasets for training and evaluation; run experiments on new datasets (data ablations) to assess their impact and determine the most effective data.</li>
<li>Develop and maintain scalable data pipelines for text data ingestion, preprocessing, filtering, and annotation.</li>
<li>Analyze real-world text datasets to assess quality, diversity, relevance, and identify areas for improvement.</li>
<li>Build lightweight tools and workflows for dataset auditing, visualization, and versioning.</li>
<li>Collaborate with Safety, Ethics, and Governance teams to ensure datasets meet standards for quality, privacy, and responsible AI practices.</li>
</ul>
<p><strong>The Candidate we&#39;re looking for</strong></p>
<p><strong>Experience:</strong></p>
<ul>
<li>Bachelor’s Degree in AI, Computer Science, Data Science, Statistics, Physics, Engineering, or related technical discipline AND 4+ years technical engineering experience with coding in languages including, but not limited to, Python and common data libraries (Pandas, NumPy, etc.) OR equivalent experience.</li>
</ul>
<p><strong>Technical skills:</strong></p>
<ul>
<li>Proficiency in statistics and exploratory data analysis methods.</li>
<li>Familiarity with data processing frameworks such as Spark, Ray, or Apache Beam.</li>
</ul>
<p><strong>Personal attributes:</strong></p>
<ul>
<li>Ability to communicate technical findings clearly to research and product teams.</li>
</ul>
<p><strong>Benefits</strong></p>
<ul>
<li>Competitive salary</li>
<li>Comprehensive benefits package</li>
<li>Opportunities for professional growth and development</li>
<li>Collaborative and dynamic work environment</li>
<li>Access to cutting-edge technology and resources</li>
</ul>
<p style="margin-top:24px;font-size:13px;color:#666;">XML job scraping automation by <a href="https://yubhub.co">YubHub</a></p>]]></Description>
      <Jobtype>full-time</Jobtype>
      <Experiencelevel>staff</Experiencelevel>
      <Workarrangement>onsite</Workarrangement>
      <Salaryrange>USD $119,800 – $234,700 per year</Salaryrange>
      <Skills>Python, Pandas, NumPy, Spark, Ray, Apache Beam, statistics, exploratory data analysis, data processing frameworks</Skills>
      <Category>Engineering</Category>
      <Industry>Technology</Industry>
      <Employername>Microsoft AI</Employername>
      <Employerlogo>https://logos.yubhub.co/microsoft.ai.png</Employerlogo>
      <Employerdescription>Microsoft AI is a leading technology company that specializes in artificial intelligence, machine learning, and data science. They are known for their innovative products and services that empower individuals and organizations to achieve more. Microsoft AI is committed to pushing the boundaries of what is possible with AI and making it accessible to everyone.</Employerdescription>
      <Employerwebsite>https://microsoft.ai</Employerwebsite>
      <Compensationcurrency></Compensationcurrency>
      <Compensationmin></Compensationmin>
      <Compensationmax></Compensationmax>
      <Applyto>https://microsoft.ai/job/member-of-technical-staff-pretraining-text-data-2/</Applyto>
      <Location>Redmond</Location>
      <Country></Country>
      <Postedate>2026-03-06</Postedate>
    </job>
    <job>
      <externalid>365605e7-0ca</externalid>
      <Title>Member of Technical Staff, Data Research Engineer</Title>
      <Description><![CDATA[<p><strong>Summary</strong></p>
<p>Microsoft AI are looking for a talented Member of Technical Staff, Data Research Engineer to join their MAI Superintelligence Team in Zürich, Switzerland. This role sits at the heart of strategic decision-making, turning market data into actionable insights for a company that&#39;s revolutionising AI technology. You&#39;ll work directly with leadership to shape the company&#39;s direction in the AI market.</p>
<p><strong>About the Role</strong></p>
<p>As a Data Research Engineer, you will be responsible for creating high-quality datasets for training and evaluation, running experiments on new datasets to assess their impact, and developing and maintaining scalable data pipelines for multimodal ingestion, pre-processing, filtering, and annotation. You will also analyze real-world multimodal datasets to assess quality, diversity, relevance, and identify areas for improvement. Additionally, you will build lightweight tools and workflows for dataset auditing, visualization, and versioning, and collaborate with Safety, Ethics, and Governance teams to ensure datasets meet standards for quality, privacy, and responsible AI practices.</p>
<p><strong>Accountabilities</strong></p>
<ul>
<li>Create high-quality datasets for training and evaluation</li>
<li>Run experiments on new datasets to assess their impact and determine the most effective data</li>
<li>Develop and maintain scalable data pipelines for multimodal ingestion, pre-processing, filtering, and annotation</li>
<li>Analyze real-world multimodal datasets to assess quality, diversity, relevance, and identify areas for improvement</li>
<li>Build lightweight tools and workflows for dataset auditing, visualization, and versioning</li>
</ul>
<p><strong>The Candidate we&#39;re looking for</strong></p>
<p><strong>Experience:</strong></p>
<ul>
<li>Bachelor&#39;s Degree in AI, Computer Science, Data Science, Statistics, Physics, Engineering, or a related technical field</li>
<li>Technical engineering experience with coding in languages including, but not limited to, Python and common data libraries (Pandas, NumPy, etc.)</li>
</ul>
<p><strong>Technical skills:</strong></p>
<ul>
<li>Proficiency in statistics and exploratory data analysis methods</li>
<li>Experience in data analysis or data engineering</li>
</ul>
<p><strong>Personal attributes:</strong></p>
<ul>
<li>Ability to communicate technical findings effectively to research and product teams</li>
</ul>
<p><strong>Benefits</strong></p>
<ul>
<li>Competitive salary and benefits package</li>
<li>Opportunity to work with a leading technology company in the AI industry</li>
<li>Collaborative and dynamic work environment</li>
<li>Professional development opportunities</li>
</ul>
<p style="margin-top:24px;font-size:13px;color:#666;">XML job scraping automation by <a href="https://yubhub.co">YubHub</a></p>]]></Description>
      <Jobtype>full-time</Jobtype>
      <Experiencelevel>staff</Experiencelevel>
      <Workarrangement>onsite</Workarrangement>
      <Salaryrange></Salaryrange>
      <Skills>Python, Pandas, NumPy, statistics, data analysis, data engineering, Spark, Ray, Apache Beam, large-scale data processing</Skills>
      <Category>Engineering</Category>
      <Industry>Technology</Industry>
      <Employername>Microsoft AI</Employername>
      <Employerlogo>https://logos.yubhub.co/microsoft.ai.png</Employerlogo>
      <Employerdescription>Microsoft AI is a leading technology company that specializes in artificial intelligence, machine learning, and data science. They are known for their innovative products and services that empower individuals and organizations to achieve more. Microsoft AI is committed to making a positive impact on society through their work in AI.</Employerdescription>
      <Employerwebsite>https://microsoft.ai</Employerwebsite>
      <Compensationcurrency></Compensationcurrency>
      <Compensationmin></Compensationmin>
      <Compensationmax></Compensationmax>
      <Applyto>https://microsoft.ai/job/member-of-technical-staff-data-research-engineer-mai-superintelligence-team-2/</Applyto>
      <Location>Zürich, Switzerland</Location>
      <Country></Country>
      <Postedate>2026-03-06</Postedate>
    </job>
    <job>
      <externalid>88f19c96-557</externalid>
      <Title>Member of Technical Staff, Data Research Engineer</Title>
      <Description><![CDATA[<p><strong>Summary</strong></p>
<p>Microsoft AI are looking for a talented Data Research Engineer to join their MAI Superintelligence Team in London. This role sits at the heart of strategic decision-making, turning market data into actionable insights for a company that&#39;s revolutionising AI technology. You&#39;ll work directly with leadership to shape the company&#39;s direction in the AI market.</p>
<p><strong>About the Role</strong></p>
<p>As a Data Research Engineer, you will be responsible for creating high-quality datasets for training and evaluation, running experiments on new datasets to assess their impact, and developing and maintaining scalable data pipelines for multimodal ingestion, pre-processing, filtering, and annotation. You will also analyse real-world multimodal datasets to assess quality, diversity, relevance, and identify areas for improvement.</p>
<p><strong>Accountabilities</strong></p>
<ul>
<li>Create high-quality datasets for training and evaluation</li>
<li>Run experiments on new datasets to assess their impact and determine the most effective data</li>
<li>Develop and maintain scalable data pipelines for multimodal ingestion, pre-processing, filtering, and annotation</li>
</ul>
<p><strong>The Candidate we&#39;re looking for</strong></p>
<p><strong>Experience:</strong></p>
<ul>
<li>Bachelor&#39;s Degree in AI, Computer Science, Data Science, Statistics, Physics, Engineering, or a related technical field</li>
<li>Technical engineering experience with coding in languages including, but not limited to, Python and common data libraries (Pandas, NumPy, etc.)</li>
</ul>
<p><strong>Technical skills:</strong></p>
<ul>
<li>Proficiency in statistics and exploratory data analysis methods</li>
<li>Familiarity with data processing frameworks such as Spark, Ray, Apache Beam</li>
</ul>
<p><strong>Personal attributes:</strong></p>
<ul>
<li>Ability to communicate technical findings effectively to research and product teams</li>
</ul>
<p><strong>Benefits</strong></p>
<ul>
<li>Competitive salary and benefits package</li>
<li>Opportunities for professional growth and development</li>
<li>Collaborative and dynamic work environment</li>
</ul>
<p style="margin-top:24px;font-size:13px;color:#666;">XML job scraping automation by <a href="https://yubhub.co">YubHub</a></p>]]></Description>
      <Jobtype>full-time</Jobtype>
      <Experiencelevel>staff</Experiencelevel>
      <Workarrangement>onsite</Workarrangement>
      <Salaryrange>Competitive salary and benefits package</Salaryrange>
      <Skills>Python, Pandas, NumPy, Spark, Ray, Apache Beam, Data processing frameworks, Machine learning algorithms</Skills>
      <Category>Engineering</Category>
      <Industry>Technology</Industry>
      <Employername>Microsoft AI</Employername>
      <Employerlogo>https://logos.yubhub.co/microsoft.ai.png</Employerlogo>
      <Employerdescription>Microsoft AI is a leading technology company that specializes in artificial intelligence, machine learning, and data science. They are known for their innovative products and services that empower individuals and organizations to achieve more. Microsoft AI is committed to making a positive impact on society through their technology and research.</Employerdescription>
      <Employerwebsite>https://microsoft.ai</Employerwebsite>
      <Compensationcurrency></Compensationcurrency>
      <Compensationmin></Compensationmin>
      <Compensationmax></Compensationmax>
      <Applyto>https://microsoft.ai/job/member-of-technical-staff-data-research-engineer-mai-superintelligence-team/</Applyto>
      <Location>London</Location>
      <Country></Country>
      <Postedate>2026-03-06</Postedate>
    </job>
    <job>
      <externalid>63bd919b-7b3</externalid>
      <Title>Member of Technical Staff - Pretraining Text Data</Title>
      <Description><![CDATA[<p><strong>Summary</strong></p>
<p>Microsoft AI are looking for a talented Member of Technical Staff - Pretraining Text Data at their Mountain View office. This role sits at the heart of strategic decision-making, turning market data into actionable insights for a company that&#39;s revolutionising AI technology. You&#39;ll work directly with leadership to shape the company&#39;s direction in the AI market.</p>
<p><strong>About the Role</strong></p>
<p>In this role, you&#39;ll work at the intersection of data and innovation—collaborating with scientists, engineers, and annotators to curate, analyze, and evaluate diverse text datasets critical to model development. You will lead efforts to:</p>
<ul>
<li>Develop novel data collection strategies</li>
<li>Improve dataset quality and integrity</li>
<li>Understand data-driven model behaviors</li>
<li>Train models to understand the impact of data and data mixes</li>
<li>Align datasets with ethical and societal values</li>
</ul>
<p><strong>Accountabilities</strong></p>
<ul>
<li>Create high-quality datasets for training and evaluation; run experiments on new datasets (data ablations) to assess their impact and determine the most effective data.</li>
<li>Develop and maintain scalable data pipelines for text data ingestion, preprocessing, filtering, and annotation.</li>
</ul>
<p><strong>The Candidate we&#39;re looking for</strong></p>
<p><strong>Experience:</strong></p>
<ul>
<li>Bachelor’s Degree in AI, Computer Science, Data Science, Statistics, Physics, Engineering, or related technical discipline AND 4+ years technical engineering experience with coding in languages including, but not limited to, Python and common data libraries (Pandas, NumPy, etc.) OR equivalent experience.</li>
</ul>
<p><strong>Technical skills:</strong></p>
<ul>
<li>Proficiency in statistics and exploratory data analysis methods.</li>
<li>Familiarity with data processing frameworks such as Spark, Ray, or Apache Beam.</li>
</ul>
<p><strong>Personal attributes:</strong></p>
<ul>
<li>Ability to communicate technical findings clearly to research and product teams.</li>
</ul>
<p><strong>Benefits</strong></p>
<ul>
<li>Competitive salary</li>
<li>Comprehensive benefits package</li>
<li>Opportunities for professional growth and development</li>
<li>Collaborative and dynamic work environment</li>
</ul>
<p style="margin-top:24px;font-size:13px;color:#666;">XML job scraping automation by <a href="https://yubhub.co">YubHub</a></p>]]></Description>
      <Jobtype>full-time</Jobtype>
      <Experiencelevel>staff</Experiencelevel>
      <Workarrangement>onsite</Workarrangement>
      <Salaryrange>USD $119,800 – $234,700 per year</Salaryrange>
      <Skills>Python, Pandas, NumPy, Spark, Ray, Apache Beam, Data analysis, Data engineering, Machine learning</Skills>
      <Category>Engineering</Category>
      <Industry>Technology</Industry>
      <Employername>Microsoft AI</Employername>
      <Employerlogo>https://logos.yubhub.co/microsoft.ai.png</Employerlogo>
      <Employerdescription>Microsoft AI is a leading technology company that specializes in artificial intelligence, machine learning, and data science. They are known for their innovative products and services that empower individuals and organizations to achieve more. Microsoft AI is committed to pushing the boundaries of what is possible with AI and making it accessible to everyone.</Employerdescription>
      <Employerwebsite>https://microsoft.ai</Employerwebsite>
      <Compensationcurrency></Compensationcurrency>
      <Compensationmin></Compensationmin>
      <Compensationmax></Compensationmax>
      <Applyto>https://microsoft.ai/job/member-of-technical-staff-pretraining-text-data/</Applyto>
      <Location>Mountain View</Location>
      <Country></Country>
      <Postedate>2026-03-06</Postedate>
    </job>
  </jobs>
</source>