<?xml version="1.0" encoding="UTF-8"?>
<source>
  <jobs>
    <job>
      <externalid>6a24f057-4f1</externalid>
      <Title>Staff Production Engineer</Title>
      <Description><![CDATA[<p>The Production Engineering Tools team builds and operates foundational platforms that make CoreWeave&#39;s cloud reliable, observable, and scalable. We are hiring a Staff Production Engineer to design, build, and own the foundational platforms and frameworks that underpin operational excellence across CoreWeave.</p>
<p>In this role, you will combine deep technical leadership with hands-on engineering to create systems that improve availability, resiliency, and delivery velocity at scale. This is a high-impact role with broad organisational influence. You will develop a deep understanding of CoreWeave&#39;s infrastructure and services, shape architecture and tooling decisions, and partner closely with service owners to operationalise reliability through automation and paved paths rather than manual process or advocacy.</p>
<p>Success requires the ability to pivot quickly between hot incidents, multi-team programs, and initiatives at all levels of the organisation. You will design, build, and own foundational platforms and frameworks from architecture through adoption and operation. You will lead technical strategy and execution for internal tooling that reduces manual operations, improves delivery velocity, and supports CoreWeave&#39;s revenue growth through faster, more reliable datacentre delivery.</p>
<p>You will partner with service owners and platform teams to translate reliability and operational requirements into automation, self-service capabilities, and opinionated paved paths. You will build and evolve systems for observability, alerting, automated remediation, resiliency testing, and authoritative sources of truth, operationalising best practices through tooling rather than manual enforcement.</p>
<p>You will participate in incident response for critical outages with the explicit goal of improving systems, tooling, and defaults to reduce future operational load,not as a long-term escalation path. You will ship production code, participate in on-call rotations as needed, and mentor engineers on platform ownership, operational design, and sustainable production practices.</p>
<p style="margin-top:24px;font-size:13px;color:#666;">XML job scraping automation by <a href="https://yubhub.co">YubHub</a></p>]]></Description>
      <Jobtype>full-time</Jobtype>
      <Experiencelevel>staff</Experiencelevel>
      <Workarrangement>hybrid</Workarrangement>
      <Salaryrange>$188,000 to $275,000</Salaryrange>
      <Skills>distributed systems, cloud platforms, Kubernetes, observability, incident practices, metrics, tracing, structured logs, SLIs/SLOs, PIRs, foundational internal platforms, service tiering, disaster recovery, chaos engineering, structured resilience programs</Skills>
      <Category>Engineering</Category>
      <Industry>Technology</Industry>
      <Employername>CoreWeave</Employername>
      <Employerlogo>https://logos.yubhub.co/coreweave.com.png</Employerlogo>
      <Employerdescription>CoreWeave is a cloud computing company that provides a platform for building and scaling AI applications.</Employerdescription>
      <Employerwebsite>https://www.coreweave.com</Employerwebsite>
      <Compensationcurrency></Compensationcurrency>
      <Compensationmin></Compensationmin>
      <Compensationmax></Compensationmax>
      <Applyto>https://job-boards.greenhouse.io/coreweave/jobs/4644302006</Applyto>
      <Location>Livingston, NJ / New York, NY / Sunnyvale, CA / Bellevue, WA</Location>
      <Country></Country>
      <Postedate>2026-04-18</Postedate>
    </job>
  </jobs>
</source>