{"version":"0.1","company":{"name":"YubHub","url":"https://yubhub.co","jobsUrl":"https://yubhub.co/jobs/skill/operational-rhythm"},"x-facet":{"type":"skill","slug":"operational-rhythm","display":"Operational Rhythm","count":4},"x-feed-size-limit":100,"x-feed-sort":"enriched_at desc","x-feed-notice":"This feed contains at most 100 jobs (the most recently enriched). For the full corpus, use the paginated /stats/by-facet endpoint or /search.","x-generator":"yubhub-xml-generator","x-rights":"Free to redistribute with attribution: \"Data by YubHub (https://yubhub.co)\"","x-schema":"Each entry in `jobs` follows https://schema.org/JobPosting. YubHub-native raw fields carry `x-` prefix.","jobs":[{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_17320bff-7cb"},"title":"Head of Partner Growth, Embedded Payroll","description":"<p>About Gusto</p>\n<p>At Gusto, we&#39;re on a mission to grow the small business economy. We handle the hard stuff , payroll, health insurance, 401(k)s, and HR , so owners can focus on their craft and their customers.</p>\n<p>With teams in Denver, San Francisco, and New York, we support more than 500,000 small businesses nationwide and are building a workplace that reflects the people we serve.</p>\n<p>All full-time employees receive competitive base pay, benefits, and equity (RSUs) , because everyone who helps build Gusto should share in its success. Offer amounts are determined by role, level, and location. Learn more about our Total Rewards philosophy.</p>\n<p>AI is a fundamental part of how work gets done at Gusto. We expect all team members to actively engage with AI tools relevant to their role and grow their fluency as the technology evolves. AI experience requirements vary by role and will be assessed during the interview process.</p>\n<p><strong>The Opportunity</strong></p>\n<p>Gusto Embedded Payroll (GEP) is one of Gusto&#39;s highest-growth bets , a platform business that enables software companies to offer Gusto-powered payroll directly within their own products. We&#39;re looking for a Head of Partner Growth to lead the team responsible for making our embedded partners successful: driving go-to-market execution, growing partner revenue, and building the operational and strategic frameworks that will scale this business toward $100M+.</p>\n<p>This is a player-coach leadership role. You&#39;ll manage a cross-functional team spanning Partner Success Management (PSM) and Partner enablement, while also staying deeply hands-on with our most strategic partner relationships. You&#39;ll own the partner growth playbook end-to-end: from GTM planning and launch sequencing through ongoing optimization, escalation management, and executive relationship development.</p>\n<p>Gusto&#39;s partnerships organization focuses on evaluating and executing strategic partnerships in new categories. Gusto Embedded Payroll allows us to bring our people platform to thousands more businesses than those we serve directly and will play a key role in Gusto&#39;s growth.</p>\n<p><strong>What It&#39;s Like to Work in This Role</strong></p>\n<p><strong>Lead &amp; Develop</strong></p>\n<ul>\n<li>Build, lead, and develop a high-performing team of Partner Success Managers and Partner Enablement</li>\n<li>Set the operational rhythm for the partner growth function , including planning cadences, escalation workflows, and performance management</li>\n<li>Coach team members on partner strategy, executive communication, and cross-functional navigation</li>\n<li>Drive Enablement with the ideation, creation, and execution of enablement frameworks and materials.</li>\n<li>Drive AI fluency across the team , embedding AI tools and workflows into day-to-day partner operations, GTM planning, and internal knowledge management to accelerate output and decision quality</li>\n</ul>\n<p><strong>Strategize &amp; Plan</strong></p>\n<ul>\n<li>Own the partner growth strategy across the GEP portfolio, including partner segmentation, prioritization, and resource allocation</li>\n<li>Develop and maintain per-partner operating plans with clear success metrics, in collaboration with Product, Engineering, Marketing, and Sales Enablement</li>\n<li>Build repeatable GTM playbooks , covering co-marketing motions, sales enablement, content roadmaps, and launch sequencing , tailored to each partner&#39;s distribution model and ICP</li>\n<li>Translate portfolio-level data (revenue, attach rates, funnel performance) into strategic narratives for leadership</li>\n</ul>\n<p><strong>Operate &amp; Execute</strong></p>\n<ul>\n<li>Drive success for our partners and their end customers through adoption, activation, and retention of Gusto-powered embedded products</li>\n<li>Own the full marketing funnel and sales motion with partners , from top-of-funnel demand generation through conversion, onboarding, and expansion</li>\n<li>Manage complex, high-stakes partner relationships directly , including contract negotiations, executive escalations, and cross-functional alignment on roadmap priorities</li>\n<li>Lead partner incident response and escalation management with structured, cross-functional workflows</li>\n</ul>\n<p><strong>Build for Scale</strong></p>\n<ul>\n<li>Create and expand the feedback loop between partners, their end users, and Gusto&#39;s internal product, marketing, engineering, and operations teams</li>\n<li>Design frameworks, templates, and processes that enable the team to manage a growing partner portfolio without linear headcount growth</li>\n<li>Identify and implement AI-powered workflows and tools that improve team efficiency, partner reporting, and knowledge management</li>\n</ul>\n<p><strong>What We&#39;re Looking For</strong></p>\n<ul>\n<li>12-15 years of experience, with 8+ years in a people management role and 7+ years in partner-facing or channel-facing roles</li>\n<li>Partner-side perspective: Has operated on the partner or channel side of the table , understands how partners evaluate, prioritize, and activate embedded or referral relationships</li>\n<li>Farmer mentality: Wired to grow and deepen existing partner relationships through trust-building, strategic account planning, and proactive expansion , not just hunting new deals</li>\n<li>Methodical operator: Builds repeatable playbooks, escalation frameworks, and operational rhythms rather than relying on one-off heroics. Strong systems thinker</li>\n<li>GTM playbook experience: Hands-on experience building go-to-market plans from scratch , including co-marketing, keyword strategies, content roadmaps, and launch sequencing</li>\n<li>Full-funnel fluency: Understands the complete marketing and sales journey from demand gen through conversion, activation, and retention. Can diagnose where a partner motion is leaking value and prescribe the right intervention</li>\n<li>AI fluency: Actively uses AI tools (e.g., LLMs, automation platforms) to accelerate workflows, improve decision quality, and build team capability. Comfortable leading an AI adoption agenda within a team</li>\n<li>Executive communicator: Strong ghostwriter and strategic communicator who can represent the partnership function credibly with C-suite stakeholders , both internally and at partner organizations</li>\n<li>Strong sense of ownership and resilience: Thrives in ambiguity, takes initiative, and drives outcomes without waiting for permission</li>\n<li>Exceptional written and verbal communication: Can shift register fluently between partner-facing emails, internal Slack, leadership documents, and board-level narratives</li>\n</ul>\n<p><strong>Strong Nice-to-Haves</strong></p>\n<ul>\n<li>Experience with embedded or platform business models (APIs, developer tools, B2B2B)</li>\n<li>Knowledge of payroll, HR tech, or fintech software ecosystems</li>\n<li>Experience managing partner P&amp;Ls, cost-to-manage models, or portfolio-level financial analysis</li>\n<li>&quot;Gets&quot; the building-mode opportunity , has scaled a partner function from early stage, not just inherited a mature program</li>\n</ul>\n<p>Our cash compensation amount for this role is targeted at:</p>\n<p>$230,215 - $270,500 in San Francisco, CA; New York, NY $195,745 - $230,000 in Denver, CO; Phoenix, AZ; Atlanta, GA; Chicago, IL; Las Vegas, NV</p>\n<p>If you are outside of the geographic areas above, your application will not be considered at this time. Final offer amounts are determined by multiple factors including candidate experience and expertise and may vary from the amounts listed above.</p>\n<p>Gusto has physical office spaces in Denver, San Francisco, and New York City. Employees who are based in those locations will be expected to work from the office on designated days approximately 2-3 days per week (or more depending on role). The same office expectations apply to all Symmetry roles, Gusto&#39;s subsidiary, whose physical office is in Scottsdale. Note: The San Francisco office expectations encompass both the San Francisco and San Jose metro areas. When approved to work from a location other than a Gusto office, a secure, reliable, and consistent internet connection is required. This includes non-office days for hybrid employees.</p>\n<p>Our customers come from all walks of life and so do we. We hire great people from a wide variety of backgrounds, not just because</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_17320bff-7cb","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Gusto","sameAs":"https://www.gusto.com/","logo":"https://logos.yubhub.co/gusto.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/gusto/jobs/7819484","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$230,215 - $270,500","x-skills-required":["partner growth","embedded payroll","partner success management","partner enablement","AI fluency","GTM planning","launch sequencing","escalation management","executive relationship development","cross-functional navigation","operational rhythm","performance management","repeatable playbooks","sales enablement","content roadmaps","co-marketing motions","demand generation","conversion","onboarding","expansion","partner incident response","structured workflows","feedback loop","frameworks","templates","processes","AI-powered workflows","team efficiency","partner reporting","knowledge management"],"x-skills-preferred":[],"datePosted":"2026-04-24T12:17:07.108Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Denver, CO;San Francisco, CA;New York, NY;Las Vegas, NV;Atlanta, GA;Chicago, IL;Phoenix, AZ"}},"employmentType":"FULL_TIME","occupationalCategory":"Sales","industry":"Technology","skills":"partner growth, embedded payroll, partner success management, partner enablement, AI fluency, GTM planning, launch sequencing, escalation management, executive relationship development, cross-functional navigation, operational rhythm, performance management, repeatable playbooks, sales enablement, content roadmaps, co-marketing motions, demand generation, conversion, onboarding, expansion, partner incident response, structured workflows, feedback loop, frameworks, templates, processes, AI-powered workflows, team efficiency, partner reporting, knowledge management","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":230215,"maxValue":270500,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_d08d38d2-b72"},"title":"Engineering Manager, Agent Prompts & Evals","description":"<p><strong>About the Role</strong></p>\n<p>Anthropic is looking for an Engineering Manager to lead the Agent Prompts &amp; Evals team. This team owns the infrastructure that lets Anthropic ship model and prompt changes with confidence , the eval frameworks, system prompt pipelines, and regression-detection systems that every model launch depends on.</p>\n<p>When a new Claude model is ready to ship, this team is the one answering “is it actually better in our products?” When a product team wants to change how Claude behaves, this team owns the tooling that tells them whether they broke something. It’s a platform team whose platform is model behavior itself.</p>\n<p>The team sits deliberately at the seam between product engineering and research. You’ll partner closely with other evals groups across the company on shared infrastructure and methodology, with product teams who are shipping features on top of Claude, and with the TPMs and research PMs driving model launches. The pace is set by the model release cadence, and the team operates as both a platform owner and a hands-on partner during launch periods.</p>\n<p><strong>Responsibilities</strong></p>\n<ul>\n<li>Lead and grow a team of prompt engineers and platform software engineers</li>\n<li>Own the product-side eval platform: the frameworks, dashboards, bulk runners, and CI integrations that product teams use to measure Claude’s behavior and catch regressions before they ship</li>\n<li>Own system prompt infrastructure: versioning, deployment, rollback, and review tooling for the prompts that run in production across claude.ai, the API, and agentic surfaces</li>\n<li>Be a steady hand through model launches , these are the team’s highest-stakes operational moments and the EM is the backstop when things get chaotic</li>\n<li>Build durable collaboration with other evals groups across the company; this means real work on ownership boundaries, shared roadmaps, and avoiding tragedy-of-the-commons on shared eval infrastructure</li>\n<li>Recruit, close, and retain engineers who want to work at the intersection of product engineering and model behavior</li>\n<li>Shape where the team invests next: there are credible paths into frontier eval development, model launch automation, and deeper prompt engineering support, and part of the job is sequencing them</li>\n<li>Push the team toward measuring things that are hard to measure , behavioral drift, prompt quality, harness parity , not just things that are easy</li>\n</ul>\n<p><strong>You May Be a Good Fit If You Have</strong></p>\n<ul>\n<li>8+ years in software engineering with 3+ years managing engineering teams, including experience leading a platform, infra, or developer-tooling team where your customers were other engineers</li>\n<li>A track record of building “pits of success” , tooling and process that made it easy for other teams to do the right thing without needing to understand all the details</li>\n<li>Comfort managing a team with a mixed charter: platform ownership, service-to-other-teams, and a launch-driven operational rhythm, all at once</li>\n<li>Enough technical depth to engage on system design, review pipeline architecture, and be credible in debates with strong ICs , you don’t need to be writing code by hand every day, but you should be able to read it, review it, and be comfortable leveraging Claude to understand, design, and occasionally build.</li>\n<li>A product mindset and willingness to wear multiple hats when the work calls for it</li>\n<li>Demonstrated ability to build and maintain peer relationships with partner orgs that have different cultures and incentives , negotiating ownership, aligning roadmaps, and holding ground when it matters without being territorial about it</li>\n<li>Experience recruiting and closing senior ICs in a competitive market</li>\n</ul>\n<p><strong>Strong Candidates May Also Have</strong></p>\n<ul>\n<li>Prior exposure to LLM evals, ML experimentation platforms, or model quality work , even tangentially</li>\n<li>Experience with A/B testing infrastructure, feature flagging, or gradual rollout systems</li>\n<li>Background in devtools, CI/CD platforms, or testing infrastructure at scale</li>\n<li>A history of managing teams that sit between two larger orgs and making that position an asset rather than a liability</li>\n<li>Interest in AI safety and alignment , not required, but it makes the “why” of the work land harder</li>\n</ul>\n<p><strong>Logistics</strong></p>\n<ul>\n<li>Minimum education: Bachelor’s degree or an equivalent combination of education, training, and/or experience</li>\n<li>Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience</li>\n<li>Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position</li>\n<li>Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices.</li>\n<li>Visa sponsorship: We do sponsor visas! However, we aren’t able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this.</li>\n</ul>\n<p><strong>How we’re different</strong></p>\n<p>We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact , advancing our long-term goals of steerable, trustworthy AI , rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We’re an extremely collaborative group, and we host frequent research discussions</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_d08d38d2-b72","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Anthropic","sameAs":"https://www.anthropic.com/","logo":"https://logos.yubhub.co/anthropic.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/anthropic/jobs/5159608008","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$320,000-$405,000 USD","x-skills-required":["software engineering","team management","platform ownership","service-to-other-teams","launch-driven operational rhythm","system design","pipeline architecture","product mindset","recruiting and closing senior ICs"],"x-skills-preferred":["LLM evals","ML experimentation platforms","model quality work","A/B testing infrastructure","feature flagging","gradual rollout systems","devtools","CI/CD platforms","testing infrastructure at scale"],"datePosted":"2026-04-18T15:54:35.018Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco, CA | New York City, NY"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"software engineering, team management, platform ownership, service-to-other-teams, launch-driven operational rhythm, system design, pipeline architecture, product mindset, recruiting and closing senior ICs, LLM evals, ML experimentation platforms, model quality work, A/B testing infrastructure, feature flagging, gradual rollout systems, devtools, CI/CD platforms, testing infrastructure at scale","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":320000,"maxValue":405000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_a738803a-64f"},"title":"Head of Enterprise Marketing Strategy & Analytics","description":"<p><strong>About the Role</strong></p>\n<p>This foundational leadership role will build and lead the Enterprise Marketing Strategy &amp; Analytics function, serving as the operating system for a rapidly scaling marketing organisation. The primary mandate is to define and measure success across all marketing programmes,from demand generation (field events, ABM, EBCs, partner co-marketing) to pipeline contribution,creating a clear line from investment to pipeline to revenue.</p>\n<p><strong>Responsibilities</strong></p>\n<ul>\n<li>Define and own the Enterprise Marketing measurement framework, targets, and reporting, covering the full funnel from top-of-funnel demand through pipeline influence and closed-won attribution.</li>\n<li>Build and maintain core analytics infrastructure (data models, attribution logic, dashboards) in partnership with Revenue Operations and Data Science, ensuring marketing and sales alignment on key metrics.</li>\n<li>Serve as one of the primary operating partner to Finance, HR, and Recruiting, leading budget tracking, headcount planning, and vendor management.</li>\n<li>Partner with marketing leadership and the central Marketing Ops &amp; Strategy team on annual and quarterly planning, resource allocation, and performance reviews.</li>\n<li>Establish the operating cadence for Enterprise Marketing (QBRs, pipeline reviews, program retros), coordinating with the central Marketing Ops &amp; Strategy team on organisation-wide rhythms, and drive the preparation needed to make these forums decision-useful.</li>\n<li>Lead the identification of high-leverage workflows to automate, partnering with the central GTM AI team on implementation and measuring productivity gains.</li>\n<li>Build and manage the Marketing Operations, Demand Analytics, and MarTech team, setting a high bar for analytical rigor and business partnership.</li>\n<li>Drive cross-functional alignment on shared definitions, tooling, and a single source of truth for marketing performance across the broader Marketing organisation and with Revenue Operations.</li>\n<li>Conduct strategic analyses to inform key organisational decisions, such as resource deployment, coverage ratios, and campaign capacity planning.</li>\n</ul>\n<p><strong>Requirements</strong></p>\n<ul>\n<li>10+ years in marketing operations, analytics, revenue operations, or strategy roles, including at least 3 years leading a team.</li>\n<li>Experience building or significantly scaling a marketing ops/analytics function at a high-growth B2B technology company undergoing significant organisational expansion.</li>\n<li>Deep fluency in the enterprise demand funnel, including lead scoring, MQL/SQL definitions, pipeline attribution, and campaign influence models.</li>\n<li>Hands-on expertise with the modern GTM data stack (CRM, Marketing Automation, BI tools).</li>\n<li>Proven track record of strategic partnership with Finance and Revenue Operations, including experience building budget models and sitting in planning cycles.</li>\n<li>Expertise in running the core operational rhythm of a marketing organisation: QBRs, headcount tracking, budget pacing, and vendor renewals.</li>\n<li>Strong written and verbal communication, capable of translating complex datasets into clear business narratives.</li>\n<li>Genuine curiosity about AI and a willingness to be an early, hands-on adopter of automation tools in your team’s workflows.</li>\n</ul>\n<p><strong>Logistics</strong></p>\n<p>Minimum education: Bachelor’s degree or an equivalent combination of education, training, and/or experience Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. Visa sponsorship: We do sponsor visas!</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_a738803a-64f","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Anthropic","sameAs":"https://www.anthropic.com/","logo":"https://logos.yubhub.co/anthropic.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/anthropic/jobs/5169101008","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$400,000-$400,000 USD","x-skills-required":["marketing operations","analytics","revenue operations","strategy","demand generation","field events","ABM","EBCs","partner co-marketing","pipeline contribution","marketing programmes","investment","pipeline","revenue","measurement framework","targets","reporting","funnel","top-of-funnel demand","pipeline influence","closed-won attribution","core analytics infrastructure","data models","attribution logic","dashboards","Data Science","marketing","sales","alignment","key metrics","budget tracking","headcount planning","vendor management","marketing leadership","central Marketing Ops & Strategy team","annual planning","quarterly planning","resource allocation","performance reviews","operating cadence","QBRs","pipeline reviews","program retros","organisation-wide rhythms","decision-useful","high-leverage workflows","automation","GTM AI team","implementation","productivity gains","Demand Analytics","MarTech team","analytical rigor","business partnership","cross-functional alignment","shared definitions","tooling","single source of truth","marketing performance","strategic analyses","resource deployment","coverage ratios","campaign capacity planning","lead scoring","MQL/SQL definitions","pipeline attribution","campaign influence models","modern GTM data stack","CRM","Marketing Automation","BI tools","strategic partnership","Finance","budget models","planning cycles","core operational rhythm","headcount tracking","budget pacing","vendor renewals","written communication","verbal communication","complex datasets","business narratives","AI","automation tools"],"x-skills-preferred":[],"datePosted":"2026-04-18T15:52:21.649Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco, CA | Seattle, WA"}},"employmentType":"FULL_TIME","occupationalCategory":"Marketing","industry":"Technology","skills":"marketing operations, analytics, revenue operations, strategy, demand generation, field events, ABM, EBCs, partner co-marketing, pipeline contribution, marketing programmes, investment, pipeline, revenue, measurement framework, targets, reporting, funnel, top-of-funnel demand, pipeline influence, closed-won attribution, core analytics infrastructure, data models, attribution logic, dashboards, Data Science, marketing, sales, alignment, key metrics, budget tracking, headcount planning, vendor management, marketing leadership, central Marketing Ops & Strategy team, annual planning, quarterly planning, resource allocation, performance reviews, operating cadence, QBRs, pipeline reviews, program retros, organisation-wide rhythms, decision-useful, high-leverage workflows, automation, GTM AI team, implementation, productivity gains, Demand Analytics, MarTech team, analytical rigor, business partnership, cross-functional alignment, shared definitions, tooling, single source of truth, marketing performance, strategic analyses, resource deployment, coverage ratios, campaign capacity planning, lead scoring, MQL/SQL definitions, pipeline attribution, campaign influence models, modern GTM data stack, CRM, Marketing Automation, BI tools, strategic partnership, Finance, budget models, planning cycles, core operational rhythm, headcount tracking, budget pacing, vendor renewals, written communication, verbal communication, complex datasets, business narratives, AI, automation tools","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":400000,"maxValue":400000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_0806749e-694"},"title":"Engineering Manager, Agent Prompts & Evals","description":"<p><strong>About the Role</strong></p>\n<p>Anthropic is looking for an Engineering Manager to lead the Agent Prompts &amp; Evals team. This team owns the infrastructure that lets Anthropic ship model and prompt changes with confidence , the eval frameworks, system prompt pipelines, and regression-detection systems that every model launch depends on.</p>\n<p>When a new Claude model is ready to ship, this team is the one answering “is it actually better in our products?” When a product team wants to change how Claude behaves, this team owns the tooling that tells them whether they broke something. It’s a platform team whose platform is model behavior itself.</p>\n<p>The team sits deliberately at the seam between product engineering and research. You’ll partner closely with other evals groups across the company on shared infrastructure and methodology, with product teams who are shipping features on top of Claude, and with the TPMs and research PMs driving model launches. The pace is set by the model release cadence, and the team operates as both a platform owner and a hands-on partner during launch periods.</p>\n<p><strong>Responsibilities</strong></p>\n<ul>\n<li>Lead and grow a team of prompt engineers and platform software engineers</li>\n<li>Own the product-side eval platform: the frameworks, dashboards, bulk runners, and CI integrations that product teams use to measure Claude’s behavior and catch regressions before they ship</li>\n<li>Own system prompt infrastructure: versioning, deployment, rollback, and review tooling for the prompts that run in production across claude.ai, the API, and agentic surfaces</li>\n<li>Be a steady hand through model launches , these are the team’s highest-stakes operational moments and the EM is the backstop when things get chaotic</li>\n<li>Build durable collaboration with other evals groups across the company; this means real work on ownership boundaries, shared roadmaps, and avoiding tragedy-of-the-commons on shared eval infrastructure</li>\n<li>Recruit, close, and retain engineers who want to work at the intersection of product engineering and model behavior</li>\n<li>Shape where the team invests next: there are credible paths into frontier eval development, model launch automation, and deeper prompt engineering support, and part of the job is sequencing them</li>\n<li>Push the team toward measuring things that are hard to measure , behavioral drift, prompt quality, harness parity , not just things that are easy</li>\n</ul>\n<p><strong>Requirements</strong></p>\n<ul>\n<li>8+ years in software engineering with 3+ years managing engineering teams, including experience leading a platform, infra, or developer-tooling team where your customers were other engineers</li>\n<li>A track record of building “pits of success” , tooling and process that made it easy for other teams to do the right thing without needing to understand all the details</li>\n<li>Comfort managing a team with a mixed charter: platform ownership, service-to-other-teams, and a launch-driven operational rhythm, all at once</li>\n<li>Enough technical depth to engage on system design, review pipeline architecture, and be credible in debates with strong ICs , you don’t need to be writing code by hand every day, but you should be able to read it, review it, and be comfortable leveraging Claude to understand, design, and occasionally build.</li>\n<li>A product mindset and willingness to wear multiple hats when the work calls for it</li>\n<li>Demonstrated ability to build and maintain peer relationships with partner orgs that have different cultures and incentives , negotiating ownership, aligning roadmaps, and holding ground when it matters without being territorial about it</li>\n<li>Experience recruiting and closing senior ICs in a competitive market</li>\n</ul>\n<p><strong>Nice to Have</strong></p>\n<ul>\n<li>Prior exposure to LLM evals, ML experimentation platforms, or model quality work , even tangentially</li>\n<li>Experience with A/B testing infrastructure, feature flagging, or gradual rollout systems</li>\n<li>Background in devtools, CI/CD platforms, or testing infrastructure at scale</li>\n<li>A history of managing teams that sit between two larger orgs and making that position an asset rather than a liability</li>\n<li>Interest in AI safety and alignment , not required, but it makes the “why” of the work land harder</li>\n</ul>\n<p><strong>Logistics</strong></p>\n<ul>\n<li>Minimum education: Bachelor’s degree or an equivalent combination of education, training, and/or experience</li>\n<li>Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience</li>\n<li>Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position</li>\n<li>Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices.</li>\n<li>Visa sponsorship: We do sponsor visas! However, we aren’t able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this.</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_0806749e-694","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Anthropic","sameAs":"https://www.anthropic.com/","logo":"https://logos.yubhub.co/anthropic.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/anthropic/jobs/5159608008","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$320,000-$405,000 USD","x-skills-required":["Software engineering","Team management","Platform ownership","Service-to-other-teams","Launch-driven operational rhythm","System design","Pipeline architecture","Product mindset","Peer relationships","Recruiting and closing senior ICs"],"x-skills-preferred":["LLM evals","ML experimentation platforms","Model quality work","A/B testing infrastructure","Feature flagging","Gradual rollout systems","Devtools","CI/CD platforms","Testing infrastructure","AI safety and alignment"],"datePosted":"2026-04-18T15:39:18.064Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco, CA | New York City, NY"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Software engineering, Team management, Platform ownership, Service-to-other-teams, Launch-driven operational rhythm, System design, Pipeline architecture, Product mindset, Peer relationships, Recruiting and closing senior ICs, LLM evals, ML experimentation platforms, Model quality work, A/B testing infrastructure, Feature flagging, Gradual rollout systems, Devtools, CI/CD platforms, Testing infrastructure, AI safety and alignment","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":320000,"maxValue":405000,"unitText":"YEAR"}}}]}