{"version":"0.1","company":{"name":"YubHub","url":"https://yubhub.co","jobsUrl":"https://yubhub.co/jobs/skill/incident-management"},"x-facet":{"type":"skill","slug":"incident-management","display":"Incident Management","count":45},"x-feed-size-limit":100,"x-feed-sort":"enriched_at desc","x-feed-notice":"This feed contains at most 100 jobs (the most recently enriched). For the full corpus, use the paginated /stats/by-facet endpoint or /search.","x-generator":"yubhub-xml-generator","x-rights":"Free to redistribute with attribution: \"Data by YubHub (https://yubhub.co)\"","x-schema":"Each entry in `jobs` follows https://schema.org/JobPosting. YubHub-native raw fields carry `x-` prefix.","jobs":[{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_a6c6e1c7-2a8"},"title":"Assistant Manager, SOX IT Lead","description":"<p>As the Assistant Manager, SOX IT Lead, you will lead the design, implementation, monitoring, and testing of IT General Controls (ITGC) and IT Application Controls (ITAC) under SOX compliance for American Honda Finance Corporation. This role ensures robust governance and risk management practices to mitigate risks and support the overall reliability of financial reporting by serving as the primary SME for complex IT control environments, system architectures, and emerging technologies impacting AHFC&#39;s SOX compliance.</p>\n<p>Key responsibilities will include:</p>\n<ul>\n<li>Leading the planning, execution, and monitoring of ITGC and ITAC for annual SOX compliance activities.</li>\n<li>Acting as the primary liaison between AHM IT GRC, CT IT, internal auditors, and external auditors for ITGC and ITAC Testing.</li>\n<li>Maintaining Risk Control Matrices (RCMS), data flow diagrams, and control documentation.</li>\n<li>Collaborating on technology projects to ensure SOX compliance requirements are integrated.</li>\n<li>Providing guidance and training to CH IT and AHFC Management on SOX requirements and control expectations.</li>\n</ul>\n<p>&#39;\\n To be successful in this role, you will need:</p>\n<ul>\n<li>A minimum of 8-10 years of experience in IT Audit, IT compliance, or IT risk management.</li>\n<li>Strong understanding of SOX, ITGCs, and frameworks such as COBIT, COSO, NIST.</li>\n<li>Experience working with ERP Systems.</li>\n<li>Experience in a public company or Big 4 audit environment.</li>\n<li>Experience as a technical SME for IT controls.</li>\n</ul>\n<p>&#39;\\n In addition to the above requirements, you will also need to possess excellent communication and stakeholder management skills, as well as the ability to interpret technical concepts and translate them into control requirements.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_a6c6e1c7-2a8","directApply":true,"hiringOrganization":{"@type":"Organization","name":"American Honda Finance Corporation","sameAs":"https://careers.honda.com","logo":"https://logos.yubhub.co/careers.honda.com.png"},"x-apply-url":"https://careers.honda.com/us/en/job/10377/Asst-Manager-SOX-IT-Lead","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$94,900.00 - $142,400.00","x-skills-required":["SOX","ITGC","ITAC","COBIT","COSO","NIST","ERP Systems","public company","Big 4 audit environment","technical SME"],"x-skills-preferred":["cloud environments","AWS","Azure","logical access","change","backup","incident management","application controls"],"datePosted":"2026-04-22T17:24:09.349Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Torrance"}},"employmentType":"FULL_TIME","occupationalCategory":"Finance","industry":"Finance","skills":"SOX, ITGC, ITAC, COBIT, COSO, NIST, ERP Systems, public company, Big 4 audit environment, technical SME, cloud environments, AWS, Azure, logical access, change, backup, incident management, application controls","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":94900,"maxValue":142400,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_dcc8a1d6-5a5"},"title":"Implementation Director","description":"<p>Asia &amp; Middle East Technology Our team partners with the businesses to build the platforms, systems, and products that our customers use every day. We keep people&#39;s money and data safe, and are at the forefront of driving innovation for our businesses, customers, and colleagues.</p>\n<p>In this role, you will define and own the overall implementation and cutover strategy, ensuring alignment across business and technology. You will develop comprehensive plans covering parallel run, big bang migration, and contingency scenarios. You will also lead execution of cutover activities, including worst-case scenario planning, rehearsals, and post-go-live hypercare operating model.</p>\n<p>Key responsibilities include:</p>\n<ul>\n<li>Defining and owning the overall implementation and cutover strategy</li>\n<li>Developing comprehensive plans covering parallel run, big bang migration, and contingency scenarios</li>\n<li>Leading execution of cutover activities, including worst-case scenario planning, rehearsals, and post-go-live hypercare operating model</li>\n<li>Ensuring robust mitigation steps for risks and issues</li>\n<li>Bringing together complex dependencies across all workstreams</li>\n<li>Ensuring BAU change is interlocked with the programme in broader implementation planning</li>\n</ul>\n<p>To be successful in the role, you should have technology expertise in delivering Big Bang migrations on key services, experience of leading post implementation activities and Incident Management, highly effective communication skills, business impact expertise, and driving partnership across Business, internal Technology and third party teams.</p>\n<p>You&#39;ll achieve more at HSBC. HSBC is committed to building a culture where all employees are valued, respected and opinions count. We take pride in providing a workplace that fosters continuous professional development, flexible working and opportunities to grow within and inclusive and diverse environment.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_dcc8a1d6-5a5","directApply":true,"hiringOrganization":{"@type":"Organization","name":"HSBC","sameAs":"https://portal.careers.hsbc.com","logo":"https://logos.yubhub.co/portal.careers.hsbc.com.png"},"x-apply-url":"https://portal.careers.hsbc.com/careers/job/563774610174523","x-work-arrangement":"onsite","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["Big Bang migrations","Implementation strategy","Cutover planning","Risk management","Incident management","Communication skills","Business impact analysis","Partnership building"],"x-skills-preferred":[],"datePosted":"2026-04-18T22:10:49.743Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Shanghai"}},"employmentType":"FULL_TIME","occupationalCategory":"IT","industry":"Finance","skills":"Big Bang migrations, Implementation strategy, Cutover planning, Risk management, Incident management, Communication skills, Business impact analysis, Partnership building"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_9938f384-dc3"},"title":"Major Incident Manager","description":"<p>As a Major Incident Manager at HSBC Technology &amp; Services (USA) Inc., you will be responsible for triaging and prioritising major IT services incidents, facilitating service recovery and business engagement, and issuing Incident Reports and Major Incident Notifications.</p>\n<p>You will contribute to Incident/Major Incident Reviews and execute incident management practice, participating in management escalation calls.</p>\n<p>This role requires you to be on-call 24x7 every 3rd week, plus some holidays, and work up to 60% remotely.</p>\n<p>The successful candidate will have a bachelor&#39;s degree in Information Technology, Computer Science, Computer Engineering, Electronic Engineering, or a related field, and 4 years of related work experience.</p>\n<p>Key responsibilities include: Triaging and managing major IT service incidents Analyzing IT service incident root causes and solutions Communicating major IT service incident status to business stakeholders Creating reports using ServiceNow Following Information Technology Infrastructure Library (ITIL) practices, framework and standards</p>\n<p>In return, you will have access to a competitive Total Reward Package, including a robust Wellness Hub, and tailored professional development opportunities to ensure you have the right skills for today and tomorrow.</p>\n<p>You will be empowered to drive HSBC&#39;s engagement with the communities we serve through an industry-leading volunteerism policy, a generous matching gift program, and a comprehensive program of immersive Sustainability and Climate Change Initiatives.</p>\n<p>Pay Range: $120,000.00 to $130,000.00 per year.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_9938f384-dc3","directApply":true,"hiringOrganization":{"@type":"Organization","name":"HSBC Technology & Services (USA) Inc.","sameAs":"https://portal.careers.hsbc.com","logo":"https://logos.yubhub.co/portal.careers.hsbc.com.png"},"x-apply-url":"https://portal.careers.hsbc.com/careers/job/563774610161843","x-work-arrangement":"hybrid","x-experience-level":"mid","x-job-type":"full-time","x-salary-range":"$120,000.00 to $130,000.00 per year","x-skills-required":["ServiceNow","ITIL","Incident Management","Communication","Analytical Skills"],"x-skills-preferred":[],"datePosted":"2026-04-18T22:09:19.191Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Buffalo"}},"employmentType":"FULL_TIME","occupationalCategory":"IT","industry":"Finance","skills":"ServiceNow, ITIL, Incident Management, Communication, Analytical Skills","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":120000,"maxValue":130000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_c81cbaa1-56a"},"title":"Engineering Technical Program Manager - W&B Platform","description":"<p>The Weights &amp; Biases (W&amp;B) team builds the developer platform trusted by machine learning practitioners to track, manage, and scale their ML workflows. As a Technical Program Manager focused on platform reliability and release management, you&#39;ll be at the centre of our platform&#39;s growth and stability.</p>\n<p>You will partner with engineering teams within W&amp;B and CoreWeave AI/ML Platform Services (AMPS) to ensure W&amp;B integrates seamlessly into the broader ML ecosystem, while maintaining high reliability and predictable releases.</p>\n<p>This role is ideal for someone who thrives in cross-functional environments, has a strong grasp of developer workflows, and excels at creating repeatable, reliable program structures that scale.</p>\n<p><strong>Responsibilities</strong></p>\n<ul>\n<li>Drive end-to-end program management for critical platform initiatives.</li>\n<li>Build and run release management processes, ensuring predictable and high-quality delivery cycles.</li>\n<li>Partner with engineering and product to define success metrics, manage risks, and ensure on-time delivery.</li>\n<li>Build and scale incident management and RCA processes for W&amp;B services.</li>\n<li>Improve the predictability and visibility of releases across teams, introducing dashboards, retrospectives, and program forums.</li>\n<li>Collaborate with TPMs and engineering leaders across W&amp;B and CoreWeave to ensure end-to-end reliability across the ML developer stack.</li>\n</ul>\n<p><strong>Qualifications</strong></p>\n<ul>\n<li>Bachelor&#39;s degree in a technical field or equivalent experience.</li>\n<li>5+ years of program management experience in SaaS, developer tools, or ML/AI platforms.</li>\n<li>Proven experience running release management programs and incident management processes.</li>\n<li>Strong technical fluency in cloud computing, developer workflows, and CI/CD practices.</li>\n<li>Excellent communication and facilitation skills with diverse technical and non-technical audiences.</li>\n<li>Track record of improving reliability, efficiency, and predictability in software delivery.</li>\n</ul>\n<p><strong>Additional Qualifications</strong></p>\n<ul>\n<li>Familiarity with ML workflows, model training/inference, and developer productivity tools.</li>\n<li>Experience building integrations between SaaS platforms, APIs, and cloud services.</li>\n<li>Strong background in reliability engineering practices and DevOps program leadership.</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_c81cbaa1-56a","directApply":true,"hiringOrganization":{"@type":"Organization","name":"CoreWeave","sameAs":"https://www.coreweave.com","logo":"https://logos.yubhub.co/coreweave.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/coreweave/jobs/4610109006","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$177,000 to $237,000","x-skills-required":["cloud computing","developer workflows","CI/CD practices","program management","release management","incident management","reliability engineering"],"x-skills-preferred":["ML workflows","model training/inference","developer productivity tools","integration between SaaS platforms, APIs, and cloud services"],"datePosted":"2026-04-18T15:56:43.785Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco, CA"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"cloud computing, developer workflows, CI/CD practices, program management, release management, incident management, reliability engineering, ML workflows, model training/inference, developer productivity tools, integration between SaaS platforms, APIs, and cloud services","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":177000,"maxValue":237000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_c4e35d55-5d1"},"title":"Technical Program Manager, Safeguards (Infrastructure & Evals)","description":"<p>Job Title: Technical Program Manager, Safeguards (Infrastructure &amp; Evals)</p>\n<p>About Anthropic</p>\n<p>Anthropic&#39;s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole.</p>\n<p>About the Role</p>\n<p>Safeguards Engineering builds and operates the infrastructure that keeps Anthropic&#39;s AI systems safe in production , the classifiers, detection pipelines, evaluation platforms, and monitoring systems that sit between our models and the real world. That infrastructure needs to be not just correct, but reliable: when a safety-critical pipeline goes down or degrades, the consequences can be serious, and they can be invisible until someone looks closely.</p>\n<p>As a Technical Program Manager for Safeguards Infrastructure and Evals, you&#39;ll own the operational health and forward momentum of this stack. Your primary responsibility is driving reliability , owning the incident-response and post-mortem process, ensuring SLOs are defined and met in partnership with various teams, and making sure that when things go wrong, the right people know, the right actions get taken, and those actions actually get closed out.</p>\n<p>Alongside that ongoing operational rhythm, you&#39;ll coordinate the larger platform investments: migrations, eval-platform improvements, and the cross-team dependencies that connect them. This role sits at the intersection of operations and program management. It requires genuine technical depth , you need to understand how these systems work well enough to triage effectively, judge what&#39;s actually safety-critical versus what can wait, and have informed conversations with the engineers building and maintaining them. But the core of the job is keeping the machine running well and the work moving.</p>\n<p>What You&#39;ll Do:</p>\n<ul>\n<li>Own the Safeguards Engineering ops review</li>\n<li>Drive the recurring cadence that keeps the team informed and coordinated: surfacing recent incidents and failures, bringing visibility to reliability trends, and making sure the right people are in the room when decisions need to be made.</li>\n<li>Drive incident tracking and post-mortem execution</li>\n<li>Establish and maintain SLOs with partner teams</li>\n<li>Maintain runbook quality and incident-ownership clarity</li>\n<li>Drive platform migrations and infrastructure projects</li>\n<li>Coordinate evals platform improvements</li>\n</ul>\n<p>You might be a good fit if you:</p>\n<ul>\n<li>Have solid technical program management experience, particularly in operational or infrastructure-heavy environments , you&#39;re comfortable owning a mix of ongoing operational cadences and discrete project work simultaneously.</li>\n<li>Understand how production ML systems work well enough to triage incidents intelligently and have substantive conversations with engineers about what&#39;s going wrong and why , you don&#39;t need to write the code, but you need to follow the technical thread.</li>\n<li>Are energized by closing loops. Post-mortem action items that never get done, SLOs that no one checks, runbooks that go stale , these things bother you, and you know how to build the processes and follow-ups that fix them.</li>\n<li>Can work effectively across team boundaries , comfortable coordinating with partner teams (like Inference) where you don&#39;t have direct authority, and skilled at keeping shared work moving through influence and clear communication.</li>\n<li>Thrive in environments where the work shifts between &#39;keep the lights on&#39; and &#39;build something new&#39; , and can context-switch between incident follow-ups and longer-horizon platform projects without dropping either.</li>\n<li>Have experience with or strong interest in AI safety , you understand why the reliability of a safety-critical pipeline is a different kind of problem than the reliability of a product feature, and that distinction motivates you.</li>\n</ul>\n<p>Strong candidates may also:</p>\n<ul>\n<li>Have experience with SRE practices, incident management frameworks, or on-call operations at scale.</li>\n<li>Have worked on or with evaluation infrastructure for ML systems , understanding how evals get designed, run, and interpreted.</li>\n<li>Have experience driving infrastructure migrations in complex, multi-team environments , particularly where the migration touches operational systems that can&#39;t go offline.</li>\n<li>Be familiar with monitoring and alerting tooling (PagerDuty, Datadog, or equivalents) and the operational culture around them.</li>\n</ul>\n<p>Deadline to apply: None, applications will be received on a rolling basis.</p>\n<p>The annual compensation range for this role is listed below. For sales roles, the range provided is the role&#39;s On Target Earnings (&#39;OTE&#39;) range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role.</p>\n<p>Annual Salary: $290,000-$365,000 USD</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_c4e35d55-5d1","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Anthropic","sameAs":"https://www.anthropic.com/","logo":"https://logos.yubhub.co/anthropic.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/anthropic/jobs/5108695008","x-work-arrangement":"hybrid","x-experience-level":"mid","x-job-type":"full-time","x-salary-range":"$290,000-$365,000 USD","x-skills-required":["Technical Program Management","Operational or Infrastructure-heavy environments","Production ML systems","Incident management frameworks","On-call operations","Evaluation infrastructure for ML systems","Infrastructure migrations","Monitoring and alerting tooling"],"x-skills-preferred":[],"datePosted":"2026-04-18T15:56:34.910Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco, CA | New York City, NY | Seattle, WA"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Technical Program Management, Operational or Infrastructure-heavy environments, Production ML systems, Incident management frameworks, On-call operations, Evaluation infrastructure for ML systems, Infrastructure migrations, Monitoring and alerting tooling","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":290000,"maxValue":365000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_9d8d91da-52f"},"title":"Enterprise Risk Management Lead","description":"<p>About Gusto</p>\n<p>At Gusto, we&#39;re on a mission to grow the small business economy. We handle the hard stuff , payroll, health insurance, 401(k)s, and HR , so owners can focus on their craft and their customers.</p>\n<p>With teams in Denver, San Francisco, and New York, we support more than 400,000 small businesses nationwide and are building a workplace that reflects the people we serve.</p>\n<p>All full-time employees receive competitive base pay, benefits, and equity (RSUs) , because everyone who helps build Gusto should share in its success. Offer amounts are determined by role, level, and location. Learn more about our Total Rewards philosophy.</p>\n<p>AI is a fundamental part of how work gets done at Gusto. We expect all team members to actively engage with AI tools relevant to their role and grow their fluency as the technology evolves. AI experience requirements vary by role and will be assessed during the interview process.</p>\n<p>About the Role:</p>\n<p>Gusto is scaling our AI-powered risk function to support a complex, multi-entity business operating in highly regulated environments. As the Enterprise Risk Management Lead, you will own and operate Gusto&#39;s Enterprise Risk and Third Party Risk Management programs , built AI-first, designed to scale, and built to enable the business to move fast without breaking things.</p>\n<p>This is a People Empowerer (manager) role. You balance hands-on program leadership with managing and developing a team of compliance professionals. You navigate the tension between &quot;doing the work&quot; and &quot;leading the work&quot; , contributing directly to complex, high-impact programs while ensuring your team delivers with excellence.</p>\n<p>You are a change agent who influences how automated risk management gets done at Gusto, models AI-enabled ways of working, and helps others grow their own capabilities in the process.</p>\n<p>You will champion the adoption of AI, machine learning, and process automation across risk monitoring, control testing, incident management, and reporting , and you will partner with Product, Data Science, and Engineering to make it explainable, adopted, compliant, and scalable.</p>\n<p>Here’s what you’ll do day-to-day:</p>\n<p>You manage initiatives that are complex in both scope and impact, influencing the strategic direction of Gusto&#39;s compliance risk management framework.</p>\n<p>You apply a deep understanding of the regulatory landscape and how it intersects with Gusto&#39;s business model to proactively design and lead cross-functional risk programs.</p>\n<p>You translate complex risk topics into clear, actionable guidance that senior leaders can immediately understand and operationalize.</p>\n<p>You lead cross-functional working groups, align divergent perspectives, and drive cohesive progress toward shared goals , with minimal oversight.</p>\n<p>As a PE, you balance individual risk and compliance contribution with team leadership.</p>\n<p>You manage operations, professional development, resource allocation, and performance , while staying close enough to the work to be a credible, hands-on partner to your team and stakeholders.</p>\n<p>You model responsible AI use, and act as a source of knowledge and mentorship , supporting your team&#39;s AI journey and helping others apply it responsibly and effectively.</p>\n<p>AI-Enabled Risk Operations, Innovation &amp; Transformation</p>\n<p>This is how you and your team operate , not a side project.</p>\n<ul>\n<li>Champion the adoption of AI, machine learning, process automation, and advanced analytics to improve risk monitoring, control testing, and reporting across ERM, TPRM, and broader compliance functions</li>\n</ul>\n<ul>\n<li>Lead the integration of AI and automation into every phase of the risk lifecycle: vendor assessments, document ingestion and analysis, continuous monitoring and alerting, risk scoring, prioritization, and trend analysis</li>\n</ul>\n<ul>\n<li>Build intelligent risk monitoring and evaluation systems , including auto-tagging for risk issues, audit requests, and regulatory changes , that improve real-time visibility and eliminate manual effort across the enterprise risk portfolio</li>\n</ul>\n<ul>\n<li>Drive the digitalization of risk tools including RCSAs, KRIs, incident reporting, and audit tracking , transforming periodic, reactive processes into continuous intelligence systems with live leading and lagging indicators that enable real-time decision-making</li>\n</ul>\n<ul>\n<li>Partner with Product, Data Science, and Engineering to define requirements for AI-driven workflows, decisioning engines, and dashboards , ensuring explainability, auditability, and regulatory defensibility of all AI-enabled risk decisions</li>\n</ul>\n<ul>\n<li>Design and build intelligent dashboards and reporting tools that deliver real-time risk visibility and decision-quality insights to senior leadership and cross-functional stakeholders</li>\n</ul>\n<ul>\n<li>Design AI workflows with appropriate validation loops, human-in-the-loop checkpoints, and guardrails , ensuring outputs are reliable, governable, and meet regulatory standards before being used to frame risks, recommendations, or decisions</li>\n</ul>\n<ul>\n<li>Stay current on AI advancements and emerging technologies and proactively integrate new capabilities into team operations to increase velocity and scale</li>\n</ul>\n<ul>\n<li>Model responsible AI use , supporting ICs in their AI journeys and fostering a culture of intentional experimentation, accountability, and continuous improvement</li>\n</ul>\n<p>Enterprise Risk Management</p>\n<ul>\n<li>Design, implement, and continuously improve Gusto&#39;s ERM framework, ensuring alignment with best practices and Gusto&#39;s stage of growth and strategic priorities across all entities</li>\n</ul>\n<ul>\n<li>Define and maintain Gusto&#39;s enterprise risk taxonomy, risk appetite statement, and key risk indicators spanning operational, regulatory, technology, financial, and reputational risk domains</li>\n</ul>\n<ul>\n<li>Lead Gusto&#39;s Enterprise Risk Management process , driving integration of risk practices across business functions, promoting a proactive risk culture, and ensuring incident management, root cause analysis, and lessons learned are systematically captured in an automated, AI forward way.</li>\n</ul>\n<ul>\n<li>Apply AI-assisted insights to enterprise risk datasets to surface systemic patterns, validate assumptions, prioritize risks, and deliver proactive, data-driven advisory to senior leadership</li>\n</ul>\n<ul>\n<li>Monitor the regulatory landscape (OCC, FDIC, CFPB, SEC, FINRA, GDPR, NIST, ISO, SOC) and leverage AI to proactively incorporate changes before they become compliance gaps</li>\n</ul>\n<ul>\n<li>Act as a key advisor to senior compliance leadership , translating complex risk findings into clear, actionable recommendations with minimal oversight</li>\n</ul>\n<p>Third Party Risk Management (TPRM)</p>\n<ul>\n<li>Design, implement, and independently manage a high-impact, AI-first TPRM program with clear milestones, progress tracking, and measurable outcomes across all Gusto entities</li>\n</ul>\n<ul>\n<li>Manage the full third-party risk lifecycle , onboarding and risk profiling, periodic assessments, issue management, corrective action tracking, and offboarding , across suppliers, product partners, contractors, service providers, and cloud service providers , and do so in an AI and automated way.</li>\n</ul>\n<ul>\n<li>Maintain a centralized, authoritative vendor risk inventory and risk register, ensuring real-time visibility into Gusto&#39;s third-party risk posture</li>\n</ul>\n<ul>\n<li>Conduct periodic AI-driven audits and reviews of third-party compliance with contractual obligations and regulatory standards, identifying patterns that inform continuous program improvement</li>\n</ul>\n<ul>\n<li>Serve as the central orchestrator across Compliance, Security, Legal, Procurement, IT, and GRC for proactive and reactive third-party incident management</li>\n</ul>\n<ul>\n<li>Own Gusto&#39;s TPRM policy and maintain comprehensive documentation , risk assessments, audit findings, corrective actions , ensuring full accountability and traceability</li>\n</ul>\n<p>People Leadership &amp; Team Development</p>\n<ul>\n<li>Balance individual compliance contribution with team leadership , managing operations, professional development, resource allocation, and performance while staying close to the work</li>\n</ul>\n<ul>\n<li>Coach and develop ICs toward next</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_9d8d91da-52f","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Gusto","sameAs":"https://www.gusto.com/","logo":"https://logos.yubhub.co/gusto.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/gusto/jobs/7746997","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["Risk Management","Compliance","AI","Machine Learning","Process Automation","Advanced Analytics","Risk Monitoring","Control Testing","Incident Management","Reporting","Vendor Assessments","Document Ingestion","Analysis","Continuous Monitoring","Alerting","Risk Scoring","Prioritization","Trend Analysis","RCSAs","KRIs","Incident Reporting","Audit Tracking","AI-Driven Workflows","Decisioning Engines","Dashboards","Explainability","Auditability","Regulatory Defensibility","Intelligent Dashboards","Reporting Tools","Real-Time Risk Visibility","Decision-Quality Insights","Senior Leadership","Cross-Functional Stakeholders","Validation Loops","Human-in-the-Loop Checkpoints","Guardrails","Reliable Outputs","Governable Outputs","Regulatory Standards","AI Advancements","Emerging Technologies","Velocity","Scale","Responsible AI Use","ICs","AI Journeys","Accountability","Continuous Improvement","ERM Framework","Best Practices","Gusto's Stage of Growth","Strategic Priorities","Enterprise Risk Taxonomy","Risk Appetite Statement","Key Risk Indicators","Operational Risk","Regulatory Risk","Technology Risk","Financial Risk","Reputational Risk","Root Cause Analysis","Lessons Learned","Automated AI Forward Way","AI-Assisted Insights","Systemic Patterns","Assumptions","Proactive Advisory","Regulatory Landscape","OCC","FDIC","CFPB","SEC","FINRA","GDPR","NIST","ISO","SOC","Proactive Incorporation","Compliance Gaps","Key Advisor","Senior Compliance Leadership","Complex Risk Findings","Clear Actionable Recommendations","Minimally Supervised","High-Impact AI-First TPRM Program","Clear Milestones","Progress Tracking","Measurable Outcomes","Third-Party Risk Lifecycle","Onboarding","Risk Profiling","Periodic Assessments","Issue Management","Corrective Action Tracking","Offboarding","Suppliers","Product Partners","Contractors","Service Providers","Cloud Service Providers","AI and Automated Way","Centralized Vendor Risk Inventory","Risk Register","Real-Time Visibility","Third-Party Risk Posture","Periodic Audits","Reviews","Contractual Obligations","Patterns","Continuous Program Improvement","Central Orchestrator","Security","Legal","Procurement","IT","GRC","Proactive Incident Management","Reactive Incident Management","TPRM Policy","Comprehensive Documentation","Risk Assessments","Audit Findings","Corrective Actions","Traceability","Balance Individual Contribution","Team Leadership","Operations","Professional Development","Resource Allocation","Performance","Close to the Work","Coach and Develop ICs","Next Level"],"x-skills-preferred":[],"datePosted":"2026-04-18T15:56:16.772Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Denver, CO;San Francisco, CA;New York, NY"}},"employmentType":"FULL_TIME","occupationalCategory":"Legal","industry":"Finance","skills":"Risk Management, Compliance, AI, Machine Learning, Process Automation, Advanced Analytics, Risk Monitoring, Control Testing, Incident Management, Reporting, Vendor Assessments, Document Ingestion, Analysis, Continuous Monitoring, Alerting, Risk Scoring, Prioritization, Trend Analysis, RCSAs, KRIs, Incident Reporting, Audit Tracking, AI-Driven Workflows, Decisioning Engines, Dashboards, Explainability, Auditability, Regulatory Defensibility, Intelligent Dashboards, Reporting Tools, Real-Time Risk Visibility, Decision-Quality Insights, Senior Leadership, Cross-Functional Stakeholders, Validation Loops, Human-in-the-Loop Checkpoints, Guardrails, Reliable Outputs, Governable Outputs, Regulatory Standards, AI Advancements, Emerging Technologies, Velocity, Scale, Responsible AI Use, ICs, AI Journeys, Accountability, Continuous Improvement, ERM Framework, Best Practices, Gusto's Stage of Growth, Strategic Priorities, Enterprise Risk Taxonomy, Risk Appetite Statement, Key Risk Indicators, Operational Risk, Regulatory Risk, Technology Risk, Financial Risk, Reputational Risk, Root Cause Analysis, Lessons Learned, Automated AI Forward Way, AI-Assisted Insights, Systemic Patterns, Assumptions, Proactive Advisory, Regulatory Landscape, OCC, FDIC, CFPB, SEC, FINRA, GDPR, NIST, ISO, SOC, Proactive Incorporation, Compliance Gaps, Key Advisor, Senior Compliance Leadership, Complex Risk Findings, Clear Actionable Recommendations, Minimally Supervised, High-Impact AI-First TPRM Program, Clear Milestones, Progress Tracking, Measurable Outcomes, Third-Party Risk Lifecycle, Onboarding, Risk Profiling, Periodic Assessments, Issue Management, Corrective Action Tracking, Offboarding, Suppliers, Product Partners, Contractors, Service Providers, Cloud Service Providers, AI and Automated Way, Centralized Vendor Risk Inventory, Risk Register, Real-Time Visibility, Third-Party Risk Posture, Periodic Audits, Reviews, Contractual Obligations, Patterns, Continuous Program Improvement, Central Orchestrator, Security, Legal, Procurement, IT, GRC, Proactive Incident Management, Reactive Incident Management, TPRM Policy, Comprehensive Documentation, Risk Assessments, Audit Findings, Corrective Actions, Traceability, Balance Individual Contribution, Team Leadership, Operations, Professional Development, Resource Allocation, Performance, Close to the Work, Coach and Develop ICs, Next Level"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_ca221b6f-dca"},"title":"Technical Program Manager, Safeguards (Infrastructure & Evals)","description":"<p><strong>About the Role</strong></p>\n<p>Safeguards Engineering builds and operates the infrastructure that keeps Anthropic&#39;s AI systems safe in production. As a Technical Program Manager for Safeguards Infrastructure and Evals, you&#39;ll own the operational health and forward momentum of this stack.</p>\n<p>Your primary responsibility is driving reliability , owning the incident-response and post-mortem process, ensuring SLOs are defined and met in partnership with various teams, and making sure that when things go wrong, the right people know, the right actions get taken, and those actions actually get closed out.</p>\n<p>Alongside that ongoing operational rhythm, you&#39;ll coordinate the larger platform investments: migrations, eval-platform improvements, and the cross-team dependencies that connect them.</p>\n<p>This role sits at the intersection of operations and program management. It requires genuine technical depth , you need to understand how these systems work well enough to triage effectively, judge what&#39;s actually safety-critical versus what can wait, and have informed conversations with the engineers building and maintaining them.</p>\n<p>But the core of the job is keeping the machine running well and the work moving.</p>\n<p><strong>Responsibilities</strong></p>\n<ul>\n<li>Own the Safeguards Engineering ops review</li>\n<li>Drive the recurring cadence that keeps the team informed and coordinated: surfacing recent incidents and failures, bringing visibility to reliability trends, and making sure the right people are in the room when decisions need to be made.</li>\n<li>Drive incident tracking and post-mortem execution</li>\n<li>Establish and maintain SLOs with partner teams</li>\n<li>Maintain runbook quality and incident-ownership clarity</li>\n<li>Drive platform migrations and infrastructure projects</li>\n<li>Coordinate evals platform improvements</li>\n</ul>\n<p><strong>Requirements</strong></p>\n<ul>\n<li>Solid technical program management experience, particularly in operational or infrastructure-heavy environments</li>\n<li>Understanding of how production ML systems work well enough to triage incidents intelligently and have substantive conversations with engineers about what&#39;s going wrong and why</li>\n<li>Ability to work effectively across team boundaries</li>\n<li>Experience with or strong interest in AI safety</li>\n</ul>\n<p><strong>Nice to Have</strong></p>\n<ul>\n<li>Experience with SRE practices, incident management frameworks, or on-call operations at scale</li>\n<li>Familiarity with monitoring and alerting tooling (PagerDuty, Datadog, or equivalents)</li>\n<li>Experience driving infrastructure migrations in complex, multi-team environments</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_ca221b6f-dca","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Anthropic","sameAs":"https://anthropic.ai/","logo":"https://logos.yubhub.co/anthropic.ai.png"},"x-apply-url":"https://job-boards.greenhouse.io/anthropic/jobs/5108695008","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$290,000-$365,000 USD","x-skills-required":["Technical Program Management","Operational or Infrastructure-heavy Environments","Production ML Systems","Incident Tracking and Post-Mortem Execution","Service-Level Objectives (SLOs)","Runbook Quality and Incident-Ownership Clarity","Platform Migrations and Infrastructure Projects","Evals Platform Improvements"],"x-skills-preferred":["SRE Practices","Incident Management Frameworks","On-Call Operations at Scale","Monitoring and Alerting Tooling","Infrastructure Migrations in Complex, Multi-Team Environments"],"datePosted":"2026-04-18T15:55:20.655Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco, CA | New York City, NY | Seattle, WA"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Technical Program Management, Operational or Infrastructure-heavy Environments, Production ML Systems, Incident Tracking and Post-Mortem Execution, Service-Level Objectives (SLOs), Runbook Quality and Incident-Ownership Clarity, Platform Migrations and Infrastructure Projects, Evals Platform Improvements, SRE Practices, Incident Management Frameworks, On-Call Operations at Scale, Monitoring and Alerting Tooling, Infrastructure Migrations in Complex, Multi-Team Environments","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":290000,"maxValue":365000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_f2970275-8b3"},"title":"Incident Manager - Detection & Response","description":"<p><strong>About the Role</strong></p>\n<p>The Detection &amp; Response (D&amp;R) team plays a critical role in protecting our systems, users, and data from security threats. We’re looking for an experienced Technical Program Manager to own and evolve incident management within D&amp;R.</p>\n<p>You’ll be the driving force behind maturing and scaling our incident response lifecycle,from detection and triage through containment, remediation, and post-incident review. Critically, some of the highest-impact work in this role happens after the immediate response: gathering data on incident trends, reporting on patterns and root causes, and working cross-functionally across engineering, security, infrastructure, and product teams to ensure that broad fixes and systemic improvements are actually implemented.</p>\n<p><strong>Responsibilities</strong></p>\n<ul>\n<li>Own the end-to-end D&amp;R incident management program: detection workflows, response processes, escalation paths, communication standards, and remediation tracking.</li>\n</ul>\n<ul>\n<li>Serve as incident commander for security incidents, driving clear coordination across executive, engineering, security, legal, and other appropriate stakeholders.</li>\n</ul>\n<ul>\n<li>Establish and run incident commander rotations within D&amp;R, ensuring clear ownership and effective coordination during incidents of varying severity.</li>\n</ul>\n<ul>\n<li>Drive post-incident accountability by defining how action items are captured, assigned, tracked, and completed across teams,ensuring follow-through on both tactical fixes and strategic improvements.</li>\n</ul>\n<ul>\n<li>Gather, analyse, and report on incident trends and patterns to surface systemic risks, recurring root causes, and areas where the organisation is most vulnerable.</li>\n</ul>\n<ul>\n<li>Translate trend analysis into actionable cross-functional initiatives: partner with engineering, infrastructure, security, and product teams to prioritise and implement broad fixes and preventive improvements that address root causes rather than symptoms.</li>\n</ul>\n<ul>\n<li>Lead incident review forums (post-mortems, retrospectives) and ensure learnings are captured, socialised, and acted upon across the organisation.</li>\n</ul>\n<ul>\n<li>Develop and maintain D&amp;R incident response documentation, playbooks, runbooks, and training materials; keep them current as the threat landscape and our systems evolve.</li>\n</ul>\n<ul>\n<li>Partner with detection engineering to improve alert fidelity, reduce noise, and shorten time-to-detection for security events.</li>\n</ul>\n<ul>\n<li>Define, develop, and track incident management KPIs and report regularly to D&amp;R and Security leadership.</li>\n</ul>\n<ul>\n<li>Support broad cross-functional training and initiatives to uplevel security awareness across the company (e.g. Tabletop exercises, training, talks).</li>\n</ul>\n<p><strong>You may be a good fit if you:</strong></p>\n<ul>\n<li>Have 7+ years of experience in technical program management, incident management, or security operations, with significant time spent in a detection &amp; response or security incident response context.</li>\n</ul>\n<ul>\n<li>Have led or built incident response programs at a technology company, ideally in a high-growth or security-intensive environment.</li>\n</ul>\n<ul>\n<li>Have a demonstrated track record of turning incident data into organisational improvements,not just writing post-mortems, but driving the cross-functional work to implement systemic fixes.</li>\n</ul>\n<ul>\n<li>Are comfortable participating in on-call responsibilities and leading incident response during high-severity security events, including off-hours.</li>\n</ul>\n<ul>\n<li>Have experience building and scaling operational processes from the ground up in environments where structure didn’t previously exist.</li>\n</ul>\n<ul>\n<li>Excel at driving accountability and follow-through across multiple teams without direct authority,you know how to influence, track, and close the loop.</li>\n</ul>\n<ul>\n<li>Have strong analytical skills and experience with incident trend analysis, metrics reporting, and data-driven prioritisation.</li>\n</ul>\n<ul>\n<li>Are highly organised with a knack for bringing structure to ambiguous, fast-moving situations.</li>\n</ul>\n<ul>\n<li>Have excellent communication skills, especially under pressure and when coordinating across technical and non-technical stakeholders, including executive leadership.</li>\n</ul>\n<ul>\n<li>Thrive in fast-paced environments where priorities shift and you’re often working with incomplete information.</li>\n</ul>\n<p><strong>Logistics</strong></p>\n<p>Minimum education: Bachelor’s degree or an equivalent combination of education, training, and/or experience</p>\n<p>Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience</p>\n<p>Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position</p>\n<p>Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices.</p>\n<p>Visa sponsorship: We do sponsor visas! However, we aren’t able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this.</p>\n<p><strong>How we’re different</strong></p>\n<p>We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact , advancing our long-term goals of steerable, trustworthy AI , rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We’re an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills.</p>\n<p>The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI &amp; Compute, Concrete Problems in AI Safety, and Learning from Human Preferences.</p>\n<p><strong>Come work with us!</strong></p>\n<p>Anthropic is a public benefit corporation</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_f2970275-8b3","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Anthropic","sameAs":"https://www.anthropic.com","logo":"https://logos.yubhub.co/anthropic.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/anthropic/jobs/5176570008","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["Technical Program Management","Incident Management","Security Operations","Detection & Response","Security Incident Response","Cross-functional collaboration","Data analysis","Metrics reporting","Communication","Leadership"],"x-skills-preferred":[],"datePosted":"2026-04-18T15:54:24.369Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Zürich, CH"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Technical Program Management, Incident Management, Security Operations, Detection & Response, Security Incident Response, Cross-functional collaboration, Data analysis, Metrics reporting, Communication, Leadership"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_86fc5e64-9f1"},"title":"Incident Manager - Detection & Response","description":"<p>We&#39;re looking for an experienced Technical Program Manager to own and evolve incident management within the Detection &amp; Response (D&amp;R) team. The role involves maturing and scaling our incident response lifecycle, from detection and triage through containment, remediation, and post-incident review. You&#39;ll be responsible for driving clear coordination across executive, engineering, security, legal, and other appropriate stakeholders. Your goal will be to ensure that we get meaningfully better after each incident.</p>\n<p>Responsibilities:</p>\n<ul>\n<li>Own the end-to-end D&amp;R incident management program: detection workflows, response processes, escalation paths, communication standards, and remediation tracking.</li>\n<li>Serve as incident commander for security incidents, driving clear coordination across executive, engineering, security, legal, and other appropriate stakeholders.</li>\n<li>Establish and run incident commander rotations within D&amp;R, ensuring clear ownership and effective coordination during incidents of varying severity.</li>\n<li>Drive post-incident accountability by defining how action items are captured, assigned, tracked, and completed across teams,ensuring follow-through on both tactical fixes and strategic improvements.</li>\n<li>Gather, analyse, and report on incident trends and patterns to surface systemic risks, recurring root causes, and areas where the organisation is most vulnerable.</li>\n<li>Translate trend analysis into actionable cross-functional initiatives: partner with engineering, infrastructure, security, and product teams to prioritise and implement broad fixes and preventive improvements that address root causes rather than symptoms.</li>\n<li>Lead incident review forums (post-mortems, retrospectives) and ensure learnings are captured, socialised, and acted upon across the organisation.</li>\n<li>Develop and maintain D&amp;R incident response documentation, playbooks, runbooks, and training materials; keep them current as the threat landscape and our systems evolve.</li>\n<li>Partner with detection engineering to improve alert fidelity, reduce noise, and shorten time-to-detection for security events.</li>\n<li>Define, develop, and track incident management KPIs and report regularly to D&amp;R and Security leadership.</li>\n<li>Support broad cross-functional training and initiatives to uplevel security awareness across the company (e.g. Tabletop exercises, training, talks).</li>\n</ul>\n<p>You may be a good fit if you:</p>\n<ul>\n<li>Have 7+ years of experience in technical program management, incident management, or security operations, with significant time spent in a detection &amp; response or security incident response context.</li>\n<li>Have led or built incident response programs at a technology company, ideally in a high-growth or security-intensive environment.</li>\n<li>Have a demonstrated track record of turning incident data into organisational improvements,not just writing post-mortems, but driving the cross-functional work to implement systemic fixes.</li>\n<li>Are comfortable participating in on-call responsibilities and leading incident response during high-severity security events, including off-hours.</li>\n<li>Have experience building and scaling operational processes from the ground up in environments where structure didn’t previously exist.</li>\n<li>Excel at driving accountability and follow-through across multiple teams without direct authority,you know how to influence, track, and close the loop.</li>\n<li>Have strong analytical skills and experience with incident trend analysis, metrics reporting, and data-driven prioritisation.</li>\n<li>Are highly organised with a knack for bringing structure to ambiguous, fast-moving situations.</li>\n<li>Have excellent communication skills, especially under pressure and when coordinating across technical and non-technical stakeholders, including executive leadership.</li>\n<li>Thrive in fast-paced environments where priorities shift and you’re often working with incomplete information.</li>\n</ul>\n<p>The annual compensation range for this role is $320,000-$405,000 USD.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_86fc5e64-9f1","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Anthropic","sameAs":"https://anthropic.com","logo":"https://logos.yubhub.co/anthropic.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/anthropic/jobs/5176481008","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$320,000-$405,000 USD","x-skills-required":["Technical Program Management","Incident Management","Security Operations","Detection & Response","Cross-functional Team Leadership","Communication","Analytical Skills","Data-driven Prioritisation","Incident Trend Analysis","Metrics Reporting"],"x-skills-preferred":[],"datePosted":"2026-04-18T15:53:23.634Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco, CA | New York City, NY"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Technical Program Management, Incident Management, Security Operations, Detection & Response, Cross-functional Team Leadership, Communication, Analytical Skills, Data-driven Prioritisation, Incident Trend Analysis, Metrics Reporting","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":320000,"maxValue":405000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_ebf95cea-76b"},"title":"Technical Escalation Manager","description":"<p>As a Technical Escalation Manager at Databricks, you will be responsible for coordinating efforts to resolve critical customer issues, customer-impacting situations, and major incidents. You will work with multiple internal teams (engineering, product management, Customer Success Engineering, and Support) and external partners to effectively resolve these customer-impacting situations.</p>\n<p>Your key responsibilities will include:</p>\n<ul>\n<li>Managing support escalation in partnership with engineering, product management, Customer Success Engineering, Support, Customers, and Partners until resolution.</li>\n<li>Achieving customer satisfaction by ensuring incidents or escalations (and related cases) are well and fully documented with the timely execution of action items.</li>\n<li>Creating and executing a data-driven customer recovery plan for every escalation and incident that is addressed.</li>\n<li>Utilizing business and technical skills to manage customer escalations, coordinate meetings and deliverables, and analyze trends and patterns for reporting purposes.</li>\n<li>Using data, metrics, and feedback to inform operational and tactical decisions that improve incident and escalation management.</li>\n<li>Coordinating all necessary resources to fast-track and resolve new incidents and escalations from customers with a clear and detailed plan.</li>\n</ul>\n<p>We are looking for a candidate with a minimum of 8+ years of experience in customer support, escalation, SRE, or incident management. You should have excellent contextual interpretation and writing skills, as well as the ability to effectively summarize and communicate to both technical and business audiences.</p>\n<p>You will also need experience with a &#39;Distributed big data Computing&#39; environment, SQL-based databases, as well as data warehousing and ETL technologies such as Informatica, DataStage, Oracle, Teradata, SQL Server, and MySQL. Linux/Unix administration skills, networking, and Hands-on Cloud experience with AWS, Azure, or GCP are required.</p>\n<p>Experience working cross-functionally with support, engineering, product management, and directly with customers; ability to deeply understand product and customer personas is also essential.</p>\n<p>A Bachelor&#39;s or Master&#39;s degree in Computer Science or Computer Engineering, or related Engineering field is preferred. Written and spoken proficiency in both Japanese and English is also required.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_ebf95cea-76b","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Databricks","sameAs":"https://databricks.com/","logo":"https://logos.yubhub.co/databricks.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/databricks/jobs/8407911002","x-work-arrangement":"remote","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["customer support","escalation","SRE","incident management","distributed big data computing","SQL-based databases","data warehousing","ETL technologies","Linux/Unix administration","networking","cloud experience"],"x-skills-preferred":[],"datePosted":"2026-04-18T15:51:57.996Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Japan"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"customer support, escalation, SRE, incident management, distributed big data computing, SQL-based databases, data warehousing, ETL technologies, Linux/Unix administration, networking, cloud experience"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_f038a181-fa9"},"title":"Command Center Technician","description":"<p>The Global Data Center Operations team serves as the backbone of our infrastructure, ensuring the seamless performance of our global hyperscale environment. Operating in a high-stakes, 24/7 setting, this team is responsible for safeguarding the availability, stability, and reliability of mission-critical systems that power our most essential services.</p>\n<p>As a Command Center Technician, you will serve as the front-line mission control for our global data center fleet. In a 24/7 operations environment, you will be responsible for real-time monitoring, coordination, and incident response across critical electrical, mechanical, and environmental systems.</p>\n<p>Key responsibilities include:</p>\n<ul>\n<li>Providing continuous 24/7 monitoring of global data center infrastructure systems using BMS, EPMS, DCIM, and other monitoring platforms.</li>\n<li>Monitoring critical assets including UPS, generators, switchgear, chillers, and fire suppression systems.</li>\n<li>Serving as the first responder for infrastructure alarms, triaging incidents and initiating response actions per SOPs, MOPs, and EOPs.</li>\n<li>Escalating incidents promptly to on-site operations, engineering teams, and leadership based on defined escalation matrices.</li>\n<li>Coordinating incident response activities across multiple teams to minimize risk to production environments.</li>\n<li>Supporting root cause analysis (RCA) efforts by providing detailed timelines, logs, and incident documentation.</li>\n<li>Acting as a central communication hub, providing clear and accurate status updates during incidents and maintenance events.</li>\n</ul>\n<p>To succeed in this role, you will need:</p>\n<ul>\n<li>2+ years of experience working in mission-critical environments (data centers, utilities, or industrial operations).</li>\n<li>Foundational technical knowledge of data center electrical and mechanical infrastructure systems.</li>\n<li>Proven experience following and executing SOPs, MOPs, and EOPs in high-availability environments.</li>\n<li>Proficiency with monitoring tools, ticketing systems, and operational dashboards.</li>\n<li>Experience in incident management, technical troubleshooting, and structured escalation.</li>\n<li>Ability to work rotating shifts, including nights, weekends, and holidays, in a 24/7 operations environment.</li>\n<li>Availability to work a flexible schedule within a 24/7 environment.</li>\n<li>Excellent time management, organizational, and communication skills.</li>\n<li>Must be able to prioritize tasks and react quickly to issues.</li>\n</ul>\n<p>Preferred qualifications include:</p>\n<ul>\n<li>4+ years of experience working in mission-critical environments</li>\n<li>Prior experience in a hyperscale or large-scale data center environment.</li>\n<li>Hands-on technical background in electrical (UPS, switchgear) or mechanical (chillers, CRAC units) systems.</li>\n<li>Familiarity with specific BMS, EPMS, and DCIM platforms.</li>\n<li>Experience working in a centralized Command Center or Network Operations Center (NOC).</li>\n</ul>\n<p>At CoreWeave, we believe in investing in our people and value candidates who can bring their own diversified experiences to our teams. If you&#39;re a motivated and detail-oriented individual who is passionate about delivering exceptional results, we&#39;d love to hear from you.</p>\n<p>We offer a competitive salary range of $75,000 to $100,000, plus a comprehensive benefits package that includes medical, dental, and vision insurance, 401(k) matching, and paid time off.</p>\n<p>If you&#39;re interested in joining our team, please submit your application, including your resume and a cover letter, to [insert contact information].</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_f038a181-fa9","directApply":true,"hiringOrganization":{"@type":"Organization","name":"CoreWeave","sameAs":"https://www.coreweave.com","logo":"https://logos.yubhub.co/coreweave.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/coreweave/jobs/4668468006","x-work-arrangement":"onsite","x-experience-level":"mid","x-job-type":"full-time","x-salary-range":"$75,000 to $100,000","x-skills-required":["BMS","EPMS","DCIM","UPS","generators","switchgear","chillers","fire suppression systems","incident management","technical troubleshooting","structured escalation"],"x-skills-preferred":["hands-on technical background in electrical (UPS, switchgear) or mechanical (chillers, CRAC units) systems","familiarity with specific BMS, EPMS, and DCIM platforms","experience working in a centralized Command Center or Network Operations Center (NOC)"],"datePosted":"2026-04-18T15:51:30.295Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Kenilworth, New Jersey"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"BMS, EPMS, DCIM, UPS, generators, switchgear, chillers, fire suppression systems, incident management, technical troubleshooting, structured escalation, hands-on technical background in electrical (UPS, switchgear) or mechanical (chillers, CRAC units) systems, familiarity with specific BMS, EPMS, and DCIM platforms, experience working in a centralized Command Center or Network Operations Center (NOC)","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":75000,"maxValue":100000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_d34bbf18-2b2"},"title":"Senior Site Reliability Engineer (FinOps) - Platform","description":"<p>As a Senior Site Reliability Engineer (FinOps) - Platform, you will be part of the Platform Engineering department, responsible for designing, building, scaling, and maturing the multi-cloud platform for hosting internal and external services. You will lead technical initiatives for automating system engineering efforts to guarantee the reliability of the global Elastic infrastructure. You will also grow our global Platform infrastructure to meet the increasing scaling demands by developing and maintaining software, tooling, and automations.</p>\n<p>Key responsibilities include:</p>\n<ul>\n<li>Taking an engineering approach in leading technical initiatives for automating system engineering efforts to guarantee the reliability of the global Elastic infrastructure.</li>\n<li>Growing our global Platform infrastructure to meet the increasing scaling demands by developing and maintaining software, tooling, and automations.</li>\n<li>Using an inclusive approach at championing an environment focused on collaboration, operational excellence, and uplifting others.</li>\n<li>Responding to and preventing repeated customer impact in response to major incidents and prioritized problem management.</li>\n</ul>\n<p>The ideal candidate will have success and lessons of experiences from striving for &#39;progress not perfection&#39; in the name of Platform reliability. They will have a background in software engineering to collaborate with engineers to expertly identify, implement, and deliver solutions. An experience in public cloud and managed Kubernetes services is advantageous.</p>\n<p>The role requires passion for developing solutions that involve inclusive communication methods to grow and strengthen partner and team relationships. Examples of working in distributed teams or working remotely is desirable.</p>\n<p>Bonus points for experience in operating a SaaS product in a public cloud, building or operating a Kubernetes-at-scale infrastructure, writing non-trivial programs in Golang or other programming languages, working with containerized services, leading and improving alerting and major incident management standard processes metrics systems, and experience in system administration with professional skills in Linux on distributed systems at scale.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_d34bbf18-2b2","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Elastic","sameAs":"https://www.elastic.co/","logo":"https://logos.yubhub.co/elastic.co.png"},"x-apply-url":"https://job-boards.greenhouse.io/elastic/jobs/7565188","x-work-arrangement":"remote","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["Cloud computing","Kubernetes","Golang","Containerization","Linux","System administration","Alerting and incident management"],"x-skills-preferred":["Infrastructure-as-Code","Terraform","Crossplane","Distributed systems","Self-organizing teams"],"datePosted":"2026-04-18T15:49:53.439Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Spain"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Cloud computing, Kubernetes, Golang, Containerization, Linux, System administration, Alerting and incident management, Infrastructure-as-Code, Terraform, Crossplane, Distributed systems, Self-organizing teams"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_f838587f-1ee"},"title":"Software Engineer, Kubernetes","description":"<p>We&#39;re looking for a skilled Software Engineer to join our team and help us build and scale our Kubernetes environment. As a Software Engineer, you will play a key part in ensuring the availability, reliability, and scalability of our cloud infrastructure. You will drive operational excellence, implement robust automation, and help shape the systems that keep our cloud running smoothly.</p>\n<p>Key Responsibilities:</p>\n<ul>\n<li>Build, operate, and scale Kubernetes-based production infrastructure that delivers our products with high reliability and performance.</li>\n<li>Develop automation, tooling, and infrastructure as code in Go and other infrastructure-focused languages to enable zero-touch operations, rapid recovery, and seamless deployments.</li>\n<li>Design, implement, and maintain monitoring, alerting, and observability solutions,leveraging the Grafana ecosystem and related tools,to proactively identify and resolve production issues.</li>\n<li>Drive incident response efforts, participate in on-call rotations, and lead root cause analysis to prevent recurrence and improve incident handling processes.</li>\n<li>Partner with internal and cross-functional teams to ensure platform capabilities meet rigorous operational requirements and customer SLAs.</li>\n<li>Engineer for resiliency, implementing best practices for redundancy, fault tolerance, and disaster recovery across complex distributed systems.</li>\n<li>Advocate for security, reliability, and performance improvements throughout the stack, continuously seeking opportunities to strengthen operational standards.</li>\n<li>Contribute to the development of custom Kubernetes operators and intelligent orchestration frameworks that optimize AI workload performance and resource utilization at scale.</li>\n</ul>\n<p>Requirements:</p>\n<ul>\n<li>3+ years of experience in production engineering, SRE, or large-scale infrastructure/platform roles.</li>\n<li>Knowledgeable in Kubernetes administration, container orchestration, and microservices architectures, with a bias for automating every aspect of operations.</li>\n<li>Proven track record managing high-uptime, customer-facing systems in a fast-moving environment, with experience delivering measurable improvements in reliability and performance.</li>\n<li>Experience in monitoring, observability, and incident management using tools like Prometheus, Grafana, Datadog, Splunk, Loki, or VictoriaMetrics.</li>\n<li>Deep understanding of Linux systems and infrastructure-focused programming, especially in Go and Bash.</li>\n<li>Strong analytical skills and ability to troubleshoot complex production issues.</li>\n<li>Excellent communication skills and ability to share knowledge with technical and non-technical stakeholders.</li>\n</ul>\n<p>What Success Looks Like:</p>\n<ul>\n<li>Deliver stable, robust, and highly-available systems that consistently meet or exceed uptime and performance targets.</li>\n<li>Champion initiatives that drive automation, reduce operational toil, and increase the efficiency of incident response.</li>\n<li>Actively contribute to a blameless culture of learning, mentoring others in operational best practices and production engineering principles.</li>\n<li>Help CoreWeave maintain industry leadership through flawless execution in supporting demanding, AI-powered workloads at scale.</li>\n</ul>\n<p>Why CoreWeave?</p>\n<ul>\n<li>We work hard, have fun, and move fast!</li>\n<li>We&#39;re in an exciting stage of hyper-growth that you won&#39;t want to miss out on.</li>\n<li>We&#39;re not afraid of a little chaos, and we&#39;re constantly learning.</li>\n<li>Our team cares deeply about how we build our product and how we work together, which is represented through our core values:</li>\n</ul>\n<ul>\n<li>Be Curious at Your Core</li>\n<li>Act Like an Owner</li>\n<li>Empower Employees</li>\n<li>Deliver Best-in-Class Client Experiences</li>\n<li>Achieve More Together</li>\n</ul>\n<p>We support and encourage an entrepreneurial outlook and independent thinking. We foster an environment that encourages collaboration and enables the development of innovative solutions to complex problems. As we get set for takeoff, the organization&#39;s growth opportunities are constantly expanding. You will be surrounded by some of the best talent in the industry, who will want to learn from you, too. Come join us!</p>\n<p>The base salary range for this role is $120,000 to $176,000. The starting salary will be determined based on job-related knowledge, skills, experience, and market location. We strive for both market alignment and internal equity when determining compensation. In addition to base salary, our total rewards package includes a discretionary bonus, equity awards, and a comprehensive benefits program (all based on eligibility).</p>\n<p>What We Offer:</p>\n<ul>\n<li>The range we&#39;ve posted represents the typical compensation range for this role. To determine actual compensation, we review the market rate for each candidate which can include a variety of factors. These include qualifications, experience, interview performance, and location.</li>\n<li>In addition to a competitive salary, we offer a variety of benefits to support your needs, including:</li>\n</ul>\n<ul>\n<li>Medical, dental, and vision insurance - 100% paid for by CoreWeave</li>\n<li>Company-paid Life Insurance</li>\n<li>Voluntary supplemental life insurance</li>\n<li>Short and long-term disability insurance</li>\n<li>Flexible Spending Account</li>\n<li>Health Savings Account</li>\n<li>Tuition Reimbursement</li>\n<li>Ability to Participate in Employee Stock Purchase Program (ESPP)</li>\n<li>Mental Wellness Benefits through Spring Health</li>\n<li>Family-Forming support provided by Carrot</li>\n<li>Paid Parental Leave</li>\n<li>Flexible, full-service childcare support with Kinside</li>\n<li>401(k) with a generous employer match</li>\n<li>Flexible PTO</li>\n<li>Catered lunch each day in our office and data center locations</li>\n<li>A casual work environment</li>\n<li>A work culture focused on innovative disruption</li>\n</ul>\n<p>Our Workplace:</p>\n<ul>\n<li>While we prioritize a hybrid work environment, remote work may be considered for candidates located more than 30 miles from an office, based on role requirements for specialized skill sets. New hires will be invited to attend onboarding at one of our hubs within their first month. Teams also gather quarterly to support collaboration.</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_f838587f-1ee","directApply":true,"hiringOrganization":{"@type":"Organization","name":"CoreWeave","sameAs":"https://www.coreweave.com","logo":"https://logos.yubhub.co/coreweave.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/coreweave/jobs/4577764006","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$120,000 to $176,000","x-skills-required":["Kubernetes administration","container orchestration","microservices architectures","Go","Bash","Linux systems","monitoring","observability","incident management","Prometheus","Grafana","Datadog","Splunk","Loki","VictoriaMetrics"],"x-skills-preferred":[],"datePosted":"2026-04-18T15:49:38.881Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Livingston, NJ / New York, NY / Sunnyvale, CA / Bellevue, WA"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Kubernetes administration, container orchestration, microservices architectures, Go, Bash, Linux systems, monitoring, observability, incident management, Prometheus, Grafana, Datadog, Splunk, Loki, VictoriaMetrics","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":120000,"maxValue":176000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_106dcdc1-e14"},"title":"Command Center Systems Engineer","description":"<p>The Command Center Systems Engineer will be responsible for building and maintaining the operational backbone of the company&#39;s command center, ensuring the uptime and operational excellence of the world&#39;s largest GPU clusters. This includes strengthening, maintaining, and governing all SOPs, MOPs, and EOPs across the Command Center, enhancing and owning the escalation framework, leading change management governance, developing and managing shift structure, handover protocols, and staffing frameworks, owning the incident management lifecycle, defining and tracking operational KPIs, building and overseeing onboarding and ongoing training programs for Command Center Technicians, partnering with engineering, facilities, and site operations teams, and leading vendor governance.</p>\n<p>The ideal candidate will have 5+ years of experience in data center operations, operations management, or mission-critical infrastructure in a 24/7 environment, a proven track record of building and scaling operational frameworks, strong project and program management skills, excellent written and verbal communication, experience facilitating root cause analysis and driving corrective action to closure, and comfort working with operational metrics and reporting.</p>\n<p>In addition to a competitive salary, the company offers a variety of benefits, including medical, dental, and vision insurance, company-paid life insurance, voluntary supplemental life insurance, short and long-term disability insurance, flexible spending account, health savings account, tuition reimbursement, employee stock purchase program, mental wellness benefits, family-forming support, paid parental leave, flexible PTO, catered lunch, and a casual work environment.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_106dcdc1-e14","directApply":true,"hiringOrganization":{"@type":"Organization","name":"CoreWeave","sameAs":"https://www.coreweave.com","logo":"https://logos.yubhub.co/coreweave.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/coreweave/jobs/4674028006","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$109,000 to $145,000","x-skills-required":["data center operations","operations management","mission-critical infrastructure","project management","incident management","change management","training program development","vendor governance"],"x-skills-preferred":["Lean","Six Sigma","hyperscale","cloud","AI infrastructure"],"datePosted":"2026-04-18T15:49:25.260Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Kenilworth, NJ"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"data center operations, operations management, mission-critical infrastructure, project management, incident management, change management, training program development, vendor governance, Lean, Six Sigma, hyperscale, cloud, AI infrastructure","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":109000,"maxValue":145000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_ac7263ce-de7"},"title":"Engineering Manager (Institutional - Custody, Prime Onchain Wallet)","description":"<p>Ready to be pushed beyond what you think you&#39;re capable of?</p>\n<p>At Coinbase, our mission is to increase economic freedom in the world.</p>\n<p>We&#39;re seeking a very specific candidate who is passionate about our mission and who believes in the power of crypto and blockchain technology to update the financial system.</p>\n<p>Our work culture is intense and isn&#39;t for everyone. But if you want to build the future alongside others who excel in their disciplines and expect the same from you, there&#39;s no better place to be.</p>\n<p>While many roles at Coinbase are remote-first, we are not remote-only. In-person participation is required throughout the year. Team and company-wide offsites are held multiple times annually to foster collaboration, connection, and alignment.</p>\n<p>Attendance is expected and fully supported.</p>\n<p>The Prime Onchain Wallet team is looking for a leader to step in and lead a tightly knit group of highly talented and motivated engineers – someone who&#39;s genuinely passionate about paving the way for institutional clients to operate confidently on-chain.</p>\n<p>This person will set the vision, bring clarity and momentum to execution, and partner across product, engineering, compliance, and go-to-market to turn complex constraints into simple, scalable solutions that institutions can trust.</p>\n<p>Onchain is the new Online, it will transform the way we exchange value and increase economic freedom by creating new opportunities. Wallet is the new browser and has the opportunity to become a super app. Prime Onchain Wallet is the interface to manage on-chain assets &amp; interact with dapps.</p>\n<p>Our team is building the operating system for businesses to operate on-chain. Businesses need the necessary enterprise tooling to operate on-chain and embrace this paradigm shift.</p>\n<p>Web2 introduced a stack of web apps used by businesses: Salesforce, Slack, Gmail, Accounting software… to facilitate exchange of data &amp; information. They need a new stack of tools to operate on-chain.</p>\n<p>We are building the only fully integrated solution that makes it simple &amp; secure to get started on-chain.</p>\n<p>What you&#39;ll be doing:</p>\n<ul>\n<li>Lead the engineering teams responsible for building the mission critical systems powering institutional products that shape the crypto landscape.</li>\n</ul>\n<ul>\n<li>Collaborate with engineers, designers, product managers, and senior leadership to translate our vision into a tangible roadmap.</li>\n</ul>\n<ul>\n<li>Break down complex projects into smaller pieces and lead the iterative design and implementation process.</li>\n</ul>\n<ul>\n<li>Be a thoughtful technical voice within the team, aiding in diligent architectural decisions and fostering a culture of high-quality and operational excellence.</li>\n</ul>\n<ul>\n<li>Collaborate with Product and Engineering teams to ensure successful delivery and operation of complex, distributed systems at scale.</li>\n</ul>\n<ul>\n<li>Coach your direct reports to have a positive impact on the organization and support their career growth.</li>\n</ul>\n<ul>\n<li>Work closely with our talent organization to identify and recruit exceptional engineers who align with Coinbase&#39;s culture and contribute to our products.</li>\n</ul>\n<ul>\n<li>Contribute to and take ownership of processes that drive engineering quality and meet our engineering SLAs.</li>\n</ul>\n<p>What we look for in you:</p>\n<ul>\n<li>At least 7 years of experience in software engineering.</li>\n</ul>\n<ul>\n<li>At least 1 year of engineering management experience.</li>\n</ul>\n<ul>\n<li>An ability to balance long-term strategic thinking with short-term planning.</li>\n</ul>\n<ul>\n<li>Experience in creating, delivering, and operating multi-tenanted, distributed systems at scale.</li>\n</ul>\n<ul>\n<li>You can be hands-on when needed – whether that’s writing/reviewing code or technical documents, participating in on-call rotations and leading incidents, or triaging/troubleshooting bugs.</li>\n</ul>\n<ul>\n<li>Your passion for building an open financial system that brings the world together drives you to excel in this role.</li>\n</ul>\n<ul>\n<li>Demonstrates the ability to responsibly use generative AI tools and copilots (e.g., LibreChat, Gemini, Glean) in daily workflows, continuously learn as tools evolve, and apply human-in-the-loop practices to deliver business-ready outputs and drive measurable improvements in efficiency, cost, and quality.</li>\n</ul>\n<p>Nice to haves:</p>\n<ul>\n<li>You have gone through a rapid growth in your company (from 10 to 100s of engineers).</li>\n</ul>\n<ul>\n<li>You have experience with Blockchains (such as Bitcoin, Ethereum etc.).</li>\n</ul>\n<ul>\n<li>You’ve worked with Golang, Ruby, Docker, Sinatra, Rails, Postgres.</li>\n</ul>\n<ul>\n<li>You’ve built financial, high reliability or security systems.</li>\n</ul>\n<ul>\n<li>Crypto-forward experience, including familiarity with onchain activity such as interacting with Ethereum addresses, using ENS, and engaging with dApps or blockchain-based services.</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_ac7263ce-de7","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Coinbase","sameAs":"https://www.coinbase.com/","logo":"https://logos.yubhub.co/coinbase.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/coinbase/jobs/7650637","x-work-arrangement":"remote","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$218,025-$256,500 USD","x-skills-required":["software engineering","engineering management","distributed systems","multi-tenanted systems","code review","technical documentation","on-call rotations","incident management","bug triage","generative AI tools","copilots","LibreChat","Gemini","Glean"],"x-skills-preferred":["Golang","Ruby","Docker","Sinatra","Rails","Postgres","blockchain development","financial systems","high reliability systems","security systems","crypto-forward experience"],"datePosted":"2026-04-18T15:48:54.784Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Remote - USA"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"software engineering, engineering management, distributed systems, multi-tenanted systems, code review, technical documentation, on-call rotations, incident management, bug triage, generative AI tools, copilots, LibreChat, Gemini, Glean, Golang, Ruby, Docker, Sinatra, Rails, Postgres, blockchain development, financial systems, high reliability systems, security systems, crypto-forward experience","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":218025,"maxValue":256500,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_9e898a04-26d"},"title":"Production Engineer, Support tooling (Tooling and Frameworks)","description":"<p>The Senior Production Engineering team sits at the heart of CoreWeave&#39;s reliability efforts. In this role, you&#39;ll partner closely with our Support/CX teams to build, operate, and evolve internal tooling that enables a &quot;Direct-to-Expert&quot; support model at scale.</p>\n<p>You&#39;ll define and ship AI-assisted workflows, self-service diagnostics, and platform integrations that reduce time-to-resolution and improve customer experience across our cloud.</p>\n<p>Key responsibilities include:</p>\n<ul>\n<li>Design, build, and own support-facing tools for case triage, intelligent routing, and expert engagement, integrating with incident and change management workflows.</li>\n</ul>\n<ul>\n<li>Develop AI-powered assistants and automations that accelerate root-cause discovery, knowledge retrieval, and resolution quality.</li>\n</ul>\n<ul>\n<li>Create and maintain dashboards, alerts, and signals that surface tooling issues early; integrate observability into new tooling to reduce MTTR.</li>\n</ul>\n<ul>\n<li>Build self-service and guided diagnostics that empower Support/CX to resolve common issues and collect high-quality context for escalations.</li>\n</ul>\n<ul>\n<li>Codify reliability and support practices into services, APIs, and Kubernetes-native controllers/operators where appropriate.</li>\n</ul>\n<ul>\n<li>Partner with engineering leadership and internal stakeholders to prioritise roadmap initiatives, land adoption, and measure business impact.</li>\n</ul>\n<ul>\n<li>Participate in an on-call rotation for the tooling you own.</li>\n</ul>\n<p>Minimum qualifications include:</p>\n<ul>\n<li>4+ years of software or infrastructure engineering experience building and operating production services.</li>\n</ul>\n<ul>\n<li>Proficiency in Go or Python (or equivalent experience).</li>\n</ul>\n<ul>\n<li>Strong fundamentals in Linux, containers, and Kubernetes; comfortable debugging in distributed systems.</li>\n</ul>\n<ul>\n<li>Experience with observability (metrics/logs/traces) and using data to improve reliability and support outcomes.</li>\n</ul>\n<ul>\n<li>Demonstrated experience with incident management and steady-state operational excellence (e.g., progressive delivery, testing strategies, error budgets, fault-tolerant design).</li>\n</ul>\n<ul>\n<li>Comfort collaborating with multiple stakeholders (Support/CX, Product, SRE, and service owners).</li>\n</ul>\n<p>Preferred qualifications include:</p>\n<ul>\n<li>Experience integrating or building support/operations tooling (e.g., ticketing/incident systems, status page, knowledge management, chat/alerting integrations).</li>\n</ul>\n<ul>\n<li>Experience automating manual workflows and stitching together productivity platforms.</li>\n</ul>\n<ul>\n<li>Familiarity with AI/ML tooling for retrieval, summarization, or copilot-style assistance.</li>\n</ul>\n<ul>\n<li>Experience codifying operational practices into Kubernetes controllers, operators, or platform services.</li>\n</ul>\n<p>The base salary range for this role is $139,000 to $204,000. The starting salary will be determined based on job-related knowledge, skills, experience, and market location. We strive for both market alignment and internal equity when determining compensation. In addition to base salary, our total rewards package includes a discretionary bonus, equity awards, and a comprehensive benefits program (all based on eligibility).</p>\n<p>In addition to a competitive salary, we offer a variety of benefits to support your needs, including:</p>\n<ul>\n<li>Medical, dental, and vision insurance - 100% paid for by CoreWeave</li>\n</ul>\n<ul>\n<li>Company-paid Life Insurance</li>\n</ul>\n<ul>\n<li>Voluntary supplemental life insurance</li>\n</ul>\n<ul>\n<li>Short and long-term disability insurance</li>\n</ul>\n<ul>\n<li>Flexible Spending Account</li>\n</ul>\n<ul>\n<li>Health Savings Account</li>\n</ul>\n<ul>\n<li>Tuition Reimbursement</li>\n</ul>\n<ul>\n<li>Ability to Participate in Employee Stock Purchase Program (ESPP)</li>\n</ul>\n<ul>\n<li>Mental Wellness Benefits through Spring Health</li>\n</ul>\n<ul>\n<li>Family-Forming support provided by Carrot</li>\n</ul>\n<ul>\n<li>Paid Parental Leave</li>\n</ul>\n<ul>\n<li>Flexible, full-service childcare support with Kinside</li>\n</ul>\n<ul>\n<li>401(k) with a generous employer match</li>\n</ul>\n<ul>\n<li>Flexible PTO</li>\n</ul>\n<ul>\n<li>Catered lunch each day in our office and data center locations</li>\n</ul>\n<ul>\n<li>A casual work environment</li>\n</ul>\n<ul>\n<li>A work culture focused on innovative disruption</li>\n</ul>\n<p>Our Workplace</p>\n<p>While we prioritise a hybrid work environment, remote work may be considered for candidates located more than 30 miles from an office, based on role requirements for specialised skill sets. New hires will be invited to attend onboarding at one of our hubs within their first month. Teams also gather quarterly to support collaboration.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_9e898a04-26d","directApply":true,"hiringOrganization":{"@type":"Organization","name":"CoreWeave","sameAs":"https://www.coreweave.com","logo":"https://logos.yubhub.co/coreweave.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/coreweave/jobs/4617128006","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$139,000 to $204,000","x-skills-required":["Go","Python","Linux","containers","Kubernetes","observability","incident management","operational excellence"],"x-skills-preferred":["AI/ML tooling","ticketing/incident systems","status page","knowledge management","chat/alerting integrations","automating manual workflows","productivity platforms"],"datePosted":"2026-04-18T15:48:08.984Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Livingston, NJ / New York, NY / Sunnyvale, CA / Bellevue, WA"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Go, Python, Linux, containers, Kubernetes, observability, incident management, operational excellence, AI/ML tooling, ticketing/incident systems, status page, knowledge management, chat/alerting integrations, automating manual workflows, productivity platforms","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":139000,"maxValue":204000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_40d32156-365"},"title":"Reliability Lead, Common Services","description":"<p>As Reliability Lead, Common Services, you will establish and lead the Reliability Engineering and production operations practice for the Common Services organization. You&#39;ll partner closely with engineering leaders and teams across Common Services to define how we build, release, monitor, and operate critical services,raising the bar on reliability, availability, and operational excellence across the board.</p>\n<p>In this role, you will:</p>\n<ul>\n<li>Establish and lead the SRE / production engineering practice for the Common Services organization, including standards for reliability, incident management, and on-call, in partnership with the central Product Engineering organization.</li>\n<li>Develop an Operational Excellence strategy that focuses on not only improving system performance but also monitoring and reducing operational toil</li>\n<li>Partner with engineering and product teams to define SLOs, SLIs, and error budgets for critical Common Services, and ensure these become part of how teams plan and make tradeoffs.</li>\n<li>Own and improve the incident management lifecycle for Common Services, including on-call rotations, escalation paths, incident tooling, post-incident reviews, and follow-through on corrective actions.</li>\n<li>Drive the observability strategy (metrics, logs, traces, dashboards, alerts) for Common Services, ensuring we have actionable visibility into the health, performance, and capacity of key systems.</li>\n<li>Collaborate with engineering leads to design and review architectures for reliability, scalability, resilience, and operability, including failure modes, redundancy, and graceful degradation.</li>\n<li>Lead efforts to automate and harden operational workflows, including deployments, rollbacks, configuration management, change management, and routine maintenance tasks.</li>\n<li>Build strong, trust-based relationships with partner teams and stakeholders, becoming a go-to leader for production readiness and operational risk within Common Services.</li>\n<li>Hire, mentor, and develop SRE and production engineering talent, fostering a culture of continuous improvement, learning from incidents, and humane on-call.</li>\n<li>Partner with other SRE and production engineering leaders across CoreWeave to align on global practices, tools, and reliability goals, representing the needs and constraints of Common Services.</li>\n</ul>\n<p>You will be responsible for defining the reliability strategy, processes, and standards for the Common Services portfolio and driving consistent, high-quality operational practices across multiple teams.</p>\n<p>The base salary range for this role is $206,000 to $303,000.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_40d32156-365","directApply":true,"hiringOrganization":{"@type":"Organization","name":"CoreWeave","sameAs":"https://www.coreweave.com","logo":"https://logos.yubhub.co/coreweave.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/coreweave/jobs/4650165006","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$206,000 to $303,000","x-skills-required":["Site Reliability Engineering","Production Engineering","Linux-based production environments","Containers","Orchestration technologies","Observability stacks","Alerting systems","SLIs/SLOs","Error budgets","Incident management","On-call rotations","Escalation paths","Post-incident reviews","Corrective actions","Automation tooling","Infrastructure-as-code","CI/CD pipelines"],"x-skills-preferred":["GPU workloads","High-performance computing","Latency/throughput-sensitive systems","Multi-tenant environments","Multi-region environments","Regulated environments","Service ownership models","Mentoring","Managing senior engineers"],"datePosted":"2026-04-18T15:47:45.370Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"New York, NY / Sunnyvale, CA / Bellevue, WA"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Site Reliability Engineering, Production Engineering, Linux-based production environments, Containers, Orchestration technologies, Observability stacks, Alerting systems, SLIs/SLOs, Error budgets, Incident management, On-call rotations, Escalation paths, Post-incident reviews, Corrective actions, Automation tooling, Infrastructure-as-code, CI/CD pipelines, GPU workloads, High-performance computing, Latency/throughput-sensitive systems, Multi-tenant environments, Multi-region environments, Regulated environments, Service ownership models, Mentoring, Managing senior engineers","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":206000,"maxValue":303000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_35ca76f9-e25"},"title":"Product Technical Support Associate, Edge Compute Systems","description":"<p>As a Product Technical Support Associate for Edge Compute Systems, you will play a critical role in ensuring the reliability and readiness of Anduril&#39;s fixed-site and expeditionary asset control solutions. GCS is designed to deliver real-time planning and control of autonomous systems at the tactical edge through several form-factor solutions to support system employment in any situation.</p>\n<p>In this role, you will support end users by improving field failure discovery, mitigation, and resolution processes, conducting root cause analysis, deploying fixes, and managing incidents across the GCS fleet. This position requires a strong problem-solving mindset and hands-on expertise in debugging and resolving complex compute hardware and software issues.</p>\n<p>Key responsibilities include:</p>\n<ul>\n<li>Sustain Anduril&#39;s GCS deployments by combining an understanding of our customers&#39; missions with familiarity of our products and delivered capabilities</li>\n<li>Triage, diagnose and root cause product incidents, driving postmortem actions including providing status visibility through resolution</li>\n<li>Consistently assess and seek to improve the quality of the fleet&#39;s observability and health telemetry in partnership with multiple functions across the GCS team</li>\n<li>Collect, organize, and analyze system failure data to define trends, drive proactive sustainment processes, and support resource allocation</li>\n<li>Support Anduril&#39;s global customers through proactive communications and detail-oriented execution</li>\n<li>Support the evaluation and improvement of product capabilities, analyzing customer communication and feedback for capability requirements, product performance indicators, and desired functionality</li>\n</ul>\n<p>Required qualifications include:</p>\n<ul>\n<li>4+ year of technical support experience with a focus on final-tier customer concern support</li>\n<li>Experience supporting and/or performing incident driven workflows requiring analysis, triage, and prioritization</li>\n<li>Experience in on-call support operations and working in limited risk tolerance environments</li>\n<li>Ability to work non-standard hours and weekends as needed</li>\n<li>Ability to obtain and maintain a U.S. Secret Security clearance</li>\n</ul>\n<p>Preferred qualifications include:</p>\n<ul>\n<li>BA or BS degree from accredited institution, STEM degree, preferably in computer science, software engineering, electrical engineering, information technology, or similar</li>\n<li>Experience supporting and/or operating compute-enabled communications systems, including electronic warfare domain experience, as a DOD employee, contractor, or end-user</li>\n<li>Experience with observability tooling such as DataDog, Grafana, and Victor Ops; exposure to software development tooling such as Git and Jira</li>\n<li>Applicable industry certifications (e.g. CompTIA Network+, CCNA, Linux+)</li>\n<li>Familiarity with and/or experience administrating NixOS systems</li>\n<li>Experience working as a system administrator</li>\n<li>Experience executing sustainment and reliability workflows for a defense-focused service or product</li>\n<li>DOD, Law Enforcement, or other Government agency experience preferred</li>\n<li>Demonstrated experience as a self-starter, able to find and resolve issues on your own</li>\n<li>Experience performing trend analysis to inform business decisions</li>\n<li>Strong aptitude for problem solving in unstructured situations at the interface of hardware, software, and networking</li>\n<li>Ability to drive challenging and vague technical problems to clarity and resolution</li>\n<li>Proven ability to master a technical system and support it in operational environments</li>\n<li>Must demonstrate an innate drive to be self-sufficient across the depth and breadth of a technical system</li>\n<li>Daily practice of excellence and rigor - you execute the 100th rep of a process with the same focus and care as the first five reps</li>\n<li>Confident with navigating ambiguity and crafting new ways of doing things</li>\n<li>Excellent written, visual, and verbal communication skills</li>\n<li>Active SECRET (or higher level) security clearance</li>\n</ul>\n<p>US Salary Range: $113,000-$149,000 USD</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_35ca76f9-e25","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Anduril Industries","sameAs":"https://www.andurilindustries.com/","logo":"https://logos.yubhub.co/andurilindustries.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/andurilindustries/jobs/5083881007","x-work-arrangement":"onsite","x-experience-level":"mid","x-job-type":"full-time","x-salary-range":"$113,000-$149,000 USD","x-skills-required":["Technical support","Problem-solving","Debugging","Root cause analysis","Incident management","Observability","Health telemetry","System failure analysis","Proactive sustainment","Resource allocation","Customer communication","Detail-oriented execution","Product evaluation","Capability requirements","Product performance indicators","Desired functionality"],"x-skills-preferred":["Computer science","Software engineering","Electrical engineering","Information technology","NixOS systems administration","System administration","Sustainment and reliability workflows","Defense-focused services","Government agency experience","Self-starting","Trend analysis","Ambiguity navigation","Communication skills"],"datePosted":"2026-04-18T15:47:38.962Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Costa Mesa, California, United States"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Technical support, Problem-solving, Debugging, Root cause analysis, Incident management, Observability, Health telemetry, System failure analysis, Proactive sustainment, Resource allocation, Customer communication, Detail-oriented execution, Product evaluation, Capability requirements, Product performance indicators, Desired functionality, Computer science, Software engineering, Electrical engineering, Information technology, NixOS systems administration, System administration, Sustainment and reliability workflows, Defense-focused services, Government agency experience, Self-starting, Trend analysis, Ambiguity navigation, Communication skills","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":113000,"maxValue":149000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_c3299844-c42"},"title":"Senior Software Engineer","description":"<p>Secure Every Identity, from AI to Human</p>\n<p>Identity is the key to unlocking the potential of AI. Okta secures AI by building the trusted, neutral infrastructure that enables organisations to safely embrace this new era. This work requires a relentless drive to solve complex challenges with real-world stakes. We are looking for builders and owners who operate with speed and urgency and execute with excellence.</p>\n<p><strong>The Opportunity</strong></p>\n<p>The Migration Services team builds the critical, data-driven services that seamlessly move customers across environments in real-time. We are looking for a Senior Software Engineer who is passionate about crafting elegant solutions to complex distributed systems problems. You will be a key player in driving innovation, collaborating with architects and product managers to build and own the crucial infrastructure that underpins the Auth0 ecosystem. If you are excited by the prospect of making a massive impact, we want to hear from you!</p>\n<p><strong>What You&#39;ll Achieve</strong></p>\n<ul>\n<li>Build for scale. You will develop, and operate highly scalable, data-intensive services, demonstrating code craftsmanship and an eye for detail.</li>\n<li>Master the data stream. You&#39;ll leverage streaming technologies and implement advanced change data capture (CDC) strategies to ensure the secure, reliable, and efficient transfer of data.</li>\n<li>Drive operational excellence. Through continuous monitoring and performance tuning, you will enhance the reliability of our migration processes and participate in our team&#39;s on-call rotation to ensure our services are always on.</li>\n</ul>\n<p><strong>What You&#39;ll Bring</strong></p>\n<ul>\n<li>Proven engineering background. With 3+ years of experience in fast-paced, agile environments, you have a proven track record of shipping high-quality software.</li>\n<li>Database familiarity. You possess a strong understanding of database fundamentals and have hands-on experience with datastores like MongoDB and PostgreSQL.</li>\n<li>Go is your go-to. You have a strong proficiency in Golang or optionally, in node.js.</li>\n<li>A passion for reliability. You have interest and experience in reliability engineering, with familiarity with observability and incident management.</li>\n<li>Collaborative skills. Your excellent written and verbal communication skills enable you to collaborate effectively with cross-functional and geo-dispersed teams.</li>\n</ul>\n<p><strong>Bonus Points</strong></p>\n<ul>\n<li>Experience with distributed streaming platforms like Kafka.</li>\n<li>Familiarity with concepts in the IAM (Identity and Access Management) domain.</li>\n<li>Experience with cloud providers (AWS, Azure) and container technologies such as Kubernetes and Docker.</li>\n</ul>\n<p>#Hybrid</p>\n<p>The Okta Experience</p>\n<ul>\n<li>Supporting Your Well-Being</li>\n<li>Driving Social Impact</li>\n<li>Developing Talent and Fostering Connection + Community</li>\n</ul>\n<p>We are intentional about connection. Our global community, spanning over 20 offices worldwide, is united by a drive to innovate. Your journey begins with an immersive, in-person onboarding experience designed to accelerate your impact and connect you to our mission and team from day one.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_c3299844-c42","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Okta","sameAs":"https://www.okta.com/","logo":"https://logos.yubhub.co/okta.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/okta/jobs/7809897","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["Golang","MongoDB","PostgreSQL","Distributed systems","Reliability engineering","Observability","Incident management"],"x-skills-preferred":["Kafka","IAM","Cloud providers","Container technologies","Kubernetes","Docker"],"datePosted":"2026-04-18T15:46:29.103Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Bengaluru, India"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Golang, MongoDB, PostgreSQL, Distributed systems, Reliability engineering, Observability, Incident management, Kafka, IAM, Cloud providers, Container technologies, Kubernetes, Docker"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_da7679a6-e4f"},"title":"Senior Technical Operations Lead","description":"<p>Job Title: Senior Technical Operations Lead</p>\n<p>We are seeking an experienced Senior Technical Operations Lead to drive operational excellence across our Infrastructure Engineering organization.</p>\n<p>As a Senior Technical Operations Lead, you will design and implement world-class operational processes, establish SRE best practices, and mentor technical teams to achieve exceptional reliability and efficiency.</p>\n<p>Key Responsibilities:</p>\n<p>SRE Leadership &amp; Transformation</p>\n<ul>\n<li>Lead the design and implementation of SRE practices and tooling across Infrastructure Engineering</li>\n</ul>\n<ul>\n<li>Establish and cultivate an SRE-focused culture at Zoominfo</li>\n</ul>\n<p>Operational Process Design &amp; Governance</p>\n<ul>\n<li>Establish clear governance frameworks and procedural consistency</li>\n</ul>\n<ul>\n<li>Make decisions about process exceptions and/or changes to accommodate different team contexts</li>\n</ul>\n<ul>\n<li>Design and/or implement process automations using scripts and integrations</li>\n</ul>\n<ul>\n<li>Define functional requirements and goals for process automations</li>\n</ul>\n<ul>\n<li>Conduct hands-on and/or automated audits to ensure process adherence and identify improvement opportunities</li>\n</ul>\n<p>Incident Management &amp; Root Cause Analysis</p>\n<ul>\n<li>Design, implement, and continuously improve Incident Management and Change Management procedures that scale across the organization, using tools such as PagerDuty, Slack, Jira, ServiceNow, and custom integrations</li>\n</ul>\n<ul>\n<li>Lead and participate in root cause analysis sessions, driving teams toward systemic improvements rather than blame</li>\n</ul>\n<ul>\n<li>Design and execute incident dry runs and tabletop exercises to build organizational resilience</li>\n</ul>\n<ul>\n<li>Establish metrics and KPIs that measure incident response effectiveness and drive continuous improvement</li>\n</ul>\n<p>Enable Data-Driven Decision Making</p>\n<ul>\n<li>Identify, define, and automate the tracking of operational KPIs and departmental metrics that matter, enabling senior managers to make informed decisions on the basis of data</li>\n</ul>\n<ul>\n<li>Build and maintain metric dashboards and automated reporting systems that provide real-time visibility into operational health</li>\n</ul>\n<ul>\n<li>Analyze trends and surface opportunities for optimization</li>\n</ul>\n<p>Stakeholder Engagement, Training &amp; Mentorship</p>\n<ul>\n<li>Build and maintain strong relationships with Engineering managers, Product Managers, and cross-functional stakeholders across geographies</li>\n</ul>\n<ul>\n<li>Maintain a feedback loop. Meet with stakeholders to understand process pain points.</li>\n</ul>\n<ul>\n<li>Influence others by fostering trust, leading by example, and inspiring them with your expertise and passion for reliability practices.</li>\n</ul>\n<ul>\n<li>Enhance internal knowledge of third-party tools such as Pagerduty, Datadog, and more, by educating Zoominfo employees on these tools.</li>\n</ul>\n<p>Deliver training sessions that make Operational Excellence engaging and motivating for diverse audiences.</p>\n<p>Required Experience &amp; Qualifications:</p>\n<ul>\n<li>Bachelor’s degree in Software Engineering, Operations Management, or related field</li>\n</ul>\n<ul>\n<li>7+ years of hands-on experience in technical operations, Site Reliability Engineering (SRE), Incident Management, or IT Service Management roles within SaaS or technical organizations</li>\n</ul>\n<ul>\n<li>Fluent English proficiency (written and verbal)</li>\n</ul>\n<ul>\n<li>Proven track record designing and implementing operational processes at scale</li>\n</ul>\n<ul>\n<li>Demonstrated expertise in SRE principles, practices, and tooling</li>\n</ul>\n<ul>\n<li>Strong data analysis skills with ability to define metrics, build or design dashboards, and use data to drive strategic decisions</li>\n</ul>\n<ul>\n<li>Proven ability to work effectively in a matrix organizational structure</li>\n</ul>\n<ul>\n<li>Ability and experience working with senior management at global organizations</li>\n</ul>\n<ul>\n<li>Hands-on experience with monitoring and observability tools such as PagerDuty and/or Datadog</li>\n</ul>\n<ul>\n<li>Familiarity with Jira, Confluence, Google Data Studio, or Tableau</li>\n</ul>\n<ul>\n<li>Experience with scripting and integrations (Python, JavaScript, Google AppScript, or similar)</li>\n</ul>\n<ul>\n<li>Background in SRE transformation or organizational process improvement initiatives</li>\n</ul>\n<p>#LI-SS4 #LI-Hybrid</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_da7679a6-e4f","directApply":true,"hiringOrganization":{"@type":"Organization","name":"ZoomInfo","sameAs":"https://www.zoominfo.com/","logo":"https://logos.yubhub.co/zoominfo.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/zoominfo/jobs/8451386002","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["Site Reliability Engineering (SRE)","Technical Operations","Incident Management","IT Service Management","Monitoring and Observability Tools","Jira","Confluence","Google Data Studio","Tableau","Scripting and Integrations","Python","JavaScript","Google AppScript"],"x-skills-preferred":[],"datePosted":"2026-04-18T15:45:47.393Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Ra'anana, Israel"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Site Reliability Engineering (SRE), Technical Operations, Incident Management, IT Service Management, Monitoring and Observability Tools, Jira, Confluence, Google Data Studio, Tableau, Scripting and Integrations, Python, JavaScript, Google AppScript"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_0a80aec8-c25"},"title":"Senior Software Engineer, Payments","description":"<p>We are looking for a self-motivated Senior Software Engineer to join our Payments team. As a member of this team, you will be responsible for designing, implementing, and maintaining systems and tools that support flow-level observability, payments reliability, and scalability.</p>\n<p>Your primary focus will be on building and managing large-scale platforms to improve the availability of our Payments platform for internal and external stakeholders. You will collaborate closely with other Payments engineering teams and Infra teams to ensure services are instrumented, scalable, and resilient to support our growing business.</p>\n<p>Key responsibilities include:</p>\n<ul>\n<li>Designing, implementing, and maintaining systems and tools at a platform level that support flow-level observability, payments reliability, and scalability.</li>\n<li>Identifying and driving improvements to increase the Payments Availability, Observability, and Resiliency of Airbnb Payments.</li>\n<li>Developing observability standards/framework for new product readiness to ensure service reliability in SOA and distributed systems.</li>\n<li>Building domain expertise to achieve scalability by understanding the nuances of Payments across processing, compliance, and infra.</li>\n<li>Driving large-scale migration and adoption projects on Observability &amp; Reliability by cross-collaborating with various Payments teams.</li>\n<li>Leading initiatives that promote a culture of reliability throughout the organization by improving incident management platforms and instrumentation.</li>\n</ul>\n<p>Requirements:</p>\n<ul>\n<li>7+ years of experience in back-end software development focusing on large-scale distributed systems.</li>\n<li>BE/B.Tech in Computer Science or a related technical field.</li>\n<li>Strong software development skills in one or more languages such as Java, Python, Kotlin, Scala, or Ruby on Rails.</li>\n<li>Experience in building intelligent AI agents and systems powered by Large Language Models is a plus.</li>\n<li>Evidence of exposure to architectural patterns of a large, high-scale web application (e.g., well-designed APIs, high-volume data pipelines, efficient algorithms).</li>\n<li>Familiarity with cloud platforms like AWS or Google Cloud Platform.</li>\n<li>Deep understanding of software development best practices, including version control, automated testing, CI/CD, and code reviews.</li>\n<li>Experience in incident management, monitoring, alerting, and root cause analysis.</li>\n<li>Effective leadership and communication skills to coordinate cross-functional teams during large-scale projects.</li>\n<li>Experience with initiatives across auto-scaling, self-healing mechanisms, chaos engineering, performance optimization techniques will be a plus.</li>\n<li>Previous experience in AI/ML will also be a plus.</li>\n</ul>\n<p>If you are a strong problem solver and have worked in a team that is on-call for production systems before, we encourage you to apply.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_0a80aec8-c25","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Airbnb","sameAs":"https://www.airbnb.com/","logo":"https://logos.yubhub.co/airbnb.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/airbnb/jobs/7613550","x-work-arrangement":"onsite","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["Java","Python","Kotlin","Scala","Ruby on Rails","Cloud platforms","Software development best practices","Incident management","Monitoring","Alerting","Root cause analysis"],"x-skills-preferred":["AI/ML","Auto-scaling","Self-healing mechanisms","Chaos engineering","Performance optimization techniques"],"datePosted":"2026-04-18T15:43:32.370Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Bangalore, India"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Java, Python, Kotlin, Scala, Ruby on Rails, Cloud platforms, Software development best practices, Incident management, Monitoring, Alerting, Root cause analysis, AI/ML, Auto-scaling, Self-healing mechanisms, Chaos engineering, Performance optimization techniques"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_84d75f6f-187"},"title":"Staff Fullstack Engineer, Agentic Applications","description":"<p>The Enterprise Applications team at Databricks is on an ambitious journey to transform how we run the business. Our mission is to build resilient, in-house platforms and AI-powered capabilities that provide a genuine competitive advantage, powering our next doubling in size and revenue.</p>\n<p>As a Staff Software Engineer on the Enterprise Applications team, you will play a critical role delivering end-to-end applications that power our business and empower our employees with truly assistive technologies. You will be responsible for defining the technical roadmap, designing critical software systems, raising the bar on engineering excellence, mentoring junior engineers, and tackling the most challenging technical issues facing the team. You will be the technical leader for a team of software engineers.</p>\n<p><strong>The impact you’ll have:</strong></p>\n<ul>\n<li>Define the technical strategy and roadmap for building in-house solutions for critical internal workflows</li>\n<li>Partner closely with product management, design, and other engineering teams to build end-to-end, AI-powered solutions</li>\n<li>Eliminate technical risk by solving the most technically challenging issues or tackling the most complex development tasks facing the team</li>\n<li>Drive team best practices for engineering excellence, including design quality, operational reviews, testing strategies, and incident management.</li>\n</ul>\n<p><strong>What we look for:</strong></p>\n<ul>\n<li>12+ years of software engineering experience, with a strong track record of technical leadership and impact</li>\n<li>3+ years of engineering leadership experience, serving as the technical owner for the software systems owned by your team</li>\n<li>Experience designing and building end-to-end, full-stack cloud-native applications</li>\n<li>Customer obsession and product engineering mindset</li>\n<li>Strong communication skills and ability to influence technical and business stakeholders</li>\n<li>History of raising the engineering quality bar, focus on execution and operational rigor</li>\n<li>Strong ability to collaborate across product, engineering, and business teams to align technical strategy with company business objectives.</li>\n</ul>\n<p><strong>Why Join Us?</strong></p>\n<ul>\n<li>Work on high-impact, high-visibility initiatives that directly contribute to Databricks&#39; growth trajectory</li>\n<li>Lead and mentor a talented team of engineers solving challenging, customer-centric problems</li>\n<li>Drive innovation in a fast-paced, data-driven environment where your work shapes the future of the company</li>\n</ul>\n<p>If you are passionate about building AI-powered solutions for running the business, we would love to hear from you!</p>\n<p><strong>Benefits</strong></p>\n<ul>\n<li>Comprehensive health coverage including medical, dental, and vision</li>\n<li>401(k) Plan</li>\n<li>Equity awards</li>\n<li>Flexible time off</li>\n<li>Paid parental leave</li>\n<li>Family Planning</li>\n<li>Gym reimbursement</li>\n<li>Annual personal development fund</li>\n<li>Work headphones reimbursement</li>\n<li>Employee Assistance Program (EAP)</li>\n<li>Business travel accident insurance</li>\n<li>Mental wellness resources</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_84d75f6f-187","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Databricks","sameAs":"https://databricks.com","logo":"https://logos.yubhub.co/databricks.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/databricks/jobs/8220836002","x-work-arrangement":"onsite","x-experience-level":"staff","x-job-type":"full-time","x-salary-range":"$192,000-$260,000 USD","x-skills-required":["software engineering","technical leadership","engineering excellence","design quality","operational reviews","testing strategies","incident management","cloud-native applications","customer obsession","product engineering mindset","communication skills","influence technical and business stakeholders"],"x-skills-preferred":[],"datePosted":"2026-04-18T15:41:35.618Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Mountain View, California"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"software engineering, technical leadership, engineering excellence, design quality, operational reviews, testing strategies, incident management, cloud-native applications, customer obsession, product engineering mindset, communication skills, influence technical and business stakeholders","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":192000,"maxValue":260000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_0962c409-5f6"},"title":"Incident Manager - Detection & Response","description":"<p>We&#39;re looking for an experienced Technical Program Manager to own and evolve incident management within the Detection &amp; Response (D&amp;R) team. This is a senior-level specialization on the Technical Program Manager ladder, focused on how we detect, respond to, and learn from security and operational incidents.</p>\n<p>You&#39;ll be the driving force behind maturing and scaling our incident response lifecycle,from detection and triage through containment, remediation, and post-incident review. Critically, some of the highest-impact work in this role happens after the immediate response: gathering data on incident trends, reporting on patterns and root causes, and working cross-functionally across engineering, security, infrastructure, and product teams to ensure that broad fixes and systemic improvements are actually implemented.</p>\n<p>Responsibilities:</p>\n<ul>\n<li>Own the end-to-end D&amp;R incident management program: detection workflows, response processes, escalation paths, communication standards, and remediation tracking.</li>\n<li>Serve as incident commander for security incidents, driving clear coordination across executive, engineering, security, legal, and other appropriate stakeholders.</li>\n<li>Establish and run incident commander rotations within D&amp;R, ensuring clear ownership and effective coordination during incidents of varying severity.</li>\n<li>Drive post-incident accountability by defining how action items are captured, assigned, tracked, and completed across teams,ensuring follow-through on both tactical fixes and strategic improvements.</li>\n<li>Gather, analyze, and report on incident trends and patterns to surface systemic risks, recurring root causes, and areas where the organization is most vulnerable.</li>\n<li>Translate trend analysis into actionable cross-functional initiatives: partner with engineering, infrastructure, security, and product teams to prioritize and implement broad fixes and preventive improvements that address root causes rather than symptoms.</li>\n<li>Lead incident review forums (post-mortems, retrospectives) and ensure learnings are captured, socialized, and acted upon across the organization.</li>\n<li>Develop and maintain D&amp;R incident response documentation, playbooks, runbooks, and training materials; keep them current as the threat landscape and our systems evolve.</li>\n<li>Partner with detection engineering to improve alert fidelity, reduce noise, and shorten time-to-detection for security events.</li>\n<li>Define, develop, and track incident management KPIs and report regularly to D&amp;R and Security leadership.</li>\n<li>Support broad cross-functional training and initiatives to uplevel security awareness across the company (e.g. Tabletop exercises, training, talks).</li>\n</ul>\n<p>You may be a good fit if you:</p>\n<ul>\n<li>Have 7+ years of experience in technical program management, incident management, or security operations, with significant time spent in a detection &amp; response or security incident response context.</li>\n<li>Have led or built incident response programs at a technology company, ideally in a high-growth or security-intensive environment.</li>\n<li>Have a demonstrated track record of turning incident data into organizational improvements,not just writing post-mortems, but driving the cross-functional work to implement systemic fixes.</li>\n<li>Are comfortable participating in on-call responsibilities and leading incident response during high-severity security events, including off-hours.</li>\n<li>Have experience building and scaling operational processes from the ground up in environments where structure didn’t previously exist.</li>\n<li>Excel at driving accountability and follow-through across multiple teams without direct authority,you know how to influence, track, and close the loop.</li>\n<li>Have strong analytical skills and experience with incident trend analysis, metrics reporting, and data-driven prioritization.</li>\n<li>Are highly organized with a knack for bringing structure to ambiguous, fast-moving situations.</li>\n<li>Have excellent communication skills, especially under pressure and when coordinating across technical and non-technical stakeholders, including executive leadership.</li>\n<li>Thrive in fast-paced environments where priorities shift and you’re often working with incomplete information.</li>\n</ul>\n<p>The annual compensation range for this role is $320,000-$405,000 USD.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_0962c409-5f6","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Anthropic","sameAs":"https://www.anthropic.com/","logo":"https://logos.yubhub.co/anthropic.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/anthropic/jobs/5176481008","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$320,000-$405,000 USD","x-skills-required":["Technical Program Management","Incident Management","Security Operations","Detection & Response","Cross-functional Team Leadership","Communication","Analytical Skills","Data-driven Prioritization","Incident Trend Analysis","Metrics Reporting"],"x-skills-preferred":[],"datePosted":"2026-04-18T15:39:59.642Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco, CA | New York City, NY"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Technical Program Management, Incident Management, Security Operations, Detection & Response, Cross-functional Team Leadership, Communication, Analytical Skills, Data-driven Prioritization, Incident Trend Analysis, Metrics Reporting","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":320000,"maxValue":405000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_55b94055-301"},"title":"Incident Manager - Detection & Response","description":"<p><strong>About the Role</strong></p>\n<p>The Detection &amp; Response (D&amp;R) team plays a critical role in protecting our systems, users, and data from security threats. We’re looking for an experienced Technical Program Manager to own and evolve incident management within D&amp;R.</p>\n<p>You’ll be the driving force behind maturing and scaling our incident response lifecycle,from detection and triage through containment, remediation, and post-incident review. Critically, some of the highest-impact work in this role happens after the immediate response: gathering data on incident trends, reporting on patterns and root causes, and working cross-functionally across engineering, security, infrastructure, and product teams to ensure that broad fixes and systemic improvements are actually implemented.</p>\n<p><strong>Responsibilities</strong></p>\n<ul>\n<li>Own the end-to-end D&amp;R incident management program: detection workflows, response processes, escalation paths, communication standards, and remediation tracking.</li>\n</ul>\n<ul>\n<li>Serve as incident commander for security incidents, driving clear coordination across executive, engineering, security, legal, and other appropriate stakeholders.</li>\n</ul>\n<ul>\n<li>Establish and run incident commander rotations within D&amp;R, ensuring clear ownership and effective coordination during incidents of varying severity.</li>\n</ul>\n<ul>\n<li>Drive post-incident accountability by defining how action items are captured, assigned, tracked, and completed across teams,ensuring follow-through on both tactical fixes and strategic improvements.</li>\n</ul>\n<ul>\n<li>Gather, analyse, and report on incident trends and patterns to surface systemic risks, recurring root causes, and areas where the organisation is most vulnerable.</li>\n</ul>\n<ul>\n<li>Translate trend analysis into actionable cross-functional initiatives: partner with engineering, infrastructure, security, and product teams to prioritise and implement broad fixes and preventive improvements that address root causes rather than symptoms.</li>\n</ul>\n<ul>\n<li>Lead incident review forums (post-mortems, retrospectives) and ensure learnings are captured, socialised, and acted upon across the organisation.</li>\n</ul>\n<ul>\n<li>Develop and maintain D&amp;R incident response documentation, playbooks, runbooks, and training materials; keep them current as the threat landscape and our systems evolve.</li>\n</ul>\n<ul>\n<li>Partner with detection engineering to improve alert fidelity, reduce noise, and shorten time-to-detection for security events.</li>\n</ul>\n<ul>\n<li>Define, develop, and track incident management KPIs and report regularly to D&amp;R and Security leadership.</li>\n</ul>\n<ul>\n<li>Support broad cross-functional training and initiatives to uplevel security awareness across the company (e.g. Tabletop exercises, training, talks).</li>\n</ul>\n<p><strong>You may be a good fit if you:</strong></p>\n<ul>\n<li>Have 7+ years of experience in technical program management, incident management, or security operations, with significant time spent in a detection &amp; response or security incident response context.</li>\n</ul>\n<ul>\n<li>Have led or built incident response programs at a technology company, ideally in a high-growth or security-intensive environment.</li>\n</ul>\n<ul>\n<li>Have a demonstrated track record of turning incident data into organisational improvements,not just writing post-mortems, but driving the cross-functional work to implement systemic fixes.</li>\n</ul>\n<ul>\n<li>Are comfortable participating in on-call responsibilities and leading incident response during high-severity security events, including off-hours.</li>\n</ul>\n<ul>\n<li>Have experience building and scaling operational processes from the ground up in environments where structure didn’t previously exist.</li>\n</ul>\n<ul>\n<li>Excel at driving accountability and follow-through across multiple teams without direct authority,you know how to influence, track, and close the loop.</li>\n</ul>\n<ul>\n<li>Have strong analytical skills and experience with incident trend analysis, metrics reporting, and data-driven prioritisation.</li>\n</ul>\n<ul>\n<li>Are highly organised with a knack for bringing structure to ambiguous, fast-moving situations.</li>\n</ul>\n<ul>\n<li>Have excellent communication skills, especially under pressure and when coordinating across technical and non-technical stakeholders, including executive leadership.</li>\n</ul>\n<ul>\n<li>Thrive in fast-paced environments where priorities shift and you’re often working with incomplete information.</li>\n</ul>\n<p><strong>Logistics</strong></p>\n<ul>\n<li>Minimum education: Bachelor’s degree or an equivalent combination of education, training, and/or experience</li>\n</ul>\n<ul>\n<li>Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience</li>\n</ul>\n<ul>\n<li>Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position</li>\n</ul>\n<ul>\n<li>Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices.</li>\n</ul>\n<ul>\n<li>Visa sponsorship: We do sponsor visas! However, we aren’t able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this.</li>\n</ul>\n<p><strong>How we’re different</strong></p>\n<p>We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact , advancing our long-term goals of steerable, trustworthy AI , rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We’re an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills.</p>\n<p>The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI &amp; Compute, Concrete Problems in AI Safety, and Learning from Human Preferences.</p>\n<p><strong>Come work with us!</strong></p>\n<p>Anthropic is a public benefit corporation</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_55b94055-301","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Anthropic","sameAs":"https://www.anthropic.com","logo":"https://logos.yubhub.co/anthropic.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/anthropic/jobs/5176570008","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["Technical Program Management","Incident Management","Security Operations","Detection & Response","Cross-functional Teamwork","Communication","Analytical Skills","Data-driven Prioritisation","Influence and Close Loop","Strong Organisational Skills"],"x-skills-preferred":[],"datePosted":"2026-04-18T15:39:51.436Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Zürich, CH"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Technical Program Management, Incident Management, Security Operations, Detection & Response, Cross-functional Teamwork, Communication, Analytical Skills, Data-driven Prioritisation, Influence and Close Loop, Strong Organisational Skills"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_ee95fc9b-ac3"},"title":"Community Operations & Executive Escalations Manager","description":"<p>We are seeking an Operations Manager to build and lead a new pillar of our Platform Operations function: Community Operations &amp; Executive Escalations. This role will stand up the team, processes, and infrastructure that protect Anthropic&#39;s reputation when high-stakes user issues surface on social media or arrive through executive channels.</p>\n<p>As the Community Operations &amp; Executive Escalations Manager, you will own end-to-end escalation management - detection, triage, incident coordination, and resolution- across two distinct but related workstreams: brand-impacting conversations on public social channels and high-sensitivity inbound from Anthropic employees on behalf of users.</p>\n<p>Responsibilities:</p>\n<ul>\n<li>Hire, lead, and develop a high-performing team of Community Operations specialists, covering detection through resolution</li>\n<li>Build the function from zero: stand up social listening infrastructure, define escalation criteria and severity tiers, author the incident management SOP, and establish coverage hours and on-call schedules</li>\n<li>Own social monitoring across public channels - triaging brand mentions and running escalations to resolution</li>\n<li>Own executive escalations end to end: receive inbound from Anthropic executives on behalf of users, lead your team in quarterbacking investigation and coordinate remediation across partner teams</li>\n<li>Serve as the central coordination point during live escalations and drive public/private response decisions to closure</li>\n<li>Build clear operating models with cross-functional teams so every stakeholder knows exactly where to send an escalation and what happens next</li>\n<li>Define and report on success metrics (time to triage, time to resolution, escalation volume by source) and run post-incident reviews that feed learnings back into criteria and process</li>\n<li>Develop response templates and playbooks for common scenarios so the team can move fast without sacrificing quality or judgment</li>\n<li>Provide thoughtful coaching and feedback to your direct reports, and partner with them on their career development and growth</li>\n</ul>\n<p>Requirements:</p>\n<ul>\n<li>8+ years of experience in community operations, escalations management, or a related field - ideally in a consumer or developer platform environment</li>\n<li>5+ years of people management experience, preferably in a fast-growing technology company</li>\n<li>Direct experience running incident or escalation workflows: you&#39;ve been the person managing the channel, paging the on-call, and writing the post-incident report</li>\n<li>Familiarity with social listening tooling and the dynamics of how issues spread across public social platforms</li>\n<li>Excellent judgment under pressure in reading a fast-moving situation, separating signal from noise, and making calls with incomplete information</li>\n<li>Proven track record of building operational programs from 0 to 1: writing the SOP, defining the criteria, standing up the tooling, and then iterating as reality breaks your assumptions</li>\n<li>Strong cross-functional influence and ability to align teams around a shared plan in real time, even when incentives differ</li>\n<li>Clear, concise written communication - you can brief an executive in three sentences and document an incident so the next person can learn from it</li>\n<li>Comfort with ambiguity and a bias toward action; this team doesn&#39;t exist yet and you&#39;ll be defining what good looks like</li>\n<li>Excitement about protecting Anthropic&#39;s reputation and relationships at a moment when public attention on AI has never been higher</li>\n</ul>\n<p>Salary:</p>\n<p>The annual compensation range for this role is $260,000-$310,000 USD.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_ee95fc9b-ac3","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Anthropic","sameAs":"https://www.anthropic.com/","logo":"https://logos.yubhub.co/anthropic.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/anthropic/jobs/5179769008","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$260,000-$310,000 USD","x-skills-required":["community operations","escalations management","social listening tooling","incident management","cross-functional influence","clear communication","bias toward action"],"x-skills-preferred":[],"datePosted":"2026-04-18T15:37:57.418Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco, CA | New York City, NY | Seattle, WA"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"community operations, escalations management, social listening tooling, incident management, cross-functional influence, clear communication, bias toward action","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":260000,"maxValue":310000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_d0318e21-ea2"},"title":"Environmental Health, Safety, and Radiation Manager","description":"<p>Standard Nuclear is fueling America&#39;s nuclear renaissance at industrial scale. We are seeking an Environmental Health, Safety, and Radiation Manager to lead and manage our environmental, health, safety, and radiological protection programs across manufacturing and laboratory operations.</p>\n<p>As a key member of our operations team, you will be responsible for ensuring that all activities are conducted safely and in compliance with company policies and applicable regulatory requirements. You will provide both strategic direction and hands-on oversight, working closely with operations, engineering, and quality teams to implement effective safety programs and maintain a strong safety culture.</p>\n<p>Responsibilities:</p>\n<ul>\n<li>Lead development, implementation, and continuous improvement of EH&amp;S programs across the site</li>\n<li>Establish and maintain policies, procedures, and standards aligned with regulatory requirements and industry best practices</li>\n<li>Ensure integration of EH&amp;S considerations into manufacturing, engineering, and operational activities</li>\n<li>Drive a proactive safety culture across all levels of the organization</li>\n</ul>\n<p>Radiological Protection Oversight:</p>\n<ul>\n<li>Oversee radiological protection programs, including contamination control, personnel monitoring, and area controls</li>\n<li>Ensure radiological work is conducted in accordance with approved procedures and regulatory expectations</li>\n<li>Provide leadership and guidance to radiological control personnel and activities</li>\n<li>Maintain compliance with applicable DOE and NRC requirements</li>\n</ul>\n<p>Regulatory Compliance &amp; Audits:</p>\n<ul>\n<li>Ensure compliance with environmental, health, safety, and radiological regulations and standards</li>\n<li>Lead preparation for and participation in internal and external audits, inspections, and assessments</li>\n<li>Interface with regulatory agencies as required</li>\n<li>Maintain audit-ready documentation and ensure timely resolution of findings</li>\n</ul>\n<p>Incident Management &amp; Risk Mitigation:</p>\n<ul>\n<li>Lead incident investigations, root cause analysis, and corrective action implementation</li>\n<li>Identify hazards and risks across operations and implement mitigation strategies</li>\n<li>Monitor and track safety performance metrics and trends</li>\n<li>Ensure timely reporting and resolution of safety incidents and concerns</li>\n</ul>\n<p>Operational &amp; Field Support:</p>\n<ul>\n<li>Maintain active presence on the manufacturing floor and in operational areas</li>\n<li>Partner with operations and engineering teams to support safe work planning and execution</li>\n<li>Ensure proper use of PPE, adherence to procedures, and implementation of safety controls</li>\n<li>Support high-risk activities and provide oversight as needed</li>\n</ul>\n<p>Team Leadership &amp; Development:</p>\n<ul>\n<li>Lead and develop EH&amp;S staff, including specialists and technicians</li>\n<li>Provide training, coaching, and performance management</li>\n<li>Support development and delivery of EH&amp;S and radiological training programs</li>\n</ul>\n<p>Systems &amp; Continuous Improvement:</p>\n<ul>\n<li>Develop and maintain EH&amp;S systems, tools, and reporting processes</li>\n<li>Identify opportunities to improve safety performance, compliance, and efficiency</li>\n<li>Support scaling of EH&amp;S programs as manufacturing operations grow</li>\n</ul>\n<p>Requirements:</p>\n<ul>\n<li>Bachelor&#39;s degree in Environmental Health &amp; Safety, Engineering, or a related field</li>\n<li>7+ years of experience in EH&amp;S within a manufacturing, industrial, or regulated environment</li>\n<li>Experience managing EH&amp;S programs and leading safety teams</li>\n<li>Strong knowledge of EH&amp;S regulations and compliance requirements (OSHA, EPA, DOE/NRC as applicable)</li>\n<li>Experience with radiological protection programs or nuclear environments required</li>\n<li>Proven experience leading incident investigations and implementing corrective actions</li>\n<li>Strong leadership, communication, and organizational skills</li>\n<li>Ability to work cross-functionally with operations, engineering, and leadership teams</li>\n<li>Experience in nuclear, energy, aerospace, defense, or other regulated industries is required</li>\n</ul>\n<p>Benefits:</p>\n<ul>\n<li>Health, Dental &amp; Vision Insurance</li>\n<li>Health Savings Account</li>\n<li>Disability and Life Insurance</li>\n<li>401K Plan</li>\n<li>Paid Time Off, Holidays</li>\n</ul>\n<p>Work Environment:</p>\n<ul>\n<li>This role is based on-site in Oak Ridge, TN and involves regular work in manufacturing, laboratory, and controlled or radiological areas.</li>\n<li>The position requires active engagement in operational environments, including presence on the production floor and oversight of field activities. Use of personal protective equipment (PPE) and adherence to EH&amp;S and radiological procedures is required. Reasonable accommodation will be provided to enable individuals with disabilities to perform the essential functions.</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_d0318e21-ea2","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Standard Nuclear","sameAs":"https://www.standardnuclear.com/","logo":"https://logos.yubhub.co/standardnuclear.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/standardnuclearinc/jobs/5191556008","x-work-arrangement":"onsite","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["Environmental Health & Safety","Radiological Protection","Regulatory Compliance","Incident Management","Risk Mitigation","Leadership","Communication","Organizational Skills","Nuclear Industry Experience"],"x-skills-preferred":[],"datePosted":"2026-04-17T13:00:16.955Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Oak Ridge, TN"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Energy","skills":"Environmental Health & Safety, Radiological Protection, Regulatory Compliance, Incident Management, Risk Mitigation, Leadership, Communication, Organizational Skills, Nuclear Industry Experience"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_03cafe1e-283"},"title":"Head of Support","description":"<p>We believe that the way people interact with their finances will drastically improve in the next few years. We&#39;re dedicated to empowering this transformation by building the tools and experiences that thousands of developers use to create their own products.</p>\n<p>Plaid powers the tools millions of people rely on to live a healthier financial life. We work with thousands of companies like Venmo, SoFi, several of the Fortune 500, and many of the largest banks to make it easy for people to connect their financial accounts to the apps and services they want to use.</p>\n<p>As Head of Support, you will own our global support strategy and outcomes across customer and consumer support, leading a distributed team that works across channels, products, and industries. You&#39;ll be responsible for uniting our customer and consumer support motions, evolving our Customer Success Package business, and using support as a strategic lever to influence product quality, roadmap, and Plaid&#39;s brand in the market.</p>\n<p><strong>Responsibilities</strong></p>\n<ul>\n<li>Own the global support strategy and outcomes across SLAs, CSAT, revenue, and support quality.</li>\n<li>Unite our customer and consumer support teams into a single, high-performing organization that is a true differentiator for Plaid.</li>\n<li>Lead, grow, and coach managers and ICs across regions and time zones; drive performance, calibration, and quality programs at scale.</li>\n<li>Manage critical incidents and executive-level escalations in tight partnership with Product, Engineering, Risk, Compliance, GTM, and CS Ops, including post-incident reviews and process fixes.</li>\n<li>Evolve support operations, tooling, and knowledge management to drive efficiency, deflection, and consistent, high-quality experiences for customers and consumers.</li>\n<li>Own the Customer Success Package business, balancing COGS, revenue, and customer experience</li>\n<li>Regularly report on support health and align plans and tradeoffs with Plaid&#39;s executive team and other stakeholders</li>\n</ul>\n<p><strong>Qualifications</strong></p>\n<ul>\n<li>10+ years in technical/customer support with at least 5+ years leading managers (manager-of-managers) in a scaling B2B SaaS or API company.</li>\n<li>3+ years running global support operations with measurable improvements in SLAs, CSAT/CES and quality.</li>\n<li>Background in fintech, payments, or developer/API platforms operating at significant scale, preferred</li>\n<li>Proven success owning support outcomes at scale, including incident management and executive-level escalations</li>\n<li>Deep experience building and leading distributed teams, with strong hiring, coaching, and performance management muscles across regions and time zones.</li>\n<li>Strong operational rigor: metrics design, forecasting and capacity planning, process improvement, and support tooling strategy.</li>\n<li>Demonstrated ability to partner with GTM, Product, and Engineering to influence roadmaps and improve product quality through support insights.</li>\n<li>Experience using AI and building content/deflection programs and quality frameworks.</li>\n</ul>\n<p><strong>Additional Information</strong></p>\n<p>Our mission at Plaid is to unlock financial freedom for everyone. To support that mission, we seek to build a diverse team of driven individuals who care deeply about making the financial ecosystem more equitable. We recognize that strong qualifications can come from both prior work experiences and lived experiences. We encourage you to apply to a role even if your experience doesn&#39;t fully match the job description.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_03cafe1e-283","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Plaid","sameAs":"https://plaid.com/","logo":"https://logos.yubhub.co/plaid.com.png"},"x-apply-url":"https://jobs.lever.co/plaid/31d1ef5f-c05a-4c71-8346-2f348d702e98","x-work-arrangement":"remote","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"USD 124,800-223,200 per-year-salary","x-skills-required":["technical/customer support","global support operations","fintech, payments, or developer/API platforms","incident management","distributed teams","operational rigor","AI","content/deflection programs","quality frameworks"],"x-skills-preferred":[],"datePosted":"2026-04-17T12:51:36.800Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"United States"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Finance","industry":"Fintech","skills":"technical/customer support, global support operations, fintech, payments, or developer/API platforms, incident management, distributed teams, operational rigor, AI, content/deflection programs, quality frameworks","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":124800,"maxValue":223200,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_fe36611f-683"},"title":"Client Experience Team Lead, Strategic Partnerships and Integrations","description":"<p>At Anchorage Digital, we are building the world’s most advanced digital asset platform for institutions to participate in crypto. As Client Experience Team Lead for Strategic Partnerships and Integrations, you have the opportunity to help shape and build the client service model under which Anchorage will operate with some of its most exciting, integration-based strategic ventures.</p>\n<p>This role is focused on handling operational relationship management with integration partners, including developing a deep understanding of how these partners leverage Anchorage Digital’s products, services, and API integrations to serve their businesses and end customers. This is an internal- and external-facing role, requiring leadership of cross-functional teams to drive strategic initiatives, gain consensus among stakeholders, and manage against internal and external deadlines.</p>\n<p>The successful candidate will be comfortable with navigating API-based business models, building out strong client servicing teams, and maintaining an understanding of the crypto ecosystem’s users, participants, and products (including familiarity with the requirements, behaviors, needs of institutional &amp; retail clients engaging with liquidity providers, exchanges, agency desks, custodians, and prime brokers in both traditional finance and DeFi), as well as demonstrating strong experience in traditional financial services.</p>\n<p><strong>Key Responsibilities:</strong></p>\n<p><strong>Operational Support and Relationship Management:</strong></p>\n<p>Intimately understand the operational construct of our integration partners to help them navigate how Anchorage Digital serves their operational needs; manage a team that provides day to day client servicing and inquiry management support for these partners.</p>\n<p><strong>Cross-Segment Advocacy:</strong></p>\n<p>Apply expertise in both institutional and retail crypto landscapes to anticipate client needs and advocate internally on behalf of our partners for product enhancements and services that serve a diverse user base with a white-glove service mindset.</p>\n<p><strong>Execution-Oriented Thought Partnership:</strong></p>\n<p>In collaboration with GTM, operations, and technical teams, act as an owner in developing specialized processes and procedures to ensure our support model develops alongside the business strategy of our partners.</p>\n<p><strong>Technical Escalation &amp; Liaison:</strong></p>\n<p>Act as the lead point of contact for these partners, resolving high-priority escalations and managing cross-organizational dependencies to ensure a &quot;white-glove&quot; resolution.</p>\n<p><strong>Oversight and Regulatory Integrity:</strong></p>\n<p>Where applicable, support the design and execution of governance and quality controls to monitor the quality of external processes performed by these partners, which may impact Anchorage Digital’s regulatory standing and reputation.</p>\n<p><strong>Technical Skills &amp; Expertise:</strong></p>\n<p><strong>API Troubleshooting &amp; Incident Management:</strong></p>\n<p>Demonstrates familiarity/comfort with API architecture and the ability to partner with technical teams to investigate and help diagnose technical issues related to Anchorage Digital’s core product suite.</p>\n<p><strong>Crypto Domain Knowledge:</strong></p>\n<p>Maintains an excellent understanding of blockchains and their nuances, specifically how they impact both retail and institutional use cases.</p>\n<p><strong>Policy Implementation:</strong></p>\n<p>Applies Anchorage Digital policies and procedures to resolve multifaceted issues arising from integrations; has strong experience working within regulatory frameworks.</p>\n<p><strong>Seasoned Financial Services:</strong></p>\n<p>Understands requirements, behaviors, needs of institutional &amp; retail clients engaging with liquidity providers, exchanges, agency desks, custodians, and prime brokers in both traditional finance and DeFi.</p>\n<p><strong>Complexity and Impact of Work:</strong></p>\n<p><strong>Strategic Relationship Management:</strong></p>\n<p>Manages the technical and operational health of relationships with strategic clients, working in lock-step with Sales and Relationship Management to defend assets in custody.</p>\n<p><strong>Communication and Project Management:</strong></p>\n<p>Streamlines the integration and communication of new product releases, acting as the primary driver to alleviate roadblocks between integration partners and internal teams.</p>\n<p><strong>Advocacy:</strong></p>\n<p>Maintains the &quot;Book of Work&quot; for strategic integration clients, prioritizing bespoke technical requests and product/feature enhancements, as well engaging teams such as compliance, legal, and operations to design support for evolving service models and new business ventures.</p>\n<p><strong>Oversight:</strong></p>\n<p>Where applicable, support the design and execution of governance and quality controls to monitor the quality of external processes performed by these partners, which may impact Anchorage Digital’s regulatory standing and reputation.</p>\n<p><strong>Communication and Influence:</strong></p>\n<p><strong>Cross-Team Thought Leadership &amp; Influence:</strong></p>\n<p>Navigate ambiguity, build social capital, translate complexity, and champion the &quot;why&quot; by proactively identifying friction points, using data-driven insights to influence priorities &amp; roadmap, demonstrating clear understanding of our integration partners and their importance to Anchorage Digital’s overall long-term strategy,</p>\n<p><strong>Reporting:</strong></p>\n<p>Proactively shares information and status updates with internal stakeholders and integration partners to ensure alignment on organizational goals.</p>\n<p><strong>Mentorship:</strong></p>\n<p>Effectively manages, develops, and creates growth roadmaps for CXM team members within the segment.</p>\n<p><strong>Additional Information About Anchorage Digital:</strong></p>\n<p>Who we are</p>\n<p>The Anchorage Village, what we call our team, brings together the brightest minds from platform security, financial services, and distributed ledger technology to provide the building blocks that empower institutions to safely participate in the evolving digital asset ecosystem.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_fe36611f-683","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Anchorage Digital","sameAs":"https://anchorage.com","logo":"https://logos.yubhub.co/anchorage.com.png"},"x-apply-url":"https://jobs.lever.co/anchorage/d5bd892a-c7ae-4df6-a0e8-2edcb29f14e6","x-work-arrangement":"remote","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["API Troubleshooting & Incident Management","Crypto Domain Knowledge","Policy Implementation","Seasoned Financial Services"],"x-skills-preferred":[],"datePosted":"2026-04-17T12:22:06.589Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"United States"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Finance","industry":"Finance","skills":"API Troubleshooting & Incident Management, Crypto Domain Knowledge, Policy Implementation, Seasoned Financial Services"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_1e0498e4-e87"},"title":"Member of Client Experience, Strategic Partnerships and Integrations","description":"<p>At Anchorage Digital, we are building the world’s most advanced digital asset platform for institutions to participate in crypto. As a member of Client Experience, Strategic Partnerships and Integrations, you have the opportunity to help shape and build the client service model under which Anchorage will operate with some of its most exciting, integration-based strategic ventures.</p>\n<p>This role is focused on handling operational relationship management with integration partners, including developing a deep understanding of how these partners leverage Anchorage Digital’s products, services, and API integrations to serve their businesses and end customers. This is an internal- and external-facing role, requiring collaboration with cross-functional teams to drive strategic initiatives, gain consensus among stakeholders, and manage against internal and external deadlines.</p>\n<p>The successful candidate will be comfortable navigating API-based business models, building and maintaining an understanding of the crypto ecosystem’s users, participants, and products (including familiarity with the requirements, behaviors, needs of institutional &amp; retail clients engaging with liquidity providers, exchanges, agency desks, custodians, and prime brokers in both traditional finance and DeFi), as well as demonstrating strong experience in traditional financial services.</p>\n<p><strong>Key Responsibilities:</strong></p>\n<p><strong>Operational Support and Relationship Management:</strong></p>\n<p>Understand the operational construct of our integration partners to help them navigate how Anchorage Digital serves their operational needs;</p>\n<p><strong>Cross-Segment Advocacy:</strong></p>\n<p>Apply expertise in both institutional and retail crypto landscapes to anticipate client needs and advocate internally on behalf of our partners for product enhancements and services that serve a diverse user base with a white-glove service mindset.</p>\n<p><strong>Technical Escalation &amp; Liaison:</strong></p>\n<p>Act as the point of contact for these partners, resolving high-priority escalations and managing cross-organizational dependencies to ensure a &quot;white-glove&quot; resolution.</p>\n<p><strong>Oversight and Regulatory Integrity:</strong></p>\n<p>Where applicable, support the design and execution of governance and quality controls to monitor the quality of external processes performed by these partners, which may impact Anchorage Digital’s regulatory standing and reputation.</p>\n<p><strong>Technical Skills &amp; Expertise:</strong></p>\n<p><strong>API Troubleshooting &amp; Incident Management:</strong></p>\n<p>Demonstrates familiarity/comfort with API architecture and the ability to partner with technical teams to investigate and help diagnose technical issues related to Anchorage Digital’s core product suite.</p>\n<p><strong>Crypto Domain Knowledge:</strong></p>\n<p>Maintains an excellent understanding of blockchains and their nuances, specifically how they impact both retail and institutional use cases.</p>\n<p><strong>Policy Implementation:</strong></p>\n<p>Applies Anchorage Digital policies and procedures to resolve multifaceted issues arising from integrations; has intermediate experience working within regulatory frameworks.</p>\n<p><strong>Financial Services Proficiency:</strong></p>\n<p>Moderate understanding of requirements, behaviors, needs of institutional &amp; retail clients engaging with liquidity providers, exchanges, agency desks, custodians, and prime brokers in both traditional finance and DeFi.</p>\n<p><strong>Complexity and Impact of Work:</strong></p>\n<p><strong>Strategic Relationship Management:</strong></p>\n<p>Supports CXM Lead on the technical and operational health of relationships with strategic clients, working in lock-step with Sales and Relationship Management to defend assets in custody.</p>\n<p><strong>Communication and Project Management:</strong></p>\n<p>Assists in the integration and communication of new product releases.</p>\n<p><strong>Advocacy:</strong></p>\n<p>Assist, as needed, in maintaining the &quot;Book of Work&quot; for strategic integration clients, prioritizing bespoke technical requests and product/feature enhancements, as well as collaborating with teams such as compliance, legal, and operations in support of service models and new business ventures.</p>\n<p><strong>Communication and Influence:</strong></p>\n<p><strong>Reporting:</strong></p>\n<p>Proactively shares information and status updates with internal stakeholders and integration partners to ensure alignment on organizational goals.</p>\n<p><strong>Growth &amp; Development:</strong></p>\n<p>Effectively manages, develops, and creates growth roadmaps for CXM team members within the segment.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_1e0498e4-e87","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Anchorage Digital","sameAs":"https://anchorage.com","logo":"https://logos.yubhub.co/anchorage.com.png"},"x-apply-url":"https://jobs.lever.co/anchorage/9ac26dbe-2e8d-443e-8530-02f9837be9d6","x-work-arrangement":"Remote","x-experience-level":null,"x-job-type":"Full-Time","x-salary-range":null,"x-skills-required":["API Troubleshooting & Incident Management","Crypto Domain Knowledge","Policy Implementation","Financial Services Proficiency","API architecture"],"x-skills-preferred":[],"datePosted":"2026-04-17T12:17:42.946Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"United States"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Finance","industry":"Technology","skills":"API Troubleshooting & Incident Management, Crypto Domain Knowledge, Policy Implementation, Financial Services Proficiency, API architecture"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_4f6e3d04-c70"},"title":"Information Security Analyst - GRC","description":"<p>At Synopsys, we drive the innovations that shape the way we live and connect. Our technology is central to the Era of Pervasive Intelligence, from self-driving cars to learning machines. We lead in chip design, verification, and IP integration, empowering the creation of high-performance silicon chips and software content.</p>\n<p>Join us to transform the future through continuous technological innovation.</p>\n<p>As an Information Security Analyst, you will be an integral part of the Synopsys Corporate Information Security group, working within a mature Governance, Risk, and Compliance (GRC) Team. This team collaborates closely with the Director of Information Security, Manager of GRC, and stakeholders across the organization to raise the overall security and compliance posture for Synopsys.</p>\n<p>Your responsibilities will include:</p>\n<ul>\n<li>Identifying, documenting, monitoring, and reporting on risk register items, KPIs/KRIs, including the monitoring of security control efficacy.</li>\n<li>Demonstrating experience with governance, risk, and compliance tools.</li>\n<li>Working with security control frameworks such as ISO 27001, SOC 2 Type II, NIST 800-53, NIST CSF, and similar.</li>\n<li>Presenting security risks to a wide audience, including risk owners and other stakeholders.</li>\n<li>Interacting with Synopsys IT and business stakeholders to understand risks to critical infrastructure by defining potential business impact with the responsibility to apply effective mitigation strategies.</li>\n<li>Providing guidance on control implementations related to governance frameworks, regulations, and corporate security policies.</li>\n<li>Understanding of security functions including Incident Management, Change Management, Identity and Access Management, and Vendor Security Risk Management.</li>\n<li>Conducting third-party (vendor) risk assessments in collaboration with stakeholders.</li>\n<li>Providing security requirements to both internal partners and external third-party providers.</li>\n<li>Effectively communicating and working with a global team.</li>\n<li>Maintaining, enforcing, and tracking the Synopsys Information Security Exception process.</li>\n<li>Staying current with industry, regulatory, and legal requirements relevant to security, compliance, and privacy.</li>\n</ul>\n<p>You will be responsible for enhancing Synopsys&#39; overall security and compliance posture by building and improving the GRC portfolio. You will also enable and transform the risk management program to address the evolving cybersecurity threat landscape. Ensure regulatory compliance as the company continues to grow. Strengthen risk assessments of suppliers and partners, contributing to a robust security framework.</p>\n<p>To be successful in this role, you will need:</p>\n<ul>\n<li>A bachelor&#39;s degree in Computer Science, Information Systems, or a related field.</li>\n<li>Typically, 5-7 years of experience in a related field.</li>\n<li>Knowledge of common certification and attestation programs such as ISO 27001 and SOC 2 Type II, ISO 31000.</li>\n<li>Practical working experience with control frameworks like ISO 27001, NIST 800-53, SOC 2 Type II and NIST CSF.</li>\n<li>Excellent organizational skills with attention to detail and the ability to multitask for project prioritization.</li>\n<li>Effective communication skills with internal and external customers, executive managers, and team members.</li>\n<li>Ability to understand the intent of compliance requirements to provide effective and meaningful examination.</li>\n</ul>\n<p>We offer a comprehensive range of health, wellness, and financial benefits to cater to your needs. Our total rewards include both monetary and non-monetary offerings. Your recruiter will provide more details about the salary range and benefits during the hiring process.</p>\n<p>At Synopsys, we want talented people of every background to feel valued and supported to do their best work. Synopsys considers all applicants for employment without regard to race, color, religion, national origin, gender, sexual orientation, age, military veteran status, or disability.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_4f6e3d04-c70","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Synopsys","sameAs":"https://careers.synopsys.com","logo":"https://logos.yubhub.co/careers.synopsys.com.png"},"x-apply-url":"https://careers.synopsys.com/job/bengaluru/information-security-analyst-grc/44408/93409691360","x-work-arrangement":"onsite","x-experience-level":"mid","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["governance, risk, and compliance","security control frameworks","ISO 27001","SOC 2 Type II","NIST 800-53","NIST CSF","incident management","change management","identity and access management","vendor security risk management"],"x-skills-preferred":[],"datePosted":"2026-04-05T13:16:53.710Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Bengaluru"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"governance, risk, and compliance, security control frameworks, ISO 27001, SOC 2 Type II, NIST 800-53, NIST CSF, incident management, change management, identity and access management, vendor security risk management"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_03c6508d-2e2"},"title":"Senior Technical Account Manager - Engine by Starling","description":"<p>At Engine by Starling, we&#39;re on a mission to find and work with leading banks around the world who have the ambition to build rapid growth businesses on our technology. We&#39;re looking for a future leader to join our team as a Senior Technical Account Manager for Canada, based in Toronto.</p>\n<p>This is a rare opportunity to build something from the ground up. As Engine&#39;s inaugural Senior Technical Account Manager for Canada, you&#39;ll lead our entry into one of North America&#39;s most sophisticated and tightly regulated financial markets, anchored by a strategic partnership with a major Canadian financial institution.</p>\n<p>As a Senior Technical Account Manager, you&#39;ll serve as the primary executive-level technical advisor for our Canadian launch client, a well-known digital challenger bank backed by one of the Big Six. You&#39;ll cultivate a relationship that will define Engine&#39;s regional credibility for years to come.</p>\n<p>Your responsibilities will include:</p>\n<ul>\n<li>Driving the end-to-end technical success strategy for our Canadian launch client, with a clear focus on platform adoption, client satisfaction, and measurable business outcomes aligned to their Canadian market entry goals.</li>\n<li>Owning and localising Engine&#39;s global TAM playbook for the Canadian market, adapting frameworks, success metrics, and engagement models to reflect the regulatory and cultural context of Canadian financial services.</li>\n<li>Developing and executing joint success plans with the client, identifying and prioritising technical initiatives that accelerate their core use cases and long-term platform value.</li>\n<li>Leading Monthly and Quarterly Business Reviews (MBRs/QBRs), presenting strategic insights on platform performance, feature adoption, and forward-looking value realisation to VP and C-Suite stakeholders.</li>\n<li>Building and sustaining trusted advisory relationships at the executive level, translating complex platform capabilities into clear business value for Canadian market operations, and ensuring Engine is seen as a strategic partner, not a vendor.</li>\n</ul>\n<p>You&#39;ll also be responsible for:</p>\n<ul>\n<li>Owning the end-to-end Major Incident lifecycle for the Canadian client, driving cross-functional resolution with urgency, clear communication, and accountability across internal and client teams.</li>\n<li>Leading proactive Problem Management initiatives to identify systemic risks, address root causes, and reduce the frequency and impact of incidents over time.</li>\n<li>Serving as the Canadian client&#39;s primary escalation point for technical issues, maintaining composure and credibility under pressure in an environment where platform reliability directly affects financial operations.</li>\n</ul>\n<p>As a member of our global TAM community, you&#39;ll participate in a collaborative on-call rotation, working alongside TAMs in the UK and Australia to ensure our global client base always has access to expert support.</p>\n<p>Requirements:</p>\n<ul>\n<li>5+ years of progressive experience in Technical Account Management, Strategic Customer Success, or a comparable client-facing role.</li>\n<li>A demonstrated track record managing top-tier enterprise accounts or leading foundational client relationships in a new market or region.</li>\n<li>Proven ability to localise and execute a global strategy or methodology within a distinct regional context, adapting rather than replicating.</li>\n<li>Fluency with cloud-based SaaS platforms and APIs, with the ability to troubleshoot and resolve complex technical issues independently and with credibility.</li>\n<li>Executive presence: proven effectiveness building relationships with and influencing the technical strategy of C-level and VP-level stakeholders.</li>\n<li>Outstanding written and verbal communication skills, with a track record of translating complex technical concepts for non-technical executive audiences, confidently and concisely.</li>\n</ul>\n<p>Highly desirable:</p>\n<ul>\n<li>Direct experience working in, or supporting clients operating in, the Canadian financial services market.</li>\n<li>Familiarity with Canadian financial regulatory frameworks, including OSFI guidelines, FINTRAC reporting obligations, and relevant provincial regulatory requirements.</li>\n<li>Experience as a first-hire or market-entry employee, with a track record of building and scaling high-performing teams.</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_03c6508d-2e2","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Starling","sameAs":"https://www.starlingbank.com/","logo":"https://logos.yubhub.co/starlingbank.com.png"},"x-apply-url":"https://apply.workable.com/j/365ADEC8FE","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["Technical Account Management","Strategic Customer Success","Cloud-based SaaS platforms","APIs","Problem Management","Incident Management","Executive presence","Communication skills"],"x-skills-preferred":[],"datePosted":"2026-03-20T16:14:14.217Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Toronto"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Finance","skills":"Technical Account Management, Strategic Customer Success, Cloud-based SaaS platforms, APIs, Problem Management, Incident Management, Executive presence, Communication skills"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_981e6f7e-ede"},"title":"Production Readiness Lead - Game Developer Experience (GDX)","description":"<p>Electronic Arts creates next-level entertainment experiences that inspire players and fans around the world. Here, everyone is part of the story. Part of a community that connects across the globe. A place where creativity thrives, new perspectives are invited, and ideas matter. A team where everyone makes play happen.</p>\n<p>The Electronic Arts Information Technology (EAIT) organization works as a global team to empower EA&#39;s employees and business operations to be creative, collaborative, and productive. As a digital entertainment company, EA&#39;s enterprise technology needs are diverse and span across game development, workforce collaboration, marketing, publishing, player experience, security, and corporate activities. Our mission is to bring creative technology services to each of these areas, working across the company to ensure better play.</p>\n<p>As part of the Game Developer Experience (GDX) organization, the Engineering and Operations team is building a structured, scalable operational lifecycle across GameKit. In this role, you will play a central part in shaping how operational excellence is embedded into product delivery from concept through launch and beyond.</p>\n<p>As the Product Readiness Lead, you will integrate operational standards directly into the Product Development Lifecycle (PDLC), ensuring that reliability, scalability, and support readiness are designed in, not added later. You will collaborate closely with Engineering, Product Management, Site Reliability Engineering (SRE), Customer Support, and Operations partners to help teams meet clearly defined expectations for observability, automation, documentation, and launch readiness.</p>\n<p>This is a hybrid role (3 days per week in the office) based in Vancouver, reporting to the Director of Operations and partnering broadly across the GameKit ecosystem to establish a repeatable, sustainable operational lifecycle model.</p>\n<p>Responsibilities:</p>\n<ul>\n<li>Enable a digital-first, automation-forward support strategy by ensuring products are designed with operational readiness from Day 0.</li>\n<li>Partner with product and engineering teams to embed automation, AI-enabled support capabilities, and agentic workflows into product designs before launch.</li>\n<li>Define and integrate standards for alerting, instrumentation, observability, runbooks, and workflow automation into the PDLC.</li>\n<li>Establish lifecycle checkpoints and measurable readiness indicators (e.g., MTTR, signal coverage, operational maturity).</li>\n<li>Lead structured operational readiness reviews and provide clear, actionable recommendations to support successful launches.</li>\n<li>Be the connector across teams, aligning technical and operational partners around shared reliability and support outcomes.</li>\n</ul>\n<p>Qualifications:</p>\n<ul>\n<li>8+ years of experience in Operations, Site Reliability Engineering (SRE), Technical Program Management, Platform Operations, or a related discipline.</li>\n<li>Demonstrated hands-on experience with Service Level Agreements (SLAs)/Service Level Objectives(SLOs), incident management, observability tooling, dashboards, and automation systems in large-scale, multi-product environments.</li>\n<li>Strong collaboration and influence skills, with the ability to work effectively across engineering, product, and operational teams.</li>\n<li>Experience driving operational consistency and continuous improvement in dynamic, technology-driven organizations.</li>\n</ul>\n<p>Pay Transparency - North America</p>\n<p>COMPENSATION AND BENEFITS</p>\n<p>The ranges listed below are what EA in good faith expects to pay applicants for this role in these locations at the time of this posting. If you reside in a different location, a recruiter will advise on the applicable range and benefits. Pay offered will be determined based on a number of relevant business and candidate factors (e.g. education, qualifications, certifications, experience, skills, geographic location, or business needs).</p>\n<p>PAY RANGES</p>\n<p>• British Columbia (depending on location e.g. Vancouver vs. Victoria) $130,800 - $183,000 CAD</p>\n<p>Pay is just one part of the overall compensation at EA.</p>\n<p>For Canada, we offer a package of benefits including vacation (3 weeks per year to start), 10 days per year of sick time, paid top-up to EI/QPIP benefits up to 100% of base salary when you welcome a new child (12 weeks for maternity, and 4 weeks for parental/adoption leave), extended health/dental/vision coverage, life insurance, disability insurance, retirement plan to regular full-time employees. Certain roles may also be eligible for bonus and equity.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_981e6f7e-ede","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Electronic Arts","sameAs":"https://jobs.ea.com","logo":"https://logos.yubhub.co/jobs.ea.com.png"},"x-apply-url":"https://jobs.ea.com/en_US/careers/JobDetail/Production-Readiness-Lead-Game-Developer-Experience-GDX/212677","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$130,800 - $183,000 CAD","x-skills-required":["Service Level Agreements (SLAs)","Service Level Objectives (SLOs)","incident management","observability tooling","dashboards","automation systems"],"x-skills-preferred":[],"datePosted":"2026-03-10T12:18:03.330Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Vancouver"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Service Level Agreements (SLAs), Service Level Objectives (SLOs), incident management, observability tooling, dashboards, automation systems","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":130800,"maxValue":183000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_a3e7e545-094"},"title":"FBS Data Production Support Analyst (Data Pipelines)","description":"<p><strong>Role Overview</strong></p>\n<p>The purpose of this role is to ensure smooth operations of our production data assets. Activities will include monitoring production systems for incident occurrence, alerting applicable parties when incidents arise and incident triaging and management. They will also carry out activities to prevent production incidents.</p>\n<p><strong>Key Responsibilities</strong></p>\n<ul>\n<li>Work with Data Pipelines, handling incidents and RCA</li>\n<li>Administers, analyzes, and prioritizes systems issues and negotiates a course of action for resolution.</li>\n<li>Supports workflow and solutions; trouble shoots user errors and supports reporting capabilities.</li>\n<li>Utilizes system monitoring utilities to monitor system availability.</li>\n<li>Extracts and compiles data system monitoring data to create availability scorecards and reports.</li>\n<li>System Monitoring: Continuously monitor IT systems to ensure optimal performance and availability, identifying and addressing potential issues before they escalate.</li>\n<li>Monitoring and Maintenance: Regularly monitor production data assets to ensure they are functioning correctly and efficiently.  Alerting applicable parties if an issue arises in production.</li>\n<li>Issue Resolution: Work with data team to identify, diagnose, and resolve technical issues related to production data assets. Work with relevant teams to implement effective solutions.</li>\n<li>Incident Management: Manage and prioritize incidents, ensuring that they are resolved promptly and efficiently and follow the incident management process. Document incidents and resolutions for future reference.</li>\n<li>Incident Management: Respond to and resolve technical issues reported by users or automated monitoring alerts. This includes diagnosing problems, identifying solutions, and implementing fixes.</li>\n<li>Problem Analysis: Analyze recurring issues to identify root causes and implement long-term solutions to prevent future occurrences.</li>\n<li>Root Cause Analysis: Conduct thorough investigations to determine the underlying causes of recurring incidents and implement preventive measures.</li>\n<li>Preventative Measures: Identify incidents that recur and put solutions in place to prevent recurrence.</li>\n<li>Data Integrity: Work with data team to ensure the accuracy and integrity of data produced and provided to the business, work with the data teams to implement and maintain quality control measures to prevent errors.</li>\n<li>Documentation: Maintain comprehensive documentation of processes, system configurations, and troubleshooting procedures.  Ensure documentation is created and owned be it by the data team or the production support team.</li>\n<li>Support: Provide support to data teams, data users and stakeholders. Respond to inquiries and assist with requests as applicable.</li>\n<li>Optimization: Identify opportunities to optimize data production processes and implement improvements to enhance efficiency.</li>\n<li>Performance Optimization: Analyze system performance and identify areas for improvement. Suggest and implement changes to enhance system efficiency and reliability.</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_a3e7e545-094","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Capgemini","sameAs":"https://jobs.workable.com","logo":"https://logos.yubhub.co/view.com.png"},"x-apply-url":"https://jobs.workable.com/view/ffvEvDAAYzjgBfJeCMdK9E/remote-fbs-data-production-support-analyst-(data-pipelines)-in-mexico-at-capgemini","x-work-arrangement":"remote","x-experience-level":"mid","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["Data Pipelines","Incident Management","System Monitoring","Data Integrity","Documentation","Problem Analysis","Root Cause Analysis","Preventative Measures","SQL","Python","Java"],"x-skills-preferred":["ETL processes","Database management","Data warehousing"],"datePosted":"2026-03-09T17:03:31.282Z","jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"IT","industry":"Finance","skills":"Data Pipelines, Incident Management, System Monitoring, Data Integrity, Documentation, Problem Analysis, Root Cause Analysis, Preventative Measures, SQL, Python, Java, ETL processes, Database management, Data warehousing"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_0b0a4da9-3f0"},"title":"Apl Sub Matter Expert II","description":"<p><strong>Role Description</strong></p>\n<p>The Application Subject Matter Expert II provides hands-on functional and technical expertise for Commercial IT business applications in a live production environment. This role focuses on production support, impact analysis, supplier deliverable validation, and audit support.</p>\n<p><strong>Requirements</strong></p>\n<p><strong>Skills and Capabilites</strong></p>\n<ul>\n<li>Application &amp; Production Support Expertise - Intermediate</li>\n<li>Business Process &amp; Commercial Insurance Domain Understanding - Intermediate</li>\n<li>Analytical Thinking &amp; Impact Analysis - Intermediate</li>\n<li>Communication &amp; Collaboration - Advanced</li>\n<li>Audit &amp; Compliance Awareness - Intermediate</li>\n<li>Operational Readiness &amp; Support Commitment - Intermediate</li>\n</ul>\n<p><strong>Software / Tool Skills</strong></p>\n<ul>\n<li>Commercial insurance applications - Entry Level (1-3 Years)</li>\n<li>Production support and incident management tools - Intermediate (4-6 Years)</li>\n</ul>\n<p><strong>Benefits</strong></p>\n<ul>\n<li>Competitive compensation and benefits package:</li>\n</ul>\n<ol>\n<li>Competitive salary and performance-based bonuses</li>\n<li>Comprehensive benefits package</li>\n<li>Career development and training opportunities</li>\n<li>Flexible work arrangements (remote and/or office-based)</li>\n<li>Dynamic and inclusive work culture within a globally renowned group</li>\n<li>Private Health Insurance</li>\n<li>Pension Plan</li>\n<li>Paid Time Off</li>\n<li>Training &amp; Development</li>\n</ol>\n<p>Note: Benefits differ based on employee level.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_0b0a4da9-3f0","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Capgemini","sameAs":"https://jobs.workable.com","logo":"https://logos.yubhub.co/view.com.png"},"x-apply-url":"https://jobs.workable.com/view/jfmqQoMt97gTReDgsBCuon/hybrid-apl-sub-matter-expert-ii-in-hyderabad-at-capgemini","x-work-arrangement":"hybrid","x-experience-level":"mid","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["Application & Production Support Expertise","Business Process & Commercial Insurance Domain Understanding","Analytical Thinking & Impact Analysis","Communication & Collaboration","Audit & Compliance Awareness","Operational Readiness & Support Commitment","Commercial insurance applications","Production support and incident management tools"],"x-skills-preferred":[],"datePosted":"2026-03-09T16:59:34.643Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Hyderabad, Telangana, India"}},"employmentType":"FULL_TIME","occupationalCategory":"IT","industry":"Technology","skills":"Application & Production Support Expertise, Business Process & Commercial Insurance Domain Understanding, Analytical Thinking & Impact Analysis, Communication & Collaboration, Audit & Compliance Awareness, Operational Readiness & Support Commitment, Commercial insurance applications, Production support and incident management tools"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_d2955c92-774"},"title":"Network Security Engineering Enterprise Architect (GSR8)","description":"<p>As a Network Security Engineering Enterprise Architect (GSR8), you will be a technical lead supporting Ford&#39;s complete Enterprise Network &amp; Security architecture transformation. You will be taking care of dynamics for Network Security Engineering Products (Security Firewalls, Proxy, ISE, SDN Networks, Wireless) team to a centre of technical excellence and customer Advocacy.</p>\n<p>You will identify, analyse, and resolve existing network security design weaknesses and vulnerabilities which could possess risk to existing infrastructure. Expert in closing zero-day security vulnerabilities taking along all infrastructure domain teams which could impact Ford&#39;s reputation across globe.</p>\n<p>As a Network Security Engineering enterprise architect, you would lead future network security product development by contributing to the network Design (architecture) and Automation used across multiple Engineering Branches, Data Centres, Manufacturing Plants and Remote users.</p>\n<p>This Role requires defining road map for ZTNA/SASE deployment using Prisma Access/Cloud, setup support model, automation to accelerate end user experience. The Global Network Security Engineering enterprise architect is responsible for successful setup of the products by working closely with Software developers from Ford and OEMs in consultation with Ford&#39;s Network and Security Operations Team.</p>\n<p>This position will be part of Ford&#39;s Enterprise Tech department and will report to the Regional Network Delivery Manager, based in same or another region. The lead needs to ensure &#39;Always On&#39; (24 x 7) availability of Ford Global Network Product offerings, working with Network &amp; Security Peers from other regions.</p>\n<p><strong>Responsibilities</strong></p>\n<p>This role will also be driving towards supporting full observability and Monitoring, process response, and technical capability to ensure customer up time of 99.999%+. This position requires a wide range of skills and experience,</p>\n<ul>\n<li>This role involves collaborating closely with the network operations team to identify continuous improvement opportunities and working with the network engineering team and OEMs to devise and implement solutions. The implementation will be driven through automation in partnership with Ford&#39;s developers.</li>\n<li>Design and implement robust security architectures and frameworks to protect against threats and vulnerabilities.</li>\n<li>Ensure timely proactive identification and reporting of security gaps and vulnerabilities to the critical business information, systems and network infrastructure.</li>\n<li>Plan for End-to-end Network &amp; Security projects implementation.</li>\n</ul>\n<p><strong>Qualifications</strong></p>\n<ul>\n<li>Support the Major technical Incident Management Calls and Change Controls through STRONG Technical Network Knowledge, Operational capability, and strong communication skills.</li>\n<li>Perform configuration updates, such as modifying configurations, signature definitions or implementing new policies on various network security tools, as directed.</li>\n<li>Demonstrate technical excellence through technical knowledge.</li>\n<li>Collaborate with global leaders to support 24/7 network availability on a worldwide scale.</li>\n<li>Advocate and ensure that high quality Follow the Sun (FTS) is delivered to receiving teams. As well as support on-call schedule and shifts are available.</li>\n<li>Support continuous improvement in service management for Network Services leveraging enterprise tools and processes (Incident, Problem &amp; Change) and focusing on customer value optimization.</li>\n<li>Supports implement best practices and processes for Network &amp; Security Operations services to maintain availability, reliability, scalability, and security.</li>\n<li>Support for effective SRE Monitoring and FSO (Full Stack Observability) on system performance and overall health, troubleshoot issues, and implement corrective actions.</li>\n<li>Collaborate with the Network LAN/WAN &amp; security Engineering/development teams to optimize infrastructure for application performance and scalability.</li>\n<li>Support team members to achieve technical network excellence thru experience, and network Certifications and support training requirements.</li>\n<li>Able to support the team to develop continued improvements leading to an &#39;always on network capability.</li>\n<li>Be able to leverage other network management tools used by the NOC in the identification and response to security connectivity incidents and faults.</li>\n<li>Develop security policies, standards, and procedures.</li>\n<li>Assist with security compliance audits to verify completeness of required configurations and verify system hardening.</li>\n<li>Participate in the problem investigation connectivity incidents related to security devices, provide recommendations to improve reliability and availability, or reduce recovery time.</li>\n<li>Support assurance of up-to-date SW releases, targeted LDOS, and PSIRTS (security updates).</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_d2955c92-774","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Ford","sameAs":"https://efds.fa.em5.oraclecloud.com"},"x-apply-url":"https://efds.fa.em5.oraclecloud.com/hcmUI/CandidateExperience/en/sites/CX_1/job/56878","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["Network Security Engineering","Enterprise Architecture","Security Firewalls","Proxy","ISE","SDN Networks","Wireless","Prisma Access/Cloud","ZTNA/SASE","Automation","Network Design","Network Security","Security Operations","Incident Management","Change Controls","Technical Knowledge","Global Leadership","Follow the Sun","SRE Monitoring","FSO","Full Stack Observability","System Performance","Network Certifications","Training Requirements"],"x-skills-preferred":[],"datePosted":"2026-03-09T10:59:07.843Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Chennai, Tamil Nadu, India"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Automotive","skills":"Network Security Engineering, Enterprise Architecture, Security Firewalls, Proxy, ISE, SDN Networks, Wireless, Prisma Access/Cloud, ZTNA/SASE, Automation, Network Design, Network Security, Security Operations, Incident Management, Change Controls, Technical Knowledge, Global Leadership, Follow the Sun, SRE Monitoring, FSO, Full Stack Observability, System Performance, Network Certifications, Training Requirements"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_d447da6b-8eb"},"title":"Site Reliability Engineer","description":"<p>Join Razer on a global mission to revolutionize the way the world games. As a Site Reliability Engineer, you will be part of Razer Gold&#39;s growing infrastructure and platform engineering team. We are seeking a skilled and driven individual with hands-on experience in Amazon Web Services (AWS), strong troubleshooting capabilities, and a passion for building scalable, observable, and resilient systems using modern Infrastructure as Code (IaC) and automation tools.</p>\n<p><strong>Job Responsibilities</strong></p>\n<p><strong>Design, Develop, and Maintain Infrastructure as Code (IaC)</strong></p>\n<p>Design, develop, and maintain Infrastructure as Code (IaC) using tools like Terraform or AWS CloudFormation.</p>\n<p><strong>Implement and Operate Reliable, Scalable Cloud Infrastructure</strong></p>\n<p>Implement and operate reliable, scalable cloud infrastructure primarily on AWS (e.g., EC2, ECS, RDS, S3, Lambda, ElastiCache, SQS, SES, Auto Scaling, Load Balancers).</p>\n<p><strong>Lead and Participate in Architecture Reviews</strong></p>\n<p>Lead and participate in architecture reviews focusing on reliability, scalability, security, and performance.</p>\n<p><strong>Develop and Manage Robust Monitoring, Alerting, and Logging Solutions</strong></p>\n<p>Develop and manage robust monitoring, alerting, and logging solutions (e.g., CloudWatch, Prometheus, Grafana, ELK, etc.) to detect and resolve issues proactively.</p>\n<p><strong>Perform Incident Management</strong></p>\n<p>Perform incident management, postmortems, root cause analysis, and implement continuous improvement strategies.</p>\n<p><strong>Collaborate with Software Engineering Teams</strong></p>\n<p>Collaborate with software engineering teams to improve CI/CD pipelines, deployment automation, and release management.</p>\n<p><strong>Automate Infrastructure Operations</strong></p>\n<p>Automate infrastructure operations, reduce manual toil, and improve reliability using scripting (Python, Bash, Node.js, or Ruby).</p>\n<p><strong>Maintain and Troubleshoot Environments</strong></p>\n<p>Maintain and troubleshoot environments involving web servers, databases, firewalls, DNS, load balancers, and networking.</p>\n<p><strong>Ensure Systems Compliance</strong></p>\n<p>Ensure systems are compliant with security standards, including patching, hardening, and secure access policies.</p>\n<p><strong>Provide On-Call Support</strong></p>\n<p>Provide on-call support, participate in incident rotations.</p>\n<p><strong>Monitor and Maintain Service-Level Objectives (SLOs)</strong></p>\n<p>Monitor and maintain service-level objectives (SLOs), SLAs, and error budgets to ensure reliability targets are met.</p>\n<p><strong>Provide Support and Solution Handling</strong></p>\n<p>Provide support and solution handling to incident and tickets assigned.</p>\n<p><strong>Pre-Requisites</strong></p>\n<ul>\n<li>Bachelor’s degree in Computer Science, Software Engineering, Information Technology, or a related field.</li>\n<li>Minimum 2 years of experience in SRE, DevOps, Cloud Infrastructure, or Systems Administration roles.</li>\n<li>Solid hands-on experience with AWS Cloud services including (but not limited to):</li>\n<li>Compute: EC2, Lambda, ECS, Auto Scaling</li>\n<li>Networking: VPC, Load Balancers, Route 53</li>\n<li>Messaging &amp; Storage: SQS, S3, RDS, ElastiCache, SES</li>\n<li>Monitoring: CloudWatch, X-Ray</li>\n<li>Proficient in Infrastructure as Code using Terraform and/or CloudFormation.</li>\n<li>Experience with CI/CD tools (e.g., GitLab CI, Jenkins, CodePipeline, ArgoCD).</li>\n<li>Strong understanding of Linux and Windows system administration and troubleshooting.</li>\n<li>Comfortable with one or more scripting/programming languages such as Python, Node.js, Bash, Ruby, or JSON/YAML for automation.</li>\n<li>Strong grasp of network fundamentals, including DNS, HTTP(S), TLS/SSL, firewalls, and TCP/IP.</li>\n<li>Experience with containerization and orchestration (Docker, ECS, or Kubernetes is a plus).</li>\n<li>Familiar with observability tools and incident management best practices.</li>\n<li>Vietnamese citizen based in Ho Chi Minh City.</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_d447da6b-8eb","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Razer","sameAs":"https://razer.wd3.myworkdayjobs.com","logo":"https://logos.yubhub.co/razer.com.png"},"x-apply-url":"https://razer.wd3.myworkdayjobs.com/en-US/Careers/job/Ho-Chi-Minh/Site-Reliability---Engineering-2_JR2026006886","x-work-arrangement":"onsite","x-experience-level":"mid","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["AWS","Terraform","CloudFormation","CI/CD","Linux","Windows","Python","Node.js","Bash","Ruby","JSON/YAML","DNS","HTTP(S)","TLS/SSL","firewalls","TCP/IP","containerization","orchestration","observability","incident management"],"x-skills-preferred":[],"datePosted":"2026-03-09T10:57:59.262Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Ho Chi Minh"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"AWS, Terraform, CloudFormation, CI/CD, Linux, Windows, Python, Node.js, Bash, Ruby, JSON/YAML, DNS, HTTP(S), TLS/SSL, firewalls, TCP/IP, containerization, orchestration, observability, incident management"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_eebf21c4-d1f"},"title":"Staff Site Reliability Engineer","description":"<p>Join our Site Reliability Engineering (SRE) team and help ensure the reliability, scalability, and performance of Replit&#39;s infrastructure that serves millions of developers worldwide.</p>\n<p>As a Staff Site Reliability Engineer, you will bridge the gap between development and operations, implementing automation and establishing best practices that enable our platform to scale efficiently while maintaining high availability.</p>\n<p>We are seeking Staff SREs who are passionate about building and maintaining resilient systems at scale. Your mission will be to proactively find and analyze reliability problems across our stack, then design and implement software and systems to create step-function improvements.</p>\n<p>You will design robust observability solutions, lead incident response, automate operational tasks, and continuously improve our infrastructure&#39;s reliability, all while mentoring and educating the broader engineering team to make reliability a core value at Replit.</p>\n<p><strong>Responsibilities</strong></p>\n<ul>\n<li>Architect and Implement Observability: Design, build, and lead the implementation of comprehensive monitoring, logging, and tracing solutions. Create dashboards and metrics that provide real-time visibility into system health and performance, enabling proactive issue detection.</li>\n</ul>\n<ul>\n<li>Define and Drive Reliability Standards: Work with product and engineering teams to define, implement, and track Service Level Objectives (SLOs) and Service Level Indicators (SLIs). Build systems to monitor and report on these metrics, holding teams accountable and ensuring we maintain high reliability standards while balancing innovation speed.</li>\n</ul>\n<ul>\n<li>Lead Incident Management and Response: Act as a senior leader during high-impact incidents, guiding the team to rapid resolution. Conduct thorough, blameless post-mortems and drive the implementation of preventative measures. Develop and refine runbooks and build automation to reduce Mean Time To Recovery (MTTR).</li>\n</ul>\n<ul>\n<li>Drive Automation and Infrastructure as Code: Architect, build, and improve automation to eliminate toil and operational work. Design and maintain CI/CD pipelines and infrastructure automation using tools like Terraform or Pulumi. Create self-healing systems that can automatically respond to common failure scenarios.</li>\n</ul>\n<ul>\n<li>Optimize Performance on Kubernetes: Collaborate with core infrastructure and product teams to performance-tune and optimize our large-scale cloud deployments, with a deep focus on Kubernetes, Docker, and GCP. Identify and resolve performance bottlenecks, implement capacity planning strategies, and reduce latency across global regions.</li>\n</ul>\n<ul>\n<li>Debug and Harden Distributed Systems: Dive deep into debugging extremely difficult technical problems across the stack. Use your findings to design and implement long-term fixes that make our systems and products more robust, operable, and easier to diagnose.</li>\n</ul>\n<ul>\n<li>Provide Staff-Level Guidance: Review feature and system designs from across the company, acting as a key owner for the reliability, scalability, security, and operational integrity of those designs.</li>\n</ul>\n<ul>\n<li>Educate and Mentor: Educate, mentor, and hold accountable the broader engineering team to improve the reliability of our systems, making reliability a core value of the Replit engineering culture.</li>\n</ul>\n<ul>\n<li>Build and Integrate: Write high-quality, well-tested code in Python or Go to meet the needs of your customers, whether it&#39;s building new internal tools or integrating with third-party vendors.</li>\n</ul>\n<p><strong>Required Skills and Experience</strong></p>\n<ul>\n<li>8-10 years of experience in Site Reliability Engineering or similar roles (e.g., DevOps, Systems Engineering, Infrastructure Engineering).</li>\n</ul>\n<ul>\n<li>Strong programming skills in languages like Python or Go. You write high-quality, well-tested code.</li>\n</ul>\n<ul>\n<li>Deep understanding of distributed systems. You’ve designed, built, scaled, and maintained production services and know how to compose a service-oriented architecture.</li>\n</ul>\n<ul>\n<li>Deep experience with container orchestration platforms, specifically Kubernetes, and cloud-native technologies.</li>\n</ul>\n<ul>\n<li>Proven track record of designing, implementing, and maintaining sophisticated monitoring and observability solutions (e.g., metrics, logging, tracing).</li>\n</ul>\n<ul>\n<li>Strong incident management skills with extensive experience leading incident response for complex systems and demonstrated critical thinking under pressure.</li>\n</ul>\n<ul>\n<li>Experience with infrastructure as code (e.g., Terraform, Pulumi) and configuration management tools.</li>\n</ul>\n<ul>\n<li>Excellent written and verbal communication skills, with an ability to explain complex technical concepts clearly and simply and a bias toward open, transparent cultural practices.</li>\n</ul>\n<ul>\n<li>Strong interpersonal skills, with experience working with and mentoring engineers from junior to principal levels.</li>\n</ul>\n<ul>\n<li>A willingness to dive into understanding, debugging, and improving any layer of the stack.</li>\n</ul>\n<ul>\n<li>You&#39;re passionate about making software creation accessible and empowering the next generation of builders.</li>\n</ul>\n<p><strong>Bonus Points</strong></p>\n<ul>\n<li>Deep experience with Google Cloud Platform (GCP) services and tools.</li>\n</ul>\n<ul>\n<li>Expert-level knowledge of modern observability platforms (e.g., Prometheus, Grafana, Datadog, OpenTelemetry).</li>\n</ul>\n<ul>\n<li>Experience designing and building reliable systems capable of handling high throughput and low latency.</li>\n</ul>\n<ul>\n<li>Significant experience with Go and Terraform.</li>\n</ul>\n<ul>\n<li>Familiarity with working in rapid-growth, startup environments.</li>\n</ul>\n<ul>\n<li>Experience writing company-facing blog posts and training materials.</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_eebf21c4-d1f","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Replit","sameAs":"https://jobs.ashbyhq.com","logo":"https://logos.yubhub.co/replit.com.png"},"x-apply-url":"https://jobs.ashbyhq.com/replit/d50ad15b-82d4-452f-b4ea-2a7f5e796170","x-work-arrangement":"remote","x-experience-level":"staff","x-job-type":"Full time","x-salary-range":"$220K - $325K","x-skills-required":["Site Reliability Engineering","DevOps","Systems Engineering","Infrastructure Engineering","Python","Go","Distributed Systems","Container Orchestration","Kubernetes","Cloud-Native Technologies","Monitoring and Observability","Incident Management","Infrastructure as Code","Terraform","Pulumi","Configuration Management"],"x-skills-preferred":["Google Cloud Platform","Prometheus","Grafana","Datadog","OpenTelemetry","Go","Terraform"],"datePosted":"2026-03-08T22:20:23.639Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Remote (United States)"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Site Reliability Engineering, DevOps, Systems Engineering, Infrastructure Engineering, Python, Go, Distributed Systems, Container Orchestration, Kubernetes, Cloud-Native Technologies, Monitoring and Observability, Incident Management, Infrastructure as Code, Terraform, Pulumi, Configuration Management, Google Cloud Platform, Prometheus, Grafana, Datadog, OpenTelemetry, Go, Terraform","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":220000,"maxValue":325000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_1f6d8d36-cd5"},"title":"Data Center Incident Program Manager","description":"<p><strong>Compensation</strong></p>\n<p>The base pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience. The salary range is $125.6K – $228K. In addition to the salary range listed above, total compensation also includes generous equity, performance-related bonus(es) for eligible employees, and the following benefits.</p>\n<ul>\n<li>Medical, dental, and vision insurance for you and your family, with employer contributions to Health Savings Accounts</li>\n</ul>\n<ul>\n<li>Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses (parking and transit)</li>\n</ul>\n<ul>\n<li>401(k) retirement plan with employer match</li>\n</ul>\n<ul>\n<li>Paid parental leave (up to 24 weeks for birth parents and 20 weeks for non-birthing parents), plus paid medical and caregiver leave (up to 8 weeks)</li>\n</ul>\n<ul>\n<li>Paid time off: flexible PTO for exempt employees and up to 15 days annually for non-exempt employees</li>\n</ul>\n<ul>\n<li>13+ paid company holidays, and multiple paid coordinated company office closures throughout the year for focus and recharge, plus paid sick or safe time (1 hour per 30 hours worked, or more, as required by applicable state or local law)</li>\n</ul>\n<ul>\n<li>Mental health and wellness support</li>\n</ul>\n<ul>\n<li>Employer-paid basic life and disability coverage</li>\n</ul>\n<ul>\n<li>Annual learning and development stipend to fuel your professional growth</li>\n</ul>\n<ul>\n<li>Daily meals in our offices, and meal delivery credits as eligible</li>\n</ul>\n<ul>\n<li>Relocation support for eligible employees</li>\n</ul>\n<ul>\n<li>Additional taxable fringe benefits, such as charitable donation matching and wellness stipends, may also be provided.</li>\n</ul>\n<p><strong>About the Team:</strong></p>\n<p>OpenAI, in close collaboration with our capital partners, is embarking on a journey to build the world’s most advanced AI infrastructure ecosystem. Our Stargate program develops and deploys massive, state-of-the-art data center campuses in partnership with industry leaders such as Oracle today—and through future OpenAI infrastructure projects tomorrow. We design for scale, speed, and reliability, and we need experienced hardware professionals who can help ensure our high-density compute environment operates at peak performance.</p>\n<p><strong>About the Role:</strong></p>\n<p>The Data Center Incident Program Manager is responsible for designing, operating, and continuously improving the end-to-end incident management lifecycle across mission-critical data center environments. This role owns the “before, during, and after” mechanics of incidents — establishing standards and playbooks in steady state, serving as (or designating) Incident Commander during active events, and driving structured post-incident review and corrective action to closure.</p>\n<p><strong>In this role you will:</strong></p>\n<ul>\n<li>Define and maintain incident severity levels (SEV definitions), classification criteria, and escalation thresholds.</li>\n</ul>\n<ul>\n<li>Establish end-to-end incident response standards: protocols, lifecycle stages (declare → stabilize → mitigate → recover → close), and operating cadence.</li>\n</ul>\n<ul>\n<li>Build and maintain governance artifacts: runbooks, war room formats, reporting templates, and decision/communication standards.</li>\n</ul>\n<ul>\n<li>Create and operationalize notification trees, stakeholder comms templates (initial, periodic updates, recovery/closure), and executive escalation criteria.</li>\n</ul>\n<ul>\n<li>Define clear RACI across Facilities, Hardware Ops, Network, Security, and vendor/partner teams, including handoffs and accountability paths.</li>\n</ul>\n<ul>\n<li>Set and manage SLAs/OLAs for acknowledgment, escalation, containment, mitigation, and reporting.</li>\n</ul>\n<ul>\n<li>Implement and run incident management tooling (ticketing, paging, logging) and ensure integrations with monitoring and workflow systems.</li>\n</ul>\n<ul>\n<li>Establish dashboards and program health metrics to track incident performance and readiness.</li>\n</ul>\n<ul>\n<li>Lead readiness activities: tabletop exercises, cross-functional simulations, IC/Deputy training, and a rotating on-call IC bench with certification standards.</li>\n</ul>\n<ul>\n<li>Serve as Incident Commander as needed: declare severity, stand up the war room, assign functional leads, and drive structured execution under pressure.</li>\n</ul>\n<ul>\n<li>Maintain real-time documentation (decisions, timelines, impact scope) and ensure clear restoration objectives and scope control during active events.</li>\n</ul>\n<ul>\n<li>Run post-incident reviews (PIRs), validate timelines, drive structured RCA (e.g., 5 Whys, Fault Tree), and separate root cause vs contributing factors.</li>\n</ul>\n<ul>\n<li>Define corrective/preventative actions (CAPAs), assign accountable owners, track to verified closure, and escalate overdue actions.</li>\n</ul>\n<ul>\n<li>Publish trend reporting (incident taxonomy, counts by severity, MTTA/MTTR, repeat failure domains) and feed systemic gaps back into design and operations teams.</li>\n</ul>\n<p><strong>You might thrive in this role if you:</strong></p>\n<ul>\n<li>7+ years in mission-critical infrastructure, data center operations, or reliability engineering</li>\n</ul>\n<ul>\n<li>Direct experience leading major incidents (P1/P0 equivalent)</li>\n</ul>\n<ul>\n<li>Strong familiarity with facilities systems, hardware operations, or network infrastructure</li>\n</ul>\n<ul>\n<li>Demonstrated experience running war rooms and executive updates</li>\n</ul>\n<ul>\n<li>Experience conducting root cause analysis and corrective action tracking</li>\n</ul>\n<ul>\n<li>Ability to remain calm and decisive under high-pressure conditions</li>\n</ul>\n<p><strong>Preferred Skills:</strong></p>\n<ul>\n<li>Experience in hyperscale or high-density AI compute environments</li>\n</ul>\n<ul>\n<li>Background in facilities commissioning, facility operations, hardware operations, or network reliability</li>\n</ul>\n<ul>\n<li>Familiarity with ISO-based quality systems or structured operational documentation frameworks</li>\n</ul>\n<ul>\n<li>Experience implementing incident tooling (PagerDuty, ServiceNow, Jira, etc.)</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_1f6d8d36-cd5","directApply":true,"hiringOrganization":{"@type":"Organization","name":"OpenAI","sameAs":"https://jobs.ashbyhq.com","logo":"https://logos.yubhub.co/openai.com.png"},"x-apply-url":"https://jobs.ashbyhq.com/openai/16aaa47f-596d-4bbd-a02a-b03db3f40c23","x-work-arrangement":"Remote","x-experience-level":"senior","x-job-type":"Full time","x-salary-range":"$125.6K – $228K","x-skills-required":["incident management","data center operations","reliability engineering","facilities systems","hardware operations","network infrastructure","root cause analysis","corrective action tracking"],"x-skills-preferred":["hyperscale","high-density AI compute environments","facilities commissioning","facility operations","ISO-based quality systems","structured operational documentation frameworks","incident tooling"],"datePosted":"2026-03-08T22:17:57.466Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Remote - US"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"incident management, data center operations, reliability engineering, facilities systems, hardware operations, network infrastructure, root cause analysis, corrective action tracking, hyperscale, high-density AI compute environments, facilities commissioning, facility operations, ISO-based quality systems, structured operational documentation frameworks, incident tooling","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":125600,"maxValue":228000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_8c164f95-f8d"},"title":"Senior Infrastructure Engineer","description":"<p>Join our Infrastructure Engineering team and help ensure the reliability, scalability, and performance of Replit&#39;s infrastructure that serves millions of developers worldwide. As a Senior Infrastructure Engineer, you will bridge the gap between development and operations, implementing automation and establishing best practices that enable our platform to scale efficiently while maintaining high availability.</p>\n<p>We are seeking Senior Infrastructure Engineers who are passionate about building and maintaining resilient systems at scale. Your mission will be to proactively find and analyse reliability problems across our stack, then design and implement software and systems to address them. You will build robust monitoring solutions, automate operational tasks, and continuously improve our infrastructure&#39;s reliability.</p>\n<p><strong>You Will:</strong></p>\n<ul>\n<li>Drive Automation and Infrastructure as Code: Build and improve automation to eliminate toil and operational work. Maintain CI/CD pipelines and infrastructure automation using tools like Terraform or Pulumi. Create self-healing systems that can automatically respond to common failure scenarios.</li>\n<li>Optimise Performance and Infrastructure: Collaborate with core infrastructure and product teams to performance tune and optimise our cloud deployments (Kubernetes, Docker, GCP). Identify and resolve performance bottlenecks and implement capacity planning strategies.</li>\n<li>Elevate Developer Experience: Design and implement improvements to our build, test, and deployment systems to make software delivery faster, safer, and more reliable for all engineers.</li>\n<li>Drive Cross-Team Improvements: Partner with service owners across Replit to understand their pain points, and collaborate on implementing build/test/deploy enhancements within their specific services.</li>\n<li>Build Shared Tooling: Create and maintain centralized tooling and automation that improves the engineering lifecycle, from local development to production monitoring.</li>\n<li>Debug and Harden Systems: Dive deep into debugging difficult technical problems, making our systems and products more robust, operable, and easier to diagnose.</li>\n<li>Collaborate on Design Reviews: Participate in feature and system design reviews, contributing expertise on security, scale, and operational considerations.</li>\n<li>Build and Integrate: Write high-quality, well-tested code to meet the needs of your customers, including building pipelines to integrate with 3rd party vendors.</li>\n</ul>\n<p><strong>Required Skills and Experience:</strong></p>\n<ul>\n<li>4+ years of experience in Site Reliability Engineering or similar roles (DevOps, Systems Engineering, Infrastructure Engineering).</li>\n<li>Strong programming skills in languages like Python or Go.</li>\n<li>You write high-quality, well-tested code.</li>\n<li>Solid understanding of distributed systems. You&#39;ve built, scaled, and maintained production services and understand service-oriented architecture.</li>\n<li>Experience with container orchestration platforms (Kubernetes) and cloud-native technologies.</li>\n<li>Experience implementing and maintaining monitoring/observability solutions, with strong skills in debugging and performance tuning.</li>\n<li>Strong incident management skills with experience participating in incident response and demonstrated critical thinking under pressure.</li>\n<li>Experience with infrastructure as code (e.g., Terraform) and configuration management tools.</li>\n<li>Excellent written and verbal communication skills, with an ability to explain technical concepts clearly.</li>\n<li>A willingness to dive into understanding, debugging, and improving any layer of the stack.</li>\n<li>You&#39;re passionate about making software creation accessible and empowering the next generation of builders.</li>\n</ul>\n<p><strong>Bonus Points:</strong></p>\n<ul>\n<li>Experience with Google Cloud Platform (GCP) services and tools.</li>\n<li>Knowledge of modern observability platforms (Prometheus, Grafana, Datadog, etc.).</li>\n<li>Experience building reliable systems capable of handling high throughput and low latency.</li>\n<li>Experience with Go and Terraform.</li>\n<li>Familiarity with working in rapid-growth environments.</li>\n</ul>\n<p>_This is a full-time role that can be held from our Foster City, CA office. The role has an in-office requirement of Monday, Wednesday, and Friday._</p>\n<p><strong>Full-Time Employee Benefits Include:</strong></p>\n<ul>\n<li>Competitive Salary &amp; Equity</li>\n<li>401(k) Program with a 4% match</li>\n<li>Health, Dental, Vision and Life Insurance</li>\n<li>Short Term and Long Term Disability</li>\n<li>Paid Parental, Medical, Caregiver Leave</li>\n<li>Commuter Benefits</li>\n<li>Monthly Wellness Stipend</li>\n<li>Autonomous Work Environment</li>\n<li>In Office Set-Up Reimbursement</li>\n<li>Flexible Time Off (FTO) + Holidays</li>\n<li>Quarterly Team Gatherings</li>\n<li>In Office Amenities</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_8c164f95-f8d","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Replit","sameAs":"https://jobs.ashbyhq.com","logo":"https://logos.yubhub.co/replit.com.png"},"x-apply-url":"https://jobs.ashbyhq.com/replit/16c85abc-763c-4f36-ab67-64f416343384","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$190K - $240K","x-skills-required":["Site Reliability Engineering","DevOps","Systems Engineering","Infrastructure Engineering","Python","Go","Terraform","Kubernetes","Docker","GCP","Monitoring/observability solutions","Debugging and performance tuning","Incident management","Infrastructure as code","Configuration management tools"],"x-skills-preferred":["Google Cloud Platform (GCP) services and tools","Modern observability platforms (Prometheus, Grafana, Datadog, etc.)","Building reliable systems capable of handling high throughput and low latency","Go and Terraform","Familiarity with working in rapid-growth environments"],"datePosted":"2026-03-07T15:20:28.138Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Foster City, CA"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Site Reliability Engineering, DevOps, Systems Engineering, Infrastructure Engineering, Python, Go, Terraform, Kubernetes, Docker, GCP, Monitoring/observability solutions, Debugging and performance tuning, Incident management, Infrastructure as code, Configuration management tools, Google Cloud Platform (GCP) services and tools, Modern observability platforms (Prometheus, Grafana, Datadog, etc.), Building reliable systems capable of handling high throughput and low latency, Go and Terraform, Familiarity with working in rapid-growth environments","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":190000,"maxValue":240000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_b7de618e-5e1"},"title":"Site Reliability Engineer","description":"<p>Join our Site Reliability Engineering team and help ensure the reliability, scalability, and performance of Replit&#39;s infrastructure that serves millions of developers worldwide. As a Site Reliability Engineer, you will bridge the gap between development and operations, implementing automation and establishing best practices that enable our platform to scale efficiently while maintaining high availability.</p>\n<p>We are seeking SREs who are passionate about building and maintaining resilient systems at scale. Your mission will be to design and implement robust monitoring solutions, automate operational tasks, and continuously improve our infrastructure&#39;s reliability and performance.</p>\n<p><strong>Responsibilities</strong></p>\n<ul>\n<li>Design and Implement Observability Solutions: Develop comprehensive monitoring and alerting systems using modern observability tools. Create dashboards and metrics that provide real-time visibility into system health and performance. Implement logging strategies that enable quick problem identification and resolution.</li>\n</ul>\n<ul>\n<li>Drive Automation and Infrastructure as Code: Architect and implement infrastructure automation solutions using tools like Terraform, Ansible, or Pulumi. Design and maintain CI/CD pipelines that enable reliable and consistent deployments. Create self-healing systems that can automatically respond to common failure scenarios.</li>\n</ul>\n<ul>\n<li>Establish SLOs and SLIs: Work with product and engineering teams to define and implement Service Level Objectives (SLOs) and Service Level Indicators (SLIs). Build systems to track and report on these metrics, ensuring we maintain high reliability standards while balancing innovation speed.</li>\n</ul>\n<ul>\n<li>Incident Management and Response: Lead incident response efforts, conducting thorough post-mortems, and implementing improvements to prevent future occurrences. Develop and maintain runbooks for critical services. Build tools and processes that reduce Mean Time To Recovery (MTTR).</li>\n</ul>\n<ul>\n<li>Performance Optimization: Identify and resolve performance bottlenecks across our infrastructure. Implement capacity planning strategies and optimize resource utilization. Work on reducing latency and improving system efficiency across global regions.</li>\n</ul>\n<p><strong>Requirements</strong></p>\n<ul>\n<li>4-8 years of experience in Site Reliability Engineering or similar roles (DevOps, Systems Engineering, Infrastructure Engineering)</li>\n</ul>\n<ul>\n<li>Strong programming skills in languages commonly used for automation (Python, Go, or similar)</li>\n</ul>\n<ul>\n<li>Deep understanding of distributed systems</li>\n</ul>\n<ul>\n<li>Experience with container orchestration platforms (Kubernetes) and cloud-native technologies</li>\n</ul>\n<ul>\n<li>Proven track record of implementing and maintaining monitoring/observability solutions</li>\n</ul>\n<ul>\n<li>Strong incident management skills with experience leading incident response</li>\n</ul>\n<ul>\n<li>Experience with infrastructure as code and configuration management tools</li>\n</ul>\n<p><strong>Bonus Points</strong></p>\n<ul>\n<li>Experience with Google Cloud Platform (GCP) services and tools</li>\n</ul>\n<ul>\n<li>Knowledge of modern observability platforms (Prometheus, Grafana, Datadog, etc.)</li>\n</ul>\n<p><strong>What We Value</strong></p>\n<ul>\n<li>Problem-solving mindset: Ability to approach complex operational challenges systematically and devise effective solutions</li>\n</ul>\n<ul>\n<li>Self-directed and autonomous: Capable of working independently while collaborating effectively with cross-functional teams</li>\n</ul>\n<ul>\n<li>Strong communication skills: Ability to explain complex technical concepts to both technical and non-technical audiences</li>\n</ul>\n<ul>\n<li>Continuous learning: Passion for staying current with industry best practices and new technologies</li>\n</ul>\n<ul>\n<li>Focus on automation: Strong belief in automating repetitive tasks and building self-healing systems</li>\n</ul>\n<p><strong>Full-Time Employee Benefits Include</strong></p>\n<ul>\n<li>Competitive Salary &amp; Equity</li>\n</ul>\n<ul>\n<li>401(k) Program with a 4% match</li>\n</ul>\n<ul>\n<li>Health, Dental, Vision and Life Insurance</li>\n</ul>\n<ul>\n<li>Short Term and Long Term Disability</li>\n</ul>\n<ul>\n<li>Paid Parental, Medical, Caregiver Leave</li>\n</ul>\n<ul>\n<li>Commuter Benefits</li>\n</ul>\n<ul>\n<li>Monthly Wellness Stipend</li>\n</ul>\n<ul>\n<li>Autonomous Work Environment</li>\n</ul>\n<ul>\n<li>In Office Set-Up Reimbursement</li>\n</ul>\n<ul>\n<li>Flexible Time Off (FTO) + Holidays</li>\n</ul>\n<ul>\n<li>Quarterly Team Gatherings</li>\n</ul>\n<ul>\n<li>In Office Amenities</li>\n</ul>\n<p><strong>Want to Learn More About What We Are Up To?</strong></p>\n<ul>\n<li>Meet the Replit Agent</li>\n</ul>\n<ul>\n<li>Replit: Make an app for that</li>\n</ul>\n<ul>\n<li>Replit Blog</li>\n</ul>\n<ul>\n<li>Amjad TED Talk</li>\n</ul>\n<p><strong>Interviewing + Culture at Replit</strong></p>\n<ul>\n<li>Operating Principles</li>\n</ul>\n<ul>\n<li>Reasons not to work at Replit</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_b7de618e-5e1","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Replit","sameAs":"https://jobs.ashbyhq.com","logo":"https://logos.yubhub.co/replit.com.png"},"x-apply-url":"https://jobs.ashbyhq.com/replit/f6e6158e-eb89-4008-81ea-1b7512bc509d","x-work-arrangement":"remote","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$160K - $250K","x-skills-required":["Site Reliability Engineering","DevOps","Systems Engineering","Infrastructure Engineering","Python","Go","Distributed systems","Container orchestration platforms","Cloud-native technologies","Monitoring/observability solutions","Incident management","Infrastructure as code","Configuration management tools"],"x-skills-preferred":["Google Cloud Platform","Prometheus","Grafana","Datadog"],"datePosted":"2026-03-07T15:20:24.140Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"United States"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Site Reliability Engineering, DevOps, Systems Engineering, Infrastructure Engineering, Python, Go, Distributed systems, Container orchestration platforms, Cloud-native technologies, Monitoring/observability solutions, Incident management, Infrastructure as code, Configuration management tools, Google Cloud Platform, Prometheus, Grafana, Datadog","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":160000,"maxValue":250000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_7b5747bd-067"},"title":"Regulatory Operations Analyst","description":"<p><strong><strong>About the Team</strong></strong></p>\n<p>At OpenAI, our User Operations team safeguards our products and users from legal risk, regulatory non-compliance, fraud, and abuse. The team operates at the intersection of operations, compliance, and user trust, embedded within the broader User Operations organization and collaborating cross-functionally with Legal, Policy, Engineering, Product, and external vendors.</p>\n<p>We support a global and diverse user base across OpenAI’s suite of products ChatGPT, SORA, API, Enterprise offerings, and developer tools by managing sensitive inbound tickets, regulatory obligations, and fraud-related escalations. Every user-facing action is grounded in our commitment to legal integrity, ethical practice, and regulatory excellence.</p>\n<p><strong><strong>About the Role</strong></strong></p>\n<p>We are seeking a sharp, adaptive, and operations-minded Regulatory Operations Analyst to help scale and evolve OpenAI’s global compliance support infrastructure.</p>\n<p>In this role, you will own high-sensitivity workflows involving complex global regulatory escalations, trust and safety matters, privacy rights requests, and intellectual property matters. You’ll serve as both a frontline operator and strategic partner, triaging and resolving complex cases while also shaping the systems, documentation, and processes that support OpenAI’s regulatory and compliance goals. You will contribute as a subject-matter expert (SME) on high-stakes escalations, partnering with cross-functional stakeholders to drive fast, defensible outcomes. You will also help design the processes, tooling, and automation that power safe operations at scale.</p>\n<p>This role is essential to building scalable, high-integrity operations that protect user rights, meet our obligations under emerging and current regulations, and reduce OpenAI’s risk exposure. You’ll also contribute to multi-phase transitions and automation efforts that support our long-term operational model.</p>\n<p>We use a hybrid work model of 3 days in the office per week in our Dublin office.</p>\n<p>_<strong>Please note:</strong> This role may involve exposure to sensitive or concerning content, including complaints involving harassment, fraud, or regulatory violations. Strong personal discretion, empathy, and resilience are essential._</p>\n<p><strong><strong>In This Role, You Will:</strong></strong></p>\n<ul>\n<li>Handle and resolve complex user issues involving:</li>\n</ul>\n<ul>\n<li>Trust &amp; Safety incidents</li>\n</ul>\n<ul>\n<li>Regulatory, audit, or compliance inquiries and complaints</li>\n</ul>\n<ul>\n<li>Intellectual property matters (e.g., copyright takedowns, ownership disputes)</li>\n</ul>\n<ul>\n<li>AI governance and regulatory frameworks (e.g., EU AI Act, DSA/OSA)</li>\n<li>Perform risk evaluations and investigations using internal tools, documentation, and third-party data</li>\n</ul>\n<ul>\n<li>Act as incident manager for highly sensitive reviews requiring nuanced interpretation of legal and regulatory standards</li>\n</ul>\n<ul>\n<li>Interface directly with Legal, Privacy, Product, and Support teams to coordinate escalations and resolution paths</li>\n</ul>\n<ul>\n<li>Partner with Legal, Privacy, Policy, and Ops to implement world-class operational workflows for compliance and risk</li>\n</ul>\n<ul>\n<li>Build and maintain tooling, escalation decision trees, playbooks, and knowledge articles</li>\n</ul>\n<ul>\n<li>Contribute to vendor training and governance models, especially during transitions and ramp-up phases</li>\n</ul>\n<ul>\n<li>Lead or participate in cross-functional initiatives that strengthen our regulatory, fraud, and legal infrastructure</li>\n</ul>\n<ul>\n<li>Monitor operational health via case quality audits, SLA tracking, escalation accuracy, and data</li>\n</ul>\n<p><strong><strong>You Might Thrive in This Role If You:</strong></strong></p>\n<ul>\n<li>Have 5+ years of experience in legal operations, regulatory compliance, or trust &amp; safety especially in a global or high-growth tech environment</li>\n</ul>\n<ul>\n<li>Have partnered with in-house counsel, DPOs, or external regulators on audits or escalations</li>\n</ul>\n<ul>\n<li>Understand tiered support structures and have worked with vendor operations at scale</li>\n</ul>\n<ul>\n<li>Bring a structured, systems-first mindset to operational governance and risk evaluation</li>\n</ul>\n<ul>\n<li>Communicate clearly, empathetically, and effectively especially in writing responses to sensitive issues</li>\n</ul>\n<ul>\n<li>Operate well in ambiguity and can manage multiple priorities simultaneously with speed and precision</li>\n</ul>\n<p>Thrive in high-autonomy environments and hold a high bar for ownership and integrity</p>\n<p><strong><strong>About OpenAI</strong></strong></p>\n<p>OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_7b5747bd-067","directApply":true,"hiringOrganization":{"@type":"Organization","name":"OpenAI","sameAs":"https://jobs.ashbyhq.com","logo":"https://logos.yubhub.co/openai.com.png"},"x-apply-url":"https://jobs.ashbyhq.com/openai/a236b8f1-e5d1-494e-ad76-f6027935fafb","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["Regulatory compliance","Trust and safety","Intellectual property","AI governance","Risk evaluation","Incident management","Communication","Operational governance","Vendor training","Cross-functional collaboration"],"x-skills-preferred":["Legal operations","Policy development","Process improvement","Data analysis","Project management"],"datePosted":"2026-03-06T18:37:13.564Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Dublin, Ireland"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Regulatory compliance, Trust and safety, Intellectual property, AI governance, Risk evaluation, Incident management, Communication, Operational governance, Vendor training, Cross-functional collaboration, Legal operations, Policy development, Process improvement, Data analysis, Project management"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_70806a42-556"},"title":"Senior Support Engineer","description":"<p><strong>Senior Support Engineer - Dublin</strong></p>\n<p><strong>Location</strong></p>\n<p>Dublin, Ireland</p>\n<p><strong>Employment Type</strong></p>\n<p>Full time</p>\n<p><strong>Department</strong></p>\n<p><strong>About the Team</strong></p>\n<p>The Technical Support team is responsible for ensuring that developers and enterprises can reliably build mission critical solutions using OpenAI models. We provide technical guidance, resolve complex issues and support customers in maximizing value and adoption from deploying our highly-capable models. We work closely with Technical Success, Product, Engineering and others to deliver the best possible experience to our customers at scale. We think from an automation-first mindset and leverage the latest in AI to scale our support operations. Join the Senior Support Engineering (SSE) team at OpenAI and help shape the future of Technical Support in the age of AI.</p>\n<p><strong>About the Role</strong></p>\n<p>We are looking for a Senior Support Engineer to collaborate directly with our strategic enterprise accounts and product teams, helping solve some of the most difficult problems faced by our Customers. You will be part of the best technical troubleshooting team at OpenAI, and our Customers and Engineering teams will look to you for technical guidance in addressing the most technically difficult issues in our environment.</p>\n<p>As a Senior Support Engineer, you will design and run operational processes to monitor our top strategic customers and a 24x7 response team. You’ll work closely with our Infrastructure and Engineering teams to deliver the best possible experience to customers at scale. Working directly with our most strategic Customers - You will be crucial to the success of the most innovative, disruptive, and high-scale AI solutions being built with the OpenAI API platform.</p>\n<p>The nature of this role will be low volume, high difficulty.</p>\n<p>This role is based in Dublin, Ireland. We use a hybrid work model of 3 days in the office per week and offer relocation assistance to new employees.</p>\n<p><strong>In this role, you will:</strong></p>\n<ul>\n<li>Be among the foremost technical and troubleshooting experts for our API platform at OpenAI. You are the last line of defense before the core Engineering team.</li>\n</ul>\n<ul>\n<li>Proactively identify and implement opportunities to scale support operations by leveraging automation and advancements in AI technologies. Contribute to shaping the future of technical support in an AI-driven era.</li>\n</ul>\n<ul>\n<li>Configure and use advanced monitoring and alerting workflows to proactively detect customer impacting issues in real time.</li>\n</ul>\n<ul>\n<li>In partnership with engineering, contribute to reliability reviews and preparedness for new features, launches, or strategic customer requirement updates. Ensure that operational readiness (monitoring, alerting, and fallback plans) is in place for any such changes.</li>\n</ul>\n<ul>\n<li>Design and refine incident response processes and documentation across strategic customers, engineering and support teams.</li>\n</ul>\n<ul>\n<li>Analyze operational metrics and incident RCAs to identify areas for improvement. Proactively recommend and implement enhancements to monitoring dashboards, alert configurations, and support workflows.</li>\n</ul>\n<ul>\n<li>Provide support coverage during holidays and weekends based on business needs.</li>\n</ul>\n<p><strong>You might thrive in this role if you:</strong></p>\n<ul>\n<li>Have a Bachelor’s degree in Computer Science or a related field. A strong software engineering foundation is important for this role’s success.</li>\n</ul>\n<ul>\n<li>Have 5+ years of experience in technical operations roles such as SRE/NOC, designing monitoring systems and resolving production issues in fast-paced and mission-critical environments. A strong track record of troubleshooting complex technical problems at the systems level.</li>\n</ul>\n<ul>\n<li>Have deep familiarity with modern monitoring, alerting, and observability practices. Hands‑on experience setting up or managing metrics, logging, and tracing for distributed systems (e.g., understanding of SLIs/SLOs, alert tuning, dashboard creation).</li>\n</ul>\n<ul>\n<li>Have proven experience leading incident response for high‑severity outages or service disruptions. Able to perform real‑time incident coordination, root cause analysis, and drive follow‑ups (post‑mortems, action items) to prevent recurrence. Knowledge of industry best practices for incident management and fault diagnosis.</li>\n</ul>\n<ul>\n<li>Have strong skills in scripting or software engineering (e.g., Python or similar) to automate repetitive tasks and integrate tools.</li>\n</ul>\n<ul>\n<li>Have solid understanding of cloud infrastructure and distributed systems fundamentals. Comfortable working with cloud services, load balancers, databases, and containerized applications.</li>\n</ul>\n<ul>\n<li>Are effective at working cross‑functionally in a high‑trust environment. Strong communication skills to explain technical issues and resolutions to both engineering and non‑technical stakeholders. You can coordinate efforts across teams and are comfortable providing updates in the midst of an ongoing incident.</li>\n</ul>\n<p><strong>Compensation, Benefits and Perks</strong></p>\n<p>This is a position with OpenAI Ireland Ltd., which controls the hiring and management of this position.</p>\n<p>Total compensation includes an annual salary, generous equity, and benefits.</p>\n<ul>\n<li>Medical, dental, and vision insurance for you and your family</li>\n</ul>\n<ul>\n<li>Mental health and wellness support</li>\n</ul>\n<ul>\n<li>PRSA plan with 8% employer matching</li>\n</ul>\n<ul>\n<li>Unlimited time off</li>\n</ul>\n<ul>\n<li>Annual learning &amp; development stipend ($1,500 USD equivalent per year)</li>\n</ul>\n<p>#LI-NM2</p>\n<p><strong>About OpenAI</strong></p>\n<p>OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_70806a42-556","directApply":true,"hiringOrganization":{"@type":"Organization","name":"OpenAI","sameAs":"https://jobs.ashbyhq.com","logo":"https://logos.yubhub.co/openai.com.png"},"x-apply-url":"https://jobs.ashbyhq.com/openai/988016e1-de50-42be-925a-438b97291c5d","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["Python","Cloud infrastructure","Distributed systems","Monitoring and alerting","Observability","Scripting","Software engineering","Cloud services","Load balancers","Databases","Containerized applications"],"x-skills-preferred":["SLIs/SLOs","Alert tuning","Dashboard creation","Incident management","Fault diagnosis","Cross-functional collaboration","Communication"],"datePosted":"2026-03-06T18:36:57.231Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Dublin"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Python, Cloud infrastructure, Distributed systems, Monitoring and alerting, Observability, Scripting, Software engineering, Cloud services, Load balancers, Databases, Containerized applications, SLIs/SLOs, Alert tuning, Dashboard creation, Incident management, Fault diagnosis, Cross-functional collaboration, Communication"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_0faec3dd-fe3"},"title":"Corporate Security Operations Manager, Tokyo","description":"<p><strong>Corporate Security Operations Manager, Tokyo</strong></p>\n<p><strong>Location</strong></p>\n<p>Tokyo, Japan</p>\n<p><strong>Employment Type</strong></p>\n<p>Full time</p>\n<p><strong>Department</strong></p>\n<p>Corporate Security</p>\n<p><strong>About the Team</strong></p>\n<p>The Corporate Security team is responsible for the security and protection of all OpenAI employees and executives. We are committed to creating and maintaining a secure environment that allows our team members to focus on their work without fear of harm or disruption.</p>\n<p><strong>About the Role</strong></p>\n<p>As Corporate Security Operations Manager for Tokyo, you will lead day-to-day corporate/physical security for our Tokyo office, and be responsible for delivering a safe, discreet, and professional security environment in a high-trust, high-confidentiality tech setting. Reporting up to the APAC Security Manager, you will be the primary security point of contact in Tokyo—balancing strong risk management with a calm, service-oriented presence—while delicately partnering with employees, cross-functional teams, and local external stakeholders.</p>\n<p>This role is based in Tokyo. Additionally, travel to Seoul and travel outside the region to support other CorpSec pillars may be required.</p>\n<p><strong>You’ll be responsible for:</strong></p>\n<ul>\n<li>Operationalizing the Tokyo office physical security program: access control, visitor management, incident response, and office security operations.</li>\n</ul>\n<ul>\n<li>Managing security vendors and the contract guard force (post orders, performance, coverage, KPIs/SLAs, and continuous improvement).</li>\n</ul>\n<ul>\n<li>Partnering closely with Workplace/Facilities, HR, IT/InfoSec, Legal, and office leadership to implement sensible, employee-friendly security controls.</li>\n</ul>\n<ul>\n<li>Leading local incident response and follow-through (documentation, after-action reviews, corrective actions).</li>\n</ul>\n<ul>\n<li>Supporting security planning for in-office events, leadership visits, and business travel into/out of Tokyo as needed. This may also include supporting the APAC regional manager in day to day administration of the Seoul office, as required.</li>\n</ul>\n<ul>\n<li>Driving emergency preparedness (earthquake readiness, evacuation/muster processes, tabletop exercises/drills in coordination with Workplace).</li>\n</ul>\n<p><strong>We’re looking for someone with:</strong></p>\n<ul>\n<li>8-10 years of experience in corporate security, protective services, law enforcement, military, or a combination of relevant fields, with demonstrated progression in responsibility.</li>\n</ul>\n<ul>\n<li>Proven experience leading physical security operations in a modern office environment, ideally within tech or other high-confidentiality settings.</li>\n</ul>\n<ul>\n<li>Demonstrated capability to manage security vendors and a contract guard force (performance management, post orders, SLAs/KPIs, incident standards).</li>\n</ul>\n<ul>\n<li>Strong incident management experience, including real-world response and after-action improvement.</li>\n</ul>\n<ul>\n<li>Exceptional interpersonal skills with a track record of delicately managing cross-functional stakeholders, employee concerns, and leadership expectations.</li>\n</ul>\n<ul>\n<li>Professional/Business English and Japanese fluency (written and spoken), including the ability to write clear incident reports and present risk decisions to regional/global partners.</li>\n</ul>\n<ul>\n<li>Sound judgment, discretion, and ability to handle sensitive issues with confidentiality.</li>\n</ul>\n<ul>\n<li>Comfort operating in a global environment across time zones, with a bias for collaboration and pragmatic solutions.</li>\n</ul>\n<ul>\n<li>A strong ethical foundation and a commitment to OpenAI’s mission and values.</li>\n</ul>\n<p><strong>About OpenAI</strong></p>\n<p>OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_0faec3dd-fe3","directApply":true,"hiringOrganization":{"@type":"Organization","name":"OpenAI","sameAs":"https://jobs.ashbyhq.com","logo":"https://logos.yubhub.co/openai.com.png"},"x-apply-url":"https://jobs.ashbyhq.com/openai/6640e439-1217-4075-9441-602b543c9afa","x-work-arrangement":"onsite","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["corporate security","physical security","incident response","security vendors","contract guard force","security planning","emergency preparedness"],"x-skills-preferred":["Japanese fluency","incident management","interpersonal skills","risk management","security operations"],"datePosted":"2026-03-06T18:25:00.603Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Tokyo, Japan"}},"employmentType":"FULL_TIME","occupationalCategory":"Operations","industry":"Technology","skills":"corporate security, physical security, incident response, security vendors, contract guard force, security planning, emergency preparedness, Japanese fluency, incident management, interpersonal skills, risk management, security operations"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_b447a8bc-5f1"},"title":"Backend Software Engineer - B2B Connectors","description":"<p><strong>Location</strong></p>\n<p>San Francisco; New York City</p>\n<p><strong>Employment Type</strong></p>\n<p>Full time</p>\n<p><strong>Location Type</strong></p>\n<p>Hybrid</p>\n<p><strong>Department</strong></p>\n<p>Applied AI</p>\n<p><strong>Compensation</strong></p>\n<ul>\n<li>$230K – $385K • Offers Equity</li>\n</ul>\n<p>The base pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience. If the role is non-exempt, overtime pay will be provided consistent with applicable laws. In addition to the salary range listed above, total compensation also includes generous equity, performance-related bonus(es) for eligible employees, and the following benefits.</p>\n<ul>\n<li>Medical, dental, and vision insurance for you and your family, with employer contributions to Health Savings Accounts</li>\n</ul>\n<ul>\n<li>Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses (parking and transit)</li>\n</ul>\n<ul>\n<li>401(k) retirement plan with employer match</li>\n</ul>\n<ul>\n<li>Paid parental leave (up to 24 weeks for birth parents and 20 weeks for non-birthing parents), plus paid medical and caregiver leave (up to 8 weeks)</li>\n</ul>\n<ul>\n<li>Paid time off: flexible PTO for exempt employees and up to 15 days annually for non-exempt employees</li>\n</ul>\n<ul>\n<li>13+ paid company holidays, and multiple paid coordinated company office closures throughout the year for focus and recharge, plus paid sick or safe time (1 hour per 30 hours worked, or more, as required by applicable state or local law)</li>\n</ul>\n<ul>\n<li>Mental health and wellness support</li>\n</ul>\n<ul>\n<li>Employer-paid basic life and disability coverage</li>\n</ul>\n<ul>\n<li>Annual learning and development stipend to fuel your professional growth</li>\n</ul>\n<ul>\n<li>Daily meals in our offices, and meal delivery credits as eligible</li>\n</ul>\n<ul>\n<li>Relocation support for eligible employees</li>\n</ul>\n<ul>\n<li>Additional taxable fringe benefits, such as charitable donation matching and wellness stipends, may also be provided.</li>\n</ul>\n<p>More details about our benefits are available to candidates during the hiring process.</p>\n<p>This role is at-will and OpenAI reserves the right to modify base pay and other compensation components at any time based on individual performance, team or company results, or market conditions.</p>\n<p><strong><strong>About the Team</strong></strong></p>\n<p>OpenAI’s mission is to make AGI beneficial for all of humanity and our mission is successful only if AGI drives real benefits across all industries in the world. Our goal in B2B applications is to enable this mission by helping businesses, enterprises &amp; governments redefine how they operate to empower people and accelerate economic growth.</p>\n<p>Connectors are the bridge between OpenAI products (ChatGPT Enterprise, Frontier, and the API) and the systems where work actually happens—documents, tickets, messages, CRM records, knowledge bases, and more. The Connectors Platform team builds the infrastructure and control plane that makes these integrations reliable, secure, scalable, and enterprise-ready across a wide range of partners and customer environments.</p>\n<p><strong><strong>About the Role</strong></strong></p>\n<p>We’re looking for an infrastructure-focused engineer to build and operate the systems that make Connectors dependable at global scale. In this role, you’ll design the control plane, reliability foundations, and operational tooling that power connector execution—auth flows, sync and indexing pipelines, rate limiting, isolation, observability, incident response, and safe rollouts. You’ll work closely with product engineering, partner teams, and security to ship enterprise-grade connectivity while meeting high bars for privacy, compliance, and uptime.</p>\n<p><strong><strong>In this role, you will:</strong></strong></p>\n<ul>\n<li>Design and operate the infrastructure that powers connector sync, indexing, and retrieval at scale (job orchestration, queues, storage, caching, backpressure).</li>\n</ul>\n<ul>\n<li>Build the “control plane” primitives for connectors: rollout controls, configuration management, permissions, policy enforcement, and kill switches.</li>\n</ul>\n<ul>\n<li>Own reliability and operational excellence: SLOs, monitoring/alerting, incident response, postmortems, on-call health, and capacity planning.</li>\n</ul>\n<ul>\n<li>Create guardrails for safe multi-tenant execution: isolation boundaries, secrets handling, rate limits, abuse prevention, and blast-radius reduction.</li>\n</ul>\n<ul>\n<li>Partner with security and compliance teams to ensure enterprise requirements are met (audibility, least privilege, data retention, and secure-by-default architecture).</li>\n</ul>\n<ul>\n<li>Improve developer velocity via internal tooling: local dev workflows, canary environments, load testing, and observability dashboards.</li>\n</ul>\n<p><strong><strong>Your background might look something like:</strong></strong></p>\n<ul>\n<li>5+ years of professional engineering experience (excluding internships) in infra / SRE / platform roles at tech and product-driven companies.</li>\n</ul>\n<ul>\n<li>Strong distributed systems fundamentals and production instincts (availability, latency, correctness, resilience).</li>\n</ul>\n<ul>\n<li>Experience building and operating services with meaningful uptime and scale requirements (multi-region is a plus).</li>\n</ul>\n<ul>\n<li>Proficient in one or more backend languages (e.g. Python, Rust) and comfortable working close to systems concerns (networking, storage, queueing).</li>\n</ul>\n<ul>\n<li>Deep familiarity with observability (metrics, logs, tracing), incident management, and reliability engineering practices.</li>\n</ul>\n<ul>\n<li>Comfortable navigating ambiguous problem spaces and pushing pragmatic solutions into production.</li>\n</ul>\n<p>Interest in AI/ML is a plus, but not required.</p>\n<p><strong>About OpenAI</strong> OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_b447a8bc-5f1","directApply":true,"hiringOrganization":{"@type":"Organization","name":"OpenAI","sameAs":"https://jobs.ashbyhq.com","logo":"https://logos.yubhub.co/openai.com.png"},"x-apply-url":"https://jobs.ashbyhq.com/openai/cbacb6bd-aa41-41af-a5d5-13515a1be72b","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$230K – $385K","x-skills-required":["Backend languages (e.g. Python, Rust)","Distributed systems fundamentals","Production instincts (availability, latency, correctness, resilience)","Experience building and operating services with meaningful uptime and scale requirements","Proficient in one or more backend languages and comfortable working close to systems concerns (networking, storage, queueing)","Deep familiarity with observability (metrics, logs, tracing), incident management, and reliability engineering practices"],"x-skills-preferred":["AI/ML"],"datePosted":"2026-03-06T18:20:52.860Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco; New York City"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Backend languages (e.g. Python, Rust), Distributed systems fundamentals, Production instincts (availability, latency, correctness, resilience), Experience building and operating services with meaningful uptime and scale requirements, Proficient in one or more backend languages and comfortable working close to systems concerns (networking, storage, queueing), Deep familiarity with observability (metrics, logs, tracing), incident management, and reliability engineering practices, AI/ML","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":230000,"maxValue":385000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_0b576be6-155"},"title":"Control Room Manager","description":"<p>We are excited to offer an outstanding opportunity for an ambitious and highly skilled Control Room Manager. In this capacity, you will be an essential member of our team, ensuring the operational integrity and availability of our gaming systems. Your work will strongly influence our operational success, helping us preserve our world-class standards.</p>\n<p><strong>What you&#39;ll do</strong></p>\n<ul>\n<li>Lead, supervise, and support Control Room Operators during all shifts.</li>\n<li>Ensure control room activities fully adhere to operational, security, and regulatory requirements.</li>\n</ul>\n<p><strong>What you need</strong></p>\n<ul>\n<li>Diploma or degree in IT, Operations, Business, or related field (or equivalent experience).</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_0b576be6-155","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Aristocrat Interactive","sameAs":"https://aristocrat.wd3.myworkdayjobs.com","logo":"https://logos.yubhub.co/aristocrat.com.png"},"x-apply-url":"https://aristocrat.wd3.myworkdayjobs.com/en-US/AristocratExternalCareersSite/job/Lansing-MI-US/Control-Room-Manager_R0020775","x-work-arrangement":"hybrid","x-experience-level":"mid","x-job-type":"full-time","x-salary-range":"$48,790 - $90,610 per year","x-skills-required":["Diploma or degree in IT, Operations, Business, or related field (or equivalent experience)","Previous experience in a control room, operations centre, NOC, SOC, or gaming operations setting","Demonstrated leadership or supervisory experience"],"x-skills-preferred":["Strong knowledge of incident management and operational reporting","Proficient with computer systems, monitoring tools, and reporting platforms","Excellent verbal, written, and interpersonal communication skills"],"datePosted":"2026-03-01T05:05:36.479Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Lansing, MI, US"}},"employmentType":"FULL_TIME","occupationalCategory":"Operations","industry":"Technology","skills":"Diploma or degree in IT, Operations, Business, or related field (or equivalent experience), Previous experience in a control room, operations centre, NOC, SOC, or gaming operations setting, Demonstrated leadership or supervisory experience, Strong knowledge of incident management and operational reporting, Proficient with computer systems, monitoring tools, and reporting platforms, Excellent verbal, written, and interpersonal communication skills","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":48790,"maxValue":90610,"unitText":"YEAR"}}}]}