{"version":"0.1","company":{"name":"YubHub","url":"https://yubhub.co","jobsUrl":"https://yubhub.co/jobs/skill/service-mesh"},"x-facet":{"type":"skill","slug":"service-mesh","display":"Service Mesh","count":22},"x-feed-size-limit":100,"x-feed-sort":"enriched_at desc","x-feed-notice":"This feed contains at most 100 jobs (the most recently enriched). For the full corpus, use the paginated /stats/by-facet endpoint or /search.","x-generator":"yubhub-xml-generator","x-rights":"Free to redistribute with attribution: \"Data by YubHub (https://yubhub.co)\"","x-schema":"Each entry in `jobs` follows https://schema.org/JobPosting. YubHub-native raw fields carry `x-` prefix.","jobs":[{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_fd6d120d-6ff"},"title":"Senior Platform Software Engineer, Transport","description":"<p>About Us</p>\n<p>We&#39;re looking for a Senior Platform Software Engineer to join our Transport team, which is at the core of our evolution towards a resilient and scalable cloud future. As a member of this team, you&#39;ll design, build, and operate the foundational platform that allows our services to run in an isolated, highly available, and globally distributed fashion.</p>\n<p>As a Senior Platform Software Engineer, you&#39;ll have an outsized impact on every dbt Labs customer, tackling complex distributed systems problems while collaborating across product engineering, security, and infrastructure teams. This is a hands-on role where whatever you work on touches all of dbt Cloud and all of our customers at the same time.</p>\n<p>In this role, you can expect to:</p>\n<ul>\n<li>Join a senior, distributed team: Become part of a closely-knit group of senior engineers at the intersection of application and infrastructure, working asynchronously with ongoing communication in public Slack channels.</li>\n</ul>\n<ul>\n<li>Architect and build platform infrastructure: Design, build, and operate foundational components of our multi-cell platform, including service routing, cloud networking, and the control plane for managing account lifecycles.</li>\n</ul>\n<ul>\n<li>Drive seamless migrations: Develop and automate the tooling to migrate customer accounts from legacy environments to the new multi-cell architecture at scale.</li>\n</ul>\n<ul>\n<li>Develop scalable backend services: Write robust, high-quality backend services and infrastructure code, primarily in Go and Python, with opportunities to work with Rust.</li>\n</ul>\n<ul>\n<li>Tackle cloud networking challenges: Collaborate on network architecture design, including VPC management, load balancing, DNS, PrivateLink, and service mesh configurations to support single-tenant and multi-tenant deployments.</li>\n</ul>\n<ul>\n<li>Automate for scale: Design and implement automation using tools like Argo Workflows, Kubernetes, and Terraform to enhance the reliability, efficiency, and scalability of our platform.</li>\n</ul>\n<ul>\n<li>Collaborate and mentor: Work closely with product engineering teams, security, and customer support to unblock feature conformance, define technical direction, and mentor other engineers.</li>\n</ul>\n<ul>\n<li>Own and troubleshoot: Take strong ownership of distributed systems, troubleshoot complex issues across application and network layers, and participate in an on-call rotation to maintain high availability.</li>\n</ul>\n<p>You are a good fit if you have:</p>\n<ul>\n<li>Worked asynchronously as part of a fully-remote, distributed team</li>\n</ul>\n<ul>\n<li>Are an experienced backend or platform engineer, proficient in languages like Go or Python, with a history of building large-scale distributed systems.</li>\n</ul>\n<ul>\n<li>Have deep expertise in modern cloud infrastructure, including extensive hands-on experience with a major cloud provider (AWS, GCP, or Azure), containerization (Docker, Kubernetes), and Infrastructure as Code (Terraform).</li>\n</ul>\n<ul>\n<li>Thrive at the intersection of product and infrastructure, with a passion for building internal platforms and automation that enhance developer productivity and platform reliability.</li>\n</ul>\n<ul>\n<li>Bring familiarity with cloud networking concepts, including load balancing, DNS, VPCs, proxies, and service mesh technologies , or have a strong desire to learn and grow in this domain.</li>\n</ul>\n<ul>\n<li>Take strong ownership of your work from end-to-end, demonstrating a systematic, customer-focused approach to problem-solving and a track record of contributing to complex technical projects.</li>\n</ul>\n<ul>\n<li>Are a proactive and collaborative communicator, skilled at articulating technical concepts to both technical and non-technical partners and working effectively across team boundaries.</li>\n</ul>\n<p>You&#39;ll have an edge if you have:</p>\n<ul>\n<li>Direct experience with cell-based or multi-tenant architectures, particularly with building tooling for large-scale account migrations.</li>\n</ul>\n<ul>\n<li>A proven track record of building internal developer platforms or self-service infrastructure that empowers other engineers.</li>\n</ul>\n<ul>\n<li>Hands-on experience with cloud networking tools such as nginx, Istio, Envoy, AWS Transit Gateway, PrivateLink, or Kubernetes CNI/service mesh implementations.</li>\n</ul>\n<ul>\n<li>Deep expertise in multi-cloud strategies, including tools for cross-cloud management and cost optimization.</li>\n</ul>\n<ul>\n<li>Advanced proficiency with our core technologies, including extensive professional experience with both Go and Python, and an interest in or exposure to Rust.</li>\n</ul>\n<ul>\n<li>Advanced industry certifications (e.g., AWS Certified Solutions Architect – Professional, AWS Advanced Networking Specialty, Certified Kubernetes Administrator) or contributions to open-source cloud-native projects.</li>\n</ul>\n<p>Qualifications</p>\n<ul>\n<li>5+ years of professional software engineering experience, particularly in platform, infrastructure, or backend roles supporting SaaS applications.</li>\n</ul>\n<ul>\n<li>A Bachelor&#39;s degree in Computer Science or a related technical field is preferred, though equivalent practical experience or bootcamp completion with relevant work history will be considered.</li>\n</ul>\n<p><strong>Compensation &amp; Benefits</strong></p>\n<p>Salary: We offer competitive compensation packages commensurate with experience, including salary, equity, and where applicable, performance-based pay. Our Talent Acquisition Team can answer questions around dbt Labs&#39; total rewards during your interview process.</p>\n<p>In select locations (including Boston, Chicago, Denver, Los Angeles, Philadelphia, New York Metro, San Francisco, DC Metro, Seattle, Austin), an alternate range may apply, as specified below.</p>\n<ul>\n<li>The typical starting salary range for this role is: $147,000 - $178,000 USD</li>\n</ul>\n<ul>\n<li>The typical starting salary range for this role in the select locations listed is: $163,000 - $198,000 US</li>\n</ul>\n<p>Equity Stake Benefits</p>\n<ul>\n<li>dbt Labs offers: unlimited vacation, 401k w/3% guaranteed contribution, excellent healthcare, paid parental leave, wellness stipend, home office stipend, and more!</li>\n</ul>\n<ul>\n<li>Equity or comparable benefits may be offered depending on the legal limitations</li>\n</ul>\n<p><strong>Our Hiring Process (All Video Interviews)</strong></p>\n<ul>\n<li>Interview with a Talent Acquisition Partner (30 Mins)</li>\n</ul>\n<ul>\n<li>Technical Interview with Hiring Manager (60 Mins)</li>\n</ul>\n<ul>\n<li>Team Interviews with Cross Collaborators (4 rounds, 45 Mins each)</li>\n</ul>\n<ul>\n<li>Final Values Interview (30 Mins)</li>\n</ul>\n<p>dbt Labs is an equal opportunity employer, committed to building an inclusive team that welcomes diverse perspectives, backgrounds, and experiences. Even if your experience doesn’t perfectly align with the job description, we encourage you to apply,we value potential just as much as a perfect resume. Want to learn more about our focus on Diversity, Equity and Inclusion at dbt Labs? Check out our DEI page.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_fd6d120d-6ff","directApply":true,"hiringOrganization":{"@type":"Organization","name":"dbt Labs","sameAs":"https://www.getdbt.com/","logo":"https://logos.yubhub.co/getdbt.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/dbtlabsinc/jobs/4685888005","x-work-arrangement":"remote","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$147,000 - $178,000 USD","x-skills-required":["Go","Python","Rust","Cloud infrastructure","Containerization","Infrastructure as Code","Cloud networking","Load balancing","DNS","VPCs","Proxies","Service mesh technologies"],"x-skills-preferred":["Cell-based or multi-tenant architectures","Building tooling for large-scale account migrations","Cloud networking tools","Multi-cloud strategies","Cross-cloud management and cost optimization"],"datePosted":"2026-04-18T15:57:06.377Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"US - Remote"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Go, Python, Rust, Cloud infrastructure, Containerization, Infrastructure as Code, Cloud networking, Load balancing, DNS, VPCs, Proxies, Service mesh technologies, Cell-based or multi-tenant architectures, Building tooling for large-scale account migrations, Cloud networking tools, Multi-cloud strategies, Cross-cloud management and cost optimization","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":147000,"maxValue":178000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_ac45e205-e7d"},"title":"Engineering Manager, Inference Routing and Performance","description":"<p><strong>About the role\\nEvery request that hits Claude , from claude.ai, the API, our cloud partners, or internal research , passes through a routing decision. Not a generic load balancer round-robin, but a decision that accounts for what&#39;s already cached where, which accelerator the request runs best on, and what else is in flight across the fleet.\\n\\nGet it right and you extract meaningfully more throughput from the same hardware. Get it wrong and you burn capacity, miss latency SLOs, or shed load that shouldn&#39;t have been shed.\\n\\nThe Inference Routing team owns this layer. We build the cluster-level routing and coordination plane for Anthropic&#39;s inference fleet , the system that sits between the API surface and the inference engines themselves, making fleet-wide efficiency decisions in real time.\\n\\nAs Anthropic moves from &quot;many independent inference replicas&quot; toward &quot;a single warehouse-scale computer running a coordinated program,&quot; Dystro is the coordination layer. This is a deeply technical team.\\n\\nThe engineers here design custom load-balancing algorithms, build quantitative models of system performance, debug latency spikes that cross kernel, network, and framework boundaries, and reason carefully about cache placement across thousands of accelerators.\\n\\nThey work shoulder-to-shoulder with teams that write kernels and ML framework internals.\\n\\nThe EM for this team doesn&#39;t need to write kernels , but they do need the systems depth to make architectural calls, evaluate deeply technical candidates, and spot when a proposed optimization will have second-order effects on the fleet.\\n\\nYou&#39;ll inherit a strong team of distributed-systems engineers, and you&#39;ll be accountable for two things that pull in different directions: shipping system-level performance improvements that measurably increase fleet throughput and efficiency, and running the team operationally so that deploys are safe, incidents are rare, and the teams who depend on Dystro can plan around you with confidence.\\n\\nThe job is holding both.\\n\\n## Representative work:\\nThings the Inference Routing EM actually spends time on:\\n- Deciding whether a proposed routing algorithm change is worth the deploy risk, given the modeled throughput gain and the blast radius if it regresses\\n- Sequencing a quarter where KV-cache offload, a new coordination protocol, and two model launches all compete for the same engineers\\n- Working through a persistent tail-latency regression with the team , walking down from fleet-level metrics to per-replica behavior to a root cause in the networking stack\\n- Building the case (with numbers) to peer teams for why a cross-team protocol change unlocks the next efficiency win\\n- Running the post-incident review after a cache-eviction bug caused a capacity event, and turning it into process changes that stick\\n- Interviewing a candidate who has built schedulers at supercomputing scale, and deciding whether they&#39;d be additive to a team that already goes deep\\n\\n## What you&#39;ll do:\\nDrive system-level performance\\n- Own the technical roadmap for cluster-level inference efficiency , routing decisions, cache placement and eviction, cross-replica coordination, and the protocols that keep routing and inference engines in sync\\n- Partner with the inference engine, kernels, and performance teams to identify fleet-level throughput and latency wins, then turn those into shipped improvements with measurable results\\n- Build the team&#39;s habit of quantitative performance modeling: claim a win only when you can measure it, and know before you ship what the expected effect is\\n\\nDeliver reliably and operate cleanly\\n- Set technical strategy for how routing evolves across heterogeneous hardware (GPUs, TPUs, Trainium) and across all our serving surfaces\\n- Run the team&#39;s operational backbone , on-call rotation, incident response, postmortem review, deploy safety , so the team can ship aggressively without the system becoming fragile\\n- Create clarity at a seam: Inference Routing sits between the API surface, the inference engines, and the cloud deployment teams. You&#39;ll make sure commitments are realistic, dependencies are understood, and nobody is surprised\\n\\nBuild and grow the team\\n- Develop and retain a strong existing team, and hire against the bar described above: people who can go to the OS and framework level when the problem demands it, and who care about production reliability\\n- Coach engineers through a roadmap where priorities shift with model launches, new hardware, and scaling demands. We pair a lot here , you&#39;ll help make that collaboration pattern productive\\n- Pick up slack when it matters. This is a small team in a critical path; sometimes the EM is the one unblocking a stuck deploy or synthesizing a design debate\\n\\n## You may be a good fit if you:\\n- Have 5+ years of engineering management experience, ideally with at least part of that leading teams on critical-path production infrastructure at scale\\n- Have a deep systems background , load balancing, scheduling, cache-coherent distributed state, high-performance networking, or similar. You need enough depth to make architectural calls about routing and efficiency, and to evaluate candidates who go to the kernel and framework level\\n- Have shipped performance improvements in large-scale systems and can explain, with numbers, what the impact was\\n- Have run production infrastructure with real operational stakes: on-call, incident response, capacity events, deploy discipline\\n- Are results-oriented with a bias toward impact, and comfortable working in a space where throughput, latency, stability, and feature velocity all pull in different directions\\n- Build strong relationships across team boundaries , this is a seam role, and much of the job is making sure other teams can rely on yours\\n- Are curious about machine learning systems. You don&#39;t need an ML research background, but you should want to learn how transformer inference actually works and how that shapes the systems problems\\n\\nStrong candidates may also have:\\n- Experience with LLM inference serving , KV caching, continuous batching, request scheduling, prefill/decode disaggregation\\n- Background in cluster schedulers, load balancers, service meshes, or coordination planes at scale\\n- Familiarity with heterogeneous accelerator fleets (GPU/TPU/Trainium) and how hardware differences affect workload placement\\n- Experience with GPU/accelerator programming, ML framework internals, or OS-level performance debugging , enough to follow and evaluate the technical work, not necessarily to do it daily\\n- Led teams at supercomputing or hyperscaler infrastructure scale\\n- Led teams through rapid-growth periods where hiring and onboarding competed with roadmap delivery\\n\\nThe annual compensation range for this role is listed below. For sales roles, the range provided is the role’s On Target Earnings (&quot;OTE&quot;) range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role.\\nAnnual Salary: $405,000-$485,000 USD</strong></p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_ac45e205-e7d","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Anthropic","sameAs":"https://www.anthropic.com/","logo":"https://logos.yubhub.co/anthropic.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/anthropic/jobs/5155391008","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$405,000-$485,000 USD","x-skills-required":["engineering management","distributed systems","load balancing","scheduling","cache-coherent distributed state","high-performance networking","machine learning systems"],"x-skills-preferred":["LLM inference serving","cluster schedulers","load balancers","service meshes","coordination planes","heterogeneous accelerator fleets","GPU/TPU/Trainium","GPU/accelerator programming","ML framework internals","OS-level performance debugging"],"datePosted":"2026-04-18T15:56:48.587Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco, CA | New York City, NY"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"engineering management, distributed systems, load balancing, scheduling, cache-coherent distributed state, high-performance networking, machine learning systems, LLM inference serving, cluster schedulers, load balancers, service meshes, coordination planes, heterogeneous accelerator fleets, GPU/TPU/Trainium, GPU/accelerator programming, ML framework internals, OS-level performance debugging","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":405000,"maxValue":485000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_d9b7d5ae-6bf"},"title":"Software Engineer, Distributed Systems","description":"<p>We&#39;re growing our team of passionate creatives and builders on a mission to make design accessible to all. Our platform helps teams bring ideas to life,whether you&#39;re brainstorming, creating a prototype, translating designs into code, or iterating with AI. From idea to product, Figma empowers teams to streamline workflows, move faster, and work together in real time from anywhere in the world.</p>\n<p>As a Software Engineer on our Infrastructure team, you’ll help design, build, and operate the systems that power our real-time collaborative design tools used by millions of people worldwide. We’re scaling fast, and we’re looking for experienced distributed systems engineers across a variety of teams. Whether you’re passionate about storage, compute orchestration, developer tooling, networking, or real-time data systems, this role offers an opportunity to shape the technical foundation of one of the most beloved design platforms in the world.</p>\n<p>Responsibilities:</p>\n<ul>\n<li>Design, build, and maintain scalable and reliable infrastructure systems that support product innovation and user collaboration at scale.</li>\n</ul>\n<ul>\n<li>Architect and evolve distributed systems including storage platforms, streaming infrastructure, and compute orchestration.</li>\n</ul>\n<ul>\n<li>Improve developer experience by building internal platforms, CI/CD systems, build tools, and APIs.</li>\n</ul>\n<ul>\n<li>Collaborate across product and infrastructure teams to design secure, maintainable, and performant systems.</li>\n</ul>\n<ul>\n<li>Participate in shaping platform strategy, roadmaps, and engineering best practices across the organization.</li>\n</ul>\n<ul>\n<li>Debug and resolve complex production issues that span services and layers of the stack.</li>\n</ul>\n<ul>\n<li>Mentor engineers and foster a culture of collaboration, inclusivity, and technical excellence.</li>\n</ul>\n<p>Requirements:</p>\n<ul>\n<li>5+ years of Software Engineering experience, specifically in backend or infrastructure engineering.</li>\n</ul>\n<ul>\n<li>Deep understanding of distributed systems concepts such as sharding, replication, consistency, and eventual convergence.</li>\n</ul>\n<ul>\n<li>Experience with cloud-native environments (AWS, GCP, or Azure), infrastructure-as-code, and container orchestration.</li>\n</ul>\n<ul>\n<li>Proficiency in languages such as Go, TypeScript, Python, Rust, or Ruby.</li>\n</ul>\n<ul>\n<li>Strong system design skills and a track record of architecting resilient production systems.</li>\n</ul>\n<ul>\n<li>Excellent communication skills, with experience collaborating across teams and mentoring others.</li>\n</ul>\n<p>Preferred Qualifications:</p>\n<ul>\n<li>Experience scaling storage platforms (e.g., Postgres, Redis, S3, DynamoDB) or operating streaming systems like Kafka.</li>\n</ul>\n<ul>\n<li>Background in traffic management, DDoS mitigation, or service mesh technologies (e.g., Envoy, Istio).</li>\n</ul>\n<ul>\n<li>A history of developing complex, real-time distributed systems at scale.</li>\n</ul>\n<ul>\n<li>A passion for building developer productivity tools, including development environments, CI/CD pipelines, and build systems.</li>\n</ul>\n<ul>\n<li>Experience with evolving large-scale, shared developer platforms to improve reliability and developer velocity.</li>\n</ul>\n<ul>\n<li>Strong problem-solving skills and a bias for action,especially when tackling high-impact, gritty challenges.</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_d9b7d5ae-6bf","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Figma","sameAs":"https://www.figma.com/","logo":"https://logos.yubhub.co/figma.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/figma/jobs/5552549004","x-work-arrangement":"remote","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$153,000-$376,000 USD","x-skills-required":["distributed systems","cloud-native environments","infrastructure-as-code","container orchestration","Go","TypeScript","Python","Rust","Ruby","system design","resilient production systems"],"x-skills-preferred":["storage platforms","streaming infrastructure","compute orchestration","developer tooling","networking","real-time data systems","traffic management","DDoS mitigation","service mesh technologies","complex distributed systems","developer productivity tools"],"datePosted":"2026-04-18T15:56:47.168Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco, CA • New York, NY • United States"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"distributed systems, cloud-native environments, infrastructure-as-code, container orchestration, Go, TypeScript, Python, Rust, Ruby, system design, resilient production systems, storage platforms, streaming infrastructure, compute orchestration, developer tooling, networking, real-time data systems, traffic management, DDoS mitigation, service mesh technologies, complex distributed systems, developer productivity tools","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":153000,"maxValue":376000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_5cad560f-dc3"},"title":"Engineering Manager, Cloud Networking (Brazil)","description":"<p>You will join Airbnb&#39;s mission-driven company dedicated to helping create a world where anyone can belong anywhere. As the first Network engineering lead in Airbnb&#39;s Brazil office, you will be responsible for bootstrapping and growing the networking team in our new San Paulo office.</p>\n<p>Your primary focus will be on delivering an Airbnb network platform that is flexible, efficient, always available, and scales with the needs of the business. You will work closely with peers across Cloud Infra, Security, Reliability, and many other partner teams across the company to achieve this goal.</p>\n<p>Key responsibilities include:</p>\n<ul>\n<li>Providing meaningful input to technical designs and direct hands-on contributions to projects in the cloud networking space</li>\n<li>Growing, leading, and managing a small team of talented engineers</li>\n<li>Supporting your team&#39;s professional growth and maintaining high performance through mentorship and coaching</li>\n<li>Working with tech leads, peers, and partners to define and execute on a coherent vision and roadmap for Airbnb&#39;s cloud network infrastructure and related components</li>\n<li>Working with open source communities (e.g. istio) to build the next generation service mesh for all Airbnb back-end services</li>\n<li>Building cross-region gateways and load balancers for global Airbnb services</li>\n<li>Working with external partners and internal engineering and security teams to deliver edge security systems that protect Airbnb services</li>\n<li>Nurturing a culture of technical quality from design, through code review, to production</li>\n<li>Building strong partnership and alignment with teams across engineering</li>\n<li>Nurturing relationships with open source communities and external service partners</li>\n</ul>\n<p>As a successful candidate, you will have a strong background in engineering management, with 2+ years of experience and 8+ years of relevant software development experience in a fast-paced tech environment. You will also have experience with a public cloud provider (AWS, GCP, Azure) and their networking service offerings, as well as experience running large-scale networking systems and software (e.g. proxies, DNS, gateways).</p>\n<p>Additionally, you will have excellent communication skills and the ability to work well with teams across the engineering organization (e.g. reliability, compute, security, etc.). You will also have strong problem-solving skills and experience leading teams on-call for production infrastructure.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_5cad560f-dc3","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Airbnb","sameAs":"https://www.airbnb.com/","logo":"https://logos.yubhub.co/airbnb.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/airbnb/jobs/7381450","x-work-arrangement":"onsite","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["Professional fluency in English","2+ years of engineering management experience","8+ years of relevant software development experience in a fast-paced tech environment","Experience with a public cloud provider (AWS, GCP, Azure) and their networking service offerings","Experience running large-scale networking systems and software (e.g. proxies, DNS, gateways)","Experience with Istio service mesh, k8s and cloud native technologies","Excellent communication skills and the ability to work well with teams across the engineering organization","Strong problem-solving skills and experience leading teams on-call for production infrastructure"],"x-skills-preferred":["Experience with open source communities (e.g. istio)","Experience building cross-region gateways and load balancers for global services","Experience working with external partners and internal engineering and security teams to deliver edge security systems"],"datePosted":"2026-04-18T15:55:03.519Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Brazil"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Professional fluency in English, 2+ years of engineering management experience, 8+ years of relevant software development experience in a fast-paced tech environment, Experience with a public cloud provider (AWS, GCP, Azure) and their networking service offerings, Experience running large-scale networking systems and software (e.g. proxies, DNS, gateways), Experience with Istio service mesh, k8s and cloud native technologies, Excellent communication skills and the ability to work well with teams across the engineering organization, Strong problem-solving skills and experience leading teams on-call for production infrastructure, Experience with open source communities (e.g. istio), Experience building cross-region gateways and load balancers for global services, Experience working with external partners and internal engineering and security teams to deliver edge security systems"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_c0569537-539"},"title":"Staff Backend Engineer, Gitlab Delivery: Upgrades","description":"<p>As a Staff Engineer on the GitLab Delivery - Upgrades team, you&#39;ll guide the technical direction for GitLab&#39;s self-managed deployment strategy so customers can deploy, upgrade, and run GitLab reliably in their own infrastructure with minimal disruption.</p>\n<p>You&#39;ll serve as a technical anchor for the team, working closely with your engineering manager, product manager, and partners across Site Reliability Engineering, Release, Security, and Development to shape cloud-native, operator-driven deployment patterns that reduce operational complexity and upgrade friction.</p>\n<p>In your first year, you&#39;ll help define the architecture for zero-downtime upgrades, strengthen observability and reliability practices, and guide the next generation of deployment automation for self-managed GitLab environments.</p>\n<p>Some examples of our projects:</p>\n<ul>\n<li>Evolving GitLab Operator and Helm charts to support zero-downtime upgrades for complex, stateful GitLab installations</li>\n</ul>\n<ul>\n<li>Advancing the GitLab Environment Toolkit to simplify large-scale, production-ready self-managed deployments</li>\n</ul>\n<p><strong>Responsibilities</strong></p>\n<ul>\n<li>Guide the technical vision and architecture for GitLab&#39;s cloud-native, self-managed deployments and upgrade workflows.</li>\n</ul>\n<ul>\n<li>Establish operational maturity standards, service integration patterns, and deployment models that help development teams manage the lifecycle of their components.</li>\n</ul>\n<ul>\n<li>Design and maintain Kubernetes Operators, Helm charts, and upgrade orchestration tooling for self-managed GitLab deployments across varied environments.</li>\n</ul>\n<ul>\n<li>Develop automation and integration frameworks for database migrations, rolling deployments, compatibility checks, and rollback paths.</li>\n</ul>\n<ul>\n<li>Define database and application lifecycle strategies, including safe PostgreSQL migration approaches and validation mechanisms that reduce downtime risk.</li>\n</ul>\n<ul>\n<li>Work with Product Management, GitLab.com Site Reliability Engineering, GitLab Dedicated, and development teams to align deployment patterns with customer needs.</li>\n</ul>\n<ul>\n<li>Mentor engineers and enable customer-facing teams through design reviews, code reviews, documentation, and runbooks.</li>\n</ul>\n<ul>\n<li>Drive observability, testing, performance, and resilience practices for self-managed deployments, and contribute to incident response and post-incident learning.</li>\n</ul>\n<p><strong>Requirements</strong></p>\n<ul>\n<li>Strong software engineering experience designing and delivering production systems that customers install and operate in their own infrastructure.</li>\n</ul>\n<ul>\n<li>Proficiency in Go for large, complex codebases, with familiarity with Ruby on Rails and Rails application architecture as a useful addition.</li>\n</ul>\n<ul>\n<li>Hands-on experience with Kubernetes in production, including building and maintaining Operators, designing Helm charts for stateful applications, and working with Custom Resource Definitions, admission controllers, and controller patterns.</li>\n</ul>\n<ul>\n<li>Knowledge of cloud-native systems and tooling, such as service mesh, observability stacks, infrastructure as code, and automation tools like Terraform or Ansible.</li>\n</ul>\n<ul>\n<li>Experience with stateful workloads and databases, including PostgreSQL schema design and migrations, persistent volumes, storage classes, and approaches for reducing downtime during upgrades.</li>\n</ul>\n<ul>\n<li>Understanding of Linux systems and production operations, including package management, systemd, system-level debugging, observability, incident response, and on-call participation.</li>\n</ul>\n<ul>\n<li>Ability to guide through influence, including writing clear technical proposals, documenting decisions, mentoring engineers, and working effectively across teams.</li>\n</ul>\n<ul>\n<li>Interest in open source infrastructure or deployment tooling, or transferable experience from adjacent domains, with the ability to explain technical concepts clearly to different audiences.</li>\n</ul>\n<p><strong>About the Team</strong></p>\n<p>The Delivery - Upgrades team sits within GitLab Delivery and focuses on delivering GitLab to self-managed users through supported, validated deployment tooling. We own and evolve the GitLab Omnibus package, Helm charts, GitLab Operator, and the GitLab Environment Toolkit, and we work asynchronously across regions with partners in Site Reliability Engineering, Release, Security, and Development.</p>\n<p>Our work centers on enabling zero-downtime upgrades, reducing operational complexity at scale, supporting GitLab’s cloud-native transition while continuing to serve existing deployments, and improving the upgrade experience for customers running GitLab in diverse environments.</p>\n<p>For more on how we work, see [Link: Team Handbook Page].</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_c0569537-539","directApply":true,"hiringOrganization":{"@type":"Organization","name":"GitLab","sameAs":"https://about.gitlab.com/","logo":"https://logos.yubhub.co/about.gitlab.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/gitlab/jobs/8463922002","x-work-arrangement":"remote","x-experience-level":"staff","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["Go","Ruby on Rails","Kubernetes","Cloud-native systems","Service mesh","Observability stacks","Infrastructure as code","Automation tools","Linux systems","Production operations","Package management","Systemd","System-level debugging","Incident response","On-call participation"],"x-skills-preferred":[],"datePosted":"2026-04-18T15:52:40.073Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Remote, India"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Go, Ruby on Rails, Kubernetes, Cloud-native systems, Service mesh, Observability stacks, Infrastructure as code, Automation tools, Linux systems, Production operations, Package management, Systemd, System-level debugging, Incident response, On-call participation"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_fa9a54d7-549"},"title":"Senior Site Reliability Engineer, Data Infrastructure","description":"<p>As a Senior Site Reliability Engineer, you will own the reliability and performance of our Kubernetes-based data platform. You will design and operate highly available, multi-region systems, ensuring our services meet strict uptime and latency targets.</p>\n<p>Day-to-day, you’ll work on scaling infrastructure, improving deployment pipelines, and hardening our security posture. You’ll play a key role in evolving our DevSecOps practices while partnering closely with engineering teams to ensure services are built for reliability from day one.</p>\n<p>We operate with production-grade discipline, supporting mission-critical services with stringent uptime requirements and a focus on automation, observability, and resilience.</p>\n<p>The Platform &amp; Infrastructure Engineering team in the Data Infrastructure organization is responsible for the reliability, scalability, and security of the company’s data platform. The team builds and operates the foundational systems that power data ingestion, transformation, analytics, and internal AI workloads at scale.</p>\n<p>About the role:</p>\n<ul>\n<li>5+ years of experience in Site Reliability Engineering, Platform Engineering, or Infrastructure Engineering roles</li>\n<li>Deep expertise in Kubernetes and containerized software services, including cluster design, operations, and troubleshooting in production environments</li>\n<li>Strong experience building and operating CI/CD systems, including tools such as Argo CD and GitHub Actions</li>\n<li>Proven experience owning production systems with high availability requirements (≥99.99% uptime), including incident response, SLI/SLO/SLA definition, error budgets, and postmortems</li>\n<li>Hands-on experience designing and operating geo-replicated, multi-region, active-active systems, including traffic routing, failover strategies, and data consistency tradeoffs</li>\n<li>Strong experience building and owning observability components, including metrics, logging, and tracing (e.g., Prometheus, Grafana, OpenTelemetry).</li>\n<li>Experience with infrastructure as code (e.g., Helm, Terraform, Pulumi) and automated environment provisioning</li>\n<li>Strong understanding of system performance tuning, capacity planning, and resource optimization in distributed systems</li>\n<li>Experience implementing and operating security best practices in cloud-native environments (e.g., secrets management, network policies, vulnerability scanning)</li>\n</ul>\n<p>Preferred:</p>\n<ul>\n<li>Experience operating data platforms or data-intensive workloads (e.g., Spark, Airflow, Kafka, Flink)</li>\n<li>Familiarity with service mesh technologies (e.g., Istio, Linkerd)</li>\n<li>Experience working in regulated environments with compliance frameworks such as GDPR, SOC 2, HIPAA, or SOX</li>\n<li>Background in building internal developer platforms or self-service infrastructure</li>\n</ul>\n<p>Wondering if you’re a good fit?</p>\n<p>We believe in investing in our people, and value candidates who can bring their own diversified experiences to our teams – even if you aren’t a 100% skill or experience match.</p>\n<p>Here are a few qualities we’ve found compatible with our team. If some of this describes you, we’d love to talk.</p>\n<ul>\n<li>You love building highly reliable systems that operate at scale</li>\n<li>You’re curious about how to continuously improve system resilience, security, and operations</li>\n<li>You’re an expert in diagnosing and solving complex distributed systems problems</li>\n</ul>\n<p>Why CoreWeave?</p>\n<p>At CoreWeave, we work hard, have fun, and move fast! We’re in an exciting stage of hyper-growth that you will not want to miss out on. We’re not afraid of a little chaos, and we’re constantly learning.</p>\n<p>Our team cares deeply about how we build our product and how we work together, which is represented through our core values:</p>\n<ul>\n<li>Be Curious at Your Core</li>\n<li>Act Like an Owner</li>\n<li>Empower Employees</li>\n<li>Deliver Best-in-Class Client Experiences</li>\n<li>Achieve More Together</li>\n</ul>\n<p>We support and encourage an entrepreneurial outlook and independent thinking. We foster an environment that encourages collaboration and provides the opportunity to develop innovative solutions to complex problems.</p>\n<p>As we get set for take off, the growth opportunities within the organization are constantly expanding. You will be surrounded by some of the best talent in the industry, who will want to learn from you, too.</p>\n<p>Come join us!</p>\n<p>The base salary range for this role is $165,000 to $242,000. The starting salary will be determined based on job-related knowledge, skills, experience, and market location. We strive for both market alignment and internal equity when determining compensation.</p>\n<p>In addition to base salary, our total rewards package includes a discretionary bonus, equity awards, and a comprehensive benefits program (all based on eligibility).</p>\n<p>What We Offer</p>\n<p>The range we’ve posted represents the typical compensation range for this role. To determine actual compensation, we review the market rate for each candidate which can include a variety of factors. These include qualifications, experience, interview performance, and location.</p>\n<p>In addition to a competitive salary, we offer a variety of benefits to support your needs, including:</p>\n<ul>\n<li>Medical, dental, and vision insurance</li>\n<li>100% paid for by CoreWeave</li>\n<li>Company-paid Life Insurance</li>\n<li>Voluntary supplemental life insurance</li>\n<li>Short and long-term disability insurance</li>\n<li>Flexible Spending Account</li>\n<li>Health Savings Account</li>\n<li>Tuition Reimbursement</li>\n<li>Ability to Participate in Employee Stock Purchase Program (ESPP)</li>\n<li>Mental Wellness Benefits through Spring Health</li>\n<li>Family-Forming support provided by Carrot</li>\n<li>Paid Parental Leave</li>\n<li>Flexible, full-service childcare support with Kinside</li>\n<li>401(k) with a generous employer match</li>\n<li>Flexible PTO</li>\n<li>Catered lunch each day in our office and data center locations</li>\n<li>A casual work environment</li>\n<li>A work culture focused on innovative disruption</li>\n</ul>\n<p>Our Workplace</p>\n<p>While we prioritize a hybrid work environment, remote work may be considered for candidates located more than 30 miles from an office, based on role requirements for specialized skill sets.</p>\n<p>New hires will be invited to attend onboarding at one of our hubs within their first month.</p>\n<p>Teams also gather quarterly to support collaboration.</p>\n<p>California Consumer Privacy Act - California applicants only</p>\n<p>CoreWeave is an equal opportunity employer, committed to fostering an inclusive and supportive workplace.</p>\n<p>All qualified applicants and candidates will receive consideration for employment without regard to race, color, religion, sex, disability, age, sexual orientation, gender identity, national origin, veteran status, or genetic information.</p>\n<p>As part of this commitment and consistent with the Americans with Disabilities Act (ADA), CoreWeave will ensure that qualified applicants and candidates with disabilities are provided reasonable accommodations for the hiring process, unless such accommodation would cause an undue hardship.</p>\n<p>If reasonable accommodation is needed, please contact: careers@coreweave.com.</p>\n<p>Export Control Compliance</p>\n<p>This position requires access to export controlled information.</p>\n<p>To conform to U.S. Government export regulations applicable to that information, applicant must either be (A) a U.S. person, defined as a (i) U.S. citizen or national, (ii) U.S. lawful permanent resident (green card holder), (iii) refugee under 8 U.S.C. § 1157, or (iv) asylee under 8 U.S.C. § 1158, (B) eligible to access the export controlled information without restrictions, or (C) otherwise exempt from the export regulations.</p>\n<p>If you are not a U.S. person, you will be required to provide documentation of your eligibility to access the export controlled information before being considered for this position.</p>\n<p>Please note that CoreWeave is subject to the requirements of the U.S. Department of Commerce&#39;s Export Administration Regulations (EAR) and the U.S. Department of State&#39;s International Traffic in Arms Regulations (ITAR).</p>\n<p>By applying for this position, you acknowledge that you have read and understood the export control requirements and that you will comply with them.</p>\n<p>If you have any questions or concerns regarding the export control requirements, please contact: careers@coreweave.com.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_fa9a54d7-549","directApply":true,"hiringOrganization":{"@type":"Organization","name":"CoreWeave","sameAs":"https://www.coreweave.com","logo":"https://logos.yubhub.co/coreweave.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/coreweave/jobs/4671535006","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$165,000 to $242,000","x-skills-required":["Kubernetes","containerized software services","cluster design","operations","troubleshooting","CI/CD systems","Argo CD","GitHub Actions","production systems","high availability","incident response","SLI/SLO/SLA definition","error budgets","postmortems","geo-replicated","multi-region","active-active systems","traffic routing","failover strategies","data consistency tradeoffs","observability components","metrics","logging","tracing","Prometheus","Grafana","OpenTelemetry","infrastructure as code","Helm","Terraform","Pulumi","automated environment provisioning","system performance tuning","capacity planning","resource optimization","distributed systems","security best practices","cloud-native environments","secrets management","network policies","vulnerability scanning"],"x-skills-preferred":["Spark","Airflow","Kafka","Flink","service mesh technologies","Istio","Linkerd","regulated environments","compliance frameworks","GDPR","SOC 2","HIPAA","SOX","internal developer platforms","self-service infrastructure"],"datePosted":"2026-04-18T15:51:59.035Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"New York, NY / Bellevue, WA"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Kubernetes, containerized software services, cluster design, operations, troubleshooting, CI/CD systems, Argo CD, GitHub Actions, production systems, high availability, incident response, SLI/SLO/SLA definition, error budgets, postmortems, geo-replicated, multi-region, active-active systems, traffic routing, failover strategies, data consistency tradeoffs, observability components, metrics, logging, tracing, Prometheus, Grafana, OpenTelemetry, infrastructure as code, Helm, Terraform, Pulumi, automated environment provisioning, system performance tuning, capacity planning, resource optimization, distributed systems, security best practices, cloud-native environments, secrets management, network policies, vulnerability scanning, Spark, Airflow, Kafka, Flink, service mesh technologies, Istio, Linkerd, regulated environments, compliance frameworks, GDPR, SOC 2, HIPAA, SOX, internal developer platforms, self-service infrastructure","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":165000,"maxValue":242000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_a14533c3-732"},"title":"Senior Engineer, Cilium CNI & Cloud Networking","description":"<p>Network Services Team</p>\n<p>The Network Services team builds and operates the foundational networking that powers CoreWeave&#39;s Kubernetes platforms at cloud scale. The team is responsible for container networking, connectivity, and network services that support large-scale, GPU-driven workloads across regions and environments. They focus on scalability, reliability, security, and performance while delivering intuitive platforms for internal teams and customers.</p>\n<p>About the Role</p>\n<p>As a Senior Engineer focused on our Cilium-based CNI, you will design, build, and operate the container networking layer that underpins CoreWeave&#39;s Kubernetes platforms. Day to day, you will work on evolving our CNI stack to support large, high-density GPU clusters with demanding throughput and latency requirements. You will partner closely with Kubernetes, Infrastructure, and Network Services engineers to ensure the platform is highly available, observable, and secure. This role spans architecture, implementation, and operations, with ownership from prototype through production. You will also help shape how our networking platform scales for future growth.</p>\n<p>Who You Are</p>\n<ul>\n<li>5+ years of experience as a Software Engineer or Systems Engineer working on cloud infrastructure or large-scale distributed systems.</li>\n<li>Hands-on production experience with Cilium CNI (or equivalent advanced CNIs), including cluster configuration and lifecycle management.</li>\n<li>Strong understanding of Cilium&#39;s eBPF datapath, policy model, and load-balancing mechanisms.</li>\n<li>Deep knowledge of cloud networking concepts, including VPCs, subnets, routing, security groups/ACLs, NAT, and ingress/egress architectures.</li>\n<li>Experience designing multi-tenant network architectures with strong isolation and security.</li>\n<li>Solid grounding in TCP/IP, dynamic routing (e.g., BGP), ECMP, MTU/fragmentation, and overlay/underlay networking (VXLAN, Geneve, encapsulation).</li>\n<li>Experience with network observability and troubleshooting across L3–L7.</li>\n<li>Proficiency in at least one systems language such as Golang or C/C++.</li>\n<li>Experience working in modern CI/CD environments.</li>\n<li>Experience operating Kubernetes at scale, including cluster lifecycle management and debugging networking issues across pods, nodes, and external services.</li>\n<li>Demonstrated ownership of complex systems end-to-end.</li>\n</ul>\n<p>Preferred</p>\n<ul>\n<li>Experience operating cloud-scale network services across tens of thousands of nodes and multiple regions.</li>\n<li>Contributions to Cilium, Kubernetes, or related open-source networking projects.</li>\n<li>Experience with eBPF development and performance tuning.</li>\n<li>Experience building Kubernetes operators or controllers.</li>\n<li>Familiarity with service meshes, multi-cluster networking, or cluster mesh solutions.</li>\n<li>Experience in GPU-heavy, HPC, or other performance-sensitive environments.</li>\n</ul>\n<p>Wondering if you’re a good fit?</p>\n<p>We believe in investing in our people and value candidates who bring diverse experiences , even if you’re not a 100% match on paper. If some of this sounds like you, we’d love to talk.</p>\n<ul>\n<li>You love solving complex distributed systems and networking challenges at scale.</li>\n<li>You’re curious about cloud-native networking, eBPF, and Kubernetes internals.</li>\n<li>You’re an expert in building reliable, scalable infrastructure that runs in production.</li>\n</ul>\n<p>Why CoreWeave?</p>\n<p>At CoreWeave, we work hard, have fun, and move fast! We’re in an exciting stage of hyper-growth that you will not want to miss out on. We’re not afraid of a little chaos, and we’re constantly learning. Our team cares deeply about how we build our product and how we work together, which is represented through our core values:</p>\n<ul>\n<li>Be Curious at Your Core</li>\n<li>Act Like an Owner</li>\n<li>Empower Employees</li>\n<li>Deliver Best-in-Class Client Experiences</li>\n<li>Achieve More Together</li>\n</ul>\n<p>The base salary range for this role is $165,000 to $242,000. The starting salary will be determined based on job-related knowledge, skills, experience, and market location. We strive for both market alignment and internal equity when determining compensation. In addition to base salary, our total rewards package includes a discretionary bonus, equity awards, and a comprehensive benefits program (all based on eligibility).</p>\n<p>What We Offer</p>\n<p>The range we’ve posted represents the typical compensation range for this role. To determine actual compensation, we review the market rate for each candidate which can include a variety of factors. These include qualifications, experience, interview performance, and location. In addition to a competitive salary, we offer a variety of benefits to support your needs, including:</p>\n<ul>\n<li>Medical, dental, and vision insurance</li>\n<li>100% paid for by CoreWeave</li>\n<li>Company-paid Life Insurance</li>\n<li>Voluntary supplemental life insurance</li>\n<li>Short and long-term disability insurance</li>\n<li>Flexible Spending Account</li>\n<li>Health Savings Account</li>\n<li>Tuition Reimbursement</li>\n<li>Ability to Participate in Employee Stock Purchase Program (ESPP)</li>\n<li>Mental Wellness Benefits through Spring Health</li>\n<li>Family-Forming support provided by Carrot</li>\n<li>Paid Parental Leave</li>\n<li>Flexible, full-service childcare support with Kinside</li>\n<li>401(k) with a generous employer match</li>\n<li>Flexible PTO</li>\n<li>Catered lunch each day in our office and data center locations</li>\n<li>A casual work environment</li>\n<li>A work culture focused on innovative disruption</li>\n</ul>\n<p>Our Workplace</p>\n<p>While we prioritize a hybrid work environment, remote work may be considered for candidates located more than 30 miles from an office, based on role requirements for specialized skill sets. New hires will be invited to attend onboarding at one of our hubs within their first month. Teams also gather quarterly to support collaboration.</p>\n<p>California Consumer Privacy Act - California applicants only</p>\n<p>CoreWeave is an equal opportunity employer, committed to fostering an inclusive and supportive workplace. All qualified applicants and candidates will receive consideration for employment without regard to race, color, religion, sex, disability, age, sexual orientation, gender identity, national origin, veteran status, or genetic information. As part of this commitment and consistent with the Americans with Disabilities Act (ADA), CoreWeave will ensure that qualified applicants and candidates with disabilities are provided reasonable accommodations for the hiring process, unless such accommodation would cause an undue hardship. If reasonable accommodation is needed, please contact: careers@coreweave.com.</p>\n<p>Export Control Compliance</p>\n<p>This position requires access to export controlled information. To conform to U.S. Government export regulations applicable to that information, applicant must either be (A) a U.S. person, defined as a (i) U.S. citizen or national, (ii) U.S. lawful permanent resident (green card holder), (iii) refugee under 8 U.S.C. § 1157, or (iv) asylee under 8 U.S.C. § 1158, (B) eligible to access the export controlled information without a required export authorization, or (C) eligible and reasonably likely to obtain the required export authorization from the applicable U.S. government agency. CoreWeave may, for legitimate business reasons, decline to pursue any export licensing process.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_a14533c3-732","directApply":true,"hiringOrganization":{"@type":"Organization","name":"CoreWeave","sameAs":"https://www.coreweave.com","logo":"https://logos.yubhub.co/coreweave.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/coreweave/jobs/4653971006","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$165,000 to $242,000","x-skills-required":["Cilium CNI","cloud infrastructure","large-scale distributed systems","container networking","connectivity","network services","Kubernetes","eBPF datapath","policy model","load-balancing mechanisms","cloud networking concepts","VPCs","subnets","routing","security groups/ACLs","NAT","ingress/egress architectures","TCP/IP","dynamic routing","ECMP","MTU/fragmentation","overlay/underlay networking","Golang","C/C++","CI/CD environments","Kubernetes at scale","cluster lifecycle management","debugging networking issues"],"x-skills-preferred":["cloud-scale network services","Cilium","eBPF development","performance tuning","Kubernetes operators","controllers","service meshes","multi-cluster networking","cluster mesh solutions","GPU-heavy","HPC","performance-sensitive environments"],"datePosted":"2026-04-18T15:47:58.336Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Livingston, NJ / New York, NY / Sunnyvale, CA / Bellevue, WA"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Cilium CNI, cloud infrastructure, large-scale distributed systems, container networking, connectivity, network services, Kubernetes, eBPF datapath, policy model, load-balancing mechanisms, cloud networking concepts, VPCs, subnets, routing, security groups/ACLs, NAT, ingress/egress architectures, TCP/IP, dynamic routing, ECMP, MTU/fragmentation, overlay/underlay networking, Golang, C/C++, CI/CD environments, Kubernetes at scale, cluster lifecycle management, debugging networking issues, cloud-scale network services, Cilium, eBPF development, performance tuning, Kubernetes operators, controllers, service meshes, multi-cluster networking, cluster mesh solutions, GPU-heavy, HPC, performance-sensitive environments","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":165000,"maxValue":242000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_3a40dbfa-d00"},"title":"Staff Software Engineer, Non-Human Identity","description":"<p>Secure Every Identity, from AI to Human Identity is the key to unlocking the potential of AI. Okta secures AI by building the trusted, neutral infrastructure that enables organisations to safely embrace this new era.</p>\n<p>We are looking for builders and owners who operate with speed and urgency and execute with excellence. This is an opportunity to do career-defining work. We&#39;re all in on this mission. If you are too, let&#39;s talk.</p>\n<p>The Team</p>\n<p>The Okta Privileged Access Management (PAM) team is building the future of identity for machines, services, and applications. We are seeking a world-class Staff Engineer to help us architect and build the high-performance core of our non-human identity platform.</p>\n<p>Your work, in close collaboration with our principal engineers and architects, will be the foundation of our strategy for managing privileged access in the modern enterprise. If you are a systems programmer who thrives on influencing the design of high-performance, concurrent, and resilient security software, this is the role for you.</p>\n<p>What you’ll be doing</p>\n<ul>\n<li>Contribute to Core Architecture:</li>\n<li>Partner with principal engineers and architects to design and implement a low-latency, high-throughput secrets engine for non-human identities</li>\n<li>Solve for Massive Scale:</li>\n<li>Write highly concurrent, performance-critical code capable of handling millions of machine-to-machine authentication and authorization requests</li>\n<li>Shape Technical Strategy:</li>\n<li>Play a key role in defining the long-term technical roadmap for scalability and performance, ensuring our platform can meet the demands of the largest enterprises</li>\n<li>Mentor and Elevate:</li>\n<li>As a senior engineer on the team, you will work with junior engineers to help them advance their SDLC expertise.</li>\n<li>On-Call:</li>\n<li>Participate in the rotational on-call activities with SRE and product development team</li>\n</ul>\n<p>What you’ll bring to the role</p>\n<ul>\n<li>Required Experience:</li>\n<li>8+ years of professional software engineering experience, with a heavy focus on backend or systems-level development</li>\n<li>Bachelor’s or Master’s degree in Computer Science, or equivalent practical experience</li>\n<li>Core Technical Expertise:</li>\n<li>Deep, hands-on expertise in multi-platform Go development and building high-performance, concurrent applications</li>\n<li>Experience designing or operating distributed systems</li>\n<li>Experience with secure systems (authn/authz, encryption, TLS, token handling, PKI, CAs, diagnosing TLS issues)</li>\n<li>Deep expertise in distributed storage systems, with a focus on replication, backup, and restore, and data management. (Postgres, etc.)</li>\n<li>Direct experience designing, building, or contributing to a secrets management, service mesh, or machine identity platform</li>\n<li>Expert-level at ergonomic API design (gRPC/openAPI), and building for reliability at scale</li>\n<li>Deep knowledge of cloud-native infrastructure</li>\n<li>Key Attributes:</li>\n<li>You are driven by the challenge of optimizing systems for performance, latency, and throughput, with a proven ability to diagnose complex, multi-system issues</li>\n<li>You have a proven track record of making significant contributions to the architecture of complex, mission-critical systems</li>\n<li>You thrive in an environment where you can focus on deep technical problems</li>\n<li>Bonus Points:</li>\n<li>Experience at a leading Cybersecurity or Infrastructure-as-Code company</li>\n<li>Contributions to open-source projects in the identity, security, or infrastructure space</li>\n</ul>\n<p>And extra credit if you have experience in any of the following!</p>\n<ul>\n<li>Deep expertise in backend systems engineering</li>\n<li>Experience building and scaling beyond standard three-tier monolithic architectures, with a focus on modern distributed systems</li>\n<li>Have worked on projects with complex, established systems</li>\n<li>Possess significant, hands-on experience in a Linux/Unix environment</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_3a40dbfa-d00","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Okta","sameAs":"https://www.okta.com/","logo":"https://logos.yubhub.co/okta.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/okta/jobs/7674829","x-work-arrangement":"hybrid","x-experience-level":"staff","x-job-type":"full-time","x-salary-range":"$194,000-$267,000 USD","x-skills-required":["Go development","Distributed systems","Secure systems","Distributed storage systems","Secrets management","Service mesh","Machine identity platform","Ergonomic API design","Cloud-native infrastructure"],"x-skills-preferred":[],"datePosted":"2026-04-18T15:47:43.090Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco, California"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Go development, Distributed systems, Secure systems, Distributed storage systems, Secrets management, Service mesh, Machine identity platform, Ergonomic API design, Cloud-native infrastructure","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":194000,"maxValue":267000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_fca5411d-4fb"},"title":"Staff Site Reliability Engineer - Kubernetes","description":"<p>Secure Every Identity, from AI to Human</p>\n<p>Identity is the key to unlocking the potential of AI. Okta secures AI by building the trusted, neutral infrastructure that enables organisations to safely embrace this new era. This work requires a relentless drive to solve complex challenges with real-world stakes. We are looking for builders and owners who operate with speed and urgency and execute with excellence.</p>\n<p>This is an opportunity to do career-defining work. We&#39;re all in on this mission. If you are too, let&#39;s talk.</p>\n<p>Workforce Identity Cloud</p>\n<p>Okta Workforce Identity Cloud (WIC) provides easy, secure access for your workforce so you can focus on other strategic priorities,like reducing costs, and doing more for your customers.</p>\n<p>If you like to be challenged and have a passion for solving large-scale automation, testing, and tuning problems, we would love to hear from you. The ideal candidate is someone who exemplifies the ethics of, “If you have to do something more than once, automate it” and who can rapidly self-educate on new concepts and tools.</p>\n<p><strong>Position Overview:</strong></p>\n<p>The Site Reliability Engineer (SRE) will play a key role in building and managing Kubernetes platforms that support cloud-native applications and services. This position focuses on architecting and managing reliable, scalable, and secure Kubernetes-based platforms on AWS, ensuring high availability and performance while optimising costs and automation. The ideal candidate will have hands-on experience with AWS infrastructure, Kubernetes platform creation, Helm charts, Karpenter scaling, and Istio service mesh.</p>\n<p><strong>Key Responsibilities:</strong></p>\n<ul>\n<li>Kubernetes Platform Creation: Design, implement, and maintain highly available, scalable, and fault-tolerant Kubernetes platforms. Ensure clusters are optimised for production workloads, providing high resilience and operational efficiency.</li>\n</ul>\n<ul>\n<li>AWS Infrastructure Management: Build, manage, and optimise AWS cloud infrastructure, including EKS, ECS, S3, VPCs, RDS, IAM, and more. Implement best practices for cost management, scaling, and security within AWS.</li>\n</ul>\n<ul>\n<li>Helm Management: Utilise Helm to automate and streamline the deployment of applications and services to Kubernetes clusters. Create, maintain, and manage Helm charts for production-ready deployments.</li>\n</ul>\n<ul>\n<li>Karpenter Implementation: Implement and manage Karpenter to dynamically scale Kubernetes clusters in response to workload demands.</li>\n</ul>\n<ul>\n<li>Istio Service Mesh Management: Configure and manage Istio to provide service-to-service communication, security, and observability within the Kubernetes clusters. Enable fine-grained traffic management, service discovery, and policy enforcement.</li>\n</ul>\n<ul>\n<li>Platform Automation &amp; Scaling: Automate the deployment, scaling, and management of infrastructure and applications. Work with CI/CD pipelines to ensure a seamless flow from development to production with minimal downtime.</li>\n</ul>\n<ul>\n<li>Incident Management &amp; Troubleshooting: Respond to incidents, troubleshoot, and resolve system issues related to performance, availability, and security in a timely and effective manner.</li>\n</ul>\n<ul>\n<li>Security &amp; Compliance: Design and implement secure cloud infrastructure with appropriate access controls, network security, and compliance frameworks.</li>\n</ul>\n<ul>\n<li>Documentation &amp; Knowledge Sharing: Create and maintain detailed documentation for Kubernetes platform setup, operational procedures, and best practices. Promote knowledge sharing across teams.</li>\n</ul>\n<p><strong>Required Qualifications:</strong></p>\n<ul>\n<li>4+ years of experience with Kubernetes/Helm;</li>\n</ul>\n<ul>\n<li>4+ years of Experience with Terraform.</li>\n</ul>\n<ul>\n<li>5+ years of Experience with AWS</li>\n</ul>\n<ul>\n<li>Experience with multi-region cloud environments.</li>\n</ul>\n<ul>\n<li>Proven experience with AWS (EC2, RDS, S3, CloudFormation, IAM, etc.) and solid understanding of cloud-native architectures.</li>\n</ul>\n<ul>\n<li>Strong expertise in Kubernetes platform creation, management, and optimisation (e.g., setting up highly available clusters, networking, and storage).</li>\n</ul>\n<ul>\n<li>Hands-on experience with Helm for Kubernetes application deployment and management.</li>\n</ul>\n<ul>\n<li>Practical experience with Karpenter for dynamic scaling of Kubernetes clusters and optimising resource usage.</li>\n</ul>\n<ul>\n<li>Expertise in managing and securing Istio for service mesh, including traffic management, security, and observability features.</li>\n</ul>\n<ul>\n<li>Proficiency in CI/CD pipelines and automation tools (e.g., Jenkins, GitLab, CircleCI, Terraform, Ansible, Spinnaker).</li>\n</ul>\n<ul>\n<li>Strong scripting and automation skills in Python, Bash, or Go for infrastructure management and platform automation.</li>\n</ul>\n<ul>\n<li>Experience with monitoring, logging, and alerting tools such as Prometheus, Grafana, CloudWatch, and ELK Stack.</li>\n</ul>\n<p><strong>Preferred Qualifications:</strong></p>\n<ul>\n<li>Understanding of security best practices for cloud platforms and Kubernetes (e.g., role-based access control (RBAC), encryption, and compliance frameworks).</li>\n</ul>\n<ul>\n<li>Familiarity with Docker and containerization principles.</li>\n</ul>\n<ul>\n<li>Bachelor’s degree in Computer Science, Engineering, or related field (or equivalent professional experience).</li>\n</ul>\n<ul>\n<li>Certifications (Preferred): CKA (Certified Kubernetes Administrator), CKAD (Certified Kubernetes Application Developer), or AWS Certified DevOps Engineer are highly desirable.</li>\n</ul>\n<p>Additional requirements:</p>\n<ul>\n<li>This position requires the ability to access federal environments and/or have access to protected federal data. As a condition of employment for this position, the successful candidate must be able to submit documentation establishing U.S. Person status (e.g. a U.S. Citizen, National, Lawful Permanent Resident, Refugee, or Asylee. 22 CFR 120.15) upon hire.</li>\n</ul>\n<ul>\n<li>Requires in-person onboarding and travel to our San Francisco, CA HQ office or our Chicago office during the first week of employment.</li>\n</ul>\n<p>#LI-Hybrid</p>\n<p>#LI-LSS1</p>\n<p>requisition ID- (P16373_3396241)</p>\n<p>The annual base salary range for this position for candidates located in the San Francisco Bay area is between: $194,000-$267,000 USD</p>\n<p>Below is the annual base salary range for candidates located in California (excluding San Francisco Bay Area), Colorado, Illinois, New York and Washington. Your actual base salary will depend on factors such as your skills, qualifications, experience, and work location. In addition, Okta offers equity (where applicable), bonus, and benefits, including health, dental and vision insurance, 401(k), flexible spending account, and paid leave (including PTO and parental leave) in accordance with our applicable plans and policies. To learn more about our Total Rewards program please visit: https://rewards.okta.com/us.</p>\n<p>The annual base salary range for this position for candidates located in California (excluding San Francisco Bay Area), Colorado, Illinois, New York, and Washington is between:$174,000-$214,000 USD</p>\n<p>The Okta Experience</p>\n<ul>\n<li>Supporting Your Well-Being</li>\n</ul>\n<ul>\n<li>Driving Social Impact</li>\n</ul>\n<ul>\n<li>Developing Talent and Fostering Connection + Community</li>\n</ul>\n<p>We are intentional about connection. Our global community, spanning over 20 offices worldwide, is united by a drive to innovate. Your journey begins with an immersive, in-person onboarding experience designed to accelerate your impact and connect you to our mission and team from day one.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_fca5411d-4fb","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Okta","sameAs":"https://www.okta.com/","logo":"https://logos.yubhub.co/okta.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/okta/jobs/7743339","x-work-arrangement":"hybrid","x-experience-level":"staff","x-job-type":"full-time","x-salary-range":"$174,000-$214,000 USD","x-skills-required":["Kubernetes","Helm","Terraform","AWS","Cloud-native architectures","Kubernetes platform creation","Kubernetes management","Kubernetes optimisation","Helm for Kubernetes application deployment","Karpenter for dynamic scaling","Istio for service mesh","CI/CD pipelines","Automation tools","Python","Bash","Go","Monitoring","Logging","Alerting"],"x-skills-preferred":["Security best practices for cloud platforms and Kubernetes","Docker and containerization principles","Certified Kubernetes Administrator","Certified Kubernetes Application Developer","AWS Certified DevOps Engineer"],"datePosted":"2026-04-18T15:46:19.185Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Bellevue, Washington; Chicago, Illinois; New York, New York; San Francisco, California; Washington, DC"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Kubernetes, Helm, Terraform, AWS, Cloud-native architectures, Kubernetes platform creation, Kubernetes management, Kubernetes optimisation, Helm for Kubernetes application deployment, Karpenter for dynamic scaling, Istio for service mesh, CI/CD pipelines, Automation tools, Python, Bash, Go, Monitoring, Logging, Alerting, Security best practices for cloud platforms and Kubernetes, Docker and containerization principles, Certified Kubernetes Administrator, Certified Kubernetes Application Developer, AWS Certified DevOps Engineer","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":174000,"maxValue":214000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_9f2e3373-2d6"},"title":"Senior Software Engineer - Platform Network","description":"<p>Secure Every Identity =========================</p>\n<p>Okta secures AI by building the trusted, neutral infrastructure that enables organisations to safely embrace this new era.</p>\n<p>The Platform Network Engineering Team -----------------------------------</p>\n<p>Auth0 by Okta is an easy-to-implement authentication and authorization platform designed by developers for developers. We make access to applications safe, secure, and seamless for over 100 million daily logins worldwide.</p>\n<p>Our modern approach to identity enables this Tier 0 global service to deliver convenience, privacy, and security so customers can focus on innovation.</p>\n<p>The Senior Software Engineer Opportunity ---------------------------------------</p>\n<p>You will be part of the Platform Network engineering team responsible for all connectivity of Auth0. You will play a key engineering role as we evolve our network architecture to meet the demands of enormous growth and support the hundreds of millions of users who rely on us to provide uninterrupted access.</p>\n<p>What you’ll be doing ------------------</p>\n<p>Implement internal and edge networking infrastructure and design solutions that work at global scale and with multi-cloud and multi-region constraints.</p>\n<p>Carry cross-team initiatives from end to end: code reviews, design reviews, operational robustness, security hygiene, etc.</p>\n<p>Design and develop new services, tools, and automation to expose network functionality to other Okta engineering and operations teams.</p>\n<p>Research and implement solutions addressing cross-cutting concerns such as routing, failover, and scaling.</p>\n<p>Participate in the team’s on-call rotation.</p>\n<p>What you’ll bring to the role ---------------------------</p>\n<p>Have 3+ years of software development experience in cloud-native services like API.</p>\n<p>Demonstrable knowledge of TCP/IP, DNS, HTTP, TLS.</p>\n<p>Have DevOps experience using cloud-agnostic, cloud-native technologies.</p>\n<p>Have experience managing infrastructure with Terraform.</p>\n<p>Have experience contributing to Go-based services.</p>\n<p>Have a passion for working on global distributed systems that are highly reliable, maintainable, scalable, and secure.</p>\n<p>Tend to deliver work incrementally to get feedback and iterate over solutions.</p>\n<p>Bring the right attitude to the team: ownership, accountability, and attention to detail.</p>\n<p>And extra credit if you have experience in any of the following!</p>\n<p>A &#39;Product Mindset&#39; toward infrastructure,building internal networking tools that are self-service, well-documented, and easy for application teams to consume.</p>\n<p>Experience with using cloud providers such as AWS or Azure and major content delivery networks.</p>\n<p>Experience implementing and scaling Service Mesh architectures to manage service-to-service communication, observability, and security.</p>\n<p>Knowledge of Istio/Envoy Proxy and the Kubernetes Gateway API to provide flexible, self-service ingress solutions for product teams.</p>\n<p>Experience designing and maintaining multi-cloud networking topologies and hybrid connectivity (Direct Connect, Cloud Interconnect) at scale</p>\n<p>Salary and Benefits -------------------</p>\n<p>The annual base salary range for this position for candidates located in Canada is between $136,000-$187,000 CAD.</p>\n<p>Okta offers equity (where applicable), bonus, and benefits, including health, dental, and vision insurance, RRSP with a match, healthcare spending, telemedicine, and paid leave (including PTO and parental leave) in accordance with our applicable plans and policies.</p>\n<p>To learn more about our Total Rewards program, please visit: https://rewards.okta.com/can</p>\n<p>The Okta Experience -------------------</p>\n<p>Supporting Your Well-being</p>\n<p>Driving Social Impact</p>\n<p>Developing Talent and Fostering Connection + Community</p>\n<p>We are intentional about connection. Our global community, spanning over 20 offices worldwide, is united by a drive to innovate.</p>\n<p>Your journey begins with an immersive, in-person onboarding experience designed to accelerate your impact and connect you to our mission and team from day one.</p>\n<p>Okta is an Equal Opportunity Employer.</p>\n<p>All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, marital status, age, physical or mental disability, or status as a protected veteran.</p>\n<p>We also consider for employment qualified applicants with arrest and convictions records, consistent with applicable laws.</p>\n<p>If reasonable accommodation is needed to complete any part of the job application, interview process, or onboarding please use this Form to request an accommodation.</p>\n<p>Notice for New York City Applicants &amp; Employees: Okta may use Automated Employment Decision Tools (AEDT), as defined by New York City Local Law 144, that use artificial intelligence, machine learning, or other automated processes to assist in our recruitment and hiring process.</p>\n<p>In accordance with NYC Local Law 144, if you are an applicant or employee residing in New York City, please click here to view our full NYC AEDT Notice.</p>\n<p>Okta is committed to complying with applicable data privacy and security laws and regulations.</p>\n<p>For more information, please see our Personnel and Job Candidate Privacy Notice at https://www.okta.com/legal/personnel-policy/</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_9f2e3373-2d6","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Okta","sameAs":"https://www.okta.com","logo":"https://logos.yubhub.co/okta.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/okta/jobs/7653477","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$136,000-$187,000 CAD","x-skills-required":["software development experience in cloud-native services like API","TCP/IP","DNS","HTTP","TLS","DevOps experience using cloud-agnostic, cloud-native technologies","infrastructure with Terraform","Go-based services"],"x-skills-preferred":["Product Mindset toward infrastructure","cloud providers such as AWS or Azure","major content delivery networks","Service Mesh architectures","Istio/Envoy Proxy","Kubernetes Gateway API","multi-cloud networking topologies","hybrid connectivity"],"datePosted":"2026-04-18T15:45:29.712Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Toronto, Ontario, Canada"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"software development experience in cloud-native services like API, TCP/IP, DNS, HTTP, TLS, DevOps experience using cloud-agnostic, cloud-native technologies, infrastructure with Terraform, Go-based services, Product Mindset toward infrastructure, cloud providers such as AWS or Azure, major content delivery networks, Service Mesh architectures, Istio/Envoy Proxy, Kubernetes Gateway API, multi-cloud networking topologies, hybrid connectivity","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":136000,"maxValue":187000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_63af8568-789"},"title":"Engineering Manager, Inference Routing and Performance","description":"<p><strong>About the role\\nEvery request that hits Claude , from claude.ai, the API, our cloud partners, or internal research , passes through a routing decision. Not a generic load balancer round-robin, but a decision that accounts for what&#39;s already cached where, which accelerator the request runs best on, and what else is in flight across the fleet.\\n\\nGet it right and you extract meaningfully more throughput from the same hardware. Get it wrong and you burn capacity, miss latency SLOs, or shed load that shouldn&#39;t have been shed.\\n\\nThe Inference Routing team owns this layer. We build the cluster-level routing and coordination plane for Anthropic&#39;s inference fleet , the system that sits between the API surface and the inference engines themselves, making fleet-wide efficiency decisions in real time.\\n\\nAs Anthropic moves from &quot;many independent inference replicas&quot; toward &quot;a single warehouse-scale computer running a coordinated program,&quot; Dystro is the coordination layer. This is a deeply technical team.\\n\\nThe engineers here design custom load-balancing algorithms, build quantitative models of system performance, debug latency spikes that cross kernel, network, and framework boundaries, and reason carefully about cache placement across thousands of accelerators.\\n\\nThey work shoulder-to-shoulder with teams that write kernels and ML framework internals.\\n\\nThe EM for this team doesn&#39;t need to write kernels , but they do need the systems depth to make architectural calls, evaluate deeply technical candidates, and spot when a proposed optimization will have second-order effects on the fleet.\\n\\nYou&#39;ll inherit a strong team of distributed-systems engineers, and you&#39;ll be accountable for two things that pull in different directions: shipping system-level performance improvements that measurably increase fleet throughput and efficiency, and running the team operationally so that deploys are safe, incidents are rare, and the teams who depend on Dystro can plan around you with confidence.\\n\\nThe job is holding both.\\n\\n## Representative work:\\nThings the Inference Routing EM actually spends time on:\\n- Deciding whether a proposed routing algorithm change is worth the deploy risk, given the modeled throughput gain and the blast radius if it regresses\\n- Sequencing a quarter where KV-cache offload, a new coordination protocol, and two model launches all compete for the same engineers\\n- Working through a persistent tail-latency regression with the team , walking down from fleet-level metrics to per-replica behavior to a root cause in the networking stack\\n- Building the case (with numbers) to peer teams for why a cross-team protocol change unlocks the next efficiency win\\n- Running the post-incident review after a cache-eviction bug caused a capacity event, and turning it into process changes that stick\\n- Interviewing a candidate who has built schedulers at supercomputing scale, and deciding whether they&#39;d be additive to a team that already goes deep\\n\\n## What you&#39;ll do:\\nDrive system-level performance\\n- Own the technical roadmap for cluster-level inference efficiency , routing decisions, cache placement and eviction, cross-replica coordination, and the protocols that keep routing and inference engines in sync\\n- Partner with the inference engine, kernels, and performance teams to identify fleet-level throughput and latency wins, then turn those into shipped improvements with measurable results\\n- Build the team&#39;s habit of quantitative performance modeling: claim a win only when you can measure it, and know before you ship what the expected effect is\\n\\nDeliver reliably and operate cleanly\\n- Set technical strategy for how routing evolves across heterogeneous hardware (GPUs, TPUs, Trainium) and across all our serving surfaces\\n- Run the team&#39;s operational backbone , on-call rotation, incident response, postmortem review, deploy safety , so the team can ship aggressively without the system becoming fragile\\n- Create clarity at a seam: Inference Routing sits between the API surface, the inference engines, and the cloud deployment teams. You&#39;ll make sure commitments are realistic, dependencies are understood, and nobody is surprised\\n\\nBuild and grow the team\\n- Develop and retain a strong existing team, and hire against the bar described above: people who can go to the OS and framework level when the problem demands it, and who care about production reliability\\n- Coach engineers through a roadmap where priorities shift with model launches, new hardware, and scaling demands. We pair a lot here , you&#39;ll help make that collaboration pattern productive\\n- Pick up slack when it matters. This is a small team in a critical path; sometimes the EM is the one unblocking a stuck deploy or synthesizing a design debate\\n\\n## You may be a good fit if you:\\n- Have 5+ years of engineering management experience, ideally with at least part of that leading teams on critical-path production infrastructure at scale\\n- Have a deep systems background , load balancing, scheduling, cache-coherent distributed state, high-performance networking, or similar. You need enough depth to make architectural calls about routing and efficiency, and to evaluate candidates who go to the kernel and framework level\\n- Have shipped performance improvements in large-scale systems and can explain, with numbers, what the impact was\\n- Have run production infrastructure with real operational stakes: on-call, incident response, capacity events, deploy discipline\\n- Are results-oriented with a bias toward impact, and comfortable working in a space where throughput, latency, stability, and feature velocity all pull in different directions\\n- Build strong relationships across team boundaries , this is a seam role, and much of the job is making sure other teams can rely on yours\\n- Are curious about machine learning systems. You don&#39;t need an ML research background, but you should want to learn how transformer inference actually works and how that shapes the systems problems\\n\\nStrong candidates may also have:\\n- Experience with LLM inference serving , KV caching, continuous batching, request scheduling, prefill/decode disaggregation\\n- Background in cluster schedulers, load balancers, service meshes, or coordination planes at scale\\n- Familiarity with heterogeneous accelerator fleets (GPU/TPU/Trainium) and how hardware differences affect workload placement\\n- Experience with GPU/accelerator programming, ML framework internals, or OS-level performance debugging , enough to follow and evaluate the technical work, not necessarily to do it daily\\n- Led teams at supercomputing or hyperscaler infrastructure scale\\n- Led teams through rapid-growth periods where hiring and onboarding competed with roadmap delivery\\n\\nThe annual compensation range for this role is listed below. For sales roles, the range provided is the role’s On Target Earnings (&quot;OTE&quot;) range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role.\\nAnnual Salary: $405,000-$485,000 USD</strong></p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_63af8568-789","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Anthropic","sameAs":"https://www.anthropic.com/","logo":"https://logos.yubhub.co/anthropic.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/anthropic/jobs/5155391008","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$405,000-$485,000 USD","x-skills-required":["engineering management","deep systems background","load balancing","scheduling","cache-coherent distributed state","high-performance networking"],"x-skills-preferred":["LLM inference serving","cluster schedulers","load balancers","service meshes","coordination planes","heterogeneous accelerator fleets","GPU/TPU/Trainium","GPU/accelerator programming","ML framework internals","OS-level performance debugging"],"datePosted":"2026-04-18T15:37:38.038Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco, CA | New York City, NY"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"engineering management, deep systems background, load balancing, scheduling, cache-coherent distributed state, high-performance networking, LLM inference serving, cluster schedulers, load balancers, service meshes, coordination planes, heterogeneous accelerator fleets, GPU/TPU/Trainium, GPU/accelerator programming, ML framework internals, OS-level performance debugging","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":405000,"maxValue":485000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_734a57ad-497"},"title":"Security Engineer","description":"<p>We&#39;re seeking a senior-level Security Engineer to own the design, implementation, and continuous improvement of security guardrails across our cloud infrastructure. You willaki, you&#39;ll build the systems and patterns that enable every team at Saronic to move fast and ship with confidence, with security baked in from the start. You will be the technical authority on how we architect, govern, and defend our AWS environments across commercial and GovCloud.</p>\n<p><strong>Key Responsibilities</strong></p>\n<ul>\n<li>Own the security architecture for Saronic&#39;s AWS environments, including multi-account strategy, network segmentation, identity architecture, and data protection across commercial AWS and AWS GovCloud</li>\n</ul>\n<ul>\n<li>Design and maintain secure-by-default Terraform modules and IaC standards that teams adopt as the standard path, enforcing least privilege, secure defaults, and compliance requirements</li>\n</ul>\n<ul>\n<li>Implement preventive controls (SCPs, permission boundaries, policy-as-code) and detective controls (Config rules, CloudTrail analysis, GuardDuty) as a unified, layered security model</li>\n</ul>\n<ul>\n<li>Design and enforce IAM patterns across AWS accounts, services, and workloads including least-privilege policies, permission boundaries, cross-account access, federation, and service-to-service authentication</li>\n</ul>\n<ul>\n<li>Implement and govern secrets management using tools such as AWS Secrets Manager or Vault, integrated into CI/CD and runtime environments</li>\n</ul>\n<ul>\n<li>Partner with DevOps and Platform Engineering to embed security into CI/CD pipelines, infrastructure provisioning, and deployment workflows</li>\n</ul>\n<ul>\n<li>Build automated compliance validation into infrastructure pipelines and replace manual security gates with automated guardrails wherever possible</li>\n</ul>\n<ul>\n<li>Create self-service security tooling and patterns that allow teams to operate with speed and autonomy while maintaining compliance</li>\n</ul>\n<ul>\n<li>Integrate logging, monitoring, and alerting across cloud infrastructure to validate control effectiveness and detect misconfigurations or threats</li>\n</ul>\n<ul>\n<li>Build and tune cloud-native detections using CloudTrail, GuardDuty, Config, and SIEM integrations</li>\n</ul>\n<ul>\n<li>Support incident response for cloud security events, drive root-cause analysis, and translate findings into improved guardrails and controls</li>\n</ul>\n<p><strong>Required Qualifications:</strong></p>\n<ul>\n<li>6+ years of hands-on experience in cloud security engineering, infrastructure security, DevSecOps, or a closely related security engineering role</li>\n</ul>\n<ul>\n<li>Expert-level proficiency with Terraform, including module design, state management, policy-as-code, and managing complex multi-environment configurations</li>\n</ul>\n<ul>\n<li>Deep expertise in AWS security services and architecture, including IAM, Organizations, SCPs, Control Tower, CloudTrail, Config, GuardDuty, Security Hub, KMS, and VPC security</li>\n</ul>\n<ul>\n<li>Demonstrated experience building security guardrails and reusable infrastructure patterns that engineering teams adopt without friction</li>\n</ul>\n<ul>\n<li>Strong experience with CI/CD pipeline security, IaC review processes, and automated compliance validation</li>\n</ul>\n<ul>\n<li>Experience operating in AWS GovCloud or FedRAMP-regulated cloud environments</li>\n</ul>\n<ul>\n<li>Strong proficiency in Python, Go, Rust, or equivalent languages for building security automation and tooling</li>\n</ul>\n<ul>\n<li>Ability to obtain and maintain a security clearance</li>\n</ul>\n<p><strong>Preferred Qualifications:</strong></p>\n<ul>\n<li>Experience in defence, aerospace, robotics, autonomy, or other high-assurance environments</li>\n</ul>\n<ul>\n<li>Experience designing multi-account AWS landing zones and organisational security architectures from the ground up</li>\n</ul>\n<ul>\n<li>Hands-on experience with Kubernetes security, container security, and service mesh security in cloud-native environments</li>\n</ul>\n<ul>\n<li>Familiarity with NIST SP 800-171, NIST SP 800-53, FedRAMP, or Cloud Computing SRG Impact Levels</li>\n</ul>\n<ul>\n<li>Experience with infrastructure drift detection, automated remediation, and continuous compliance monitoring</li>\n</ul>\n<ul>\n<li>Relevant certifications such as AWS Security Specialty, AWS Solutions Architect Professional, HashiCorp Terraform Associate/Engineer, CCSP, or CISSP</li>\n</ul>\n<p><strong>Additional Information</strong></p>\n<p>Benefits: Medical Insurance: Comprehensive health insurance plans covering a range of services. Saronic pays 100% of the premium for employees and 80% for dependents. Dental and Vision Insurance: Coverage for routine dental check-ups, orthodontics, and vision care. Saronic pays 100% of the premium under the basic plan for employees and 80% for dependents. Time Off: Generous PTO and Holidays. Parental Leave: Paid maternity and paternity leave to support new parents. Competitive Salary: Industry-standard salaries with opportunities for performance-based bonuses. Retirement Plan: 401(k) plan. Stock Options: Equity options to give employees a stake in the company’s success. Life and Disability Insurance: Basic life insurance and short- and long-term disability coverage. Pet Insurance: Discounted pet insurance options including 24/7 Telehealth helpline. Additional Perks: Free lunch benefit and unlimited free drinks and snacks in the office</p>\n<p>This role requires access to export-controlled information or items that require “U.S. Person” status. As defined by U.S. law, individuals who are any one of the following are considered to be a “U.S. Person”: (1) U.S. citizens, (2) legal permanent residents (a.k.a. green card holders), and (3) certain protected classes of asylees and refugees, as defined in 8 U.S.C. 1324b(a)(3).</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_734a57ad-497","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Saronic Technologies","sameAs":"https://www.saronictechnologies.com/","logo":"https://logos.yubhub.co/saronictechnologies.com.png"},"x-apply-url":"https://jobs.lever.co/saronic/18310005-a24b-4f4c-9538-465df614c4fa","x-work-arrangement":"onsite","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["Terraform","AWS security services","IAM","Organizations","SCPs","Control Tower","CloudTrail","Config","GuardDuty","Security Hub","KMS","VPC security","Python","Go","Rust","CI/CD pipeline security","IaC review processes","automated compliance validation","AWS GovCloud","FedRAMP-regulated cloud environments"],"x-skills-preferred":["Kubernetes security","container security","service mesh security","NIST SP 800-171","NIST SP 800-53","FedRAMP","Cloud Computing SRG Impact Levels","infrastructure drift detection","automated remediation","continuous compliance monitoring","AWS Security Specialty","AWS Solutions Architect Professional","HashiCorp Terraform Associate/Engineer","CCSP","CISSP"],"datePosted":"2026-04-17T12:56:38.157Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Terraform, AWS security services, IAM, Organizations, SCPs, Control Tower, CloudTrail, Config, GuardDuty, Security Hub, KMS, VPC security, Python, Go, Rust, CI/CD pipeline security, IaC review processes, automated compliance validation, AWS GovCloud, FedRAMP-regulated cloud environments, Kubernetes security, container security, service mesh security, NIST SP 800-171, NIST SP 800-53, FedRAMP, Cloud Computing SRG Impact Levels, infrastructure drift detection, automated remediation, continuous compliance monitoring, AWS Security Specialty, AWS Solutions Architect Professional, HashiCorp Terraform Associate/Engineer, CCSP, CISSP"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_eeeb517e-3c5"},"title":"Staff Security Engineer, Infrastructure","description":"<p>We&#39;re looking for a Staff Security Engineer, Infrastructure to secure the core systems that power our platform: GPU compute, multi-cloud environments, networking, and data pipelines. You&#39;ll operate across the full stack, from cloud and Kubernetes to identity, networking, and secrets, designing and implementing security controls that scale with a high-performance AI platform.</p>\n<p>This role is highly hands-on and systems-oriented, sitting at the intersection of security, infrastructure, and distributed systems.</p>\n<p>Your primary responsibilities will be to:</p>\n<ul>\n<li>Build and harden infrastructure security by designing and implementing security controls across cloud infrastructure, Kubernetes and containerized workloads, networking, service meshes, and edge systems, CI/CD pipelines and deployment systems, and secure compute environments for GPU workloads and model execution.</li>\n<li>Implement identity, secrets, and access controls, including machine identity and workload authentication, secrets management and encryption, least-privilege access, and short-lived credentials.</li>\n<li>Protect model weights, inference endpoints, and customer data, design secure data access pathways and isolation mechanisms, and ensure safe multi-tenant execution environments.</li>\n<li>Automate security guardrails directly into infrastructure and CI/CD, use Infrastructure-as-Code to enforce secure defaults, and continuously identify and remediate security gaps through automation.</li>\n<li>Identify and mitigate risks across infrastructure layers, defend against both external attackers and insider threats, and drive projects like network isolation, encryption, and secure service communication.</li>\n</ul>\n<p>To succeed in this role, you&#39;ll need to have:</p>\n<ul>\n<li>8+ years in security engineering, infrastructure, or SRE.</li>\n<li>Strong understanding of cloud security, networking fundamentals, Linux systems, and container security.</li>\n<li>Experience building or securing production infrastructure at scale.</li>\n<li>Deep knowledge of authentication and authorization systems, secrets management and cryptography basics, common vulnerabilities and attack vectors, and ability to design security controls across multiple layers.</li>\n<li>Proficiency in at least one language, experience with Infrastructure-as-Code, and strong automation mindset.</li>\n</ul>\n<p>Nice to have experience with GPU infrastructure, multi-tenant platform isolation, service mesh architectures, and high-growth startup environments.</p>\n<p>What makes this role unique is that you&#39;ll work on cutting-edge AI infrastructure security, secure GPU clusters, model execution, and real-time inference systems, have high ownership, and direct impact on developer trust and platform reliability.</p>\n<p>Our security philosophy is to enable developers, automate everything, assume breach, and design for resilience.</p>\n<p>In terms of compensation and benefits, we offer competitive salary, equity, full health, dental, and vision coverage, and opportunity to work on frontier AI infrastructure.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_eeeb517e-3c5","directApply":true,"hiringOrganization":{"@type":"Organization","name":"fal.ai","sameAs":"https://fal.ai","logo":"https://logos.yubhub.co/fal.ai.png"},"x-apply-url":"https://job-boards.greenhouse.io/fal/jobs/4200560009","x-work-arrangement":"onsite","x-experience-level":"staff","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["cloud security","networking fundamentals","Linux systems","container security","Infrastructure-as-Code","authentication and authorization systems","secrets management and cryptography basics","common vulnerabilities and attack vectors"],"x-skills-preferred":["GPU infrastructure","multi-tenant platform isolation","service mesh architectures","high-growth startup environments"],"datePosted":"2026-04-17T12:32:36.163Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"cloud security, networking fundamentals, Linux systems, container security, Infrastructure-as-Code, authentication and authorization systems, secrets management and cryptography basics, common vulnerabilities and attack vectors, GPU infrastructure, multi-tenant platform isolation, service mesh architectures, high-growth startup environments"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_8e582153-6af"},"title":"Senior DevOps Lead - Cloud & Autonomous System","description":"<p>About Cyngn</p>\n<p>Cyngn is a publicly-traded autonomous technology company that deploys self-driving industrial vehicles to factories, warehouses, and other facilities throughout North America.</p>\n<p>We are a small company with under 100 employees, operating with the energy of a startup. However, we&#39;re also publicly traded, which means our employees get access to the liquidity of our publicly-traded equity.</p>\n<p>As a Senior DevOps Lead at Cyngn, you will play a vital role in architecting and managing infrastructure across cloud and autonomous vehicle systems. This position combines traditional cloud DevOps leadership with specialized expertise in robotics and autonomous systems infrastructure.</p>\n<p>Responsibilities</p>\n<ul>\n<li>Lead and architect cloud and vehicle infrastructure initiatives across AWS and ROS/Linux environments</li>\n<li>Design and implement scalable solutions for both cloud services and autonomous vehicle systems</li>\n<li>Establish and maintain DevOps best practices, CI/CD pipelines, and infrastructure as code</li>\n<li>Drive observability, monitoring, and incident response strategies</li>\n<li>Optimize performance and cost efficiency of cloud and edge computing resources</li>\n<li>Mentor team members and foster a developer-friendly environment</li>\n<li>Manage on-call rotations and incident response processes</li>\n<li>Architect solutions for processing and storing large-scale vehicle telemetry data</li>\n<li>Lead security initiatives and compliance efforts across infrastructure</li>\n</ul>\n<p>Requirements</p>\n<ul>\n<li>10+ years of relevant DevOps/Infrastructure experience</li>\n<li>Proven track record as a technical lead in platform or infrastructure teams</li>\n<li>Advanced expertise in AWS services, infrastructure as code (Terraform), and Kubernetes</li>\n<li>Strong experience with service mesh (Istio) and Helm/Kustomize</li>\n<li>Deep understanding of ROS/ROS2 and Linux kernel configurations</li>\n<li>Experience with GPU configurations and ML infrastructure</li>\n<li>Expertise in ARM and NVIDIA CUDA platform configurations</li>\n<li>Strong programming skills in Python and shell scripting</li>\n<li>Experience with infrastructure automation (Ansible)</li>\n<li>Expertise in CI/CD tools (Jenkins, GitHub Actions)</li>\n<li>Strong system architecture and design skills</li>\n<li>Excellence in technical documentation</li>\n<li>Outstanding problem-solving abilities</li>\n<li>Strong leadership and mentoring capabilities</li>\n</ul>\n<p>Nice to haves</p>\n<ul>\n<li>Experience with autonomous vehicle systems</li>\n<li>Track record of optimizing GPU-based ML infrastructure</li>\n<li>Experience with large-scale IoT deployments</li>\n<li>Contributions to open-source projects</li>\n<li>Experience with real-time systems and low-latency requirements</li>\n<li>Expertise in security implementations including SSO, IdP, and AWS Cognito</li>\n<li>Experience with JFrog artifactory and container registry management</li>\n<li>Proficiency in AWS IoT Greengrass</li>\n<li>Experience with container resource management on edge devices</li>\n<li>Understanding of CPU affinity and priority scheduling</li>\n<li>Track record of implementing cost optimization strategies</li>\n<li>Experience with scaling systems both horizontally and vertically</li>\n</ul>\n<p>Benefits &amp; Perks</p>\n<ul>\n<li>Health benefits (Medical, Dental, Vision, HSA and FSA (Health &amp; Dependent Daycare), Employee Assistance Program, 1:1 Health Concierge)</li>\n<li>Life, Short-term, and long-term disability insurance (Cyngn funds 100% of premiums)</li>\n<li>Company 401(k)</li>\n<li>Commuter Benefits</li>\n<li>Flexible vacation policy</li>\n<li>Sabbatical leave opportunity after five years with the company</li>\n<li>Paid Parental Leave</li>\n<li>Daily lunches for in-office employees</li>\n<li>Monthly meal and tech allowances for remote employees</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_8e582153-6af","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Cyngn","sameAs":"https://www.cyngn.com/","logo":"https://logos.yubhub.co/cyngn.com.png"},"x-apply-url":"https://jobs.lever.co/cyngn/1c31b7d8-cf85-472f-9358-1e10189cf815","x-work-arrangement":"remote","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$198,000-225,000 per year","x-skills-required":["AWS services","infrastructure as code (Terraform)","Kubernetes","service mesh (Istio)","Helm/Kustomize","ROS/ROS2","Linux kernel configurations","GPU configurations","ML infrastructure","ARM","NVIDIA CUDA platform configurations","Python","shell scripting","infrastructure automation (Ansible)","CI/CD tools (Jenkins, GitHub Actions)","system architecture and design skills","technical documentation","problem-solving abilities","leadership and mentoring capabilities"],"x-skills-preferred":["autonomous vehicle systems","optimizing GPU-based ML infrastructure","large-scale IoT deployments","open-source projects","real-time systems and low-latency requirements","security implementations including SSO, IdP, and AWS Cognito","JFrog artifactory and container registry management","AWS IoT Greengrass","container resource management on edge devices","CPU affinity and priority scheduling","cost optimization strategies","scaling systems both horizontally and vertically"],"datePosted":"2026-04-17T12:27:09.593Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Mountain View"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"AWS services, infrastructure as code (Terraform), Kubernetes, service mesh (Istio), Helm/Kustomize, ROS/ROS2, Linux kernel configurations, GPU configurations, ML infrastructure, ARM, NVIDIA CUDA platform configurations, Python, shell scripting, infrastructure automation (Ansible), CI/CD tools (Jenkins, GitHub Actions), system architecture and design skills, technical documentation, problem-solving abilities, leadership and mentoring capabilities, autonomous vehicle systems, optimizing GPU-based ML infrastructure, large-scale IoT deployments, open-source projects, real-time systems and low-latency requirements, security implementations including SSO, IdP, and AWS Cognito, JFrog artifactory and container registry management, AWS IoT Greengrass, container resource management on edge devices, CPU affinity and priority scheduling, cost optimization strategies, scaling systems both horizontally and vertically","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":198000,"maxValue":225000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_a560bd4c-a1a"},"title":"Cloud Security Engineer","description":"<p>We&#39;re looking for a Cloud Security Engineer to join our team. As a Cloud Security Engineer at Starling, you&#39;ll be building and supporting tooling and infrastructure that spans across AWS and GCP supporting our internal operations and interfacing with other teams to deliver the services that support our business.</p>\n<p>Key Responsibilities:</p>\n<ul>\n<li>Engineer Secure Foundations: You will lead the design and implementation of critical security services, with a heavy focus on building robust Identity and Access Management (IAM) systems and automated, API-driven certificate management workflows.</li>\n<li>Security-as-Code &amp; Scalability: Leveraging a software-first philosophy, you will develop and maintain high-quality, scalable security tooling and middleware within ECS and Kubernetes environments, ensuring security logic is integrated directly into the deployment pipeline.</li>\n<li>Collaborative Code Ownership: You will serve as a technical authority in cross-functional code reviews, acting as an engineering peer who helps teams bake security into their services from the first line of code to the final pull request.</li>\n<li>Proactive System Hardening: You will stay ahead of the evolving threat landscape by treating security as a continuous engineering challenge,proactively identifying vulnerabilities and architecting technical solutions to fortify our global ecosystem.</li>\n</ul>\n<p>Professional Requirements:</p>\n<ul>\n<li>Demonstrated ability to architect secure, distributed systems with a focus on programmatic IAM and automated, API-driven PKI management.</li>\n<li>Extensive experience with Infrastructure as Code (IaC) in Terraform and a deep commitment to writing clean, maintainable, and production-grade code,ideally in Golang.</li>\n<li>A test-first mentality toward security, with experience building unit and integration tests into CI/CD pipelines to ensure that security guardrails are as reliable as the features they protect.</li>\n<li>A strong conceptual grasp of cryptographic primitives and hands-on experience securing containerized workloads and service meshes within ECS and Kubernetes.</li>\n<li>A track record of taking end-to-end ownership of complex technical projects, from initial design docs and RFCs through to deployment and observability.</li>\n<li>A belief that if it isn&#39;t tested, it&#39;s broken, and a drive to proactively identify and fix vulnerabilities by treating security as a continuous engineering challenge.</li>\n</ul>\n<p>Our Team Philosophy:\nThe Security Engineering team is a diverse and dynamic group passionate about building secure and resilient systems. We&#39;re enthusiastic about security, but we&#39;re not about rigid, one-size-fits-all controls. We believe in striking a balance between protecting our systems and empowering our developers to build and innovate.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_a560bd4c-a1a","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Starling","sameAs":"https://www.starlingbank.com/","logo":"https://logos.yubhub.co/starlingbank.com.png"},"x-apply-url":"https://apply.workable.com/j/3B7E26FC24","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["Cloud Security","AWS","GCP","Identity and Access Management","API-driven Certificate Management","Infrastructure as Code","Terraform","Golang","Cryptographic Primitives","Containerized Workloads","Service Meshes"],"x-skills-preferred":[],"datePosted":"2026-03-20T16:14:58.088Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"London"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Finance","skills":"Cloud Security, AWS, GCP, Identity and Access Management, API-driven Certificate Management, Infrastructure as Code, Terraform, Golang, Cryptographic Primitives, Containerized Workloads, Service Meshes"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_41528416-21c"},"title":"Staff+ Software Security Engineer","description":"<p><strong>About Anthropic</strong></p>\n<p>Anthropic&#39;s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.</p>\n<p><strong>About the Team</strong></p>\n<p>The Security Engineering team protects Anthropic&#39;s AI systems and maintains the trust of our users and society. We define the authentication architecture for our training infrastructure, design the cryptographic foundations that protect model weights and training data, and drive the developer security program that shapes how engineers build and ship software.</p>\n<p><strong>About the role:</strong></p>\n<ul>\n<li>Scope, design, and build complex security systems end to end, maintaining them through production and driving through ambiguous technical challenges with minimal oversight</li>\n<li>Identify systematic risks through threat modeling and risk assessment, then build the controls and infrastructure that address them</li>\n<li>Mentor engineers across the security team and broader engineering organisation, contribute to hiring, and grow security engineering culture at Anthropic</li>\n<li>Enable other teams to build their own security solutions by providing design pattern guidance and expanding security ownership beyond the security team</li>\n</ul>\n<p><strong>Developer security and supply chain</strong></p>\n<ul>\n<li>Build and advance our developer security program by embedding security practices into the software development lifecycle and developer workflows</li>\n<li>Harden CI/CD pipelines against supply chain attacks through isolated build environments, signed attestations, dependency verification, and automated policy enforcement</li>\n</ul>\n<p><strong>Identity and secrets management</strong></p>\n<ul>\n<li>Architect systems that protect sensitive assets including model weights, customer data, and training datasets</li>\n<li>Build and operate credential issuance, rotation, and workload authentication across our multi-cloud environments</li>\n</ul>\n<p><strong>Infrastructure security</strong></p>\n<ul>\n<li>Implement and maintain cloud security controls including IAM, network segmentation, VPC architecture, and encryption across our multi-cloud and on-prem environments</li>\n<li>Contribute to cluster security controls including RBAC policies, namespace isolation, workload identity, and pod security</li>\n<li>Contribute to continuous cloud security posture management using infrastructure-as-code scanning, misconfiguration detection, and automated remediation</li>\n</ul>\n<p><strong>Secure frameworks</strong></p>\n<ul>\n<li>Build critical security foundations including cryptographic frameworks, mTLS infrastructure, secure serialization, and authorization systems, designed to prevent entire classes of vulnerabilities and empower engineering teams to work securely without becoming security experts themselves</li>\n<li>Partner with product, research, infrastructure, and other security teams to ensure frameworks integrate smoothly with lower-layer security controls</li>\n</ul>\n<p><strong>You may be a good fit if you have:</strong></p>\n<ul>\n<li>At least 8 years of software engineering experience with deep security expertise, including leading complex security initiatives independently</li>\n<li>Bachelor&#39;s degree in Computer Science or equivalent industry experience</li>\n<li>Strong programming skills in Python or at least one systems language such as Go, Rust, or C/C++</li>\n<li>Deep understanding of identity systems, cryptographic primitives, and secrets management</li>\n<li>Working knowledge of Kubernetes security primitives including RBAC, namespaces, network policies, and service accounts</li>\n<li>Experience leading cross-functional security initiatives and navigating complex organisational dynamics</li>\n<li>Outstanding communication skills, translating technical concepts effectively across all levels of the organisation</li>\n<li>A track record of bringing clarity and ownership to ambiguous technical problems and driving them to resolution</li>\n<li>Low ego and high empathy, with a history of growing the engineers around you and supporting diverse, inclusive teams</li>\n<li>Passion for AI safety and the role security engineering plays in building trustworthy AI systems</li>\n</ul>\n<p><strong>Strong candidates may also have:</strong></p>\n<ul>\n<li>Designed or operated identity and secrets management systems for large-scale AI or cloud infrastructure</li>\n<li>Built security frameworks or libraries adopted across an engineering organisation</li>\n<li>Led a developer security program including supply chain security, secure build infrastructure, and SDLC integrations</li>\n<li>Built or secured CI infrastructure using Nix, Bazel, or Kubernetes-based deploy systems, with depth in toolchain issues, CI/CD pipelines, and developer workflow optimisation</li>\n<li>Implemented machine identity or workload authentication systems using SPIFFE/SPIRE, mTLS, or equivalent</li>\n<li>Understanding of Linux systems internals including namespaces, cgroups, and seccomp, and how these underpin container and workload isolation</li>\n<li>Contributed to the security architecture of multi-cloud environments including network segmentation, data protection, and access governance</li>\n<li>Experience with network security controls including admission controllers, CNI-level policy, service mesh security, and east-west traffic enforcement</li>\n<li>Experience building runtime security monitoring using eBPF or kernel security policies</li>\n</ul>\n<p><strong>Deadline to apply:</strong></p>\n<p>None, applications will be received on a rolling basis.</p>\n<p><strong>The annual compensation range for this role is listed below.</strong></p>\n<p>For sales roles, the range provided is the role’s On Target Earnings (&quot;OTE&quot;) range, meaning the total amount of money an employee is expected to earn in a year, including bonuses and other forms of compensation.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_41528416-21c","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Anthropic","sameAs":"https://job-boards.greenhouse.io","logo":"https://logos.yubhub.co/anthropic.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/anthropic/jobs/5120512008","x-work-arrangement":"hybrid","x-experience-level":"staff","x-job-type":"full-time","x-salary-range":"The annual compensation range for this role is listed below.\n\nFor sales roles, the range provided is the role’s On Target Earnings (\"OTE\") range, meaning the total amount of money an employee is expected to earn in a year, including bonuses and other forms of compensation.","x-skills-required":["Python","Go","Rust","C/C++","Kubernetes","RBAC","namespaces","network policies","service accounts","identity systems","cryptographic primitives","secrets management"],"x-skills-preferred":["Nix","Bazel","Kubernetes-based deploy systems","SPIFFE/SPIRE","mTLS","Linux systems internals","namespaces","cgroups","seccomp","container and workload isolation","multi-cloud environments","network segmentation","data protection","access governance","admission controllers","CNI-level policy","service mesh security","east-west traffic enforcement","runtime security monitoring","eBPF","kernel security policies"],"datePosted":"2026-03-08T13:52:38.657Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco, CA | New York City, NY | Seattle, WA"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Python, Go, Rust, C/C++, Kubernetes, RBAC, namespaces, network policies, service accounts, identity systems, cryptographic primitives, secrets management, Nix, Bazel, Kubernetes-based deploy systems, SPIFFE/SPIRE, mTLS, Linux systems internals, namespaces, cgroups, seccomp, container and workload isolation, multi-cloud environments, network segmentation, data protection, access governance, admission controllers, CNI-level policy, service mesh security, east-west traffic enforcement, runtime security monitoring, eBPF, kernel security policies"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_2fd7fc02-3ed"},"title":"Security Engineer, Agent Security","description":"<p><strong>Security Engineer, Agent Security</strong></p>\n<p><strong>Location</strong></p>\n<p>San Francisco</p>\n<p><strong>Employment Type</strong></p>\n<p>Full time</p>\n<p><strong>Department</strong></p>\n<p>Security</p>\n<p><strong>Compensation</strong></p>\n<ul>\n<li>$293K – $385K • Offers Equity</li>\n</ul>\n<p>The base pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience. If the role is non-exempt, overtime pay will be provided consistent with applicable laws. In addition to the salary range listed above, total compensation also includes generous equity, performance-related bonus(es) for eligible employees, and the following benefits.</p>\n<p><strong>Benefits</strong></p>\n<ul>\n<li>Medical, dental, and vision insurance for you and your family, with employer contributions to Health Savings Accounts</li>\n</ul>\n<ul>\n<li>Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses (parking and transit)</li>\n</ul>\n<ul>\n<li>401(k) retirement plan with employer match</li>\n</ul>\n<ul>\n<li>Paid parental leave (up to 24 weeks for birth parents and 20 weeks for non-birthing parents), plus paid medical and caregiver leave (up to 8 weeks)</li>\n</ul>\n<ul>\n<li>Paid time off: flexible PTO for exempt employees and up to 15 days annually for non-exempt employees</li>\n</ul>\n<ul>\n<li>13+ paid company holidays, and multiple paid coordinated company office closures throughout the year for focus and recharge, plus paid sick or safe time (1 hour per 30 hours worked, or more, as required by applicable state or local law)</li>\n</ul>\n<ul>\n<li>Mental health and wellness support</li>\n</ul>\n<ul>\n<li>Employer-paid basic life and disability coverage</li>\n</ul>\n<ul>\n<li>Annual learning and development stipend to fuel your professional growth</li>\n</ul>\n<ul>\n<li>Daily meals in our offices, and meal delivery credits as eligible</li>\n</ul>\n<ul>\n<li>Relocation support for eligible employees</li>\n</ul>\n<ul>\n<li>Additional taxable fringe benefits, such as charitable donation matching and wellness stipends, may also be provided.</li>\n</ul>\n<p><strong>About the Team</strong></p>\n<p>The team’s mission is to accelerate the secure evolution of agentic AI systems at OpenAI. To achieve this, the team designs, implements, and continuously refines security policies, frameworks, and controls that defend OpenAI’s most critical assets—including the user and customer data embedded within them—against the unique risks introduced by agentic AI.</p>\n<p><strong>About the Role</strong></p>\n<p><strong>As a Security Engineer on the Agent Security Team</strong>, you will be at the forefront of securing OpenAI’s cutting-edge agentic AI systems. Your role will involve designing and implementing robust security frameworks, policies, and controls to safeguard OpenAI’s critical assets and ensure the safe deployment of agentic systems. You will develop comprehensive threat models, partner tightly with our Agent Infrastructure group to fortify the platforms that power OpenAI’s most advanced agentic systems, and lead efforts to enhance safety monitoring pipelines at scale.</p>\n<p><strong>Responsibilities</strong></p>\n<ul>\n<li>Architecting security controls for agentic AI – design, implement, and iterate on identity, network, and runtime-level defenses (e.g., sandboxing, policy enforcement) that integrate directly with the Agent Infrastructure stack.</li>\n</ul>\n<ul>\n<li>Building production-grade security tooling – ship code that hardens safety monitoring pipelines across agent executions at scale.</li>\n</ul>\n<ul>\n<li>Collaborating cross-functionally – work daily with Agent Infrastructure, product, research, safety, and security teams to balance security, performance, and usability.</li>\n</ul>\n<ul>\n<li>Influencing strategy &amp; standards – shape the long-term Agent Security roadmap, publish best practices internally and externally, and help define industry standards for securing autonomous AI.</li>\n</ul>\n<p><strong>Requirements</strong></p>\n<ul>\n<li>Strong software-engineering skills in Python or at least one systems language (Go, Rust, C/C++), plus a track record of shipping and operating secure, high-reliability services.</li>\n</ul>\n<ul>\n<li>Deep expertise in modern isolation techniques – experience with container security, kernel-level hardening, and other isolation methods.</li>\n</ul>\n<ul>\n<li>Hands-on network security experience – implementing identity-based controls, policy enforcement, and secure large-scale telemetry pipelines.</li>\n</ul>\n<ul>\n<li>Clear, concise communication that bridges engineering, research, and leadership audiences; comfort influencing roadmaps and driving consensus.</li>\n</ul>\n<ul>\n<li>Bias for action &amp; ownership – you thrive in ambiguity, move quickly without sacrificing rigor, and elevate the security bar company-wide from day one.</li>\n</ul>\n<ul>\n<li>Cloud security depth on at least one major provider (Azure, AWS, GCP), including identity federation, workload IAM, and infrastructure-as-code best practices.</li>\n</ul>\n<ul>\n<li>Familiarity with AI/ML security challenges – experience addressing risks associated with advanced AI systems (nice-to-have but valuable)</li>\n</ul>\n<p><strong>Preferred Qualifications</strong></p>\n<ul>\n<li>Experience with container orchestration (e.g., Kubernetes) and service mesh technologies (e.g., Istio, Linkerd).</li>\n</ul>\n<ul>\n<li>Knowledge of cloud security frameworks and compliance standards (e.g., HIPAA, PCI-DSS).</li>\n</ul>\n<ul>\n<li>Familiarity with machine learning and AI frameworks (e.g., TensorFlow, PyTorch).</li>\n</ul>\n<ul>\n<li>Experience with DevOps tools and practices (e.g., CI/CD pipelines, containerization).</li>\n</ul>\n<p><strong>What We Offer</strong></p>\n<ul>\n<li>Competitive salary and benefits package</li>\n</ul>\n<ul>\n<li>Opportunity to work with a talented team of engineers and researchers</li>\n</ul>\n<ul>\n<li>Collaborative and dynamic work environment</li>\n</ul>\n<ul>\n<li>Professional growth and development opportunities</li>\n</ul>\n<ul>\n<li>Flexible work arrangements</li>\n</ul>\n<ul>\n<li>Access to cutting-edge technology and tools</li>\n</ul>\n<p><strong>How to Apply</strong></p>\n<p>If you are a motivated and experienced security engineer looking to join a dynamic team, please submit your application, including your resume and a cover letter, to [insert contact information]. We look forward to hearing from you!</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_2fd7fc02-3ed","directApply":true,"hiringOrganization":{"@type":"Organization","name":"OpenAI","sameAs":"https://jobs.ashbyhq.com","logo":"https://logos.yubhub.co/openai.com.png"},"x-apply-url":"https://jobs.ashbyhq.com/openai/e9bea775-7eb6-438a-ab96-27d5f941e69d","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$293K – $385K • Offers Equity","x-skills-required":["Python","Go","Rust","C/C++","container security","kernel-level hardening","isolation methods","identity-based controls","policy enforcement","telemetry pipelines","cloud security","identity federation","workload IAM","infrastructure-as-code"],"x-skills-preferred":["container orchestration","service mesh technologies","cloud security frameworks","compliance standards","machine learning","AI frameworks","DevOps tools","CI/CD pipelines","containerization"],"datePosted":"2026-03-06T18:44:49.390Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Python, Go, Rust, C/C++, container security, kernel-level hardening, isolation methods, identity-based controls, policy enforcement, telemetry pipelines, cloud security, identity federation, workload IAM, infrastructure-as-code, container orchestration, service mesh technologies, cloud security frameworks, compliance standards, machine learning, AI frameworks, DevOps tools, CI/CD pipelines, containerization","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":293000,"maxValue":385000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_cbb7e2e4-4bc"},"title":"Security Engineer, Infrastructure Security","description":"<p><strong>Security Engineer, Infrastructure Security</strong></p>\n<p><strong>Location</strong></p>\n<p>Remote - US; New York City; San Francisco; Seattle</p>\n<p><strong>Employment Type</strong></p>\n<p>Full time</p>\n<p><strong>Location Type</strong></p>\n<p>Remote</p>\n<p><strong>Department</strong></p>\n<p>Security</p>\n<p><strong>Compensation</strong></p>\n<ul>\n<li>SF, Seattle or NYC $230K – $385K • Offers Equity</li>\n<li>Zone A $207K – $346.5K • Offers Equity</li>\n<li>Zone B $184K – $308K • Offers Equity</li>\n</ul>\n<p>The base pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience. If the role is non-exempt, overtime pay will be provided consistent with applicable laws. In addition to the salary range listed above, total compensation also includes generous equity, performance-related bonus(es) for eligible employees, and the following benefits.</p>\n<p><strong>Benefits</strong></p>\n<ul>\n<li>Medical, dental, and vision insurance for you and your family, with employer contributions to Health Savings Accounts</li>\n<li>Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses (parking and transit)</li>\n<li>401(k) retirement plan with employer match</li>\n<li>Paid parental leave (up to 24 weeks for birth parents and 20 weeks for non-birthing parents), plus paid medical and caregiver leave (up to 8 weeks)</li>\n<li>Paid time off: flexible PTO for exempt employees and up to 15 days annually for non-exempt employees</li>\n<li>13+ paid company holidays, and multiple paid coordinated company office closures throughout the year for focus and recharge, plus paid sick or safe time (1 hour per 30 hours worked, or more, as required by applicable state or local law)</li>\n<li>Mental health and wellness support</li>\n<li>Employer-paid basic life and disability coverage</li>\n<li>Annual learning and development stipend to fuel your professional growth</li>\n<li>Daily meals in our offices, and meal delivery credits as eligible</li>\n<li>Relocation support for eligible employees</li>\n<li>Additional taxable fringe benefits, such as charitable donation matching and wellness stipends, may also be provided.</li>\n</ul>\n<p><strong>About the Team</strong></p>\n<p>Security is at the foundation of OpenAI’s mission to ensure that artificial general intelligence benefits all of humanity.</p>\n<p>The Security team protects OpenAI’s technology, people, and products. We are technical in what we build but are operational in how we do our work, and are committed to supporting all products and research at OpenAI. Our Security team tenets include: prioritizing for impact, enabling researchers, preparing for future transformative technologies, and engaging a robust security culture.</p>\n<p><strong>About the Role</strong></p>\n<p>OpenAI is seeking a Security Engineer to join our Infrastructure Security (InfraSec) team. InfraSec protects the foundations of OpenAI’s research and production environments, spanning GPU supercomputing clusters, multi-cloud infrastructure, datacenters, networking, storage, and the critical services that power our frontier AI models. Our charter includes securing everything from bare-metal hardware and firmware, to Kubernetes clusters and service meshes, to data storage and access pathways for highly sensitive model weights and user data.</p>\n<p><strong>In this role, you will:</strong></p>\n<ul>\n<li>Design and build security controls across diverse layers (e.g., physical hardware, firmware/BMC, OS, Kubernetes, networks, and CI/CD) to defend against sophisticated adversaries and insider threats.</li>\n<li>Collaborate with engineering and security teams to drive deployment of security enhancements and control changes across broad-scale infrastructure.</li>\n<li>Tackle high-impact projects such as checkpoint encryption, network isolation, secret management, and machine identity, while continuously raising the security bar for emerging AI workloads.</li>\n<li>Take a generalist approach to building security controls, balancing a mix of security expertise and broad technical skillsets to adapt to evolving challenges.</li>\n</ul>\n<p><strong>You will thrive in this role if you have:</strong></p>\n<ul>\n<li>Deep understanding of security principles, best practices, and common vulnerabilities.</li>\n<li>A proactive mindset, with the ability to identify and address security gaps or inefficiencies through automation and tooling.</li>\n<li>A track record of delivering scalable solutions and driving impactful changes across infrastructure in real-world projects.</li>\n<li>Expertise in the security of cloud platforms (e.g., Amazon AWS, Microsoft Azure), especially securing multi-cloud networks and infrastructure, and designing cloud agnostic systems.</li>\n<li>Experience securing on-prem deployments and datacenters from construction to multi-tenant use.</li>\n<li>Familiarity with container security, orchestration security, and authentication/authorization.</li>\n<li>Strong analytical and problem-solving skills, with an ability to think critically and objectively assess security risks.</li>\n<li>Excellent communication skills, with the ability to convey complex security concepts to technical and non-technical stakeholders.</li>\n<li>Excitement about collaborating with cross-functional teams to build secure, reliable systems that scale globally.</li>\n</ul>\n<p><strong>About OpenAI</strong></p>\n<p>OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_cbb7e2e4-4bc","directApply":true,"hiringOrganization":{"@type":"Organization","name":"OpenAI","sameAs":"https://jobs.ashbyhq.com","logo":"https://logos.yubhub.co/openai.com.png"},"x-apply-url":"https://jobs.ashbyhq.com/openai/f51f750f-a737-4441-8f96-30133a2a8049","x-work-arrangement":"remote","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$230K – $385K","x-skills-required":["security principles","best practices","common vulnerabilities","cloud platforms","Amazon AWS","Microsoft Azure","container security","orchestration security","authentication/authorization","Kubernetes","service meshes","data storage","access pathways","firmware","BMC","OS","networks","CI/CD"],"x-skills-preferred":["security expertise","broad technical skillsets","cloud agnostic systems","on-prem deployments","datacenters","multi-tenant use","strong analytical skills","problem-solving skills","critical thinking","objectively assess security risks"],"datePosted":"2026-03-06T18:33:14.263Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Remote - US; New York City; San Francisco; Seattle"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"security principles, best practices, common vulnerabilities, cloud platforms, Amazon AWS, Microsoft Azure, container security, orchestration security, authentication/authorization, Kubernetes, service meshes, data storage, access pathways, firmware, BMC, OS, networks, CI/CD, security expertise, broad technical skillsets, cloud agnostic systems, on-prem deployments, datacenters, multi-tenant use, strong analytical skills, problem-solving skills, critical thinking, objectively assess security risks","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":230000,"maxValue":385000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_14dd5de2-4dc"},"title":"Software Engineer, Infrastructure Security","description":"<p><strong>Software Engineer, Infrastructure Security</strong></p>\n<p><strong>Location</strong></p>\n<p>Remote - US; New York City; San Francisco; Seattle</p>\n<p><strong>Employment Type</strong></p>\n<p>Full time</p>\n<p><strong>Location Type</strong></p>\n<p>Remote</p>\n<p><strong>Department</strong></p>\n<p>Security</p>\n<p><strong>Compensation</strong></p>\n<ul>\n<li>SF, Seattle or NYC $230K – $385K • Offers Equity</li>\n<li>Zone A $207K – $346.5K • Offers Equity</li>\n<li>Zone B $184K – $308K • Offers Equity</li>\n</ul>\n<p>The base pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience. If the role is non-exempt, overtime pay will be provided consistent with applicable laws. In addition to the salary range listed above, total compensation also includes generous equity, performance-related bonus(es) for eligible employees, and the following benefits.</p>\n<p><strong>Benefits</strong></p>\n<ul>\n<li>Medical, dental, and vision insurance for you and your family, with employer contributions to Health Savings Accounts</li>\n<li>Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses (parking and transit)</li>\n<li>401(k) retirement plan with employer match</li>\n<li>Paid parental leave (up to 24 weeks for birth parents and 20 weeks for non-birthing parents), plus paid medical and caregiver leave (up to 8 weeks)</li>\n<li>Paid time off: flexible PTO for exempt employees and up to 15 days annually for non-exempt employees</li>\n<li>13+ paid company holidays, and multiple paid coordinated company office closures throughout the year for focus and recharge, plus paid sick or safe time (1 hour per 30 hours worked, or more, as required by applicable state or local law)</li>\n<li>Mental health and wellness support</li>\n<li>Employer-paid basic life and disability coverage</li>\n<li>Annual learning and development stipend to fuel your professional growth</li>\n<li>Daily meals in our offices, and meal delivery credits as eligible</li>\n<li>Relocation support for eligible employees</li>\n<li>Additional taxable fringe benefits, such as charitable donation matching and wellness stipends, may also be provided.</li>\n</ul>\n<p><strong>About the Team</strong></p>\n<p>Security is at the foundation of OpenAI’s mission to ensure that artificial general intelligence benefits all of humanity.</p>\n<p>The Security team protects OpenAI’s technology, people, and products. We are technical in what we build but operational in how we execute, and we support every product and research effort at OpenAI. Our tenets include prioritizing for impact, enabling researchers and developers, preparing for future transformative technologies, and fostering a strong, collaborative security culture.</p>\n<p><strong>About the Role</strong></p>\n<p>OpenAI is seeking a Security Software Engineer to join the Infrastructure Security (InfraSec) team.</p>\n<p>InfraSec safeguards the core of OpenAI’s research and production environments—GPU supercomputing clusters, multi-cloud infrastructure, datacenters, networking, storage, and the critical services that power our frontier AI models. Our charter spans everything from bare-metal hardware and firmware to Kubernetes clusters, service meshes, and the data pathways that carry highly sensitive model weights and user data.</p>\n<p>As a Security Software Engineer, you will design and build critical foundational services, such as authentication systems, egress/ingress proxies, access brokers, and key management platforms, that demand high standards of reliability, scalability, and software craftsmanship. These systems form the security backbone of OpenAI’s supercomputing environment and must remain robust under intense scale and adversarial pressure.</p>\n<p><strong>In this role, you will:</strong></p>\n<ul>\n<li>Architect and implement production-grade security services (e.g., auth services, access brokers, secure proxies, key-management infrastructure) that provide strong guarantees across hardware, operating systems, Kubernetes, networks, and CI/CD.</li>\n<li>Partner with infrastructure and research engineers to embed security into high-performance compute clusters, enabling rapid model training and deployment without compromising protection.</li>\n<li>Develop automation and detection tooling to continuously identify and mitigate risks in large-scale cloud and on-prem environments.</li>\n<li>Drive high-impact initiatives such as line-speed encryption, machine identity, and network isolation, continuously raising the security bar for emerging AI workloads.</li>\n<li>Lead or participate in design reviews and threat models to ensure new systems launch with strong security foundations and operational excellence.</li>\n</ul>\n<p><strong>You will thrive in this role if you have:</strong></p>\n<ul>\n<li>Strong software engineering skills in languages such as Python, Go, Rust, or C/C++, with a track record of shipping and operating high-reliability distributed services.</li>\n<li>Experience building or operating critical security infrastructure (e.g., auth services, service-to-service proxies, certificate or key-management systems).</li>\n<li>Deep understanding of security principles, best practices, and common vulnerabilities.</li>\n<li>Expertise in securing large-scale cloud platforms (e.g., Azure, AWS, GCP), including multi-cloud networks and cloud-agnostic system design.</li>\n<li>Familiarity with container and orchestration security (Kubernetes, service meshes) and modern authentication/authorization standards (OIDC, mTLS, SPIFFE/SPIRE).</li>\n<li>A proactive mindset, with the ability to identify and address security gaps or inefficiencies through automation and tooling.</li>\n<li>A track record of delivering scalable solutions and driving impactful changes across infrastructure in real-world projects.</li>\n<li>Strong analytical and problem-solving skills, with an ability to think critically and objectively assess security risks.</li>\n<li>Excellent communication skills, with the ability to convey complex security concepts to technical and non-technical stakeholders.</li>\n<li>Excitement about collaborating</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_14dd5de2-4dc","directApply":true,"hiringOrganization":{"@type":"Organization","name":"OpenAI","sameAs":"https://jobs.ashbyhq.com","logo":"https://logos.yubhub.co/openai.com.png"},"x-apply-url":"https://jobs.ashbyhq.com/openai/98ad9beb-4f91-496c-bd16-ac0b2a8d5bb2","x-work-arrangement":"remote","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$230K – $385K","x-skills-required":["Python","Go","Rust","C/C++","Kubernetes","Service meshes","OIDC","mTLS","SPIFFE/SPIRE","Cloud security","Container security","Orchestration security","Authentication","Authorization","Security principles","Best practices","Common vulnerabilities"],"x-skills-preferred":["Cloud platforms","Multi-cloud networks","Cloud-agnostic system design","Automation","Detection tooling","Line-speed encryption","Machine identity","Network isolation"],"datePosted":"2026-03-06T18:29:27.261Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Remote - US; New York City; San Francisco; Seattle"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Python, Go, Rust, C/C++, Kubernetes, Service meshes, OIDC, mTLS, SPIFFE/SPIRE, Cloud security, Container security, Orchestration security, Authentication, Authorization, Security principles, Best practices, Common vulnerabilities, Cloud platforms, Multi-cloud networks, Cloud-agnostic system design, Automation, Detection tooling, Line-speed encryption, Machine identity, Network isolation","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":230000,"maxValue":385000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_3f16d353-491"},"title":"Software Engineer, Infrastructure Reliability","description":"<p><strong>Software Engineer, Infrastructure Reliability</strong></p>\n<p><strong>Location</strong></p>\n<p>San Francisco</p>\n<p><strong>Employment Type</strong></p>\n<p>Full time</p>\n<p><strong>Department</strong></p>\n<p>Applied AI</p>\n<p><strong>Compensation</strong></p>\n<ul>\n<li>$255K – $385K</li>\n</ul>\n<p>The base pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience. If the role is non-exempt, overtime pay will be provided consistent with applicable laws. In addition to the salary range listed above, total compensation also includes generous equity, performance-related bonus(es) for eligible employees, and the following benefits.</p>\n<p><strong>Benefits</strong></p>\n<ul>\n<li>Medical, dental, and vision insurance for you and your family, with employer contributions to Health Savings Accounts</li>\n</ul>\n<ul>\n<li>Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses (parking and transit)</li>\n</ul>\n<ul>\n<li>401(k) retirement plan with employer match</li>\n</ul>\n<ul>\n<li>Paid parental leave (up to 24 weeks for birth parents and 20 weeks for non-birthing parents), plus paid medical and caregiver leave (up to 8 weeks)</li>\n</ul>\n<ul>\n<li>Paid time off: flexible PTO for exempt employees and up to 15 days annually for non-exempt employees</li>\n</ul>\n<ul>\n<li>13+ paid company holidays, and multiple paid coordinated company office closures throughout the year for focus and recharge, plus paid sick or safe time (1 hour per 30 hours worked, or more, as required by applicable state or local law)</li>\n</ul>\n<ul>\n<li>Mental health and wellness support</li>\n</ul>\n<ul>\n<li>Employer-paid basic life and disability coverage</li>\n</ul>\n<ul>\n<li>Annual learning and development stipend to fuel your professional growth</li>\n</ul>\n<ul>\n<li>Daily meals in our offices, and meal delivery credits as eligible</li>\n</ul>\n<ul>\n<li>Relocation support for eligible employees</li>\n</ul>\n<ul>\n<li>Additional taxable fringe benefits, such as charitable donation matching and wellness stipends, may also be provided.</li>\n</ul>\n<p><strong>About the Team</strong></p>\n<p>We’re hiring Software Engineers to join our Applied Infrastructure organization, and more specifically for our Database Systems and Online Storage teams. These teams operate with a high degree of autonomy and are deeply collaborative, with a shared mandate to raise the bar on safety, reliability, and velocity across OpenAI.</p>\n<p><strong>About the Role</strong></p>\n<p>You’ll be at the heart of scaling and hardening the infrastructure that powers some of the most widely used AI systems in the world. You’ll help ensure our systems are highly reliable, observable, performant, and secure—so researchers can iterate quickly, and products like ChatGPT and the OpenAI API can serve millions of users safely and effectively.</p>\n<p>This is a hands-on, high-leverage role for engineers who thrive on ownership, love solving deep technical problems across the stack, and want to work on systems that support cutting-edge research and deploy at global scale. You’ll play a key part in shaping technical direction, proactively improving system resilience, and collaborating closely with infra, product, and research teams to turn complex infrastructure into reliable platforms.</p>\n<p><strong>In this role you will:</strong></p>\n<ul>\n<li>Design, build, and operate reliable and performant systems used across engineering.</li>\n</ul>\n<ul>\n<li>Identify and fix performance bottlenecks and inefficiencies, ensuring our infrastructure can scale to the next order of magnitude.</li>\n</ul>\n<ul>\n<li>Dig deep to resolve complex issues.</li>\n</ul>\n<ul>\n<li>Continuously improve automation to reduce manual work. Improve internal tooling and our developer experience.</li>\n</ul>\n<ul>\n<li>Contribute to incident response, postmortems, and the development of best practices around system reliability and scalability.</li>\n</ul>\n<p><strong>You might thrive in this role if you:</strong></p>\n<ul>\n<li>Have a deep understanding of distributed systems principles and a proven track record in building and operating scalable and reliable systems.</li>\n</ul>\n<ul>\n<li>Have a keen eye for performance and optimization. You know how to squeeze the most performance out of complex, globally-distributed systems.</li>\n</ul>\n<ul>\n<li>Have experience operating orchestration systems such as Kubernetes at scale and building abstractions over cloud platforms</li>\n</ul>\n<ul>\n<li>Are comfortable working in Linux environments, and with tools like Kubernetes, Terraform, CI/CD pipelines, and modern observability stacks.</li>\n</ul>\n<ul>\n<li>Are experienced in collaborating with cross-functional teams to ensure that reliability and scalability are considered in the design and development of new features and services.</li>\n</ul>\n<ul>\n<li>Have a humble attitude, an eagerness to help your colleagues, and a desire to do whatever it takes to make the team succeed.</li>\n</ul>\n<ul>\n<li>Own problems end-to-end, and are willing to pick up whatever knowledge you&#39;re missing to get the job done.</li>\n</ul>\n<ul>\n<li>Are comfortable with ambiguity and rapid change.</li>\n</ul>\n<p><strong>Qualifications:</strong></p>\n<ul>\n<li>4+ years of relevant industry experience, with 2+ years leading large scale, complex projects or teams as an engineer or tech lead</li>\n</ul>\n<ul>\n<li>A passion for distributed systems at scale with a focus on reliability, scalability, security, and continuous improvement.</li>\n</ul>\n<ul>\n<li>Proven experience as an reliability engineer, production engineer, or a similar role in a fast-paced, rapidly scaling company.</li>\n</ul>\n<ul>\n<li>Strong proficiency in cloud infrastructure (like AWS, GCP, Azure) and IaC tools such as Terraform. Proficiency in programming / scripting languages.</li>\n</ul>\n<ul>\n<li>Experience with containerization technologies and container orchestration platforms like Kubernetes.</li>\n</ul>\n<ul>\n<li>Experience with observability tools such as Datadog, Prometheus, Grafana, Splunk and ELK stack.</li>\n</ul>\n<ul>\n<li>Experience with microservices architecture and service mesh technologies.</li>\n</ul>\n<ul>\n<li>Knowledge of security best practices in cloud environments.</li>\n</ul>\n<ul>\n<li>Strong understanding of distributed systems, networking, and database technologies.</li>\n</ul>\n<ul>\n<li>Excellent problem-solving skills and ability to work in a fast-paced environment.</li>\n</ul>\n<p><strong>About OpenAI</strong></p>\n<p>OpenAI is an AI research and deployment company that aims to develop and apply general-purpose technologies to align with human values.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_3f16d353-491","directApply":true,"hiringOrganization":{"@type":"Organization","name":"OpenAI","sameAs":"https://jobs.ashbyhq.com","logo":"https://logos.yubhub.co/openai.com.png"},"x-apply-url":"https://jobs.ashbyhq.com/openai/779b340d-e645-4da1-a923-b3070a26d936","x-work-arrangement":"onsite","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$255K – $385K","x-skills-required":["cloud infrastructure","IaC tools","programming/scripting languages","containerization technologies","container orchestration platforms","observability tools","microservices architecture","service mesh technologies","security best practices","distributed systems","networking","database technologies"],"x-skills-preferred":["Kubernetes","Terraform","Datadog","Prometheus","Grafana","Splunk","ELK stack"],"datePosted":"2026-03-06T18:24:50.552Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"cloud infrastructure, IaC tools, programming/scripting languages, containerization technologies, container orchestration platforms, observability tools, microservices architecture, service mesh technologies, security best practices, distributed systems, networking, database technologies, Kubernetes, Terraform, Datadog, Prometheus, Grafana, Splunk, ELK stack","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":255000,"maxValue":385000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_32d33889-c44"},"title":"Software Engineer, Caching Infrastructure","description":"<p><strong>Software Engineer, Caching Infrastructure</strong></p>\n<p><strong>Location</strong></p>\n<p>San Francisco</p>\n<p><strong>Employment Type</strong></p>\n<p>Full time</p>\n<p><strong>Department</strong></p>\n<p>Applied AI</p>\n<p><strong>Compensation</strong></p>\n<ul>\n<li>$230K – $385K • Offers Equity</li>\n</ul>\n<p>The base pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience. If the role is non-exempt, overtime pay will be provided consistent with applicable laws. In addition to the salary range listed above, total compensation also includes generous equity, performance-related bonus(es) for eligible employees, and the following benefits.</p>\n<ul>\n<li>Medical, dental, and vision insurance for you and your family, with employer contributions to Health Savings Accounts</li>\n</ul>\n<ul>\n<li>Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses (parking and transit)</li>\n</ul>\n<ul>\n<li>401(k) retirement plan with employer match</li>\n</ul>\n<ul>\n<li>Paid parental leave (up to 24 weeks for birth parents and 20 weeks for non-birthing parents), plus paid medical and caregiver leave (up to 8 weeks)</li>\n</ul>\n<ul>\n<li>Paid time off: flexible PTO for exempt employees and up to 15 days annually for non-exempt employees</li>\n</ul>\n<ul>\n<li>13+ paid company holidays, and multiple paid coordinated company office closures throughout the year for focus and recharge, plus paid sick or safe time (1 hour per 30 hours worked, or more, as required by applicable state or local law)</li>\n</ul>\n<ul>\n<li>Mental health and wellness support</li>\n</ul>\n<ul>\n<li>Employer-paid basic life and disability coverage</li>\n</ul>\n<ul>\n<li>Annual learning and development stipend to fuel your professional growth</li>\n</ul>\n<ul>\n<li>Daily meals in our offices, and meal delivery credits as eligible</li>\n</ul>\n<ul>\n<li>Relocation support for eligible employees</li>\n</ul>\n<ul>\n<li>Additional taxable fringe benefits, such as charitable donation matching and wellness stipends, may also be provided.</li>\n</ul>\n<p>More details about our benefits are available to candidates during the hiring process.</p>\n<p>This role is at-will and OpenAI reserves the right to modify base pay and other compensation components at any time based on individual performance, team or company results, or market conditions.</p>\n<p><strong><strong>About the Team</strong></strong></p>\n<p>At OpenAI, we’re building safe and beneficial artificial general intelligence. We deploy our models through ChatGPT, our APIs, and other cutting-edge products. Behind the scenes, making these systems fast, reliable, and cost-efficient requires world-class infrastructure.</p>\n<p>The Caching Infrastructure team is responsible for building a caching layer that powers many critical use cases at OpenAI. We aim to provide a high-availability, multi-tenant cache platform that scales automatically with workload, minimizes tail latency, and supports a diverse range of use cases.</p>\n<p>We’re looking for an experienced engineer to help design and scale this critical infrastructure. The ideal candidate has deep experience in distributed caching systems (e.g., Redis, Memcached), networking fundamentals, and Kubernetes-based service orchestration.</p>\n<p><strong><strong>In This Role, You Will:</strong></strong></p>\n<ul>\n<li>Design, build, and operate OpenAI’s multi-tenant caching platform used across inference, identity, quota, and product experiences.</li>\n</ul>\n<ul>\n<li>Define the long-term vision and roadmap for caching as a core infra capability, balancing performance, durability, and cost.</li>\n</ul>\n<ul>\n<li>Collaborate with other infra teams (e.g., networking, observability, databases) and product teams to ensure our caching platform meets their needs.</li>\n</ul>\n<p><strong><strong>You Might Thrive In This Role If You:</strong></strong></p>\n<ul>\n<li>Have 5+ years of experience building and scaling distributed systems, with a strong focus on caching, load balancing, or storage systems.</li>\n</ul>\n<ul>\n<li>Have deep expertise with Redis, Memcached, or similar solutions, including clustering, durability configurations, client-side connection patterns, and performance tuning.</li>\n</ul>\n<ul>\n<li>Have production experience with Kubernetes, service meshes (e.g., Envoy), and autoscaling systems.</li>\n</ul>\n<ul>\n<li>Think rigorously about latency, reliability, throughput, and cost in designing platform capabilities.</li>\n</ul>\n<ul>\n<li>Thrive in a fast-paced environment and enjoy balancing pragmatic engineering with long-term technical excellence.</li>\n</ul>\n<p><strong>About OpenAI</strong></p>\n<p>OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_32d33889-c44","directApply":true,"hiringOrganization":{"@type":"Organization","name":"OpenAI","sameAs":"https://jobs.ashbyhq.com","logo":"https://logos.yubhub.co/openai.com.png"},"x-apply-url":"https://jobs.ashbyhq.com/openai/a20b7fc6-6f01-4618-ba35-37b40083f93e","x-work-arrangement":"onsite","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$230K – $385K • Offers Equity","x-skills-required":["distributed caching systems","Redis","Memcached","Kubernetes","service meshes","autoscaling systems"],"x-skills-preferred":["clustering","durability configurations","client-side connection patterns","performance tuning"],"datePosted":"2026-03-06T18:24:00.812Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"distributed caching systems, Redis, Memcached, Kubernetes, service meshes, autoscaling systems, clustering, durability configurations, client-side connection patterns, performance tuning","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":230000,"maxValue":385000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_67dcf42f-2dc"},"title":"Engineering Manager ChatGPT Infra","description":"<p><strong>Engineering Manager ChatGPT Infra</strong></p>\n<p><strong>Location</strong></p>\n<p>London, UK</p>\n<p><strong>Employment Type</strong></p>\n<p>Full time</p>\n<p><strong>Department</strong></p>\n<p>Applied AI</p>\n<p><strong><strong>About the Team:</strong></strong></p>\n<p>The ChatGPT Infrastructure team is responsible for the platform that powers ChatGPT, one of the fastest-growing consumer products in history. We build, scale, and operate the infrastructure that enables rapid experimentation, reliable deployment, and global delivery of AI-powered experiences. As we expand our global footprint, we’re investing in establishing a leadership presence in London to help shape our growing office and drive collaboration across OpenAI’s international teams.</p>\n<p><strong><strong>About the Role:</strong></strong></p>\n<p>We’re looking for an experienced Engineering Manager to lead the ChatGPT Infra team from our London office. In this dual role, you’ll be both a technical leader and the site lead for our London engineering hub. You’ll be responsible for building and mentoring a world-class infra team, helping to scale ChatGPT infrastructure, and fostering a strong, inclusive engineering culture at our growing international site.</p>\n<p>You will:</p>\n<ul>\n<li>Lead a team of infrastructure engineers focused on availability, scalability, and performance for ChatGPT.</li>\n</ul>\n<ul>\n<li>Collaborate closely with product and research teams to deliver a seamless and robust experience to millions of users.</li>\n</ul>\n<ul>\n<li>Define and drive technical strategy for key components such as deployment pipelines, service mesh, observability, and CI/CD systems.</li>\n</ul>\n<ul>\n<li>Partner with recruiting to grow the London engineering team and represent OpenAI in the local tech community.</li>\n</ul>\n<ul>\n<li>Serve as a cultural ambassador and people manager, supporting cross-functional collaboration and site operations.</li>\n</ul>\n<ul>\n<li>Operate with a high degree of autonomy and ownership, with support from global leaders and peers.</li>\n</ul>\n<p><strong><strong>Qualifications:</strong></strong></p>\n<ul>\n<li>7+ years of hands-on engineering experience, ideally in high-scale systems, distributed computing, or developer platforms.</li>\n</ul>\n<ul>\n<li>Demonstrated success in leading cross-functional projects and collaborating across product, infra, and research orgs.</li>\n</ul>\n<ul>\n<li>Passion for building strong, inclusive teams and mentoring engineers of all experience levels.</li>\n</ul>\n<ul>\n<li>Experience operating production services in cloud environments (e.g., AWS, GCP, Azure).</li>\n</ul>\n<ul>\n<li>Comfortable wearing multiple hats — from deep technical discussions to team planning and office leadership.</li>\n</ul>\n<ul>\n<li>Based in or willing to relocate to London.</li>\n</ul>\n<p><strong>About OpenAI</strong></p>\n<p>OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_67dcf42f-2dc","directApply":true,"hiringOrganization":{"@type":"Organization","name":"OpenAI","sameAs":"https://jobs.ashbyhq.com","logo":"https://logos.yubhub.co/openai.com.png"},"x-apply-url":"https://jobs.ashbyhq.com/openai/5a4ba7cb-4ba2-41d3-8e02-840617a0f571","x-work-arrangement":"onsite","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["high-scale systems","distributed computing","developer platforms","cloud environments","AWS","GCP","Azure","deployment pipelines","service mesh","observability","CI/CD systems"],"x-skills-preferred":["leadership","team management","cross-functional collaboration","site operations"],"datePosted":"2026-03-06T18:20:48.510Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"London, UK"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"high-scale systems, distributed computing, developer platforms, cloud environments, AWS, GCP, Azure, deployment pipelines, service mesh, observability, CI/CD systems, leadership, team management, cross-functional collaboration, site operations"}]}