{"version":"0.1","company":{"name":"YubHub","url":"https://yubhub.co","jobsUrl":"https://yubhub.co/jobs/skill/load-balancers"},"x-facet":{"type":"skill","slug":"load-balancers","display":"Load Balancers","count":12},"x-feed-size-limit":100,"x-feed-sort":"enriched_at desc","x-feed-notice":"This feed contains at most 100 jobs (the most recently enriched). For the full corpus, use the paginated /stats/by-facet endpoint or /search.","x-generator":"yubhub-xml-generator","x-rights":"Free to redistribute with attribution: \"Data by YubHub (https://yubhub.co)\"","x-schema":"Each entry in `jobs` follows https://schema.org/JobPosting. YubHub-native raw fields carry `x-` prefix.","jobs":[{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_ac45e205-e7d"},"title":"Engineering Manager, Inference Routing and Performance","description":"<p><strong>About the role\\nEvery request that hits Claude , from claude.ai, the API, our cloud partners, or internal research , passes through a routing decision. Not a generic load balancer round-robin, but a decision that accounts for what&#39;s already cached where, which accelerator the request runs best on, and what else is in flight across the fleet.\\n\\nGet it right and you extract meaningfully more throughput from the same hardware. Get it wrong and you burn capacity, miss latency SLOs, or shed load that shouldn&#39;t have been shed.\\n\\nThe Inference Routing team owns this layer. We build the cluster-level routing and coordination plane for Anthropic&#39;s inference fleet , the system that sits between the API surface and the inference engines themselves, making fleet-wide efficiency decisions in real time.\\n\\nAs Anthropic moves from &quot;many independent inference replicas&quot; toward &quot;a single warehouse-scale computer running a coordinated program,&quot; Dystro is the coordination layer. This is a deeply technical team.\\n\\nThe engineers here design custom load-balancing algorithms, build quantitative models of system performance, debug latency spikes that cross kernel, network, and framework boundaries, and reason carefully about cache placement across thousands of accelerators.\\n\\nThey work shoulder-to-shoulder with teams that write kernels and ML framework internals.\\n\\nThe EM for this team doesn&#39;t need to write kernels , but they do need the systems depth to make architectural calls, evaluate deeply technical candidates, and spot when a proposed optimization will have second-order effects on the fleet.\\n\\nYou&#39;ll inherit a strong team of distributed-systems engineers, and you&#39;ll be accountable for two things that pull in different directions: shipping system-level performance improvements that measurably increase fleet throughput and efficiency, and running the team operationally so that deploys are safe, incidents are rare, and the teams who depend on Dystro can plan around you with confidence.\\n\\nThe job is holding both.\\n\\n## Representative work:\\nThings the Inference Routing EM actually spends time on:\\n- Deciding whether a proposed routing algorithm change is worth the deploy risk, given the modeled throughput gain and the blast radius if it regresses\\n- Sequencing a quarter where KV-cache offload, a new coordination protocol, and two model launches all compete for the same engineers\\n- Working through a persistent tail-latency regression with the team , walking down from fleet-level metrics to per-replica behavior to a root cause in the networking stack\\n- Building the case (with numbers) to peer teams for why a cross-team protocol change unlocks the next efficiency win\\n- Running the post-incident review after a cache-eviction bug caused a capacity event, and turning it into process changes that stick\\n- Interviewing a candidate who has built schedulers at supercomputing scale, and deciding whether they&#39;d be additive to a team that already goes deep\\n\\n## What you&#39;ll do:\\nDrive system-level performance\\n- Own the technical roadmap for cluster-level inference efficiency , routing decisions, cache placement and eviction, cross-replica coordination, and the protocols that keep routing and inference engines in sync\\n- Partner with the inference engine, kernels, and performance teams to identify fleet-level throughput and latency wins, then turn those into shipped improvements with measurable results\\n- Build the team&#39;s habit of quantitative performance modeling: claim a win only when you can measure it, and know before you ship what the expected effect is\\n\\nDeliver reliably and operate cleanly\\n- Set technical strategy for how routing evolves across heterogeneous hardware (GPUs, TPUs, Trainium) and across all our serving surfaces\\n- Run the team&#39;s operational backbone , on-call rotation, incident response, postmortem review, deploy safety , so the team can ship aggressively without the system becoming fragile\\n- Create clarity at a seam: Inference Routing sits between the API surface, the inference engines, and the cloud deployment teams. You&#39;ll make sure commitments are realistic, dependencies are understood, and nobody is surprised\\n\\nBuild and grow the team\\n- Develop and retain a strong existing team, and hire against the bar described above: people who can go to the OS and framework level when the problem demands it, and who care about production reliability\\n- Coach engineers through a roadmap where priorities shift with model launches, new hardware, and scaling demands. We pair a lot here , you&#39;ll help make that collaboration pattern productive\\n- Pick up slack when it matters. This is a small team in a critical path; sometimes the EM is the one unblocking a stuck deploy or synthesizing a design debate\\n\\n## You may be a good fit if you:\\n- Have 5+ years of engineering management experience, ideally with at least part of that leading teams on critical-path production infrastructure at scale\\n- Have a deep systems background , load balancing, scheduling, cache-coherent distributed state, high-performance networking, or similar. You need enough depth to make architectural calls about routing and efficiency, and to evaluate candidates who go to the kernel and framework level\\n- Have shipped performance improvements in large-scale systems and can explain, with numbers, what the impact was\\n- Have run production infrastructure with real operational stakes: on-call, incident response, capacity events, deploy discipline\\n- Are results-oriented with a bias toward impact, and comfortable working in a space where throughput, latency, stability, and feature velocity all pull in different directions\\n- Build strong relationships across team boundaries , this is a seam role, and much of the job is making sure other teams can rely on yours\\n- Are curious about machine learning systems. You don&#39;t need an ML research background, but you should want to learn how transformer inference actually works and how that shapes the systems problems\\n\\nStrong candidates may also have:\\n- Experience with LLM inference serving , KV caching, continuous batching, request scheduling, prefill/decode disaggregation\\n- Background in cluster schedulers, load balancers, service meshes, or coordination planes at scale\\n- Familiarity with heterogeneous accelerator fleets (GPU/TPU/Trainium) and how hardware differences affect workload placement\\n- Experience with GPU/accelerator programming, ML framework internals, or OS-level performance debugging , enough to follow and evaluate the technical work, not necessarily to do it daily\\n- Led teams at supercomputing or hyperscaler infrastructure scale\\n- Led teams through rapid-growth periods where hiring and onboarding competed with roadmap delivery\\n\\nThe annual compensation range for this role is listed below. For sales roles, the range provided is the role’s On Target Earnings (&quot;OTE&quot;) range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role.\\nAnnual Salary: $405,000-$485,000 USD</strong></p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_ac45e205-e7d","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Anthropic","sameAs":"https://www.anthropic.com/","logo":"https://logos.yubhub.co/anthropic.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/anthropic/jobs/5155391008","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$405,000-$485,000 USD","x-skills-required":["engineering management","distributed systems","load balancing","scheduling","cache-coherent distributed state","high-performance networking","machine learning systems"],"x-skills-preferred":["LLM inference serving","cluster schedulers","load balancers","service meshes","coordination planes","heterogeneous accelerator fleets","GPU/TPU/Trainium","GPU/accelerator programming","ML framework internals","OS-level performance debugging"],"datePosted":"2026-04-18T15:56:48.587Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco, CA | New York City, NY"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"engineering management, distributed systems, load balancing, scheduling, cache-coherent distributed state, high-performance networking, machine learning systems, LLM inference serving, cluster schedulers, load balancers, service meshes, coordination planes, heterogeneous accelerator fleets, GPU/TPU/Trainium, GPU/accelerator programming, ML framework internals, OS-level performance debugging","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":405000,"maxValue":485000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_5cad560f-dc3"},"title":"Engineering Manager, Cloud Networking (Brazil)","description":"<p>You will join Airbnb&#39;s mission-driven company dedicated to helping create a world where anyone can belong anywhere. As the first Network engineering lead in Airbnb&#39;s Brazil office, you will be responsible for bootstrapping and growing the networking team in our new San Paulo office.</p>\n<p>Your primary focus will be on delivering an Airbnb network platform that is flexible, efficient, always available, and scales with the needs of the business. You will work closely with peers across Cloud Infra, Security, Reliability, and many other partner teams across the company to achieve this goal.</p>\n<p>Key responsibilities include:</p>\n<ul>\n<li>Providing meaningful input to technical designs and direct hands-on contributions to projects in the cloud networking space</li>\n<li>Growing, leading, and managing a small team of talented engineers</li>\n<li>Supporting your team&#39;s professional growth and maintaining high performance through mentorship and coaching</li>\n<li>Working with tech leads, peers, and partners to define and execute on a coherent vision and roadmap for Airbnb&#39;s cloud network infrastructure and related components</li>\n<li>Working with open source communities (e.g. istio) to build the next generation service mesh for all Airbnb back-end services</li>\n<li>Building cross-region gateways and load balancers for global Airbnb services</li>\n<li>Working with external partners and internal engineering and security teams to deliver edge security systems that protect Airbnb services</li>\n<li>Nurturing a culture of technical quality from design, through code review, to production</li>\n<li>Building strong partnership and alignment with teams across engineering</li>\n<li>Nurturing relationships with open source communities and external service partners</li>\n</ul>\n<p>As a successful candidate, you will have a strong background in engineering management, with 2+ years of experience and 8+ years of relevant software development experience in a fast-paced tech environment. You will also have experience with a public cloud provider (AWS, GCP, Azure) and their networking service offerings, as well as experience running large-scale networking systems and software (e.g. proxies, DNS, gateways).</p>\n<p>Additionally, you will have excellent communication skills and the ability to work well with teams across the engineering organization (e.g. reliability, compute, security, etc.). You will also have strong problem-solving skills and experience leading teams on-call for production infrastructure.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_5cad560f-dc3","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Airbnb","sameAs":"https://www.airbnb.com/","logo":"https://logos.yubhub.co/airbnb.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/airbnb/jobs/7381450","x-work-arrangement":"onsite","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["Professional fluency in English","2+ years of engineering management experience","8+ years of relevant software development experience in a fast-paced tech environment","Experience with a public cloud provider (AWS, GCP, Azure) and their networking service offerings","Experience running large-scale networking systems and software (e.g. proxies, DNS, gateways)","Experience with Istio service mesh, k8s and cloud native technologies","Excellent communication skills and the ability to work well with teams across the engineering organization","Strong problem-solving skills and experience leading teams on-call for production infrastructure"],"x-skills-preferred":["Experience with open source communities (e.g. istio)","Experience building cross-region gateways and load balancers for global services","Experience working with external partners and internal engineering and security teams to deliver edge security systems"],"datePosted":"2026-04-18T15:55:03.519Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Brazil"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Professional fluency in English, 2+ years of engineering management experience, 8+ years of relevant software development experience in a fast-paced tech environment, Experience with a public cloud provider (AWS, GCP, Azure) and their networking service offerings, Experience running large-scale networking systems and software (e.g. proxies, DNS, gateways), Experience with Istio service mesh, k8s and cloud native technologies, Excellent communication skills and the ability to work well with teams across the engineering organization, Strong problem-solving skills and experience leading teams on-call for production infrastructure, Experience with open source communities (e.g. istio), Experience building cross-region gateways and load balancers for global services, Experience working with external partners and internal engineering and security teams to deliver edge security systems"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_127228e6-c1a"},"title":"Senior Support Engineer - Korean Speaking","description":"<p>We&#39;re seeking a Senior Support Engineer to join our Support team in South Korea. As a Senior Support Engineer, you will provide expert-level service to our APJ customers, ensuring technical customer issues are serviced within our contractual SLA and managed to resolution.</p>\n<p>You will document and share your knowledge with the rest of the organization and our customers using Knowledge Centered-Services (KCS) methodology. You will also have a mindset of continuous improvement, in terms of efficiency of support processes and customer satisfaction.</p>\n<p>To be successful in this role, you will need to work across multi-cultural and geographically distributed teams. You will have 3+ years of proven experience in Technical Support in a Software business, a technical background in fields like Information Technology, Network Engineering, Software Engineering, and a &#39;Customer First&#39; mindset.</p>\n<p>You will be a team player, able to work in a fast-paced environment with a positive and adaptable approach. You will have knowledge of databases (SQL / No SQL) or search software technologies, experience with SaaS and/or Distributed systems, experience with Linux/Unix, experience with APIs, familiarity with Knowledge Centered-Services (KCS), and highly collaborative.</p>\n<p>Native Korean language skills and professional working proficiency in English are required, as well as effective verbal and written communication skills.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_127228e6-c1a","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Elastic","sameAs":"https://www.elastic.co/","logo":"https://logos.yubhub.co/elastic.co.png"},"x-apply-url":"https://job-boards.greenhouse.io/elastic/jobs/7712961","x-work-arrangement":"remote","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["Technical Support","Knowledge Centered-Services (KCS)","Databases (SQL / No SQL)","Search software technologies","SaaS and/or Distributed systems","Linux/Unix","APIs","Native Korean language skills","Professional working proficiency in English"],"x-skills-preferred":["Experience with administering and/or troubleshooting Elastic products in a production environment","Experience with Networking and/or Load Balancers","Experience with Kubernetes","Experience with Message Brokering (e.g. Kafka)"],"datePosted":"2026-04-18T15:52:25.490Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"South Korea"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Technical Support, Knowledge Centered-Services (KCS), Databases (SQL / No SQL), Search software technologies, SaaS and/or Distributed systems, Linux/Unix, APIs, Native Korean language skills, Professional working proficiency in English, Experience with administering and/or troubleshooting Elastic products in a production environment, Experience with Networking and/or Load Balancers, Experience with Kubernetes, Experience with Message Brokering (e.g. Kafka)"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_16dd7ebd-23f"},"title":"Staff Product Manager, Networking","description":"<p>This role sits within CoreWeave&#39;s Product organization, focused on building and scaling advanced networking capabilities that power AI, machine learning, and high-performance computing workloads.</p>\n<p>As a Staff Product Manager, Networking, you will own the strategy and roadmap for CoreWeave&#39;s advanced networking product portfolio. On a day-to-day basis, you will translate market insights, customer needs, and technical constraints into clear product requirements and execution plans. You will work closely with cross-functional partners to launch new products and evolve existing offerings, ensuring they meet CoreWeave&#39;s high standards for performance, scalability, and reliability.</p>\n<p>This is a highly visible role with significant influence over the future of CoreWeave&#39;s networking platform.</p>\n<p>CoreWeave is a rapidly growing company that prioritizes innovation and disruption. We believe in investing in our people and value candidates who can bring their own diversified experiences to our teams.</p>\n<p>If you love defining product strategy in technically complex, fast-evolving domains, are curious about emerging networking technologies, and are an expert at turning market insights and customer needs into scalable, high-impact products, then we&#39;d love to talk.</p>\n<p>At CoreWeave, we work hard, have fun, and move fast. We&#39;re in an exciting stage of hyper-growth that you will not want to miss. We&#39;re not afraid of a little chaos, and we&#39;re constantly learning. Our team cares deeply about how we build our product and how we work together, which is reflected in our core values:</p>\n<ul>\n<li>Be Curious at Your Core</li>\n<li>Act Like an Owner</li>\n<li>Empower Employees</li>\n<li>Deliver Best-in-Class Client Experiences</li>\n<li>Achieve More Together</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_16dd7ebd-23f","directApply":true,"hiringOrganization":{"@type":"Organization","name":"CoreWeave","sameAs":"https://www.coreweave.com","logo":"https://logos.yubhub.co/coreweave.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/coreweave/jobs/4642612006","x-work-arrangement":"hybrid","x-experience-level":"staff","x-job-type":"full-time","x-salary-range":"$188,000 to $275,000","x-skills-required":["product management","networking","infrastructure","distributed systems","VPCs","load balancers","HPC networking","Direct Connect–style solutions"],"x-skills-preferred":["building or scaling networking products for cloud, hyperscale, or high-performance computing environments","background working closely with infrastructure or platform engineering teams","advanced degree or specialized coursework in networking or distributed systems"],"datePosted":"2026-04-18T15:51:56.472Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Bellevue, WA/  Livingston, NJ /  New York, NY /  San Francisco, CA/   Sunnyvale, CA"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"product management, networking, infrastructure, distributed systems, VPCs, load balancers, HPC networking, Direct Connect–style solutions, building or scaling networking products for cloud, hyperscale, or high-performance computing environments, background working closely with infrastructure or platform engineering teams, advanced degree or specialized coursework in networking or distributed systems","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":188000,"maxValue":275000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_7df1b7d2-b71"},"title":"Network Engineer - Edge","description":"<p>About xAI</p>\n<p>xAI&#39;s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge.</p>\n<p><strong>About the Role</strong></p>\n<p>Grok and X are powered largely from our own on-premise infrastructure which enables us to move at speed and efficiency when deploying vast amounts of capacity. People wouldn&#39;t be able to enjoy participating in the townhall on X or use Grok to understand the universe if it weren&#39;t for our Edge networking infrastructure.</p>\n<p>We are seeking two senior engineers that help architect, develop, and build our peering and transit infrastructure with associated routing policies, eDNS, cloud connectivity, CGNAT, and load balancer fleets. The two successful candidates will have great Python skills to remove repetitive engineering cycles and auto-mitigate customer impacts.</p>\n<p><strong>Responsibilities</strong></p>\n<ul>\n<li>Architect, develop, and maintain peering and transit infrastructure, including routing policies, eDNS, cloud connectivity, CGNAT, and load balancer fleets.</li>\n</ul>\n<ul>\n<li>Leverage Python scripting to automate repetitive engineering tasks and proactively mitigate customer impacts.</li>\n</ul>\n<ul>\n<li>Manage and troubleshoot DNS infrastructure to ensure reliable performance.</li>\n</ul>\n<ul>\n<li>Oversee and resolve issues related to cloud VPCs and connected network hardware.</li>\n</ul>\n<ul>\n<li>Diagnose and resolve complex TCP/IP issues to maintain seamless network operations.</li>\n</ul>\n<ul>\n<li>Collaborate with cross-functional teams to enhance infrastructure efficiency and support xAI&#39;s AI platforms.</li>\n</ul>\n<p><strong>Required Qualifications</strong></p>\n<ul>\n<li>7+ years of experience with edge network hardware, including load balancers, CGNAT, routers, and switches.</li>\n</ul>\n<ul>\n<li>7+ years of routing experience in backbones, peering, and transit areas with expertise in traffic engineering.</li>\n</ul>\n<ul>\n<li>5+ years of experience using Python scripting to automate deployments and break/fix tasks.</li>\n</ul>\n<ul>\n<li>3+ years of experience managing DNS infrastructure.</li>\n</ul>\n<ul>\n<li>3+ years of experience managing and troubleshooting cloud VPCs and connected network hardware.</li>\n</ul>\n<ul>\n<li>Proven ability to troubleshoot complex TCP/IP issues.</li>\n</ul>\n<p><strong>Preferred Qualifications</strong></p>\n<ul>\n<li>Experience with A10 Networks, NGINX, or open-source load balancer/CGNAT software.</li>\n</ul>\n<ul>\n<li>Familiarity with Route53 and UltraDNS.</li>\n</ul>\n<ul>\n<li>Expertise in GCP, AWS, and OCI VPC architecture and troubleshooting.</li>\n</ul>\n<ul>\n<li>Knowledge of Kubernetes Ingress.</li>\n</ul>\n<ul>\n<li>Experience with CDN, Fastly, and Cloudfront.</li>\n</ul>\n<ul>\n<li>Demonstrated success in on-call rotations and incident response in high-stakes environments.</li>\n</ul>\n<ul>\n<li>Strong problem-solving skills and adaptability in a fast-paced, ambiguous setting.</li>\n</ul>\n<p><strong>Annual Base Salary</strong></p>\n<p>$180,000 - $440,000 USD</p>\n<p><strong>Benefits</strong></p>\n<p>Base salary is just one part of our total rewards package at X, which also includes equity, comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short &amp; long-term disability insurance, life insurance, and various other discounts and perks.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_7df1b7d2-b71","directApply":true,"hiringOrganization":{"@type":"Organization","name":"xAI","sameAs":"https://www.xai.com/","logo":"https://logos.yubhub.co/xai.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/xai/jobs/4950906007","x-work-arrangement":"onsite","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$180,000 - $440,000 USD","x-skills-required":["edge network hardware","load balancers","CGNAT","routers","switches","routing","traffic engineering","Python scripting","DNS infrastructure","cloud VPCs","TCP/IP issues"],"x-skills-preferred":["A10 Networks","NGINX","Route53","UltraDNS","GCP","AWS","OCI","Kubernetes Ingress","CDN","Fastly","Cloudfront"],"datePosted":"2026-04-18T15:48:41.822Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Palo Alto, CA"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"edge network hardware, load balancers, CGNAT, routers, switches, routing, traffic engineering, Python scripting, DNS infrastructure, cloud VPCs, TCP/IP issues, A10 Networks, NGINX, Route53, UltraDNS, GCP, AWS, OCI, Kubernetes Ingress, CDN, Fastly, Cloudfront","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":180000,"maxValue":440000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_509502c4-bc6"},"title":"Network Engineer - Edge","description":"<p><strong>About the Role</strong></p>\n<p>Grok and X are powered largely from our own on-premise infrastructure which enables us to move at speed and efficiency when deploying vast amounts of capacity. People wouldn’t be able to enjoy participating in the townhall on X or use Grok to understand the universe if it weren’t for our Edge networking infrastructure.</p>\n<p><strong>Responsibilities</strong></p>\n<ul>\n<li>Architect, develop, and maintain peering and transit infrastructure, including routing policies, eDNS, cloud connectivity, CGNAT, and load balancer fleets.</li>\n</ul>\n<ul>\n<li>Leverage Python scripting to automate repetitive engineering tasks and proactively mitigate customer impacts.</li>\n</ul>\n<ul>\n<li>Manage and troubleshoot DNS infrastructure to ensure reliable performance.</li>\n</ul>\n<ul>\n<li>Oversee and resolve issues related to cloud VPCs and connected network hardware.</li>\n</ul>\n<ul>\n<li>Diagnose and resolve complex TCP/IP issues to maintain seamless network operations.</li>\n</ul>\n<ul>\n<li>Collaborate with cross-functional teams to enhance infrastructure efficiency and support xAI’s AI platforms.</li>\n</ul>\n<p><strong>Required Qualifications</strong></p>\n<ul>\n<li>7+ years of experience with edge network hardware, including load balancers, CGNAT, routers, and switches.</li>\n</ul>\n<ul>\n<li>7+ years of routing experience in backbones, peering, and transit areas with expertise in traffic engineering.</li>\n</ul>\n<ul>\n<li>5+ years of experience using Python scripting to automate deployments and break/fix tasks.</li>\n</ul>\n<ul>\n<li>3+ years of experience managing DNS infrastructure.</li>\n</ul>\n<ul>\n<li>3+ years of experience managing and troubleshooting cloud VPCs and connected network hardware.</li>\n</ul>\n<ul>\n<li>Proven ability to troubleshoot complex TCP/IP issues.</li>\n</ul>\n<p><strong>Preferred Qualifications</strong></p>\n<ul>\n<li>Experience with A10 Networks, NGINX, or open-source load balancer/CGNAT software.</li>\n</ul>\n<ul>\n<li>Familiarity with Route53 and UltraDNS.</li>\n</ul>\n<ul>\n<li>Expertise in GCP, AWS, and OCI VPC architecture and troubleshooting.</li>\n</ul>\n<ul>\n<li>Knowledge of Kubernetes Ingress.</li>\n</ul>\n<ul>\n<li>Experience with CDN, Fastly, and Cloudfront.</li>\n</ul>\n<ul>\n<li>Demonstrated success in on-call rotations and incident response in high-stakes environments.</li>\n</ul>\n<ul>\n<li>Strong problem-solving skills and adaptability in a fast-paced, ambiguous setting.</li>\n</ul>\n<p><strong>Annual Base Salary</strong></p>\n<p>$180,000 - $440,000 USD</p>\n<p><strong>Benefits</strong></p>\n<p>Base salary is just one part of our total rewards package at X, which also includes equity, comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short &amp; long-term disability insurance, life insurance, and various other discounts and perks.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_509502c4-bc6","directApply":true,"hiringOrganization":{"@type":"Organization","name":"xAI","sameAs":"https://www.xai.com/","logo":"https://logos.yubhub.co/xai.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/xai/jobs/4950947007","x-work-arrangement":"onsite","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$180,000 - $440,000 USD","x-skills-required":["edge network hardware","load balancers","CGNAT","routers","switches","routing","traffic engineering","Python scripting","DNS infrastructure","cloud VPCs","TCP/IP"],"x-skills-preferred":["A10 Networks","NGINX","Route53","UltraDNS","GCP","AWS","OCI","Kubernetes Ingress","CDN","Fastly","Cloudfront"],"datePosted":"2026-04-18T15:48:00.617Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Seattle, WA"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"edge network hardware, load balancers, CGNAT, routers, switches, routing, traffic engineering, Python scripting, DNS infrastructure, cloud VPCs, TCP/IP, A10 Networks, NGINX, Route53, UltraDNS, GCP, AWS, OCI, Kubernetes Ingress, CDN, Fastly, Cloudfront","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":180000,"maxValue":440000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_54d670f7-e78"},"title":"Senior Delivery Success Manager","description":"<p>We are looking for a Senior Delivery Success Manager to join our team. As a Senior Delivery Success Manager, you will be responsible for leading and developing Okta partners to successfully deliver Okta services to our valued customers.</p>\n<p>Reporting to the Manager of Partner Delivery Success, this role will be responsible for developing partner delivery capability within a defined product domain and regional partner portfolio. This role ensures service delivery partners are prepared to successfully deploy Okta solutions through capability development, go-live validation, and delivery oversight.</p>\n<p>Key Responsibilities:</p>\n<ul>\n<li>Develop delivery capability across the assigned partner portfolio.</li>\n<li>Guide partners through the Service Delivery Partner maturity model.</li>\n<li>Identify specialization gaps and recommend enablement paths.</li>\n<li>Support partners pursuing product specializations (OWI, Auth0, Workflows, etc.).</li>\n<li>Encourage partner certification and skill development.</li>\n<li>Review and validate partner go-live submissions.</li>\n<li>Classify deployments according to Standard, Advanced, and Strategic complexity levels.</li>\n<li>Ensure deployments meet Okta delivery standards.</li>\n<li>Capture delivery feedback from Professional Services and customers.</li>\n<li>Serve as the delivery advisor for sales teams when identifying partners for deployments.</li>\n<li>Recommend partners based on capability, specialization, and deployment complexity.</li>\n<li>Assist in identifying partners capable of supporting complex customer architectures.</li>\n<li>Track ecosystem capability trends across assigned partners.</li>\n<li>Identify gaps in partner expertise or geographic coverage.</li>\n<li>Provide recommendations for partner enablement and recruitment.</li>\n<li>Partner closely with Professional Services on partner delivery quality feedback and co-delivery planning.</li>\n<li>Partner closely with Alliances to support relationship management and service delivery partner recruitment.</li>\n<li>Drive and track partner adoption to post-sales enablement.</li>\n</ul>\n<p>What you’ll bring to the role:</p>\n<ul>\n<li>7+ years of experience in enterprise software delivery, professional services, or partner success roles.</li>\n<li>Experience working with system integrators or service delivery partners.</li>\n<li>Experience supporting enterprise software implementations.</li>\n<li>Strong understanding of identity and access management concepts.</li>\n<li>Familiarity with enterprise SaaS deployment models.</li>\n<li>Understanding of partner ecosystems and implementation services.</li>\n<li>Ability to guide partners through complex technology implementations.</li>\n<li>Strong cross-functional collaboration skills.</li>\n<li>Ability to evaluate partner capabilities and recommend improvement strategies.</li>\n<li>Excellent communication and stakeholder management skills.</li>\n<li>Ability to work with technical teams, sales teams, and executive stakeholders.</li>\n</ul>\n<p>And extra credit if you have experience in any of the following!</p>\n<ul>\n<li>Familiar with several programming languages (e.g., .NET, JavaScript, Python, Java).</li>\n<li>Familiar with open-source tools and development practices.</li>\n<li>Familiar with load balancers, reverse proxies, and Web Access Management tech (e.g., F5, Oracle, NGINX).</li>\n<li>Familiar with AI GPTs</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_54d670f7-e78","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Okta","sameAs":"https://www.okta.com","logo":"https://logos.yubhub.co/okta.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/okta/jobs/7779032","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$103,000-$140,000 CAD","x-skills-required":["enterprise software delivery","professional services","partner success","identity and access management","enterprise SaaS deployment models","partner ecosystems","implementation services","cross-functional collaboration","communication","stakeholder management"],"x-skills-preferred":[".NET","JavaScript","Python","Java","open-source tools","development practices","load balancers","reverse proxies","Web Access Management","AI GPTs"],"datePosted":"2026-04-18T15:45:05.013Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Toronto, Ontario, Canada"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"enterprise software delivery, professional services, partner success, identity and access management, enterprise SaaS deployment models, partner ecosystems, implementation services, cross-functional collaboration, communication, stakeholder management, .NET, JavaScript, Python, Java, open-source tools, development practices, load balancers, reverse proxies, Web Access Management, AI GPTs","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":103000,"maxValue":140000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_63af8568-789"},"title":"Engineering Manager, Inference Routing and Performance","description":"<p><strong>About the role\\nEvery request that hits Claude , from claude.ai, the API, our cloud partners, or internal research , passes through a routing decision. Not a generic load balancer round-robin, but a decision that accounts for what&#39;s already cached where, which accelerator the request runs best on, and what else is in flight across the fleet.\\n\\nGet it right and you extract meaningfully more throughput from the same hardware. Get it wrong and you burn capacity, miss latency SLOs, or shed load that shouldn&#39;t have been shed.\\n\\nThe Inference Routing team owns this layer. We build the cluster-level routing and coordination plane for Anthropic&#39;s inference fleet , the system that sits between the API surface and the inference engines themselves, making fleet-wide efficiency decisions in real time.\\n\\nAs Anthropic moves from &quot;many independent inference replicas&quot; toward &quot;a single warehouse-scale computer running a coordinated program,&quot; Dystro is the coordination layer. This is a deeply technical team.\\n\\nThe engineers here design custom load-balancing algorithms, build quantitative models of system performance, debug latency spikes that cross kernel, network, and framework boundaries, and reason carefully about cache placement across thousands of accelerators.\\n\\nThey work shoulder-to-shoulder with teams that write kernels and ML framework internals.\\n\\nThe EM for this team doesn&#39;t need to write kernels , but they do need the systems depth to make architectural calls, evaluate deeply technical candidates, and spot when a proposed optimization will have second-order effects on the fleet.\\n\\nYou&#39;ll inherit a strong team of distributed-systems engineers, and you&#39;ll be accountable for two things that pull in different directions: shipping system-level performance improvements that measurably increase fleet throughput and efficiency, and running the team operationally so that deploys are safe, incidents are rare, and the teams who depend on Dystro can plan around you with confidence.\\n\\nThe job is holding both.\\n\\n## Representative work:\\nThings the Inference Routing EM actually spends time on:\\n- Deciding whether a proposed routing algorithm change is worth the deploy risk, given the modeled throughput gain and the blast radius if it regresses\\n- Sequencing a quarter where KV-cache offload, a new coordination protocol, and two model launches all compete for the same engineers\\n- Working through a persistent tail-latency regression with the team , walking down from fleet-level metrics to per-replica behavior to a root cause in the networking stack\\n- Building the case (with numbers) to peer teams for why a cross-team protocol change unlocks the next efficiency win\\n- Running the post-incident review after a cache-eviction bug caused a capacity event, and turning it into process changes that stick\\n- Interviewing a candidate who has built schedulers at supercomputing scale, and deciding whether they&#39;d be additive to a team that already goes deep\\n\\n## What you&#39;ll do:\\nDrive system-level performance\\n- Own the technical roadmap for cluster-level inference efficiency , routing decisions, cache placement and eviction, cross-replica coordination, and the protocols that keep routing and inference engines in sync\\n- Partner with the inference engine, kernels, and performance teams to identify fleet-level throughput and latency wins, then turn those into shipped improvements with measurable results\\n- Build the team&#39;s habit of quantitative performance modeling: claim a win only when you can measure it, and know before you ship what the expected effect is\\n\\nDeliver reliably and operate cleanly\\n- Set technical strategy for how routing evolves across heterogeneous hardware (GPUs, TPUs, Trainium) and across all our serving surfaces\\n- Run the team&#39;s operational backbone , on-call rotation, incident response, postmortem review, deploy safety , so the team can ship aggressively without the system becoming fragile\\n- Create clarity at a seam: Inference Routing sits between the API surface, the inference engines, and the cloud deployment teams. You&#39;ll make sure commitments are realistic, dependencies are understood, and nobody is surprised\\n\\nBuild and grow the team\\n- Develop and retain a strong existing team, and hire against the bar described above: people who can go to the OS and framework level when the problem demands it, and who care about production reliability\\n- Coach engineers through a roadmap where priorities shift with model launches, new hardware, and scaling demands. We pair a lot here , you&#39;ll help make that collaboration pattern productive\\n- Pick up slack when it matters. This is a small team in a critical path; sometimes the EM is the one unblocking a stuck deploy or synthesizing a design debate\\n\\n## You may be a good fit if you:\\n- Have 5+ years of engineering management experience, ideally with at least part of that leading teams on critical-path production infrastructure at scale\\n- Have a deep systems background , load balancing, scheduling, cache-coherent distributed state, high-performance networking, or similar. You need enough depth to make architectural calls about routing and efficiency, and to evaluate candidates who go to the kernel and framework level\\n- Have shipped performance improvements in large-scale systems and can explain, with numbers, what the impact was\\n- Have run production infrastructure with real operational stakes: on-call, incident response, capacity events, deploy discipline\\n- Are results-oriented with a bias toward impact, and comfortable working in a space where throughput, latency, stability, and feature velocity all pull in different directions\\n- Build strong relationships across team boundaries , this is a seam role, and much of the job is making sure other teams can rely on yours\\n- Are curious about machine learning systems. You don&#39;t need an ML research background, but you should want to learn how transformer inference actually works and how that shapes the systems problems\\n\\nStrong candidates may also have:\\n- Experience with LLM inference serving , KV caching, continuous batching, request scheduling, prefill/decode disaggregation\\n- Background in cluster schedulers, load balancers, service meshes, or coordination planes at scale\\n- Familiarity with heterogeneous accelerator fleets (GPU/TPU/Trainium) and how hardware differences affect workload placement\\n- Experience with GPU/accelerator programming, ML framework internals, or OS-level performance debugging , enough to follow and evaluate the technical work, not necessarily to do it daily\\n- Led teams at supercomputing or hyperscaler infrastructure scale\\n- Led teams through rapid-growth periods where hiring and onboarding competed with roadmap delivery\\n\\nThe annual compensation range for this role is listed below. For sales roles, the range provided is the role’s On Target Earnings (&quot;OTE&quot;) range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role.\\nAnnual Salary: $405,000-$485,000 USD</strong></p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_63af8568-789","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Anthropic","sameAs":"https://www.anthropic.com/","logo":"https://logos.yubhub.co/anthropic.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/anthropic/jobs/5155391008","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$405,000-$485,000 USD","x-skills-required":["engineering management","deep systems background","load balancing","scheduling","cache-coherent distributed state","high-performance networking"],"x-skills-preferred":["LLM inference serving","cluster schedulers","load balancers","service meshes","coordination planes","heterogeneous accelerator fleets","GPU/TPU/Trainium","GPU/accelerator programming","ML framework internals","OS-level performance debugging"],"datePosted":"2026-04-18T15:37:38.038Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco, CA | New York City, NY"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"engineering management, deep systems background, load balancing, scheduling, cache-coherent distributed state, high-performance networking, LLM inference serving, cluster schedulers, load balancers, service meshes, coordination planes, heterogeneous accelerator fleets, GPU/TPU/Trainium, GPU/accelerator programming, ML framework internals, OS-level performance debugging","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":405000,"maxValue":485000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_8980bea0-e13"},"title":"Senior Software Engineer, Java - Network Team","description":"<p>We are revolutionizing the way large networks are managed. Our Forward Enterprise platform delivers a vendor-agnostic &#39;digital twin&#39; of the network, based on a mathematical model. The platform scales to support hundreds of thousands of network devices, whether cloud, hybrid cloud, or on-prem. It serves as a single source of truth for the network, enabling network operators to instantly verify security posture, accelerate troubleshooting, avoid outages, and modernize network management.</p>\n<p>Our team is currently seeking experienced Java developers to work as part of our Network team. As a senior software engineer, you will help bring the best ideas from the software development world into the networking industry.</p>\n<p>Responsibilities:</p>\n<ul>\n<li>Contribute to our code base, systems and software architecture as a member of our engineering team.</li>\n<li>Help create and optimize network device models for different device vendors and protocols.</li>\n<li>Help create infrastructure needed to configure, collect and test network devices.</li>\n<li>Work with peers who are experts in Networking, Distributed Systems, Big Data and Search.</li>\n</ul>\n<p>Requirements:</p>\n<ul>\n<li>5+ years of work experience in software development</li>\n<li>3+ years of work experience with Java</li>\n<li>BS in Computer Science or related degree</li>\n<li>Solid software engineering experience with large code bases</li>\n<li>Basic understanding of networking and TCP/IP.</li>\n<li>Strong verbal and written communication skills.</li>\n</ul>\n<p>Nice to have:</p>\n<ul>\n<li>Working knowledge of how switches, routers, firewalls or load balancers work.</li>\n<li>Experience working with networking protocols such as BGP/OSPF/IS-IS, IPv4/IPv6, MPLS, VLAN, VXLAN, etc.</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_8980bea0-e13","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Forward Networks","sameAs":"https://www.forward.net/","logo":"https://logos.yubhub.co/forward.net.png"},"x-apply-url":"https://job-boards.greenhouse.io/forwardnetworks/jobs/5967053003","x-work-arrangement":"onsite","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["Java","networking","TCP/IP","software engineering","large code bases"],"x-skills-preferred":["switches","routers","firewalls","load balancers","BGP/OSPF/IS-IS","IPv4/IPv6","MPLS","VLAN","VXLAN"],"datePosted":"2026-04-17T12:36:15.038Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Bengaluru, India"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Java, networking, TCP/IP, software engineering, large code bases, switches, routers, firewalls, load balancers, BGP/OSPF/IS-IS, IPv4/IPv6, MPLS, VLAN, VXLAN"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_70806a42-556"},"title":"Senior Support Engineer","description":"<p><strong>Senior Support Engineer - Dublin</strong></p>\n<p><strong>Location</strong></p>\n<p>Dublin, Ireland</p>\n<p><strong>Employment Type</strong></p>\n<p>Full time</p>\n<p><strong>Department</strong></p>\n<p><strong>About the Team</strong></p>\n<p>The Technical Support team is responsible for ensuring that developers and enterprises can reliably build mission critical solutions using OpenAI models. We provide technical guidance, resolve complex issues and support customers in maximizing value and adoption from deploying our highly-capable models. We work closely with Technical Success, Product, Engineering and others to deliver the best possible experience to our customers at scale. We think from an automation-first mindset and leverage the latest in AI to scale our support operations. Join the Senior Support Engineering (SSE) team at OpenAI and help shape the future of Technical Support in the age of AI.</p>\n<p><strong>About the Role</strong></p>\n<p>We are looking for a Senior Support Engineer to collaborate directly with our strategic enterprise accounts and product teams, helping solve some of the most difficult problems faced by our Customers. You will be part of the best technical troubleshooting team at OpenAI, and our Customers and Engineering teams will look to you for technical guidance in addressing the most technically difficult issues in our environment.</p>\n<p>As a Senior Support Engineer, you will design and run operational processes to monitor our top strategic customers and a 24x7 response team. You’ll work closely with our Infrastructure and Engineering teams to deliver the best possible experience to customers at scale. Working directly with our most strategic Customers - You will be crucial to the success of the most innovative, disruptive, and high-scale AI solutions being built with the OpenAI API platform.</p>\n<p>The nature of this role will be low volume, high difficulty.</p>\n<p>This role is based in Dublin, Ireland. We use a hybrid work model of 3 days in the office per week and offer relocation assistance to new employees.</p>\n<p><strong>In this role, you will:</strong></p>\n<ul>\n<li>Be among the foremost technical and troubleshooting experts for our API platform at OpenAI. You are the last line of defense before the core Engineering team.</li>\n</ul>\n<ul>\n<li>Proactively identify and implement opportunities to scale support operations by leveraging automation and advancements in AI technologies. Contribute to shaping the future of technical support in an AI-driven era.</li>\n</ul>\n<ul>\n<li>Configure and use advanced monitoring and alerting workflows to proactively detect customer impacting issues in real time.</li>\n</ul>\n<ul>\n<li>In partnership with engineering, contribute to reliability reviews and preparedness for new features, launches, or strategic customer requirement updates. Ensure that operational readiness (monitoring, alerting, and fallback plans) is in place for any such changes.</li>\n</ul>\n<ul>\n<li>Design and refine incident response processes and documentation across strategic customers, engineering and support teams.</li>\n</ul>\n<ul>\n<li>Analyze operational metrics and incident RCAs to identify areas for improvement. Proactively recommend and implement enhancements to monitoring dashboards, alert configurations, and support workflows.</li>\n</ul>\n<ul>\n<li>Provide support coverage during holidays and weekends based on business needs.</li>\n</ul>\n<p><strong>You might thrive in this role if you:</strong></p>\n<ul>\n<li>Have a Bachelor’s degree in Computer Science or a related field. A strong software engineering foundation is important for this role’s success.</li>\n</ul>\n<ul>\n<li>Have 5+ years of experience in technical operations roles such as SRE/NOC, designing monitoring systems and resolving production issues in fast-paced and mission-critical environments. A strong track record of troubleshooting complex technical problems at the systems level.</li>\n</ul>\n<ul>\n<li>Have deep familiarity with modern monitoring, alerting, and observability practices. Hands‑on experience setting up or managing metrics, logging, and tracing for distributed systems (e.g., understanding of SLIs/SLOs, alert tuning, dashboard creation).</li>\n</ul>\n<ul>\n<li>Have proven experience leading incident response for high‑severity outages or service disruptions. Able to perform real‑time incident coordination, root cause analysis, and drive follow‑ups (post‑mortems, action items) to prevent recurrence. Knowledge of industry best practices for incident management and fault diagnosis.</li>\n</ul>\n<ul>\n<li>Have strong skills in scripting or software engineering (e.g., Python or similar) to automate repetitive tasks and integrate tools.</li>\n</ul>\n<ul>\n<li>Have solid understanding of cloud infrastructure and distributed systems fundamentals. Comfortable working with cloud services, load balancers, databases, and containerized applications.</li>\n</ul>\n<ul>\n<li>Are effective at working cross‑functionally in a high‑trust environment. Strong communication skills to explain technical issues and resolutions to both engineering and non‑technical stakeholders. You can coordinate efforts across teams and are comfortable providing updates in the midst of an ongoing incident.</li>\n</ul>\n<p><strong>Compensation, Benefits and Perks</strong></p>\n<p>This is a position with OpenAI Ireland Ltd., which controls the hiring and management of this position.</p>\n<p>Total compensation includes an annual salary, generous equity, and benefits.</p>\n<ul>\n<li>Medical, dental, and vision insurance for you and your family</li>\n</ul>\n<ul>\n<li>Mental health and wellness support</li>\n</ul>\n<ul>\n<li>PRSA plan with 8% employer matching</li>\n</ul>\n<ul>\n<li>Unlimited time off</li>\n</ul>\n<ul>\n<li>Annual learning &amp; development stipend ($1,500 USD equivalent per year)</li>\n</ul>\n<p>#LI-NM2</p>\n<p><strong>About OpenAI</strong></p>\n<p>OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_70806a42-556","directApply":true,"hiringOrganization":{"@type":"Organization","name":"OpenAI","sameAs":"https://jobs.ashbyhq.com","logo":"https://logos.yubhub.co/openai.com.png"},"x-apply-url":"https://jobs.ashbyhq.com/openai/988016e1-de50-42be-925a-438b97291c5d","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["Python","Cloud infrastructure","Distributed systems","Monitoring and alerting","Observability","Scripting","Software engineering","Cloud services","Load balancers","Databases","Containerized applications"],"x-skills-preferred":["SLIs/SLOs","Alert tuning","Dashboard creation","Incident management","Fault diagnosis","Cross-functional collaboration","Communication"],"datePosted":"2026-03-06T18:36:57.231Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Dublin"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Python, Cloud infrastructure, Distributed systems, Monitoring and alerting, Observability, Scripting, Software engineering, Cloud services, Load balancers, Databases, Containerized applications, SLIs/SLOs, Alert tuning, Dashboard creation, Incident management, Fault diagnosis, Cross-functional collaboration, Communication"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_e38e0353-95c"},"title":"Senior Support Engineer","description":"<p><strong>Senior Support Engineer - Tokyo</strong></p>\n<p><strong>Location</strong></p>\n<p>Tokyo, Japan</p>\n<p><strong>Employment Type</strong></p>\n<p>Full time</p>\n<p><strong>Department</strong></p>\n<p><strong>About the Team</strong></p>\n<p>The Technical Support team is responsible for ensuring that developers and enterprises can reliably build mission critical solutions using OpenAI models. We provide technical guidance, resolve complex issues and support customers in maximizing value and adoption from deploying our highly-capable models. We work closely with Technical Success, Product, Engineering and others to deliver the best possible experience to our customers at scale. We think from an automation-first mindset and leverage the latest in AI to scale our support operations. Join the Senior Support Engineering (SSE) team at OpenAI and help shape the future of Technical Support in the age of AI.</p>\n<p><strong>About the Role</strong></p>\n<p>We are looking for a Senior Support Engineer to collaborate directly with our strategic enterprise accounts and product teams, helping solve some of the most difficult problems faced by our Customers. You will be part of the best technical troubleshooting team at OpenAI, and our Customers and Engineering teams will look to you for technical guidance in addressing the most technically difficult issues in our environment.</p>\n<p>As a Senior Support Engineer, you will design and run operational processes to monitor our top strategic customers and a 24x7 response team. You’ll work closely with our Infrastructure and Engineering teams to deliver the best possible experience to customers at scale. Working directly with our most strategic Customers - You will be crucial to the success of the most innovative, disruptive, and high-scale AI solutions being built with the OpenAI API platform.</p>\n<p>The nature of this role will be low volume, high difficulty.</p>\n<p>This role is based in Tokyo, Japan. We use a hybrid work model of 3 days in the office per week and offer relocation assistance to new employees.</p>\n<p><strong>In this role, you will:</strong></p>\n<ul>\n<li>Be among the foremost technical and troubleshooting experts for our API platform at OpenAI. You are the last line of defense before the core Engineering team.</li>\n</ul>\n<ul>\n<li>Proactively identify and implement opportunities to scale support operations by leveraging automation and advancements in AI technologies. Contribute to shaping the future of technical support in an AI-driven era.</li>\n</ul>\n<ul>\n<li>Configure and use advanced monitoring and alerting workflows to proactively detect customer impacting issues in real time.</li>\n</ul>\n<ul>\n<li>In partnership with engineering, contribute to reliability reviews and preparedness for new features, launches, or strategic customer requirement updates. Ensure that operational readiness (monitoring, alerting, and fallback plans) is in place for any such changes.</li>\n</ul>\n<ul>\n<li>Design and refine incident response processes and documentation across strategic customers, engineering and support teams.</li>\n</ul>\n<ul>\n<li>Analyze operational metrics and incident RCAs to identify areas for improvement. Proactively recommend and implement enhancements to monitoring dashboards, alert configurations, and support workflows.</li>\n</ul>\n<ul>\n<li>Provide support coverage during holidays and weekends based on business needs.</li>\n</ul>\n<p><strong>You might thrive in this role if you:</strong></p>\n<ul>\n<li>Have a Bachelor’s degree in Computer Science or a related field. A strong software engineering foundation is important for this role’s success.</li>\n</ul>\n<ul>\n<li>Have 8+ years of experience in technical operations roles such as SRE/NOC, designing monitoring systems and resolving production issues in fast-paced and mission-critical environments. A strong track record of troubleshooting complex technical problems at the systems level.</li>\n</ul>\n<ul>\n<li>Have deep familiarity with modern monitoring, alerting, and observability practices. Hands‑on experience setting up or managing metrics, logging, and tracing for distributed systems (e.g., understanding of SLIs/SLOs, alert tuning, dashboard creation).</li>\n</ul>\n<ul>\n<li>Have proven experience leading incident response for high‑severity outages or service disruptions. Able to perform real‑time incident coordination, root cause analysis, and drive follow‑ups (post‑mortems, action items) to prevent recurrence. Knowledge of industry best practices for incident management and fault diagnosis.</li>\n</ul>\n<ul>\n<li>Have strong skills in scripting or software engineering (e.g., Python or similar) to automate repetitive tasks and integrate tools.</li>\n</ul>\n<ul>\n<li>Have solid understanding of cloud infrastructure and distributed systems fundamentals. Comfortable working with cloud services, load balancers, databases, and containerized applications.</li>\n</ul>\n<ul>\n<li>Are effective at working cross‑functionally in a high‑trust environment. Strong communication skills to explain technical issues and resolutions to both engineering and non‑technical stakeholders. You can coordinate efforts across teams and are comfortable providing updates in the midst of an ongoing incident.</li>\n</ul>\n<p><strong>About OpenAI</strong></p>\n<p>OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_e38e0353-95c","directApply":true,"hiringOrganization":{"@type":"Organization","name":"OpenAI","sameAs":"https://jobs.ashbyhq.com","logo":"https://logos.yubhub.co/openai.com.png"},"x-apply-url":"https://jobs.ashbyhq.com/openai/b2fd550d-3e04-434e-bb91-c5b7bc8ac8b7","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["Python","Cloud infrastructure","Distributed systems","Monitoring and alerting","Observability","Scripting","Software engineering","Cloud services","Load balancers","Databases","Containerized applications"],"x-skills-preferred":["Automation","AI technologies","Incident response","Reliability reviews","Post-mortems","Action items","Cross-functional collaboration","Communication","Technical writing"],"datePosted":"2026-03-06T18:36:56.708Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Tokyo, Japan"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Python, Cloud infrastructure, Distributed systems, Monitoring and alerting, Observability, Scripting, Software engineering, Cloud services, Load balancers, Databases, Containerized applications, Automation, AI technologies, Incident response, Reliability reviews, Post-mortems, Action items, Cross-functional collaboration, Communication, Technical writing"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_ff45eb13-34b"},"title":"Senior Cloud Architect","description":"<p>As a Senior Cloud Architect, you will design, build, and evolve Azure cloud architectures aligned with business, security, and compliance requirements. You will define and maintain Azure landing zones, governance models, and cloud standards (RBAC, policies, naming, tagging). You will lead cloud modernization and migration initiatives (on prem → Azure, hybrid, multi region).</p>\n<p><strong>What you&#39;ll do</strong></p>\n<ul>\n<li>Design, build, and evolve Azure cloud architectures aligned with business, security, and compliance requirements</li>\n<li>Define and maintain Azure landing zones, governance models, and cloud standards (RBAC, policies, naming, tagging)</li>\n</ul>\n<p><strong>What you need</strong></p>\n<ul>\n<li>Proven experience as a Cloud / Azure Architect in enterprise environments</li>\n<li>Deep hands-on knowledge of Microsoft Azure, including Azure Virtual Networks, ExpressRoute/VPN, Load Balancers, Azure Compute (VM, App Services, AKS), Azure Storage, SQL / PaaS data services, Azure Entra ID (Azure AD), identity, and access management</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_ff45eb13-34b","directApply":true,"hiringOrganization":{"@type":"Organization","name":"MHP - A Porsche Company","sameAs":"https://www.mhp.com/","logo":"https://logos.yubhub.co/mhp.com.png"},"x-apply-url":"https://jobs.porsche.com/index.php?ac=jobad&id=19939","x-work-arrangement":"onsite","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["Proven experience as a Cloud / Azure Architect in enterprise environments","Deep hands-on knowledge of Microsoft Azure, including Azure Virtual Networks, ExpressRoute/VPN, Load Balancers, Azure Compute (VM, App Services, AKS), Azure Storage, SQL / PaaS data services, Azure Entra ID (Azure AD), identity, and access management"],"x-skills-preferred":[],"datePosted":"2026-03-04T14:08:33.697Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Bucharest, Cluj, Timisoara"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Proven experience as a Cloud / Azure Architect in enterprise environments, Deep hands-on knowledge of Microsoft Azure, including Azure Virtual Networks, ExpressRoute/VPN, Load Balancers, Azure Compute (VM, App Services, AKS), Azure Storage, SQL / PaaS data services, Azure Entra ID (Azure AD), identity, and access management"}]}