Senior Production Engineer

3b419874-946 Senior Production Engineer Production Engineering ensures CoreWeave's cloud delivers world-class reliability, performance, and operational excellence. We are hiring a Senior Production Engineer to take direct, hands-on ownership of critical tooling that drives reliability and delivery success.

In this role, you will work broadly across the cloud stack designing, implementing, deploying, and operating systems that improve delivery velocity, service availability, and operational safety. You’ll be responsible for leading end-to-end technical projects, maintaining long-lived systems the team owns, and strengthening our operational foundations through durable engineering investments.

This is a role for someone who enjoys building, debugging, and operating production systems. You will collaborate closely with service owners, but your primary impact comes from the reliability, quality, and maturity of the systems you deliver and maintain over time.

Responsibilities

Take hands-on ownership of critical systems and frameworks, driving their architecture, implementation, and long-term evolution.
Lead end-to-end delivery of engineering projects that improve availability, scalability, operational automation, and failure recovery.
Build and maintain observability, alerting, automated remediation, and resilience testing for the systems you support.
Participate in incident response as a subject-matter expert; drive deep root-cause investigations and implement lasting fixes.
Improve runbooks, sources of truth, deployment workflows, and operational tooling to harden production readiness.
Eliminate single points of failure and reduce operational toil through automation, refactors, and system redesigns.
Ship production code regularly in Python, Go, or similar languages, and participate in on-call rotations.
Maintain and mature long-term projects and frameworks owned by the team, ensuring they remain reliable, well-instrumented, and easy to operate.
Collaborate with platform teams to ensure new features and services integrate cleanly with our reliability best-practices and tooling.

What You’ve Worked On (Minimum Qualifications)

7+ years of engineering experience building and operating distributed systems or cloud platforms.
Demonstrated ability to debug complex production issues end-to-end, across services, infrastructure layers, and automation.
Strong programming or scripting ability (Python, Go, or similar), with experience shipping and operating production services and tools.
Deep knowledge of cloud-native technologies and distributed system patterns, particularly Kubernetes.
Experience with modern observability stacks: metrics, tracing, structured logs, SLOs/SLIs, and incident lifecycle practices.
A track record of successfully delivering hands-on reliability improvements through engineering execution.

Preferred Qualifications

Experience building internal tooling, frameworks, or automation that supports high-availability cloud operations.
Familiarity with DR/BCP, service tiering, capacity planning, or chaos engineering.
Background operating or building large-scale AI or GPU-accelerated infrastructure.
Experience maintaining multi-year ownership of foundational production systems.

Why CoreWeave

At CoreWeave, we work hard, have fun, and move fast. You’ll join a team that values curiosity, ownership, and creative problem-solving. Production Engineering sits at the intersection of reliability and AI infrastructure, building systems that enable the world’s most powerful AI cloud.

Core Values

Be Curious at Your Core
Act Like an Owner
Empower Employees
Deliver Best-in-Class Client Experiences
Achieve More Together

We support and encourage an entrepreneurial outlook and independent thinking. We foster an environment that encourages collaboration and enables the development of innovative solutions to complex problems. As we get set for takeoff, the organization's growth opportunities are constantly expanding. You will be surrounded by some of the best talent in the industry, who will want to learn from you, too. Come join us!

Compensation

The base salary range for this role is 160,000 to 214,000 SGD. The starting salary will be determined based on job-related knowledge, skills, experience, and market location. We strive for both market alignment and internal equity when determining compensation. In addition to base salary, our total rewards package includes a discretionary bonus, equity awards, and a comprehensive benefits program (all based on eligibility).

What We Offer

The range we’ve posted represents the typical compensation range for this role. To determine actual compensation, we review the market rate for each candidate which can include a variety of factors. These include qualifications, experience, interview performance, and location.

In addition to a competitive salary, we offer a variety of benefits to support your needs, including:

Medical, dental, and vision insurance - 100% paid for by CoreWeave
Company-paid Life Insurance
Voluntary supplemental life insurance
Short and long-term disability insurance
Flexible Spending Account
Health Savings Account
Tuition Reimbursement
Ability to Participate in Employee Stock Purchase Program (ESPP)
Mental Wellness Benefits through Spring Health
Family-Forming support provided by Carrot
Paid Parental Leave
Flexible, full-service childcare support with Kinside
401(k) with a generous employer match
Flexible PTO
Catered lunch each day in our office and data center locations
A casual work environment
A work culture focused on innovative disruption

XML job scraping automation by YubHub

]]> full-time senior hybrid 160,000 to 214,000 SGD cloud computing, distributed systems, Kubernetes, observability stacks, metrics, tracing, structured logs, SLOs/SLIs, incident lifecycle practices, Python, Go, engineering experience, internal tooling, frameworks, automation, DR/BCP, service tiering, capacity planning, chaos engineering, large-scale AI, GPU-accelerated infrastructure Engineering Technology CoreWeave https://logos.yubhub.co/coreweave.com.png CoreWeave is a cloud computing company that provides a platform for building and scaling AI applications. https://www.coreweave.com https://job-boards.greenhouse.io/coreweave/jobs/4675297006?utm_source=yubhub.co&utm_medium=jobs_feed&utm_campaign=apply Singapore 2026-04-24 c0569537-539 Staff Backend Engineer, Gitlab Delivery: Upgrades As a Staff Engineer on the GitLab Delivery - Upgrades team, you'll guide the technical direction for GitLab's self-managed deployment strategy so customers can deploy, upgrade, and run GitLab reliably in their own infrastructure with minimal disruption.

You'll serve as a technical anchor for the team, working closely with your engineering manager, product manager, and partners across Site Reliability Engineering, Release, Security, and Development to shape cloud-native, operator-driven deployment patterns that reduce operational complexity and upgrade friction.

In your first year, you'll help define the architecture for zero-downtime upgrades, strengthen observability and reliability practices, and guide the next generation of deployment automation for self-managed GitLab environments.

Some examples of our projects:

Evolving GitLab Operator and Helm charts to support zero-downtime upgrades for complex, stateful GitLab installations

Advancing the GitLab Environment Toolkit to simplify large-scale, production-ready self-managed deployments

Responsibilities

Guide the technical vision and architecture for GitLab's cloud-native, self-managed deployments and upgrade workflows.

Establish operational maturity standards, service integration patterns, and deployment models that help development teams manage the lifecycle of their components.

Design and maintain Kubernetes Operators, Helm charts, and upgrade orchestration tooling for self-managed GitLab deployments across varied environments.

Develop automation and integration frameworks for database migrations, rolling deployments, compatibility checks, and rollback paths.

Define database and application lifecycle strategies, including safe PostgreSQL migration approaches and validation mechanisms that reduce downtime risk.

Work with Product Management, GitLab.com Site Reliability Engineering, GitLab Dedicated, and development teams to align deployment patterns with customer needs.

Mentor engineers and enable customer-facing teams through design reviews, code reviews, documentation, and runbooks.

Drive observability, testing, performance, and resilience practices for self-managed deployments, and contribute to incident response and post-incident learning.

Requirements

Strong software engineering experience designing and delivering production systems that customers install and operate in their own infrastructure.

Proficiency in Go for large, complex codebases, with familiarity with Ruby on Rails and Rails application architecture as a useful addition.

Hands-on experience with Kubernetes in production, including building and maintaining Operators, designing Helm charts for stateful applications, and working with Custom Resource Definitions, admission controllers, and controller patterns.

Knowledge of cloud-native systems and tooling, such as service mesh, observability stacks, infrastructure as code, and automation tools like Terraform or Ansible.

Experience with stateful workloads and databases, including PostgreSQL schema design and migrations, persistent volumes, storage classes, and approaches for reducing downtime during upgrades.

Understanding of Linux systems and production operations, including package management, systemd, system-level debugging, observability, incident response, and on-call participation.

Ability to guide through influence, including writing clear technical proposals, documenting decisions, mentoring engineers, and working effectively across teams.

Interest in open source infrastructure or deployment tooling, or transferable experience from adjacent domains, with the ability to explain technical concepts clearly to different audiences.

About the Team

The Delivery - Upgrades team sits within GitLab Delivery and focuses on delivering GitLab to self-managed users through supported, validated deployment tooling. We own and evolve the GitLab Omnibus package, Helm charts, GitLab Operator, and the GitLab Environment Toolkit, and we work asynchronously across regions with partners in Site Reliability Engineering, Release, Security, and Development.

Our work centers on enabling zero-downtime upgrades, reducing operational complexity at scale, supporting GitLab’s cloud-native transition while continuing to serve existing deployments, and improving the upgrade experience for customers running GitLab in diverse environments.

For more on how we work, see [Link: Team Handbook Page].

XML job scraping automation by YubHub

]]> full-time staff remote Go, Ruby on Rails, Kubernetes, Cloud-native systems, Service mesh, Observability stacks, Infrastructure as code, Automation tools, Linux systems, Production operations, Package management, Systemd, System-level debugging, Incident response, On-call participation Engineering Technology GitLab https://logos.yubhub.co/about.gitlab.com.png GitLab is a software development platform that provides tools for version control, issue tracking, and project management. With over 50 million registered users and more than 50% of the Fortune 100 trusting GitLab, it is a large and established company. https://about.gitlab.com/ https://job-boards.greenhouse.io/gitlab/jobs/8463922002?utm_source=yubhub.co&utm_medium=jobs_feed&utm_campaign=apply Remote, India 2026-04-18 15a29cc3-0bf Senior Production Engineer CORPORATION

CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence. Trusted by leading AI labs, startups, and global enterprises, CoreWeave combines superior infrastructure performance with deep technical expertise to accelerate breakthroughs and turn compute into capability. Founded in 2017, CoreWeave became a publicly traded company (Nasdaq: CRWV) in March 2025.

About the Role

Production Engineering ensures CoreWeave’s cloud delivers world-class reliability, performance, and operational excellence. We are hiring a Senior Production Engineer to take direct, hands-on ownership of critical tooling that drives reliability and delivery success.

What You’ll Do

Take hands-on ownership of critical systems and frameworks, driving their architecture, implementation, and long-term evolution.

Lead end-to-end delivery of engineering projects that improve availability, scalability, operational automation, and failure recovery.

Build and maintain observability, alerting, automated remediation, and resilience testing for the systems you support.

Participate in incident response as a subject-matter expert; drive deep root-cause investigations and implement lasting fixes.

Improve runbooks, sources of truth, deployment workflows, and operational tooling to harden production readiness.

Eliminate single points of failure and reduce operational toil through automation, refactors, and system redesigns.

Ship production code regularly in Python, Go, or similar languages, and participate in on-call rotations.

Maintain and mature long-term projects and frameworks owned by the team, ensuring they remain reliable, well-instrumented, and easy to operate.

Collaborate with platform teams to ensure new features and services integrate cleanly with our reliability best-practices and tooling.

What You’ve Worked On (Minimum Qualifications)

7+ years of engineering experience building and operating distributed systems or cloud platforms.

Demonstrated ability to debug complex production issues end-to-end, across services, infrastructure layers, and automation.

Strong programming or scripting ability (Python, Go, or similar), with experience shipping and operating production services and tools.

Deep knowledge of cloud-native technologies and distributed system patterns, particularly Kubernetes.

Experience with modern observability stacks: metrics, tracing, structured logs, SLOs/SLIs, and incident lifecycle practices.

A track record of successfully delivering hands-on reliability improvements through engineering execution.

Preferred Qualifications

Experience building internal tooling, frameworks, or automation that supports high-availability cloud operations.

Familiarity with DR/BCP, service tiering, capacity planning, or chaos engineering.

Background operating or building large-scale AI or GPU-accelerated infrastructure.

Experience maintaining multi-year ownership of foundational production systems.

Why CoreWeave?

At CoreWeave, we work hard, have fun, and move fast! We’re in an exciting stage of hyper-growth that you will not want to miss out on. We’re not afraid of a little chaos, and we’re constantly learning. Our team cares deeply about how we build our product and how we work together, which is represented through our core values:

Be Curious at Your Core

Act Like an Owner

Empower Employees

Deliver Best-in-Class Client Experiences

Achieve More Together

The base salary range for this role is $139,000 to $204,000. The starting salary will be determined based on job-related knowledge, skills, experience, and market location. We strive for both market alignment and internal equity when determining compensation. In addition to base salary, our total rewards package includes a discretionary bonus, equity awards, and a comprehensive benefits program (all based on eligibility).

What We Offer

In addition to a competitive salary, we offer a variety of benefits to support your needs, including:

Medical, dental, and vision insurance - 100% paid for by CoreWeave

Company-paid Life Insurance

Voluntary supplemental life insurance

Short and long-term disability insurance

Flexible Spending Account

Health Savings Account

Tuition Reimbursement

Ability to Participate in Employee Stock Purchase Program (ESPP)

Mental Wellness Benefits through Spring Health

Family-Forming support provided by Carrot

Paid Parental Leave

Flexible, full-service childcare support with Kinside

401(k) with a generous employer match

Flexible PTO

Catered lunch each day in our office and data center locations

A casual work environment

A work culture focused on innovative disruption

Our Workplace

While we prioritize a hybrid work environment, remote work may be considered for candidates located more than 30 miles from an office, based on role requirements for specialized skill sets. New hires will be invited to attend onboarding at one of our hubs within their first month. Teams also gather quarterly to support collaboration.

California Consumer Privacy Act - California applicants only

XML job scraping automation by YubHub

]]> full-time senior hybrid $139,000 to $204,000 cloud computing, distributed systems, cloud platforms, Kubernetes, observability stacks, metrics, tracing, structured logs, SLOs/SLIs, incident lifecycle practices, Python, Go, programming, scripting, production services, tools, internal tooling, frameworks, automation, high-availability cloud operations, DR/BCP, service tiering, capacity planning, chaos engineering, large-scale AI, GPU-accelerated infrastructure Engineering Technology CoreWeave https://logos.yubhub.co/coreweave.com.png CoreWeave is a cloud computing company that provides a platform for building and scaling AI applications. https://www.coreweave.com https://job-boards.greenhouse.io/coreweave/jobs/4670172006?utm_source=yubhub.co&utm_medium=jobs_feed&utm_campaign=apply Livingston, NJ / New York, NY / Sunnyvale, CA / Bellevue, WA 2026-04-18 9701c504-1a6 Senior Software Engineer I, Inference We're looking for a Senior Software Engineer I to join our team. As a senior engineer, you'll lead designs, raise engineering standards, and deliver measurable improvements to latency, throughput, and reliability across multiple services. You'll partner with product, orchestration, and hardware teams to evolve our Kubernetes-native inference platform and meet strict P99 SLAs at scale.

Key responsibilities include:

Lead design reviews and drive architecture within the team; decompose multi-service work into clear milestones.
Define and own SLIs/SLOs; ensure post-incident actions land and reliability improves release-over-release.
Implement advanced optimizations (e.g., micro-batch schedulers, speculative decoding, KV-cache reuse) and quantify impact.
Strengthen incident posture: capacity planning, autoscaling policy, graceful degradation, rollback/traffic-shift strategies.
Mentor IC1/IC2 engineers; review cross-team designs and elevate coding/testing standards.

Requirements include:

3-5 years of industry experience building distributed systems or cloud services.
Strong coding in Python or Go (C++ a plus) and deep familiarity with networked systems and performance.
Hands-on experience with Kubernetes at production scale, CI/CD, and observability stacks (Prometheus, Grafana, OpenTelemetry).
Practical knowledge of inference internals: batching, caching, mixed precision (BF16/FP8), streaming token delivery.
Proven track record improving tail latency (P95/P99) and service reliability through metrics-driven work.

Preferred qualifications include contributions to inference frameworks, experience with CUDA kernels, NCCL/SHARP, RDMA/NUMA, or GPU interconnect topologies, and leading multi-team initiatives or partnering with customers on mission-critical launches.

XML job scraping automation by YubHub

]]> full-time senior hybrid $139,000 to $204,000 Python, Go, Kubernetes, CI/CD, Observability stacks, Inference internals, Batching, Caching, Mixed precision, Streaming token delivery, Contributions to inference frameworks, CUDA kernels, NCCL/SHARP, RDMA/NUMA, GPU interconnect topologies Engineering Technology CoreWeave https://logos.yubhub.co/coreweave.com.png CoreWeave is a cloud computing company that provides a platform for building and scaling AI. It was founded in 2017 and became a publicly traded company in March 2025. https://www.coreweave.com https://job-boards.greenhouse.io/coreweave/jobs/4647603006?utm_source=yubhub.co&utm_medium=jobs_feed&utm_campaign=apply Sunnyvale, CA / Bellevue, WA 2026-04-18 40d32156-365 Reliability Lead, Common Services As Reliability Lead, Common Services, you will establish and lead the Reliability Engineering and production operations practice for the Common Services organization. You'll partner closely with engineering leaders and teams across Common Services to define how we build, release, monitor, and operate critical services,raising the bar on reliability, availability, and operational excellence across the board.

In this role, you will:

Establish and lead the SRE / production engineering practice for the Common Services organization, including standards for reliability, incident management, and on-call, in partnership with the central Product Engineering organization.
Develop an Operational Excellence strategy that focuses on not only improving system performance but also monitoring and reducing operational toil
Partner with engineering and product teams to define SLOs, SLIs, and error budgets for critical Common Services, and ensure these become part of how teams plan and make tradeoffs.
Own and improve the incident management lifecycle for Common Services, including on-call rotations, escalation paths, incident tooling, post-incident reviews, and follow-through on corrective actions.
Drive the observability strategy (metrics, logs, traces, dashboards, alerts) for Common Services, ensuring we have actionable visibility into the health, performance, and capacity of key systems.
Collaborate with engineering leads to design and review architectures for reliability, scalability, resilience, and operability, including failure modes, redundancy, and graceful degradation.
Lead efforts to automate and harden operational workflows, including deployments, rollbacks, configuration management, change management, and routine maintenance tasks.
Build strong, trust-based relationships with partner teams and stakeholders, becoming a go-to leader for production readiness and operational risk within Common Services.
Hire, mentor, and develop SRE and production engineering talent, fostering a culture of continuous improvement, learning from incidents, and humane on-call.
Partner with other SRE and production engineering leaders across CoreWeave to align on global practices, tools, and reliability goals, representing the needs and constraints of Common Services.

You will be responsible for defining the reliability strategy, processes, and standards for the Common Services portfolio and driving consistent, high-quality operational practices across multiple teams.

The base salary range for this role is $206,000 to $303,000.

XML job scraping automation by YubHub

]]> full-time senior hybrid $206,000 to $303,000 Site Reliability Engineering, Production Engineering, Linux-based production environments, Containers, Orchestration technologies, Observability stacks, Alerting systems, SLIs/SLOs, Error budgets, Incident management, On-call rotations, Escalation paths, Post-incident reviews, Corrective actions, Automation tooling, Infrastructure-as-code, CI/CD pipelines, GPU workloads, High-performance computing, Latency/throughput-sensitive systems, Multi-tenant environments, Multi-region environments, Regulated environments, Service ownership models, Mentoring, Managing senior engineers Engineering Technology CoreWeave https://logos.yubhub.co/coreweave.com.png CoreWeave is a cloud computing company that provides a platform for AI development and deployment. https://www.coreweave.com https://job-boards.greenhouse.io/coreweave/jobs/4650165006?utm_source=yubhub.co&utm_medium=jobs_feed&utm_campaign=apply New York, NY / Sunnyvale, CA / Bellevue, WA 2026-04-18 86622b48-10e Software Engineer, Site Reliability We are looking for a Site Reliability Engineer who thinks like a software engineer first. You will own critical production systems end-to-end, designing, building, and improving them rather than simply operating them. You will write production-quality code that keeps the platform reliable at scale, embed with product engineering teams to influence architecture from the start, and build the internal tooling that every engineer at Hebbia depends on.

Responsibilities:

Own critical production services end-to-end, from design and code review through deployment, operation, and incident response
Profile, benchmark, and rewrite hot paths to eliminate bottlenecks as Hebbia scales
Lead incident response and drive post-mortem culture, translating findings into code changes and architectural improvements rather than runbooks
Design and build observability frameworks from scratch, writing custom instrumentation, alerting logic, and debugging tooling that surfaces production issues before customers feel them
Define and enforce SLOs across platform services and build the feedback loops that keep engineering teams accountable to them
Own capacity planning and cost efficiency: model growth, right-size infrastructure, and write automation that prevents over-provisioning and resource exhaustion
Build robust, well-tested internal platforms and deployment tooling held to the same engineering standards as customer-facing code
Own and continuously improve CI/CD systems so engineering teams can ship safely and quickly
Embed with product engineering teams as a peer software engineer, contributing directly to production codebases and co-designing systems for reliability from the start
Partner on infrastructure security through threat modeling, hardening, and automated compliance tooling

Who You Are:

5+ years software development with a track record of writing, shipping, and maintaining production services, not just operating infrastructure
Production-grade proficiency in at least one systems or backend language: Go, Python, C++, or Rust
Proven experience as a Production Engineer, SRE, or software engineer with a deep infrastructure focus, comfortable owning services end-to-end across the full stack
Deep understanding of distributed systems
Container orchestration expertise and hands-on experience debugging complex distributed failures in production
Working knowledge of OS-level concepts
Cloud platform fluency (AWS preferred)
Experience in building and maintaining observability stacks
Strong CI/CD pipeline expertise and a track record of improving developer velocity without sacrificing safety
Background at a company with a Production Engineering or software-focused SRE culture is a strong plus
Experience building platforms for AI/ML workloads or high-throughput document processing pipelines is a plus

Compensation: The salary range for this role is $160,000 to $300,000. This range may be inclusive of several career levels at Hebbia and will be narrowed during the interview process based on the candidate’s experience and qualifications. Adjustments outside of this range may be considered for candidates whose qualifications significantly differ from those outlined in the job description.

XML job scraping automation by YubHub

]]> full-time senior onsite $160,000 - $300,000 Go, Python, C++, Rust, Distributed systems, Container orchestration, OS-level concepts, Cloud platform fluency (AWS), Observability stacks, CI/CD pipeline expertise Engineering Technology Hebbia https://logos.yubhub.co/hebbia.com.png Hebbia is an AI platform that generates alpha and drives upside for investors and bankers. It was founded in 2020 and backed by Peter Thiel and Andreessen Horowitz. https://hebbia.com https://job-boards.greenhouse.io/hebbia/jobs/4666955005?utm_source=yubhub.co&utm_medium=jobs_feed&utm_campaign=apply New York City; San Francisco, CA 2026-04-17 089e27b0-40a Backend Engineer We're looking for a skilled Backend Engineer to join our Data Infrastructure engineering organisation. As a member of this team, you will play a key role in helping us build analytics for internal and music industry-facing tools. Our platform enables capabilities such as showing artists how many streams their latest release has to informing internal teams about their cloud resource usage.

As a Backend Engineer, you will help us exemplify, measure and raise the reliability of data infrastructure of squads across different verticals within Spotify. You'll work closely with engineers to provide OLAP capabilities to build dynamic, reliable data visualizations and share responsibility with them in diagnosing, resolving, and preventing production issues.

Key responsibilities include:

Building, operating, and evolving data analytics platforms that include backend services as well as OLAP data stores (Druid) for teams building analytics across Spotify.
Building internal tooling, libraries, and services that streamline integration patterns with our analytics platform.
Advocating for best practices in service design, data modeling, schema evolution, and contract testing to ensure long-term maintainability.
Working in an autonomous, multi-functional environment and collaborating with squads across Spotify to continuously iterate and deliver on new product objectives.

To succeed in this role, you will need:

3+ years of relevant experience with distributed datastores and backend services.
Proficiency in Java and a willingness to learn Kubernetes and Terraform.
Understanding of data modeling, dimensional schemas, and analytical query patterns.
Experience building internal developer tools, libraries, or shared services that support large engineering organisations.
A strong sense of ownership of service quality, SLOs, and operational excellence.
Familiarity with OLAP databases or analytics warehouses (e.g., Druid, ClickHouse, Pinot, BigQuery, Snowflake).
Comfort with metrics-driven development and observability stacks (Prometheus, Grafana, similar).
Excellent communication and interpersonal skills, with the ability to work effectively with cross-functional teams.

In return, we offer a competitive salary range of $125,562-$179,374, plus equity, as well as a comprehensive benefits package including health insurance, six months' paid parental leave, 401(k) retirement plan, monthly meal allowance, 23 paid days off, 13 paid flexible holidays, and paid sick leave.

XML job scraping automation by YubHub

]]> full-time mid hybrid $125,562-$179,374 Java, Kubernetes, Terraform, OLAP databases, Analytics warehouses, Metrics-driven development, Observability stacks Engineering Technology Spotify https://logos.yubhub.co/spotify.com.png Spotify is a music streaming service with millions of users worldwide. https://www.spotify.com https://jobs.lever.co/spotify/66492688-d5b0-4cf8-b1a4-4a715157edd9?utm_source=yubhub.co&utm_medium=jobs_feed&utm_campaign=apply NYC 2026-03-31 2d29b93a-388 Software Engineer III We are seeking an experienced Software Engineer skilled in Python and/or Golang to build and maintain automation, APIs, and services that manage the lifecycle of our infrastructure platforms.

What you'll do

Design, develop, and maintain scalable APIs and automation services in Python and/or Go to manage platform operations (e.g., provisioning, configuration, access control, DNS updates).
Automate infrastructure lifecycle workflows across Kubernetes/OpenShift and related systems.

What you need

5+ years of professional software engineering experience using Python and/or Go (Golang).
Proven experience designing and implementing RESTful or gRPC APIs.

XML job scraping automation by YubHub

]]> full-time senior hybrid $119,600 - $167,300 CAD Python, Golang, Kubernetes, OpenShift, GitOps, CI/CD pipelines, containerization, Linux-based systems, PKI, certificate management, secrets automation, infrastructure-as-code, policy-as-code, DNS automation, RBAC, access control systems, observability stacks Engineering Technology Electronic Arts https://logos.yubhub.co/jobs.ea.com.png Electronic Arts creates next-level entertainment experiences that inspire players and fans around the world. Here, everyone is part of the story. Part of a community that connects across the globe. A place where creativity thrives, new perspectives are invited, and ideas matter. https://jobs.ea.com https://jobs.ea.com/en_US/careers/JobDetail/Software-Engineer-III/212100?utm_source=yubhub.co&utm_medium=jobs_feed&utm_campaign=apply Vancouver 2026-01-24