<?xml version="1.0" encoding="UTF-8"?>
<source>
  <jobs>
    <job>
      <externalid>709b405a-48b</externalid>
      <Title>Staff / Senior Software Engineer, AI Reliability</Title>
      <Description><![CDATA[<p>We&#39;re seeking a Staff / Senior Software Engineer, AI Reliability to join our team. As a key member of our AIRE (AI Reliability Engineering) team, you will partner with teams across Anthropic to improve reliability across our most critical serving paths. You will develop Service Level Objectives for large language model serving systems, design and implement monitoring and observability systems, assist in the design and implementation of high-availability serving infrastructure, lead incident response for critical AI services, and support the reliability of safeguard model serving.</p>
<p>You may be a good fit for this role if you have strong distributed systems, infrastructure, or reliability backgrounds, are curious and brave, think holistically about how systems compose and where the seams are, can build lasting relationships across teams, care about users and feel ownership over outcomes, have excellent communication and collaboration skills, and bring diverse experience.</p>
<p>Strong candidates may also have experience operating large-scale model serving or training infrastructure, experience with one or more ML hardware accelerators, understanding of ML-specific networking optimizations, expertise in AI-specific observability tools and frameworks, experience with chaos engineering and systematic resilience testing, and contributions to open-source infrastructure or ML tooling.</p>
<p>We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. We value impact and believe that the highest-impact AI research will be big science. We work as a single cohesive team on just a few large-scale research efforts and value communication skills.</p>
<p>If you&#39;re interested in this role, please submit an application even if you don&#39;t believe you meet every single qualification. We encourage diversity and strive to include a range of diverse perspectives on our team.</p>
<p style="margin-top:24px;font-size:13px;color:#666;">XML job scraping automation by <a href="https://yubhub.co">YubHub</a></p>]]></Description>
      <Jobtype>full-time</Jobtype>
      <Experiencelevel>staff</Experiencelevel>
      <Workarrangement>hybrid</Workarrangement>
      <Salaryrange>$325,000-$485,000 USD</Salaryrange>
      <Skills>distributed systems, infrastructure, reliability, Service Level Objectives, monitoring and observability systems, high-availability serving infrastructure, incident response, safeguard model serving, large-scale model serving or training infrastructure, ML hardware accelerators, ML-specific networking optimizations, AI-specific observability tools and frameworks, chaos engineering and systematic resilience testing, open-source infrastructure or ML tooling</Skills>
      <Category>Engineering</Category>
      <Industry>Technology</Industry>
      <Employername>Anthropic</Employername>
      <Employerlogo>https://logos.yubhub.co/anthropic.com.png</Employerlogo>
      <Employerdescription>Anthropic is a public benefit corporation that creates reliable, interpretable, and steerable AI systems.</Employerdescription>
      <Employerwebsite>https://www.anthropic.com/</Employerwebsite>
      <Compensationcurrency></Compensationcurrency>
      <Compensationmin></Compensationmin>
      <Compensationmax></Compensationmax>
      <Applyto>https://job-boards.greenhouse.io/anthropic/jobs/5113224008</Applyto>
      <Location>San Francisco, CA | New York City, NY | Seattle, WA</Location>
      <Country></Country>
      <Postedate>2026-04-18</Postedate>
    </job>
    <job>
      <externalid>f14ee3e5-931</externalid>
      <Title>Software Engineer, UI Platform</Title>
      <Description><![CDATA[<p>As a Software Engineer on the UI Platform team at Anthropic, you will be hands-on building the platform that other engineers depend on every day.scope of work includes designing and shipping shared components and design-system-level abstractions, evolving the backend-for-frontend (BFF) APIs that power our client applications, and improving the build, deploy, and observability systems that keep Claude.ai running smoothly across surfaces.</p>
<p>This is a great fit if you care deeply about developer experience and want your engineering work to have outsized leverage: instead of shipping one feature, you&#39;re building the tools and systems that make dozens of features possible.</p>
<p>Responsibilities:</p>
<ul>
<li>Design and build shared UI components, libraries, and abstractions that product teams across Anthropic use to ship consistently and efficiently on web and mobile</li>
</ul>
<ul>
<li>Contribute to the BFF API layer that powers Claude.ai&#39;s client applications,thinking carefully about clean contracts, performance, and reliability at the boundary between frontend and backend</li>
</ul>
<ul>
<li>Improve developer velocity across the organization by reducing friction in our build, deploy, and testing pipelines</li>
</ul>
<ul>
<li>Work on performance and reliability: identify and resolve latency issues, improve observability, and help establish high standards that the rest of the platform team can build on</li>
</ul>
<ul>
<li>Partner closely with product engineering teams to understand their needs, unblock them when possible, and shape platform investments around where the most impact is</li>
</ul>
<ul>
<li>Help maintain and evolve documentation and tooling that make the platform approachable for engineers joining or building on top of it</li>
</ul>
<p>You may be a good fit if you:</p>
<ul>
<li>Have 5+ years of software engineering experience, with significant time spent building shared platforms, developer tools, or infrastructure that other engineers rely on</li>
</ul>
<ul>
<li>Have strong practical skills in modern web technologies (React, TypeScript, Next.js) and experience designing or consuming APIs that serve frontend applications</li>
</ul>
<ul>
<li>Care about developer experience and have a track record of building things that make other engineers more productive</li>
</ul>
<ul>
<li>Have solid instincts around reliability, observability, and performance,and enjoy operationalizing those instincts in production systems</li>
</ul>
<ul>
<li>Thrive in fast-paced, collaborative environments and enjoy working closely with cross-functional partners</li>
</ul>
<ul>
<li>Pick up slack, even if it goes outside your job description</li>
</ul>
<p>Strong candidates may also have experience with:</p>
<ul>
<li>Building shared component libraries or design systems for multiple surfaces (web, mobile, desktop)</li>
</ul>
<ul>
<li>BFF architectures and API patterns that balance flexibility with consistency across client platforms</li>
</ul>
<ul>
<li>Performance optimization and latency reduction in consumer-facing applications</li>
</ul>
<ul>
<li>CI/CD, build systems, and deployment automation</li>
</ul>
<ul>
<li>Observability and monitoring (metrics, logging, tracing)</li>
</ul>
<ul>
<li>Working on AI/ML products or in rapidly evolving product environments</li>
</ul>
<p>Candidates need not have:</p>
<ul>
<li>100% of the skills needed to perform the job</li>
</ul>
<ul>
<li>Formal certifications or education credentials</li>
</ul>
<p>The annual compensation range for this role is $320,000-$405,000 USD.</p>
<p style="margin-top:24px;font-size:13px;color:#666;">XML job scraping automation by <a href="https://yubhub.co">YubHub</a></p>]]></Description>
      <Jobtype>full-time</Jobtype>
      <Experiencelevel>senior</Experiencelevel>
      <Workarrangement>hybrid</Workarrangement>
      <Salaryrange>$320,000-$405,000 USD</Salaryrange>
      <Skills>software engineering, UI platform, shared components, design-system-level abstractions, backend-for-frontend (BFF) APIs, build, deploy, observability systems, developer experience, React, TypeScript, Next.js, APIs, performance optimization, latency reduction, CI/CD, build systems, deployment automation, observability and monitoring</Skills>
      <Category>Engineering</Category>
      <Industry>Technology</Industry>
      <Employername>Anthropic</Employername>
      <Employerlogo>https://logos.yubhub.co/anthropic.com.png</Employerlogo>
      <Employerdescription>Anthropic is a public benefit corporation that creates reliable, interpretable, and steerable AI systems.</Employerdescription>
      <Employerwebsite>https://www.anthropic.com/</Employerwebsite>
      <Compensationcurrency></Compensationcurrency>
      <Compensationmin></Compensationmin>
      <Compensationmax></Compensationmax>
      <Applyto>https://job-boards.greenhouse.io/anthropic/jobs/4673416008</Applyto>
      <Location>San Francisco, CA | New York City, NY</Location>
      <Country></Country>
      <Postedate>2026-04-18</Postedate>
    </job>
    <job>
      <externalid>67b4ccd7-51d</externalid>
      <Title>Senior Software Engineer, Observability Insights</Title>
      <Description><![CDATA[<p>Join CoreWeave&#39;s Observability team, where we are building the next-generation insights layer for AI systems.</p>
<p>Our team empowers internal and external users to understand, troubleshoot, and optimize complex AI workloads by transforming telemetry into actionable insights.</p>
<p>As a Senior Software Engineer on the Observability Insights team, you will lead the development of agentic interfaces and product experiences that sit atop CoreWeave&#39;s telemetry layer.</p>
<p>You&#39;ll design multi-tenant APIs, managed Grafana experiences, and MCP-based tool servers to help customers and internal teams interact with data in innovative ways.</p>
<p>Collaborating closely with PMs and engineering leadership, your work will shape the end-to-end observability experience and influence how people engage with cutting-edge AI infrastructure.</p>
<p><strong>About the role</strong></p>
<ul>
<li>6+ years of experience in software or infrastructure engineering building production-grade backend systems and distributed APIs.</li>
</ul>
<ul>
<li>Strong focus on developer-facing infrastructure, with a customer-obsessed approach to SDKs, CLIs, and APIs.</li>
</ul>
<ul>
<li>Proficient in reliability engineering, including fault-tolerant design, SLOs, error budgets, and multi-tenant system resilience.</li>
</ul>
<ul>
<li>Familiar with observability systems such as ClickHouse, Loki, VictoriaMetrics, Prometheus, and Grafana.</li>
</ul>
<ul>
<li>Experienced in agentic applications or LLM-based features, including grounding, tool calling, and operational safety.</li>
</ul>
<ul>
<li>Comfortable writing production code primarily in Go, with the ability to integrate Python components when needed.</li>
</ul>
<ul>
<li>Collaborative experience in agile teams delivering end-to-end telemetry-to-insights pipelines.</li>
</ul>
<p><strong>Preferred</strong></p>
<ul>
<li>Experience operating Kubernetes clusters at scale, especially for AI workloads.</li>
</ul>
<ul>
<li>Hands-on experience with logging, tracing, and metrics platforms in production, with deep knowledge of cardinality, indexing, and query optimization.</li>
</ul>
<ul>
<li>Experienced in running distributed systems or API services at cloud scale, including event streaming and data pipeline management.</li>
</ul>
<ul>
<li>Familiarity with LLM frameworks, MCP, and agentic tooling (e.g., Langchain, AgentCore).</li>
</ul>
<p><strong>Why CoreWeave?</strong></p>
<p>At CoreWeave, we work hard, have fun, and move fast!</p>
<p>We&#39;re in an exciting stage of hyper-growth that you will not want to miss out on.</p>
<p>We&#39;re not afraid of a little chaos, and we&#39;re constantly learning.</p>
<p>Our team cares deeply about how we build our product and how we work together, which is represented through our core values:</p>
<ul>
<li>Be Curious at Your Core</li>
</ul>
<ul>
<li>Act Like an Owner</li>
</ul>
<ul>
<li>Empower Employees</li>
</ul>
<ul>
<li>Deliver Best-in-Class Client Experiences</li>
</ul>
<ul>
<li>Achieve More Together</li>
</ul>
<p>We support and encourage an entrepreneurial outlook and independent thinking.</p>
<p>We foster an environment that encourages collaboration and enables the development of innovative solutions to complex problems.</p>
<p>As we get set for takeoff, the organization&#39;s growth opportunities are constantly expanding.</p>
<p>You will be surrounded by some of the best talent in the industry, who will want to learn from you, too.</p>
<p>Come join us!</p>
<p style="margin-top:24px;font-size:13px;color:#666;">XML job scraping automation by <a href="https://yubhub.co">YubHub</a></p>]]></Description>
      <Jobtype>full-time</Jobtype>
      <Experiencelevel>senior</Experiencelevel>
      <Workarrangement>hybrid</Workarrangement>
      <Salaryrange>$165,000 to $242,000</Salaryrange>
      <Skills>software engineering, infrastructure engineering, backend systems, distributed APIs, reliability engineering, fault-tolerant design, SLOs, error budgets, multi-tenant system resilience, observability systems, ClickHouse, Loki, VictoriaMetrics, Prometheus, Grafana, agentic applications, LLM-based features, grounding, tool calling, operational safety, Go, Python, Kubernetes, logging, tracing, metrics platforms, cardinality, indexing, query optimization, event streaming, data pipeline management, LLM frameworks, MCP, agent tooling, operating Kubernetes clusters</Skills>
      <Category>Engineering</Category>
      <Industry>Technology</Industry>
      <Employername>CoreWeave</Employername>
      <Employerlogo>https://logos.yubhub.co/coreweave.com.png</Employerlogo>
      <Employerdescription>CoreWeave is a cloud computing company that provides a platform for building and scaling AI.</Employerdescription>
      <Employerwebsite>https://www.coreweave.com</Employerwebsite>
      <Compensationcurrency></Compensationcurrency>
      <Compensationmin></Compensationmin>
      <Compensationmax></Compensationmax>
      <Applyto>https://job-boards.greenhouse.io/coreweave/jobs/4650163006</Applyto>
      <Location>New York, NY / Sunnyvale, CA</Location>
      <Country></Country>
      <Postedate>2026-04-18</Postedate>
    </job>
    <job>
      <externalid>cbeabfab-916</externalid>
      <Title>Software Engineer, Observability</Title>
      <Description><![CDATA[<p>As a Software Engineer on the Observability team, you will design, build, and maintain scalable systems that process and surface telemetry data across distributed environments.</p>
<p>You&#39;ll contribute production-quality code in languages like Go and Python, while improving system reliability through enhanced monitoring, alerting, and incident response practices.</p>
<p>Day to day, you&#39;ll collaborate with cross-functional engineering teams to implement observability best practices, support production systems, and help optimize performance across large-scale infrastructure.</p>
<p>You will also participate in on-call rotations and contribute to continuous improvements based on real-world system behavior.</p>
<p>CoreWeave is looking for a talented software engineer to join our Observability team. You will be responsible for designing, building, and maintaining scalable systems that process and surface telemetry data across distributed environments.</p>
<p>The ideal candidate will have experience with Go and Python, as well as a strong understanding of system reliability and observability best practices.</p>
<p>In addition to your technical skills, you should be able to collaborate effectively with cross-functional teams and communicate complex technical concepts to non-technical stakeholders.</p>
<p>If you&#39;re passionate about building scalable systems and improving system reliability, we&#39;d love to hear from you!</p>
<p style="margin-top:24px;font-size:13px;color:#666;">XML job scraping automation by <a href="https://yubhub.co">YubHub</a></p>]]></Description>
      <Jobtype>full-time</Jobtype>
      <Experiencelevel>mid</Experiencelevel>
      <Workarrangement>hybrid</Workarrangement>
      <Salaryrange>$109,000 to $145,000</Salaryrange>
      <Skills>Go, Python, Kubernetes, containerization, microservices architectures, observability systems, metrics, logging, tracing, ClickHouse, Elastic, Loki, VictoriaMetrics, Prometheus, Thanos, OpenTelemetry, Grafana, Terraform, modern testing frameworks, deployment strategies, data streaming technologies, AI/ML infrastructure</Skills>
      <Category>Engineering</Category>
      <Industry>Technology</Industry>
      <Employername>CoreWeave</Employername>
      <Employerlogo>https://logos.yubhub.co/coreweave.com.png</Employerlogo>
      <Employerdescription>CoreWeave is a cloud computing company that provides a platform for building and scaling AI workloads.</Employerdescription>
      <Employerwebsite>https://www.coreweave.com</Employerwebsite>
      <Compensationcurrency></Compensationcurrency>
      <Compensationmin></Compensationmin>
      <Compensationmax></Compensationmax>
      <Applyto>https://job-boards.greenhouse.io/coreweave/jobs/4587675006</Applyto>
      <Location>New York, NY / Sunnyvale, CA</Location>
      <Country></Country>
      <Postedate>2026-04-18</Postedate>
    </job>
    <job>
      <externalid>277cf2a4-232</externalid>
      <Title>Research Engineer, AI Observability</Title>
      <Description><![CDATA[<p>As a Research Engineer on our team, you&#39;ll design and build systems that let AI analyze large, unstructured datasets , think tens or hundreds of thousands of conversations or documents , and produce structured, trustworthy insights.</p>
<p>This is a high-leverage role. The tools you build will be used by dozens of researchers and investigators, and directly shape our ability to measure and mitigate both misuse and misalignment.</p>
<p>You&#39;ll work across the full stack, from core analysis frameworks through user-facing apps and interfaces.</p>
<p>Responsibilities:</p>
<ul>
<li>Design and implement AI-based monitoring systems for AI training and deployment</li>
<li>Extend and improve core frameworks for processing large volumes of unstructured text</li>
<li>Partner with researchers and safety teams across Anthropic to understand their analytical needs and build solutions</li>
<li>Develop agentic integrations that allow AI systems to autonomously investigate and act on analytical findings</li>
<li>Contribute to the strategic direction of the team, including decisions about what to build, what to partner on, and where to invest</li>
</ul>
<p>You May Be a Good Fit If You:</p>
<ul>
<li>Have 5+ years of software engineering experience, with meaningful exposure to ML systems</li>
<li>Are excited about the problem of scaling human oversight of AI systems</li>
<li>Are familiar with LLM application development and evaluation</li>
<li>Enjoy building tools that other people use , you care about UX, reliability, and documentation</li>
<li>Thrive in collaborative, cross-functional environments</li>
</ul>
<p>Strong Candidates May Also Have:</p>
<ul>
<li>Experience with productionizing internal tools or building developer-facing platforms</li>
<li>Background in building monitoring or observability systems</li>
<li>Comfort with ambiguity , our team is small and growing, and you&#39;ll help define what we become</li>
</ul>
<p>The annual compensation range for this role is $320,000-$405,000 USD.</p>
<p style="margin-top:24px;font-size:13px;color:#666;">XML job scraping automation by <a href="https://yubhub.co">YubHub</a></p>]]></Description>
      <Jobtype>full-time</Jobtype>
      <Experiencelevel>senior</Experiencelevel>
      <Workarrangement>hybrid</Workarrangement>
      <Salaryrange>$320,000-$405,000 USD</Salaryrange>
      <Skills>software engineering, ML systems, LLM application development, evaluation, UX, reliability, documentation, productionizing internal tools, building developer-facing platforms, monitoring or observability systems, ambiguity</Skills>
      <Category>Engineering</Category>
      <Industry>Technology</Industry>
      <Employername>Anthropic</Employername>
      <Employerlogo>https://logos.yubhub.co/anthropic.com.png</Employerlogo>
      <Employerdescription>Anthropic is a public benefit corporation that creates reliable, interpretable, and steerable AI systems.</Employerdescription>
      <Employerwebsite>https://www.anthropic.com/</Employerwebsite>
      <Compensationcurrency></Compensationcurrency>
      <Compensationmin></Compensationmin>
      <Compensationmax></Compensationmax>
      <Applyto>https://job-boards.greenhouse.io/anthropic/jobs/5125083008</Applyto>
      <Location>San Francisco, CA</Location>
      <Country></Country>
      <Postedate>2026-04-18</Postedate>
    </job>
    <job>
      <externalid>1bdd60c5-d3c</externalid>
      <Title>Senior Software Engineer - Network Dev</Title>
      <Description><![CDATA[<p>About Us</p>
<p>At Cloudflare, we are on a mission to help build a better Internet. Today the company runs one of the world&#39;s largest networks that powers millions of websites and other Internet properties for customers ranging from individual bloggers to SMBs to Fortune 500 companies.</p>
<p>Cloudflare protects and accelerates any Internet application online without adding hardware, installing software, or changing a line of code. Internet properties powered by Cloudflare all have web traffic routed through its intelligent global network, which gets smarter with every request. As a result, they see significant improvement in performance and a decrease in spam and other attacks.</p>
<p>About the Department</p>
<p>Cloudflare&#39;s Network Engineering Team builds and runs the infrastructure that runs our software. The Engineering Team is split into two groups: one handles product development and the other handles operations. Product development covers both new features and functionality and scaling our existing software to meet the challenges of a massively growing customer base. The operations team handles one of the world&#39;s largest networks with data centers in 190 cities worldwide and a couple of large specialized data centers for internal needs.</p>
<p>About the role</p>
<p>Cloudflare operates a large global network spanning hundreds of cities (data centers). You will join a team of talented network automation engineers who are building software solutions to improve network resilience and reduce engineering operational toil. You will work on a range of tools, infrastructure and services - new and existing - with an aim to elegantly and efficiently solve problems and deliver practical, maintainable and scalable solutions.</p>
<p>Responsibilities</p>
<ul>
<li>Join a team of talented network automation engineers who are building software solutions to improve network resilience and reduce engineering operational toil.</li>
<li>Work on a range of tools, infrastructure and services - new and existing - with an aim to elegantly and efficiently solve problems and deliver practical, maintainable and scalable solutions.</li>
</ul>
<p>Requirements</p>
<ul>
<li>BA/BS in Computer Science or equivalent experience</li>
<li>5+ years of proven experience in developing software components for network automation.</li>
<li>Strong understanding of software development principles, design patterns, and various programming languages (like python and golang)</li>
<li>Highly Proficient with modern Unix/Linux operating systems/distributions</li>
<li>Experience in MySQL, Postgres, Clickhouse (or equivalent SQL language)</li>
<li>Experience with CI/CD, containers and/or virtualization</li>
<li>Experience with Observability systems like prometheus, grafana (or equivalents)</li>
</ul>
<p>Bonus Points</p>
<ul>
<li>Knowledge of Networking engineering, with competencies in Layer 2 and Layer 3 protocols and vendor equipment: Cisco, Juniper, etc.</li>
<li>Experience building and maintaining large distributed systems</li>
<li>Experience managing internal and/or external customer requirements and expectations</li>
</ul>
<p>What Makes Cloudflare Special?</p>
<p>We&#39;re not just a highly ambitious, large-scale technology company. We&#39;re a highly ambitious, large-scale technology company with a soul. Fundamental to our mission to help build a better Internet is protecting the free and open Internet.</p>
<p>Project Galileo: Since 2014, we&#39;ve equipped more than 2,400 journalism and civil society organizations in 111 countries with powerful tools to defend themselves against attacks that would otherwise censor their work, technology already used by Cloudflare’s enterprise customers--at no cost.</p>
<p>Athenian Project: In 2017, we created the Athenian Project to ensure that state and local governments have the highest level of protection and reliability for free, so that their constituents have access to election information and voter registration. Since the project, we&#39;ve provided services to more than 425 local government election websites in 33 states.</p>
<p>1.1.1.1: We released 1.1.1.1 to help fix the foundation of the Internet by building a faster, more secure and privacy-centric public DNS resolver. This is available publicly for everyone to use - it is the first consumer-focused service Cloudflare has ever released.</p>
<p>Here’s the deal - we don’t store client IP addresses never, ever. We will continue to abide by our privacy commitment and ensure that no user data is sold to advertisers or used to target consumers.</p>
<p>Sound like something you’d like to be a part of? We’d love to hear from you!</p>
<p>This position may require access to information protected under U.S. export control laws, including the U.S. Export Administration Regulations. Please note that any offer of employment may be conditioned on your authorization to receive software or technology controlled under these U.S. export laws without sponsorship for an export license.</p>
<p>Cloudflare is proud to be an equal opportunity employer. We are committed to providing equal employment opportunity for all people and place great value in both diversity and inclusiveness. All qualified applicants will be considered for employment without regard to their, or any other person&#39;s, perceived or actual race, color, religion, sex, gender, gender identity, gender expression, sexual orientation, national origin, ancestry, citizenship, age, physical or mental disability, medical condition, family care status, or any other basis protected by law.</p>
<p>We are an AA/Veterans/Disabled Employer. Cloudflare provides reasonable accommodations to qualified individuals with disabilities. Please tell us if you require a reasonable accommodation to apply for a job. Examples of reasonable accommodations include, but are not limited to, changing the application process, providing documents in an alternate format, using a sign language interpreter, or using specialized equipment. If you require a reasonable accommodation to apply for a job, please contact us via e-mail at hr@cloudflare.com or via mail at 101 Townsend St. San Francisco, CA 94107.</p>
<p style="margin-top:24px;font-size:13px;color:#666;">XML job scraping automation by <a href="https://yubhub.co">YubHub</a></p>]]></Description>
      <Jobtype>full-time</Jobtype>
      <Experiencelevel>senior</Experiencelevel>
      <Workarrangement>onsite</Workarrangement>
      <Salaryrange></Salaryrange>
      <Skills>BA/BS in Computer Science or equivalent experience, 5+ years of proven experience in developing software components for network automation, Strong understanding of software development principles, design patterns, and various programming languages (like python and golang), Highly Proficient with modern Unix/Linux operating systems/distributions, Experience in MySQL, Postgres, Clickhouse (or equivalent SQL language), Experience with CI/CD, containers and/or virtualization, Experience with Observability systems like prometheus, grafana (or equivalents), Knowledge of Networking engineering, with competencies in Layer 2 and Layer 3 protocols and vendor equipment: Cisco, Juniper, etc., Experience building and maintaining large distributed systems, Experience managing internal and/or external customer requirements and expectations</Skills>
      <Category>Engineering</Category>
      <Industry>Technology</Industry>
      <Employername>Cloudflare</Employername>
      <Employerlogo>https://logos.yubhub.co/cloudflare.com.png</Employerlogo>
      <Employerdescription>Cloudflare operates one of the world&apos;s largest networks that powers millions of websites and other Internet properties.</Employerdescription>
      <Employerwebsite>https://www.cloudflare.com/</Employerwebsite>
      <Compensationcurrency></Compensationcurrency>
      <Compensationmin></Compensationmin>
      <Compensationmax></Compensationmax>
      <Applyto>https://job-boards.greenhouse.io/cloudflare/jobs/7167953</Applyto>
      <Location>In-Office</Location>
      <Country></Country>
      <Postedate>2026-04-18</Postedate>
    </job>
    <job>
      <externalid>c930b80e-7a6</externalid>
      <Title>Staff / Senior Software Engineer, AI Reliability</Title>
      <Description><![CDATA[<p><strong>About Anthropic</strong></p>
<p>Anthropic&#39;s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.</p>
<p><strong>About the Role</strong></p>
<p>AIRE (AI Reliability Engineering) partners with teams across Anthropic to improve reliability across our most critical serving paths -- every hop from the SDK through our network, API layers, serving infrastructure, and accelerators and back. We jump into the trenches alongside partner teams to make the systems that deliver Claude more robust and resilient, be it during an incident or collaborating on projects.</p>
<p>Reliability here is an emergent phenomenon that transcends any single team&#39;s boundaries, so someone has to zoom out and look at the whole picture. That&#39;s us -- and it means few teams at Anthropic offer this kind of dynamic, cross-cutting exposure to the systems that matter most.</p>
<p>Claude has your back. AIRE has Claude&#39;s. Help us keep Claude reliable for everyone who depends on it.</p>
<p><strong>Responsibilities:</strong></p>
<ul>
<li>Develop appropriate Service Level Objectives for large language model serving systems, balancing availability and latency with development velocity.</li>
</ul>
<ul>
<li>Design and implement monitoring and observability systems across the token path.</li>
</ul>
<ul>
<li>Assist in the design and implementation of high-availability serving infrastructure across multiple regions and cloud providers</li>
</ul>
<ul>
<li>Lead incident response for critical AI services, ensuring rapid recovery, thorough incident reviews, and systematic improvements.</li>
</ul>
<ul>
<li>Support the reliability of safeguard model serving -- critical for both site reliability and Anthropic&#39;s safety commitments.</li>
</ul>
<p><strong>You may be a good fit if you:</strong></p>
<ul>
<li>Have strong distributed systems, infrastructure, or reliability backgrounds -- we&#39;re looking for reliability-minded software engineers and SREs.</li>
</ul>
<ul>
<li>Are curious and brave -- comfortable jumping into unfamiliar systems during an incident and helping drive resolution even when you don&#39;t have deep expertise yet.</li>
</ul>
<ul>
<li>Think holistically about how systems compose and where the seams are.</li>
</ul>
<ul>
<li>Can build lasting relationships across teams -- our engagement model depends on being welcomed as teammates, not outsiders with opinions.</li>
</ul>
<ul>
<li>Care about users and feel ownership over outcomes, even for systems you don&#39;t own.</li>
</ul>
<ul>
<li>Have excellent communication and collaboration skills -- you&#39;ll be partnering across the entire company.</li>
</ul>
<ul>
<li>Bring diverse experience -- the team&#39;s strength comes from people who&#39;ve built product stacks, scaled databases, run massive distributed systems, and everything in between.</li>
</ul>
<p><strong>Strong candidates may also:</strong></p>
<ul>
<li>Have been an SRE, Production Engineer, or in similar reliability-focused roles on large scale systems</li>
</ul>
<ul>
<li>Have experience operating large-scale model serving or training infrastructure (&gt;1000 GPUs).</li>
</ul>
<ul>
<li>Have experience with one or more ML hardware accelerators (GPUs, TPUs, Trainium).</li>
</ul>
<ul>
<li>Understand ML-specific networking optimizations like RDMA and InfiniBand.</li>
</ul>
<ul>
<li>Have expertise in AI-specific observability tools and frameworks.</li>
</ul>
<ul>
<li>Have experience with chaos engineering and systematic resilience testing.</li>
</ul>
<ul>
<li>Have contributed to open-source infrastructure or ML tooling.</li>
</ul>
<p><strong>Logistics</strong></p>
<p><strong>Education requirements:</strong> We require at least a Bachelor&#39;s degree in a related field or equivalent experience. <strong>Location-based hybrid policy:</strong> Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices.</p>
<p><strong>Visa sponsorship:</strong> We do sponsor visas! However, we aren&#39;t able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this.</p>
<p><strong>We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you&#39;re interested in this work.</strong></p>
<p><strong>Your safety matters to us. To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you&#39;re ever unsure about a communication, don&#39;t click any links—visit anthropic.com/careers directly for confirmed position openings.</strong></p>
<p><strong>How we&#39;re different</strong></p>
<p>We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as a team sport, where everyone contributes to the overall success of the team.</p>
<p style="margin-top:24px;font-size:13px;color:#666;">XML job scraping automation by <a href="https://yubhub.co">YubHub</a></p>]]></Description>
      <Jobtype>full-time</Jobtype>
      <Experiencelevel>staff</Experiencelevel>
      <Workarrangement>hybrid</Workarrangement>
      <Salaryrange>$325,000 - $485,000 USD</Salaryrange>
      <Skills>distributed systems, infrastructure, reliability, large language model serving systems, monitoring and observability systems, high-availability serving infrastructure, incident response, safeguard model serving, SRE, Production Engineer, ML hardware accelerators, ML-specific networking optimizations, AI-specific observability tools and frameworks, chaos engineering, systematic resilience testing, open-source infrastructure or ML tooling</Skills>
      <Category>Engineering</Category>
      <Industry>Technology</Industry>
      <Employername>Anthropic</Employername>
      <Employerlogo>https://logos.yubhub.co/anthropic.com.png</Employerlogo>
      <Employerdescription>Anthropic is a company that creates reliable, interpretable, and steerable AI systems. It has a quickly growing team of researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.</Employerdescription>
      <Employerwebsite>https://job-boards.greenhouse.io</Employerwebsite>
      <Compensationcurrency></Compensationcurrency>
      <Compensationmin></Compensationmin>
      <Compensationmax></Compensationmax>
      <Applyto>https://job-boards.greenhouse.io/anthropic/jobs/5113224008</Applyto>
      <Location>San Francisco, CA | New York City, NY | Seattle, WA</Location>
      <Country></Country>
      <Postedate>2026-03-08</Postedate>
    </job>
    <job>
      <externalid>453f53c5-e0d</externalid>
      <Title>Research Engineer, AI Observability</Title>
      <Description><![CDATA[<p><strong>About Anthropic</strong></p>
<p>Anthropic&#39;s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.</p>
<p><strong>About the Team</strong></p>
<p>As AI training and deployments scale, the volume of data we need to monitor and understand is exploding. Our team uses Claude itself to make sense of this data. We own an integrated set of tools enabling Anthropic to ask open-ended questions, surface unexpected patterns, and maintain meaningful human oversight over massive datasets.</p>
<p>Our tools are widely adopted internally — powering ongoing enforcement, threat intelligence investigations, model audits, and more — and we’re looking for experienced engineers and researchers to both scale up existing applications and go zero-to-one on new ones.</p>
<p><strong>About the Role</strong></p>
<p>As a Research Engineer on our team, you&#39;ll design and build systems that let AI analyse large, unstructured datasets — think tens or hundreds of thousands of conversations or documents — and produce structured, trustworthy insights. You&#39;ll work across the full stack, from core analysis frameworks through user-facing apps and interfaces.</p>
<p>This is a high-leverage role. The tools you build will be used by dozens of researchers and investigators, and directly shape our ability to measure and mitigate both misuse and misalignment.</p>
<p><strong>Responsibilities:</strong></p>
<ul>
<li>Design and implement AI-based monitoring systems for AI training and deployment</li>
</ul>
<ul>
<li>Extend and improve core frameworks for processing large volumes of unstructured text</li>
</ul>
<ul>
<li>Partner with researchers and safety teams across Anthropic to understand their analytical needs and build solutions</li>
</ul>
<ul>
<li>Develop agentic integrations that allow AI systems to autonomously investigate and act on analytical findings</li>
</ul>
<ul>
<li>Contribute to the strategic direction of the team, including decisions about what to build, what to partner on, and where to invest</li>
</ul>
<p><strong>You May Be a Good Fit If You:</strong></p>
<ul>
<li>Have 5+ years of software engineering experience, with meaningful exposure to ML systems</li>
</ul>
<ul>
<li>Are excited about the problem of scaling human oversight of AI systems</li>
</ul>
<ul>
<li>Are familiar with LLM application development (context engineering, evaluation, orchestration)</li>
</ul>
<ul>
<li>Enjoy building tools that other people use — you care about UX, reliability, and documentation</li>
</ul>
<ul>
<li>Can context-switch between deep infrastructure work and user-facing product thinking</li>
</ul>
<ul>
<li>Thrive in collaborative, cross-functional environments</li>
</ul>
<p><strong>Strong Candidates May Also Have:</strong></p>
<ul>
<li>Research experience in AI safety, alignment, or responsible deployment</li>
</ul>
<ul>
<li>Practical experience with both data science and engineering, including developing and using large-scale data processing frameworks</li>
</ul>
<ul>
<li>Experience with productionizing internal tools or building developer-facing platforms</li>
</ul>
<ul>
<li>Background in building monitoring or observability systems</li>
</ul>
<ul>
<li>Comfort with ambiguity — our team is small and growing, and you&#39;ll help define what we become</li>
</ul>
<p><strong>Logistics</strong></p>
<p><strong>Education requirements:</strong> We require at least a Bachelor&#39;s degree in a related field or equivalent experience. <strong>Location-based hybrid policy:</strong> Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices.</p>
<p><strong>Visa sponsorship:</strong> We do sponsor visas! However, we aren&#39;t able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this.</p>
<p><strong>We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you&#39;re interested in this work.</strong></p>
<p><strong>Your safety matters to us. To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you&#39;re ever unsure about a communication, don&#39;t click any links—visit anthropic.com/careers directly for confirmed position openings.</strong></p>
<p><strong>How we&#39;re different</strong></p>
<p>We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We&#39;re an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work.</p>
<p style="margin-top:24px;font-size:13px;color:#666;">XML job scraping automation by <a href="https://yubhub.co">YubHub</a></p>]]></Description>
      <Jobtype>full-time</Jobtype>
      <Experiencelevel>senior</Experiencelevel>
      <Workarrangement>hybrid</Workarrangement>
      <Salaryrange>$320,000 - $405,000 USD</Salaryrange>
      <Skills>software engineering, ML systems, LLM application development, context engineering, evaluation, orchestration, UX, reliability, documentation, data science, engineering, large-scale data processing frameworks, productionizing internal tools, developer-facing platforms, monitoring, observability systems, research experience in AI safety, alignment, responsible deployment, practical experience with both data science and engineering, experience with productionizing internal tools or building developer-facing platforms, background in building monitoring or observability systems, comfort with ambiguity</Skills>
      <Category>Engineering</Category>
      <Industry>Technology</Industry>
      <Employername>Anthropic</Employername>
      <Employerlogo>https://logos.yubhub.co/anthropic.com.png</Employerlogo>
      <Employerdescription>Anthropic is a quickly growing organisation with a mission to create reliable, interpretable, and steerable AI systems. Our team is a group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.</Employerdescription>
      <Employerwebsite>https://job-boards.greenhouse.io</Employerwebsite>
      <Compensationcurrency></Compensationcurrency>
      <Compensationmin></Compensationmin>
      <Compensationmax></Compensationmax>
      <Applyto>https://job-boards.greenhouse.io/anthropic/jobs/5125083008</Applyto>
      <Location>San Francisco, CA</Location>
      <Country></Country>
      <Postedate>2026-03-08</Postedate>
    </job>
    <job>
      <externalid>28fd37f4-a07</externalid>
      <Title>Devops Developer</Title>
      <Description><![CDATA[<p>Join us for an opportunity to work with the best game development teams in the world. We are looking for a Devops Engineer to join the tools development and automation team supporting BioWare, Motive, Maxis, Full Circle.</p>
<p><strong>What you&#39;ll do</strong></p>
<p>This DevOps Developer role in the Software Quality organization works with Quality Assurance and Game Development teams to create tools and technical strategies. Our goal is to improve automation infrastructure and increase efficiencies in the Game Development and QA processes.</p>
<ul>
<li>Operate and maintain tools, ensuring exceptional uptime, secure environments.</li>
<li>First responder and driving continuous improvement based on root cause analysis.</li>
</ul>
<p><strong>What you need</strong></p>
<ul>
<li>5+ year experience in managing distributed, scalable and resilient high-performing systems</li>
</ul>
<p style="margin-top:24px;font-size:13px;color:#666;">XML job scraping automation by <a href="https://yubhub.co">YubHub</a></p>]]></Description>
      <Jobtype>full-time</Jobtype>
      <Experiencelevel>mid</Experiencelevel>
      <Workarrangement>hybrid</Workarrangement>
      <Salaryrange></Salaryrange>
      <Skills>C#/.NET experience, Experience implementing data and infrastructure security best practices, Experience with container workload technologies such as Kubernetes, Helm and Docker, Experience with monitoring/observability systems such as Prometheus, Grafana and/or Datadog, Experience with continuous integration and delivery, using pipeline automation systems such as Jenkins, GitLab and GitHub</Skills>
      <Category>Engineering</Category>
      <Industry>Technology</Industry>
      <Employername>Electronic Arts</Employername>
      <Employerlogo>https://logos.yubhub.co/jobs.ea.com.png</Employerlogo>
      <Employerdescription>Electronic Arts creates next-level entertainment experiences that inspire players and fans around the world. Here, everyone is part of the story. Part of a community that connects across the globe. A place where creativity thrives, new perspectives are invited, and ideas matter.</Employerdescription>
      <Employerwebsite>https://jobs.ea.com</Employerwebsite>
      <Compensationcurrency></Compensationcurrency>
      <Compensationmin></Compensationmin>
      <Compensationmax></Compensationmax>
      <Applyto>https://jobs.ea.com/en_US/careers/JobDetail/Software-Developer-II/212007</Applyto>
      <Location>Montreal</Location>
      <Country></Country>
      <Postedate>2026-02-06</Postedate>
    </job>
    <job>
      <externalid>c3f17689-b79</externalid>
      <Title>Development Director</Title>
      <Description><![CDATA[<p>We are looking for an experienced Development Director to lead Quality Verification (QV) efforts for the Battlefield franchise. You will oversee quality execution across development and live service, ensuring a high-quality, scalable, and reliable player experience.</p>
<p><strong>What you&#39;ll do</strong></p>
<ul>
<li>Live Service Quality &amp; Game Operations</li>
<li>Lead quality strategy for live service operations supporting Battlefield&#39;s Free-to-Play and Battle Royale experiences.</li>
</ul>
<p><strong>What you need</strong></p>
<ul>
<li>Bachelor&#39;s degree or equivalent professional experience in the games or software industry.</li>
</ul>
<p style="margin-top:24px;font-size:13px;color:#666;">XML job scraping automation by <a href="https://yubhub.co">YubHub</a></p>]]></Description>
      <Jobtype>full-time</Jobtype>
      <Experiencelevel>senior</Experiencelevel>
      <Workarrangement>hybrid</Workarrangement>
      <Salaryrange>$138,400 - $211,700 USD</Salaryrange>
      <Skills>Quality Assurance, Quality Verification, Leadership, People Management, Agile development methodologies, Test automation frameworks, Telemetry or observability systems</Skills>
      <Category>Engineering</Category>
      <Industry>Technology</Industry>
      <Employername>Electronic Arts</Employername>
      <Employerlogo>https://logos.yubhub.co/jobs.ea.com.png</Employerlogo>
      <Employerdescription>Electronic Arts creates next-level entertainment experiences that inspire players and fans around the world. Here, everyone is part of the story. Part of a community that connects across the globe. A place where creativity thrives, new perspectives are invited, and ideas matter. A team where everyone makes play happen.</Employerdescription>
      <Employerwebsite>https://jobs.ea.com</Employerwebsite>
      <Compensationcurrency></Compensationcurrency>
      <Compensationmin></Compensationmin>
      <Compensationmax></Compensationmax>
      <Applyto>https://jobs.ea.com/en_US/careers/JobDetail/Development-Director/212005</Applyto>
      <Location>Los Angeles</Location>
      <Country></Country>
      <Postedate>2026-01-17</Postedate>
    </job>
  </jobs>
</source>