{"version":"0.1","company":{"name":"YubHub","url":"https://yubhub.co","jobsUrl":"https://yubhub.co/jobs/skill/inference-workloads"},"x-facet":{"type":"skill","slug":"inference-workloads","display":"Inference workloads","count":2},"x-feed-size-limit":100,"x-feed-sort":"enriched_at desc","x-feed-notice":"This feed contains at most 100 jobs (the most recently enriched). For the full corpus, use the paginated /stats/by-facet endpoint or /search.","x-generator":"yubhub-xml-generator","x-rights":"Free to redistribute with attribution: \"Data by YubHub (https://yubhub.co)\"","x-schema":"Each entry in `jobs` follows https://schema.org/JobPosting. YubHub-native raw fields carry `x-` prefix.","jobs":[{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_09e4131c-a18"},"title":"AI Infrastructure Engineer","description":"<p>We&#39;re looking for Senior+ AI Infrastructure Engineers to build the systems that train and serve Intercom&#39;s next generation of AI products.</p>\n<p>Intercom is an AI company that builds from the GPU all the way up to a user agent that resolves millions of customer service queries a month. You&#39;ll join a small, highly technical team working at the cutting edge of modern AI infrastructure.</p>\n<p>The AI Infra team built the training pipelines and runs the inference for custom models like Fin Apex, which outperforms frontier models in customer service tasks, and is the foundation of the AI Group&#39;s full stack approach to AI.</p>\n<p>As a Senior AI Infrastructure Engineer focused on model training and inference, you will:</p>\n<p>Implement and scale training pipelines for large transformer and LLM models, from data ingestion and preprocessing through distributed training and evaluation.</p>\n<p>Build and optimize inference services that deliver low-latency, high-reliability experiences for our customers, including autoscaling, routing, and fallbacks.</p>\n<p>Work on GPU-level performance: tuning kernels, improving utilization, and identifying bottlenecks across our training and inference stack.</p>\n<p>Collaborate closely with ML scientists to implement cutting edge training and inference methods and bring them to production.</p>\n<p>Play an active role in hiring, mentoring, and developing other engineers on the team.</p>\n<p>Raise the bar for technical standards, reliability, and operational excellence across Intercom’s AI platform.</p>\n<p>We’re looking to hire Senior+ AI Infrastructure Engineers. You’re likely a great fit if:</p>\n<p>You have 5+ years of experience in software engineering, with a strong track record of shipping high-quality products or platforms.</p>\n<p>You hold a degree in Computer Science, Computer Engineering, or a related field (or you have equivalent experience with very strong fundamentals).</p>\n<p>You have hands-on experience with one or more of the following:</p>\n<p>Model training (especially transformers and LLMs).</p>\n<p>Model inference at scale (again, especially transformers and LLMs).</p>\n<p>Low-level GPU work, such as writing CUDA or Triton kernels.</p>\n<p>You communicate clearly, can explain complex technical topics to different audiences, and enjoy close collaboration with both engineers and non-engineers.</p>\n<p>You take pride in strong technical fundamentals, love learning, and are willing to invest in your own development.</p>\n<p>Have deep knowledge of at least one programming language (for example Python, Ruby, Java, Go, etc.).</p>\n<p>Experience at AI native companies that train and/or run inference for their own models (e.g. modern AI labs or AI-native product companies).</p>\n<p>Experience running training or inference workloads on Kubernetes.</p>\n<p>Experience with AWS or other major cloud providers.</p>\n<p>Production experience with Python in ML or infrastructure contexts.</p>\n<p>Demonstrated passion for technology through personal projects, open source, meetups, or publishing content about your work and learnings</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_09e4131c-a18","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Intercom","sameAs":"https://www.intercom.com/","logo":"https://logos.yubhub.co/intercom.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/intercom/jobs/7824137?utm_source=yubhub.co&utm_medium=jobs_feed&utm_campaign=apply","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["model training","model inference","low-level GPU work","CUDA","Triton","Python","Kubernetes","AWS","cloud computing"],"x-skills-preferred":["AI native companies","training workloads","inference workloads","ML scientists","cutting edge training","inference methods","operational excellence"],"datePosted":"2026-04-25T20:59:56.022Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"London, England"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"model training, model inference, low-level GPU work, CUDA, Triton, Python, Kubernetes, AWS, cloud computing, AI native companies, training workloads, inference workloads, ML scientists, cutting edge training, inference methods, operational excellence"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_64780097-d2c"},"title":"Software Engineer, Backend","description":"<p>You&#39;ll build and scale the backend systems that power millions of users creating content every day on Gamma. This role is about solving real distributed systems challenges at scale while maintaining the performance and reliability users expect from a modern AI-powered product. You&#39;ll work across the full stack, shipping features that directly impact how people create and share their ideas.</p>\n<p>While this role is backend focused, you&#39;ll work across the entire product with our frontend, product, and design teams. Our full TypeScript stack is built on modern technologies including React, Node.js, PostgreSQL, Redis, and cutting-edge AI models.</p>\n<p>Our team has a strong in-office culture and works in person 4–5 days per week in San Francisco. We love working together to stay creative and connected, with flexibility to work from home when focus matters most.</p>\n<p><strong>Responsibilities</strong></p>\n<ul>\n<li>Scale backend systems to hundreds of millions of users while maintaining high performance and availability</li>\n<li>Build and optimize APIs that power real-time collaborative editing and AI content generation</li>\n<li>Design and implement distributed systems that handle massive scale with reliability</li>\n<li>Ship features across the full stack, working closely with frontend engineers to deliver polished experiences</li>\n<li>Architect solutions for complex technical challenges in areas like data consistency, caching, and query optimization</li>\n<li>Collaborate with product and design to turn ideas into production-ready features</li>\n</ul>\n<p><strong>What You&#39;ll Bring</strong></p>\n<ul>\n<li>3+ years building production backend systems with strong fundamentals in distributed systems, databases, and API design</li>\n<li>Deep proficiency in TypeScript/Node.js or similar backend languages, with eagerness to work in our TypeScript stack</li>\n<li>Experience scaling systems to handle millions of users and high throughput workloads</li>\n<li>Strong understanding of PostgreSQL, Redis, or similar database technologies</li>\n<li>Passion for building APIs, scaling complex systems, and creating excellent web applications</li>\n<li>Curiosity and attitude that matches your technical knowledge</li>\n<li>Prior experience working with websockets, streaming, or scaling inference workloads (Nice to have)</li>\n</ul>\n<p><strong>Compensation Range</strong></p>\n<p>The base salary for this full-time position, which spans multiple internal levels depending on qualifications, ranges between $180K - $275K plus benefits &amp; equity.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_64780097-d2c","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Gamma","sameAs":"https://gamma.com","logo":"https://logos.yubhub.co/gamma.com.png"},"x-apply-url":"https://jobs.ashbyhq.com/gamma/fb12356a-e868-4a4a-801c-882a6b0ac83f?utm_source=yubhub.co&utm_medium=jobs_feed&utm_campaign=apply","x-work-arrangement":"hybrid","x-experience-level":"mid","x-job-type":"Full time","x-salary-range":"$180K - $275K","x-skills-required":["TypeScript","Node.js","PostgreSQL","Redis","API design","Distributed systems","Database design"],"x-skills-preferred":["Websockets","Streaming","Inference workloads"],"datePosted":"2026-04-24T12:16:02.239Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"TypeScript, Node.js, PostgreSQL, Redis, API design, Distributed systems, Database design, Websockets, Streaming, Inference workloads","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":180000,"maxValue":275000,"unitText":"YEAR"}}}]}