{"version":"0.1","company":{"name":"YubHub","url":"https://yubhub.co","jobsUrl":"https://yubhub.co/jobs/skill/cost-optimisation"},"x-facet":{"type":"skill","slug":"cost-optimisation","display":"Cost Optimisation","count":2},"x-feed-size-limit":100,"x-feed-sort":"enriched_at desc","x-feed-notice":"This feed contains at most 100 jobs (the most recently enriched). For the full corpus, use the paginated /stats/by-facet endpoint or /search.","x-generator":"yubhub-xml-generator","x-rights":"Free to redistribute with attribution: \"Data by YubHub (https://yubhub.co)\"","x-schema":"Each entry in `jobs` follows https://schema.org/JobPosting. YubHub-native raw fields carry `x-` prefix.","jobs":[{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_68e28ed5-d02"},"title":"Head of PB CLM and Client Reporting","description":"<p>Some careers have more impact than others. If you&#39;re looking for a career where you can make a real impression, join HSBC and discover how valued you&#39;ll be.</p>\n<p>We are currently seeking an experienced professional to join our team in the role of Head of PB CLM and Client Reporting.</p>\n<p>As a senior technology leader, you will be responsible for overseeing the delivery of technology solutions that enable faster and safer delivery, with clear VSIP delivery priorities, roadmaps, and capacity plans.</p>\n<p>Key responsibilities include:</p>\n<ul>\n<li>Ensuring resilient and quality engineering, driving engineering best practices and automation, cloud adoption and optimisation, and service resilience across your domain.</li>\n<li>Empowering delivery teams to be efficient, driving Agile Ways of Operating and managing Technology financials and resources within approved budgets.</li>\n<li>Driving convergence towards FSA and global scalable solutions.</li>\n<li>Focusing on the improvement of customer experience, whilst clarifying and prioritising demand across all markets.</li>\n</ul>\n<p>You will partner with the business to define a coherent vision and supporting technology roadmap, aligned to the GPB Future State Architecture in line with business Target Operating models and Enterprise Technology architecture principles.</p>\n<p>All aspects of change delivered in S/VS, including adherence to global HSBC principles and frameworks.</p>\n<p>Input into S/VS backlog prioritisation and decision making, via transparent communication with key stakeholders including Domain Heads and Market CIOs.</p>\n<p>Defining and shaping technology components of the Capabilities, Features and Customer Journeys.</p>\n<p>Supporting other GPB domains, by providing a common framework for the delivery of consistent and complementary data and reporting solutions.</p>\n<p>Reducing RTB costs through minimising solution variation across Markets, demising legacy solutions and driving alignment to Future State Architecture.</p>\n<p>Developing Country Technology delivery roadmap in partnership with Business and ensuring Country demand is clearly articulated and prioritised within global portfolios.</p>\n<p>Engineering &amp; Technical Service Management</p>\n<ul>\n<li>Delivery of Technology specific development of new and existing solutions required to enable realisation of OKRs in line with the Technology resilience standards.</li>\n<li>Enabling customisation/configuration of technology solutions to factor technology-specific local market requirements and regulations.</li>\n<li>Ensuring effective service management for GPB CLM &amp; Client Reporting services and incidents, constantly aiming for high levels of customer satisfaction.</li>\n<li>Ensuring adherence to security and compliance standards, including data privacy regulations.</li>\n</ul>\n<p>Collaborating with cybersecurity and compliance teams to implement best practices, conduct audits, and mitigate risks.</p>\n<p>Leadership &amp; Teamwork</p>\n<ul>\n<li>Managing a globally dispersed Technology team, developing talent and instilling a culture of high-performance and cross-functional collaboration.</li>\n<li>Maintaining strong relationships with Business stakeholders, partnering to deliver world-class solutions within your domains.</li>\n<li>Guiding Technology teams in clarification of needs and possible solution paths, together with Business, Transformation and Flow-to-Work teams.</li>\n</ul>\n<p>Knowledge &amp; Experience/Qualifications</p>\n<ul>\n<li>Domain Knowledge – Experience in a global Private Banking technology organisation, a global and multi-divisional setting being a significant advantage.</li>\n<li>Cross-Functional Collaboration – Unifying cross-functional Business and Technology teams to drive and optimise achievement of common objectives and customer outcomes.</li>\n<li>Technology &amp; Resilience – Understanding of relevant platforms, tools, and processes to drive delivery efficiency, resilience, and engineering excellence (e.g. Mode2, TRMF, SonarQube), as well as experience with high availability, high-scale, and performant systems.</li>\n<li>Lean-Agile Practices – Proven experience with Lean-Agile ways of working (DevSecOps, sprints, Jira/Kanban workflows etc.), to increase efficiency and optimise value delivery.</li>\n<li>Financial – VSIP budget and resource allocation, value-based decision making, cost optimisation and business case development.</li>\n<li>Stakeholder Management &amp; Communication Skills – Conveying complex concepts, facilitating discussions, and driving consensus among stakeholders at all levels, with the ability to balance conflicting and changing demands through prioritisation and a pragmatic approach.</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_68e28ed5-d02","directApply":true,"hiringOrganization":{"@type":"Organization","name":"HSBC","sameAs":"https://portal.careers.hsbc.com","logo":"https://logos.yubhub.co/portal.careers.hsbc.com.png"},"x-apply-url":"https://portal.careers.hsbc.com/careers/job/563774608885239","x-work-arrangement":"onsite","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["Mode2","TRMF","SonarQube","Lean-Agile","DevSecOps","Jira/Kanban","Agile","Cloud","Automation","Service Resilience","Agile Ways of Operating","Technology Financials","Resource Allocation","Value-Based Decision Making","Cost Optimisation","Business Case Development","Stakeholder Management","Communication Skills"],"x-skills-preferred":[],"datePosted":"2026-04-18T22:11:38.210Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Guangzhou"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Finance","skills":"Mode2, TRMF, SonarQube, Lean-Agile, DevSecOps, Jira/Kanban, Agile, Cloud, Automation, Service Resilience, Agile Ways of Operating, Technology Financials, Resource Allocation, Value-Based Decision Making, Cost Optimisation, Business Case Development, Stakeholder Management, Communication Skills"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_d50772ab-afe"},"title":"Staff / Senior Software Engineer, Cloud Inference","description":"<p>We are seeking a Staff / Senior Software Engineer to join our Cloud Inference team. The successful candidate will design and build infrastructure that serves Claude across multiple cloud service providers (CSPs), accounting for differences in compute hardware, networking, APIs, and operational models.</p>\n<p>The ideal candidate will have significant software engineering experience, with a strong background in high-performance, large-scale distributed systems serving millions of users. They will also have experience building or operating services on at least one major cloud platform (AWS, GCP, or Azure), with exposure to Kubernetes, Infrastructure as Code or container orchestration.</p>\n<p>Responsibilities:</p>\n<ul>\n<li>Design and build infrastructure that serves Claude across multiple CSPs, accounting for differences in compute hardware, networking, APIs, and operational models</li>\n</ul>\n<ul>\n<li>Collaborate with CSP partner engineering teams to resolve operational issues, influence provider roadmaps, and stand up end-to-end serving on new cloud platforms</li>\n</ul>\n<ul>\n<li>Design and evolve CI/CD automation systems, including validation and deployment pipelines, that reliably ship new model versions to millions of users across cloud platforms without regressions</li>\n</ul>\n<ul>\n<li>Design interfaces and tooling abstractions across CSPs that enable cost-effective inference management, scale across providers, and reduce per-platform complexity</li>\n</ul>\n<ul>\n<li>Contribute to capacity planning and autoscaling strategies that dynamically match supply with demand across CSP validation and production workloads</li>\n</ul>\n<ul>\n<li>Optimise inference cost and performance across providers,designing workload placement and routing systems that direct requests to the most cost-effective accelerator and region</li>\n</ul>\n<ul>\n<li>Contribute to inference features that must work consistently across all platforms</li>\n</ul>\n<ul>\n<li>Analyse observability data across providers to identify performance bottlenecks, cost anomalies, and regressions, and drive remediation based on real-world production workloads</li>\n</ul>\n<p>Requirements:</p>\n<ul>\n<li>Significant software engineering experience, with a strong background in high-performance, large-scale distributed systems serving millions of users</li>\n</ul>\n<ul>\n<li>Experience building or operating services on at least one major cloud platform (AWS, GCP, or Azure), with exposure to Kubernetes, Infrastructure as Code or container orchestration</li>\n</ul>\n<ul>\n<li>Strong interest in inference</li>\n</ul>\n<ul>\n<li>Thrive in cross-functional collaboration with both internal teams and external partners</li>\n</ul>\n<ul>\n<li>Are a fast learner who can quickly ramp up on new technologies, hardware platforms, and provider ecosystems</li>\n</ul>\n<ul>\n<li>Are highly autonomous and self-driven, taking ownership of problems end-to-end with a bias toward flexibility and high-impact work</li>\n</ul>\n<ul>\n<li>Pick up slack, even when it goes outside your job description</li>\n</ul>\n<p>Preferred skills:</p>\n<ul>\n<li>Direct experience working with CSP partner teams to scale infrastructure or products across multiple platforms, navigating differences in networking, security, privacy, billing, and managed service offerings</li>\n</ul>\n<ul>\n<li>A background in building platform-agnostic tooling or abstraction layers that work across cloud providers</li>\n</ul>\n<ul>\n<li>Hands-on experience with capacity management, cost optimisation, or resource planning at scale across heterogeneous environments</li>\n</ul>\n<ul>\n<li>Strong familiarity with LLM inference optimisation, batching, caching, and serving strategies</li>\n</ul>\n<ul>\n<li>Experience with Machine learning infrastructure including GPUs, TPUs, Trainium, or other AI accelerators</li>\n</ul>\n<ul>\n<li>Background designing and building CI/CD systems that automate deployment and validation across cloud environments</li>\n</ul>\n<ul>\n<li>Solid understanding of multi-region deployments, geographic routing, and global traffic management</li>\n</ul>\n<ul>\n<li>Proficiency in Python or Rust</li>\n</ul>\n<p>Salary Range: $300,000-$485,000 USD</p>\n<p>Experience Level: Staff</p>\n<p>Employment Type: Full-time</p>\n<p>Workplace Type: Hybrid</p>\n<p>Category: Engineering</p>\n<p>Industry: Technology</p>\n<p>Required Skills:</p>\n<ul>\n<li>High-performance, large-scale distributed systems</li>\n</ul>\n<ul>\n<li>Cloud computing (AWS, GCP, Azure)</li>\n</ul>\n<ul>\n<li>Kubernetes</li>\n</ul>\n<ul>\n<li>Infrastructure as Code</li>\n</ul>\n<ul>\n<li>Container orchestration</li>\n</ul>\n<ul>\n<li>Inference</li>\n</ul>\n<ul>\n<li>Cross-functional collaboration</li>\n</ul>\n<ul>\n<li>Autonomy and self-driven</li>\n</ul>\n<ul>\n<li>Platform-agnostic tooling</li>\n</ul>\n<ul>\n<li>Capacity management</li>\n</ul>\n<ul>\n<li>Cost optimisation</li>\n</ul>\n<ul>\n<li>Resource planning</li>\n</ul>\n<ul>\n<li>LLM inference optimisation</li>\n</ul>\n<ul>\n<li>Machine learning infrastructure</li>\n</ul>\n<ul>\n<li>CI/CD systems</li>\n</ul>\n<ul>\n<li>Multi-region deployments</li>\n</ul>\n<ul>\n<li>Geographic routing</li>\n</ul>\n<ul>\n<li>Global traffic management</li>\n</ul>\n<ul>\n<li>Python</li>\n</ul>\n<ul>\n<li>Rust</li>\n</ul>\n<p>Preferred Skills:</p>\n<ul>\n<li>Direct experience working with CSP partner teams</li>\n</ul>\n<ul>\n<li>Building platform-agnostic tooling</li>\n</ul>\n<ul>\n<li>Hands-on experience with capacity management</li>\n</ul>\n<ul>\n<li>Strong familiarity with LLM inference optimisation</li>\n</ul>\n<ul>\n<li>Experience with Machine learning infrastructure</li>\n</ul>\n<ul>\n<li>Background designing and building CI/CD systems</li>\n</ul>\n<ul>\n<li>Solid understanding of multi-region deployments</li>\n</ul>\n<ul>\n<li>Proficiency in Python or Rust</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_d50772ab-afe","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Anthropic","sameAs":"https://www.anthropic.com/","logo":"https://logos.yubhub.co/anthropic.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/anthropic/jobs/5107466008","x-work-arrangement":"hybrid","x-experience-level":"staff","x-job-type":"full-time","x-salary-range":"$300,000-$485,000 USD","x-skills-required":["high-performance, large-scale distributed systems","cloud computing (AWS, GCP, Azure)","kubernetes","infrastructure as code","container orchestration","inference","cross-functional collaboration","autonomy and self-driven","platform-agnostic tooling","capacity management","cost optimisation","resource planning","llm inference optimisation","machine learning infrastructure","ci/cd systems","multi-region deployments","geographic routing","global traffic management","python","rust"],"x-skills-preferred":["direct experience working with csp partner teams","building platform-agnostic tooling","hands-on experience with capacity management","strong familiarity with llm inference optimisation","experience with machine learning infrastructure","background designing and building ci/cd systems","solid understanding of multi-region deployments","proficiency in python or rust"],"datePosted":"2026-04-18T15:53:24.048Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco, CA | Seattle, WA"}},"employmentType":"FULL_TIME","occupationalCategory":"engineering","industry":"technology","skills":"high-performance, large-scale distributed systems, cloud computing (AWS, GCP, Azure), kubernetes, infrastructure as code, container orchestration, inference, cross-functional collaboration, autonomy and self-driven, platform-agnostic tooling, capacity management, cost optimisation, resource planning, llm inference optimisation, machine learning infrastructure, ci/cd systems, multi-region deployments, geographic routing, global traffic management, python, rust, direct experience working with csp partner teams, building platform-agnostic tooling, hands-on experience with capacity management, strong familiarity with llm inference optimisation, experience with machine learning infrastructure, background designing and building ci/cd systems, solid understanding of multi-region deployments, proficiency in python or rust","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":300000,"maxValue":485000,"unitText":"YEAR"}}}]}