{"version":"0.1","company":{"name":"YubHub","url":"https://yubhub.co","jobsUrl":"https://yubhub.co/jobs/skill/api-rate-limits-throttling"},"x-facet":{"type":"skill","slug":"api-rate-limits-throttling","display":"API rate limits/throttling","count":1},"x-feed-size-limit":100,"x-feed-sort":"enriched_at desc","x-feed-notice":"This feed contains at most 100 jobs (the most recently enriched). For the full corpus, use the paginated /stats/by-facet endpoint or /search.","x-generator":"yubhub-xml-generator","x-rights":"Free to redistribute with attribution: \"Data by YubHub (https://yubhub.co)\"","x-schema":"Each entry in `jobs` follows https://schema.org/JobPosting. YubHub-native raw fields carry `x-` prefix.","jobs":[{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_3335e005-31a"},"title":"Senior Cloud Engineer","description":"<p>We&#39;re looking for a Senior Cloud Engineer to join our team. As a Senior Cloud Engineer, you will be responsible for owning Helpshift&#39;s production services, ensuring complete monitoring coverage, troubleshooting and fixing production issues. You will also design and maintain scalable GCP infrastructure using Terraform, build deployment pipelines for AI agents, manage vector databases, and implement &#39;Security-by-Design&#39; principles.</p>\n<p>Key responsibilities include:</p>\n<ul>\n<li>Infrastructure Ownership: Ensure complete monitoring coverage, troubleshoot and fix production issues.</li>\n<li>Infrastructure as Code (IaC): Design and maintain scalable GCP infrastructure using Terraform.</li>\n<li>AI Orchestration &amp; LLMOps: Build deployment pipelines for AI agents, managing vector databases (e.g., Vertex AI Search, Pinecone, Weaviate, ElasticSearch) and model endpoints.</li>\n<li>Security (DevSecOps): Implement &#39;Security-by-Design,&#39; including IAM least-privilege access, secret management (Secret Manager), and automated vulnerability scanning for AI workloads.</li>\n<li>CI/CD Excellence: Architect high-velocity pipelines for both traditional microservices and AI model prompts/configurations.</li>\n<li>Observability: Set up comprehensive monitoring for system health and LLM-specific metrics (latency, token usage, and cost).</li>\n<li>Cloud Governance: Optimise GCP costs and manage resource quotas, especially for GPU/TPU-intensive AI tasks.</li>\n<li>Cross Cloud Deployment: Establish &amp; Optimise the connectivity among apps deployed in different cloud environments (AWS &lt; GCP)</li>\n</ul>\n<p>Requirements:</p>\n<ul>\n<li>Relevant experience of 6+ years and above</li>\n<li>Expert-level Google Cloud Platform (GCP) administration skills: GKE, Cloud Run, Vertex AI, GCS, NEG etc</li>\n<li>Experience deploying Vector Databases (Pinecone, Weaviate, ElasticSearch or Vertex Search) and managing API rate limits/throttling for LLM providers.</li>\n<li>Setting up Cloud Monitoring/Logging specifically for AI metrics: token consumption, inference latency, and model error rates.</li>\n<li>In-depth knowledge of running/managing UNIX-like operating systems (we use Ubuntu)</li>\n<li>Strong knowledge of networking protocols, security architectures, and identity and access management (IAM) principles.</li>\n<li>Experience with containerisation technologies (e.g., Docker, Kubernetes) and securing containerised environments.</li>\n<li>Proficiency in Python and Bash</li>\n<li>Experience in designing and building solutions that are highly scalable, fault tolerant and cost-effective</li>\n<li>Experience with IaaC tools like Ansible, Terraform.</li>\n<li>Ability to analyse bottlenecks in architecture and quickly debug to reach a resolution for issues</li>\n<li>Have an automation mindset and ability to reason and work with complex systems.</li>\n<li>Excellent communication and documentation skills</li>\n<li>Quick learner and good mentor for junior team members</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_3335e005-31a","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Helpshift","sameAs":"https://www.helpshift.com/","logo":"https://logos.yubhub.co/helpshift.com.png"},"x-apply-url":"https://apply.workable.com/j/5C2241AE6D","x-work-arrangement":"remote","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["Google Cloud Platform (GCP)","Terraform","Vertex AI","Vector Databases","API rate limits/throttling","Cloud Monitoring/Logging","UNIX-like operating systems","Networking protocols","Security architectures","Identity and access management (IAM)","Containerisation technologies","Python","Bash","IaaC tools","Ansible"],"x-skills-preferred":[],"datePosted":"2026-04-24T13:06:30.939Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Google Cloud Platform (GCP), Terraform, Vertex AI, Vector Databases, API rate limits/throttling, Cloud Monitoring/Logging, UNIX-like operating systems, Networking protocols, Security architectures, Identity and access management (IAM), Containerisation technologies, Python, Bash, IaaC tools, Ansible"}]}