{"version":"0.1","company":{"name":"YubHub","url":"https://yubhub.co","jobsUrl":"https://yubhub.co/jobs/skill/thanos"},"x-facet":{"type":"skill","slug":"thanos","display":"Thanos","count":3},"x-feed-size-limit":100,"x-feed-sort":"enriched_at desc","x-feed-notice":"This feed contains at most 100 jobs (the most recently enriched). For the full corpus, use the paginated /stats/by-facet endpoint or /search.","x-generator":"yubhub-xml-generator","x-rights":"Free to redistribute with attribution: \"Data by YubHub (https://yubhub.co)\"","x-schema":"Each entry in `jobs` follows https://schema.org/JobPosting. YubHub-native raw fields carry `x-` prefix.","jobs":[{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_6b0282a9-9ee"},"title":"Staff Software Engineer, Observability","description":"<p>We are seeking a highly experienced Staff Software Engineer to lead our efforts in building, maintaining, and optimizing highly scalable, reliable, and secure systems. The Observability team is responsible for deploying and maintaining critical infrastructure at CoreWeave including our logging, tracing, and metrics platforms as well as the pipelines that feed them.</p>\n<p>Key Responsibilities:</p>\n<ul>\n<li>Lead and mentor engineers, fostering a culture of collaboration and continuous improvement.</li>\n<li>Scale logging, tracing, and metrics platforms to support a global datacenter footprint.</li>\n<li>Develop and refine monitoring and alerting to enhance system reliability.</li>\n<li>Advise engineers across CoreWeave on optimal usage of Observability systems.</li>\n<li>Automate interactions with CoreWeave&#39;s Compute Infrastructure layer.</li>\n<li>Manage production clusters and ensure development teams follow best practices for deployments.</li>\n</ul>\n<p>Required Qualifications:</p>\n<ul>\n<li>7+ years of experience in Software Engineering, Site Reliability Engineering, DevOps, or a related field.</li>\n<li>Deep expertise across all observability pillars using tools like ClickHouse, Elastic, Loki, Victoria Metrics, Prometheus, Thanos and/or Grafana.</li>\n<li>Expertise in Kubernetes, containerization, and microservices architectures.</li>\n<li>Proven track record of leading incident management and post-mortem analysis.</li>\n<li>Excellent problem-solving, analytical, and communication skills.</li>\n</ul>\n<p>Preferred Qualifications:</p>\n<ul>\n<li>Experience running and scaling observability tools as a cloud provider.</li>\n<li>Experience administering large-scale kubernetes clusters.</li>\n<li>Deep understanding of data-streaming systems.</li>\n</ul>\n<p>The base salary range for this role is $188,000 to $250,000.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_6b0282a9-9ee","directApply":true,"hiringOrganization":{"@type":"Organization","name":"CoreWeave","sameAs":"https://www.coreweave.com","logo":"https://logos.yubhub.co/coreweave.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/coreweave/jobs/4577361006","x-work-arrangement":"hybrid","x-experience-level":"staff","x-job-type":"full-time","x-salary-range":"$188,000 to $250,000","x-skills-required":["ClickHouse","Elastic","Loki","Victoria Metrics","Prometheus","Thanos","Grafana","Kubernetes","containerization","microservices architectures"],"x-skills-preferred":["Experience running and scaling observability tools as a cloud provider","Experience administering large-scale kubernetes clusters","Deep understanding of data-streaming systems"],"datePosted":"2026-04-18T15:54:03.521Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Livingston, NJ / New York, NY / Sunnyvale, CA / Bellevue, WA"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"ClickHouse, Elastic, Loki, Victoria Metrics, Prometheus, Thanos, Grafana, Kubernetes, containerization, microservices architectures, Experience running and scaling observability tools as a cloud provider, Experience administering large-scale kubernetes clusters, Deep understanding of data-streaming systems","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":188000,"maxValue":250000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_e37e01a3-23d"},"title":"Systems Engineer, Metrics and Alerting","description":"<p>At Cloudflare, we&#39;re on a mission to help build a better Internet. We&#39;re looking for a Systems Engineer to join our Observability Team, responsible for the observability platform and stack to make our engineering teams productive. In this role, you will design, deliver, and operate software and a platform that progresses Cloudflare&#39;s Observability competency. You will solve scaling bottlenecks in critical services in our Metrics &amp; Alerting pipeline and work on highly distributed and scalable systems.</p>\n<p>As a member of our team, you will participate in the constant cycle of knowledge sharing and mentoring, participate in the global on-call rotation for the services your team owns, research and introduce cutting-edge technologies, and contribute to open-source.</p>\n<p>We are a small team, well-funded, growing and focused on building an extraordinary company. This is a software engineering/systems engineering role and is a superb opportunity to be part of a high performing team to help to support Cloudflare’s mission and help build a better internet.</p>\n<p>You may be a good fit for our team if you have a Software Engineering background and proficiency in high-level programming languages (e.g., Go), proficiency in Data structures and databases like TSDBs, Columnar stores or related, proficiency in distributed Linux environments, proficiency in designing high-scale distributed systems, proficiency in Prometheus, Alertmanager, Thanos, experience working in a fast, high-growth environment, experience working in a 24/7/365 service environment, exquisite written and verbal communication skills, familiarity with Internetworking, networking protocols Layer 2-7 of the OSI model and BGP, strong bias for action.</p>\n<p>Bonus points if you have experience with high-bandwidth transit Internetworking and routing, passion for code simplicity and performance.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_e37e01a3-23d","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Cloudflare","sameAs":"https://www.cloudflare.com/","logo":"https://logos.yubhub.co/cloudflare.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/cloudflare/jobs/6673579","x-work-arrangement":"hybrid","x-experience-level":"mid","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["Go","Data structures","Databases","Linux","Distributed systems","Prometheus","Alertmanager","Thanos"],"x-skills-preferred":["High-bandwidth transit Internetworking","Routing","Code simplicity","Performance"],"datePosted":"2026-04-18T15:49:46.565Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Hybrid"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Go, Data structures, Databases, Linux, Distributed systems, Prometheus, Alertmanager, Thanos, High-bandwidth transit Internetworking, Routing, Code simplicity, Performance"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_cbeabfab-916"},"title":"Software Engineer, Observability","description":"<p>As a Software Engineer on the Observability team, you will design, build, and maintain scalable systems that process and surface telemetry data across distributed environments.</p>\n<p>You&#39;ll contribute production-quality code in languages like Go and Python, while improving system reliability through enhanced monitoring, alerting, and incident response practices.</p>\n<p>Day to day, you&#39;ll collaborate with cross-functional engineering teams to implement observability best practices, support production systems, and help optimize performance across large-scale infrastructure.</p>\n<p>You will also participate in on-call rotations and contribute to continuous improvements based on real-world system behavior.</p>\n<p>CoreWeave is looking for a talented software engineer to join our Observability team. You will be responsible for designing, building, and maintaining scalable systems that process and surface telemetry data across distributed environments.</p>\n<p>The ideal candidate will have experience with Go and Python, as well as a strong understanding of system reliability and observability best practices.</p>\n<p>In addition to your technical skills, you should be able to collaborate effectively with cross-functional teams and communicate complex technical concepts to non-technical stakeholders.</p>\n<p>If you&#39;re passionate about building scalable systems and improving system reliability, we&#39;d love to hear from you!</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_cbeabfab-916","directApply":true,"hiringOrganization":{"@type":"Organization","name":"CoreWeave","sameAs":"https://www.coreweave.com","logo":"https://logos.yubhub.co/coreweave.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/coreweave/jobs/4587675006","x-work-arrangement":"hybrid","x-experience-level":"mid","x-job-type":"full-time","x-salary-range":"$109,000 to $145,000","x-skills-required":["Go","Python","Kubernetes","containerization","microservices architectures","observability systems","metrics","logging","tracing"],"x-skills-preferred":["ClickHouse","Elastic","Loki","VictoriaMetrics","Prometheus","Thanos","OpenTelemetry","Grafana","Terraform","modern testing frameworks","deployment strategies","data streaming technologies","AI/ML infrastructure"],"datePosted":"2026-04-18T15:46:41.788Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"New York, NY / Sunnyvale, CA"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Go, Python, Kubernetes, containerization, microservices architectures, observability systems, metrics, logging, tracing, ClickHouse, Elastic, Loki, VictoriaMetrics, Prometheus, Thanos, OpenTelemetry, Grafana, Terraform, modern testing frameworks, deployment strategies, data streaming technologies, AI/ML infrastructure","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":109000,"maxValue":145000,"unitText":"YEAR"}}}]}