{"version":"0.1","company":{"name":"YubHub","url":"https://yubhub.co","jobsUrl":"https://yubhub.co/jobs/skill/monitoring-and-logging"},"x-facet":{"type":"skill","slug":"monitoring-and-logging","display":"Monitoring & Logging","count":3},"x-feed-size-limit":100,"x-feed-sort":"enriched_at desc","x-feed-notice":"This feed contains at most 100 jobs (the most recently enriched). For the full corpus, use the paginated /stats/by-facet endpoint or /search.","x-generator":"yubhub-xml-generator","x-rights":"Free to redistribute with attribution: \"Data by YubHub (https://yubhub.co)\"","x-schema":"Each entry in `jobs` follows https://schema.org/JobPosting. YubHub-native raw fields carry `x-` prefix.","jobs":[{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_c45bee07-fb6"},"title":"IT Systems Administrator - Linux","description":"<p>We&#39;re seeking a Systems Administrator with deep Linux expertise to help maintain and evolve our infrastructure. In this role, you&#39;ll be responsible for ensuring the reliability, performance, and security of core systems that support day-to-day operations.</p>\n<p>You&#39;ll work closely with technical teams to implement improvements, streamline operations, and keep our environment resilient.</p>\n<p>Your responsibilities will include: Administering and maintaining Linux servers and services, focusing on stability, scalability, and security Performing patching, upgrades, and configuration management to keep systems current and compliant Managing authentication, and access controls across Linux and integrated platforms Supporting and troubleshooting core infrastructure services (DNS, DHCP, VPN, SSH, NFS, etc.) Developing and maintaining automation workflows to reduce manual work and improve consistency Monitoring system performance, responding to incidents, and implementing preventative measures Collaborating with different departments to understand and meet their technical requirements and support needs Testing and evaluating new technology to determine its potential benefits for the company Documenting procedures, configurations, and troubleshooting steps for team use</p>\n<p>Requirements: 3–5+ years of experience as a Systems Administrator with a strong Linux focus Hands-on expertise with major Linux distributions (e.g. RHEL, CentOS, Ubuntu) in server environments Solid foundation in networking concepts and services (TCP/IP, DNS, DHCP, VPNs) Proficiency with virtualization platforms (VMware, KVM, or similar) Experience with authentication and access management in Linux environments (e.g. LDAP, Kerberos, SSSD, PAM) Strong troubleshooting skills and ability to manage multiple priorities effectively Clear, collaborative communication skills for cross-team work</p>\n<p>Bonus: Experience with automation and configuration management tools (e.g., Ansible) Familiarity with identity management platforms such as FreeIPA or Active Directory integration Exposure to cloud environments (AWS, GCP, Azure) Knowledge of containerization and orchestration (Docker, Kubernetes) Experience with monitoring and logging tools (Grafana, Prometheus, ELK, Nagios)</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_c45bee07-fb6","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Astranis","sameAs":"https://astranis.com/","logo":"https://logos.yubhub.co/astranis.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/astranis/jobs/4243225006","x-work-arrangement":"onsite","x-experience-level":"mid","x-job-type":"full-time","x-salary-range":"$120,000-$150,000 USD","x-skills-required":["Linux","Systems Administration","Networking","Virtualization","Authentication and Access Management"],"x-skills-preferred":["Automation and Configuration Management","Identity Management","Cloud Computing","Containerization and Orchestration","Monitoring and Logging"],"datePosted":"2026-04-24T15:19:05.635Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco"}},"employmentType":"FULL_TIME","occupationalCategory":"IT","industry":"Technology","skills":"Linux, Systems Administration, Networking, Virtualization, Authentication and Access Management, Automation and Configuration Management, Identity Management, Cloud Computing, Containerization and Orchestration, Monitoring and Logging","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":120000,"maxValue":150000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_f4d78b19-44f"},"title":"FBS AWS Data Engineer","description":"<p>We are seeking a skilled and self-driven AWS Data Engineer to design, develop, and maintain scalable data ingestion frameworks that support enterprise analytics and reporting.</p>\n<p>The ideal candidate will have deep expertise in AWS technologies, data lake architecture, and cross-functional collaboration to deliver high-quality data solutions.</p>\n<p>Key Responsibilities:</p>\n<p>Data Ingestion &amp; Framework Development</p>\n<ul>\n<li>Design, build, and maintain reusable, modular, and configuration-driven frameworks for ingesting both historical and incremental data from diverse sources into Iceberg tables on AWS S3.</li>\n</ul>\n<ul>\n<li>Expose ingested data to Snowflake via Snowflake external tables, ensuring seamless integration and accessibility.</li>\n</ul>\n<ul>\n<li>Implement robust logging mechanisms to monitor all data processes, ensuring completeness, timeliness, accuracy, and validity (ABC metrics).</li>\n</ul>\n<ul>\n<li>Configure automated notifications to alert support teams of process statuses and anomalies.</li>\n</ul>\n<ul>\n<li>Adhere to architectural standards and development best practices throughout the lifecycle.</li>\n</ul>\n<p>Solution Design &amp; Execution</p>\n<ul>\n<li>Translate complex business requirements into scalable and efficient technical solutions.</li>\n</ul>\n<ul>\n<li>Independently plan and execute the implementation of new data capabilities, including:</li>\n</ul>\n<ul>\n<li>Development of project plans with clear milestones and delivery timelines.</li>\n</ul>\n<ul>\n<li>Task breakdown, assignment, and management.</li>\n</ul>\n<ul>\n<li>Comprehensive documentation and tracking of work using Rally or equivalent tools.</li>\n</ul>\n<ul>\n<li>Identification and management of dependencies across cross-functional teams.</li>\n</ul>\n<p>Cross-Team Collaboration</p>\n<ul>\n<li>Coordinate effectively with internal and external stakeholders, including:</li>\n</ul>\n<ul>\n<li>Cloud Operations</li>\n</ul>\n<ul>\n<li>Information Security</li>\n</ul>\n<ul>\n<li>Business Units</li>\n</ul>\n<ul>\n<li>Other Development Teams</li>\n</ul>\n<ul>\n<li>Facilitate alignment and secure commitment from partner teams to meet project deliverables and dependency timelines.</li>\n</ul>\n<p>Proactive, Timely, Concise and Audience Appropriate Communication</p>\n<ul>\n<li>Communicates complex technical concepts to technical and non-technical personnel.</li>\n</ul>\n<ul>\n<li>Delivers routine progress and status to stakeholders.</li>\n</ul>\n<ul>\n<li>Communicates information in line with the target audience experience, background, and expectations; uses terms, examples, and analogies that are meaningful to the audience.</li>\n</ul>\n<ul>\n<li>Ensures accuracy of information communicated to effectively support project leadership decision making.</li>\n</ul>\n<p>Continuous Improvement</p>\n<ul>\n<li>Proactively accumulates and maintains knowledge of current and emerging/evolving technologies, concepts, and trends in the IT field.</li>\n</ul>\n<ul>\n<li>Provides input on improving or enhancing existing organizational processes based on lessons learned and experiences from project work.</li>\n</ul>\n<ul>\n<li>Performs root cause analysis to quickly identify and resolve issues causing recurring technical problems.</li>\n</ul>\n<p>Self-Driven Problem Solving &amp; Initiative</p>\n<ul>\n<li>Demonstrates a high degree of independence and ownership in driving initiatives from concept to completion.</li>\n</ul>\n<ul>\n<li>Proactively identifies challenges and inefficiencies, and takes swift action to resolve them without waiting for direction.</li>\n</ul>\n<ul>\n<li>Navigates complex organizational structures to engage the right stakeholders and ensure timely delivery.</li>\n</ul>\n<ul>\n<li>Maintains a solution-oriented mindset, continuously seeking opportunities to improve processes, enhance collaboration, and deliver value.</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_f4d78b19-44f","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Capgemini","sameAs":"https://www.capgemini.com/","logo":"https://logos.yubhub.co/capgemini.com.png"},"x-apply-url":"https://jobs.workable.com/view/xzJcJMrshbQVkrddwFjuqG/remote-fbs-aws-data-engineer-in-brazil-at-capgemini","x-work-arrangement":"remote","x-experience-level":"mid","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["Data Processing & Orchestration","Storage & Lakehouse Architecture","Security & Access Management","Monitoring & Logging","Development & Automation"],"x-skills-preferred":[],"datePosted":"2026-04-24T14:15:59.443Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Brazil"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Finance","skills":"Data Processing & Orchestration, Storage & Lakehouse Architecture, Security & Access Management, Monitoring & Logging, Development & Automation"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_3c88160f-6ad"},"title":"Site Reliability Manager","description":"<p>We are a technology organisation operating high-performance, large-scale Linux production environments that support critical platforms and engineering teams. Our focus is on operational excellence, service reliability, automation, and continuous improvement. We run 24x7 operations and partner closely with platform, network, security, and engineering teams to deliver stable, secure, and scalable infrastructure.</p>\n<p>You will lead and manage a 24x7 L1 Linux Engineering / SRE team operating in rotational shifts. Your responsibilities will include owning hiring, onboarding, performance management, coaching, and career development for L1 engineers. You will also own L1 production support operations for Linux systems in a 24x7 environment, acting as the first leadership escalation point during major production incidents.</p>\n<p>Key responsibilities include ensuring adherence to SLAs, OLAs, and operational KPIs such as availability and MTTR. You will provide technical oversight across Linux OS, bare metal and virtualized platforms, and monitoring/logging systems. Driving automation adoption using Ansible, Bash, and Python to reduce manual toil is also a key aspect of this role.</p>\n<p>You will partner with platform, network, security, and engineering teams to improve system reliability and resilience. Your impact will be ensuring stable, reliable, and efficient 24x7 L1 Linux/SRE operations, reducing incident recurrence and improving incident response and resolution times, building a skilled, motivated, and well-governed L1 engineering team, and improving operational maturity through automation, standardization, and documentation.</p>\n<p>To succeed in this role, you will need 10–14+ years of experience in IT Infrastructure, Linux Operations, or SRE, with 4–6+ years of people management experience, preferably managing 24x7 support teams. You will also need a strong hands-on background in Linux system administration and production support, experience with incident management, on-call models, and rotational shifts, advanced knowledge of Linux OS internals, experience with virtualization platforms (VMware, KVM, OpenStack, oVirt), knowledge of monitoring and logging tools (e.g., Nagios, ELK), experience with automation and configuration management (Ansible), and scripting skills in Bash and/or Python.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_3c88160f-6ad","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Synopsys","sameAs":"https://careers.synopsys.com","logo":"https://logos.yubhub.co/careers.synopsys.com.png"},"x-apply-url":"https://careers.synopsys.com/job/bengaluru/site-reliability-manager/44408/94212497792","x-work-arrangement":"onsite","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["Linux","Site Reliability Engineering","Automation","Configuration Management","Scripting","Virtualization","Monitoring and Logging"],"x-skills-preferred":[],"datePosted":"2026-04-24T14:14:47.010Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Bengaluru"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Linux, Site Reliability Engineering, Automation, Configuration Management, Scripting, Virtualization, Monitoring and Logging"}]}