{"version":"0.1","company":{"name":"YubHub","url":"https://yubhub.co","jobsUrl":"https://yubhub.co/jobs/skill/large-scale-data-processing-systems"},"x-facet":{"type":"skill","slug":"large-scale-data-processing-systems","display":"Large Scale Data Processing Systems","count":3},"x-feed-size-limit":100,"x-feed-sort":"enriched_at desc","x-feed-notice":"This feed contains at most 100 jobs (the most recently enriched). For the full corpus, use the paginated /stats/by-facet endpoint or /search.","x-generator":"yubhub-xml-generator","x-rights":"Free to redistribute with attribution: \"Data by YubHub (https://yubhub.co)\"","x-schema":"Each entry in `jobs` follows https://schema.org/JobPosting. YubHub-native raw fields carry `x-` prefix.","jobs":[{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_1be5f2b8-044"},"title":"Principal Machine Learning Engineer","description":"<p>As a Principal Machine Learning Engineer, you will work on the Data Labeling and classification on large scale multi modal Copilot data part of the Microsoft AI (MAI) organization.</p>\n<p>We’re looking for a hands-on ML engineer to prototype and productionize complex classification flows on real production logs, operate prompted classifiers at scale (ad hoc and scheduled), and build secure, compliant data-labeling pipelines.</p>\n<p>We’re looking for someone with experience in data pipelines, data science, and machine learning, as well as a strong communicator and great teammate.</p>\n<p>The right candidate takes the initiative and enjoys building world-class consumer experiences and products in a fast-paced environment.</p>\n<p>Microsoft’s mission is to empower every person and every organization on the planet to achieve more.</p>\n<p>As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals.</p>\n<p>Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond</p>\n<p>Starting January 26, 2026, MAI employees are expected to work from a designated Microsoft office at least four days a week if they live within 50 miles (U.S.) or 25 miles (non-U.S., country-specific) of that location.</p>\n<p>This expectation is subject to local law and may vary by jurisdiction.</p>\n<p>Responsibilities:</p>\n<p>Build evaluation loops (precision/recall, calibration, drift, human-in-the-loop) and publish dashboards/SLOs.</p>\n<p>Generalize machine learning (ML) solutions into repeatable frameworks.</p>\n<p>Operationalize prompted classifiers at scale (batch &amp; streaming), including orchestration, autoscaling, monitoring, and cost guardrails.</p>\n<p>Conduct thorough review of data analysis and techniques used to summarize the process review and highlight areas that have been missed or need re-examining.</p>\n<p>Collaborate cross-functionally with DS, Security, and Platform to define schemas, access patterns, and governance.</p>\n<p>Independently write efficient, readable, extensible code and model pipelines.</p>\n<p>Commit to a customer-oriented focus by acknowledging customer needs and perspectives, validating customer perspectives, focusing on broader customer context, and serving as a trusted advisor.</p>\n<p>Qualifications:</p>\n<p>Required Qualifications:</p>\n<p>Bachelor’s Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.</p>\n<p>Preferred Qualifications:</p>\n<p>7+ years’ experience writing production-quality Python or Java or Scala code.</p>\n<p>5+ years’ experience in distributed systems design and implementation of large scale data processing systems</p>\n<p>3+ years’ experience building ML data pipelines using atleast one of AML, Promptflow, Langchain or LangGraph</p>\n<p>Demonstrated interest in Responsible AI.</p>\n<p>Experience prompting, evaluating, and working with large language models.</p>\n<p>#MicrosoftAI #mai-datainsights #mai-datainsights</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_1be5f2b8-044","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Microsoft","sameAs":"https://microsoft.ai","logo":"https://logos.yubhub.co/microsoft.ai.png"},"x-apply-url":"https://microsoft.ai/job/principal-machine-learning-engineer-5/","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$139,900 - $274,800 per year","x-skills-required":["C++","C#","Java","JavaScript","Python","Machine Learning","Data Science","Distributed Systems Design","Large Scale Data Processing Systems","AML","Promptflow","Langchain","LangGraph"],"x-skills-preferred":["Responsible AI","Large Language Models"],"datePosted":"2026-04-24T12:16:06.393Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Redmond"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"C++, C#, Java, JavaScript, Python, Machine Learning, Data Science, Distributed Systems Design, Large Scale Data Processing Systems, AML, Promptflow, Langchain, LangGraph, Responsible AI, Large Language Models","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":139900,"maxValue":274800,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_2da1b504-b73"},"title":"Principal Machine Learning Engineer","description":"<p>As a Principal Machine Learning Engineer, you will work on the Data Labeling and classification on large scale multi modal Copilot data part of the Microsoft AI (MAI) organization.</p>\n<p>We&#39;re looking for a hands-on ML engineer to prototype and productionize complex classification flows on real production logs, operate prompted classifiers at scale (ad hoc and scheduled), and build secure, compliant data-labeling pipelines.</p>\n<p>We&#39;re looking for someone with experience in data pipelines, data science, and machine learning, as well as a strong communicator and great teammate.</p>\n<p>The right candidate takes the initiative and enjoys building world-class consumer experiences and products in a fast-paced environment.</p>\n<p>Microsoft&#39;s mission is to empower every person and every organization on the planet to achieve more.</p>\n<p>As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals.</p>\n<p>Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond</p>\n<p>Starting January 26, 2026, MAI employees are expected to work from a designated Microsoft office at least four days a week if they live within 50 miles (U.S.) or 25 miles (non-U.S., country-specific) of that location.</p>\n<p>This expectation is subject to local law and may vary by jurisdiction.</p>\n<p>Responsibilities:</p>\n<p>Build evaluation loops (precision/recall, calibration, drift, human-in-the-loop) and publish dashboards/SLOs.</p>\n<p>Generalize machine learning (ML) solutions into repeatable frameworks.</p>\n<p>Operationalize prompted classifiers at scale (batch &amp; streaming), including orchestration, autoscaling, monitoring, and cost guardrails.</p>\n<p>Conduct thorough review of data analysis and techniques used to summarize the process review and highlight areas that have been missed or need re-examining.</p>\n<p>Collaborate cross-functionally with DS, Security, and Platform to define schemas, access patterns, and governance.</p>\n<p>Independently write efficient, readable, extensible code and model pipelines.</p>\n<p>Commit to a customer-oriented focus by acknowledging customer needs and perspectives, validating customer perspectives, focusing on broader customer context, and serving as a trusted advisor.</p>\n<p>Qualifications:</p>\n<p>Required Qualifications:</p>\n<p>Bachelor’s Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.</p>\n<p>Preferred Qualifications:</p>\n<p>7+ years’ experience writing production-quality Python or Java or Scala code.</p>\n<p>5+ years’ experience in distributed systems design and implementation of large scale data processing systems</p>\n<p>3+ years’ experience building ML data pipelines using atleast one of AML, Promptflow, Langchain or LangGraph</p>\n<p>Demonstrated interest in Responsible AI.</p>\n<p>Experience prompting, evaluating, and working with large language models.</p>\n<p>#MicrosoftAI #mai-datainsights #mai-datainsights</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_2da1b504-b73","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Microsoft","sameAs":"https://microsoft.ai","logo":"https://logos.yubhub.co/microsoft.ai.png"},"x-apply-url":"https://microsoft.ai/job/principal-machine-learning-engineer-4/","x-work-arrangement":"hybrid","x-experience-level":"staff","x-job-type":"Full-time","x-salary-range":"$139,900 - $274,800 per year","x-skills-required":["C","C++","C#","Java","JavaScript","Python","Data pipelines","Data science","Machine learning","Responsible AI","Large language models"],"x-skills-preferred":["Production-quality Python or Java or Scala code","Distributed systems design and implementation of large scale data processing systems","AML","Promptflow","Langchain","LangGraph"],"datePosted":"2026-04-24T12:11:19.103Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"New York"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"C, C++, C#, Java, JavaScript, Python, Data pipelines, Data science, Machine learning, Responsible AI, Large language models, Production-quality Python or Java or Scala code, Distributed systems design and implementation of large scale data processing systems, AML, Promptflow, Langchain, LangGraph","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":139900,"maxValue":274800,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_de15c74c-c71"},"title":"Principal Machine Learning Engineer","description":"<p>As a Principal Machine Learning Engineer, you will work on the Data Labeling and classification on large scale multi modal Copilot data part of the Microsoft AI (MAI) organization.</p>\n<p>We’re looking for a hands-on ML engineer to prototype and productionize complex classification flows on real production logs, operate prompted classifiers at scale (ad hoc and scheduled), and build secure, compliant data-labeling pipelines.</p>\n<p>We’re looking for someone with experience in data pipelines, data science, and machine learning, as well as a strong communicator and great teammate.</p>\n<p>The right candidate takes the initiative and enjoys building world-class consumer experiences and products in a fast-paced environment.</p>\n<p>Microsoft’s mission is to empower every person and every organization on the planet to achieve more.</p>\n<p>As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals.</p>\n<p>Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond</p>\n<p>Starting January 26, 2026, MAI employees are expected to work from a designated Microsoft office at least four days a week if they live within 50 miles (U.S.) or 25 miles (non-U.S., country-specific) of that location.</p>\n<p>This expectation is subject to local law and may vary by jurisdiction.</p>\n<p>Responsibilities:</p>\n<p>Build evaluation loops (precision/recall, calibration, drift, human-in-the-loop) and publish dashboards/SLOs.</p>\n<p>Generalize machine learning (ML) solutions into repeatable frameworks.</p>\n<p>Operationalize prompted classifiers at scale (batch &amp; streaming), including orchestration, autoscaling, monitoring, and cost guardrails.</p>\n<p>Conduct thorough review of data analysis and techniques used to summarize the process review and highlight areas that have been missed or need re-examining.</p>\n<p>Collaborate cross-functionally with DS, Security, and Platform to define schemas, access patterns, and governance.</p>\n<p>Independently write efficient, readable, extensible code and model pipelines.</p>\n<p>Commit to a customer-oriented focus by acknowledging customer needs and perspectives, validating customer perspectives, focusing on broader customer context, and serving as a trusted advisor.</p>\n<p>Qualifications:</p>\n<p>Required Qualifications:</p>\n<p>Bachelor’s Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.</p>\n<p>Preferred Qualifications:</p>\n<p>7+ years’ experience writing production-quality Python or Java or Scala code.</p>\n<p>5+ years’ experience in distributed systems design and implementation of large scale data processing systems</p>\n<p>3+ years’ experience building ML data pipelines using atleast one of AML, Promptflow, Langchain or LangGraph</p>\n<p>Demonstrated interest in Responsible AI.</p>\n<p>Experience prompting, evaluating, and working with large language models.</p>\n<p>#MicrosoftAI #mai-datainsights #mai-datainsights</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_de15c74c-c71","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Microsoft","sameAs":"https://microsoft.ai","logo":"https://logos.yubhub.co/microsoft.ai.png"},"x-apply-url":"https://microsoft.ai/job/principal-machine-learning-engineer-6/","x-work-arrangement":"hybrid","x-experience-level":"staff","x-job-type":"full-time","x-salary-range":"$139,900 - $274,800 per year","x-skills-required":["C","C++","C#","Java","JavaScript","Python","Data pipelines","Data science","Machine learning","Distributed systems","Large scale data processing systems","AML","Promptflow","Langchain","LangGraph","Responsible AI","Large language models"],"x-skills-preferred":[],"datePosted":"2026-04-24T12:11:06.556Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Mountain View"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"C, C++, C#, Java, JavaScript, Python, Data pipelines, Data science, Machine learning, Distributed systems, Large scale data processing systems, AML, Promptflow, Langchain, LangGraph, Responsible AI, Large language models","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":139900,"maxValue":274800,"unitText":"YEAR"}}}]}