{"version":"0.1","company":{"name":"YubHub","url":"https://yubhub.co","jobsUrl":"https://yubhub.co/jobs/skill/pretraining"},"x-facet":{"type":"skill","slug":"pretraining","display":"Pretraining","count":13},"x-feed-size-limit":100,"x-feed-sort":"enriched_at desc","x-feed-notice":"This feed contains at most 100 jobs (the most recently enriched). For the full corpus, use the paginated /stats/by-facet endpoint or /search.","x-generator":"yubhub-xml-generator","x-rights":"Free to redistribute with attribution: \"Data by YubHub (https://yubhub.co)\"","x-schema":"Each entry in `jobs` follows https://schema.org/JobPosting. YubHub-native raw fields carry `x-` prefix.","jobs":[{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_9ecceef8-349"},"title":"Research Engineer/Research Scientist, Audio","description":"<p>We are seeking a Research Engineer/Research Scientist to join our Audio team. As a member of this team, you will work across the full stack of audio ML, developing audio codecs and representations, sourcing and synthesizing high-quality audio data, training large-scale speech language models and large audio diffusion models, and developing novel architectures for incorporating continuous signals into LLMs.</p>\n<p>Our team focuses primarily but not exclusively on speech, building advanced steerable systems spanning end-to-end conversational systems, speech and audio understanding models, and speech synthesis capabilities. The team works closely with many collaborators across pretraining, finetuning, reinforcement learning, production inference, and product to get advanced audio technologies from early research to high-impact real-world deployments.</p>\n<p>Responsibilities:</p>\n<ul>\n<li>Develop and train audio models, including conversational speech-to-speech, speech translation, speech recognition, text-to-speech, diarization, codecs, and generative audio models</li>\n<li>Work across abstraction levels, from signal processing fundamentals to large-scale model training and inference optimization</li>\n<li>Collaborate with teams across the company to develop and deploy audio technologies</li>\n<li>Communicate clearly and effectively with colleagues and stakeholders</li>\n</ul>\n<p>Strong candidates may also have experience with:</p>\n<ul>\n<li>Large language model pretraining and finetuning</li>\n<li>Training diffusion models for image and audio generation</li>\n<li>Reinforcement learning for large language models and diffusion models</li>\n<li>End-to-end system optimization, from performance benchmarking to kernel optimization</li>\n<li>GPUs, Kubernetes, PyTorch, or distributed training infrastructure</li>\n</ul>\n<p>Representative projects:</p>\n<ul>\n<li>Training state-of-the-art neural audio codecs for 48 kHz stereo audio</li>\n<li>Developing novel algorithms for diffusion pretraining and reinforcement learning</li>\n<li>Scaling audio datasets to millions of hours of high-quality audio</li>\n<li>Creating robust evaluation methodologies for hard-to-measure qualities such as naturalness or expressiveness</li>\n<li>Studying training dynamics of mixed audio-text language models</li>\n<li>Optimizing latency and inference throughput for deployed streaming audio systems</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_9ecceef8-349","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Anthropic","sameAs":"https://www.anthropic.com/","logo":"https://logos.yubhub.co/anthropic.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/anthropic/jobs/5074815008","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$350,000-$500,000 USD","x-skills-required":["JAX","PyTorch","large-scale distributed training","signal processing fundamentals","speech language models","audio diffusion models","continuous signals","LLMs"],"x-skills-preferred":["large language model pretraining","diffusion models","reinforcement learning","end-to-end system optimization","GPUs","Kubernetes","distributed training infrastructure"],"datePosted":"2026-04-18T15:42:59.425Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco, CA"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"JAX, PyTorch, large-scale distributed training, signal processing fundamentals, speech language models, audio diffusion models, continuous signals, LLMs, large language model pretraining, diffusion models, reinforcement learning, end-to-end system optimization, GPUs, Kubernetes, distributed training infrastructure","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":350000,"maxValue":500000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_cc9d92de-913"},"title":"Research Engineer / Research Scientist, Vision","description":"<p>We&#39;re looking for research engineers with a strong computer vision background to work on research, development, and evaluation for state-of-the-art Claude models. In this role, you&#39;ll run experiments to evaluate architectural variants, data strategies, and SL and RL techniques to improve Claude&#39;s vision. You&#39;ll also develop and test tools, skills, and agentic infrastructure that enable Claude to reason over visual inputs. Additionally, you&#39;ll create evaluations and benchmarks that measure progress on multimodal capabilities across training and deployment.</p>\n<p>As a research engineer, you&#39;ll partner with the product org to ensure that the vision improvements you deliver impact Claude&#39;s performance on real-world tasks. You&#39;ll also work with our product org to find solutions to our most vexing API customer challenges related to vision and spatial reasoning.</p>\n<p>Strong candidates may also have experience with large-scale pretraining, SL, and RL on language models, deep learning research on images, video, or other modalities, developing complex agentic systems using LLMs, high-performance ML systems (GPUs, TPUs, JAX, PyTorch), and large-scale ETL and data pipeline development.</p>\n<p>The annual compensation range for this role is $350,000-$850,000 USD.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_cc9d92de-913","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Anthropic","sameAs":"https://www.anthropic.com/","logo":"https://logos.yubhub.co/anthropic.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/anthropic/jobs/5074217008","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$350,000-$850,000 USD","x-skills-required":["computer vision","ML","software engineering","large vision language models","synthetic and real-world visual training datasets","systematic prompting, finetuning, or evaluation"],"x-skills-preferred":["large-scale pretraining","SL","RL","deep learning research","agentic systems","high-performance ML systems","ETL and data pipeline development"],"datePosted":"2026-04-18T15:42:18.530Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"New York City, NY; San Francisco, CA; Seattle, WA"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"computer vision, ML, software engineering, large vision language models, synthetic and real-world visual training datasets, systematic prompting, finetuning, or evaluation, large-scale pretraining, SL, RL, deep learning research, agentic systems, high-performance ML systems, ETL and data pipeline development","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":350000,"maxValue":850000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_4bd6468f-bc0"},"title":"Senior Applied Scientist","description":"<p>We&#39;re building the next-generation Grounding Service that powers the latest AI applications—chat assistants, copilots, and autonomous agents—with factual, cited, and trustworthy responses. Our platform stitches together retrieval, reasoning, and real-time data so that large language models stay anchored to enterprise knowledge, the public web, and proprietary tools.</p>\n<p>We&#39;re looking for a Senior Applied Scientist to lead end-to-end science for grounding: inventing retrieval and attribution methods, defining factuality/faithfulness metrics, and shipping production models and APIs that scale to billions of queries. You&#39;ll partner closely with engineering, product, research, and customers to deliver fast, reliable, and explainable answers with source citations across a diverse set of domains and modalities.</p>\n<p>As a team, we value curiosity, pragmatic rigor, and inclusive collaboration. We believe great systems emerge when scientists and engineers co-design metrics, models, and infrastructure—and when we obsess over customer impact, privacy, and safety.</p>\n<p>Responsibilities\n Owns the science roadmap for grounding—including retrieval, re-ranking, attribution, and reasoning—driving initiatives from problem framing to production impact.\n Designs and evolves state-of-the-art retrieval and RAG orchestration across documents, tables, code, and images.\n Builds citation and provenance systems (e.g., passage highlighting, quote-level alignment, confidence scoring) to reduce hallucinations and increase user trust.\n Leads experimentation and evaluation using A/B testing, interleaving, NDCG, MRR, precision/recall, and calibration curves to guide measurable trade-offs.\n Advances tool-augmented grounding through schema-aware retrieval, function calling, knowledge graph joins, and real-time connectors to databases, cloud object stores, search indexes, and the web.\n Partners with platform engineering to productionize models with scalable inference, embedding services, feature stores, caching, and privacy-compliant multi-tenant systems.\n Nurtures collaborative relationships with product and business leaders across Microsoft, influencing strategic decisions and driving business impact through technology.\n Authors white papers, contributes to internal tools and services, and may publish research to generate intellectual property.\n Bridges the gap between researchers (e.g., Microsoft Research) and development teams, applying long-term research to solve immediate product needs.\n Leads high-stakes negotiations to ensure cutting-edge technologies are applied practically and effectively.\n Identifies and solves significant business problems using novel, scalable, and data-driven solutions.\n Shapes the direction of Microsoft and the broader industry through pioneering product and tooling work.\n Mentors applied scientists and data scientists, establishing best practices in experimentation, error analysis, and incident review.\n Collaborates cross-functionally with PMs, research, infrastructure, and security teams to align on milestones, SLAs, and safety protocols.\n Communicates clearly through design documentation, progress updates, and presentations to executives and customers.\n Contributes to ethics and privacy policies, identifies bias in product development, and proposes mitigation strategies.</p>\n<p>Qualifications\n Required Qualifications:\n  Bachelor’s Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 4+ years related experience (e.g., statistics predictive analytics, research)\n  OR Master’s Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 3+ years related experience (e.g., statistics, predictive analytics, research)\n  OR Doctorate in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 1+ year(s) related experience (e.g., statistics, predictive analytics, research)\n  OR equivalent experience.\n  Minimum of 4 years of hands-on experience designing and building search, retrieval, or ranking systems.\n  Proven track record of shipping LLM-powered or Retrieval-Augmented Generation (RAG) systems into production environments.\n  Solid coding skills and solid foundation in machine learning, with the ability to implement and optimize models effectively.\n  Demonstrated ability to lead through ambiguity, make principled trade-offs, and deliver measurable impact in cross-functional, fast-paced settings.\n Preferred Qualifications:\n  Master’s Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 6+ years related experience (e.g., statistics, predictive analytics, research)\n  OR Doctorate in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 3+ years related experience (e.g., statistics, predictive analytics, research)\n  OR equivalent experience.\n  Demonstrated expertise in information retrieval, with publications in top-tier conferences or journals such as NeurIPS, ICML, ICLR, SIGIR, or ACL.\n  Hands-on experience in large language model (LLM) development, including pretraining, supervised fine-tuning (SFT), and reinforcement learning (RL).\n  Proven track record in optimizing LLM inference, or active contributions to open-source frameworks like vLLM, SGLang, or related projects.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_4bd6468f-bc0","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Microsoft","sameAs":"https://microsoft.ai","logo":"https://logos.yubhub.co/microsoft.ai.png"},"x-apply-url":"https://microsoft.ai/job/senior-applied-scientist-38/","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["Statistics","Econometrics","Computer Science","Electrical or Computer Engineering","Search","Retrieval","Ranking","Machine Learning","Information Retrieval","Large Language Models","Pretraining","Supervised Fine-Tuning","Reinforcement Learning"],"x-skills-preferred":["Information Retrieval","Large Language Models","Pretraining","Supervised Fine-Tuning","Reinforcement Learning"],"datePosted":"2026-03-08T22:18:58.169Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Suzhou"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, Search, Retrieval, Ranking, Machine Learning, Information Retrieval, Large Language Models, Pretraining, Supervised Fine-Tuning, Reinforcement Learning, Information Retrieval, Large Language Models, Pretraining, Supervised Fine-Tuning, Reinforcement Learning"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_d0214534-b6a"},"title":"Senior Applied Scientist","description":"<p>We&#39;re building the next-generation Grounding Service that powers the latest AI applications—chat assistants, copilots, and autonomous agents—with factual, cited, and trustworthy responses. Our platform stitches together retrieval, reasoning, and real-time data so that large language models stay anchored to enterprise knowledge, the public web, and proprietary tools. We&#39;re looking for a Senior Applied Scientist to lead end-to-end science for grounding: inventing retrieval and attribution methods, defining factuality/faithfulness metrics, and shipping production models and APIs that scale to billions of queries. You&#39;ll partner closely with engineering, product, research, and customers to deliver fast, reliable, and explainable answers with source citations across a diverse set of domains and modalities. As a team, we value curiosity, pragmatic rigor, and inclusive collaboration. We believe great systems emerge when scientists and engineers co-design metrics, models, and infrastructure—and when we obsess over customer impact, privacy, and safety. Microsoft&#39;s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond. Starting January 26, 2026, Microsoft AI (MAI) employees who live within a 50-mile commute of a designated Microsoft office in the U.S. or 25-mile commute of a non-U.S., country-specific location are expected to work from the office at least four days per week. This expectation is subject to local law and may vary by jurisdiction. Responsibilities</p>\n<p>Owns the science roadmap for grounding—including retrieval, re-ranking, attribution, and reasoning—driving initiatives from problem framing to production impact. Designs and evolves state-of-the-art retrieval and RAG orchestration across documents, tables, code, and images. Builds citation and provenance systems (e.g., passage highlighting, quote-level alignment, confidence scoring) to reduce hallucinations and increase user trust. Leads experimentation and evaluation using A/B testing, interleaving, NDCG, MRR, precision/recall, and calibration curves to guide measurable trade-offs. Advances tool-augmented grounding through schema-aware retrieval, function calling, knowledge graph joins, and real-time connectors to databases, cloud object stores, search indexes, and the web. Partners with platform engineering to productionize models with scalable inference, embedding services, feature stores, caching, and privacy-compliant multi-tenant systems. Nurtures collaborative relationships with product and business leaders across Microsoft, influencing strategic decisions and driving business impact through technology. Authors white papers, contributes to internal tools and services, and may publish research to generate intellectual property. Bridges the gap between researchers (e.g., Microsoft Research) and development teams, applying long-term research to solve immediate product needs. Leads high-stakes negotiations to ensure cutting-edge technologies are applied practically and effectively. Identifies and solves significant business problems using novel, scalable, and data-driven solutions. Shapes the direction of Microsoft and the broader industry through pioneering product and tooling work. Mentors applied scientists and data scientists, establishing best practices in experimentation, error analysis, and incident review. Collaborates cross-functionally with PMs, research, infrastructure, and security teams to align on milestones, SLAs, and safety protocols. Communicates clearly through design documentation, progress updates, and presentations to executives and customers. Contributes to ethics and privacy policies, identifies bias in product development, and proposes mitigation strategies.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_d0214534-b6a","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Microsoft","sameAs":"https://microsoft.ai","logo":"https://logos.yubhub.co/microsoft.ai.png"},"x-apply-url":"https://microsoft.ai/job/senior-applied-scientist-37/","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["Statistics","Econometrics","Computer Science","Electrical or Computer Engineering","Machine Learning","Information Retrieval","Large Language Model Development","Pretraining","Supervised Fine-Tuning","Reinforcement Learning","Optimizing LLM Inference"],"x-skills-preferred":["Master's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field","6+ years related experience (e.g., statistics, predictive analytics, research)","Demonstrated expertise in information retrieval, with publications in top-tier conferences or journals such as NeurIPS, ICML, ICLR, SIGIR, or ACL","Hands-on experience in large language model (LLM) development, including pretraining, supervised fine-tuning (SFT), and reinforcement learning (RL)","Proven track record in optimizing LLM inference, or active contributions to open-source frameworks like vLLM, SGLang, or related projects"],"datePosted":"2026-03-08T22:16:41.766Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Beijing"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, Machine Learning, Information Retrieval, Large Language Model Development, Pretraining, Supervised Fine-Tuning, Reinforcement Learning, Optimizing LLM Inference, Master's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field, 6+ years related experience (e.g., statistics, predictive analytics, research), Demonstrated expertise in information retrieval, with publications in top-tier conferences or journals such as NeurIPS, ICML, ICLR, SIGIR, or ACL, Hands-on experience in large language model (LLM) development, including pretraining, supervised fine-tuning (SFT), and reinforcement learning (RL), Proven track record in optimizing LLM inference, or active contributions to open-source frameworks like vLLM, SGLang, or related projects"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_58928a28-64d"},"title":"Research Engineer/Research Scientist, Audio","description":"<p><strong>About Anthropic</strong></p>\n<p>Anthropic&#39;s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.</p>\n<p><strong>You may be a good fit if you:</strong></p>\n<ul>\n<li>Have hands-on experience with training audio models, whether that&#39;s conversational speech-to-speech, speech translation, speech recognition, text-to-speech, diarization, codecs, or generative audio models</li>\n<li>Genuinely enjoy both research and engineering work, and you&#39;d describe your ideal split as roughly 50/50 rather than heavily weighted toward one or the other</li>\n<li>Are comfortable working across abstraction levels, from signal processing fundamentals to large-scale model training and inference optimization</li>\n<li>Have deep expertise with JAX, PyTorch, or large-scale distributed training, and can debug performance issues across the full stack</li>\n<li>Thrive in fast-moving environments where the most important problem might shift as we learn more about what works</li>\n<li>Communicate clearly and collaborate effectively; audio touches many parts of our systems, so you&#39;ll work closely with teams across the company</li>\n<li>Are passionate about building conversational AI that feels natural, steerable, and safe</li>\n<li>Care about the societal impacts of voice AI and want to help shape how these systems are developed responsibly</li>\n</ul>\n<p><strong>Strong candidates may also have experience with:</strong></p>\n<ul>\n<li>Large language model pretraining and finetuning</li>\n<li>Training diffusion models for image and audio generation</li>\n<li>Reinforcement learning for large language models and diffusion models</li>\n<li>End-to-end system optimization, from performance benchmarking to kernel optimization</li>\n<li>GPUs, Kubernetes, PyTorch, or distributed training infrastructure</li>\n</ul>\n<p><strong>Representative projects:</strong></p>\n<ul>\n<li>Training state-of-the art neural audio codecs for 48 kHz stereo audio</li>\n<li>Developing novel algorithms for diffusion pretraining and reinforcement learning</li>\n<li>Scaling audio datasets to millions of hours of high quality audio</li>\n<li>Creating robust evaluation methodologies for hard-to-measure qualities such as naturalness or expressiveness</li>\n<li>Studying training dynamics of mixed audio-text language models</li>\n<li>Optimizing latency and inference throughput for deployed streaming audio systems</li>\n</ul>\n<p><strong>Logistics</strong></p>\n<p><strong>Education requirements:</strong> We require at least a Bachelor&#39;s degree in a related field or equivalent experience. <strong>Location-based hybrid policy:</strong> Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices.</p>\n<p><strong>Visa sponsorship:</strong> We do sponsor visas! However, we aren&#39;t able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this.</p>\n<p><strong>We encourage you to apply even if you do not believe you meet every single qualification.</strong> Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you&#39;re interested in this work.</p>\n<p><strong>Your safety matters to us.</strong> To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you&#39;re ever unsure about a communication, don&#39;t click any links—visit anthropic.com/careers directly for confirmed position openings.</p>\n<p><strong>How we&#39;re different</strong></p>\n<p>We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI systems that benefit society.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_58928a28-64d","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Anthropic","sameAs":"https://job-boards.greenhouse.io","logo":"https://logos.yubhub.co/anthropic.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/anthropic/jobs/5074815008","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$350,000 - $500,000 USD","x-skills-required":["audio models","speech-to-speech","speech translation","speech recognition","text-to-speech","diarization","codecs","generative audio models","JAX","PyTorch","large-scale distributed training"],"x-skills-preferred":["large language model pretraining","training diffusion models","reinforcement learning","end-to-end system optimization","GPUs","Kubernetes","PyTorch","distributed training infrastructure"],"datePosted":"2026-03-08T13:46:24.550Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco, CA"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"audio models, speech-to-speech, speech translation, speech recognition, text-to-speech, diarization, codecs, generative audio models, JAX, PyTorch, large-scale distributed training, large language model pretraining, training diffusion models, reinforcement learning, end-to-end system optimization, GPUs, Kubernetes, PyTorch, distributed training infrastructure","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":350000,"maxValue":500000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_c33b2d78-cc9"},"title":"Research Lead, Training Insights","description":"<p><strong>About the role</strong></p>\n<p>As a Research Lead on the Training Insights team, you&#39;ll develop the strategy for, and lead execution on, how we measure and characterise model capabilities across training and deployment. This is a hands-on leadership role: you&#39;ll drive original research into new evaluation methodologies while leading a small team of researchers and research engineers doing the same.</p>\n<p>Your work will span the full lifecycle of model development. You&#39;ll research and build new long-horizon evaluations that test the boundaries of what our models can achieve, develop novel approaches to measuring emerging capabilities, and deepen our understanding of how those capabilities develop — both during production RL training and after. You&#39;ll also take a cross-organisational view, working across Reinforcement Learning, Pretraining, Inference, Product, Alignment, Safeguards, and other teams to map the landscape of model evaluations at Anthropic and identify critical gaps in coverage.</p>\n<p>This role carries significant visibility and impact. You&#39;ll help shape the evaluation narrative for model releases, contributing directly to how Anthropic communicates about its models to both internal and external audiences. Done well, you will change how the industry measures and understands model capabilities, significantly furthering our safety mission.</p>\n<p><strong>Responsibilities:</strong></p>\n<ul>\n<li>Build new novel and long-horizon evaluations</li>\n<li>Develop novel measurement approaches for understanding how model capabilities emerge and evolve during RL training</li>\n<li>Lead strategic evaluation coverage across the company</li>\n<li>Shape the evaluation narrative for model releases</li>\n<li>Lead and mentor a small team of researchers and research engineers, setting research direction and fostering a culture of rigorous, creative research</li>\n<li>Design evaluation frameworks that balance scientific rigor with the practical demands of production training schedules</li>\n<li>Build and maintain relationships across Anthropic&#39;s research organisation to ensure evaluation insights inform training and deployment decisions</li>\n<li>Contribute to the broader research community through publications, open-source contributions, or external engagement on evaluation best practices</li>\n</ul>\n<p><strong>You may be a good fit if you:</strong></p>\n<ul>\n<li>Have significant experience designing and running evaluations for large language models or similar complex ML systems</li>\n<li>Have led technical projects or teams, either formally or through sustained ownership of critical research directions</li>\n<li>Are equally comfortable designing experiments and writing code—you can move between research and implementation fluidly</li>\n<li>Think strategically about what to measure and why, not just how to measure it</li>\n<li>Can synthesise information across multiple teams and workstreams to form a coherent picture of model capabilities</li>\n<li>Communicate complex technical findings clearly to both technical and non-technical audiences</li>\n<li>Are results-oriented and thrive in fast-paced environments where priorities shift based on research findings</li>\n<li>Care deeply about AI safety and want your work to directly influence how capable AI systems are developed and deployed</li>\n</ul>\n<p><strong>Strong candidates may also have:</strong></p>\n<ul>\n<li>Experience building evaluations for long-horizon or agentic tasks</li>\n<li>Deep familiarity with Reinforcement Learning training dynamics and how model behaviour changes during training</li>\n<li>Published research in machine learning evaluation, benchmarking, or related areas</li>\n<li>Experience with safety evaluation frameworks and red teaming methodologies</li>\n<li>Background in psychometrics, experimental psychology, or other measurement-focused disciplines</li>\n<li>A track record of communicating evaluation results to inform high-stakes decisions about model development or deployment</li>\n<li>Experience managing or mentoring researchers and engineers</li>\n</ul>\n<p><strong>Representative projects:</strong></p>\n<ul>\n<li>Designing and implementing a suite of long-horizon evaluations that test model capabilities on tasks requiring sustained reasoning, planning, and tool use over extended interactions</li>\n<li>Building systems to track capability development across RL training checkpoints, surfacing insights about when and how specific capabilities emerge</li>\n<li>Conducting a cross-org audit of evaluation coverage, identifying blind spots, and prioritising new evaluations to fill critical gaps across Pretraining, RL, Inference, and Product</li>\n<li>Developing the evaluation methodology and narrative for a major model release, working with research leads and communications to clearly characterise model capabilities and limitations</li>\n<li>Researching and prototyping novel evaluation approaches for capabilities that are difficult to measure with existing benchmarks</li>\n<li>Leading a team effort to build reusable evaluation infrastructure that serves multiple teams across the research organisation</li>\n</ul>\n<p><strong>Logistics</strong></p>\n<p><strong>Education requirements:</strong> We require at least a Bachelor&#39;s degree in a related field or equivalent experience. <strong>Location-based hybrid policy:</strong> Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices repsectively.</p>\n<p><strong>Visa sponsorship:</strong> We do sponsor visas!</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_c33b2d78-cc9","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Anthropic","sameAs":"https://job-boards.greenhouse.io","logo":"https://logos.yubhub.co/anthropic.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/anthropic/jobs/5139654008","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$850,000 - $850,000USD","x-skills-required":["machine learning","evaluation methodologies","Reinforcement Learning","Pretraining","Inference","Product","Alignment","Safeguards"],"x-skills-preferred":["psychometrics","experimental psychology","safety evaluation frameworks","red teaming methodologies"],"datePosted":"2026-03-08T13:45:37.187Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco, CA"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"machine learning, evaluation methodologies, Reinforcement Learning, Pretraining, Inference, Product, Alignment, Safeguards, psychometrics, experimental psychology, safety evaluation frameworks, red teaming methodologies","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":850000,"maxValue":850000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_f0ed63ad-d69"},"title":"Research Engineer / Research Scientist, Vision","description":"<p><strong>About the role</strong></p>\n<p>We&#39;re looking for research engineers with a strong computer vision background who believe that visual and spatial reasoning are core to fully unlocking the capabilities of LLMs. In this role, you&#39;ll work on research, development, and evaluation for state-of-the-art Claude models, with a focus on visual and spatial capabilities.</p>\n<p><strong>What you&#39;ll do:</strong></p>\n<ul>\n<li>Run experiments to evaluate architectural variants, data strategies, and SL and RL techniques to improve Claude&#39;s vision</li>\n</ul>\n<ul>\n<li>Develop and test tools, skills, and agentic infrastructure that enable Claude to reason over visual inputs</li>\n</ul>\n<ul>\n<li>Create evaluations and benchmarks that measure progress on multimodal capabilities across training and deployment</li>\n</ul>\n<ul>\n<li>Work with our product org to find solutions to our most vexing API customer challenges related to vision and spatial reasoning</li>\n</ul>\n<p><strong>You may be a good fit if you:</strong></p>\n<ul>\n<li>Have 7+ years of ML, computer vision, and software engineering experience through industry, academia, or other projects</li>\n</ul>\n<ul>\n<li>Are familiar with the architecture, training, and operation of large vision language models</li>\n</ul>\n<ul>\n<li>Have experience creating and evaluating large synthetic and real-world visual training datasets</li>\n</ul>\n<ul>\n<li>Have experience engaging in systematic prompting, finetuning, or evaluation</li>\n</ul>\n<ul>\n<li>Are results-oriented, with a bias towards flexibility and impact</li>\n</ul>\n<ul>\n<li>Enjoy pair programming and cross-team collaboration</li>\n</ul>\n<ul>\n<li>Care about the societal impacts of your work</li>\n</ul>\n<p><strong>Strong candidates may also have experience with:</strong></p>\n<ul>\n<li>Large-scale pretraining, SL, and RL on language models</li>\n</ul>\n<ul>\n<li>Deep learning research on images, video, or other modalities</li>\n</ul>\n<ul>\n<li>Developing complex agentic systems using LLMs</li>\n</ul>\n<ul>\n<li>High-performance ML systems (GPUs, TPUs, JAX, PyTorch)</li>\n</ul>\n<ul>\n<li>Large-scale ETL and data pipeline development</li>\n</ul>\n<p><strong>Representative projects:</strong></p>\n<ul>\n<li>Running experiments to determine ideal training datamixes and parameters for a synthetically generated vision dataset</li>\n</ul>\n<ul>\n<li>Finetuning Claude to maximise its performance using a particular set of agent tools/skills</li>\n</ul>\n<ul>\n<li>Building a pipeline to ingest and process a novel source of visual training data</li>\n</ul>\n<ul>\n<li>Designing and running experiments to evaluate the scalability of two architectural variants</li>\n</ul>\n<p><strong>Logistics</strong></p>\n<ul>\n<li>Education requirements: We require at least a Bachelor&#39;s degree in a related field or equivalent experience.</li>\n</ul>\n<ul>\n<li>Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices.</li>\n</ul>\n<ul>\n<li>Visa sponsorship: We do sponsor visas! However, we aren&#39;t able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this.</li>\n</ul>\n<p><strong>How we&#39;re different</strong></p>\n<p>We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We&#39;re an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills.</p>\n<p>The easiest way to understand our research directions is to read our recent research. This research can be found on our website.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_f0ed63ad-d69","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Anthropic","sameAs":"https://job-boards.greenhouse.io","logo":"https://logos.yubhub.co/anthropic.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/anthropic/jobs/5074217008","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$350,000 - $850,000 USD","x-skills-required":["computer vision","large vision language models","deep learning research","high-performance ML systems","large-scale ETL and data pipeline development"],"x-skills-preferred":["large-scale pretraining","SL and RL on language models","agentic systems using LLMs"],"datePosted":"2026-03-08T13:45:30.573Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"New York City, NY; San Francisco, CA; Seattle, WA"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"computer vision, large vision language models, deep learning research, high-performance ML systems, large-scale ETL and data pipeline development, large-scale pretraining, SL and RL on language models, agentic systems using LLMs","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":350000,"maxValue":850000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_2bfc37e4-bc3"},"title":"Researcher, Pretraining Safety","description":"<p><strong>Job Posting</strong></p>\n<p><strong>Researcher, Pretraining Safety</strong></p>\n<p><strong>Location</strong></p>\n<p>San Francisco</p>\n<p><strong>Employment Type</strong></p>\n<p>Full time</p>\n<p><strong>Department</strong></p>\n<p>Safety Systems</p>\n<p><strong>Compensation</strong></p>\n<ul>\n<li>$295K – $445K • Offers Equity</li>\n</ul>\n<p>The base pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience. If the role is non-exempt, overtime pay will be provided consistent with applicable laws. In addition to the salary range listed above, total compensation also includes generous equity, performance-related bonus(es) for eligible employees, and the following benefits.</p>\n<ul>\n<li>Medical, dental, and vision insurance for you and your family, with employer contributions to Health Savings Accounts</li>\n</ul>\n<ul>\n<li>Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses (parking and transit)</li>\n</ul>\n<ul>\n<li>401(k) retirement plan with employer match</li>\n</ul>\n<ul>\n<li>Paid parental leave (up to 24 weeks for birth parents and 20 weeks for non-birthing parents), plus paid medical and caregiver leave (up to 8 weeks)</li>\n</ul>\n<ul>\n<li>Paid time off: flexible PTO for exempt employees and up to 15 days annually for non-exempt employees</li>\n</ul>\n<ul>\n<li>13+ paid company holidays, and multiple paid coordinated company office closures throughout the year for focus and recharge, plus paid sick or safe time (1 hour per 30 hours worked, or more, as required by applicable state or local law)</li>\n</ul>\n<ul>\n<li>Mental health and wellness support</li>\n</ul>\n<ul>\n<li>Employer-paid basic life and disability coverage</li>\n</ul>\n<ul>\n<li>Annual learning and development stipend to fuel your professional growth</li>\n</ul>\n<ul>\n<li>Daily meals in our offices, and meal delivery credits as eligible</li>\n</ul>\n<ul>\n<li>Relocation support for eligible employees</li>\n</ul>\n<ul>\n<li>Additional taxable fringe benefits, such as charitable donation matching and wellness stipends, may also be provided.</li>\n</ul>\n<p>More details about our benefits are available to candidates during the hiring process.</p>\n<p>This role is at-will and OpenAI reserves the right to modify base pay and other compensation components at any time based on individual performance, team or company results, or market conditions.</p>\n<p><strong><strong>About the Team</strong></strong></p>\n<p>The Safety Systems team is responsible for various safety work to ensure our best models can be safely deployed to the real world to benefit the society and is at the forefront of OpenAI&#39;s mission to build and deploy safe AGI, driving our commitment to AI safety and fostering a culture of trust and transparency.</p>\n<p>The Pretraining Safety team’s goal is to build safer, more capable base models and enable earlier, more reliable safety evaluation during training. We aim to:</p>\n<ol>\n<li><strong>Develop upstream safety evaluations</strong> that to monitor how and when unsafe behaviors and goals emerge;</li>\n</ol>\n<ol>\n<li><strong>Create safer priors</strong> through targeted pretraining and mid-training interventions that make downstream alignment more effective and efficient</li>\n</ol>\n<ol>\n<li><strong>Design safe-by-design architectures</strong> that allow for more controllability of model capabilities</li>\n</ol>\n<p>In addition, we will conduct the foundational research necessary for understanding how behaviors emerge, generalize, and can be reliably measured throughout training.</p>\n<p><strong><strong>About the Role</strong></strong></p>\n<p>The Pretraining Safety team is pioneering how safety is built into models before they reach post-training and deployment. In this role, you will work throughout the full stack of model development with a focus on pre-training:</p>\n<ul>\n<li>Identify safety-relevant behaviors as they first emerge in base models</li>\n</ul>\n<ul>\n<li>Evaluate and reduce risk without waiting for full-scale training runs</li>\n</ul>\n<ul>\n<li>Design architectures and training setups that make safer behavior the default</li>\n</ul>\n<ul>\n<li>Strengthen models by incorporating richer, earlier safety signals</li>\n</ul>\n<p>We collaborate across OpenAI’s safety ecosystem—from Safety Systems to Training—to ensure that safety foundations are robust, scalable, and grounded in real-world risks.</p>\n<p><strong><strong>In this role, you will:</strong></strong></p>\n<ul>\n<li>Develop new techniques to predict, measure, and evaluate unsafe behavior in early-stage models</li>\n</ul>\n<ul>\n<li>Design data curation strategies that improve pretraining priors and reduce downstream risk</li>\n</ul>\n<ul>\n<li>Explore safe-by-design architectures and training configurations that improve controllability</li>\n</ul>\n<ul>\n<li>Introduce novel safety-oriented loss functions, metrics, and evals into the pretraining stack</li>\n</ul>\n<ul>\n<li>Work closely with cross-functional safety teams to unify pre- and post-training risk reduction</li>\n</ul>\n<p><strong><strong>You might thrive in this role if you:</strong></strong></p>\n<ul>\n<li>Have experience developing or scaling pretraining architectures (LLMs, diffusion models, multimodal models, etc.)</li>\n</ul>\n<ul>\n<li>Are comfortable working with training infrastructure, data pipelines, and evaluation frameworks (e.g., Python, PyTorch/JAX, Apache Beam)</li>\n</ul>\n<ul>\n<li>Enjoy hands-on research — designing, implementing, and iterating on experiments</li>\n</ul>\n<ul>\n<li>Enjoy collaborating with diverse technical and cross-functional partners (e.g., policy, legal, training)</li>\n</ul>\n<ul>\n<li>Are data-driven with strong statistical reasoning and rigor in experimental design</li>\n</ul>\n<ul>\n<li>Value building clean, scalable research workflows and streamlining processes for yourself and others</li>\n</ul>\n<p><strong><strong>About OpenAI</strong></strong></p>\n<p>OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_2bfc37e4-bc3","directApply":true,"hiringOrganization":{"@type":"Organization","name":"OpenAI","sameAs":"https://jobs.ashbyhq.com","logo":"https://logos.yubhub.co/openai.com.png"},"x-apply-url":"https://jobs.ashbyhq.com/openai/d829b701-5ee2-414f-8596-ef94911a168a","x-work-arrangement":"onsite","x-experience-level":"mid","x-job-type":"full-time","x-salary-range":"$295K – $445K • Offers Equity","x-skills-required":["pretraining architectures","training infrastructure","data pipelines","evaluation frameworks","Python","PyTorch/JAX","Apache Beam","hands-on research","collaboration","data-driven","statistical reasoning"],"x-skills-preferred":["LLMs","diffusion models","multimodal models","safe-by-design architectures","training configurations","loss functions","metrics","evals"],"datePosted":"2026-03-06T18:36:25.493Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"pretraining architectures, training infrastructure, data pipelines, evaluation frameworks, Python, PyTorch/JAX, Apache Beam, hands-on research, collaboration, data-driven, statistical reasoning, LLMs, diffusion models, multimodal models, safe-by-design architectures, training configurations, loss functions, metrics, evals","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":295000,"maxValue":445000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_7a8a04e7-245"},"title":"Member of Technical Staff, AI Platform Engineer","description":"<p><strong>Summary</strong></p>\n<p>Microsoft are looking for a talented Member of Technical Staff, AI Platform Engineer at their Mountain View office. This role sits at the heart of strategic decision-making, turning market data into actionable insights for a company that&#39;s revolutionising AI technology. You&#39;ll work directly with leadership to shape the company&#39;s direction in the AI market.</p>\n<p><strong>About the Role</strong></p>\n<p>As an AI Platform Engineer, you will play a crucial role in building and releasing production software at the platform level. You will work on APIs, data flows, systems, and services that form the backbone of our AI products. Your expertise will help us create robust and scalable platforms that drive innovation and efficiency.</p>\n<p><strong>Accountabilities</strong></p>\n<ul>\n<li>Design, develop, and maintain platform-level software solutions.</li>\n<li>Collaborate with cross-functional teams to integrate AI capabilities into various products.</li>\n<li>Ensure the reliability, scalability, and performance of platform components.</li>\n</ul>\n<p><strong>The Candidate we&#39;re looking for</strong></p>\n<p><strong>Experience:</strong></p>\n<ul>\n<li>Bachelor’s Degree in Computer Science, or related technical discipline AND 4+ years technical engineering experience with coding in languages including, but not limited to TypeScript, Python, C, C++, C#, Java OR equivalent experience.</li>\n</ul>\n<p><strong>Technical skills:</strong></p>\n<ul>\n<li>Deep experience with all of the following languages: Golang, Java/Scala, Typescript (React/Next.js)</li>\n<li>Experience in model pretraining, post-training, evaluation, and inference</li>\n<li>Experience using Machine Learning frameworks, including experience using, deploying, and scaling language learning models, either personally or professionally.</li>\n</ul>\n<p><strong>Personal attributes:</strong></p>\n<ul>\n<li>Ability to clearly communicate complex technical concepts to both technical and non-technical stakeholders.</li>\n<li>Demonstrated interpersonal skills and ability to work closely with cross-functional teams, including product managers, designers, and other engineers.</li>\n</ul>\n<p><strong>Benefits</strong></p>\n<ul>\n<li>Software Engineering IC4 – The typical base pay range for this role across the U.S. is USD $119,800 – $234,700 per year.</li>\n<li>Software Engineering IC5 – The typical base pay range for this role across the U.S. is USD $139,900 – $274,800 per year.</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_7a8a04e7-245","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Microsoft","sameAs":"https://microsoft.ai","logo":"https://logos.yubhub.co/microsoft.ai.png"},"x-apply-url":"https://microsoft.ai/job/member-of-technical-staff-ai-platform-engineer/","x-work-arrangement":"onsite","x-experience-level":"staff","x-job-type":"full-time","x-salary-range":"USD $119,800 – $234,700 per year or USD $139,900 – $274,800 per year","x-skills-required":["Golang","Java/Scala","Typescript (React/Next.js)","model pretraining","post-training","evaluation","inference","Machine Learning frameworks"],"x-skills-preferred":["Golang","Java/Scala","Typescript (React/Next.js)"],"datePosted":"2026-03-06T07:32:28.272Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Mountain View"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Golang, Java/Scala, Typescript (React/Next.js), model pretraining, post-training, evaluation, inference, Machine Learning frameworks, Golang, Java/Scala, Typescript (React/Next.js)","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":119800,"maxValue":274800,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_cc868c06-d58"},"title":"Member of Technical Staff - AI Pretraining - MAI Superintelligence Team","description":"<p><strong>Summary</strong></p>\n<p>Microsoft AI are looking for a talented Member of Technical Staff - AI Pretraining - MAI Superintelligence Team at their Zürich office. This role sits at the heart of strategic decision-making, turning market data into actionable insights for a company that&#39;s revolutionising AI technology. You&#39;ll work directly with leadership to shape the company&#39;s direction in the AI market.</p>\n<p><strong>About the Role</strong></p>\n<p>As a Member of Technical Staff - AI Pretraining - MAI Superintelligence Team, you will develop algorithms, model architectures, data mixtures, and scaling laws for large-scale training using a rigorous data-driven approach grounded in meticulous ablations. You will drive algorithmic implementations, conduct experiments, and oversee flagship training runs on our in-house large-scale distributed stack. You will collaborate closely with teams on infrastructure, data, post-training, and multimodality.</p>\n<p><strong>Accountabilities</strong></p>\n<ul>\n<li>Develop algorithms, model architectures, data mixtures, and scaling laws for large-scale training using a rigorous data-driven approach grounded in meticulous ablations</li>\n<li>Drive algorithmic implementations, conduct experiments, and oversee flagship training runs on our in-house large-scale distributed stack</li>\n</ul>\n<p><strong>The Candidate we&#39;re looking for</strong></p>\n<p><strong>Experience:</strong></p>\n<ul>\n<li>Bachelor&#39;s Degree in Computer Science, or related technical discipline AND technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python</li>\n</ul>\n<p><strong>Technical skills:</strong></p>\n<ul>\n<li>Proven expertise in the area of pretraining</li>\n</ul>\n<p><strong>Personal attributes:</strong></p>\n<ul>\n<li>Passionate about conversational AI and its deployment</li>\n<li>Demonstrated written and verbal communication skills with the ability to work closely with cross-functional teams, including product managers, designers, and other engineers</li>\n</ul>\n<p><strong>Benefits</strong></p>\n<ul>\n<li>Starting January 26, 2026, MAI employees are expected to work from a designated Microsoft office at least four days a week if they live within 50 miles (U.S.) or 25 miles (non-U.S., country-specific) of that location</li>\n<li>Health and wellbeing benefits</li>\n<li>Professional development opportunities</li>\n<li>Financial benefits (bonus, equity, pension, etc.)</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_cc868c06-d58","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Microsoft AI","sameAs":"https://microsoft.ai","logo":"https://logos.yubhub.co/microsoft.ai.png"},"x-apply-url":"https://microsoft.ai/job/member-of-technical-staff-ai-pretraining-mai-superintelligence-team-2/","x-work-arrangement":"onsite","x-experience-level":"staff","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["C","C++","C#","Java","JavaScript","Python"],"x-skills-preferred":["pretraining","conversational AI"],"datePosted":"2026-03-06T07:29:44.921Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Zürich"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"C, C++, C#, Java, JavaScript, Python, pretraining, conversational AI"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_25a3ff19-9bd"},"title":"Member of Technical Staff - AI Pretraining - MAI Superintelligence Team","description":"<p><strong>Summary</strong></p>\n<p>Microsoft AI are looking for a talented Member of Technical Staff - AI Pretraining - MAI Superintelligence Team at their London office. This role sits at the heart of strategic decision-making, turning market data into actionable insights for a company that&#39;s revolutionising AI technology. You&#39;ll work directly with leadership to shape the company&#39;s direction in the AI market.</p>\n<p><strong>About the Role</strong></p>\n<p>As a Member of Technical Staff - AI Pretraining - MAI Superintelligence Team, you will develop algorithms, model architectures, data mixtures, and scaling laws for large-scale training using a rigorous data-driven approach grounded in meticulous ablations. You will drive algorithmic implementations, conduct experiments, and oversee flagship training runs on our in-house large-scale distributed stack. You will collaborate closely with teams on infrastructure, data, post-training, and multimodality.</p>\n<p><strong>Accountabilities</strong></p>\n<ul>\n<li>Develop algorithms, model architectures, data mixtures, and scaling laws for large-scale training using a rigorous data-driven approach grounded in meticulous ablations.</li>\n<li>Drive algorithmic implementations, conduct experiments, and oversee flagship training runs on our in-house large-scale distributed stack.</li>\n</ul>\n<p><strong>The Candidate we&#39;re looking for</strong></p>\n<p><strong>Experience:</strong></p>\n<ul>\n<li>Bachelor&#39;s Degree in Computer Science, or related technical discipline AND technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python.</li>\n</ul>\n<p><strong>Technical skills:</strong></p>\n<ul>\n<li>Proven expertise in the area of pretraining.</li>\n</ul>\n<p><strong>Personal attributes:</strong></p>\n<ul>\n<li>Passionate about conversational AI and its deployment.</li>\n<li>Demonstrated written and verbal communication skills with the ability to work closely with cross-functional teams, including product managers, designers, and other engineers.</li>\n</ul>\n<p><strong>Benefits</strong></p>\n<ul>\n<li>Starting January 26, 2026, MAI employees are expected to work from a designated Microsoft office at least four days a week if they live within 50 miles (U.S.) or 25 miles (non-U.S., country-specific) of that location.</li>\n<li>Health and wellbeing benefits.</li>\n<li>Professional development opportunities.</li>\n<li>Financial benefits (bonus, equity, pension, etc.).</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_25a3ff19-9bd","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Microsoft AI","sameAs":"https://microsoft.ai","logo":"https://logos.yubhub.co/microsoft.ai.png"},"x-apply-url":"https://microsoft.ai/job/member-of-technical-staff-ai-pretraining-mai-superintelligence-team-3/","x-work-arrangement":"onsite","x-experience-level":"staff","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["C","C++","C#","Java","JavaScript","Python"],"x-skills-preferred":["pretraining","conversational AI"],"datePosted":"2026-03-06T07:29:18.175Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"London"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"C, C++, C#, Java, JavaScript, Python, pretraining, conversational AI"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_41c4fc0c-b2c"},"title":"Member of Technical Staff - AI Pretraining - MAI Superintelligence Team","description":"<p><strong>Summary</strong></p>\n<p>Microsoft AI are looking for a talented Member of Technical Staff - AI Pretraining - MAI Superintelligence Team at their Zürich office. This role sits at the heart of strategic decision-making, turning market data into actionable insights for a company that&#39;s revolutionising AI technology. You&#39;ll work directly with leadership to shape the company&#39;s direction in the AI market.</p>\n<p><strong>About the Role</strong></p>\n<p>As a Member of Technical Staff - AI Pretraining - MAI Superintelligence Team, you will develop algorithms, model architectures, data mixtures, and scaling laws for large-scale training using a rigorous data-driven approach grounded in meticulous ablations. You will drive algorithmic implementations, conduct experiments, and oversee flagship training runs on our in-house large-scale distributed stack. You will collaborate closely with teams on infrastructure, data, post-training, and multimodality.</p>\n<p><strong>Accountabilities</strong></p>\n<ul>\n<li>Develop algorithms, model architectures, data mixtures, and scaling laws for large-scale training using a rigorous data-driven approach grounded in meticulous ablations</li>\n<li>Drive algorithmic implementations, conduct experiments, and oversee flagship training runs on our in-house large-scale distributed stack</li>\n</ul>\n<p><strong>The Candidate we&#39;re looking for</strong></p>\n<p><strong>Experience:</strong></p>\n<ul>\n<li>Bachelor&#39;s Degree in Computer Science, or related technical discipline AND technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python</li>\n</ul>\n<p><strong>Technical skills:</strong></p>\n<ul>\n<li>Proven expertise in the area of pretraining</li>\n</ul>\n<p><strong>Personal attributes:</strong></p>\n<ul>\n<li>Passionate about conversational AI and its deployment</li>\n</ul>\n<p><strong>Benefits</strong></p>\n<ul>\n<li>Starting January 26, 2026, MAI employees are expected to work from a designated Microsoft office at least four days a week if they live within 50 miles (U.S.) or 25 miles (non-U.S., country-specific) of that location</li>\n<li>Health and wellbeing benefits</li>\n<li>Professional development opportunities</li>\n<li>Financial benefits (bonus, equity, pension, etc.)</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_41c4fc0c-b2c","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Microsoft AI","sameAs":"https://microsoft.ai","logo":"https://logos.yubhub.co/microsoft.ai.png"},"x-apply-url":"https://microsoft.ai/job/member-of-technical-staff-ai-pretraining-mai-superintelligence-team-4/","x-work-arrangement":"onsite","x-experience-level":"staff","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["C","C++","C#","Java","JavaScript","Python"],"x-skills-preferred":["pretraining"],"datePosted":"2026-03-06T07:28:42.934Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Zürich"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"C, C++, C#, Java, JavaScript, Python, pretraining"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_37d2db65-306"},"title":"Member of Technical Staff - AI Pretraining - MAI Superintelligence Team","description":"<p><strong>Summary</strong></p>\n<p>Microsoft AI are looking for a talented Member of Technical Staff - AI Pretraining - MAI Superintelligence Team at their London office. This role sits at the heart of strategic decision-making, turning market data into actionable insights for a company that&#39;s revolutionising AI technology. You&#39;ll work directly with leadership to shape the company&#39;s direction in the AI market.</p>\n<p><strong>About the Role</strong></p>\n<p>As a Member of Technical Staff - AI Pretraining - MAI Superintelligence Team, you will develop algorithms, model architectures, data mixtures, and scaling laws for large-scale training using a rigorous data-driven approach grounded in meticulous ablations. You will drive algorithmic implementations, conduct experiments, and oversee flagship training runs on our in-house large-scale distributed stack. You will collaborate closely with teams on infrastructure, data, post-training, and multimodality.</p>\n<p><strong>Accountabilities</strong></p>\n<ul>\n<li>Develop algorithms, model architectures, data mixtures, and scaling laws for large-scale training using a rigorous data-driven approach grounded in meticulous ablations</li>\n<li>Drive algorithmic implementations, conduct experiments, and oversee flagship training runs on our in-house large-scale distributed stack</li>\n</ul>\n<p><strong>The Candidate we&#39;re looking for</strong></p>\n<p><strong>Experience:</strong></p>\n<ul>\n<li>Bachelor&#39;s Degree in Computer Science, or related technical discipline AND technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python</li>\n</ul>\n<p><strong>Technical skills:</strong></p>\n<ul>\n<li>Proven expertise in the area of pretraining</li>\n</ul>\n<p><strong>Personal attributes:</strong></p>\n<ul>\n<li>Passionate about conversational AI and its deployment</li>\n<li>Demonstrated written and verbal communication skills with the ability to work closely with cross-functional teams, including product managers, designers, and other engineers</li>\n</ul>\n<p><strong>Benefits</strong></p>\n<ul>\n<li>Starting January 26, 2026, MAI employees are expected to work from a designated Microsoft office at least four days a week if they live within 50 miles (U.S.) or 25 miles (non-U.S., country-specific) of that location</li>\n<li>Health and wellbeing benefits</li>\n<li>Professional development opportunities</li>\n<li>Financial benefits (bonus, equity, pension, etc.)</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_37d2db65-306","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Microsoft AI","sameAs":"https://microsoft.ai","logo":"https://logos.yubhub.co/microsoft.ai.png"},"x-apply-url":"https://microsoft.ai/job/member-of-technical-staff-ai-pretraining-mai-superintelligence-team/","x-work-arrangement":"onsite","x-experience-level":"staff","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["C","C++","C#","Java","JavaScript","Python"],"x-skills-preferred":["pretraining","conversational AI"],"datePosted":"2026-03-06T07:26:57.778Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"London"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"C, C++, C#, Java, JavaScript, Python, pretraining, conversational AI"}]}