{"version":"0.1","company":{"name":"YubHub","url":"https://yubhub.co","jobsUrl":"https://yubhub.co/jobs/skill/neurips-iclr-icml-publications"},"x-facet":{"type":"skill","slug":"neurips-iclr-icml-publications","display":"NeurIPS, ICLR, ICML publications","count":1},"x-feed-size-limit":100,"x-feed-sort":"enriched_at desc","x-feed-notice":"This feed contains at most 100 jobs (the most recently enriched). For the full corpus, use the paginated /stats/by-facet endpoint or /search.","x-generator":"yubhub-xml-generator","x-rights":"Free to redistribute with attribution: \"Data by YubHub (https://yubhub.co)\"","x-schema":"Each entry in `jobs` follows https://schema.org/JobPosting. YubHub-native raw fields carry `x-` prefix.","jobs":[{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_d2f5b1e5-545"},"title":"Research Scientist, Gemini Safety","description":"<p>We&#39;re seeking a versatile Research Scientist to join our Gemini Safety team. As a Research Scientist, you will apply and develop data and algorithmic cutting-edge solutions to advance our latest user-facing models. Your work will focus on advancing the safety and fairness behavior of state-of-the-art AI models, driving the development of foundational technology adopted by numerous product areas, including Gemini App, Cloud API, and Search.</p>\n<p>Key responsibilities include:</p>\n<ul>\n<li>Post-training/instruction tuning state-of-the-art LLMs, focusing on text-to-text, image/video/audio-to-text modalities and agentic capabilities</li>\n<li>Exploring data, reasoning, and algorithmic solutions to ensure Gemini Models are safe, maximally helpful, and work for everyone</li>\n<li>Improve Gemini&#39;s adversarial robustness, with a focus on high-stakes abuse risks</li>\n<li>Design and maintain high-quality evaluation protocols to assess model behavior gaps and headroom related to safety and fairness</li>\n<li>Develop and execute experimental plans to address known gaps, or construct entirely new capabilities</li>\n<li>Drive innovation and enhance understanding of Supervised Fine Tuning and Reinforcement Learning fine-tuning at scale</li>\n</ul>\n<p>To succeed as a Research Scientist in the Gemini Safety team, we look for the following skills and experience:</p>\n<ul>\n<li>PhD in Computer Science, a related field, or equivalent practical experience</li>\n<li>Significant LLM post-training experience</li>\n<li>Experience in Reward modeling and Reinforcement Learning for LLMs Instruction tuning</li>\n<li>Experience with Long-range Reinforcement learning</li>\n<li>Experience in areas such as Safety, Fairness, and Alignment</li>\n<li>Track record of publications at NeurIPS, ICLR, ICML</li>\n<li>Experience taking research from concept to product</li>\n<li>Experience with collaborating or leading an applied research project</li>\n<li>Strong experimental taste: Good judgment regarding baselines, ablations, and what is worth testing</li>\n<li>Experience with JAX</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_d2f5b1e5-545","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Google DeepMind","sameAs":"https://deepmind.com/","logo":"https://logos.yubhub.co/deepmind.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/deepmind/jobs/7731944?utm_source=yubhub.co&utm_medium=jobs_feed&utm_campaign=apply","x-work-arrangement":"onsite","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["PhD in Computer Science","LLM post-training experience","Reward modeling and Reinforcement Learning for LLMs Instruction tuning","Long-range Reinforcement learning","Safety, Fairness, and Alignment","NeurIPS, ICLR, ICML publications","Research from concept to product","Collaborating or leading an applied research project","JAX"],"x-skills-preferred":[],"datePosted":"2026-04-18T15:40:08.109Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Zurich, Switzerland"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"PhD in Computer Science, LLM post-training experience, Reward modeling and Reinforcement Learning for LLMs Instruction tuning, Long-range Reinforcement learning, Safety, Fairness, and Alignment, NeurIPS, ICLR, ICML publications, Research from concept to product, Collaborating or leading an applied research project, JAX"}]}