{"version":"0.1","company":{"name":"YubHub","url":"https://yubhub.co","jobsUrl":"https://yubhub.co/jobs/skill/large-scale-model-training"},"x-facet":{"type":"skill","slug":"large-scale-model-training","display":"Large-Scale Model Training","count":4},"x-feed-size-limit":100,"x-feed-sort":"enriched_at desc","x-feed-notice":"This feed contains at most 100 jobs (the most recently enriched). For the full corpus, use the paginated /stats/by-facet endpoint or /search.","x-generator":"yubhub-xml-generator","x-rights":"Free to redistribute with attribution: \"Data by YubHub (https://yubhub.co)\"","x-schema":"Each entry in `jobs` follows https://schema.org/JobPosting. YubHub-native raw fields carry `x-` prefix.","jobs":[{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_0a60466d-c3b"},"title":"Senior Machine Learning Engineer (Research Scientist) - Data Foundation & AI","description":"<p>We believe that the way people interact with their finances will drastically improve in the next few years. We&#39;re dedicated to empowering this transformation by building the tools and experiences that thousands of developers use to create their own products.</p>\n<p>The Data Foundation and AI team within Plaid&#39;s Data organization builds and maintains the shared machine learning and AI infrastructure that powers capabilities across Plaid&#39;s product suite. The team transforms Plaid&#39;s unique financial network data into general-purpose representations that can be leveraged by teams across the company.</p>\n<p>As a Senior Research Scientist on the Data Foundation and AI team, you will lead applied research on Plaid&#39;s foundation model by designing model architectures, pretraining objectives, and fine-tuning strategies that generalize across a wide range of downstream product use cases. You will also build and maintain end-to-end production machine learning systems, including training pipelines, model serving infrastructure, feature engineering, and monitoring.</p>\n<p>Responsibilities:</p>\n<ul>\n<li>Building a foundation model on one of the world&#39;s richest financial datasets that no one else has.</li>\n<li>Doing research that ships: moving from experimentation and prototypes to production systems serving real customers.</li>\n<li>Working across the full ML stack, from pretraining objectives and architectures to serving infrastructure and monitoring.</li>\n<li>Collaborating with a high-caliber team and seeing your work amplify the capabilities of multiple product teams.</li>\n<li>Helping hundreds of millions of consumers achieve greater financial freedom through data-driven products.</li>\n</ul>\n<p>Qualifications:</p>\n<ul>\n<li>Strong applied ML research skills with production delivery experience.</li>\n<li>Depth in Transformers/LLMs, representation learning, or large-scale model training.</li>\n<li>Demonstrated ability to ship models to production (not just prototype).</li>\n<li>Distributed training experience and strong Python + software engineering fundamentals.</li>\n<li>Fintech / financial data domain experience is a plus.</li>\n<li>External publications or open-source contributions is a plus.</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_0a60466d-c3b","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Plaid","sameAs":"https://plaid.com/","logo":"https://logos.yubhub.co/plaid.com.png"},"x-apply-url":"https://jobs.lever.co/plaid/2e7c75c3-1da5-4eb4-9b60-44de246a0fd8?utm_source=yubhub.co&utm_medium=jobs_feed&utm_campaign=apply","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$228,960-$315,360 per year","x-skills-required":["Transformers/LLMs","Representation learning","Large-scale model training","Distributed training","Python","Software engineering fundamentals"],"x-skills-preferred":[],"datePosted":"2026-04-24T16:09:35.664Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Finance","skills":"Transformers/LLMs, Representation learning, Large-scale model training, Distributed training, Python, Software engineering fundamentals","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":228960,"maxValue":315360,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_02a96eda-a01"},"title":"Researcher, Agentic Post-Training","description":"<p><strong>Compensation</strong></p>\n<p>Estimated Base Salary $295K – $445K</p>\n<p>The base pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience. If the role is non-exempt, overtime pay will be provided consistent with applicable laws. In addition to the salary range listed above, total compensation also includes generous equity, performance-related bonus(es) for eligible employees, and the following benefits.</p>\n<ul>\n<li>Medical, dental, and vision insurance for you and your family, with employer contributions to Health Savings Accounts</li>\n</ul>\n<ul>\n<li>Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses (parking and transit)</li>\n</ul>\n<ul>\n<li>401(k) retirement plan with employer match</li>\n</ul>\n<ul>\n<li>Paid parental leave (up to 24 weeks for birth parents and 20 weeks for non-birthing parents), plus paid medical and caregiver leave (up to 8 weeks)</li>\n</ul>\n<ul>\n<li>Paid time off: flexible PTO for exempt employees and up to 15 days annually for non-exempt employees</li>\n</ul>\n<ul>\n<li>13+ paid company holidays, and multiple paid coordinated company office closures throughout the year for focus and recharge, plus paid sick or safe time (1 hour per 30 hours worked, or more, as required by applicable state or local law)</li>\n</ul>\n<ul>\n<li>Mental health and wellness support</li>\n</ul>\n<ul>\n<li>Employer-paid basic life and disability coverage</li>\n</ul>\n<ul>\n<li>Annual learning and development stipend to fuel your professional growth</li>\n</ul>\n<ul>\n<li>Daily meals in our offices, and meal delivery credits as eligible</li>\n</ul>\n<ul>\n<li>Relocation support for eligible employees</li>\n</ul>\n<ul>\n<li>Additional taxable fringe benefits, such as charitable donation matching and wellness stipends, may also be provided.</li>\n</ul>\n<p><strong>Team Description</strong></p>\n<p>OpenAI is looking for exceptional researchers to join the Post-Training Frontiers team, which is responsible for post-training the agentic models we ship across Codex, the API, ChatGPT Thinking, and ChatGPT Pro. The Post-Training Frontiers team sets up the pipeline for deciding which integrations can go into the post-training run, develops its own horizontal improvements to the model, and trains the final model.</p>\n<p><strong>Responsibilities</strong></p>\n<ul>\n<li>Own end-to-end research and engineering projects that improve the final post-training of OpenAI’s agentic models.</li>\n</ul>\n<ul>\n<li>Decide, together with partner teams, which integrations are ready for inclusion in major model runs.</li>\n</ul>\n<ul>\n<li>Develop horizontal model improvements across factuality, instruction following, tool/function calling, multi-agent behavior, reasoning-effort calibration, and other broad capabilities.</li>\n</ul>\n<ul>\n<li>Build and improve training, evaluation, grading, and data infrastructure for large-scale RL/post-training runs.</li>\n</ul>\n<ul>\n<li>Create evals and diagnostics that help us understand whether a model is ready to ship.</li>\n</ul>\n<ul>\n<li>Improve the feedback loop from real product usage into post-training, including better ways to learn from implicit user feedback.</li>\n</ul>\n<ul>\n<li>Collaborate closely with Codex, API, ChatGPT, product, training, and other post-training teams to make frontier models more useful, reliable, and agentic.</li>\n</ul>\n<p><strong>Nice to have</strong></p>\n<ul>\n<li>Experience with large-scale model training or RL systems.</li>\n</ul>\n<ul>\n<li>Experience building evals, graders, reward models, or data pipelines for LLM training.</li>\n</ul>\n<ul>\n<li>Experience with coding agents, tool-using agents, browser/computer-use agents, function calling, or multi-agent systems.</li>\n</ul>\n<ul>\n<li>Background in quant, systems, infra, or other environments where you built reliable machinery for high-stakes experimentation.</li>\n</ul>\n<ul>\n<li>Evidence of strong product taste, especially around writing, design, code generation, or agent workflows.</li>\n</ul>\n<p><strong>About OpenAI</strong></p>\n<p>OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_02a96eda-a01","directApply":true,"hiringOrganization":{"@type":"Organization","name":"OpenAI","sameAs":"https://openai.com/","logo":"https://logos.yubhub.co/openai.com.png"},"x-apply-url":"https://jobs.ashbyhq.com/openai/4acc3515-887f-4bc9-8c2a-00f36ae480c3?utm_source=yubhub.co&utm_medium=jobs_feed&utm_campaign=apply","x-work-arrangement":null,"x-experience-level":null,"x-job-type":"Full time","x-salary-range":"$295K – $445K","x-skills-required":["ML fundamentals","LLMs","RL","RLHF","post-training","evals","model training"],"x-skills-preferred":["large-scale model training","RL systems","graders","reward models","data pipelines","LLM training","coding agents","tool-using agents","browser/computer-use agents","function calling","multi-agent systems","quant","systems","infra","product taste","writing","design","code generation","agent workflows"],"datePosted":"2026-04-24T12:22:09.599Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"ML fundamentals, LLMs, RL, RLHF, post-training, evals, model training, large-scale model training, RL systems, graders, reward models, data pipelines, LLM training, coding agents, tool-using agents, browser/computer-use agents, function calling, multi-agent systems, quant, systems, infra, product taste, writing, design, code generation, agent workflows","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":295000,"maxValue":445000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_2e513a92-ec5"},"title":"Research Scientist (Generative Modeling)","description":"<p>We are seeking a talented Research Scientist with a strong background in generative modeling, particularly diffusion models, to join our modeling team. This role is ideal for candidates with deep expertise in diffusion models applied to images, videos, or 3D assets and scenes.</p>\n<p>While experience in one or more of the following areas is a strong plus: large-scale model training, research in 3D computer vision.</p>\n<p>You will collaborate closely with researchers, engineers, and product teams to bring advanced 3D modeling and machine learning techniques into real-world applications, ensuring that our technology remains at the forefront of visual innovation. This role involves significant hands-on research and engineering work, driving projects from conceptualization through to production deployment.</p>\n<p>Key responsibilities include designing, implementing, and training large-scale diffusion models for generating 3D worlds, developing and experimenting with large-scale diffusion models to add novel control signals, adapting to target aesthetic preferences, or distilling for efficient inference, collaborating closely with research and product teams to understand and translate product requirements into effective technical roadmaps, contributing hands-on to all stages of model development including data curation, experimentation, evaluation, and deployment, continuously exploring and integrating cutting-edge research in diffusion and generative AI more broadly, acting as a key technical resource within the team, mentoring colleagues, and driving best practices in generative modeling and ML engineering.</p>\n<p>Ideal candidate profile includes 3+ years of experience in generative modeling or applied ML roles, extensive experience with machine learning frameworks such as PyTorch or TensorFlow, especially in the context of diffusion models and other generative models, deep expertise in at least one area of generative modeling, strong history of publications or open-source contributions involving large-scale diffusion models, strong coding proficiency in Python and experience with GPU-accelerated computing, ability to engage effectively with researchers and cross-functional teams, clearly translating complex technical ideas into actionable tasks and outcomes, comfortable operating within a dynamic startup environment with high levels of ambiguity, ownership, and innovation.</p>\n<p>Nice to have includes contributions to open-source projects in the fields of computer vision, graphics, or ML, familiarity with large-scale training infrastructure, experience integrating machine learning models into production environments, led or been involved with the development or training of large-scale, state-of-the-art generative models.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_2e513a92-ec5","directApply":true,"hiringOrganization":{"@type":"Organization","name":"World Labs","sameAs":"https://worldlabs.ai","logo":"https://logos.yubhub.co/worldlabs.ai.png"},"x-apply-url":"https://job-boards.greenhouse.io/worldlabs/jobs/4089324009?utm_source=yubhub.co&utm_medium=jobs_feed&utm_campaign=apply","x-work-arrangement":"onsite","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$250,000 - $325,000 base salary (good-faith estimate for San Francisco Bay Area upon hire; actual offer based on experience, skills, and qualifications)","x-skills-required":["generative modeling","diffusion models","PyTorch","TensorFlow","machine learning frameworks","large-scale model training","research in 3D computer vision","data curation","experimentation","evaluation","deployment","GPU-accelerated computing","Python"],"x-skills-preferred":["open-source contributions","large-scale training infrastructure","integrating machine learning models into production environments","leading or being involved with the development or training of large-scale, state-of-the-art generative models"],"datePosted":"2026-04-17T13:09:56.134Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"generative modeling, diffusion models, PyTorch, TensorFlow, machine learning frameworks, large-scale model training, research in 3D computer vision, data curation, experimentation, evaluation, deployment, GPU-accelerated computing, Python, open-source contributions, large-scale training infrastructure, integrating machine learning models into production environments, leading or being involved with the development or training of large-scale, state-of-the-art generative models","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":250000,"maxValue":325000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_e5396d5d-402"},"title":"Research Scientist, AQUA","description":"<p>Job Title: Research Scientist, AQUA</p>\n<p>We are seeking a highly motivated and innovative Research Scientist to join our team in Bangalore, focused on advancing the state-of-the-art in autonomous agents through reinforcement learning and ML optimization methods.</p>\n<p>As a Research Scientist at Google DeepMind, you will conduct cutting-edge research on large language models (LLMs), focusing on the development of more capable, robust autonomous agents.</p>\n<p><strong>Key Responsibilities:</strong></p>\n<ul>\n<li>Design, implement and evaluate models, agents and software prototypes of large foundational models.</li>\n<li>Push the boundary of state of the art RL methods and machine learning optimization methods to build autonomous agents.</li>\n<li>Report and present research findings and developments including status and results clearly and efficiently both internally and externally, verbally and in writing.</li>\n<li>Suggest and engage in team collaborations to meet ambitious research goals.</li>\n<li>Work with external collaborators and maintain relationships with relevant research labs and key individuals as appropriate.</li>\n<li>Work in collaboration with our Responsible AI teams to ensure our advances in intelligence are developed ethically and provide broad benefits to humanity.</li>\n</ul>\n<p><strong>About You:</strong></p>\n<ul>\n<li>You are a passionate and talented researcher with a strong foundation and a proven ability to conduct impactful research in AI.</li>\n<li>You have a collaborative mindset and are excited to work as part of a team to tackle ambitious research challenges.</li>\n<li>You are passionate about seeing your research translated into real-world products that improve the lives of users and are eager to work in an environment where research has a direct path to product impact.</li>\n</ul>\n<p><strong>Requirements:</strong></p>\n<ul>\n<li>PhD in Computer Science, Artificial Intelligence, or a related field.</li>\n<li>Strong publication record in top-tier machine learning conferences or journals.</li>\n<li>Solid understanding of deep learning, natural language processing, computer vision, and/or speech processing.</li>\n<li>Experience with relevant ML frameworks such as JAX, TensorFlow, or PyTorch.</li>\n</ul>\n<p><strong>Preferred Qualifications:</strong></p>\n<ul>\n<li>Experience with multimodal learning, large language models, and/or assistive AI agents.</li>\n<li>Experience with prompt engineering, few-shot learning, post-training techniques, and evaluations.</li>\n<li>Familiarity with large-scale model training and deployment.</li>\n<li>Strong programming skills in Python or similar languages.</li>\n<li>Excellent communication and collaboration skills.</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_e5396d5d-402","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Google DeepMind","sameAs":"https://deepmind.com/","logo":"https://logos.yubhub.co/deepmind.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/deepmind/jobs/7640947?utm_source=yubhub.co&utm_medium=jobs_feed&utm_campaign=apply","x-work-arrangement":"onsite","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["Deep Learning","Natural Language Processing","Computer Vision","Speech Processing","JAX","TensorFlow","PyTorch"],"x-skills-preferred":["Multimodal Learning","Large Language Models","Assistive AI Agents","Prompt Engineering","Few-Shot Learning","Post-Training Techniques","Evaluations","Large-Scale Model Training","Deployment"],"datePosted":"2026-03-31T18:26:50.617Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Bangalore, India"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Deep Learning, Natural Language Processing, Computer Vision, Speech Processing, JAX, TensorFlow, PyTorch, Multimodal Learning, Large Language Models, Assistive AI Agents, Prompt Engineering, Few-Shot Learning, Post-Training Techniques, Evaluations, Large-Scale Model Training, Deployment"}]}