{"version":"0.1","company":{"name":"YubHub","url":"https://yubhub.co","jobsUrl":"https://yubhub.co/jobs/skill/trustworthy-ai"},"x-facet":{"type":"skill","slug":"trustworthy-ai","display":"Trustworthy Ai","count":3},"x-feed-size-limit":100,"x-feed-sort":"enriched_at desc","x-feed-notice":"This feed contains at most 100 jobs (the most recently enriched). For the full corpus, use the paginated /stats/by-facet endpoint or /search.","x-generator":"yubhub-xml-generator","x-rights":"Free to redistribute with attribution: \"Data by YubHub (https://yubhub.co)\"","x-schema":"Each entry in `jobs` follows https://schema.org/JobPosting. YubHub-native raw fields carry `x-` prefix.","jobs":[{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_769c0070-5b2"},"title":"Research Scientist, Agent Robustness","description":"<p>As a Research Scientist working on Agent Robustness, you will work on the fundamental challenges of building AI agents that are safe and aligned with humans.</p>\n<p>For example, you might:</p>\n<ul>\n<li>Research the science of AI agent capabilities with a focus on how they relate to safety, risk factors, and methodologies for benchmarking them;</li>\n<li>Design and build harnesses to test AI agents&#39; tendency to take harmful actions when pressured to do so by users or tricked into doing so by elements of their environment;</li>\n<li>Design and build exploits and mitigations for new and unique failure modes that arise as AI agents gain affordances like coding, web browsing, and computer use;</li>\n<li>Characterize and design mitigations for potential failure modes or broader risks of systems involving multiple interacting AI agents.</li>\n</ul>\n<p>Ideally you&#39;d have:</p>\n<ul>\n<li>Commitment to our mission of promoting safe, secure, and trustworthy AI deployments in the industry as frontier AI capabilities continue to advance;</li>\n<li>Practical experience conducting technical research collaboratively;</li>\n<li>Experience with post-training and RL techniques such as RLHF, DPO, GRPO, and similar approaches;</li>\n<li>A track record of published research in machine learning, particularly in generative AI;</li>\n<li>At least three years of experience addressing sophisticated ML problems, whether in a research setting or in product development;</li>\n<li>Strong written and verbal communication skills to operate in a cross-functional team.</li>\n</ul>\n<p>Nice to have:</p>\n<ul>\n<li>Hands-on experience with agent evaluation frameworks such as SWE-bench, WebArena, OSWorld, Inspect, or similar tools;</li>\n<li>Experience with red-teaming, prompt injection, or adversarial testing of AI systems.</li>\n</ul>\n<p>Our research interviews are crafted to assess candidates&#39; skills in practical ML prototyping and debugging, their grasp of research concepts, and their alignment with our organisational culture. We will not ask any LeetCode-style questions. If you&#39;re excited about advancing AI safety and contributing to our mission, we encourage you to apply, even if your experience doesn&#39;t perfectly align with every requirement.</p>\n<p>Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position, determined by work location and additional factors, including job-related skills, experience, interview performance, and relevant education or training. Scale employees in eligible roles are also granted equity-based compensation, subject to Board of Director approval. Your recruiter can share more about the specific salary range for your preferred location during the hiring process, and confirm whether the hired role will be eligible for equity grant. You&#39;ll also receive benefits including, but not limited to: Comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this role may be eligible for additional benefits such as a commuter stipend.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_769c0070-5b2","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Scale","sameAs":"https://scale.com/","logo":"https://logos.yubhub.co/scale.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/scaleai/jobs/4675684005","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$216,000-$270,000 USD","x-skills-required":["Commitment to our mission of promoting safe, secure, and trustworthy AI deployments in the industry as frontier AI capabilities continue to advance","Practical experience conducting technical research collaboratively","Experience with post-training and RL techniques such as RLHF, DPO, GRPO, and similar approaches","A track record of published research in machine learning, particularly in generative AI","At least three years of experience addressing sophisticated ML problems, whether in a research setting or in product development"],"x-skills-preferred":["Hands-on experience with agent evaluation frameworks such as SWE-bench, WebArena, OSWorld, Inspect, or similar tools","Experience with red-teaming, prompt injection, or adversarial testing of AI systems"],"datePosted":"2026-04-18T15:57:29.447Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco, CA; New York, NY"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Commitment to our mission of promoting safe, secure, and trustworthy AI deployments in the industry as frontier AI capabilities continue to advance, Practical experience conducting technical research collaboratively, Experience with post-training and RL techniques such as RLHF, DPO, GRPO, and similar approaches, A track record of published research in machine learning, particularly in generative AI, At least three years of experience addressing sophisticated ML problems, whether in a research setting or in product development, Hands-on experience with agent evaluation frameworks such as SWE-bench, WebArena, OSWorld, Inspect, or similar tools, Experience with red-teaming, prompt injection, or adversarial testing of AI systems","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":216000,"maxValue":270000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_8d715def-b40"},"title":"Technical Program Manager, Consumer Engineering","description":"<p><strong>About Anthropic</strong></p>\n<p>Anthropic&#39;s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.</p>\n<p><strong>About the Role</strong></p>\n<p>Claude is used by millions of people every day through our web, mobile, and desktop applications. As these products scale, so does the complexity of coordinating across engineering teams and cross-functional partners. We need an experienced Technical Program Manager to drive critical initiatives across our product engineering organisation.</p>\n<p>You&#39;ll be instrumental in coordinating high-complexity cross-team efforts, driving launch readiness, and ensuring seamless execution across platform and product teams. This role is critical as our products grow in scale and complexity.</p>\n<p><strong>Responsibilities:</strong></p>\n<ul>\n<li>Lead cross-team engineering initiatives, coordinating between platform, product, and infrastructure teams to deliver high-quality releases</li>\n<li>Drive program execution for major product features, ensuring alignment across engineering, legal, policy, and other cross-functional partners</li>\n<li>Partner with engineering leadership to improve development velocity, manage dependencies, and remove blockers</li>\n<li>Build and maintain relationships across engineering teams to understand technical requirements, constraints, and tradeoffs</li>\n<li>Create comprehensive program documentation including roadmaps, status reports, risk assessments, and communication plans</li>\n<li>Facilitate technical decision-making by bringing together stakeholders, driving consensus, and ensuring timely resolution of blocking issues</li>\n<li>Establish processes and frameworks for scaling product development and improving engineering excellence</li>\n<li>Drive launch readiness across teams, ensuring predictable and well-communicated releases</li>\n</ul>\n<p><strong>You may be a good fit if you:</strong></p>\n<ul>\n<li>Have several years of experience in technical program management, with a track record of successfully delivering complex, cross-functional programs</li>\n<li>Have experience with web, mobile, or client application development and the ability to engage meaningfully with frontend and backend engineers</li>\n<li>Have experience coordinating across multiple engineering teams and cross-functional partners (legal, policy, etc.) simultaneously</li>\n<li>Are highly organised and can manage multiple parallel workstreams effectively across distributed teams</li>\n<li>Thrive in unstructured environments with a knack for bringing order to chaos and creating clarity in ambiguous situations</li>\n<li>Have excellent written and verbal communication skills, with the ability to influence without authority</li>\n<li>Have a track record of building trust with technical teams and driving change through influence</li>\n<li>Are passionate about Anthropic&#39;s mission and interested in the challenges of bringing AI capabilities to users safely and reliably</li>\n<li>Have experience with high-growth products serving large user bases (preferred but not required)</li>\n</ul>\n<p><strong>Deadline to apply:</strong></p>\n<p>None, applications will be received on a rolling basis.</p>\n<p><strong>Logistics</strong></p>\n<ul>\n<li>Education requirements: We require at least a Bachelor&#39;s degree in a related field or equivalent experience.</li>\n<li>Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices.</li>\n<li>Visa sponsorship: We do sponsor visas! However, we aren&#39;t able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this.</li>\n</ul>\n<p><strong>We encourage you to apply even if you do not believe you meet every single qualification.</strong></p>\n<p>Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you&#39;re interested in this work.</p>\n<p><strong>Your safety matters to us.</strong></p>\n<p>To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you&#39;re ever unsure about a communication, don&#39;t click any links—visit anthropic.com/careers directly for confirmed position openings.</p>\n<p><strong>How we&#39;re different</strong></p>\n<p>We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We&#39;re an extremely collaborative group, and we</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_8d715def-b40","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Anthropic","sameAs":"https://job-boards.greenhouse.io","logo":"https://logos.yubhub.co/anthropic.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/anthropic/jobs/5062968008","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$290,000 - $365,000 USD","x-skills-required":["Technical Program Management","Web Development","Mobile Development","Client Application Development","Frontend Engineering","Backend Engineering","Cross-functional Team Management","Program Execution","Launch Readiness","Engineering Excellence"],"x-skills-preferred":["High-growth Products","Large User Bases","AI Research","Steerable AI","Trustworthy AI","Empirical Science","Collaborative Teamwork"],"datePosted":"2026-03-08T13:51:05.142Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco, CA | New York City, NY | Seattle, WA"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Technical Program Management, Web Development, Mobile Development, Client Application Development, Frontend Engineering, Backend Engineering, Cross-functional Team Management, Program Execution, Launch Readiness, Engineering Excellence, High-growth Products, Large User Bases, AI Research, Steerable AI, Trustworthy AI, Empirical Science, Collaborative Teamwork","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":290000,"maxValue":365000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_28cb565e-69a"},"title":"Researcher, Health AI","description":"<p><strong>Researcher, Health AI</strong></p>\n<p><strong>Location</strong></p>\n<p>San Francisco</p>\n<p><strong>Employment Type</strong></p>\n<p>Full time</p>\n<p><strong>Department</strong></p>\n<p>Safety Systems</p>\n<p><strong>Compensation</strong></p>\n<ul>\n<li>$295K – $445K • Offers Equity</li>\n</ul>\n<p>The base pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience. If the role is non-exempt, overtime pay will be provided consistent with applicable laws. In addition to the salary range listed above, total compensation also includes generous equity, performance-related bonus(es) for eligible employees, and the following benefits.</p>\n<ul>\n<li>Medical, dental, and vision insurance for you and your family, with employer contributions to Health Savings Accounts</li>\n</ul>\n<ul>\n<li>Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses (parking and transit)</li>\n</ul>\n<ul>\n<li>401(k) retirement plan with employer match</li>\n</ul>\n<ul>\n<li>Paid parental leave (up to 24 weeks for birth parents and 20 weeks for non-birthing parents), plus paid medical and caregiver leave (up to 8 weeks)</li>\n</ul>\n<ul>\n<li>Paid time off: flexible PTO for exempt employees and up to 15 days annually for non-exempt employees</li>\n</ul>\n<ul>\n<li>13+ paid company holidays, and multiple paid coordinated company office closures throughout the year for focus and recharge, plus paid sick or safe time (1 hour per 30 hours worked, or more, as required by applicable state or local law)</li>\n</ul>\n<ul>\n<li>Mental health and wellness support</li>\n</ul>\n<ul>\n<li>Employer-paid basic life and disability coverage</li>\n</ul>\n<ul>\n<li>Annual learning and development stipend to fuel your professional growth</li>\n</ul>\n<ul>\n<li>Daily meals in our offices, and meal delivery credits as eligible</li>\n</ul>\n<ul>\n<li>Relocation support for eligible employees</li>\n</ul>\n<ul>\n<li>Additional taxable fringe benefits, such as charitable donation matching and wellness stipends, may also be provided.</li>\n</ul>\n<p>More details about our benefits are available to candidates during the hiring process.</p>\n<p>This role is at-will and OpenAI reserves the right to modify base pay and other compensation components at any time based on individual performance, team or company results, or market conditions.</p>\n<p><strong>About the Team</strong></p>\n<p>The Safety Systems team is dedicated to ensuring the safety, robustness, and reliability of AI models towards their deployment in the real world.</p>\n<p>OpenAI’s charter calls on us to ensure the benefits of AI are distributed widely. Our Health AI team is focused on enabling universal access to high-quality medical information. We work at the intersection of AI safety research and healthcare applications, aiming to create trustworthy AI models that can assist medical professionals and improve patient outcomes.</p>\n<p><strong>About the Role</strong></p>\n<p>We’re seeking strong researchers who are passionate about advancing AI safety and improving global health outcomes. As a Research Scientist, you will contribute to the development of safe and effective AI models for healthcare applications. You will implement practical and general methods to improve the behavior, knowledge, and reasoning of our models in these settings. This will require research into safety and alignment techniques that we aim to generalize towards safe and beneficial AGI.</p>\n<p>This role is based in San Francisco, CA. We use a hybrid work model of 3 days in the office per week and offer relocation assistance to new employees.</p>\n<p><strong>In this role, you will:</strong></p>\n<ul>\n<li>Design and apply practical and scalable methods to improve safety and reliability of our models, including RLHF, automated red teaming, scalable oversight, etc.</li>\n</ul>\n<ul>\n<li>Evaluate methods using health-related data, ensuring models provide accurate, reliable, and trustworthy information.</li>\n</ul>\n<ul>\n<li>Build reusable libraries for applying general alignment techniques to our models.</li>\n</ul>\n<ul>\n<li>Proactively understand the safety of our models and systems, identifying areas of risk.</li>\n</ul>\n<ul>\n<li>Work with cross-team stakeholders to integrate methods in core model training and launch safety improvements in OpenAI’s products.</li>\n</ul>\n<p><strong>You might thrive in this role if you:</strong></p>\n<ul>\n<li>Are excited about OpenAI’s mission of ensuring AGI is universally beneficial and are aligned with OpenAI’s charter.</li>\n</ul>\n<ul>\n<li>Demonstrate passion for AI safety and improving global health outcomes.</li>\n</ul>\n<ul>\n<li>Have 4+ years of experience with deep learning research and LLMs, especially practical alignment topics such as RLHF, automated red teaming, scalable oversight, etc.</li>\n</ul>\n<ul>\n<li>Hold a Ph.D. or other degree in computer science, AI, machine learning, or a related field.</li>\n</ul>\n<ul>\n<li>Stay goal-oriented instead of method-oriented, and are not afraid of unglamorous but high-value work when needed.</li>\n</ul>\n<ul>\n<li>Possess experience making practical model improvements for AI model deployment.</li>\n</ul>\n<ul>\n<li>Own problems end-to-end, and are willing to pick up whatever knowledge you&#39;re missing to get the job done.</li>\n</ul>\n<ul>\n<li>Are a team player who enjoys collaborative work environments.</li>\n</ul>\n<ul>\n<li>Bonus: possess experience in health-related AI research or deployments.</li>\n</ul>\n<p><strong>About OpenAI</strong></p>\n<p>OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_28cb565e-69a","directApply":true,"hiringOrganization":{"@type":"Organization","name":"OpenAI","sameAs":"https://jobs.ashbyhq.com","logo":"https://logos.yubhub.co/openai.com.png"},"x-apply-url":"https://jobs.ashbyhq.com/openai/bcbe08e3-9593-431d-bc99-37e35e035742","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$295K – $445K • Offers Equity","x-skills-required":["Deep learning research","LLMs","RLHF","Automated red teaming","Scalable oversight","Health-related data","AI safety research","Healthcare applications","Trustworthy AI models","Medical professionals","Patient outcomes","Ph.D. or other degree in computer science, AI, machine learning, or a related field"],"x-skills-preferred":["Team player","Collaborative work environments","Health-related AI research or deployments"],"datePosted":"2026-03-06T18:40:30.820Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Deep learning research, LLMs, RLHF, Automated red teaming, Scalable oversight, Health-related data, AI safety research, Healthcare applications, Trustworthy AI models, Medical professionals, Patient outcomes, Ph.D. or other degree in computer science, AI, machine learning, or a related field, Team player, Collaborative work environments, Health-related AI research or deployments","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":295000,"maxValue":445000,"unitText":"YEAR"}}}]}