{"version":"0.1","company":{"name":"YubHub","url":"https://yubhub.co","jobsUrl":"https://yubhub.co/jobs/skill/adversarial-examples"},"x-facet":{"type":"skill","slug":"adversarial-examples","display":"Adversarial Examples","count":2},"x-feed-size-limit":100,"x-feed-sort":"enriched_at desc","x-feed-notice":"This feed contains at most 100 jobs (the most recently enriched). For the full corpus, use the paginated /stats/by-facet endpoint or /search.","x-generator":"yubhub-xml-generator","x-rights":"Free to redistribute with attribution: \"Data by YubHub (https://yubhub.co)\"","x-schema":"Each entry in `jobs` follows https://schema.org/JobPosting. YubHub-native raw fields carry `x-` prefix.","jobs":[{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_1f33d5a1-6ed"},"title":"Model Behavior Tutor - Epistemic Rigor & Truthfulness","description":"<p>You will ensure Grok reasons carefully, resists motivated reasoning, and communicates uncertainty and evidence proportionately.</p>\n<p>Responsibilities: Assess model outputs for factual accuracy, logical coherence, fallacious reasoning, and hidden assumptions. Identify subtle ideological capture, statistical fallacies, and rhetorical sleights of hand. Write exemplary reasoning that models intellectual honesty, source evaluation, nuanced weighing of primary and secondary sources, and scoping of confidence. Construct adversarial examples and red-team prompts to expose remaining epistemic weaknesses. Contribute to the definition and scaling of constitutional principles for truth-seeking behavior.</p>\n<p>Basic Qualifications: Published analytical work and academic training in a high-rigor field. Strong Forecasting track record (e.g., Metaculus, Good Judgment), rigorous analysis, or public updating on errors. Deep knowledge in at least three of: philosophy of science, cognitive psychology, statistics, logic, linguistics, history, economics, or related disciplines. Ability to steel-man opposing views and separate settled knowledge from speculation. Habitual reliance on primary sources and base rates.</p>\n<p>Preferred Skills and Experience: Experience in intelligence analysis, investigative journalism, or academic peer review.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_1f33d5a1-6ed","directApply":true,"hiringOrganization":{"@type":"Organization","name":"xAI","sameAs":"https://www.xai.com/","logo":"https://logos.yubhub.co/xai.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/xai/jobs/5017518007","x-work-arrangement":"remote","x-experience-level":"senior","x-job-type":"full-time|part-time|contract","x-salary-range":"$40/hour - $70/hour","x-skills-required":["factual accuracy","logical coherence","fallacious reasoning","ideological capture","statistical fallacies","rhetorical sleights of hand","intellectual honesty","source evaluation","nuanced weighing of primary and secondary sources","scoping of confidence","adversarial examples","red-team prompts","epistemic weaknesses","definition and scaling of constitutional principles for truth-seeking behavior"],"x-skills-preferred":["intelligence analysis","investigative journalism","academic peer review"],"datePosted":"2026-04-18T15:48:46.461Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Remote"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"factual accuracy, logical coherence, fallacious reasoning, ideological capture, statistical fallacies, rhetorical sleights of hand, intellectual honesty, source evaluation, nuanced weighing of primary and secondary sources, scoping of confidence, adversarial examples, red-team prompts, epistemic weaknesses, definition and scaling of constitutional principles for truth-seeking behavior, intelligence analysis, investigative journalism, academic peer review"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_f73f108d-30a"},"title":"Senior Security Engineer, Agentic Red Team","description":"<p>Job Title: Senior Security Engineer, Agentic Red Team</p>\n<p>We&#39;re a team of scientists, engineers, machine learning experts, and more, working together to advance the state of the art in artificial intelligence.</p>\n<p><strong>About Us</strong> The Agentic Red Team is a specialized, high-velocity unit within Google DeepMind Security. Our mission is to close the &#39;Agentic Launch Gap&#39;,the critical window where novel AI capabilities outpace traditional security reviews.</p>\n<p><strong>The Role</strong> As a Senior Security Engineer on the Agentic Red Team, you will be the primary technical executor of our adversarial engagements. You will work &#39;in the room&#39; with product builders, identifying architectural flaws during the design phase long before formal reviews begin.</p>\n<p><strong>Key Responsibilities:</strong></p>\n<ul>\n<li>Execute Agile Red Teaming: Conduct rapid, high-impact security assessments on agentic services, focusing on vulnerabilities unique to GenAI such as prompt injection, tool-use escalation, and autonomous lateral movement.</li>\n<li>Develop Advanced Exploits: Engineer and execute complex attack sequences that exploit non-deterministic model behaviors, agentic logic errors, and data poisoning vectors.</li>\n<li>Build Automated Defenses: Write code to transform manual vulnerability discoveries into automated regression testing frameworks (&#39;Auto Red Teaming&#39;) that prevent regression in future model versions.</li>\n<li>Embed with Product Teams: Partner directly with developers during the design and build phases to provide immediate feedback, effectively shortening the feedback loop between offensive findings and defensive engineering.</li>\n<li>Curate Threat Intelligence: Maintain and expand a library of agent-specific attack patterns and exploit primitives to establish robust release criteria for new models.</li>\n</ul>\n<p><strong>About You</strong> In order to set you up for success as a Software Engineer at Google DeepMind, we look for the following skills and experience:</p>\n<ul>\n<li>Bachelor&#39;s degree in Computer Science, Information Security, or equivalent practical experience.</li>\n<li>Experience in Red Teaming, Offensive Security, or Adversarial Machine Learning.</li>\n<li>Strong coding skills in Python, Go, or C++ with experience building security tools or automation.</li>\n<li>Technical understanding of LLM architectures, agentic workflows (e.g., chain-of-thought reasoning), and common AI vulnerability classes.</li>\n</ul>\n<p><strong>Preferred Qualifications</strong></p>\n<ul>\n<li>Hands-on experience developing exploits for GenAI models (e.g., prompt injection, adversarial examples, training data extraction).</li>\n<li>Experience working in a consulting capacity with product teams or in a fast-paced &#39;startup-like&#39; environment.</li>\n<li>Familiarity with AI safety benchmarks, evaluation frameworks, and fuzzing techniques.</li>\n<li>Ability to translate complex probabilistic risks into actionable engineering fixes for developers.</li>\n</ul>\n<p><strong>Salary &amp; Benefits</strong> The US base salary range for this full-time position is between $166,000 - $244,000 + bonus + equity + benefits.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_f73f108d-30a","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Google DeepMind","sameAs":"https://deepmind.com/","logo":"https://logos.yubhub.co/deepmind.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/deepmind/jobs/7596438","x-work-arrangement":"onsite","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$166,000 - $244,000 + bonus + equity + benefits","x-skills-required":["Python","Go","C++","Red Teaming","Offensive Security","Adversarial Machine Learning","LLM architectures","agentic workflows","chain-of-thought reasoning","AI vulnerability classes"],"x-skills-preferred":["prompt injection","adversarial examples","training data extraction","AI safety benchmarks","evaluation frameworks","fuzzing techniques"],"datePosted":"2026-03-16T14:39:43.939Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Mountain View, California, US; New York City, New York, US; Zurich, Switzerland"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Python, Go, C++, Red Teaming, Offensive Security, Adversarial Machine Learning, LLM architectures, agentic workflows, chain-of-thought reasoning, AI vulnerability classes, prompt injection, adversarial examples, training data extraction, AI safety benchmarks, evaluation frameworks, fuzzing techniques","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":166000,"maxValue":244000,"unitText":"YEAR"}}}]}