{"version":"0.1","company":{"name":"YubHub","url":"https://yubhub.co","jobsUrl":"https://yubhub.co/jobs/skill/labeling"},"x-facet":{"type":"skill","slug":"labeling","display":"Labeling","count":16},"x-feed-size-limit":100,"x-feed-sort":"enriched_at desc","x-feed-notice":"This feed contains at most 100 jobs (the most recently enriched). For the full corpus, use the paginated /stats/by-facet endpoint or /search.","x-generator":"yubhub-xml-generator","x-rights":"Free to redistribute with attribution: \"Data by YubHub (https://yubhub.co)\"","x-schema":"Each entry in `jobs` follows https://schema.org/JobPosting. YubHub-native raw fields carry `x-` prefix.","jobs":[{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_38ba8f1b-783"},"title":"Occupational Math Tutor","description":"<p>As an AI Tutor – Occupational Math Specialist, you&#39;ll help advance xAI&#39;s mission by enhancing our AI technologies through high-quality inputs, labels, and annotations using specialized software. You&#39;ll focus on math as it is used in real-world occupational settings, such as finance, accounting, insurance, operations, logistics, skilled trades, and healthcare analytics.</p>\n<p>Responsibilities: Use proprietary tools to label and evaluate data. Support and ensure the delivery of high-quality curated data. Work with engineers to refine tasks, tools, and workflows. Design, select, and refine tasks grounded in real-world occupational math, for example: Insurance and risk: premiums, deductibles, expected loss calculations, simple risk modeling. Logistics and operations: inventory and reorder policies, capacity/throughput, basic cost optimization. Skilled trades: surveying and land measurement, electrical load calculations, machining tolerances, material estimation, blueprint and scale calculations. Finance and banking: loan amortization schedules, time value of money, portfolio and risk metrics. Accounting and tax: financial statements, reconciliations, depreciation, multi-bracket tax calculations, payroll. Health and social data: rates, ratios, simple biostatistics, survey-based metrics and policy-relevant indicators. Provide detailed, step-by-step solutions and evaluate model responses for correctness, adherence to domain rules (e.g., tax codes, building codes, basic regulatory or business constraints), clarity, and plausibility. Interpret, analyze, and execute tasks based on given instructions.</p>\n<p>Basic Qualifications: A Master&#39;s or PhD in a quantitative field such as Mathematics, Statistics, Operations Research, Economics, Finance, Actuarial Science, or a closely related discipline with strong training in applied/occupational math; or A Bachelor&#39;s degree in one of the fields above plus substantial professional experience (e.g., 2+ years) in a math-heavy occupational domain such as finance, banking, insurance, accounting, logistics/supply chain, healthcare analytics, or policy analysis Professional licensure or certification in a skilled trade (e.g., licensed surveyor, master electrician, journeyman machinist) with demonstrated expertise in trade-specific mathematical calculations. Strong proficiency in applied mathematics relevant to at least one of the above domains (e.g., probability and statistics, financial math, optimization under constraints, geometric and measurement calculations, or similar). Proficiency in reading and writing, both in informal and professional English. Strong ability to navigate various information resources and databases. Outstanding communication, interpersonal, analytical, and organizational capabilities. Solid reading comprehension skills combined with the capacity to exercise autonomous judgment even when presented with limited data/material. A strong passion for and commitment to technological advancements and innovation.</p>\n<p>Preferred Skills and Experience: Professional certifications in relevant fields (e.g., CFA, FRM, SOA exams, CPA, Six Sigma, PE, licensed surveyor, or similar) or demonstrably equivalent experience. Prior professional experience in one or more domains such as asset management, retail or commercial banking, insurance pricing/reserving, accounting/audit, logistics/supply chain planning, healthcare analytics, public policy analysis, or skilled trades. Previous AI Tutoring experience and/or experience teaching or training others in applied or occupational math topics. Experience building and reviewing complex spreadsheets, financial or risk models, dashboards, technical drawings, or similar artifacts.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_38ba8f1b-783","directApply":true,"hiringOrganization":{"@type":"Organization","name":"xAI","sameAs":"https://www.xai.com","logo":"https://logos.yubhub.co/xai.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/xai/jobs/4997616007","x-work-arrangement":"remote","x-experience-level":"senior","x-job-type":"full-time|part-time|contract","x-salary-range":"$45/hour - $75/hour","x-skills-required":["applied mathematics","probability and statistics","financial math","optimization under constraints","geometric and measurement calculations","specialized software","proprietary tools","data labeling","data evaluation","curated data","task refinement","workflow refinement","domain expertise","tax codes","building codes","regulatory compliance","professional certifications","prior professional experience","AI Tutoring experience","experience building and reviewing complex spreadsheets","financial or risk models","dashboards","technical drawings"],"x-skills-preferred":["CFA","FRM","SOA exams","CPA","Six Sigma","PE","licensed surveyor","asset management","retail or commercial banking","insurance pricing/reserving","accounting/audit","logistics/supply chain planning","healthcare analytics","public policy analysis","skilled trades"],"datePosted":"2026-04-18T15:55:17.270Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Remote"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"applied mathematics, probability and statistics, financial math, optimization under constraints, geometric and measurement calculations, specialized software, proprietary tools, data labeling, data evaluation, curated data, task refinement, workflow refinement, domain expertise, tax codes, building codes, regulatory compliance, professional certifications, prior professional experience, AI Tutoring experience, experience building and reviewing complex spreadsheets, financial or risk models, dashboards, technical drawings, CFA, FRM, SOA exams, CPA, Six Sigma, PE, licensed surveyor, asset management, retail or commercial banking, insurance pricing/reserving, accounting/audit, logistics/supply chain planning, healthcare analytics, public policy analysis, skilled trades"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_b2637f59-e14"},"title":"Full-Stack Software Engineer, Reinforcement Learning","description":"<p>As a Full-Stack Software Engineer in RL, you&#39;ll build the platforms, tools, and interfaces that power environment creation, data collection, and training observability. The quality of Claude&#39;s next generation depends on the quality of the data we train it on , and the systems you build are what make that data possible. You&#39;ll own product surfaces end-to-end , from backend services and APIs to the web UIs that researchers, external vendors, and thousands of data labelers use every day.\\n\\nYou don&#39;t need a background in ML research. What matters is that you can take an ambiguous, high-stakes problem and ship a polished, reliable product against it, fast. This team moves very quickly. Claude writes a lot of the code we commit, which means the bottleneck isn&#39;t typing , it&#39;s judgment, taste, and the ability to react to what researchers need next.\\n\\nYou&#39;ll iterate on data collection strategies to distill the knowledge of thousands of human experts around the world into our models, and you&#39;ll do it in a loop that closes in hours and days, not quarters or months.\\n\\nAnthropic&#39;s Reinforcement Learning organization leads the research and development that trains Claude to be capable, reliable, and safe. We&#39;ve contributed to every Claude model, with significant impact on the autonomy and coding capabilities of our most advanced models.\\n\\nOur work spans teaching models to use computers effectively, advancing code generation through RL, pioneering fundamental RL research for large language models, and building the scalable training methodologies behind our frontier production models.\\n\\nThe RL org is organized around four goals: solving the science of long-horizon tasks and continual learning, scaling RL data and environments to be comprehensive and diverse, automating software engineering end-to-end, and training the frontier production model.\\n\\nOur engineering teams build the environments, evaluation systems, data pipelines, and tooling that make all of this possible , from realistic agentic training environments and scalable code data generation to human data collection platforms and production training operations.\\n\\n### Responsibilities\\n\\n<em>   Build and extend web platforms for RL environment creation, management, and quality review , including environment configuration, versioning, and validation workflows\\n</em>   Develop vendor-facing interfaces and tooling that let external partners create, submit, and iterate on training environments with minimal friction\\n<em>   Design and implement platforms for human data collection at scale, including labeling workflows, quality assurance systems, and feedback mechanisms that surface reward signal integrity issues early\\n</em>   Build evaluation dashboards and observability UIs that give researchers real-time insight into environment quality, training run health, and reward hacking\\n<em>   Create backend services and APIs that connect environment authoring tools, data collection systems, and RL training infrastructure\\n</em>   Build and expand scalable code data generation pipelines, producing diverse programming tasks with robust reward signals across languages and difficulty levels\\n<em>   Develop onboarding automation and documentation tooling so new vendors and internal users ramp up in hours, not weeks\\n</em>   Partner closely with RL researchers, data operations, and vendor management to translate ambiguous requirements into well-scoped, well-designed products\\n\\n### Requirements\\n\\n<em>   Strong software engineering fundamentals and real full-stack range , you&#39;re comfortable owning a surface from database schema to frontend\\n</em>   Proficient in Python and a modern web stack (React, TypeScript, or similar)\\n<em>   Track record of shipping systems that solved a hard problem, not just shipped on time , e.g. you built the thing that made your team 10x faster, or the internal tool nobody thought was possible\\n</em>   Operate with high agency: you identify what needs to be done and drive it forward without waiting for a ticket\\n<em>   Found yourself wondering &quot;why isn&#39;t this moving faster?&quot; in previous roles , and then have done something about it\\n</em>   Care about UX and can build interfaces that are intuitive for both technical researchers and non-technical labelers\\n<em>   Communicate clearly with researchers, operations teams, and engineers, and can turn vague asks into well-scoped work\\n</em>   Thrive in a fast-moving environment where priorities shift, Claude is your pair programmer, and the next problem is often one nobody has solved before\\n<em>   Care about Anthropic&#39;s mission to build safe, beneficial AI and want your work to contribute directly to it\\n\\n### Nice to Have\\n\\n</em>   Built data collection, labeling, or annotation platforms , ideally ones that had to scale across many vendors or many task types\\n<em>   Background building multi-tenant platforms with role-based access, audit trails, and vendor management workflows\\n</em>   Experience with cloud infrastructure (GCP or AWS), Docker, and CI/CD pipelines\\n<em>   Familiarity with LLM training, fine-tuning, or evaluation workflows\\n</em>   Experience with async Python (Trio, asyncio) or high-throughput API design\\n<em>   Background in dashboards, monitoring, or observability tooling\\n</em>   Experience working directly with external vendors or partners on technical integrations\\n<em>   A background that isn&#39;t a straight line , e.g. math or physics into SWE, competitive programming, research into engineering, or a side project that outgrew its scope\\n\\n### Representative Projects\\n\\n</em>   Building a unified platform for human data collection that integrates labeling workflows, vendor management, and QA for complex agentic tasks\\n<em>   Developing vendor onboarding automation that handles Docker registry access, API token management, and environment validation\\n</em>   Creating evaluation and observability dashboards that catch reward hacks, measure environment difficulty, and give real-time feedback during production training\\n<em>   Building environment quality review workflows that let researchers browse, grade, and provide feedback on training environments\\n</em>   Developing automated environment quality pipelines that validate correctness and difficulty calibration before environments hit production training\\n*   Building internal tools for browsing and analyzing training run results, environment statistics, and data collection progress</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_b2637f59-e14","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Anthropic","sameAs":"https://www.anthropic.com/","logo":"https://logos.yubhub.co/anthropic.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/anthropic/jobs/5186067008","x-work-arrangement":"hybrid","x-experience-level":"staff","x-job-type":"full-time","x-salary-range":"$300,000-$405,000 USD","x-skills-required":["Python","Modern web stack","React","TypeScript","Strong software engineering fundamentals","Full-stack range","Database schema","Frontend","Cloud infrastructure","Docker","CI/CD pipelines","LLM training","Fine-tuning","Evaluation workflows","Async Python","High-throughput API design","Dashboards","Monitoring","Observability tooling"],"x-skills-preferred":["Data collection","Labeling","Annotation platforms","Multi-tenant platforms","Role-based access","Audit trails","Vendor management workflows"],"datePosted":"2026-04-18T15:54:27.784Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco, CA | New York City, NY"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Python, Modern web stack, React, TypeScript, Strong software engineering fundamentals, Full-stack range, Database schema, Frontend, Cloud infrastructure, Docker, CI/CD pipelines, LLM training, Fine-tuning, Evaluation workflows, Async Python, High-throughput API design, Dashboards, Monitoring, Observability tooling, Data collection, Labeling, Annotation platforms, Multi-tenant platforms, Role-based access, Audit trails, Vendor management workflows","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":300000,"maxValue":405000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_5f12e37f-373"},"title":"Sr. Staff Data & Applied Scientist, GenAI & Labeling Platforms","description":"<p>We&#39;re seeking an experienced Data Scientist to drive step function improvements in our data labeling capabilities at Pinterest. As a Sr. Staff Data &amp; Applied Scientist, GenAI &amp; Labeling Platforms, you will lead high-impact technical work in the space of Generative AI-powered labeling and evaluation systems.</p>\n<p>Key responsibilities include:</p>\n<ul>\n<li>Driving high-impact scientific work across GenAI-powered labeling and evaluation systems</li>\n<li>Identifying opportunities where LLMs and related methods can improve quality, speed, coverage, and cost efficiency</li>\n<li>Developing prototypes that demonstrate value in areas such as prompt optimization, task decomposition, quality estimation, routing, and human-in-the-loop workflows</li>\n<li>Designing rigorous measurement frameworks to evaluate model performance, workflow outcomes, and operational tradeoffs</li>\n<li>Partnering with engineering, product, and data science teams to shape roadmaps and productionize successful approaches</li>\n<li>Establishing strong standards for trustworthiness, including bias measurement, calibration, quality control, and responsible oversight</li>\n<li>Creating reusable methods and frameworks that can scale across teams and use cases</li>\n</ul>\n<p>Requirements include:</p>\n<ul>\n<li>10+ years of combined post-graduate academic and industry experience applying scientific methods to solve real-world problems on large-scale data</li>\n<li>Deep hands-on experience as an individual contributor solving technically complex, high-impact data science or ML problems</li>\n<li>Strong experience applying LLMs or other generative AI techniques to practical workflows, systems, or products</li>\n<li>Demonstrated ability to turn ambiguous problems into rigorous analyses, experiments, prototypes, and scalable solutions</li>\n<li>Proven track record of writing high-quality code and using technical work to influence product or platform direction</li>\n<li>Strong cross-functional collaboration skills and a track record of influencing leaders and peers through data and technical judgment</li>\n<li>Strong business and product sense with the ability to define meaningful success metrics and prioritize high-value opportunities</li>\n<li>Self-directed learning mindset and comfort working in a rapidly evolving technical landscape</li>\n<li>Experience with labeling systems, evaluation frameworks, human judgment workflows, or internal AI tooling is strongly preferred</li>\n</ul>\n<p>In-office requirement statement: This role will need to be in the office for in-person collaboration 1-2 times/quarter and therefore can be situated anywhere in the country.</p>\n<p>Relocation statement: This position is not eligible for relocation assistance.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_5f12e37f-373","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Pinterest","sameAs":"https://www.pinterest.com/","logo":"https://logos.yubhub.co/pinterest.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/pinterest/jobs/7775686","x-work-arrangement":"remote","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$195,738-$402,990 USD","x-skills-required":["Data Science","Machine Learning","Generative AI","Labeling Systems","Evaluation Frameworks","Human Judgment Workflows","Internal AI Tooling"],"x-skills-preferred":["LLMs","Prompt Optimization","Task Decomposition","Quality Estimation","Routing","Human-in-the-Loop Workflows"],"datePosted":"2026-04-18T15:51:34.310Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Palo Alto, CA, US; Remote, US"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Data Science, Machine Learning, Generative AI, Labeling Systems, Evaluation Frameworks, Human Judgment Workflows, Internal AI Tooling, LLMs, Prompt Optimization, Task Decomposition, Quality Estimation, Routing, Human-in-the-Loop Workflows","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":195738,"maxValue":402990,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_40a343d1-388"},"title":"Senior Data Scientist, Platform (Algorithms/Trust)","description":"<p>We&#39;re looking for a Senior Data Scientist to join our Content Integrity Data Science team. As a Senior Data Scientist, you will play a key role in building and protecting trust on the Airbnb platform by ensuring that the content on Airbnb, including listings, profiles, messages and other user-generated experiences, is accurate, authentic and aligned with our policies and community standards.</p>\n<p>This role is unique in that it directly improves the safety, trust, and quality of real-world user experiences by advancing Airbnb&#39;s ability to understand, interpret and act on content at scale. You will help shape how the platform reasons about listings, profiles, messages and other user-generated content by building the next generation of Trust Content Understanding Models.</p>\n<p>The ideal candidate is a motivated and talented &#39;full-stack&#39; Data Scientist with strong applied ML intuition and a bias toward impact, who can own and drive forward challenging, high-visibility initiatives such as:</p>\n<ul>\n<li>Advance Airbnb&#39;s content integrity capabilities by building Natural Language Processing (NLP) and LLM-based models that understand intent, policy compliance, quality and risk across listings, profiles, and user communications</li>\n</ul>\n<ul>\n<li>Develop high-performing models for detecting problematic or misleading content, including text classification, semantic similarity, information extraction and generative model-based reasoning for policy interpretation and enforcement</li>\n</ul>\n<ul>\n<li>Design and optimize human-in-the-loop Machine Learning (ML) systems for content review, labeling, escalation and continuous model improvement</li>\n</ul>\n<ul>\n<li>Build systems to detect emerging content risks and abuse patterns across regions, cohorts and surfaces using statistical, ML and representation-learning approaches</li>\n</ul>\n<ul>\n<li>Design intelligent sampling and evaluation strategies to measure rare events, policy recall, false positives/negatives and model blind spots in large-scale content systems</li>\n</ul>\n<p>A Typical Day:</p>\n<ul>\n<li>Artificial Intelligence / Machine Learning: Build and deploy production AI/ML systems for content integrity and trust content understanding, including feature engineering, model development and evaluation, thresholding, error analysis and end-to-end model lifecycle management. This includes working with NLP and LLM-based models in real production settings.</li>\n</ul>\n<ul>\n<li>Inference: Partner with inference data scientists to conduct rigorous quantitative analyses, applying working knowledge of causal inference to interpret results, assess impact, and identify gaps and opportunities to improve content quality and trust outcomes.</li>\n</ul>\n<ul>\n<li>Optimization: Develop frameworks to analyze tradeoffs between enforcement accuracy, user experience, operational cost and coverage, and propose strategies to optimize overall system effectiveness.</li>\n</ul>\n<ul>\n<li>Communication &amp; Collaboration: Deliver robust research reports with effective data visualizations, clear storytelling and bullet-proof accuracy to drive forward impact in collaboration with cross-functional partners in product, engineering and operations</li>\n</ul>\n<ul>\n<li>Empowerment: Think strategically about how to scale and evolve Airbnb&#39;s content integrity defenses, helping define the long-term vision for the role of AI-driven content understanding across the Trust ecosystem.</li>\n</ul>\n<p>Your Expertise:</p>\n<ul>\n<li>5+ years of industry experience in a quantitative analysis role with a Master’s degree in a quantitative field (computer science, statistics etc.), or 2+ years of experience with a Ph.D.</li>\n</ul>\n<ul>\n<li>State-of-the-art knowledge of AI/ML models</li>\n</ul>\n<ul>\n<li>Hands-on experience building, evaluating, and deploying NLP and LLM-based solutions, including text classification, information extraction, semantic understanding or generative applications.</li>\n</ul>\n<ul>\n<li>Working knowledge of causal inference</li>\n</ul>\n<ul>\n<li>Skilled in statistical programming (Python or R) and database usage (SQL)</li>\n</ul>\n<ul>\n<li>Proven ability to communicate clearly and effectively to audiences of varying technical levels</li>\n</ul>\n<ul>\n<li>Ability to translate complex findings and results into compelling narratives that drive impact</li>\n</ul>\n<ul>\n<li>Excellent project management, communication, and collaboration skills</li>\n</ul>\n<ul>\n<li>Trust &amp; Safety experience is a plus</li>\n</ul>\n<p>Your Location:</p>\n<p>This position is US - Remote Eligible. The role may include occasional work at an Airbnb office or attendance at offsites, as agreed to with your manager. While the position is Remote Eligible, you must live in a state where Airbnb, Inc. has a registered entity. Click here for the up-to-date list of excluded states. This list is continuously evolving, so please check back with us if the state you live in is on the exclusion list. If your position is employed by another Airbnb entity, your recruiter will inform you what states you are eligible to work from.</p>\n<p>Our Commitment To Inclusion &amp; Belonging:</p>\n<p>Airbnb is committed to working with the broadest talent pool possible. We believe diverse ideas foster innovation and engagement, and allow us to attract creatively-led people, and to develop the best products, services and solutions. All qualified individuals are encouraged to apply. We strive to also provide a disability inclusive application and interview process. If you are a candidate with a disability and require reasonable accommodation in order to submit an application, please contact us at: reasonableaccommodations@airbnb.com. Please include your full name, the role you’re applying for and the accommodation necessary to assist you with the recruiting process. We ask that you only reach out to us if you are a candidate whose disability prevents you from being able to complete our online application.</p>\n<p>How We&#39;ll Take Care of You:</p>\n<p>Our job titles may span more than one career level. The actual base pay is dependent upon many factors, such as: training, transferable skills, work experience, business needs and market demands. The base pay range is subject to change and may be modified in the future. This role may also be eligible for bonus, equity, benefits, and Employee Travel Credits.</p>\n<p>Pay Range $177,000-$208,000 USD</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_40a343d1-388","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Airbnb","sameAs":"https://www.airbnb.com/","logo":"https://logos.yubhub.co/airbnb.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/airbnb/jobs/7594971","x-work-arrangement":"remote","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$177,000-$208,000 USD","x-skills-required":["Natural Language Processing (NLP)","LLM-based models","text classification","semantic similarity","information extraction","generative model-based reasoning","policy interpretation and enforcement","human-in-the-loop Machine Learning (ML)","content review","labeling","escalation","continuous model improvement","statistical","representation-learning approaches","intelligent sampling","evaluation strategies","rare events","policy recall","false positives/negatives","model blind spots","large-scale content systems","Python","R","SQL","causal inference","statistical programming","database usage"],"x-skills-preferred":[],"datePosted":"2026-04-18T15:47:24.814Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"United States"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Natural Language Processing (NLP), LLM-based models, text classification, semantic similarity, information extraction, generative model-based reasoning, policy interpretation and enforcement, human-in-the-loop Machine Learning (ML), content review, labeling, escalation, continuous model improvement, statistical, representation-learning approaches, intelligent sampling, evaluation strategies, rare events, policy recall, false positives/negatives, model blind spots, large-scale content systems, Python, R, SQL, causal inference, statistical programming, database usage","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":177000,"maxValue":208000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_bd9625d9-99b"},"title":"ML Infrastructure Engineer, Safeguards","description":"<p>We are seeking a Machine Learning Infrastructure Engineer to join our Safeguards organization, where you&#39;ll build and scale the critical infrastructure that powers our AI safety systems.</p>\n<p>As part of the Safeguards team, you&#39;ll design and implement ML infrastructure that powers Claude safety. Your work will directly contribute to making AI systems more trustworthy and aligned with human values, ensuring our models operate safely as they become more capable.</p>\n<p>Responsibilities:</p>\n<ul>\n<li>Design and build scalable ML infrastructure to support real-time and batch classifier and safety evaluations across our model ecosystem</li>\n<li>Build monitoring and observability tools to track model performance, data quality, and system health for safety-critical applications</li>\n<li>Collaborate with research teams to productionize safety research, translating experimental safety techniques into robust, scalable systems</li>\n<li>Optimize inference latency and throughput for real-time safety evaluations while maintaining high reliability standards</li>\n<li>Implement automated testing, deployment, and rollback systems for ML models in production safety applications</li>\n<li>Partner with Safeguards, Security, and Alignment teams to understand requirements and deliver infrastructure that meets safety and production needs</li>\n<li>Contribute to the development of internal tools and frameworks that accelerate safety research and deployment</li>\n</ul>\n<p>You may be a good fit if you:</p>\n<ul>\n<li>Have 5+ years of experience building production ML infrastructure, ideally in safety-critical domains like fraud detection, content moderation, or risk assessment</li>\n<li>Are proficient in Python and have experience with ML frameworks like PyTorch, TensorFlow, or JAX</li>\n<li>Have hands-on experience with cloud platforms (AWS, GCP) and container orchestration (Kubernetes)</li>\n<li>Understand distributed systems principles and have built systems that handle high-throughput, low-latency workloads</li>\n<li>Have experience with data engineering tools and building robust data pipelines (e.g., Spark, Airflow, streaming systems)</li>\n<li>Are results-oriented, with a bias towards reliability and impact in safety-critical systems</li>\n<li>Enjoy collaborating with researchers and translating cutting-edge research into production systems</li>\n<li>Care deeply about AI safety and the societal impacts of your work</li>\n</ul>\n<p>Strong candidates may have experience with:</p>\n<ul>\n<li>Working with large language models and modern transformer architectures</li>\n<li>Implementing A/B testing frameworks and experimentation infrastructure for ML systems</li>\n<li>Developing monitoring and alerting systems for ML model performance and data drift</li>\n<li>Building automated labeling systems and human-in-the-loop workflows</li>\n<li>Experience in trust &amp; safety, fraud prevention, or content moderation domains</li>\n<li>Knowledge of privacy-preserving ML techniques and compliance requirements</li>\n<li>Contributing to open-source ML infrastructure projects</li>\n</ul>\n<p>We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_bd9625d9-99b","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Anthropic","sameAs":"https://www.anthropic.com/","logo":"https://logos.yubhub.co/anthropic.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/anthropic/jobs/4778843008","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$320,000-$405,000 USD","x-skills-required":["Python","PyTorch","TensorFlow","JAX","Cloud platforms (AWS, GCP)","Container orchestration (Kubernetes)","Distributed systems principles","Data engineering tools (Spark, Airflow, streaming systems)"],"x-skills-preferred":["Large language models and modern transformer architectures","A/B testing frameworks and experimentation infrastructure for ML systems","Monitoring and alerting systems for ML model performance and data drift","Automated labeling systems and human-in-the-loop workflows","Trust & safety, fraud prevention, or content moderation domains","Privacy-preserving ML techniques and compliance requirements","Open-source ML infrastructure projects"],"datePosted":"2026-04-18T15:44:06.907Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco, CA"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Python, PyTorch, TensorFlow, JAX, Cloud platforms (AWS, GCP), Container orchestration (Kubernetes), Distributed systems principles, Data engineering tools (Spark, Airflow, streaming systems), Large language models and modern transformer architectures, A/B testing frameworks and experimentation infrastructure for ML systems, Monitoring and alerting systems for ML model performance and data drift, Automated labeling systems and human-in-the-loop workflows, Trust & safety, fraud prevention, or content moderation domains, Privacy-preserving ML techniques and compliance requirements, Open-source ML infrastructure projects","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":320000,"maxValue":405000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_76fd624c-e23"},"title":"Full-Stack Software Engineer, Reinforcement Learning","description":"<p>As a Full-Stack Software Engineer in RL, you&#39;ll build the platforms, tools, and interfaces that power environment creation, data collection, and training observability. The quality of Claude&#39;s next generation depends on the quality of the data we train it on , and the systems you build are what make that data possible. You&#39;ll own product surfaces end-to-end , from backend services and APIs to the web UIs that researchers, external vendors, and thousands of data labelers use every day. You don&#39;t need a background in ML research. What matters is that you can take an ambiguous, high-stakes problem and ship a polished, reliable product against it, fast.</p>\n<p>This team moves very quickly. Claude writes a lot of the code we commit, which means the bottleneck isn&#39;t typing , it&#39;s judgment, taste, and the ability to react to what researchers need next. You&#39;ll iterate on data collection strategies to distill the knowledge of thousands of human experts around the world into our models, and you&#39;ll do it in a loop that closes in hours and days, not quarters or months.</p>\n<p>Our work spans teaching models to use computers effectively, advancing code generation through RL, pioneering fundamental RL research for large language models, and building the scalable training methodologies behind our frontier production models. The RL org is organized around four goals: solving the science of long-horizon tasks and continual learning, scaling RL data and environments to be comprehensive and diverse, automating software engineering end-to-end, and training the frontier production model.</p>\n<p>Our engineering teams build the environments, evaluation systems, data pipelines, and tooling that make all of this possible , from realistic agentic training environments and scalable code data generation to human data collection platforms and production training operations.</p>\n<p>Responsibilities:</p>\n<ul>\n<li>Build and extend web platforms for RL environment creation, management, and quality review , including environment configuration, versioning, and validation workflows</li>\n<li>Develop vendor-facing interfaces and tooling that let external partners create, submit, and iterate on training environments with minimal friction</li>\n<li>Design and implement platforms for human data collection at scale, including labeling workflows, quality assurance systems, and feedback mechanisms that surface reward signal integrity issues early</li>\n<li>Build evaluation dashboards and observability UIs that give researchers real-time insight into environment quality, training run health, and reward hacking</li>\n<li>Create backend services and APIs that connect environment authoring tools, data collection systems, and RL training infrastructure</li>\n<li>Build and expand scalable code data generation pipelines, producing diverse programming tasks with robust reward signals across languages and difficulty levels</li>\n<li>Develop onboarding automation and documentation tooling so new vendors and internal users ramp up in hours, not weeks</li>\n<li>Partner closely with RL researchers, data operations, and vendor management to translate ambiguous requirements into well-scoped, well-designed products</li>\n</ul>\n<p>You May Be a Good Fit If You:</p>\n<ul>\n<li>Have strong software engineering fundamentals and real full-stack range , you&#39;re comfortable owning a surface from database schema to frontend</li>\n<li>Are proficient in Python and a modern web stack (React, TypeScript, or similar)</li>\n<li>Have a track record of shipping systems that solved a hard problem, not just shipped on time , e.g. you built the thing that made your team 10x faster, or the internal tool nobody thought was possible</li>\n<li>Operate with high agency: you identify what needs to be done and drive it forward without waiting for a ticket</li>\n<li>Have found yourself wondering &quot;why isn&#39;t this moving faster?&quot; in previous roles , and then have done something about it</li>\n<li>Care about UX and can build interfaces that are intuitive for both technical researchers and non-technical labelers</li>\n<li>Communicate clearly with researchers, operations teams, and engineers, and can turn vague asks into well-scoped work</li>\n<li>Thrive in a fast-moving environment where priorities shift, Claude is your pair programmer, and the next problem is often one nobody has solved before</li>\n<li>Care about Anthropic&#39;s mission to build safe, beneficial AI and want your work to contribute directly to it</li>\n</ul>\n<p>Strong Candidates May Also Have:</p>\n<ul>\n<li>Built data collection, labeling, or annotation platforms , ideally ones that had to scale across many vendors or many task types</li>\n<li>Background building multi-tenant platforms with role-based access, audit trails, and vendor management workflows</li>\n<li>Experience with cloud infrastructure (GCP or AWS), Docker, and CI/CD pipelines</li>\n<li>Familiarity with LLM training, fine-tuning, or evaluation workflows</li>\n<li>Experience with async Python (Trio, asyncio) or high-throughput API design</li>\n<li>Background in dashboards, monitoring, or observability tooling</li>\n<li>Experience working directly with external vendors or partners on technical integrations</li>\n<li>A background that isn&#39;t a straight line , e.g. math or physics into SWE, competitive programming, research into engineering, or a side project that outgrew its scope</li>\n</ul>\n<p>Representative Projects:</p>\n<ul>\n<li>Building a unified platform for human data collection that integrates labeling workflows, vendor management, and QA for complex agentic tasks</li>\n<li>Developing vendor onboarding automation that handles Docker registry access, API token management, and environment validation</li>\n<li>Creating evaluation and observability dashboards that catch reward hacks, measure environment difficulty, and give real-time feedback during production training</li>\n<li>Building environment quality review workflows that let researchers browse, grade, and provide feedback on training environments</li>\n<li>Developing automated environment quality pipelines that validate correctness and difficulty calibration before environments hit production training</li>\n<li>Building internal tools for browsing and analyzing training run results, environment statistics, and data collection progress</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_76fd624c-e23","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Anthropic","sameAs":"https://www.anthropic.com/","logo":"https://logos.yubhub.co/anthropic.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/anthropic/jobs/5186067008","x-work-arrangement":"hybrid","x-experience-level":"staff","x-job-type":"full-time","x-salary-range":"$300,000-$405,000 USD","x-skills-required":["Python","Modern web stack","React","TypeScript","Cloud infrastructure","Docker","CI/CD pipelines","LLM training","Fine-tuning","Evaluation workflows","Async Python","High-throughput API design","Dashboards","Monitoring","Observability tooling"],"x-skills-preferred":["Data collection","Labeling","Annotation","Multi-tenant platforms","Role-based access","Audit trails","Vendor management workflows"],"datePosted":"2026-04-18T15:39:16.596Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco, CA | New York City, NY"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Python, Modern web stack, React, TypeScript, Cloud infrastructure, Docker, CI/CD pipelines, LLM training, Fine-tuning, Evaluation workflows, Async Python, High-throughput API design, Dashboards, Monitoring, Observability tooling, Data collection, Labeling, Annotation, Multi-tenant platforms, Role-based access, Audit trails, Vendor management workflows","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":300000,"maxValue":405000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_e617011a-be8"},"title":"Data Operations","description":"<p>Our Data Ops team partners with Research to answer two core questions: What data do we need to train outstanding AI audio models — and how do we source and scale it effectively? In this role, you will own the end-to-end lifecycle of data acquisition and labeling operations. You&#39;ll translate research needs into clear data specifications and labeling workflows, manage external data vendors, and ensure datasets are delivered on time and to high quality standards. You&#39;ll oversee both in-house and third-party labeling teams — setting guidelines, prioritizing work, implementing QC processes, and monitoring throughput and performance. You&#39;ll also help scale our operations by improving tooling and processes, and by supporting the hiring, training, and development of a high-performing labeling workforce.</p>\n<p>We&#39;re looking for exceptional individuals who combine technical excellence with ethical awareness, who are excited by hard problems and motivated by human impact. You&#39;ll strive with us if you:</p>\n<ul>\n<li><p>Are passionate about audio AI driven by a desire to make content universally accessible and breaking the frontiers of new tech.</p>\n</li>\n<li><p>Are a highly motivated and driven individual with a strong work ethic. Our team is aware of this critical moment of audio AI evolution and is committed to going the extra mile to lead.</p>\n</li>\n<li><p>Are analytical, efficient, and strive on solving complex challenges with a first principles mindset.</p>\n</li>\n<li><p>Consistently strive for excellence, delivering high-quality work quickly and exceeding expectations.</p>\n</li>\n<li><p>Take initiative and work autonomously from day one, prioritizing learning and contribution while leaving ego aside.</p>\n</li>\n</ul>\n<p>We don&#39;t require any formal experience, certifications, or degrees. Instead, we&#39;re looking for someone with:</p>\n<ul>\n<li><p>Strong attention to detail – data is all about the details; we expect you to craft clear project guidelines together with Research and build processes to ensure our data meets them</p>\n</li>\n<li><p>People skills – you will interact with Researchers, external service providers, and a large number of data labellers and labelling managers; you should feel comfortable providing feedback and keeping large teams moving along to meet deadlines</p>\n</li>\n<li><p>Bonus points for:</p>\n<ul>\n<li><p>Prior experience running large-scale data labelling projects for a FAANG/similar company, AI research lab, or data labelling platform</p>\n</li>\n<li><p>Python skills</p>\n</li>\n</ul>\n</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_e617011a-be8","directApply":true,"hiringOrganization":{"@type":"Organization","name":"ElevenLabs","sameAs":"https://elevenlabs.io","logo":"https://logos.yubhub.co/elevenlabs.io.png"},"x-apply-url":"https://elevenlabs.io/careers/4d12e97f-e637-49f1-a39c-7f32d6d364bb/data-operations","x-work-arrangement":"remote","x-experience-level":"entry","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["data acquisition","data labeling","project management","quality control","team management"],"x-skills-preferred":["Python","data analysis","machine learning"],"datePosted":"2026-03-10T12:19:51.081Z","jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"data acquisition, data labeling, project management, quality control, team management, Python, data analysis, machine learning"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_db67438e-963"},"title":"Director, System Software Engineering - Metropolis Accelerated and Inferencing Software","description":"<p><strong>Director, System Software Engineering - Metropolis Accelerated and Inferencing Software</strong></p>\n<p>We are looking for an engineering leader who is hands-on with deep learning—comfortable reading/modeling code, not just running it. You will lead, encourage, and develop world-class engineering and data teams distributed across Europe, Asia and the United States.</p>\n<p><strong>Key Responsibilities:</strong></p>\n<ul>\n<li>Architect and operationalize NVIDIA’s end-to-end data Inference Acceleration strategy, powering Inferencing and continuous performance improvements.</li>\n<li>Drive Strategic Implementations of TensorRT, VLLM and other accelerated frameworks for inference solutions for Edge and Enterprise devices: Lead Accelerated Computing efforts and solutions for key Metropolis verticals. Set up Proofs of Readiness (PORs) and guide their implementations.</li>\n<li>Leading customer solutions: Collaborate with major Metropolis OEMs and Partners to architect highly accelerated and optimized custom deep learning models and inference pipelines for their specific requirements. Offer direct customer support, including debugging, technical education, and handling customer inquiries for our Metropolis partner and customers. Responsible for drafting and finalizing SOWs with internal customers and partners.</li>\n<li>Performance Benchmarking: Orchestrate efforts to achieve leading performance results on industry benchmarks like MLPerf on various edge and Enterprise devices.</li>\n<li>Technical Leadership &amp; Influence: Function as a technical leader for deep learning across multiple teams, giving oversight and build support. Apply customer insights to influence the composition and structure of upcoming SOC / GPU deep learning hardware.</li>\n<li>Scaling the team: Strategically hiring to meet new demands while also mentoring and adjusting existing teams to new deep learning challenges.</li>\n<li>Representing Nvidia Deep learning solutions in webinars, conferences and partner events</li>\n</ul>\n<p><strong>Requirements:</strong></p>\n<ul>\n<li>Masters in Computer Science/Electrical Engineering or equivalent experience.</li>\n<li>A minimum of 8 years of meaningful involvement in machine learning/deep learning research or practical experience, coupled with 7+ years of leadership background and overall 15+ years of industry experience.</li>\n<li>Over 10 years of validated expertise in the embedded software sector, holding technical leadership positions accountable for delivering outstanding production software within a multifaceted setting.</li>\n<li>Deep Knowledge of GPU, CPU and dedicated deep learning architecture fundamentals and low-level performance optimizations using heterogeneous computing.</li>\n<li>Hands-on experience with VLMs, LLMs, or multimodal AI systems applied to perception, data triage, or automated labeling.</li>\n<li>Strong expertise in large-scale data processing, systems build, or machine learning pipelines.</li>\n<li>Strong communication, careful planning, and technical leadership capabilities.</li>\n</ul>\n<p><strong>Benefits:</strong></p>\n<ul>\n<li>Competitive salary package and benefits</li>\n<li>Eligible for equity</li>\n</ul>\n<p><strong>How to Apply:</strong></p>\n<p>Applications for this job will be accepted at least until March 13, 2026.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_db67438e-963","directApply":true,"hiringOrganization":{"@type":"Organization","name":"NVIDIA","sameAs":"https://nvidia.wd5.myworkdayjobs.com","logo":"https://logos.yubhub.co/nvidia.com.png"},"x-apply-url":"https://nvidia.wd5.myworkdayjobs.com/en-US/NVIDIAExternalCareerSite/job/US-CA-Santa-Clara/Director--Metropolis-Accelerated-and-Inferencing-Software_JR2011299","x-work-arrangement":"onsite","x-experience-level":"executive","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["Machine Learning","Deep Learning","GPU","CPU","Heterogeneous Computing","TensorRT","VLLM","Proof of Readiness","Customer Support","Technical Education","Performance Benchmarking","Technical Leadership","Team Scaling","Webinars","Conferences","Partner Events"],"x-skills-preferred":["VLMs","LLMs","Multimodal AI Systems","Perception","Data Triage","Automated Labeling","Large-Scale Data Processing","Systems Build","Machine Learning Pipelines"],"datePosted":"2026-03-09T20:43:31.482Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Santa Clara"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Machine Learning, Deep Learning, GPU, CPU, Heterogeneous Computing, TensorRT, VLLM, Proof of Readiness, Customer Support, Technical Education, Performance Benchmarking, Technical Leadership, Team Scaling, Webinars, Conferences, Partner Events, VLMs, LLMs, Multimodal AI Systems, Perception, Data Triage, Automated Labeling, Large-Scale Data Processing, Systems Build, Machine Learning Pipelines"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_e5e1a9a5-3c9"},"title":"Research Associate - Fresher","description":"<p>We are hiring Research Associates for a AI project in Gurgaon. As a Research Associate, you will work on various client projects to train generative AI models by creating prompts and responses based on instructions provided. You will use established best practices for quality prompts and generate similar prompts and responses based on given examples. You will also learn new software programs on the job and provide supporting documentation when the AI fails.</p>\n<p>Key Responsibilities:</p>\n<ul>\n<li>Work on various client projects to train generative AI models by creating prompts and responses based on instructions provided and established best practices for quality prompts.</li>\n<li>Generate similar prompts and responses based on given examples.</li>\n<li>Use communication channels such as Slack, Teams, and SharePoint to learn about new projects, collaborate with your team, and ask questions.</li>\n<li>Learn new software programs on the job.</li>\n<li>Provide supporting documentation when the AI fails.</li>\n</ul>\n<p>Requirements:</p>\n<ul>\n<li>Experience: 0-2 years</li>\n<li>Excellent communication skills (oral and written)</li>\n<li>Qualification: Bachelor&#39;s degree (any stream)</li>\n<li>Preferred: Good understanding of AI</li>\n<li>Ability to gain new skills and knowledge through hands-on experience</li>\n<li>Keen eye for detail</li>\n<li>Demonstrated ability to work independently</li>\n<li>Strong time management skills</li>\n<li>Exemplify the quality of doing &#39;get it done&#39; attitude, including a high level of accountability, transparency, and teamwork first &amp; foremost</li>\n</ul>\n<p>Benefits:</p>\n<ul>\n<li>Medical Insurance</li>\n<li>Term Insurance</li>\n<li>Accidental Insurance</li>\n<li>Lunch/Dinner provided at subsidized rates</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_e5e1a9a5-3c9","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Keywords Studios","sameAs":"https://www.keywordsstudios.com/","logo":"https://logos.yubhub.co/keywordsstudios.com.png"},"x-apply-url":"https://apply.workable.com/j/29F19E0674","x-work-arrangement":"onsite","x-experience-level":"entry","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["AI","prompt writing","annotation","labeling","communication"],"x-skills-preferred":["AI","writing","reading comprehension"],"datePosted":"2026-03-09T10:55:31.690Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Bengaluru"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"AI, prompt writing, annotation, labeling, communication, AI, writing, reading comprehension"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_f6122e0e-516"},"title":"Research Associate - Fresher","description":"<p><strong>Research Associate Role</strong></p>\n<p>We are hiring Research Associates for a AI project in Gurgaon. The Research Associate role is focused on Prompt Writing, Annotation and Labeling, where you will work on improving an AI engine.</p>\n<p><strong>Key Responsibilities</strong></p>\n<ul>\n<li>Work on various client projects to train generative AI models by creating prompts and responses based on instructions provided and established best practices for quality prompts.</li>\n<li>Generate similar prompts and responses based on given examples.</li>\n<li>Use communication channels such as Slack, Teams, and SharePoint to learn about new projects, collaborate with your team, and ask questions.</li>\n<li>Learn new software programs on the job.</li>\n<li>Provide supporting documentation when the AI fails.</li>\n</ul>\n<p><strong>Requirements</strong></p>\n<ul>\n<li>0-2 years of experience</li>\n<li>Excellent communication skills (oral and written)</li>\n<li>Bachelor&#39;s degree (any stream)</li>\n<li>Good understanding of AI (preferred)</li>\n<li>Ability to gain new skills and knowledge through hands-on experience</li>\n<li>Keen eye for detail</li>\n<li>Demonstrated ability to work independently</li>\n<li>Strong time management skills</li>\n<li>Get it done attitude, including high level of accountability, transparency, and teamwork</li>\n</ul>\n<p><strong>Benefits</strong></p>\n<ul>\n<li>Cab facility within hiring zones</li>\n<li>Medical insurance, term insurance, and accidental insurance</li>\n<li>Lunch/dinner provided at subsidized rates</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_f6122e0e-516","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Keywords Studios","sameAs":"https://apply.workable.com","logo":"https://logos.yubhub.co/j.com.png"},"x-apply-url":"https://apply.workable.com/j/BC2E215524","x-work-arrangement":"onsite","x-experience-level":"entry","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["AI","Prompt writing","Annotation and labeling","Communication skills","Time management skills"],"x-skills-preferred":["Good understanding of AI"],"datePosted":"2026-03-09T10:55:25.321Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Gurgaon"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"AI, Prompt writing, Annotation and labeling, Communication skills, Time management skills, Good understanding of AI"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_b5258a48-495"},"title":"Senior Data and Applied Scientist","description":"<p>Imagine shaping the future of local search for millions of users worldwide. At Bing Places, you&#39;ll join a team that powers business entity relevance on the search results page. You&#39;ll work on cutting-edge tools and metrics that ensure users find the most accurate and meaningful local results. Our team thrives on innovation, leveraging large and small language models, and advanced measurement systems to deliver exceptional quality.</p>\n<p>As a Data Scientist in Bing Places, you will design new relevance metrics, build labeling pipelines, and fine-tune language models to improve search quality. You&#39;ll work on prompt engineering, implement modern language models techniques like Retrieval Augmented Generation, and create scalable workflows for measurement and evaluation.</p>\n<p>This opportunity will allow you to:</p>\n<ul>\n<li>Accelerate your career growth by working on state-of-the-art AI systems.</li>\n<li>Develop deep expertise in prompt engineering and model tuning.</li>\n<li>Hone your skills in building robust data pipelines and quality frameworks.</li>\n</ul>\n<p>Microsoft&#39;s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.</p>\n<p>Responsibilities:</p>\n<ul>\n<li>Design and implement new relevance metrics to measure and improve local search quality.</li>\n<li>Develop and optimize LLM/SLM labeling pipelines for high-throughput, consistent quality judgments.</li>\n<li>Engineer and fine-tune prompts for LLMs to enhance query understanding and classification accuracy.</li>\n<li>Apply modern LLM techniques such as retrieval-augmented generation for improved grounding and relevance.</li>\n<li>Build scalable workflows and dashboards for measurement, evaluation cycles, and quality checks.</li>\n<li>Analyze failure modes and improve prompt rubrics to reduce defect rates and enhance labeling consistency.</li>\n<li>Collaborate with cross-functional teams to integrate metrics and labeling systems into production environments.</li>\n</ul>\n<p>Qualifications:</p>\n<ul>\n<li>Doctorate in Data Science, Mathematics, Statistics, Econometrics, Economics, Operations Research, Computer Science, or related field AND relevant data-science experience (e.g., managing structured and unstructured data, applying statistical techniques and reporting results) OR Master’s Degree in Data Science, Mathematics, Statistics, Econometrics, Economics, Operations Research, Computer Science, or related field AND relevant data-science experience (e.g., managing structured and unstructured data, applying statistical techniques and reporting results) OR Bachelor’s Degree in Data Science, Mathematics, Statistics, Econometrics, Economics, Operations Research, Computer Science, or related field AND relevant data-science experience (e.g., managing structured and unstructured data, applying statistical techniques and reporting results) OR equivalent experience.</li>\n<li>Relevant years customer-facing, project-delivery experience, professional services, and/or consulting experience.</li>\n</ul>\n<p>Other Requirements:</p>\n<ul>\n<li>Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_b5258a48-495","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Microsoft","sameAs":"https://microsoft.ai","logo":"https://logos.yubhub.co/microsoft.ai.png"},"x-apply-url":"https://microsoft.ai/job/senior-data-and-applied-scientist/","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["Data Science","Mathematics","Statistics","Econometrics","Economics","Operations Research","Computer Science","LLM/SLM labeling pipelines","Prompt engineering","Model tuning","Data pipelines","Quality frameworks"],"x-skills-preferred":[],"datePosted":"2026-03-08T22:19:32.251Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Barcelona"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Data Science, Mathematics, Statistics, Econometrics, Economics, Operations Research, Computer Science, LLM/SLM labeling pipelines, Prompt engineering, Model tuning, Data pipelines, Quality frameworks"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_714d4a02-1c4"},"title":"Applied Scientist","description":"<p>Imagine shaping the future of local search for millions of users worldwide. At Bing Places, you&#39;ll join a team that powers business entity relevance on the search results page. You&#39;ll work on cutting-edge tools and metrics that ensure users find the most accurate and meaningful local results. Our team thrives on innovation, leveraging large and small language models, and advanced measurement systems to deliver exceptional quality.</p>\n<p>As a Applied Scientist in Bing Places, you will design new relevance metrics, build labeling pipelines, and fine-tune language models to improve search quality. You&#39;ll work on prompt engineering, implement modern language models techniques like Retrieval Augmented Generation, and create scalable workflows for measurement and evaluation.</p>\n<p>This opportunity will allow you to:</p>\n<ul>\n<li>Accelerate your career growth by working on state-of-the-art AI systems.</li>\n<li>Develop deep expertise in prompt engineering and model tuning.</li>\n<li>Hone your skills in building robust data pipelines and quality frameworks.</li>\n</ul>\n<p>Microsoft&#39;s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.</p>\n<p>Responsibilities:</p>\n<ul>\n<li>Design and implement new relevance metrics to measure and improve local search quality.</li>\n<li>Develop and optimize LLM/SLM labeling pipelines for high-throughput, consistent quality judgments.</li>\n<li>Engineer and fine-tune prompts for LLMs to enhance query understanding and classification accuracy.</li>\n<li>Apply modern LLM techniques such as retrieval-augmented generation for improved grounding and relevance.</li>\n<li>Build scalable workflows and dashboards for measurement, evaluation cycles, and quality checks.</li>\n<li>Analyze failure modes and improve prompt rubrics to reduce defect rates and enhance labeling consistency.</li>\n<li>Collaborate with cross-functional teams to integrate metrics and labeling systems into production environments.</li>\n</ul>\n<p>Qualifications:</p>\n<ul>\n<li>Bachelor’s Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND hands on experience (e.g., statistics, predictive analytics, research) OR Master’s Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND hands on experience (e.g., statistics, predictive analytics, research) OR Doctorate in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field OR equivalent experience.</li>\n</ul>\n<p>Other Requirements:</p>\n<ul>\n<li>Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_714d4a02-1c4","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Microsoft","sameAs":"https://microsoft.ai","logo":"https://logos.yubhub.co/microsoft.ai.png"},"x-apply-url":"https://microsoft.ai/job/applied-scientist-7/","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["Statistics","Econometrics","Computer Science","Electrical or Computer Engineering","LLM/SLM labeling pipelines","Prompt engineering","Model tuning","Data pipelines","Quality frameworks"],"x-skills-preferred":["Retrieval Augmented Generation","Scalable workflows","Dashboards","Measurement","Evaluation cycles","Quality checks"],"datePosted":"2026-03-08T22:15:35.607Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Barcelona"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, LLM/SLM labeling pipelines, Prompt engineering, Model tuning, Data pipelines, Quality frameworks, Retrieval Augmented Generation, Scalable workflows, Dashboards, Measurement, Evaluation cycles, Quality checks"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_6cc383e0-ff6"},"title":"ML Infrastructure Engineer, Safeguards","description":"<p><strong>About the role</strong></p>\n<p>We are seeking a Machine Learning Infrastructure Engineer to join our Safeguards organization, where you&#39;ll build and scale the critical infrastructure that powers our AI safety systems. You&#39;ll work at the intersection of machine learning, large-scale distributed systems, and AI safety, developing the platforms and tools that enable our safeguards to operate reliably at scale.</p>\n<p><strong>Responsibilities:</strong></p>\n<ul>\n<li>Design and build scalable ML infrastructure to support real-time and batch classifier and safety evaluations across our model ecosystem</li>\n<li>Build monitoring and observability tools to track model performance, data quality, and system health for safety-critical applications</li>\n<li>Collaborate with research teams to productionize safety research, translating experimental safety techniques into robust, scalable systems</li>\n<li>Optimize inference latency and throughput for real-time safety evaluations while maintaining high reliability standards</li>\n<li>Implement automated testing, deployment, and rollback systems for ML models in production safety applications</li>\n<li>Partner with Safeguards, Security, and Alignment teams to understand requirements and deliver infrastructure that meets safety and production needs</li>\n<li>Contribute to the development of internal tools and frameworks that accelerate safety research and deployment</li>\n</ul>\n<p><strong>You may be a good fit if you:</strong></p>\n<ul>\n<li>Have 5+ years of experience building production ML infrastructure, ideally in safety-critical domains like fraud detection, content moderation, or risk assessment</li>\n<li>Are proficient in Python and have experience with ML frameworks like PyTorch, TensorFlow, or JAX</li>\n<li>Have hands-on experience with cloud platforms (AWS, GCP) and container orchestration (Kubernetes)</li>\n<li>Understand distributed systems principles and have built systems that handle high-throughput, low-latency workloads</li>\n<li>Have experience with data engineering tools and building robust data pipelines (e.g., Spark, Airflow, streaming systems)</li>\n<li>Are results-oriented, with a bias towards reliability and impact in safety-critical systems</li>\n<li>Enjoy collaborating with researchers and translating cutting-edge research into production systems</li>\n<li>Care deeply about AI safety and the societal impacts of your work</li>\n</ul>\n<p><strong>Strong candidates may have experience with:</strong></p>\n<ul>\n<li>Working with large language models and modern transformer architectures</li>\n<li>Implementing A/B testing frameworks and experimentation infrastructure for ML systems</li>\n<li>Developing monitoring and alerting systems for ML model performance and data drift</li>\n<li>Building automated labeling systems and human-in-the-loop workflows</li>\n<li>Experience in trust &amp; safety, fraud prevention, or content moderation domains</li>\n<li>Knowledge of privacy-preserving ML techniques and compliance requirements</li>\n<li>Contributing to open-source ML infrastructure projects</li>\n</ul>\n<p><strong>Deadline to apply:</strong></p>\n<p>None. Applications will be reviewed on a rolling basis.</p>\n<p><strong>Logistics</strong></p>\n<ul>\n<li>Education requirements: We require at least a Bachelor&#39;s degree in a related field or equivalent experience.</li>\n<li>Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices.</li>\n<li>Visa sponsorship: We do sponsor visas! However, we aren&#39;t able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this.</li>\n</ul>\n<p><strong>We encourage you to apply even if you do not believe you meet every single qualification.</strong></p>\n<p>Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you&#39;re interested in this work.</p>\n<p><strong>Your safety matters to us.</strong></p>\n<p>To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you&#39;re ever unsure about a communication, don&#39;t click any links—visit anthropic.com/careers directly for confirmed position openings.</p>\n<p><strong>How we&#39;re different</strong></p>\n<p>We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing the state of the art in AI safety and making a meaningful difference in the world.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_6cc383e0-ff6","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Anthropic","sameAs":"https://job-boards.greenhouse.io","logo":"https://logos.yubhub.co/anthropic.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/anthropic/jobs/4778843008","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$320,000 - $405,000 USD","x-skills-required":["Python","PyTorch","TensorFlow","JAX","AWS","GCP","Kubernetes","Spark","Airflow","streaming systems"],"x-skills-preferred":["large language models","modern transformer architectures","A/B testing frameworks","experimentation infrastructure","monitoring and alerting systems","automated labeling systems","human-in-the-loop workflows","trust & safety","fraud prevention","content moderation domains","privacy-preserving ML techniques","compliance requirements"],"datePosted":"2026-03-08T13:46:05.401Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco, CA"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Python, PyTorch, TensorFlow, JAX, AWS, GCP, Kubernetes, Spark, Airflow, streaming systems, large language models, modern transformer architectures, A/B testing frameworks, experimentation infrastructure, monitoring and alerting systems, automated labeling systems, human-in-the-loop workflows, trust & safety, fraud prevention, content moderation domains, privacy-preserving ML techniques, compliance requirements","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":320000,"maxValue":405000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_7e3331e3-3f3"},"title":"Software Engineer, Research - Human Data","description":"<p><strong>Software Engineer, Research - Human Data</strong></p>\n<p><strong>About the Team</strong></p>\n<p>OpenAI’s mission is to ensure that artificial general intelligence (AGI) benefits all of humanity. A key part of achieving that mission is training models that deeply understand and reflect human preferences — the <strong>Human Data</strong> team is at the heart of that effort.</p>\n<p>The Human Data engineering team creates the systems that enable scalable, high-quality human feedback. These systems are essential to how OpenAI trains and improves its most advanced models. Engineers on this team collaborate closely with world-class researchers to bring alignment techniques to life — from experimental ideas to production-ready feedback loops.</p>\n<p><strong>About the Role</strong></p>\n<p>We’re looking for software engineers to join the Human Data team and build the platforms, prototypes, tools, and infrastructure that power how our AI models are trained, aligned, and evaluated. You’ll partner with researchers and cross-functional teams to bring alignment ideas to life, influence future model training, and shape how models interact with the real world.</p>\n<p>We’re looking for people who are excited by technical ownership, enjoy working across the stack, and are eager to solve ambiguous problems in a high-impact, fast-paced environment.</p>\n<p>This role is based in San Francisco, CA. We use a hybrid work model of 3 days in the office per week and offer relocation assistance to new employees.</p>\n<p><strong>In this role, you will:</strong></p>\n<ul>\n<li>Build and maintain robust full-stack systems for feedback collection, data labeling, and evaluation pipelines, while maintaining high levels of security.</li>\n</ul>\n<ul>\n<li>Translate experimental alignment research into scalable production infrastructure, including inference and model training stacks.</li>\n</ul>\n<ul>\n<li>Design and iterate on user-facing tools and backend services to support high-quality data workflows</li>\n</ul>\n<ul>\n<li>Partner with researchers, engineers, and program leads to shape feedback loops and model interaction paradigms</li>\n</ul>\n<ul>\n<li>Drive infrastructure improvements that enable faster iteration and scaling across OpenAI’s frontier models, from internal research tooling all the way to production ChatGPT.</li>\n</ul>\n<p><strong>You might thrive in this role if you:</strong></p>\n<ul>\n<li>Have strong software engineering fundamentals and experience building production systems at scale</li>\n</ul>\n<ul>\n<li>Enjoy full-stack development with end-to-end ownership — from backend pipelines to user interfaces</li>\n</ul>\n<ul>\n<li>Are motivated by high-impact collaboration with research teams and solving novel, ambiguous problems</li>\n</ul>\n<ul>\n<li>Are excited to shape how AI systems learn from human preferences and reflect a broad range of human values</li>\n</ul>\n<ul>\n<li>Care deeply about inclusive tooling and building systems that enhance model safety, reliability, and usefulness</li>\n</ul>\n<p><strong>About OpenAI</strong></p>\n<p>OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_7e3331e3-3f3","directApply":true,"hiringOrganization":{"@type":"Organization","name":"OpenAI","sameAs":"https://jobs.ashbyhq.com","logo":"https://logos.yubhub.co/openai.com.png"},"x-apply-url":"https://jobs.ashbyhq.com/openai/4d6a5951-9838-434c-830a-22cb938ea228","x-work-arrangement":"hybrid","x-experience-level":"mid","x-job-type":"full-time","x-salary-range":"US$230K – $385K • Offers Equity\nLondon£131K – £245K • Offers Equity","x-skills-required":["software engineering","full-stack development","data labeling","evaluation pipelines","security","inference and model training stacks","user-facing tools","backend services","data workflows","research collaboration","model interaction paradigms","infrastructure improvements","AI systems","human preferences","inclusive tooling","model safety","reliability","usefulness"],"x-skills-preferred":["strong software engineering fundamentals","experience building production systems at scale","full-stack development with end-to-end ownership","high-impact collaboration with research teams","solving novel, ambiguous problems","shaping how AI systems learn from human preferences"],"datePosted":"2026-03-06T18:39:21.723Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco; London, UK"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"software engineering, full-stack development, data labeling, evaluation pipelines, security, inference and model training stacks, user-facing tools, backend services, data workflows, research collaboration, model interaction paradigms, infrastructure improvements, AI systems, human preferences, inclusive tooling, model safety, reliability, usefulness, strong software engineering fundamentals, experience building production systems at scale, full-stack development with end-to-end ownership, high-impact collaboration with research teams, solving novel, ambiguous problems, shaping how AI systems learn from human preferences","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":131000,"maxValue":385000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_684d1cb4-32c"},"title":"Operations Analyst, User Operations (Trust & Safety) - EMEA","description":"<p><strong>Operations Analyst, User Operations (Trust &amp; Safety) - EMEA</strong></p>\n<p><strong>About the Team</strong></p>\n<p>At OpenAI, our <strong>User Operations</strong> team safeguards our products and users from abuse, fraud, and safety risks. We serve as part of the company’s frontline safety infrastructure—helping ensure real-world user experiences translate into timely, high-integrity decisions and durable improvements. We operate at the intersection of operations, user trust, and risk management, collaborating cross-functionally with partners across Legal, Policy, Engineering, and Product, as well as external vendors.</p>\n<p><strong>About the Role</strong></p>\n<p>We are seeking a sharp, adaptive, and technically capable Operations Analyst to help scale and evolve our user safety and risk operations—focused on complex, time-critical issues where the right decision can materially change outcomes for users and the business.</p>\n<p>In this role, you’ll sit on the frontline of user safety: detecting and assessing high-impact risks that surface in real-world product usage and driving them to resolution alongside teams across the company. You’ll turn messy, ambiguous signals into clear actions—improving how we respond in the moment while also strengthening the systems (processes, documentation, and automation) that make consistent, high-quality decisions possible at scale. This is a high-autonomy role with visible impact: you’ll own workstreams end-to-end and measure success through faster response, higher quality outcomes, and reduced risk.</p>\n<p>Location / work model: London (hybrid, 3 days/week in-office).</p>\n<p>_<strong>Please note:</strong> This role may involve exposure to sensitive or concerning content. Strong discretion, good judgment, and resilience are essential._</p>\n<p><strong>In This Role, You Will</strong></p>\n<ul>\n<li>Triage and resolve complex, high-sensitivity user issues and escalations, including trust &amp; safety incidents and other time-critical risk reviews.</li>\n</ul>\n<ul>\n<li>Conduct risk evaluations and investigations using internal tooling, operational documentation, and appropriate external sources when relevant.</li>\n</ul>\n<ul>\n<li>Serve as an incident manager for sensitive reviews requiring nuanced interpretation, clear decision-making, and strong cross-functional coordination.</li>\n</ul>\n<ul>\n<li>Partner with stakeholders across Legal, Policy, Product, Engineering, and Support to drive fast, defensible outcomes—and ensure lessons learned translate into improvements to user experience, policy interpretation, and safety operations.</li>\n</ul>\n<ul>\n<li>Design and improve operational workflows (intake, triage, escalation pathways, QA, training, and governance) with a strong focus on consistency, scalability, and auditability.</li>\n</ul>\n<ul>\n<li>Build and maintain playbooks, decision trees, knowledge articles, and macros, and continuously refine them based on new learnings.</li>\n</ul>\n<ul>\n<li>Take an automation-first approach: identify repetitive work, redesign the workflow, and prototype automation using AI tools, no-code platforms, or lightweight scripting (in partnership with technical counterparts as needed).</li>\n</ul>\n<ul>\n<li>Monitor operational health through quality audits, SLA tracking, escalation accuracy, and trend analysis, and propose interventions grounded in clear metrics.</li>\n</ul>\n<ul>\n<li>Contribute to vendor enablement and governance, including training, calibration, and process improvements—especially during transitions or ramp periods.</li>\n</ul>\n<p><strong>You Might Thrive in This Role If You</strong></p>\n<ul>\n<li>Have <strong>5+ years</strong> of experience in trust &amp; safety, risk operations, investigations, incident response, or comparable high-judgment operational work in a fast-moving environment.</li>\n</ul>\n<ul>\n<li>Are highly technical for an ops role: strong analytical skills (e.g., SQL), comfortable working with data tooling/dashboards, and able to translate insights into operational changes.</li>\n</ul>\n<ul>\n<li>Can operationalize ambiguous risk signals into structured inputs for classifier/detection development—including taxonomy design, labeling guidance and quality standards, and feedback loops that improve performance over time (in collaboration with technical stakeholders).</li>\n</ul>\n<ul>\n<li>Operate with high standards and high conviction: you can form clear, defensible points of view, communicate them crisply, and drive decisions to closure.</li>\n</ul>\n<ul>\n<li>Default to systems thinking: you routinely turn one-off fixes into repeatable processes, and you measure impact (time saved, accuracy improved, risk reduced).</li>\n</ul>\n<ul>\n<li>Are AI-fluent and know how to apply model/agent tooling in real workflows—while maintaining strong QA discipline and appropriate safeguards.</li>\n</ul>\n<ul>\n<li>Communicate clearly and effectively—especially in writing—when dealing with sensitive topics and complex tradeoffs.</li>\n</ul>\n<ul>\n<li>Thrive in ambiguity, manage multiple priorities simultaneously, and stay effective as context changes.</li>\n</ul>\n<p><strong>Nice to Have</strong></p>\n<ul>\n<li>Experience building QA programs, calibrations, sampling plans, or standardized review practices for sensitive workflows.</li>\n</ul>\n<ul>\n<li>Experience working with scaled vendor operations and governance models.</li>\n</ul>\n<ul>\n<li>Familiarity with building lightweight automation (scripts, no-code tools, workflow automation) in operational contexts.</li>\n</ul>\n<p><strong>About OpenAI</strong></p>\n<p>OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_684d1cb4-32c","directApply":true,"hiringOrganization":{"@type":"Organization","name":"OpenAI","sameAs":"https://jobs.ashbyhq.com","logo":"https://logos.yubhub.co/openai.com.png"},"x-apply-url":"https://jobs.ashbyhq.com/openai/986732f2-56a0-44ca-9030-a7cdeccf9454","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["SQL","data tooling/dashboards","AI tools","no-code platforms","lightweight scripting"],"x-skills-preferred":["taxonomy design","labeling guidance and quality standards","feedback loops","vendor enablement and governance","training and calibration"],"datePosted":"2026-03-06T18:32:08.483Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"London, UK"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"SQL, data tooling/dashboards, AI tools, no-code platforms, lightweight scripting, taxonomy design, labeling guidance and quality standards, feedback loops, vendor enablement and governance, training and calibration"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_601ca6bf-9b1"},"title":"Senior Machine Learning Engineer, Natural Language Processing - PhD Early Career","description":"<p><strong>[2026] Senior Machine Learning Engineer, Natural Language Processing - PhD Early Career</strong></p>\n<p>San Mateo, CA, United States</p>\n<p>Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences– all created by our global community of developers and creators.</p>\n<p>At Roblox, we’re building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from anywhere in the world, and on any device.</p>\n<p>A career at Roblox means you’ll be working to shape the future of human interaction, solving unique technical challenges at scale, and helping to create safer, more civil shared experiences for everyone.</p>\n<p>Natural Language Processing (NLP) is central to enabling massive-scale communication, creation, and safety across the Roblox platform. This role offers the unique opportunity to build and deploy cutting-edge <strong>NLP, speech, and generative AI models</strong> that operate at an unprecedented scale, impacting hundreds of millions of daily users.</p>\n<p>You will solve an extremely diverse range of high-scale language-related problems—from <strong>real-time moderation of voice and text</strong> to <strong>automatically localizing experiences</strong> and empowering users through <strong>LLM-driven creation tools</strong>. We combine cutting-edge research with large-scale engineering to bridge experimentation and production, designing algorithms that shape the next generation of language services for our immersive, user-generated content platform.</p>\n<p><strong><strong>Teams Hiring for This Role</strong></strong></p>\n<ul>\n<li><strong>Safety AI Systems:</strong>Dedicated to building end-to-end ML systems for maintaining civility and safety across the platform, operating at massive scale. This includes:</li>\n</ul>\n<ul>\n<li><strong>Real-time Moderation:</strong> Building world-class NLP and speech models for <strong>real-time moderation of voice and text</strong> (processing over 6 billion messages daily) and advanced interventions that measurably improve user civility.</li>\n</ul>\n<ul>\n<li><strong>Critical Harms &amp; Advanced Detection:</strong> Developing specialized LLM agents, behavioral analysis, and graph systems for detecting and preventing rare, high-risk scenarios (e.g., child safety, terrorism), requiring adversarial thinking and multi-step reasoning.</li>\n</ul>\n<ul>\n<li><strong>Safety Data Quality:</strong> Ensuring all Safety AI systems are robust by managing the core data infrastructure, MLOps, and Active Learning initiatives for continuous model improvement.</li>\n</ul>\n<p><strong>You Will</strong></p>\n<ul>\n<li>Design and implement <strong>deep learning-based NLP and speech solutions</strong> that address problems across Roblox, from creation to safety.</li>\n</ul>\n<ul>\n<li>Develop advanced models, including <strong>Large Language Models (LLMs), machine translation, and generative AI</strong>, for user interactions, content creation, and moderation.</li>\n</ul>\n<ul>\n<li>Have the independence and <strong>end-to-end responsibility</strong> to develop NLP/ML-based services that are scalable and resilient.</li>\n</ul>\n<ul>\n<li>Be a <strong>technical bar-raiser</strong> for cutting-edge ML technology, high code quality, and architectural designs.</li>\n</ul>\n<ul>\n<li>Work backward from user and product needs to deliver ML solutions that drive engagement, safety, and ecosystem growth.</li>\n</ul>\n<p><strong>You Have</strong></p>\n<ul>\n<li>Possessing or pursuing a Ph.D. in Computer Science, Computer Engineering, Mathematics, Statistics, or a related technical field, with a thesis aligned to Roblox’s research areas.</li>\n</ul>\n<ul>\n<li>Expertise in one or more areas: NLP, Speech Models, Large Language Models, Machine Translation, or Generative AI (including diffusion models).</li>\n</ul>\n<ul>\n<li>Experience with transformer-based model design, training, serving, and product integration.</li>\n</ul>\n<ul>\n<li>A strong research track record, evidenced by multiple publications and presentations in top-tier, peer-reviewed venues (e.g., ACL, EMNLP, Interspeech, ICML, NeurIPS).</li>\n</ul>\n<ul>\n<li>Proficiency in one or more programming languages (e.g., Python, C++, Go, Java) and experience building and optimizing large-scale systems.</li>\n</ul>\n<p>You may redact age, date of birth, and dates of attendance/graduation from your resume if you prefer.</p>\n<p>As you apply, you can find more information about our process by signing up for Speak\\_. You&#39;ll gain access to our practice assessment, comprehensive guides, FAQs, and modules designed to help you ace the hiring process.</p>\n<p>For roles that are based at our headquarters in San Mateo, CA: The starting base pay for this position is as shown below. The actual base pay is dependent upon a variety of job-related factors such as professional background, training, work experience, location, business needs and market demand. Therefore, in some circumstances, the actual salary could fall outside of this expected range. This pay range is subject to change and may be modified in the future. All full-time employees are also eligible for equity compensation and for benefits as described on <strong>this page</strong>.</p>\n<p>Annual Salary Range</p>\n<p>$195,780—$242,100 USD</p>\n<p>Roles that are based in an office are onsite Tuesday, Wednesday, and Thursday, with optional presence on Monday and Friday (unless otherwise noted).</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_601ca6bf-9b1","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Roblox","sameAs":"https://careers.roblox.com","logo":"https://logos.yubhub.co/careers.roblox.com.png"},"x-apply-url":"https://careers.roblox.com/jobs/7324377","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$195,780—$242,100 USD","x-skills-required":["NLP","Speech Models","Large Language Models","Machine Translation","Generative AI","Python","C++","Go","Java","Transformer-based model design","Training","Serving","Product integration"],"x-skills-preferred":["Deep learning","Computer vision","Natural language processing","Speech recognition","Text analysis","Sentiment analysis","Named entity recognition","Part-of-speech tagging","Dependency parsing","Semantic role labeling"],"datePosted":"2026-03-06T14:18:50.958Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Mateo, CA"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"NLP, Speech Models, Large Language Models, Machine Translation, Generative AI, Python, C++, Go, Java, Transformer-based model design, Training, Serving, Product integration, Deep learning, Computer vision, Natural language processing, Speech recognition, Text analysis, Sentiment analysis, Named entity recognition, Part-of-speech tagging, Dependency parsing, Semantic role labeling","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":195780,"maxValue":242100,"unitText":"YEAR"}}}]}