{"version":"0.1","company":{"name":"YubHub","url":"https://yubhub.co","jobsUrl":"https://yubhub.co/jobs/skill/proprietary-datasets"},"x-facet":{"type":"skill","slug":"proprietary-datasets","display":"Proprietary Datasets","count":1},"x-feed-size-limit":100,"x-feed-sort":"enriched_at desc","x-feed-notice":"This feed contains at most 100 jobs (the most recently enriched). For the full corpus, use the paginated /stats/by-facet endpoint or /search.","x-generator":"yubhub-xml-generator","x-rights":"Free to redistribute with attribution: \"Data by YubHub (https://yubhub.co)\"","x-schema":"Each entry in `jobs` follows https://schema.org/JobPosting. YubHub-native raw fields carry `x-` prefix.","jobs":[{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_ed5725bb-311"},"title":"Applied Research Engineer, Agents","description":"<p>Shape the Future of AI</p>\n<p>At Labelbox, we&#39;re building the critical infrastructure that powers breakthrough AI models at leading research labs and enterprises. Since 2018, we&#39;ve been pioneering data-centric approaches that are fundamental to AI development, and our work becomes even more essential as AI capabilities expand exponentially.</p>\n<p>As an Applied Research Engineer at Labelbox, you&#39;ll sit at the junction of advanced AI research and real product impact, with a focus on the data that makes modern agents work,browser interactions, SWE/code traces, GUI sessions, and multi-turn workflows. You&#39;ll drive the data landscape required to advance capable, adaptable agents and help shape Labelbox&#39;s strategy for collecting, synthesizing, and evaluating it.</p>\n<p>Create frameworks and tools to construct, train, benchmark and evaluate autonomous agent capabilities.</p>\n<p>Design agent-focused data programs using supervised fine-tuning (SFT) and reinforcement learning (RL) methodologies.</p>\n<p>Develop data pipelines from diverse sources like code repositories, web browsers, and computer systems.</p>\n<p>Implement and adapt popular open-source agent libraries and benchmarks with proprietary datasets and models.</p>\n<p>Engage with research teams in frontier AI labs and the wider AI community to understand evolving agent data needs for frontier models and share best practices.</p>\n<p>Collaborate closely with frontier AI lab customers to understand requirements and guide model development.</p>\n<p>Publish research findings in academic journals, conferences, and blog posts.</p>\n<p>What You Bring</p>\n<p>Ph.D. or Master&#39;s degree in Computer Science, Machine Learning, AI, or related field.</p>\n<p>At least 3 years of experience addressing sophisticated ML problems with successful delivery to customers.</p>\n<p>Experience building and training autonomous agents,tool use, structured outputs, multi-step planning,across browsers/GUI, codebases, and databases using SFT and RL.</p>\n<p>Constructed and evaluated agentic benchmarks (e.g. SWE-bench, WebArena, τ-bench, OSWorld) and reliability/efficiency suites (e.g. WABER).</p>\n<p>Adept at interpreting research literature and quickly turning new ideas into prototypes.</p>\n<p>Deep understanding of frontier models (autoregressive, diffusion), post-training (SFT, RLVR, RLAIF, RLHF, et al.), and their human data requirements.</p>\n<p>Proficient in Python, data science libraries and deep learning frameworks (e.g., PyTorch, JAX, TensorFlow).</p>\n<p>Strong analytical and problem-solving abilities in ambiguous situations.</p>\n<p>Excellent communication skills.</p>\n<p>Track record of publications in top-tier AI/ML venues (e.g., ACL, EMNLP, NAACL, NeurIPS, ICML, ICLR, etc.).</p>\n<p>Labelbox Applied Research</p>\n<p>At Labelbox Applied Research, we&#39;re committed to pushing the boundaries of AI and data-centric machine learning, with a particular focus on advanced human-AI interaction techniques. We believe that high-quality human data and sophisticated human feedback integration methods are key to unlocking the next generation of AI capabilities. Our research team works at the intersection of machine learning, human-computer interaction, and AI ethics to develop innovative solutions that can be practically applied in real-world scenarios.</p>\n<p>Life at Labelbox</p>\n<p>Location: Join our dedicated tech hubs in San Francisco or Wrocław, Poland</p>\n<p>Work Style: Hybrid model with 2 days per week in office, combining collaboration and flexibility</p>\n<p>Environment: Fast-paced and high-intensity, perfect for ambitious individuals who thrive on ownership and quick decision-making</p>\n<p>Growth: Career advancement opportunities directly tied to your impact</p>\n<p>Vision: Be part of building the foundation for humanity&#39;s most transformative technology</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_ed5725bb-311","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Labelbox","sameAs":"https://www.labelbox.com/","logo":"https://logos.yubhub.co/labelbox.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/labelbox/jobs/4829775007","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$250,000-$300,000 USD","x-skills-required":["Python","data science libraries","deep learning frameworks","PyTorch","JAX","TensorFlow","supervised fine-tuning","reinforcement learning","agent libraries","benchmarks","proprietary datasets","human-AI interaction","AI ethics"],"x-skills-preferred":[],"datePosted":"2026-04-18T15:52:38.777Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco Bay Area"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Python, data science libraries, deep learning frameworks, PyTorch, JAX, TensorFlow, supervised fine-tuning, reinforcement learning, agent libraries, benchmarks, proprietary datasets, human-AI interaction, AI ethics","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":250000,"maxValue":300000,"unitText":"YEAR"}}}]}