{"version":"0.1","company":{"name":"YubHub","url":"https://yubhub.co","jobsUrl":"https://yubhub.co/jobs/skill/huggingface"},"x-facet":{"type":"skill","slug":"huggingface","display":"Huggingface","count":17},"x-feed-size-limit":100,"x-feed-sort":"enriched_at desc","x-feed-notice":"This feed contains at most 100 jobs (the most recently enriched). For the full corpus, use the paginated /stats/by-facet endpoint or /search.","x-generator":"yubhub-xml-generator","x-rights":"Free to redistribute with attribution: \"Data by YubHub (https://yubhub.co)\"","x-schema":"Each entry in `jobs` follows https://schema.org/JobPosting. YubHub-native raw fields carry `x-` prefix.","jobs":[{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_19fc414d-dcc"},"title":"Specialist Solutions Architect - AI & ML (Communications, Media, Entertainment & Games)","description":"<p>As a Specialist Solutions Architect - AI &amp; ML Engineer, you will be the trusted technical ML &amp; AI expert to both Databricks customers and the Field Engineering organisation.</p>\n<p>You will work with Solution Architects to guide customers in architecting production-grade ML &amp; AI applications on Databricks, while aligning their technical roadmap with the continually evolving Databricks Data Intelligence Platform.</p>\n<p>You will continue to strengthen your technical skills through applying cutting-edge technologies in GenAI, MLOps, and ML more broadly, expanding your impact through mentorship, and establishing yourself as an AI thought leader.</p>\n<p>The impact you will have:</p>\n<ul>\n<li>Architect production-level ML &amp; AI workloads for customers using our unified platform, including agents, end-to-end ML pipelines, training/inference optimisation, integration with cloud-native services, MLOps, etc.</li>\n</ul>\n<ul>\n<li>Serve as trusted practitioner for enterprise GenAI solutions, including RAG architectures, agentic systems (tool-calling agents, multi-agent orchestration, guardrails), natural language querying of structured data, AI evaluation and observability, and monitoring systems</li>\n</ul>\n<ul>\n<li>Build, scale, and optimise customer AI workloads and apply best-in-class MLOps to productionise these workloads across a variety of domains</li>\n</ul>\n<ul>\n<li>Provide advanced technical support to Solution Architects during the technical sale ranging from feature engineering, training, tracking, serving to model monitoring all within a single platform, as well as participating in the larger ML SME community in Databricks</li>\n</ul>\n<ul>\n<li>Collaborate cross-functionally with the product and engineering teams to represent the voice of the customer, define priorities and influence the product roadmap, helping with the adoption of Databricks&#39; AI offerings</li>\n</ul>\n<p>What we look for:</p>\n<ul>\n<li>5+ years of hands-on industry ML experience in at least one of the following:</li>\n</ul>\n<ul>\n<li>ML Engineer: Build and maintain production-grade cloud (AWS/Azure/GCP) infrastructure that supports the deployment of ML applications, including drift monitoring.</li>\n</ul>\n<ul>\n<li>AI Engineer: Experience with the latest techniques in LLMs &amp; agentic systems including vector databases, fine-tuning LLMs, AI guardrail systems, and deploying LLMs with tools such as HuggingFace, Langchain, and OpenAI</li>\n</ul>\n<ul>\n<li>Graduate degree in a quantitative discipline (Computer Science, Engineering, Statistics, Operations Research, etc.) or equivalent practical experience</li>\n</ul>\n<ul>\n<li>Experience communicating and/or teaching technical concepts to non-technical and technical audiences alike</li>\n</ul>\n<ul>\n<li>Passion for collaboration, life-long learning, and driving business value through ML &amp; AI</li>\n</ul>\n<ul>\n<li>[Preferred] 2+ years customer-facing experience in a pre-sales or post-sales role</li>\n</ul>\n<ul>\n<li>Can meet expectations for technical training and role-specific outcomes within 3 months of hire</li>\n</ul>\n<ul>\n<li>This role can be remote, but we prefer that you be located in the job listing area and can travel up to 30% when needed</li>\n</ul>\n<p>Pay Range Transparency Databricks is committed to fair and equitable compensation practices. The pay range(s) for this role is listed below and represents the expected salary range for non-commissionable roles or on-target earnings for commissionable roles. Actual compensation packages are based on several factors that are unique to each candidate, including but not limited to job-related skills, depth of experience, relevant certifications and training, and specific work location. Based on the factors above, Databricks anticipates utilising the full width of the range. The total compensation package for this position may also include eligibility for annual performance bonus, equity, and the benefits listed above. For more information regarding which range your location is in visit our page here. Local Pay Range $219,100-$301,300 USD</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_19fc414d-dcc","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Databricks","sameAs":"https://databricks.com","logo":"https://logos.yubhub.co/databricks.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/databricks/jobs/8480547002","x-work-arrangement":"remote","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$219,100-$301,300 USD","x-skills-required":["ML Engineer","AI Engineer","GenAI","MLOps","Cloud-Native Services","Vector Databases","Fine-Tuning LLMs","AI Guardrail Systems","HuggingFace","Langchain","OpenAI"],"x-skills-preferred":[],"datePosted":"2026-04-18T15:58:35.900Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"United States"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"ML Engineer, AI Engineer, GenAI, MLOps, Cloud-Native Services, Vector Databases, Fine-Tuning LLMs, AI Guardrail Systems, HuggingFace, Langchain, OpenAI","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":219100,"maxValue":301300,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_a38ec886-62e"},"title":"AI Engineer - FDE (Forward Deployed Engineer)","description":"<p>Mission</p>\n<p>The AI Forward Deployed Engineering (AI FDE) team is a highly specialized customer-facing AI team at Databricks. We deliver professional services engagements to help our customers build and productionize first-of-its-kind AI applications.</p>\n<p>We work cross-functionally to shape long-term strategic priorities and initiatives alongside engineering, product, and developer relations, as well as support internal subject matter expert (SME) teams. We view our team as an ensemble: we look for individuals with strong, unique specializations to improve the overall strength of the team.</p>\n<p>This team is the right fit for you if you love working with customers, teammates, and fueling your curiosity for the latest trends in GenAI, LLMOps, and ML more broadly. This role can be remote.</p>\n<p>The impact you will have:</p>\n<ul>\n<li>Develop cutting-edge GenAI solutions, incorporating the latest techniques from our Mosaic AI research to solve customer problems</li>\n</ul>\n<ul>\n<li>Own production rollouts of consumer and internally facing GenAI applications</li>\n</ul>\n<ul>\n<li>Serve as a trusted technical advisor to customers across a variety of domains</li>\n</ul>\n<ul>\n<li>Present at conferences such as Data + AI Summit, recognized as a thought leader internally and externally</li>\n</ul>\n<ul>\n<li>Collaborate cross-functionally with the product and engineering teams to influence priorities and shape the product roadmap</li>\n</ul>\n<p>What we look for:</p>\n<ul>\n<li>Experience building GenAI applications, including RAG, multi-agent systems, Text2SQL, fine-tuning, etc., with tools such as HuggingFace, LangChain, and DSPy</li>\n</ul>\n<ul>\n<li>Minimum of 5+ years of relevant experience as a Data Scientist preferably working in a consulting role</li>\n</ul>\n<ul>\n<li>Expertise in deploying production-grade GenAI applications, including evaluation and optimizations</li>\n</ul>\n<ul>\n<li>Extensive years of hands-on industry data science experience, leveraging common machine learning and data science tools, i.e. pandas, scikit-learn, PyTorch, etc.</li>\n</ul>\n<ul>\n<li>Experience building production-grade machine learning deployments on AWS, Azure, or GCP</li>\n</ul>\n<ul>\n<li>Graduate degree in a quantitative discipline (Computer Science, Engineering, Statistics, Operations Research, etc.) or equivalent practical experience</li>\n</ul>\n<ul>\n<li>Experience communicating and/or teaching technical concepts to non-technical and technical audiences alike</li>\n</ul>\n<ul>\n<li>Passion for collaboration, life-long learning, and driving business value through AI</li>\n</ul>\n<ul>\n<li>Preferred experience using the Databricks Intelligence Platform and Apache Spark to process large-scale distributed datasets</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_a38ec886-62e","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Databricks","sameAs":"https://databricks.com","logo":"https://logos.yubhub.co/databricks.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/databricks/jobs/8099751002","x-work-arrangement":"remote","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["GenAI","HuggingFace","LangChain","DSPy","pandas","scikit-learn","PyTorch","AWS","Azure","GCP","Apache Spark"],"x-skills-preferred":["Databricks Intelligence Platform","Mosaic AI research"],"datePosted":"2026-04-18T15:58:10.707Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Remote - India"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"GenAI, HuggingFace, LangChain, DSPy, pandas, scikit-learn, PyTorch, AWS, Azure, GCP, Apache Spark, Databricks Intelligence Platform, Mosaic AI research"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_10ceb713-2cf"},"title":"Specialist Solutions Architect - AI & ML (Financial Services)","description":"<p>As a Specialist Solutions Architect - AI &amp; ML Engineer, you will be the trusted technical ML &amp; AI expert to both Databricks customers and the Field Engineering organization.</p>\n<p>You will work with Solution Architects to guide customers in architecting production-grade ML &amp; AI applications on Databricks, while aligning their technical roadmap with the continually evolving Databricks Data Intelligence Platform.</p>\n<p>Your responsibilities will include:</p>\n<ul>\n<li>Architecting production-level ML &amp; AI workloads for customers using our unified platform, including agents, end-to-end ML pipelines, training/inference optimization, integration with cloud-native services, MLOps, etc.</li>\n</ul>\n<ul>\n<li>Serving as a trusted practitioner for enterprise GenAI solutions, including RAG architectures, agentic systems (tool-calling agents, multi-agent orchestration, guardrails), natural language querying of structured data, AI evaluation and observability, and monitoring systems</li>\n</ul>\n<ul>\n<li>Building, scaling, and optimizing customer AI workloads and applying best-in-class MLOps to productionize these workloads across a variety of domains</li>\n</ul>\n<ul>\n<li>Providing advanced technical support to Solution Architects during the technical sale ranging from feature engineering, training, tracking, serving to model monitoring all within a single platform, as well as participating in the larger ML SME community in Databricks</li>\n</ul>\n<ul>\n<li>Collaborating cross-functionally with the product and engineering teams to represent the voice of the customer, define priorities and influence the product roadmap, helping with the adoption of Databricks&#39; AI offerings</li>\n</ul>\n<p>We are looking for someone with 5+ years of hands-on industry ML experience in at least one of the following areas:</p>\n<ul>\n<li>ML Engineer: Build and maintain production-grade cloud (AWS/Azure/GCP) infrastructure that supports the deployment of ML applications, including drift monitoring.</li>\n</ul>\n<ul>\n<li>AI Engineer: Experience with the latest techniques in LLMs &amp; agentic systems including vector databases, fine-tuning LLMs, AI guardrail systems, and deploying LLMs with tools such as HuggingFace, Langchain, and OpenAI</li>\n</ul>\n<p>A graduate degree in a quantitative discipline (Computer Science, Engineering, Statistics, Operations Research, etc.) or equivalent practical experience is also required.</p>\n<p>Additionally, experience communicating and/or teaching technical concepts to non-technical and technical audiences alike is highly valued.</p>\n<p>The salary range for this position is $180,000-$247,500 USD, depending on location.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_10ceb713-2cf","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Databricks","sameAs":"https://databricks.com","logo":"https://logos.yubhub.co/databricks.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/databricks/jobs/8434243002","x-work-arrangement":"remote","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$180,000-$247,500 USD","x-skills-required":["ML Engineer","AI Engineer","GenAI","MLOps","Cloud Native Services","Vector Databases","Fine-Tuning LLMs","AI Guardrail Systems","HuggingFace","Langchain","OpenAI"],"x-skills-preferred":[],"datePosted":"2026-04-18T15:53:44.261Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Central - United States; Northeast - United States; Southeast - United States"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"ML Engineer, AI Engineer, GenAI, MLOps, Cloud Native Services, Vector Databases, Fine-Tuning LLMs, AI Guardrail Systems, HuggingFace, Langchain, OpenAI","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":180000,"maxValue":247500,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_46c30960-10d"},"title":"AI Applied Scientist","description":"<p>We&#39;re looking for applied scientists with a Machine Learning and Artificial Intelligence background to build AI technologies and make Figma products more magical. You will be driving fundamental and applied research in this area. You will be combining industry best practices and a first-principles approach to design and build AI/ML models and systems to improve Figma&#39;s products.</p>\n<p>What you&#39;ll do at Figma:</p>\n<ul>\n<li>Drive fundamental and applied research in AI.</li>\n<li>Explore the boundaries of what is possible with the current technology set to build best in class models for Figma&#39;s domains.</li>\n<li>Combine industry best practices and a first-principles approach to build cutting edge Generative AI models, using techniques like Supervised Finetuning (SFT), Reinforcement Learning (RL), prompt improvements and synthetic data generation.</li>\n<li>Work in concert with product and infrastructure engineers to improve Figma&#39;s products through AI powered features.</li>\n<li>Collaborate closely with product managers and engineers to transform user feedback into requirements for AI systems.</li>\n<li>Build evaluation systems to measure and improve quality of AI features in Figma products.</li>\n</ul>\n<p>We&#39;d love to hear from you if you have:</p>\n<ul>\n<li>Extensive experience in building generative AI features through prompt engineering, and fine tuning models in production environments.</li>\n<li>Experience working on deep learning and generative AI frameworks like PyTorch, JAX, HuggingFace etc.</li>\n<li>Experience training LLMs with Reinforcement Learning techniques such as preference-based RL (DPO, PPO) and/or RL with verifiable rewards (RLVR) such as GRPO/DAPO.</li>\n<li>4+ years in Generative AI, and 6+ years of experience in one or more of the following areas: machine learning, natural language processing/understanding, computer vision.</li>\n<li>Strong software engineering skills with 5+ years of experience in programming languages (Python, C++, Java or R).</li>\n<li>Experience communicating and working across functions to drive solutions.</li>\n</ul>\n<p>While not required, It’s an added plus if you also have:</p>\n<ul>\n<li>Proven track record of planning multi-year roadmap in which shorter-term projects ladder to the long-term vision.</li>\n<li>Experience in mentoring/influencing senior engineers across organizations.</li>\n<li>Expertise working on large scale and distributed AI training.</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_46c30960-10d","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Figma","sameAs":"https://www.figma.com/","logo":"https://logos.yubhub.co/figma.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/figma/jobs/5707966004","x-work-arrangement":"remote","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$153,000-$376,000 USD","x-skills-required":["Generative AI","Machine Learning","Artificial Intelligence","Deep Learning","PyTorch","JAX","HuggingFace","Reinforcement Learning","Natural Language Processing","Computer Vision","Python","C++","Java","R"],"x-skills-preferred":[],"datePosted":"2026-04-18T15:49:15.355Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco, CA • New York, NY • United States"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Generative AI, Machine Learning, Artificial Intelligence, Deep Learning, PyTorch, JAX, HuggingFace, Reinforcement Learning, Natural Language Processing, Computer Vision, Python, C++, Java, R","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":153000,"maxValue":376000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_c1802213-f81"},"title":"AI Engineer - FDE (Forward Deployed Engineer) - U.S. Federal Sector","description":"<p>Job Title: AI Engineer - FDE (Forward Deployed Engineer) - U.S. Federal Sector</p>\n<p>Job Description: The AI Forward Deployed Engineering (AI FDE) team is a highly specialized customer-facing AI team at Databricks. We deliver professional services engagements to help our federal government customers build and productionize first-of-its-kind AI applications.</p>\n<p>Responsibilities:</p>\n<ul>\n<li>Develop cutting-edge GenAI solutions, incorporating the latest techniques from our Mosaic AI Research to solve customer problems.</li>\n<li>Own production rollouts of consumer and internally facing GenAI applications.</li>\n<li>Serve as a trusted technical advisor to customers across a variety of domains.</li>\n<li>Present at conferences such as Data + AI Summit, recognized as a thought leader internally and externally.</li>\n<li>Collaborate cross-functionally with the product and engineering teams to influence priorities and shape the product roadmap.</li>\n</ul>\n<p>What We Look For:</p>\n<ul>\n<li>Experience working with U.S. federal civilian, state, or local agencies.</li>\n<li>Experience building GenAI applications, including RAG, multi-agent systems, Text2SQL, fine-tuning, etc., with tools such as HuggingFace, LangChain, and DSPy.</li>\n<li>Expertise in deploying production-grade GenAI applications, including evaluation and optimizations.</li>\n<li>Extensive years of hands-on industry data science experience, leveraging common machine learning and data science tools (i.e., pandas, scikit-learn, PyTorch, etc.).</li>\n<li>Experience building production-grade machine learning deployments on AWS, Azure, or GCP.</li>\n<li>Graduate degree in a quantitative discipline (Computer Science, Engineering, Statistics, Operations Research, etc.) or equivalent practical experience.</li>\n<li>Experience communicating and/or teaching technical concepts to non-technical and technical audiences alike.</li>\n<li>Passion for collaboration, life-long learning, and driving business value through AI.</li>\n</ul>\n<p>Pay Range Transparency:</p>\n<p>Databricks is committed to fair and equitable compensation practices. The pay range(s) for this role is listed below and represents the expected salary range for non-commissionable roles or on-target earnings for commissionable roles.</p>\n<p>Actual compensation packages are based on several factors that are unique to each candidate, including but not limited to job-related skills, depth of experience, relevant certifications and training, and specific work location.</p>\n<p>Based on the factors above, Databricks anticipates utilizing the full width of the range.</p>\n<p>The total compensation package for this position may also include eligibility for annual performance bonus, equity, and the benefits listed above.</p>\n<p>For more information regarding which range your location is in visit our page here.</p>\n<p>Local Pay Range $180,656-$248,360 USD</p>\n<p>Benefits:</p>\n<p>At Databricks, we strive to provide comprehensive benefits and perks that meet the needs of all of our employees.</p>\n<p>For specific details on the benefits offered in your region click here.</p>\n<p>Our Commitment to Diversity and Inclusion:</p>\n<p>At Databricks, we are committed to fostering a diverse and inclusive culture where everyone can excel.</p>\n<p>We take great care to ensure that our hiring practices are inclusive and meet equal employment opportunity standards.</p>\n<p>Individuals looking for employment at Databricks are considered without regard to age, color, disability, ethnicity, family or marital status, gender identity or expression, language, national origin, physical and mental ability, political affiliation, race, religion, sexual orientation, socio-economic status, veteran status, and other protected characteristics.</p>\n<p>Compliance:</p>\n<p>If access to export-controlled technology or source code is required for performance of job duties, it is within Employer&#39;s discretion whether to apply for a U.S. government license for such positions, and Employer may decline to proceed with an applicant on this basis alone.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_c1802213-f81","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Databricks","sameAs":"https://databricks.com","logo":"https://logos.yubhub.co/databricks.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/databricks/jobs/8415203002","x-work-arrangement":"remote","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["GenAI","HuggingFace","LangChain","DSPy","pandas","scikit-learn","PyTorch","AWS","Azure","GCP","Graduate degree in Computer Science, Engineering, Statistics, Operations Research, etc."],"x-skills-preferred":[],"datePosted":"2026-04-18T15:47:10.995Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Maryland; Virginia; Washington, D.C."}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"GenAI, HuggingFace, LangChain, DSPy, pandas, scikit-learn, PyTorch, AWS, Azure, GCP, Graduate degree in Computer Science, Engineering, Statistics, Operations Research, etc."},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_f3dfc09f-532"},"title":"AI Engineer - FDE (Forward Deployed Engineer)","description":"<p>We are seeking an AI Engineer - FDE (Forward Deployed Engineer) to join our team. As an AI Engineer, you will develop cutting-edge GenAI solutions, incorporating the latest techniques from our Mosaic AI research to solve customer problems. You will own production rollouts of consumer and internally facing GenAI applications, serve as a trusted technical advisor to customers across a variety of domains, and present at conferences such as Data + AI Summit.</p>\n<p>Responsibilities:</p>\n<ul>\n<li>Develop cutting-edge GenAI solutions, incorporating the latest techniques from our Mosaic AI research to solve customer problems</li>\n<li>Own production rollouts of consumer and internally facing GenAI applications</li>\n<li>Serve as a trusted technical advisor to customers across a variety of domains</li>\n<li>Present at conferences such as Data + AI Summit</li>\n</ul>\n<p>Requirements:</p>\n<ul>\n<li>Experience building GenAI applications, including RAG, multi-agent systems, Text2SQL, fine-tuning, etc., with tools such as HuggingFace, LangChain, and DSPy</li>\n<li>Expertise in deploying production-grade GenAI applications, including evaluation and optimizations</li>\n<li>Extensive years of hands-on industry data science experience, leveraging common machine learning and data science tools, i.e. pandas, scikit-learn, PyTorch, etc.</li>\n<li>Experience building production-grade machine learning deployments on AWS, Azure, or GCP</li>\n<li>Graduate degree in a quantitative discipline (Computer Science, Engineering, Statistics, Operations Research, etc.) or equivalent practical experience</li>\n<li>Experience communicating and/or teaching technical concepts to non-technical and technical audiences alike</li>\n<li>Passion for collaboration, life-long learning, and driving business value through AI</li>\n</ul>\n<p>Benefits: At Databricks, we strive to provide comprehensive benefits and perks that meet the needs of all of our employees. For specific details on the benefits offered in your region, please click here.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_f3dfc09f-532","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Databricks","sameAs":"https://databricks.com/","logo":"https://logos.yubhub.co/databricks.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/databricks/jobs/8024004002","x-work-arrangement":"onsite","x-experience-level":"all levels","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["GenAI","HuggingFace","LangChain","DSPy","pandas","scikit-learn","PyTorch","AWS","Azure","GCP"],"x-skills-preferred":[],"datePosted":"2026-04-18T15:45:26.463Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"London, United Kingdom"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"GenAI, HuggingFace, LangChain, DSPy, pandas, scikit-learn, PyTorch, AWS, Azure, GCP"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_c7431c7c-8d7"},"title":"AI Engineer - FDE (Forward Deployed Engineer)","description":"<p>We are seeking an AI Engineer - FDE (Forward Deployed Engineer) to join our team. As a member of our highly specialized customer-facing AI team, you will deliver professional services engagements to help our customers build and productionize first-of-its-kind AI applications. You will work cross-functionally to shape long-term strategic priorities and initiatives alongside engineering, product, and developer relations, as well as support internal subject matter expert (SME) teams.</p>\n<p>The impact you will have:</p>\n<ul>\n<li>Develop cutting-edge GenAI solutions, incorporating the latest techniques from our Mosaic AI Research to solve customer problems.</li>\n<li>Own production rollouts of consumer and internally facing GenAI applications.</li>\n<li>Serve as a trusted technical advisor to customers across a variety of domains.</li>\n<li>Present at conferences such as Data + AI Summit, recognized as a thought leader internally and externally.</li>\n<li>Collaborate cross-functionally with the product and engineering teams to influence priorities and shape the product roadmap.</li>\n</ul>\n<p>What we look for:</p>\n<ul>\n<li>Experience building GenAI applications, including RAG, multi-agent systems, Text2SQL, fine-tuning, etc., with tools such as HuggingFace, LangChain, and DSPy.</li>\n<li>Expertise in deploying production-grade GenAI applications, including evaluation and optimizations.</li>\n<li>Extensive years of hands-on industry data science experience, leveraging common machine learning and data science tools (i.e., pandas, scikit-learn, PyTorch, etc.).</li>\n<li>Experience building production-grade machine learning deployments on AWS, Azure, or GCP.</li>\n<li>Graduate degree in a quantitative discipline (Computer Science, Engineering, Statistics, Operations Research, etc.) or equivalent practical experience.</li>\n<li>Experience communicating and/or teaching technical concepts to non-technical and technical audiences alike.</li>\n<li>Passion for collaboration, life-long learning, and driving business value through AI.</li>\n<li>Preferred experience using the Databricks Intelligence Platform and Apache Spark™ to process large-scale distributed datasets.</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_c7431c7c-8d7","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Databricks","sameAs":"https://databricks.com","logo":"https://logos.yubhub.co/databricks.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/databricks/jobs/8335860002","x-work-arrangement":"remote","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$180,656-$248,360 USD","x-skills-required":["GenAI","HuggingFace","LangChain","DSPy","pandas","scikit-learn","PyTorch","AWS","Azure","GCP","Apache Spark"],"x-skills-preferred":["Databricks Intelligence Platform"],"datePosted":"2026-04-18T15:42:34.365Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"United States"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"GenAI, HuggingFace, LangChain, DSPy, pandas, scikit-learn, PyTorch, AWS, Azure, GCP, Apache Spark, Databricks Intelligence Platform","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":180656,"maxValue":248360,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_daa2eb47-aa0"},"title":"AI Engineer - FDE (Forward Deployed Engineer)","description":"<p>The AI Forward Deployed Engineering (AI FDE) team at Databricks is a highly specialized customer-facing AI team. We deliver professional services engagements to help our customers build and productionize first-of-its-kind AI applications. We work cross-functionally to shape long-term strategic priorities and initiatives alongside engineering, product, and developer relations, as well as support internal subject matter expert (SME) teams.</p>\n<p>We view our team as an ensemble: we look for individuals with strong, unique specializations to improve the overall strength of the team. This team is the right fit for you if you love working with customers, teammates, and fueling your curiosity for the latest trends in GenAI, LLMOps, and ML more broadly. Open to remote locations.</p>\n<p>The impact you will have:</p>\n<ul>\n<li>Develop cutting-edge GenAI solutions, incorporating the latest techniques from our Mosaic AI Research to solve customer problems</li>\n<li>Own production rollouts of consumer and internally facing GenAI applications</li>\n<li>Serve as a trusted technical advisor to customers across a variety of domains</li>\n<li>Present at conferences such as Data + AI Summit, recognized as a thought leader internally and externally</li>\n<li>Collaborate cross-functionally with the product and engineering teams to influence priorities and shape the product roadmap</li>\n</ul>\n<p>What we look for:</p>\n<ul>\n<li>Experience building GenAI applications, including RAG, multi-agent systems, Text2SQL, fine-tuning, etc., with tools such as HuggingFace, LangChain, and DSPy</li>\n<li>Expertise in deploying production-grade GenAI applications, including evaluation and optimizations</li>\n<li>Extensive years of hands-on industry data science experience, leveraging common machine learning and data science tools, i.e. pandas, scikit-learn, PyTorch, etc.</li>\n<li>Experience building production-grade machine learning deployments on AWS, Azure, or GCP</li>\n<li>Graduate degree in a quantitative discipline (Computer Science, Engineering, Statistics, Operations Research, etc.) or equivalent practical experience</li>\n<li>Experience communicating and/or teaching technical concepts to non-technical and technical audiences alike</li>\n<li>Passion for collaboration, life-long learning, and driving business value through AI</li>\n</ul>\n<p>Benefits:</p>\n<p>At Databricks, we strive to provide comprehensive benefits and perks that meet the needs of all of our employees. For specific details on the benefits offered in your region, please visit our website.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_daa2eb47-aa0","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Databricks","sameAs":"https://databricks.com/","logo":"https://logos.yubhub.co/databricks.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/databricks/jobs/8330188002","x-work-arrangement":"remote","x-experience-level":null,"x-job-type":"full-time","x-salary-range":null,"x-skills-required":["GenAI","HuggingFace","LangChain","DSPy","pandas","scikit-learn","PyTorch","AWS","Azure","GCP"],"x-skills-preferred":[],"datePosted":"2026-04-18T15:42:25.160Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Melbourne, Australia"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"GenAI, HuggingFace, LangChain, DSPy, pandas, scikit-learn, PyTorch, AWS, Azure, GCP"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_bb7aa015-076"},"title":"AI Engineer - FDE (Forward Deployed Engineer)","description":"<p>We are seeking an AI Engineer - FDE (Forward Deployed Engineer) to join our team. As an AI Engineer, you will develop cutting-edge GenAI solutions, incorporating the latest techniques from our Mosaic AI Research to solve customer problems. You will own production rollouts of consumer and internally facing GenAI applications, serve as a trusted technical advisor to customers across a variety of domains, and present at conferences such as Data + AI Summit, recognized as a thought leader internally and externally. You will collaborate cross-functionally with the product and engineering teams to influence priorities and shape the product roadmap.</p>\n<p>Responsibilities:</p>\n<ul>\n<li>Develop cutting-edge GenAI solutions, incorporating the latest techniques from our Mosaic AI Research to solve customer problems</li>\n<li>Own production rollouts of consumer and internally facing GenAI applications</li>\n<li>Serve as a trusted technical advisor to customers across a variety of domains</li>\n<li>Present at conferences such as Data + AI Summit, recognized as a thought leader internally and externally</li>\n<li>Collaborate cross-functionally with the product and engineering teams to influence priorities and shape the product roadmap</li>\n</ul>\n<p>Requirements:</p>\n<ul>\n<li>Experience building GenAI applications, including RAG, multi-agent systems, Text2SQL, fine-tuning, etc., with tools such as HuggingFace, LangChain, and DSPy</li>\n<li>Expertise in deploying production-grade GenAI applications, including evaluation and optimizations</li>\n<li>Extensive years of hands-on industry data science experience, leveraging common machine learning and data science tools, i.e. pandas, scikit-learn, PyTorch, etc.</li>\n<li>Experience building production-grade machine learning deployments on AWS, Azure, or GCP</li>\n<li>Graduate degree in a quantitative discipline (Computer Science, Engineering, Statistics, Operations Research, etc.) or equivalent practical experience</li>\n<li>Experience communicating and/or teaching technical concepts to non-technical and technical audiences alike</li>\n<li>Passion for collaboration, life-long learning, and driving business value through AI</li>\n</ul>\n<p>Benefits: At Databricks, we strive to provide comprehensive benefits and perks that meet the needs of all of our employees. For specific details on the benefits offered in your region, please click here.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_bb7aa015-076","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Databricks","sameAs":"https://databricks.com","logo":"https://logos.yubhub.co/databricks.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/databricks/jobs/8298792002","x-work-arrangement":"remote","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["GenAI","HuggingFace","LangChain","DSPy","pandas","scikit-learn","PyTorch","AWS","Azure","GCP"],"x-skills-preferred":[],"datePosted":"2026-04-18T15:42:06.992Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Sydney, Australia"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"GenAI, HuggingFace, LangChain, DSPy, pandas, scikit-learn, PyTorch, AWS, Azure, GCP"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_9f6fed50-cc0"},"title":"Applied AI, AI Engineer","description":"<p>About the Job</p>\n<p>We are seeking an Applied AI, AI Engineer to join our customer-facing technical organization. As a member of our team, you will work directly with enterprise clients from pre-sales through implementation to deploy cutting-edge AI solutions that deliver measurable business impact.</p>\n<p>Your primary responsibility will be to identify high-value internal use cases across engineering, legal, HR, sales, and operations, and build or vibe code end-to-end LLM applications. You will own the full lifecycle of these applications, from prototype to production, maintenance, and iteration.</p>\n<p>In addition to your technical skills, you will also be responsible for documenting learnings and sharing insights with product and research teams, and converting successful internal tools into customer demos or case studies where appropriate.</p>\n<p>How We Work in Applied AI</p>\n<p>We care about people and outputs. What matters is what you ship, not the time you spend on it. Bureaucracy is where urgency goes to vanish. You talk to whoever you need to talk to. The best idea wins, whether it comes from a principal engineer or someone in their first week. Always ask why. The best solutions come from deep understanding, not from copying what worked before. We say what we mean. Feedback is direct, timely, and given because we care. No politics. Low ego, high standards. We embrace an unstructured environment and find joy in it.</p>\n<p>About You</p>\n<p>You are fluent in English and have 3+ years of experience building production software, with meaningful experience deploying LLM applications. You have a bias toward shipping, preferring a working prototype over a perfect specification. You possess strong technical coding skills in Python and front-end skills with React Frameworks. You are comfortable working autonomously across teams with different needs and constraints, and have strong communication skills to bridge non-technical teams and AI capabilities.</p>\n<p>Ideally, you have contributions to open-source evaluation frameworks or published research on LLM evaluation, experience as a Customer Engineer, Forward Deployed Engineer, Sales Engineer, Solutions Architect, or Technical Product Manager, and experience with ML frameworks (PyTorch, HuggingFace Transformers).</p>\n<p>Benefits</p>\n<p>PTO: The CDI contract will be a &#39;Forfait 218 jours&#39;, corresponding to 25 days of holidays and on average 8 to 10 days of RTT days, and complete autonomy on working hours.</p>\n<p>Health: Full health insurance coverage for you and your family.</p>\n<p>Transportation: We offer a €600 annual mobility allowance, covering 50% of your public transportation costs and including the Sustainable Mobility Allowance (FMD), encouraging eco-friendly travel options such as cycling or carpooling.</p>\n<p>Food: Swile meal vouchers with 10,83€ per worked day, including 60% offered by the company.</p>\n<p>Sport: Gymlib - sponsorship by Mistral of a significant part of the monthly fee (depending on the program you chose).</p>\n<p>Parental policy: 4 additional weeks for parents on top of what is offered by the French state.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_9f6fed50-cc0","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Mistral AI","sameAs":"https://mistral.ai","logo":"https://logos.yubhub.co/mistral.ai.png"},"x-apply-url":"https://jobs.lever.co/mistral/3d9a6ece-1f8c-4e0b-a275-fde6300ed1f8","x-work-arrangement":"onsite","x-experience-level":"mid","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["Python","React Frameworks","LLM applications","PyTorch","HuggingFace Transformers"],"x-skills-preferred":["Open-source evaluation frameworks","Published research on LLM evaluation","Customer Engineer","Forward Deployed Engineer","Sales Engineer","Solutions Architect","Technical Product Manager"],"datePosted":"2026-04-17T12:46:55.240Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Paris"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Python, React Frameworks, LLM applications, PyTorch, HuggingFace Transformers, Open-source evaluation frameworks, Published research on LLM evaluation, Customer Engineer, Forward Deployed Engineer, Sales Engineer, Solutions Architect, Technical Product Manager"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_c7da135a-ebe"},"title":"Applied AI, Evaluation Engineer","description":"<p>About the Job</p>\n<p>The Applied AI team is Mistral&#39;s customer-facing technical organization. We work directly with enterprise clients from pre-sales through implementation to deploy cutting-edge AI solutions that deliver measurable business impact.</p>\n<p>As a first Evaluation Engineer, you&#39;ll design the methodology, build the infrastructure, and define what &#39;ready for production&#39; means across verticals and use cases. You will design and implement evaluation systems that help our customers understand model performance across their specific use cases, build robust evaluation infrastructure, and work closely with both research and customer-facing teams.</p>\n<p>Research builds evals for frontier capabilities but customers don&#39;t care about MMLU scores. We need in Applied AI evals and frameworks for customer reality domain-specific, risk-aware, production-grade. The kind that tell you whether your medical summarization model will hallucinate drug interactions, or whether your legal assistant will invent case citations.</p>\n<p>This role sits at the intersection of research, engineering, and solutions, you will play a critical cross role in measuring, understanding, and improving the capabilities of our models for our enterprise customers.</p>\n<p>Responsibilities</p>\n<ul>\n<li><p>Design and implement comprehensive evaluation frameworks to measure LLM capabilities across diverse customer use cases, including text generation, reasoning, code, and domain-specific applications</p>\n</li>\n<li><p>Build scalable evaluation infrastructure and pipelines that enable rapid, reproducible assessment of model performance</p>\n</li>\n<li><p>Develop novel evaluation methodologies to assess emerging capabilities or verticalized use cases (cybersecurity, finance, healthcare, etc.) and enable the Solutions (Deployment Strategist and Applied AI) on these topics</p>\n</li>\n<li><p>Create custom evaluation suites tailored to enterprise customers&#39; specific needs, working closely with them to understand their requirements and success criteria</p>\n</li>\n<li><p>Collaborate with research teams to translate evaluation insights into model improvements and training decisions</p>\n</li>\n<li><p>Partner with product teams to continuously improve our evaluation tooling based on customer feedback</p>\n</li>\n</ul>\n<p>How We Work in Applied AI</p>\n<ul>\n<li><p>We care about people and outputs</p>\n</li>\n<li><p>What matters is what you ship, not the time you spend on it</p>\n</li>\n<li><p>Bureaucracy is where urgency goes to vanish. You talk to whoever you need to talk to</p>\n</li>\n<li><p>The best idea wins, whether it comes from a principal engineer or someone in their first week</p>\n</li>\n<li><p>Always ask why. The best solutions come from deep understanding, not from copying what worked before</p>\n</li>\n<li><p>We say what we mean. Feedback is direct, timely, and given because we care</p>\n</li>\n<li><p>No politics. Low ego, high standards</p>\n</li>\n<li><p>We embrace an unstructured environment and find joy in it</p>\n</li>\n</ul>\n<p>About You</p>\n<ul>\n<li><p>You are fluent in English</p>\n</li>\n<li><p>3+ years of experience in ML evaluation, benchmarking for LLM or agentic systems</p>\n</li>\n<li><p>You have proven experience in AI or machine learning product implementation with APIs, back-end</p>\n</li>\n<li><p>You have deep understanding of concepts and algorithms underlying machine learning and LLMs</p>\n</li>\n<li><p>You have strong technical coding skills in Python</p>\n</li>\n<li><p>You hold strong communication skills with an ability to explain complex technical concepts in simple terms with technical and non-technical audiences</p>\n</li>\n</ul>\n<p>Ideally You Have:</p>\n<ul>\n<li><p>Contributions to open-source evaluation frameworks (e.g., LM Eval Harness, OpenAI Evals) or published research on LLM evaluation</p>\n</li>\n<li><p>Experience as a Customer Engineer, Forward Deployed Engineer, Sales Engineer, Solutions Architect or Technical Product Manager</p>\n</li>\n<li><p>Experience with ML frameworks (PyTorch, HuggingFace Transformers)</p>\n</li>\n</ul>\n<p>Benefits</p>\n<ul>\n<li><p>PTO: The CDI contract will be a &#39;Forfait 218 jours&#39;, corresponding to 25 days of holidays and on average 8 to 10 days of RTT days, and complete autonomy on working hours</p>\n</li>\n<li><p>Health: Full health insurance coverage for you and your family</p>\n</li>\n<li><p>Transportation: We offer a €600 annual mobility allowance. This package covers 50% of your public transportation costs and includes the Sustainable Mobility Allowance (FMD), encouraging eco-friendly travel options such as cycling or carpooling</p>\n</li>\n<li><p>Food: Swile meal vouchers with 10,83€ per worked day, incl 60% offered by company</p>\n</li>\n<li><p>Sport: Gymlib - sponsorship by Mistral of a significant part of the monthly fee (depending on the program you chose)</p>\n</li>\n<li><p>Parental policy: 4 additional weeks for parents on top of what is offered by the French state</p>\n</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_c7da135a-ebe","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Mistral AI","sameAs":"https://mistral.ai","logo":"https://logos.yubhub.co/mistral.ai.png"},"x-apply-url":"https://jobs.lever.co/mistral/e0db3860-0a80-47a8-958a-f8e62f3bb59c","x-work-arrangement":"onsite","x-experience-level":"entry","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["ML evaluation","benchmarking for LLM or agentic systems","AI or machine learning product implementation with APIs, back-end","Python","evaluation frameworks","open-source evaluation frameworks","PyTorch","HuggingFace Transformers"],"x-skills-preferred":[],"datePosted":"2026-04-17T12:46:40.159Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Paris"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"ML evaluation, benchmarking for LLM or agentic systems, AI or machine learning product implementation with APIs, back-end, Python, evaluation frameworks, open-source evaluation frameworks, PyTorch, HuggingFace Transformers"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_62efca6f-b6f"},"title":"Senior AI Engineer","description":"<p>We&#39;re looking for a Senior AI Engineer who is obsessed with building AI systems that actually work in production: reliable, observable, cost-efficient, and genuinely useful. This is not a research role. You will ship AI-powered features that process real financial data for real businesses.</p>\n<p>LLM &amp; AI Pipeline Engineering - Design, build, and maintain production-grade LLM integration pipelines , including retrieval-augmented generation (RAG), prompt engineering, output parsing, and chain orchestration.</p>\n<p>Develop and operate AI features within Jeeves&#39;s core financial products: spend categorization, document extraction, anomaly detection, financial Q&amp;A, and automated reconciliation.</p>\n<p>Implement structured output validation, fallback handling, and confidence scoring to ensure AI decisions meet reliability standards for financial use cases.</p>\n<p>Evaluate and integrate AI frameworks and tools (LangChain, LlamaIndex, OpenAI API, Anthropic API, HuggingFace, vector databases) and advocate for the right tool for the job.</p>\n<p>Establish prompt versioning and evaluation practices to ensure AI outputs remain accurate and consistent as models and data evolve.</p>\n<p>Retrieval &amp; Vector Search - Design and maintain vector search pipelines using databases such as Pinecone, Weaviate, or pgvector to power semantic search and RAG-based features.</p>\n<p>Build document ingestion and chunking pipelines for Jeeves&#39;s financial data , processing invoices, receipts, policy documents, and transaction records.</p>\n<p>Optimize retrieval quality through embedding model selection, chunk strategy, metadata filtering, and re-ranking techniques.</p>\n<p>ML Model Serving &amp; Operations - Collaborate with data scientists to take trained ML models from experimental notebooks to production serving infrastructure.</p>\n<p>Build and maintain model serving endpoints with appropriate latency SLOs, input validation, and output monitoring.</p>\n<p>Implement model performance monitoring and data drift detection to ensure production models remain accurate over time.</p>\n<p>Support model retraining workflows by designing clean data pipelines and feature engineering that can be continuously updated.</p>\n<p>Backend Integration &amp; Reliability - Integrate AI services cleanly with Jeeves&#39;s backend microservices , designing clear API contracts, circuit breakers, and graceful degradation patterns.</p>\n<p>Write high-quality, testable backend code in Python or Go/Node.js to power AI-integrated features.</p>\n<p>Instrument AI components with structured logging, distributed tracing, latency dashboards, and alerting to ensure operational visibility.</p>\n<p>Collaboration &amp; Growth - Partner with Product, Backend Engineering, and Data Science to define the AI roadmap and translate requirements into reliable systems.</p>\n<p>Contribute to a culture of quality by writing design docs, reviewing peers&#39; AI system designs, and sharing learnings openly.</p>\n<p>Help grow the AI engineering practice at Jeeves by establishing patterns, tooling, and best practices that the broader team can build on.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_62efca6f-b6f","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Jeeves","sameAs":"https://www.jeeves.com/","logo":"https://logos.yubhub.co/jeeves.com.png"},"x-apply-url":"https://jobs.lever.co/tryjeeves/ded9e04e-f18e-4d4c-ae43-4b7882c6200b","x-work-arrangement":"remote","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["LLM","AI","Python","LangChain","LlamaIndex","OpenAI API","Anthropic API","HuggingFace","vector databases","Pinecone","Weaviate","pgvector","semantic search","RAG-based features","document ingestion","chunking pipelines","embedding model selection","chunk strategy","metadata filtering","re-ranking techniques","model serving infrastructure","latency SLOs","input validation","output monitoring","model performance monitoring","data drift detection","clean data pipelines","feature engineering","API contracts","circuit breakers","graceful degradation patterns","structured logging","distributed tracing","latency dashboards","alerting"],"x-skills-preferred":[],"datePosted":"2026-04-17T12:39:23.341Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"India"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Finance","skills":"LLM, AI, Python, LangChain, LlamaIndex, OpenAI API, Anthropic API, HuggingFace, vector databases, Pinecone, Weaviate, pgvector, semantic search, RAG-based features, document ingestion, chunking pipelines, embedding model selection, chunk strategy, metadata filtering, re-ranking techniques, model serving infrastructure, latency SLOs, input validation, output monitoring, model performance monitoring, data drift detection, clean data pipelines, feature engineering, API contracts, circuit breakers, graceful degradation patterns, structured logging, distributed tracing, latency dashboards, alerting"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_e2350d04-53f"},"title":"Senior AI Engineer","description":"<p>We&#39;re looking for a Senior AI Engineer who is obsessed with building AI systems that actually work in production: reliable, observable, cost-efficient, and genuinely useful. This is not a research role. You will ship AI-powered features that process real financial data for real businesses.</p>\n<p>LLM &amp; AI Pipeline Engineering - Design, build, and maintain production-grade LLM integration pipelines , including retrieval-augmented generation (RAG), prompt engineering, output parsing, and chain orchestration.</p>\n<p>Develop and operate AI features within Jeeves&#39;s core financial products: spend categorization, document extraction, anomaly detection, financial Q&amp;A, and automated reconciliation.</p>\n<p>Implement structured output validation, fallback handling, and confidence scoring to ensure AI decisions meet reliability standards for financial use cases.</p>\n<p>Evaluate and integrate AI frameworks and tools (LangChain, LlamaIndex, OpenAI API, Anthropic API, HuggingFace, vector databases) and advocate for the right tool for the job.</p>\n<p>Establish prompt versioning and evaluation practices to ensure AI outputs remain accurate and consistent as models and data evolve.</p>\n<p>Retrieval &amp; Vector Search - Design and maintain vector search pipelines using databases such as Pinecone, Weaviate, or pgvector to power semantic search and RAG-based features.</p>\n<p>Build document ingestion and chunking pipelines for Jeeves&#39;s financial data , processing invoices, receipts, policy documents, and transaction records.</p>\n<p>Optimize retrieval quality through embedding model selection, chunk strategy, metadata filtering, and re-ranking techniques.</p>\n<p>ML Model Serving &amp; Operations - Collaborate with data scientists to take trained ML models from experimental notebooks to production serving infrastructure.</p>\n<p>Build and maintain model serving endpoints with appropriate latency SLOs, input validation, and output monitoring.</p>\n<p>Implement model performance monitoring and data drift detection to ensure production models remain accurate over time.</p>\n<p>Support model retraining workflows by designing clean data pipelines and feature engineering that can be continuously updated.</p>\n<p>Backend Integration &amp; Reliability - Integrate AI services cleanly with Jeeves&#39;s backend microservices , designing clear API contracts, circuit breakers, and graceful degradation patterns.</p>\n<p>Write high-quality, testable backend code in Python or Go/Node.js to power AI-integrated features.</p>\n<p>Instrument AI components with structured logging, distributed tracing, latency dashboards, and alerting to ensure operational visibility.</p>\n<p>Collaboration &amp; Growth - Partner with Product, Backend Engineering, and Data Science to define the AI roadmap and translate requirements into reliable systems.</p>\n<p>Contribute to a culture of quality by writing design docs, reviewing peers&#39; AI system designs, and sharing learnings openly.</p>\n<p>Help grow the AI engineering practice at Jeeves by establishing patterns, tooling, and best practices that the broader team can build on.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_e2350d04-53f","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Jeeves","sameAs":"https://www.jeeves.com/","logo":"https://logos.yubhub.co/jeeves.com.png"},"x-apply-url":"https://jobs.lever.co/tryjeeves/66241934-7138-4d7d-8b05-a211ec5d6e24","x-work-arrangement":"remote","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["LLM","AI","Python","LangChain","LlamaIndex","OpenAI API","Anthropic API","HuggingFace","vector databases","Pinecone","Weaviate","pgvector","PostgreSQL","async patterns","cloud infrastructure","AWS","GCP","Azure","structured logging","distributed tracing","latency dashboards","alerting"],"x-skills-preferred":[],"datePosted":"2026-04-17T12:38:54.694Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Colombia"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Finance","skills":"LLM, AI, Python, LangChain, LlamaIndex, OpenAI API, Anthropic API, HuggingFace, vector databases, Pinecone, Weaviate, pgvector, PostgreSQL, async patterns, cloud infrastructure, AWS, GCP, Azure, structured logging, distributed tracing, latency dashboards, alerting"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_d477874c-cf5"},"title":"Senior AI Engineer","description":"<p>We&#39;re looking for a Senior AI Engineer who is obsessed with building AI systems that actually work in production: reliable, observable, cost-efficient, and genuinely useful. This is not a research role. You will ship AI-powered features that process real financial data for real businesses.</p>\n<p>LLM &amp; AI Pipeline Engineering - Design, build, and maintain production-grade LLM integration pipelines , including retrieval-augmented generation (RAG), prompt engineering, output parsing, and chain orchestration.</p>\n<p>Develop and operate AI features within Jeeves&#39;s core financial products: spend categorization, document extraction, anomaly detection, financial Q&amp;A, and automated reconciliation.</p>\n<p>Implement structured output validation, fallback handling, and confidence scoring to ensure AI decisions meet reliability standards for financial use cases.</p>\n<p>Evaluate and integrate AI frameworks and tools (LangChain, LlamaIndex, OpenAI API, Anthropic API, HuggingFace, vector databases) and advocate for the right tool for the job.</p>\n<p>Establish prompt versioning and evaluation practices to ensure AI outputs remain accurate and consistent as models and data evolve.</p>\n<p>Retrieval &amp; Vector Search - Design and maintain vector search pipelines using databases such as Pinecone, Weaviate, or pgvector to power semantic search and RAG-based features.</p>\n<p>Build document ingestion and chunking pipelines for Jeeves&#39;s financial data , processing invoices, receipts, policy documents, and transaction records.</p>\n<p>Optimize retrieval quality through embedding model selection, chunk strategy, metadata filtering, and re-ranking techniques.</p>\n<p>ML Model Serving &amp; Operations - Collaborate with data scientists to take trained ML models from experimental notebooks to production serving infrastructure.</p>\n<p>Build and maintain model serving endpoints with appropriate latency SLOs, input validation, and output monitoring.</p>\n<p>Implement model performance monitoring and data drift detection to ensure production models remain accurate over time.</p>\n<p>Support model retraining workflows by designing clean data pipelines and feature engineering that can be continuously updated.</p>\n<p>Backend Integration &amp; Reliability - Integrate AI services cleanly with Jeeves&#39;s backend microservices , designing clear API contracts, circuit breakers, and graceful degradation patterns.</p>\n<p>Write high-quality, testable backend code in Python or Go/Node.js to power AI-integrated features.</p>\n<p>Instrument AI components with structured logging, distributed tracing, latency dashboards, and alerting to ensure operational visibility.</p>\n<p>Collaboration &amp; Growth - Partner with Product, Backend Engineering, and Data Science to define the AI roadmap and translate requirements into reliable systems.</p>\n<p>Contribute to a culture of quality by writing design docs, reviewing peers&#39; AI system designs, and sharing learnings openly.</p>\n<p>Help grow the AI engineering practice at Jeeves by establishing patterns, tooling, and best practices that the broader team can build on.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_d477874c-cf5","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Jeeves","sameAs":"https://www.jeeves.com/","logo":"https://logos.yubhub.co/jeeves.com.png"},"x-apply-url":"https://jobs.lever.co/tryjeeves/639e39d0-b357-4bc2-aff2-968cdedb14b6","x-work-arrangement":"remote","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["LLM","AI","Python","Go","Node.js","Pinecone","Weaviate","pgvector","LangChain","LlamaIndex","OpenAI API","Anthropic API","HuggingFace","vector databases","API contracts","circuit breakers","graceful degradation patterns","structured logging","distributed tracing","latency dashboards","alerting"],"x-skills-preferred":[],"datePosted":"2026-04-17T12:38:44.910Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Argentina"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Finance","skills":"LLM, AI, Python, Go, Node.js, Pinecone, Weaviate, pgvector, LangChain, LlamaIndex, OpenAI API, Anthropic API, HuggingFace, vector databases, API contracts, circuit breakers, graceful degradation patterns, structured logging, distributed tracing, latency dashboards, alerting"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_d48b0655-2fa"},"title":"Data/Infrastructure Advocate Engineer","description":"<p>At Hugging Face, we&#39;re on a journey to democratise good AI. As our first Data/Infrastructure Advocate Engineer, you&#39;ll bridge the gap between cutting-edge data infrastructure and the global community of data engineers, researchers, and developers.</p>\n<p>You&#39;ll champion Xet storage on the Hugging Face Hub, empowering users to efficiently store, version, and collaborate on large-scale datasets. This role is for someone who thrives at the intersection of technical depth (storage, Parquet, deduplication) and community advocacy—helping define the future of open data workflows.</p>\n<p>Your main missions will be:</p>\n<ul>\n<li>Grow and nurture the open-source data/infra community—launch initiatives, collaborate with data-focused groups, and organise events or challenges.</li>\n<li>Promote the Hugging Face Hub as the go-to platform for data storage, versioning, and collaboration—curate and showcase datasets, benchmarks, and tools like Xet.</li>\n<li>Highlight use cases like efficient large dataset updates, Parquet editing, and deduplication to demonstrate the Hub&#39;s value for data workflows.</li>\n<li>Create demos, benchmarks, and tools (e.g., Colab notebooks) to illustrate best practices for data storage and versioning.</li>\n<li>Experiment with Xet, Parquet, and other data formats to showcase their potential for ML and data engineering.</li>\n<li>Produce high-quality tutorials, blog posts, and videos that make complex topics accessible.</li>\n<li>Share insights on storage optimisation, dataset versioning, and deduplication to empower developers.</li>\n<li>Actively participate in online communities (Discord, GitHub, forums) to highlight contributions, answer questions, and foster collaboration.</li>\n<li>Ensure datasets and tools released on the Hub are well-documented, with clear examples, benchmarks, and use cases.</li>\n</ul>\n<p><strong>About you</strong></p>\n<p>You&#39;re a great fit if you:</p>\n<ul>\n<li>Have strong technical skills in Python, data libraries (e.g., pandas, pyarrow, huggingface/datasets), and storage systems (Parquet, Open Table Formats, S3).</li>\n<li>Are a hands-on builder who loves experimenting with data tools, storage optimisation, and dataset versioning.</li>\n<li>Can clearly explain complex topics (e.g., deduplication, compression, Parquet editing) through writing, demos, or talks.</li>\n<li>Are active in developer communities (GitHub, Discord, forums) and passionate about open source and knowledge sharing.</li>\n<li>Thrive in fast-moving environments and enjoy building in public to inspire others.</li>\n</ul>\n<p>If you&#39;re interested in joining us but don&#39;t tick every box above, we still encourage you to apply! We&#39;re building a diverse team whose skills, experiences, and backgrounds complement one another.</p>\n<p><strong>More about Hugging Face</strong></p>\n<p>We are actively working to build a culture that values diversity, equity, and inclusivity. We are intentionally building a workplace where you feel respected and supported—regardless of who you are or where you come from.</p>\n<p>Hugging Face is an equal opportunity employer, and we do not discriminate based on race, ethnicity, religion, colour, national origin, gender, sexual orientation, age, marital status, veteran status, or ability status.</p>\n<p>We value development. You will work with some of the smartest people in our industry.</p>\n<p>We provide all employees with reimbursement for relevant conferences, training, and education.</p>\n<p>We care about your well-being. We offer flexible working hours and remote options.</p>\n<p>We offer health, dental, and vision benefits for employees and their dependents.</p>\n<p>We also offer parental leave and flexible paid time off.</p>\n<p>We support our employees wherever they are. While we have office spaces in NYC and Paris, we&#39;re very distributed, and all remote employees have the opportunity to visit our offices.</p>\n<p>If needed, we&#39;ll also outfit your workstation to ensure you succeed.</p>\n<p>We want our teammates to be shareholders. All employees have company equity as part of their compensation package.</p>\n<p>If we succeed in becoming a category-defining platform in machine learning and artificial intelligence, everyone enjoys the upside.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_d48b0655-2fa","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Hugging Face","sameAs":"https://huggingface.co/"},"x-apply-url":"https://apply.workable.com/j/5CA82A9A98","x-work-arrangement":"remote","x-experience-level":"entry","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["Python","data libraries","pandas","pyarrow","huggingface/datasets","storage systems","Parquet","Open Table Formats","S3"],"x-skills-preferred":[],"datePosted":"2026-03-10T11:34:41.656Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"New York"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Python, data libraries, pandas, pyarrow, huggingface/datasets, storage systems, Parquet, Open Table Formats, S3"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_f81a1dc8-ca4"},"title":"Data/Infrastructure Advocate Engineer - EMEA Remote","description":"<p>At Hugging Face, we&#39;re on a journey to democratize good AI. We are building the fastest growing platform for AI builders with over 5 million users &amp; 100k organisations who collectively shared over 1M models, 300k datasets &amp; 300k apps. Our open-source libraries have more than 400k+ stars on Github.</p>\n<p>As our first Data/Infrastructure Advocate Engineer, you&#39;ll bridge the gap between cutting-edge data infrastructure and the global community of data engineers, researchers, and developers. You&#39;ll champion Xet storage on the Hugging Face Hub, empowering users to efficiently store, version, and collaborate on large-scale datasets.</p>\n<p>This role is for someone who thrives at the intersection of technical depth (storage, Parquet, deduplication) and community advocacy—helping define the future of open data workflows. You&#39;ll collaborate with teams like Datasets, Hub, and Infrastructure to shape how developers interact with data on our platform, and inspire a community to build better, faster, and more scalable data pipelines.</p>\n<p>Your Main Missions:</p>\n<ul>\n<li>Grow and nurture the open-source data/infra community—launch initiatives, collaborate with data-focused groups, and organise events or challenges. Engage with communities like Apache Parquet, Open Tables Formats, and data engineering forums to promote best practices and Hugging Face tools.</li>\n</ul>\n<ul>\n<li>Promote the Hugging Face Hub as the go-to platform for data storage, versioning, and collaboration—curate and showcase datasets, benchmarks, and tools like Xet.</li>\n</ul>\n<ul>\n<li>Highlight use cases like efficient large dataset updates, Parquet editing, and deduplication to demonstrate the Hub’s value for data workflows.</li>\n</ul>\n<ul>\n<li>Create demos, benchmarks, and tools (e.g., Colab notebooks) to illustrate best practices for data storage and versioning.</li>\n</ul>\n<ul>\n<li>Experiment with Xet, Parquet, and other data formats to showcase their potential for ML and data engineering.</li>\n</ul>\n<ul>\n<li>Produce high-quality tutorials, blog posts, and videos that make complex topics accessible.</li>\n</ul>\n<ul>\n<li>Share insights on storage optimisation, dataset versioning, and deduplication to empower developers.</li>\n</ul>\n<ul>\n<li>Actively participate in online communities (Discord, GitHub, forums) to highlight contributions, answer questions, and foster collaboration.</li>\n</ul>\n<ul>\n<li>Ensure datasets and tools released on the Hub are well-documented, with clear examples, benchmarks, and use cases.</li>\n</ul>\n<p><strong>About you</strong></p>\n<p>You’re a great fit if you:</p>\n<ul>\n<li>Have strong technical skills in Python, data libraries (e.g., pandas, pyarrow, huggingface/datasets), and storage systems (Parquet, Open Table Formats, S3).</li>\n</ul>\n<ul>\n<li>Are a hands-on builder who loves experimenting with data tools, storage optimisation, and dataset versioning.</li>\n</ul>\n<ul>\n<li>Can clearly explain complex topics (e.g., deduplication, compression, Parquet editing) through writing, demos, or talks.</li>\n</ul>\n<ul>\n<li>Are active in developer communities (GitHub, Discord, forums) and passionate about open source and knowledge sharing.</li>\n</ul>\n<ul>\n<li>Thrive in fast-moving environments and enjoy building in public to inspire others.</li>\n</ul>\n<p>If you&#39;re interested in joining us but don&#39;t tick every box above, we still encourage you to apply! We&#39;re building a diverse team whose skills, experiences, and backgrounds complement one another. We&#39;re happy to consider where you might be able to make the biggest impact.</p>\n<p><strong>More about Hugging Face</strong></p>\n<p>We are actively working to build a culture that values diversity, equity, and inclusivity. We are intentionally building a workplace where you feel respected and supported—regardless of who you are or where you come from. We believe this is foundational to building a great company and community, as well as the future of machine learning more broadly. Hugging Face is an equal opportunity employer, and we do not discriminate based on race, ethnicity, religion, colour, national origin, gender, sexual orientation, age, marital status, veteran status, or ability status.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_f81a1dc8-ca4","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Hugging Face","sameAs":"https://huggingface.co/"},"x-apply-url":"https://apply.workable.com/j/7C7F63E87A","x-work-arrangement":"remote","x-experience-level":"entry","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["Python","data libraries","pandas","pyarrow","huggingface/datasets","storage systems","Parquet","Open Table Formats","S3"],"x-skills-preferred":[],"datePosted":"2026-03-10T11:34:10.184Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Paris"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Python, data libraries, pandas, pyarrow, huggingface/datasets, storage systems, Parquet, Open Table Formats, S3"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_67fcb604-29e"},"title":"Applied AI, Evaluation Engineer","description":"<p>About Mistral AI</p>\n<p>At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life.</p>\n<p>We are a global organisation with teams distributed between France, USA, UK, Germany, and Singapore. Our comprehensive AI platform meets enterprise needs, whether on-premises or in cloud environments.</p>\n<p>Our offerings include le Chat, the AI assistant for life and work.</p>\n<p>About The Job</p>\n<p>The Applied AI team is Mistral&#39;s customer-facing technical organisation. We work directly with enterprise clients from pre-sales through implementation to deploy cutting-edge AI solutions that deliver measurable business impact.</p>\n<p>As a first Evaluation Engineer, you&#39;ll design the methodology, build the infrastructure, and define what &#39;ready for production&#39; means across verticals and use cases. You will design and implement evaluation systems that help our customers understand model performance across their specific use cases, build robust evaluation infrastructure, and work closely with both research and customer-facing teams.</p>\n<p>Research builds evals for frontier capabilities but customers don&#39;t care about MMLU scores. We need in Applied AI evals and frameworks for customer reality domain-specific, risk-aware, production-grade. The kind that tell you whether your medical summarization model will hallucinate drug interactions, or whether your legal assistant will invent case citations.</p>\n<p>This role sits at the intersection of research, engineering, and solutions, you will play a critical cross role in measuring, understanding, and improving the capabilities of our models for our enterprise customers.</p>\n<p>Responsibilities</p>\n<ul>\n<li><p>Design and implement comprehensive evaluation frameworks to measure LLM capabilities across diverse customer use cases, including text generation, reasoning, code, and domain-specific applications</p>\n</li>\n<li><p>Build scalable evaluation infrastructure and pipelines that enable rapid, reproducible assessment of model performance</p>\n</li>\n<li><p>Develop novel evaluation methodologies to assess emerging capabilities or verticalized use cases (cybersecurity, finance, healthcare, etc.) and enable the Solutions (Deployment Strategist and Applied AI) on these topics</p>\n</li>\n<li><p>Create custom evaluation suites tailored to enterprise customers&#39; specific needs, working closely with them to understand their requirements and success criteria</p>\n</li>\n<li><p>Collaborate with research teams to translate evaluation insights into model improvements and training decisions</p>\n</li>\n<li><p>Partner with product teams to continuously improve our evaluation tooling based on customer feedback</p>\n</li>\n</ul>\n<p>How We Work in Applied AI</p>\n<ul>\n<li><p>We care about people and outputs</p>\n</li>\n<li><p>What matters is what you ship, not the time you spend on it</p>\n</li>\n<li><p>Bureaucracy is where urgency goes to vanish. You talk to whoever you need to talk to. The best idea wins, whether it comes from a principal engineer or someone in their first week</p>\n</li>\n<li><p>Always ask why. The best solutions come from deep understanding, not from copying what worked before</p>\n</li>\n<li><p>We say what we mean. Feedback is direct, timely, and given because we care</p>\n</li>\n<li><p>No politics. Low ego, high standards</p>\n</li>\n<li><p>We embrace an unstructured environment and find joy in it</p>\n</li>\n</ul>\n<p>About You</p>\n<ul>\n<li><p>You are fluent in English</p>\n</li>\n<li><p>3+ years of experience in ML evaluation, benchmarking for LLM or agentic systems</p>\n</li>\n<li><p>You have proven experience in AI or machine learning product implementation with APIs, back-end</p>\n</li>\n<li><p>You have deep understanding of concepts and algorithms underlying machine learning and LLMs</p>\n</li>\n<li><p>You have strong technical coding skills in Python</p>\n</li>\n</ul>\n<p>Ideally You Have:</p>\n<ul>\n<li><p>Contributions to open-source evaluation frameworks (e.g., LM Eval Harness, OpenAI Evals) or published research on LLM evaluation</p>\n</li>\n<li><p>Experience as a Customer Engineer, Forward Deployed Engineer, Sales Engineer, Solutions Architect or Technical Product Manager</p>\n</li>\n<li><p>Experience with ML frameworks (PyTorch, HuggingFace Transformers)</p>\n</li>\n</ul>\n<p>Benefits</p>\n<p>PTO: The CDI contract will be a &#39;Forfait 218 jours&#39;, corresponding to 25 days of holidays and on average 8 to 10 days of RTT days, and complete autonomy on working hours</p>\n<p>Health: Full health insurance coverage for you and your family</p>\n<p>Transportation: We offer a €600 annual mobility allowance. This package covers 50% of your public transportation costs and includes the Sustainable Mobility Allowance (FMD), encouraging eco-friendly travel options such as cycling or carpooling</p>\n<p>Food: Swile meal vouchers with 10,83€ per worked day, incl 60% offered by company</p>\n<p>Sport: Gymlib - sponsorship by Mistral of a significant part of the monthly fee (depending on the program you chose)</p>\n<p>Parental policy: 4 additional weeks for parents on top of what is offered by the French state</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_67fcb604-29e","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Mistral AI","sameAs":"https://mistral.ai"},"x-apply-url":"https://jobs.lever.co/mistral/e0db3860-0a80-47a8-958a-f8e62f3bb59c","x-work-arrangement":"onsite","x-experience-level":"entry","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["ML evaluation","benchmarking","LLM","agentic systems","AI","machine learning","APIs","back-end","Python","PyTorch","HuggingFace Transformers"],"x-skills-preferred":[],"datePosted":"2026-03-10T11:25:45.717Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Paris"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"ML evaluation, benchmarking, LLM, agentic systems, AI, machine learning, APIs, back-end, Python, PyTorch, HuggingFace Transformers"}]}