{"version":"0.1","company":{"name":"YubHub","url":"https://yubhub.co","jobsUrl":"https://yubhub.co/jobs/skill/data-ecosystem"},"x-facet":{"type":"skill","slug":"data-ecosystem","display":"Data Ecosystem","count":5},"x-feed-size-limit":100,"x-feed-sort":"enriched_at desc","x-feed-notice":"This feed contains at most 100 jobs (the most recently enriched). For the full corpus, use the paginated /stats/by-facet endpoint or /search.","x-generator":"yubhub-xml-generator","x-rights":"Free to redistribute with attribution: \"Data by YubHub (https://yubhub.co)\"","x-schema":"Each entry in `jobs` follows https://schema.org/JobPosting. YubHub-native raw fields carry `x-` prefix.","jobs":[{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_4c199de2-f4c"},"title":"Staff Backend Engineer, Core Entities Data Foundation","description":"<p>Job Title: Staff Backend Engineer, Core Entities Data Foundation</p>\n<p>Location: Remote - US</p>\n<p>Department: Software Engineering</p>\n<p>Job Description:</p>\n<p>Airbnb was born in 2007 when two hosts welcomed three guests to their San Francisco home, and has since grown to over 5 million hosts who have welcomed over 2 billion guest arrivals in almost every country across the globe.</p>\n<p>Every day, hosts offer unique stays and experiences that make it possible for guests to connect with communities in a more authentic way.</p>\n<p>The Community You Will Join:</p>\n<p>Marketplaces Data and AI is a group of passionate machine learning, software, data, and analytics engineers. We are responsible for developing new, cutting-edge AI and data products that leverage Airbnb’s massive datasets across Users, Listings, Pricing, and Supply/Demand.</p>\n<p>You will be a crucial part of the Guest and Host organization, developing a semantic model for our Core Entities and Events that powers the experiences of millions of guests and hosts globally.</p>\n<p>The Difference You Will Make:</p>\n<p>You will own some of the most critical data systems at Airbnb, building a semantic model that autonomously drives both infrastructure and code changes. This role will focus on building new capabilities in our data ecosystem: a platform that proactively detects issues, orchestrates solutions and seamlessly integrates human-in-the-loop workflows for expert guidance.</p>\n<p>Your contributions will shift Airbnb from storing massive amounts of data to intelligently organizing and utilizing it, empowering our product and operations teams to move faster and deliver a resilient, high quality experience for our global community of Guests and Hosts.</p>\n<p>A Typical Day:</p>\n<ul>\n<li>Develop an actionable technical strategy from our ambitious vision to drive infrastructure and code from a semantic model of the business</li>\n</ul>\n<ul>\n<li>Improve and expand our detection systems to find more complex issues and integrate human-in-the-loop workflows for subject matter expert guidance</li>\n</ul>\n<ul>\n<li>Architect and develop systems that autonomously orchestrate infrastructure and code changes based on findings from our detection platforms</li>\n</ul>\n<ul>\n<li>Partner closely with Machine Learning, Data Engineering, and Product teams to ensure deep integration in the product space, not just a siloed project.</li>\n</ul>\n<ul>\n<li>Identify areas for improvement, champion the adoption of best practices in engineering architecture, perform technical design reviews, and enhance our software engineers across team boundaries.</li>\n</ul>\n<ul>\n<li>Research the latest innovations in semantic modeling and AI-driven infrastructure, actively sharing these insights to act as a thought leader within Airbnb’s engineering organization.</li>\n</ul>\n<p>Your Expertise:</p>\n<ul>\n<li>9+ years of relevant software development industry experience in a fast-paced tech environment</li>\n</ul>\n<ul>\n<li>BS, MS or PhD in CS or related field</li>\n</ul>\n<ul>\n<li>Expertise with backend systems in large-scale service-oriented architectures</li>\n</ul>\n<ul>\n<li>Good judgment in making tradeoffs to balance short-term business needs with long-term technical quality</li>\n</ul>\n<ul>\n<li>Strong understanding of how deep backend systems are expressed in the UX shown to customers</li>\n</ul>\n<ul>\n<li>End-to-end mentality that transcends team boundaries and helps find globally optimal solutions</li>\n</ul>\n<ul>\n<li>Excellent communication skills and the ability to work well within a team and with teams across the engineering organization</li>\n</ul>\n<ul>\n<li>Passionate about efficiency, availability, system quality and user experience</li>\n</ul>\n<p>Your Location:</p>\n<p>This position is US - Remote Eligible. The role may include occasional work at an Airbnb office or attendance at offsites, as agreed to with your manager. While the position is Remote Eligible, you must live in a state where Airbnb, Inc. has a registered entity. Click here for the up-to-date list of excluded states. This list is continuously evolving, so please check back with us if the state you live in is on the exclusion list . If your position is employed by another Airbnb entity, your recruiter will inform you what states you are eligible to work from.</p>\n<p>Our Commitment To Inclusion &amp; Belonging:</p>\n<p>Airbnb is committed to working with the broadest talent pool possible. We believe diverse ideas foster innovation and engagement, and allow us to attract creatively-led people, and to develop the best products, services and solutions. All qualified individuals are encouraged to apply. We strive to also provide a disability inclusive application and interview process. If you are a candidate with a disability and require reasonable accommodation in order to submit an application, please contact us at: reasonableaccommodations@airbnb.com. Please include your full name, the role you’re applying for and the accommodation necessary to assist you with the recruiting process. We ask that you only reach out to us if you are a candidate whose disability prevents you from being able to complete our online application.</p>\n<p>How We&#39;ll Take Care of You:</p>\n<p>Our job titles may span more than one career level. The actual base pay is dependent upon many factors, such as: training, transferable skills, work experience, business needs and market demands. The base pay range is subject to change and may be modified in the future. This role may also be eligible for bonus, equity, benefits, and Employee Travel Credits.</p>\n<p>Pay Range $212,000-$265,000 USD</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_4c199de2-f4c","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Airbnb","sameAs":"https://www.airbnb.com/","logo":"https://logos.yubhub.co/airbnb.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/airbnb/jobs/7774153","x-work-arrangement":"remote","x-experience-level":"staff","x-job-type":"full-time","x-salary-range":"$212,000-$265,000 USD","x-skills-required":["backend systems","large-scale service-oriented architectures","semantic modeling","AI-driven infrastructure","data systems","infrastructure and code changes","human-in-the-loop workflows","expert guidance","data ecosystem","detecting issues","orchestrating solutions","technical design reviews","engineering architecture","software engineers","team boundaries","globally optimal solutions","communication skills","team collaboration"],"x-skills-preferred":[],"datePosted":"2026-04-18T15:55:43.689Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Remote - US"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"backend systems, large-scale service-oriented architectures, semantic modeling, AI-driven infrastructure, data systems, infrastructure and code changes, human-in-the-loop workflows, expert guidance, data ecosystem, detecting issues, orchestrating solutions, technical design reviews, engineering architecture, software engineers, team boundaries, globally optimal solutions, communication skills, team collaboration","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":212000,"maxValue":265000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_7eb73baf-db6"},"title":"Engineering Manager - Streaming","description":"<p>We are seeking a dedicated Engineering Leader to spearhead Spark Structured Streaming development initiatives. Your primary mission will be to make Spark Structured Streaming state of the art Stream Processing engine by adding advanced features such as sophisticated state management, new operators and making the engine performance both from latency and throughput point of view by reimagining engine architecture.</p>\n<p>Key responsibilities include:</p>\n<ul>\n<li>Leading a talented engineering team in Spark Structured Streaming team developing and promoting the engine in OSS and the Databricks Data Intelligence Platform</li>\n<li>Overseeing sustained recruitment of top-tier talent, and upskilling talent on the team</li>\n<li>Implementing robust processes to efficiently execute product vision, strategy, and roadmap in alignment with organisational goals and priorities</li>\n<li>Build software that is not just high quality but easy to operate</li>\n<li>Make company wide impact by driving Stream Processing adoption across the Databricks product portfolio</li>\n<li>Manage technical debt, including long term technical architecture decisions and balance product roadmap</li>\n</ul>\n<p>What we look for:</p>\n<ul>\n<li>5+ years experience working in a related system, streaming, query processing, query optimisation, including big-data ecosystem, Apache Spark or database internal</li>\n<li>A passion for database systems, storage systems, distributed systems, language design, or performance optimisation</li>\n<li>Can ensure the team builds high quality and reliable infrastructure services. Experience being responsible for testing, quality, and SLAs of a product</li>\n<li>Previous experience building and leading teams in a complex technical domain, such as on distributed data systems or database internals</li>\n<li>Ability to attract, hire, and coach engineers who meet the Databricks hiring standards. Can up level existing team via hiring top-notch senior talent, growing leaders and helping struggling members. Can gain trust of the team and guide their careers</li>\n<li>Comfortable working cross functionally with product management and directly with customers; ability to deeply understand product and customer personas</li>\n</ul>\n<p>Pay Range Transparency</p>\n<p>Databricks is committed to fair and equitable compensation practices. The pay range(s) for this role is listed below and represents the expected salary range for non-commissionable roles or on-target earnings for commissionable roles. Actual compensation packages are based on several factors that are unique to each candidate, including but not limited to job-related skills, depth of experience, relevant certifications and training, and specific work location. Based on the factors above, Databricks anticipates utilizing the full width of the range. The total compensation package for this position may also include eligibility for annual performance bonus, equity, and the benefits listed above.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_7eb73baf-db6","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Databricks","sameAs":"https://databricks.com","logo":"https://logos.yubhub.co/databricks.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/databricks/jobs/8324875002","x-work-arrangement":"onsite","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$181,000-$253,750 USD","x-skills-required":["Apache Spark","Streaming","Query processing","Query optimisation","Big-data ecosystem","Database internal"],"x-skills-preferred":[],"datePosted":"2026-04-18T15:50:40.103Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Bellevue, Washington"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Apache Spark, Streaming, Query processing, Query optimisation, Big-data ecosystem, Database internal","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":181000,"maxValue":253750,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_7b750523-8ff"},"title":"Staff Software Engineer, Data Engineering","description":"<p>We are seeking a Staff Software Engineer to lead the technical strategy and implementation of our enterprise data architecture, governance foundations, and analytics enablement tooling.</p>\n<p>In this role, you will be the primary engineering counterpart to the Senior Product Manager for Data Enablement &amp; Governance, jointly shaping the roadmap for enterprise analytics, shared definitions, and the tools that help Omada answer questions faster and more reliably.</p>\n<p>You will design and evolve core data products, define patterns and standards used across the company, and drive the technical execution of initiatives that ensure our metrics, reports, and data products are scalable, governed, and trustworthy.</p>\n<p>This is a high-impact, cross-functional Staff role working across Data Engineering, Data Science, Analytics, Product, IT, and business leaders.</p>\n<p><strong>Key Responsibilities:</strong></p>\n<p><strong>Enterprise Data Architecture</strong></p>\n<ul>\n<li>Own the vision and technical roadmap for Omada&#39;s enterprise data architecture, spanning ingestion, storage, modeling, and serving layers for analytics and applied statistics use cases.</li>\n<li>Design, implement, and evolve scalable, secure, and cost-efficient data solutions (datalakes, warehouses, marts, semantic layers) that support governed, cross-functional analytics and self-service.</li>\n<li>Define and socialize architectural patterns, data contracts, and integration standards used by data and product teams across the organization.</li>\n<li>Anticipate future needs (e.g., new product lines, new modalities, AI/ML workloads) and drive proactive architectural changes rather than reacting to incidents or point-in-time requests.</li>\n</ul>\n<p><strong>Data Modeling, Quality, and Governance Foundations</strong></p>\n<ul>\n<li>Lead the design of logical and physical data models to support enterprise metrics, dashboards, and ad hoc analytics, with a focus on reusability and clear ownership.</li>\n<li>Implement robust data quality, validation, and monitoring frameworks that underpin trusted “single source of truth” definitions for core concepts (e.g., active member, MAU, GLP-1 member).</li>\n<li>Partner with the Senior Product Manager, Data Enablement &amp; Governance to translate governance decisions (definitions, ownership, change-management processes) into concrete technical implementations in the data platform.</li>\n<li>Set standards and review mechanisms to ensure new pipelines, marts, and reports align with enterprise definitions and governance policies.</li>\n<li>Continuously improve performance, scalability, and cost-efficiency of data workflows and storage; lead deep dives and remediation for complex production issues.</li>\n</ul>\n<p><strong>Enterprise Data Products Lifecycle</strong></p>\n<ul>\n<li>In close partnership with the Senior PM, define and deliver core, reusable data products (e.g., engagement, clinical, financial, client, care delivery datasets) that power dashboards, reporting, and self-service analytics.</li>\n<li>Co-Architect and implement technical foundations for AI-assisted analytics tools, governed semantic layers, and reporting applications that make analysts and business users more efficient.</li>\n<li>Partner with Product and Engineering teams owning tools like Amplitude, Tableau, and internal reporting tools to ensure consistent instrumentation, mapping to enterprise definitions, and scalable access patterns.</li>\n<li>Translate business and product requirements into resilient schemas, data services, and interfaces that are usable, maintainable, and auditable.</li>\n<li>Ensure production data delivery meets defined SLAs and supports downstream BI, reporting apps, and applied statistics workloads.</li>\n<li>Play a key role in cross-functional forums (e.g., Data Governance Committee, analytics communities) as the technical voice for feasibility, risk, and long-term platform health.</li>\n</ul>\n<p><strong>Technical Leadership, Mentorship, and Culture</strong></p>\n<ul>\n<li>Lead large, multi-team technical initiatives,from design to implementation and rollout,setting a high bar for design docs, reviews, and execution quality.</li>\n<li>Mentor senior and mid-level engineers, elevating the team’s skills in data modeling, pipeline design, governance, and platform thinking.</li>\n<li>Help shape playbooks for how product squads and spokes engage with central data teams on new metrics, data products, and applied stats projects.</li>\n<li>Partner closely with Analytics, Data Science, Product, and business leaders to ensure data architecture and governance decisions are aligned with company OKRs and measurable business value.</li>\n<li>Proactively identify complexity, duplication, and fragility in existing systems; drive simplification and standardization with sustainable solutions.</li>\n<li>Model Omada’s values in day-to-day work, fostering a culture of trust, context-seeking, bold thinking, and high-impact delivery.</li>\n</ul>\n<p><strong>About You:</strong></p>\n<ul>\n<li>8+ years of experience building, maintaining, and orchestrating scalable data platforms and high-quality production pipelines, including significant experience in analytics or warehousing environments.</li>\n<li>Demonstrated Staff-level impact: leading cross-team technical initiatives, making architectural decisions that shaped a multi-year roadmap, and influencing stakeholders beyond your immediate team.</li>\n<li>Deep experience with cloud data ecosystems (e.g., AWS) and modern data warehouses (e.g., Redshift, Snowflake, BigQuery), including MPP query optimization.</li>\n<li>Strong background in data modeling for OLTP and OLAP, and designing reusable data products for BI, reporting, and advanced analytics.</li>\n<li>Hands-on experience implementing data quality, observability, and governance frameworks, ideally in a regulated or PHI/PII-sensitive environment.</li>\n<li>Experience partnering with Product Management and Analytics to define and deliver platform capabilities, not just point solutions.</li>\n</ul>\n<p><strong>Technical Skills:</strong></p>\n<ul>\n<li>Strong proficiency in SQL (analytical and performance-tuned) and experience with relational and MPP databases.</li>\n<li>Proficiency in at least one modern programming language used in data engineering (e.g., Python, Java, Scala) and comfort applying software engineering best practices (testing, CI/CD, code review).</li>\n<li>Experience with workflow orchestration and data integration tools (e.g., Airflow) and event-driven or streaming patterns where appropriate.</li>\n<li>Familiarity with BI and analytics tools (e.g., Tableau, Amplitude, or similar) and how they integrate with governed data layers.</li>\n<li>Experience with data governance concepts (ownership, lineage, definitions, access controls) and their technical implementation in a modern data stack.</li>\n<li>Familiarity with AI tools for development.</li>\n</ul>\n<p><strong>Communication &amp; Working Style:</strong></p>\n<ul>\n<li>Excellent communication and collaboration skills, with the ability to convey complex technical concepts to non-technical stakeholders.</li>\n<li>Highly self-directed and comfortable operating in ambiguous, cross-functional problem spaces, creating clarity and direction where none exists.</li>\n<li>Strong sense of ownership and bias for impact; you care about outcomes for members, customers, and internal users, not just elegant systems.</li>\n</ul>\n<p><strong>Benefits:</strong></p>\n<ul>\n<li>Competitive salary with generous annual cash bonus</li>\n<li>Equity grants</li>\n<li>Remote first work from home culture</li>\n<li>Flexible Time Off to help you recharge</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_7b750523-8ff","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Omada Health","sameAs":"https://www.omadahealth.com/","logo":"https://logos.yubhub.co/omadahealth.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/omadahealth/jobs/7753330","x-work-arrangement":"remote","x-experience-level":"staff","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["SQL","Cloud data ecosystems","Modern data warehouses","MPP query optimization","Data modeling","Data quality","Data governance","Workflow orchestration","Data integration","Event-driven or streaming patterns","BI and analytics tools","AI tools for development"],"x-skills-preferred":[],"datePosted":"2026-04-17T12:50:06.765Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Remote, USA"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Healthcare","skills":"SQL, Cloud data ecosystems, Modern data warehouses, MPP query optimization, Data modeling, Data quality, Data governance, Workflow orchestration, Data integration, Event-driven or streaming patterns, BI and analytics tools, AI tools for development"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_24513047-6d1"},"title":"Licensing Executive","description":"<p>About the Role\nWe are seeking an experienced Licensing Executive to join our legal team to lead the sourcing, licensing, and partnership for high-value content (e.g., books, journals, research papers, academic datasets, and multimedia) for AI training, retrieval-augmented generation (RAG) and distribution partnerships.</p>\n<p>In this role, you will drive the acquisition of premium content, negotiate complex multi-pronged agreements, and build long-term relationships with publishers, universities, research institutions, and data providers. You will work closely with cross-functional teams to ensure access to quality and relevant data and content sources that are aligned with Mistral AI’s interests, and enable innovative use of data and content.</p>\n<p>This role is ideal for a results-driven negotiator and strategic thinker with a passion for AI, academic content, and ethical data practices, and a proven track record of closing high-stakes deals in the publishing, technology, or research sectors.</p>\n<p>Key Responsibilities</p>\n<p>Strategic Sourcing &amp; Pipeline Development\n• Build and manage a robust pipeline of high-quality content (e.g., STEM, academic, robotics, multimedia).\n• Qualify and vet data &amp; content providers to ensure compliance with legal (copyright, data provenance) and business (relevance, cost, scalability) requirements.\n• Provide regular reports and analytics on procurement activities, investments, and performance to support data-driven decision-making.</p>\n<p>Licensing &amp; Partnership Management\n• Serve as a key point of contact for external partners (e.g., publishers, universities, and research institutions) to understand their goals and interests, addressing their needs and priorities.\n• Develop multi-pronged relationships (e.g., revenue-sharing, co-development) to create long-term collaboration.\n• Develop new programs that promote fair compensation and sustainability for content creators, owners and curators.</p>\n<p>Cross-Functional Collaboration\n• Collaborate with internal stakeholders (e.g., Science, Product, and Go To Market teams) to understand their needs and ensure procurement activities support their objectives.\n• Evaluate “make vs. buy” options for content sourcing in collaboration with the Human Data team, balancing data development with external access/licensing opportunities</p>\n<p>Required Qualifications and Skills\n• Proven track record of negotiating and closing complex deals ($10M+), including revenue-sharing, licensing, or co-development agreements.\n• Deep understanding of AI training data ecosystems and ability to translate this into business terms.\n• Legal acumen: Understanding of legal concepts involved in data acquisition and content licensing.\n• Strong STEM background (e.g., degree in Science, Technology, Engineering, Mathematics, or related field) and a passion for academic content and research.\n• Excellent communication and stakeholder management skills (experience negotiating with C-level stakeholders), with the ability to build trust and influence partners at all levels.\n• Business acumen with experience in market analysis and financial modeling (e.g., DCF analysis)\n• Fluency in English and French; additional languages (e.g., German) are a plus.\n• Knowledge of global copyright laws.\n• Experience working in a fast-paced, global environment, with distributed teams.</p>\n<p>Nice-to-Have Skills\n• Existing network in the publishing, academic, or research communities (e.g., relationships with major publishers, universities, or data providers).\n• Experience with AI training data, including familiarity with pretraining, RAG, or synthetic data generation.\n• Direct experience working for a tech company sourcing data/content for LLMs\n• Technical literacy in data formats (e.g., JSON, XML), APIs, or content management systems.</p>\n<p>Benefits\n• Competitive cash salary and equity\n• Daily lunch vouchers\n• Monthly contribution to a Gympass subscription\n• Monthly contribution to a mobility pass\n• Full health insurance for you and your family\n• Generous parental leave policy\n• Visa sponsorship</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_24513047-6d1","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Mistral AI","sameAs":"https://mistral.ai","logo":"https://logos.yubhub.co/mistral.ai.png"},"x-apply-url":"https://jobs.lever.co/mistral/b84413c1-00a1-4663-8697-aa6548cc87f8","x-work-arrangement":"onsite","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["proven track record of negotiating and closing complex deals","deep understanding of AI training data ecosystems","legal acumen","strong STEM background","excellent communication and stakeholder management skills","business acumen","fluency in English and French","knowledge of global copyright laws","experience working in a fast-paced, global environment"],"x-skills-preferred":["existing network in the publishing, academic, or research communities","experience with AI training data","direct experience working for a tech company sourcing data/content for LLMs","technical literacy in data formats, APIs, or content management systems"],"datePosted":"2026-04-17T12:47:36.791Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Paris"}},"employmentType":"FULL_TIME","occupationalCategory":"Legal","industry":"Technology","skills":"proven track record of negotiating and closing complex deals, deep understanding of AI training data ecosystems, legal acumen, strong STEM background, excellent communication and stakeholder management skills, business acumen, fluency in English and French, knowledge of global copyright laws, experience working in a fast-paced, global environment, existing network in the publishing, academic, or research communities, experience with AI training data, direct experience working for a tech company sourcing data/content for LLMs, technical literacy in data formats, APIs, or content management systems"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_54414b3a-610"},"title":"Licensing Executive","description":"<p>About the Role\\nWe are seeking an experienced Licensing Executive to join our legal team to lead the sourcing, licensing, and partnership for high-value content (e.g., books, journals, research papers, academic datasets, and multimedia) for AI training, retrieval-augmented generation (RAG) and distribution partnerships.\\n\\nIn this role, you will drive the acquisition of premium content, negotiate complex multi-pronged agreements, and build long-term relationships with publishers, universities, research institutions, and data providers. You will work closely with cross-functional teams to ensure access to quality and relevant data and content sources that are aligned with Mistral AI’s interests, and enable innovative use of data and content.\\n\\nThis role is ideal for a results-driven negotiator and strategic thinker with a passion for AI, academic content, and ethical data practices, and a proven track record of closing high-stakes deals in the publishing, technology, or research sectors.\\n\\nKey Responsibilities\\n\\nStrategic Sourcing &amp; Pipeline Development\\n• Build and manage a robust pipeline of high-quality content (e.g., STEM, academic, robotics, multimedia).\\n• Qualify and vet data &amp; content providers to ensure compliance with legal (copyright, data provenance) and business (relevance, cost, scalability) requirements.\\n• Provide regular reports and analytics on procurement activities, investments, and performance to support data-driven decision-making.\\n\\nLicensing &amp; Partnership Management\\n• Serve as a key point of contact for external partners (e.g., publishers, universities, and research institutions) to understand their goals and interests, addressing their needs and priorities.\\n• Develop multi-pronged relationships (e.g., revenue-sharing, co-development) to create long-term collaboration.\\n• Develop new programs that promote fair compensation and sustainability for content creators, owners and curators.\\n\\nCross-Functional Collaboration\\n• Collaborate with internal stakeholders (e.g., Science, Product, and Go To Market teams) to understand their needs and ensure procurement activities support their objectives.\\n• Evaluate &quot;make vs. buy&quot; options for content sourcing in collaboration with the Human Data team, balancing data development with external access/licensing opportunities\\n\\nRequired Qualifications and Skills\\n• Proven track record of negotiating and closing complex deals ($10M+), including revenue-sharing, licensing, or co-development agreements.\\n• Deep understanding of AI training data ecosystems and ability to translate this into business terms.\\n• Legal acumen: Understanding of legal concepts involved in data acquisition and content licensing.\\n• Strong STEM background (e.g., degree in Science, Technology, Engineering, Mathematics, or related field) and a passion for academic content and research.\\n• Excellent communication and stakeholder management skills (experience negotiating with C-level stakeholders), with the ability to build trust and influence partners at all levels.\\n• Business acumen with experience in market analysis and financial modeling (e.g., DCF analysis)\\n• Fluency in English and French; additional languages (e.g., German) are a plus.\\n• Knowledge of global copyright laws.\\n• Experience working in a fast-paced, global environment, with distributed teams.\\n\\nNice-to-Have Skills\\n• Existing network in the publishing, academic, or research communities (e.g., relationships with major publishers, universities, or data providers).\\n• Experience with AI training data, including familiarity with pretraining, RAG, or synthetic data generation.\\n• Direct experience working for a tech company sourcing data/content for LLMs\\n• Technical literacy in data formats (e.g., JSON, XML), APIs, or content management systems.\\n\\nBenefits\\n• Competitive cash salary and equity\\n• Daily lunch vouchers\\n• Monthly contribution to a Gympass subscription\\n• Monthly contribution to a mobility pass\\n• Full health insurance for you and your family\\n• Generous parental leave policy\\n• Visa sponsorship</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_54414b3a-610","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Mistral AI","sameAs":"https://mistral.ai"},"x-apply-url":"https://jobs.lever.co/mistral/b84413c1-00a1-4663-8697-aa6548cc87f8","x-work-arrangement":"onsite","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["proven track record of negotiating and closing complex deals","deep understanding of AI training data ecosystems","legal acumen","strong STEM background","excellent communication and stakeholder management skills","business acumen with experience in market analysis and financial modeling","fluency in English and French","knowledge of global copyright laws","experience working in a fast-paced, global environment"],"x-skills-preferred":["existing network in the publishing, academic, or research communities","experience with AI training data","direct experience working for a tech company sourcing data/content for LLMs","technical literacy in data formats (e.g., JSON, XML), APIs, or content management systems"],"datePosted":"2026-03-10T11:29:26.466Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Paris"}},"employmentType":"FULL_TIME","occupationalCategory":"Legal","industry":"Technology","skills":"proven track record of negotiating and closing complex deals, deep understanding of AI training data ecosystems, legal acumen, strong STEM background, excellent communication and stakeholder management skills, business acumen with experience in market analysis and financial modeling, fluency in English and French, knowledge of global copyright laws, experience working in a fast-paced, global environment, existing network in the publishing, academic, or research communities, experience with AI training data, direct experience working for a tech company sourcing data/content for LLMs, technical literacy in data formats (e.g., JSON, XML), APIs, or content management systems"}]}