{"version":"0.1","company":{"name":"YubHub","url":"https://yubhub.co","jobsUrl":"https://yubhub.co/jobs/skill/big-data-stack"},"x-facet":{"type":"skill","slug":"big-data-stack","display":"Big Data Stack","count":3},"x-feed-size-limit":100,"x-feed-sort":"enriched_at desc","x-feed-notice":"This feed contains at most 100 jobs (the most recently enriched). For the full corpus, use the paginated /stats/by-facet endpoint or /search.","x-generator":"yubhub-xml-generator","x-rights":"Free to redistribute with attribution: \"Data by YubHub (https://yubhub.co)\"","x-schema":"Each entry in `jobs` follows https://schema.org/JobPosting. YubHub-native raw fields carry `x-` prefix.","jobs":[{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_06fb74fd-d12"},"title":"Senior MLE","description":"<p><strong>About the Role</strong></p>\n<p>We&#39;re looking for a Senior MLE to join our Machine Learning Recall team. As a Senior MLE, you will help us build and optimize ML/DL models to improve customer experience by providing the best results in terms of relevancy and marginality.</p>\n<p><strong>Role Overview</strong></p>\n<p>In the second part of 2025, we plan to focus our attention on three key areas:</p>\n<ul>\n<li>Recall: we don&#39;t want to lose good results</li>\n<li>Visual solutions: we would like to deliver end-to-end visual solutions for our customer, including (but not limited to) image search, shop the look, visual recommendations, etc</li>\n<li>Technical platform: we have many different technologies/models inside a team, and we would like to allow other teams to use them widely and integrate in their pipelines</li>\n</ul>\n<p><strong>Challenges You Will Tackle</strong></p>\n<ul>\n<li>Build and deploy robust ML systems for search (including text/image &amp; multimodal approaches, etc)</li>\n<li>Tune LLMs to improve our system in different aspects, not limited to what we already have</li>\n<li>Improve business KPIs by using new techniques/models and validating hypotheses</li>\n<li>Collaborate with other technical teams to exchange experiences to improve the overall Constructor.io system</li>\n<li>Be responsible for what you and your team do</li>\n</ul>\n<p><strong>Requirements</strong></p>\n<ul>\n<li>3+ years of professional experience in applied machine learning</li>\n<li>Excellent NLP knowledge (especially transformer-based approaches)</li>\n<li>Comprehensive knowledge of classical machine learning</li>\n<li>Extensive Python knowledge</li>\n<li>Experience with any DL framework (we’re using torch)</li>\n<li>Experience with any SQL dialect (we’re using SparkSQL, MySQL and a couple more dialects)</li>\n<li>You have delivered production ML systems</li>\n<li>Proficiency with big data stack for end-to-end ML product development (we’re using Pyspark for most of our pipelines)</li>\n<li>You are able to translate intuition into data-driven hypotheses that result in engineering solutions that bring significant business value</li>\n<li>Proactivity: you can&#39;t close your eyes to problems, but are ready to solve them</li>\n<li>You are friendly and willing to help your teammates &amp; others</li>\n<li>Nice to have:</li>\n</ul>\n<p>+ Experience designing, conducting, and analyzing A/B tests \t+ Experience with Rust (or C/C++) \t+ Experience with a public cloud like AWS, Azure, or GCP \t+ Strong knowledge of data structures, algorithms and their trade-off \t+ Empathy \t+ Ability to explain difficult concepts \t+ You love to work on performance optimization, such as increasing result quality and improving code performance</p>\n<p><strong>Benefits</strong></p>\n<ul>\n<li>Unlimited vacation time - we strongly encourage all of our employees take at least 3 weeks per year</li>\n<li>Fully remote team - choose where you live</li>\n<li>Work from home stipend! We want you to have the resources you need to set up your home office</li>\n<li>Apple laptops provided for new employees</li>\n<li>Training and development budget for every employee, refreshed each year</li>\n<li>Maternity &amp; Paternity leave for qualified employees</li>\n<li>Work with smart people who will help you grow and make a meaningful impact</li>\n<li>Base salary: $80k–$120k USD, depending on knowledge, skills, experience, and interview results</li>\n<li>Stock options - offered in addition to the base salary</li>\n<li>Regular team offsites to connect and collaborate</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_06fb74fd-d12","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Constructor","sameAs":"https://apply.workable.com","logo":"https://logos.yubhub.co/j.com.png"},"x-apply-url":"https://apply.workable.com/j/AA636BFBB2","x-work-arrangement":"remote","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$80k–$120k USD","x-skills-required":["NLP knowledge","Classical machine learning","Python knowledge","DL framework (torch)","SQL dialect (SparkSQL, MySQL)","Big data stack (Pyspark)","Data-driven hypotheses","Proactivity","Friendly and willing to help teammates"],"x-skills-preferred":["Experience designing, conducting, and analyzing A/B tests","Experience with Rust (or C/C++)","Experience with a public cloud like AWS, Azure, or GCP","Strong knowledge of data structures, algorithms and their trade-off","Empathy","Ability to explain difficult concepts","Performance optimization"],"datePosted":"2026-03-09T10:58:38.277Z","jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"NLP knowledge, Classical machine learning, Python knowledge, DL framework (torch), SQL dialect (SparkSQL, MySQL), Big data stack (Pyspark), Data-driven hypotheses, Proactivity, Friendly and willing to help teammates, Experience designing, conducting, and analyzing A/B tests, Experience with Rust (or C/C++), Experience with a public cloud like AWS, Azure, or GCP, Strong knowledge of data structures, algorithms and their trade-off, Empathy, Ability to explain difficult concepts, Performance optimization","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":80000,"maxValue":120000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_cd5ee5de-c1b"},"title":"Data Analyst: Retail Media","description":"<p><strong>About the Role</strong></p>\n<p>We&#39;re looking for a Senior Data Analyst to join our Retail Media team. As a key member of our cross-functional team, you will help us make data-driven product decisions, optimize customer and advertiser metrics, and scale the product.</p>\n<p><strong>Responsibilities</strong></p>\n<ul>\n<li>Research product hypotheses to optimize main platform metrics for retailers and advertisers: ROAS, Ad spent, Churn Rate, etc.</li>\n<li>Work with PMs and Engineers to scope the most important insights we expect to extract from our data and improvements that we’d like to bring to the platform.</li>\n<li>Collaborate with technical and non-technical business partners to develop / update functionalities.</li>\n<li>Communicate with stakeholders within and outside the team.</li>\n<li>Own the results and business metrics for advertisers and retailers.</li>\n<li>Participate in late-stage sales process and building demos for prospect customers.</li>\n</ul>\n<p><strong>Requirements</strong></p>\n<ul>\n<li>You are proficient in BI tools (data analysis, building dashboards for engineers and non-technical folks).</li>\n<li>You are an excellent communicator with the ability to translate business needs into a technical language and vice versa.</li>\n<li>You are excited to leverage massive amounts of data to drive product innovation &amp; deliver business value.</li>\n<li>You are proficient at SQL (any variant), well-versed in exploratory data analysis with Python (pandas &amp; numpy, data visualization libraries).</li>\n<li>Big plus is practical familiarity with the big data stack (Spark, Presto/Athena, Hive).</li>\n<li>You are adept at fast prototyping and providing analytical support for initiatives in the e-commerce space by identifying &amp; focusing on relevant features &amp; metrics.</li>\n<li>You are willing to develop and maintain effective communication tools to report business performance and inform decision-making at a cross-functional level.</li>\n<li>Big plus is AdTech industry experience.</li>\n</ul>\n<p><strong>Benefits</strong></p>\n<ul>\n<li>Unlimited vacation time - we strongly encourage all of our employees take at least 3 weeks per year</li>\n<li>Fully remote team - choose where you live</li>\n<li>Work from home stipend! We want you to have the resources you need to set up your home office</li>\n<li>Apple laptops provided for new employees</li>\n<li>Training and development budget for every employee, refreshed each year</li>\n<li>Maternity &amp; Paternity leave for qualified employees</li>\n<li>Work with smart people who will help you grow and make a meaningful impact</li>\n<li>Base salary: $80k–$120k USD, depending on knowledge, skills, experience, and interview results</li>\n<li>Stock options - offered in addition to the base salary</li>\n<li>Regular team offsites to connect and collaborate</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_cd5ee5de-c1b","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Constructor","sameAs":"https://apply.workable.com","logo":"https://logos.yubhub.co/j.com.png"},"x-apply-url":"https://apply.workable.com/j/A29359EF00","x-work-arrangement":"remote","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$80k–$120k USD","x-skills-required":["BI tools","SQL","Python","data visualization libraries","big data stack"],"x-skills-preferred":["AdTech industry experience"],"datePosted":"2026-03-09T10:58:29.901Z","jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"BI tools, SQL, Python, data visualization libraries, big data stack, AdTech industry experience","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":80000,"maxValue":120000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_c952dc65-160"},"title":"AI Machine Learning Engineer: AI Shopping Agents","description":"<p><strong>About Us</strong></p>\n<p>Constructor is a U.S. based company that has been in the market since 2019, building a next-generation platform for search and discovery in ecommerce. Its search engine is entirely invented in-house, utilizing transformers and generative LLMs, and powers over 1 billion queries every day across 150 languages and roughly 100 countries.</p>\n<p><strong>Responsibilities</strong></p>\n<ul>\n<li>Architect and build real-time agentic workflows to handle complex, multi-step user tasks and open-ended queries, providing users with accurate and contextually relevant answers and product suggestions</li>\n<li>Own the end-to-end data lifecycle for AI workflows, including vector database ingestion and indexing</li>\n<li>Design metrics to evaluate the relevance and performance of query results, ensuring alignment with business goals and user expectations</li>\n<li>Generate and rapidly prototype novel product hypotheses that leverage LLMs, RAG, and agentic systems</li>\n<li>Collaborate closely with Product, Design, Analytics, and other engineering teams to translate AI capabilities into tangible, high-quality product features</li>\n<li>Improve the speed, quality, and efficiency of our AI systems and engineering processes</li>\n<li>Take ownership of systems and designs from conception through to deployment and maintenance</li>\n</ul>\n<p><strong>Qualifications</strong></p>\n<ul>\n<li>4+ years of industry experience in related fields, including search, information retrieval, recommendation systems, applied machine learning, and NLP</li>\n<li>Excellent skills in delivering and communicating business value</li>\n<li>Proficient in Python, SQL, and the big data stack for end-to-end ML product development, with experience across the entire pipeline in typical recommendation systems or LLM-based solutions</li>\n<li>Strong grasp of Information Retrieval (IR) techniques (e.g., dense retrieval, re-ranking, chunking strategies)</li>\n<li>Direct experience with Retrieval-Augmented Generation (RAG); experience building autonomous agents is a strong plus</li>\n<li>Nice to have: experience with automatic prompt optimization techniques (e.g., DSPy)</li>\n<li>Solid understanding of ML evaluation methodologies and key IR metrics</li>\n<li>Passion for shipping high-quality products and a self-motivated drive to take ownership of tasks</li>\n</ul>\n<p><strong>Tech Stack</strong></p>\n<ul>\n<li>Core: Python, FastAPI, asyncio, Airflow, Luigi, PySpark, Docker, LangGraph</li>\n<li>Data Stores: Vector Databases, DynamoDB, AWS S3, AWS RDS</li>\n<li>Cloud &amp; MLOps: AWS, Databricks, Ray</li>\n</ul>\n<p><strong>Benefits</strong></p>\n<ul>\n<li>Unlimited vacation time - we strongly encourage all of our employees take at least 3 weeks per year</li>\n<li>Fully remote team - choose where you live</li>\n<li>Work from home stipend! We want you to have the resources you need to set up your home office</li>\n<li>Apple laptops provided for new employees</li>\n<li>Training and development budget for every employee, refreshed each year</li>\n<li>Maternity &amp; Paternity leave for qualified employees</li>\n<li>Work with smart people who will help you grow and make a meaningful impact</li>\n<li>This position has a base salary range between $80k and $120k USD.</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_c952dc65-160","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Constructor","sameAs":"https://apply.workable.com","logo":"https://logos.yubhub.co/j.com.png"},"x-apply-url":"https://apply.workable.com/j/D15079EEBA","x-work-arrangement":"remote","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$80k - $120k USD","x-skills-required":["Python","SQL","big data stack","Information Retrieval (IR) techniques","Retrieval-Augmented Generation (RAG)","automatic prompt optimization techniques"],"x-skills-preferred":["DSPy"],"datePosted":"2026-03-09T10:57:19.254Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Oregon"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Python, SQL, big data stack, Information Retrieval (IR) techniques, Retrieval-Augmented Generation (RAG), automatic prompt optimization techniques, DSPy","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":80000,"maxValue":120000,"unitText":"YEAR"}}}]}