{"version":"0.1","company":{"name":"YubHub","url":"https://yubhub.co","jobsUrl":"https://yubhub.co/jobs/skill/distributed-compute"},"x-facet":{"type":"skill","slug":"distributed-compute","display":"Distributed Compute","count":10},"x-feed-size-limit":100,"x-feed-sort":"enriched_at desc","x-feed-notice":"This feed contains at most 100 jobs (the most recently enriched). For the full corpus, use the paginated /stats/by-facet endpoint or /search.","x-generator":"yubhub-xml-generator","x-rights":"Free to redistribute with attribution: \"Data by YubHub (https://yubhub.co)\"","x-schema":"Each entry in `jobs` follows https://schema.org/JobPosting. YubHub-native raw fields carry `x-` prefix.","jobs":[{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_94897623-5b7"},"title":"Software Engineer II","description":"<p>Overview About the Team Copilot Security builds the foundations that make Microsoft’s AI experiences trusted, resilient, and safe. We design and implement security capabilities that protect users across Windows, Edge, web, mobile, and third-party ecosystems. Our work spans secure identity flows, defenses against threats like prompt injection, and privacy-first systems that scale globally.</p>\n<p>About the Role Copilot is entering a new era of agentic AI, where intelligent agents take actions on behalf of users. We’re looking for a Software Engineer II with solid fundamentals and high growth potential,someone who can quickly deepen their expertise in AI-driven security and expand their ownership over time. You’ll contribute to secure orchestration frameworks, AI-powered defenses, and the core systems that ensure Copilot’s actions remain trustworthy.</p>\n<p>Responsibilities Build and ship security features that protect Copilot from threats such as prompt injection, adversarial manipulation, and unsafe agentic workflows. Implement secure orchestration components that allow Copilot to safely delegate and execute actions across devices, services, and platforms. Contribute to developing intelligent agents that apply information-flow reasoning, guardrails, and common-sense constraints for security and privacy. Collaborate with partner teams across engineering, product, security, privacy, and AI to adopt secure agentic patterns and best practices. Instrument and monitor key metrics for agentic AI security, using data to improve reliability, safety, and user trust. Write clear documentation for secure agentic patterns, including safe-delegation guidelines and emerging risk considerations. Demonstrate high growth potential by progressively expanding technical scope, autonomy, and ownership as you gain experience with agentic AI and security systems.</p>\n<p>Qualifications Required Qualifications: Bachelor’s Degree in Computer Science or related technical field AND 2+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.</p>\n<p>Preferred Qualifications: Master’s Degree in Computer Science or related technical field AND 3+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Bachelor’s Degree in Computer Science or related technical field AND 5+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.</p>\n<p>Experience building production-quality software systems. 1–2+ years building or operating large-scale distributed systems or services. Experience working on security-critical, privacy-sensitive, or AI-powered systems. Familiarity with agentic AI concepts such as tool calling, orchestration, or multi-agent workflows. Experience with modern cloud development, containerization (Docker, Kubernetes), or distributed compute frameworks. Exposure to evaluation or observability tooling for LLM-based applications (e.g., LangFuse, MLFlow, Phoenix) or interest in learning these systems. Ability to communicate technical concepts clearly and collaborate effectively across teams. Demonstrated high growth potential, with solid learning velocity and the ability to quickly take on broader areas of ownership. Growth mindset with interest in developing deeper expertise in AI security, orchestration, and emerging threat models.</p>\n<p>#MicrosoftAI #MAI DPS</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_94897623-5b7","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Microsoft AI","sameAs":"https://microsoft.ai","logo":"https://logos.yubhub.co/microsoft.ai.png"},"x-apply-url":"https://microsoft.ai/job/software-engineer-ii-32/","x-work-arrangement":"hybrid","x-experience-level":"mid","x-job-type":"full-time","x-salary-range":"$100,600 - $199,000 per year","x-skills-required":["C","C++","C#","Java","JavaScript","Python","Agentic AI","Secure Orchestration","Information-Flow Reasoning","Guardrails","Common-Sense Constraints","Security","Privacy","Cloud Development","Containerization","Distributed Compute Frameworks"],"x-skills-preferred":["Modern Cloud Development","Evaluation or Observability Tooling","LLM-Based Applications"],"datePosted":"2026-04-24T12:14:42.974Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"New York"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"C, C++, C#, Java, JavaScript, Python, Agentic AI, Secure Orchestration, Information-Flow Reasoning, Guardrails, Common-Sense Constraints, Security, Privacy, Cloud Development, Containerization, Distributed Compute Frameworks, Modern Cloud Development, Evaluation or Observability Tooling, LLM-Based Applications","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":100600,"maxValue":199000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_447681a8-24c"},"title":"Software Engineer II","description":"<p>About the Team Copilot Security builds the foundations that make Microsoft’s AI experiences trusted, resilient, and safe. We design and implement security capabilities that protect users across Windows, Edge, web, mobile, and third-party ecosystems. Our work spans secure identity flows, defenses against threats like prompt injection, and privacy-first systems that scale globally.</p>\n<p>About the Role Copilot is entering a new era of agentic AI, where intelligent agents take actions on behalf of users. We’re looking for a Software Engineer II with solid fundamentals and high growth potential,someone who can quickly deepen their expertise in AI-driven security and expand their ownership over time. You’ll contribute to secure orchestration frameworks, AI-powered defenses, and the core systems that ensure Copilot’s actions remain trustworthy. This role is ideal for engineers who enjoy solving complex technical problems, learning new AI-driven patterns, and building secure, scalable systems that balance innovation with user trust.</p>\n<p>Why This Role Matters Your work will directly shape how hundreds of millions of users experience safe, trustworthy, and innovative AI. You’ll be at the forefront of defining how agentic AI can proactively defend users, mitigate emerging threats, and unlock new secure scenarios, making a global impact on Microsoft’s most transformative products.</p>\n<p>Responsibilities Build and ship security features that protect Copilot from threats such as prompt injection, adversarial manipulation, and unsafe agentic workflows. Implement secure orchestration components that allow Copilot to safely delegate and execute actions across devices, services, and platforms. Contribute to developing intelligent agents that apply information-flow reasoning, guardrails, and common-sense constraints for security and privacy. Collaborate with partner teams across engineering, product, security, privacy, and AI to adopt secure agentic patterns and best practices. Instrument and monitor key metrics for agentic AI security, using data to improve reliability, safety, and user trust. Write clear documentation for secure agentic patterns, including safe-delegation guidelines and emerging risk considerations. Demonstrate high growth potential by progressively expanding technical scope, autonomy, and ownership as you gain experience with agentic AI and security systems.</p>\n<p>Qualifications Required Qualifications: Bachelor’s Degree in Computer Science or related technical field AND 2+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience. Preferred Qualifications: Master’s Degree in Computer Science or related technical field AND 3+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Bachelor’s Degree in Computer Science or related technical field AND 5+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_447681a8-24c","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Microsoft AI","sameAs":"https://microsoft.ai","logo":"https://logos.yubhub.co/microsoft.ai.png"},"x-apply-url":"https://microsoft.ai/job/software-engineer-ii-30/","x-work-arrangement":"hybrid","x-experience-level":"mid","x-job-type":"full-time","x-salary-range":"$100,600 - $199,000 per year","x-skills-required":["C","C++","C#","Java","JavaScript","Python","Agentic AI","Secure Orchestration","Information-Flow Reasoning","Guardrails","Common-Sense Constraints"],"x-skills-preferred":["Modern Cloud Development","Containerization (Docker, Kubernetes)","Distributed Compute Frameworks","Evaluation or Observability Tooling (e.g., LangFuse, MLFlow, Phoenix)"],"datePosted":"2026-04-24T12:13:11.434Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Redmond"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"C, C++, C#, Java, JavaScript, Python, Agentic AI, Secure Orchestration, Information-Flow Reasoning, Guardrails, Common-Sense Constraints, Modern Cloud Development, Containerization (Docker, Kubernetes), Distributed Compute Frameworks, Evaluation or Observability Tooling (e.g., LangFuse, MLFlow, Phoenix)","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":100600,"maxValue":199000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_38a4b00f-02f"},"title":"Software Engineer II","description":"<p>About the Team Copilot Security builds the foundations that make Microsoft’s AI experiences trusted, resilient, and safe. We design and implement security capabilities that protect users across Windows, Edge, web, mobile, and third-party ecosystems. Our work spans secure identity flows, defenses against threats like prompt injection, and privacy-first systems that scale globally.</p>\n<p>About the Role Copilot is entering a new era of agentic AI, where intelligent agents take actions on behalf of users. We’re looking for a Software Engineer II with solid fundamentals and high growth potential,someone who can quickly deepen their expertise in AI-driven security and expand their ownership over time. You’ll contribute to secure orchestration frameworks, AI-powered defenses, and the core systems that ensure Copilot’s actions remain trustworthy.</p>\n<p>Responsibilities Build and ship security features that protect Copilot from threats such as prompt injection, adversarial manipulation, and unsafe agentic workflows. Implement secure orchestration components that allow Copilot to safely delegate and execute actions across devices, services, and platforms. Contribute to developing intelligent agents that apply information-flow reasoning, guardrails, and common-sense constraints for security and privacy. Collaborate with partner teams across engineering, product, security, privacy, and AI to adopt secure agentic patterns and best practices. Instrument and monitor key metrics for agentic AI security, using data to improve reliability, safety, and user trust. Write clear documentation for secure agentic patterns, including safe-delegation guidelines and emerging risk considerations. Demonstrate high growth potential by progressively expanding technical scope, autonomy, and ownership as you gain experience with agentic AI and security systems.</p>\n<p>Qualifications Bachelor’s Degree in Computer Science or related technical field AND 2+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_38a4b00f-02f","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Microsoft AI","sameAs":"https://microsoft.ai","logo":"https://logos.yubhub.co/microsoft.ai.png"},"x-apply-url":"https://microsoft.ai/job/software-engineer-ii-31/","x-work-arrangement":"hybrid","x-experience-level":"mid","x-job-type":"full-time","x-salary-range":null,"x-skills-required":["C","C++","C#","Java","JavaScript","Python"],"x-skills-preferred":["modern cloud development","containerization (Docker, Kubernetes)","distributed compute frameworks","evaluation or observability tooling for LLM-based applications"],"datePosted":"2026-04-24T12:12:20.814Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Mountain View"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"C, C++, C#, Java, JavaScript, Python, modern cloud development, containerization (Docker, Kubernetes), distributed compute frameworks, evaluation or observability tooling for LLM-based applications"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_7e28478b-c37"},"title":"Research, Audio Expertise","description":"<p>We&#39;re seeking a researcher to advance the frontier of audio capabilities. You&#39;ll explore how audio models enable more natural and efficient communication/collaboration, preserving more information and capturing user intent.</p>\n<p>This is a highly collaborative role. You&#39;ll work closely across pre-training, post-training, and product with world-class researchers, infrastructure engineers, and designers.</p>\n<p>As a researcher in this role, you&#39;ll be expected to:</p>\n<ul>\n<li>Own research projects on audio training, low-latency inference, and conversational responsiveness.</li>\n<li>Design and train large-scale models that natively support audio input and output.</li>\n<li>Investigate scaling behaviour such as how data, model size, and compute affect capability and efficiency.</li>\n<li>Build and maintain audio data pipelines, including preprocessing, filtering, segmentation, and alignment for training and evaluation.</li>\n<li>Collaborate with data and infrastructure teams to scale audio training efficiently across distributed systems.</li>\n<li>Publish and present research that moves the entire community forward.</li>\n</ul>\n<p>Share code, datasets, and insights that accelerate progress across industry and academia.</p>\n<p>This role blends fundamental research and practical engineering, as we do not distinguish between the two roles internally. You will be expected to write high-performance code and read technical reports.</p>\n<p>It&#39;s an excellent fit for someone who enjoys both deep theoretical exploration and hands-on experimentation, and who wants to shape the foundations of how AI learns.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_7e28478b-c37","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Thinking Machines Lab","sameAs":"https://thinkingmachines.ai/","logo":"https://logos.yubhub.co/thinkingmachines.ai.png"},"x-apply-url":"https://job-boards.greenhouse.io/thinkingmachines/jobs/5002212008","x-work-arrangement":"onsite","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$350,000 - $475,000 USD","x-skills-required":["Python","PyTorch","TensorFlow","JAX","Machine Learning","Deep Learning","Distributed Compute Environments"],"x-skills-preferred":["Probability","Statistics","Real-time Inference","Streaming Architectures","Optimization for Low Latency","Large-Scale Audio or Multimodal Models","Speech, Audio, Voice, or Similar Areas"],"datePosted":"2026-04-18T15:57:29.075Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Python, PyTorch, TensorFlow, JAX, Machine Learning, Deep Learning, Distributed Compute Environments, Probability, Statistics, Real-time Inference, Streaming Architectures, Optimization for Low Latency, Large-Scale Audio or Multimodal Models, Speech, Audio, Voice, or Similar Areas","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":350000,"maxValue":475000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_24176cb8-311"},"title":"Member of Technical Staff - Compute Infrastructure","description":"<p>We&#39;re seeking a highly skilled Member of Technical Staff to join our Compute Infrastructure team. As a key member of this team, you will design, build, and operate massive-scale clusters and orchestration platforms that power frontier AI training, inference, and agent workloads at unprecedented scale.</p>\n<p>In this role, you will push the boundaries of container orchestration far beyond existing systems like Kubernetes, manage exascale compute resources, optimize for high-performance training runs and production serving, and collaborate closely with research and systems teams to deliver reliable, ultra-scalable infrastructure that enables xAI&#39;s next-generation models and applications.</p>\n<p>Responsibilities include building and managing massive-scale clusters, designing, developing, and extending an in-house container orchestration platform, collaborating with research teams to architect and optimize compute clusters, profiling, debugging, and resolving complex system-level performance bottlenecks, and owning end-to-end infrastructure initiatives.</p>\n<p>To succeed in this role, you will need deep expertise in virtualization technologies and advanced containerization/sandboxing, strong proficiency in systems programming languages such as C/C++ and Rust, and proven track record profiling, debugging, and optimizing complex system-level performance issues.</p>\n<p>Preferred skills and experience include experience in Linux kernel development, hypervisor extensions, or low-level system programming for compute-intensive workloads, operating or designing large-scale AI training/inference clusters, and familiarity with performance tools, tracing, and debugging in production distributed environments.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_24176cb8-311","directApply":true,"hiringOrganization":{"@type":"Organization","name":"xAI","sameAs":"https://www.xai.com/","logo":"https://logos.yubhub.co/xai.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/xai/jobs/5052040007","x-work-arrangement":"onsite","x-experience-level":"staff","x-job-type":"full-time","x-salary-range":"$180,000 - $440,000 USD","x-skills-required":["Deep expertise in virtualization technologies (KVM, Xen, QEMU) and advanced containerization/sandboxing (Kata, Firecracker, gVisor, Sysbox, or equivalent)","Strong proficiency in systems programming languages such as C/C++ and Rust","Proven track record profiling, debugging, and optimizing complex system-level performance issues, with deep knowledge of Linux kernel internals, resource management, scheduling, memory management, and low-level engineering","Hands-on experience building or significantly enhancing distributed compute platforms, orchestration systems, or high-performance infrastructure at scale"],"x-skills-preferred":["Experience in Linux kernel development, hypervisor extensions, or low-level system programming for compute-intensive workloads","Proven track record operating or designing large-scale AI training/inference clusters (GPU/TPU scale)","Experience with custom runtimes, isolation techniques, or bespoke platforms for specialized AI compute","Familiarity with performance tools, tracing, and debugging in production distributed environments"],"datePosted":"2026-04-18T15:55:50.213Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Palo Alto, CA"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Deep expertise in virtualization technologies (KVM, Xen, QEMU) and advanced containerization/sandboxing (Kata, Firecracker, gVisor, Sysbox, or equivalent), Strong proficiency in systems programming languages such as C/C++ and Rust, Proven track record profiling, debugging, and optimizing complex system-level performance issues, with deep knowledge of Linux kernel internals, resource management, scheduling, memory management, and low-level engineering, Hands-on experience building or significantly enhancing distributed compute platforms, orchestration systems, or high-performance infrastructure at scale, Experience in Linux kernel development, hypervisor extensions, or low-level system programming for compute-intensive workloads, Proven track record operating or designing large-scale AI training/inference clusters (GPU/TPU scale), Experience with custom runtimes, isolation techniques, or bespoke platforms for specialized AI compute, Familiarity with performance tools, tracing, and debugging in production distributed environments","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":180000,"maxValue":440000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_231ce599-c30"},"title":"Staff Machine Learning Engineer, Content Quality Signals","description":"<p>We&#39;re seeking a Staff Machine Learning Engineer to join our Content Understanding team. As a key member of this team, you will lead modeling strategy for content understanding, including architecture selection, training approach, and evaluation methodology. You will design and ship production models that generate content signals such as embeddings and classifications used across multiple product surfaces. The ideal candidate will have significant industry experience building software and ML pipelines/systems, including technical leadership. They will have strong proficiency in Python and at least one ML stack such as PyTorch / TensorFlow, plus solid software engineering fundamentals. The role requires proven experience training and deploying ML models to production, including model versioning, rollouts, monitoring, and retraining strategies. The successful candidate will have deep hands-on experience in content understanding domains, such as computer vision, NLP, and multimodal/embedding models. They will also have experience working with large-scale datasets and distributed compute. The ideal candidate will be able to influence across teams and drive ambiguous problem areas to measurable outcomes. They will have strong applied skills in evaluation and experimentation, including defining metrics, offline/online alignment, A/B testing, debugging regressions, and model quality analysis.</p>\n<p>The role is ideal for a senior modeler who also enjoys developing, productionizing models and leading technical direction across teams. The successful candidate will be able to provide technical leadership through design reviews, mentoring, and raising the quality bar for modeling and ML engineering practices.</p>\n<p>In addition to the above responsibilities, the successful candidate will be expected to:</p>\n<ul>\n<li>Collaborate with infra/platform teams to ensure scalable, reliable training/serving (latency, cost, observability, rollout safety).</li>\n<li>Partner with signal-consuming teams (ranking, retrieval, integrity, ads) to define signal contracts, adoption patterns, and success metrics.</li>\n<li>Own the full ML lifecycle: data/labeling strategy (human labels + weak supervision), training pipelines, offline evaluation, online experimentation, deployment, and monitoring/retraining.</li>\n<li>Provide technical leadership through design reviews, mentoring, and raising the quality bar for modeling and ML engineering practices.</li>\n</ul>\n<p>Nice to have: experience with Cursor, Copilot, Codex, or similar AI coding assistants for development, debugging, testing, and refactoring; familiarity with LLM-powered productivity tools for documentation search, experiment analysis, SQL/data exploration, and engineering workflow acceleration.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_231ce599-c30","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Pinterest","sameAs":"https://www.pinterest.com/","logo":"https://logos.yubhub.co/pinterest.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/pinterest/jobs/7531060","x-work-arrangement":"remote","x-experience-level":"staff","x-job-type":"full-time","x-salary-range":"$189,308-$389,753 USD","x-skills-required":["Python","PyTorch","TensorFlow","Computer Vision","NLP","Multimodal Embedding Models","Large-Scale Datasets","Distributed Compute"],"x-skills-preferred":["Cursor","Copilot","Codex","LLM-Powered Productivity Tools"],"datePosted":"2026-04-18T15:54:53.925Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco, CA, US; Remote, US"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Python, PyTorch, TensorFlow, Computer Vision, NLP, Multimodal Embedding Models, Large-Scale Datasets, Distributed Compute, Cursor, Copilot, Codex, LLM-Powered Productivity Tools","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":189308,"maxValue":389753,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_9be280f4-cbc"},"title":"Software Engineer, Data Infrastructure","description":"<p>We&#39;re looking for an engineer to join our small, high-impact team responsible for architecting and scaling the core infrastructure behind distributed training pipelines, multimodal data catalogs, and intelligent processing systems that operate over petabytes of data.</p>\n<p>As a software engineer on our data infrastructure team, you&#39;ll design, build, and operate scalable, fault-tolerant infrastructure for LLM Research: distributed compute, data orchestration, and storage across modalities. You&#39;ll develop high-throughput systems for data ingestion, processing, and transformation , including training data catalogs, deduplication, quality checks, and search. You&#39;ll also build systems for traceability, reproducibility, and robust quality control at every stage of the data lifecycle.</p>\n<p>You&#39;ll collaborate with research teams to unlock new features, improve data quality, and accelerate training cycles. You&#39;ll implement and maintain monitoring and alerting to support platform reliability and performance.</p>\n<p>If you&#39;re excited by distributed systems, large-scale data mining, open-source tools like Spark, Kafka, Beam, Ray, and Delta Lake, and enjoy building from the ground up, we&#39;d love to hear from you.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_9be280f4-cbc","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Thinking Machines Lab","sameAs":"https://thinkingmachines.ai/","logo":"https://logos.yubhub.co/thinkingmachines.ai.png"},"x-apply-url":"https://job-boards.greenhouse.io/thinkingmachines/jobs/5013919008","x-work-arrangement":"onsite","x-experience-level":null,"x-job-type":"full-time","x-salary-range":"$350,000 - $475,000 USD","x-skills-required":["backend language (Python or Rust)","distributed compute frameworks (Apache Spark or Ray)","cloud infrastructure","data lake architectures","batch and streaming pipelines"],"x-skills-preferred":["Kafka","dbt","Terraform","Airflow","web crawler","deduplication","data mining","search","file formats and storage systems"],"datePosted":"2026-04-18T15:54:00.309Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"backend language (Python or Rust), distributed compute frameworks (Apache Spark or Ray), cloud infrastructure, data lake architectures, batch and streaming pipelines, Kafka, dbt, Terraform, Airflow, web crawler, deduplication, data mining, search, file formats and storage systems","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":350000,"maxValue":475000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_4ced2159-802"},"title":"Research, Vision Expertise","description":"<p>Thinking Machines Lab is seeking a researcher to join their team in San Francisco. The successful candidate will work on advancing the science of visual perception and multimodal learning. They will design architectures that fuse pixels and text, build datasets and evaluation methods that test real-world comprehension, and develop representations that let models ground abstract concepts in the physical world.</p>\n<p>The ideal candidate will have expertise in multimodality and experience running large-scale experiments. They will be comfortable contributing to complex engineering systems and have a strong grasp of probability, statistics, and machine learning fundamentals.</p>\n<p>This is an evergreen role, meaning that the position is open on an ongoing basis. The company receives many applications, and there may not always be an immediate role that aligns perfectly with the candidate&#39;s experience and skills. However, they encourage candidates to apply and continuously review applications.</p>\n<p>Responsibilities:</p>\n<ul>\n<li>Own research projects on training and performance analysis of multimodal AI models.</li>\n<li>Curate and build large-scale datasets and evaluation benchmarks to advance vision capabilities.</li>\n<li>Work with data infrastructure engineers, pretraining researchers and engineers, and product teams to create frontier multimodal models and the products that leverage them.</li>\n<li>Publish and present research that moves the entire community forward.</li>\n</ul>\n<p>Skills and Qualifications:</p>\n<ul>\n<li>Ability to design, run, and analyze experiments thoughtfully, with demonstrated research judgment and empirical rigor.</li>\n<li>Understanding of machine learning fundamentals, large-scale training, and distributed compute environments.</li>\n<li>Proficiency in Python and familiarity with at least one deep learning framework (e.g., PyTorch, TensorFlow, or JAX).</li>\n<li>Comfortable with debugging distributed training and writing code that scales.</li>\n<li>Bachelor&#39;s degree or equivalent experience in Computer Science, Machine Learning, Physics, Mathematics, or a related discipline with strong theoretical and empirical grounding.</li>\n</ul>\n<p>Preferred qualifications include research or engineering contributions in visual reasoning, spatial understanding, or multimodal architecture design, experience developing evaluation frameworks for multimodal tasks, publications or open-source contributions in vision-language modeling, video understanding, or multimodal AI, and a strong grasp of probability, statistics, and ML fundamentals.</p>\n<p>Logistics:</p>\n<ul>\n<li>Location: San Francisco, California.</li>\n<li>Compensation: $350,000 - $475,000 USD per year, depending on background, skills, and experience.</li>\n<li>Visa sponsorship: Yes.</li>\n<li>Benefits: Generous health, dental, and vision benefits, unlimited PTO, paid parental leave, and relocation support as needed.</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_4ced2159-802","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Thinking Machines Lab","sameAs":"https://thinkingmachines.ai/","logo":"https://logos.yubhub.co/thinkingmachines.ai.png"},"x-apply-url":"https://job-boards.greenhouse.io/thinkingmachines/jobs/5002288008","x-work-arrangement":"onsite","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$350,000 - $475,000 USD per year","x-skills-required":["Python","Deep learning framework (e.g., PyTorch, TensorFlow, or JAX)","Machine learning fundamentals","Large-scale training","Distributed compute environments"],"x-skills-preferred":["Visual reasoning","Spatial understanding","Multimodal architecture design","Evaluation frameworks for multimodal tasks","Vision-language modeling","Video understanding","Multimodal AI"],"datePosted":"2026-04-18T15:52:43.848Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Python, Deep learning framework (e.g., PyTorch, TensorFlow, or JAX), Machine learning fundamentals, Large-scale training, Distributed compute environments, Visual reasoning, Spatial understanding, Multimodal architecture design, Evaluation frameworks for multimodal tasks, Vision-language modeling, Video understanding, Multimodal AI","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":350000,"maxValue":475000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_372999e8-579"},"title":"Senior Software Engineer II, AI Workload Orchestration","description":"<p>As a Senior Software Engineer II on the AI Workload Orchestration team, you will help build and operate CoreWeave&#39;s Kubernetes-native platform for admitting, scheduling, and operating AI workloads at scale.</p>\n<p>This platform integrates multiple orchestration and scheduling frameworks such as Kueue, Volcano, and Ray to support modern AI training and inference workflows. It complements SUNK (Slurm on Kubernetes) by providing a Kubernetes-first, cloud-native orchestration layer with deep platform integration.</p>\n<p>You will own meaningful components of the platform, drive reliability and performance improvements, and help scale the system as customer demand and workload complexity continue to grow.</p>\n<p>Responsibilities:</p>\n<ul>\n<li>Design, build, and operate Kubernetes-native services for AI workload orchestration and scheduling</li>\n<li>Own one or more platform components end-to-end, including design, implementation, testing, and on-call support</li>\n<li>Improve scheduling latency, cluster utilization, and workload reliability through metrics-driven engineering</li>\n<li>Contribute to architectural discussions across services and influence design decisions within the platform</li>\n<li>Work closely with adjacent teams (CKS, infrastructure, managed inference) to ensure clean interfaces and integrations</li>\n<li>Mentor junior engineers and raise the quality bar for code, design, and operations</li>\n</ul>\n<p>About the role:</p>\n<ul>\n<li>5–8 years of professional software engineering experience in distributed systems, cloud infrastructure, or platform engineering</li>\n<li>Strong experience building production systems in Go (Python or C++ a plus)</li>\n<li>Solid understanding of Kubernetes fundamentals, APIs, controllers, and operating services in production</li>\n<li>Experience working with scheduling, resource management, or quota-based systems</li>\n<li>Proven ability to improve system reliability and performance using data and operational metrics</li>\n<li>Comfortable owning services in production and participating in on-call rotations</li>\n</ul>\n<p>Preferred:</p>\n<ul>\n<li>Experience with Kubernetes-native orchestration frameworks such as Kueue, Volcano, Ray, Kubeflow, or Argo Workflows</li>\n<li>Familiarity with GPU-based workloads, ML training, or inference pipelines</li>\n<li>Knowledge of scheduling concepts such as quota enforcement, pre-emption, and backfilling</li>\n<li>Experience with reliability practices including SLOs, alerting, and incident response</li>\n<li>Exposure to AI infrastructure, HPC, or large-scale distributed compute environments</li>\n</ul>\n<p>Why CoreWeave?</p>\n<p>At CoreWeave, we work hard, have fun, and move fast! We’re in an exciting stage of hyper-growth that you will not want to miss out on. We’re not afraid of a little chaos, and we’re constantly learning. Our team cares deeply about how we build our product and how we work together, which is represented through our core values:</p>\n<ul>\n<li>Be Curious at Your Core</li>\n<li>Act Like an Owner</li>\n<li>Empower Employees</li>\n<li>Deliver Best-in-Class Client Experiences</li>\n<li>Achieve More Together</li>\n</ul>\n<p>The base salary range for this role is $165,000 to $242,000. The starting salary will be determined based on job-related knowledge, skills, experience, and market location. We strive for both market alignment and internal equity when determining compensation. In addition to base salary, our total rewards package includes a discretionary bonus, equity awards, and a comprehensive benefits program (all based on eligibility).</p>\n<p>What We Offer</p>\n<p>The range we’ve posted represents the typical compensation range for this role. To determine actual compensation, we review the market rate for each candidate which can include a variety of factors. These include qualifications, experience, interview performance, and location.</p>\n<p>In addition to a competitive salary, we offer a variety of benefits to support your needs, including:</p>\n<ul>\n<li>Medical, dental, and vision insurance - 100% paid for by CoreWeave</li>\n<li>Company-paid Life Insurance</li>\n<li>Voluntary supplemental life insurance</li>\n<li>Short and long-term disability insurance</li>\n<li>Flexible Spending Account</li>\n<li>Health Savings Account</li>\n<li>Tuition Reimbursement</li>\n<li>Ability to Participate in Employee Stock Purchase Program (ESPP)</li>\n<li>Mental Wellness Benefits through Spring Health</li>\n<li>Family-Forming support provided by Carrot</li>\n<li>Paid Parental Leave</li>\n<li>Flexible, full-service childcare support with Kinside</li>\n<li>401(k) with a generous employer match</li>\n<li>Flexible PTO</li>\n<li>Catered lunch each day in our office and data center locations</li>\n<li>A casual work environment</li>\n<li>A work culture focused on innovative disruption</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_372999e8-579","directApply":true,"hiringOrganization":{"@type":"Organization","name":"CoreWeave","sameAs":"https://www.coreweave.com","logo":"https://logos.yubhub.co/coreweave.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/coreweave/jobs/4647595006","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$165,000 to $242,000","x-skills-required":["Kubernetes","Go","Distributed systems","Cloud infrastructure","Platform engineering","Scheduling","Resource management","Quota-based systems"],"x-skills-preferred":["Kueue","Volcano","Ray","Kubeflow","Argo Workflows","GPU-based workloads","ML training","Inference pipelines","SLOs","Alerting","Incident response","AI infrastructure","HPC","Large-scale distributed compute environments"],"datePosted":"2026-04-18T15:50:19.636Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Sunnyvale, CA / Bellevue, WA"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Kubernetes, Go, Distributed systems, Cloud infrastructure, Platform engineering, Scheduling, Resource management, Quota-based systems, Kueue, Volcano, Ray, Kubeflow, Argo Workflows, GPU-based workloads, ML training, Inference pipelines, SLOs, Alerting, Incident response, AI infrastructure, HPC, Large-scale distributed compute environments","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":165000,"maxValue":242000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_2bf2ef11-7d6"},"title":"Senior Backend Engineer, Data Modeling and Ingestion Platform","description":"<p>We are looking for a Senior Backend Engineer to lead the unification of large, highly rich, and heterogeneous datasets sourced from a wide range of external providers. These datasets are used to power our generative audio models.</p>\n<p>Your work will create the foundational dataset that powers our research by building robust, scalable systems for linking, deduplicating, reconciling, and enriching data at massive scale. This role centres on high-impact bulk ingestion and advanced data linkage. You will design the logic, algorithms, and strategies that transform many independent datasets into a unified, high-quality canonical asset used throughout the company.</p>\n<p>You will collaborate closely with ML researchers and product teams, working with tools such as BigQuery, Dataflow/Beam, TFRecords, and,where beneficial,distributed systems frameworks like Ray. Familiarity with ML workflows using JAX or multihost training is a plus, as the datasets you produce will directly support that ecosystem.</p>\n<p>Responsibilities:</p>\n<ul>\n<li>Build high-throughput bulk ingestion workflows to integrate datasets from multiple external providers.</li>\n<li>Design and implement scalable entity-resolution solutions, including record linking, deduplication, clustering, and conflict arbitration.</li>\n<li>Create and refine matching logic, decision rules, and similarity functions to align datasets with high accuracy and strong coverage.</li>\n<li>Define and track data quality indicators, such as overlap metrics, match precision/recall, duplicate rates, and completeness.</li>\n<li>Prepare training-ready datasets in formats such as TFRecords, and structure data to meet ML research requirements.</li>\n<li>Develop processing components using Dataflow (Beam) and manage large analytical workloads in BigQuery.</li>\n<li>Leverage frameworks like Ray to accelerate large-scale experiments, feature extraction, and research-oriented data preparation.</li>\n<li>Collaborate with ML researchers to anticipate downstream requirements and evolve linkage strategies as new sources and use cases emerge.</li>\n</ul>\n<p>Requirements:</p>\n<ul>\n<li>Experience working with large, heterogeneous datasets from multiple providers or domains.</li>\n<li>Strong background in entity resolution, deduplication, data unification, or related large-scale data integration techniques.</li>\n<li>Proficiency in Python, with an emphasis on efficient, scalable data processing.</li>\n<li>Experience with BigQuery, Google Dataflow/Apache Beam, or similar batch-processing frameworks.</li>\n<li>Familiarity with data validation, normalization, reconciliation, and building consistent views across diverse data sources.</li>\n<li>Ability to craft well-structured matching and decision strategies that balance accuracy, completeness, and computational efficiency.</li>\n<li>Comfortable iterating quickly on pragmatic solutions, balancing correctness with time-to-delivery.</li>\n<li>Clear communication skills and the ability to collaborate closely with ML and research teams.</li>\n</ul>\n<p>Nice to Have:</p>\n<ul>\n<li>Knowledge of architecting Google Cloud Platform systems at scale.</li>\n<li>Experience with distributed compute frameworks such as Ray, Spark, or Flink.</li>\n<li>Understanding of JAX-based ML pipelines, multihost training setups, or large-scale data preparation for accelerator-backed workflows.</li>\n<li>Familiarity with TFRecords or other high-volume training data formats.</li>\n<li>Exposure to ranking, clustering, or statistical similarity modeling.</li>\n<li>Experience with Go, NextJS, and/or React Native to contribute to full-stack development.</li>\n</ul>\n<p>Why Join Us:</p>\n<ul>\n<li>You will design the core dataset that underpins our research, product development, and generative audio models.</li>\n<li>You&#39;ll work on large-scale data challenges that require creativity, algorithmic thinking, and engineering excellence.</li>\n<li>You&#39;ll join a small, fast-moving team where your decisions shape the direction of our data and research capabilities.</li>\n</ul>\n<p>Benefits:</p>\n<ul>\n<li>Highly competitive salary and equity.</li>\n<li>Quarterly productivity budget.</li>\n<li>Flexible time off.</li>\n<li>Fantastic office location in Manhattan.</li>\n<li>Productivity package, including ChatGPT Plus, Claude Code, and Copilot.</li>\n<li>Top-notch private health, dental, and vision insurance for you and your dependents.</li>\n<li>401(k) plan options with employer matching.</li>\n<li>Concierge medical/primary care through One Medical and Rightway.</li>\n<li>Mental health support from Spring Health.</li>\n<li>Personalized life insurance, travel assistance, and many other perks.</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_2bf2ef11-7d6","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Udio","sameAs":"https://udio.com","logo":"https://logos.yubhub.co/udio.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/udio/jobs/4988140008","x-work-arrangement":"onsite","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$180,000 - $220,000","x-skills-required":["Python","BigQuery","Dataflow/Beam","TFRecords","Ray","JAX","Multihost training","Entity resolution","Deduplication","Data unification","Large-scale data integration"],"x-skills-preferred":["Go","NextJS","React Native","Distributed compute frameworks","Ranking","Clustering","Statistical similarity modeling"],"datePosted":"2026-04-17T13:04:41.720Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"New York"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Python, BigQuery, Dataflow/Beam, TFRecords, Ray, JAX, Multihost training, Entity resolution, Deduplication, Data unification, Large-scale data integration, Go, NextJS, React Native, Distributed compute frameworks, Ranking, Clustering, Statistical similarity modeling","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":180000,"maxValue":220000,"unitText":"YEAR"}}}]}