AI Infrastructure Engineer

9af8d812-df8 AI Infrastructure Engineer We're looking for Senior+ AI Infrastructure Engineers to build the systems that train and serve Intercom's next generation of AI products.

As a Senior AI Infrastructure Engineer focused on model training and inference, you will:

Implement and scale training pipelines for large transformer and LLM models, from data ingestion and preprocessing through distributed training and evaluation.

Build and optimize inference services that deliver low-latency, high-reliability experiences for our customers, including autoscaling, routing, and fallbacks.

Work on GPU-level performance: tuning kernels, improving utilization, and identifying bottlenecks across our training and inference stack.

Collaborate closely with ML scientists to implement cutting edge training and inference methods and bring them to production.

Play an active role in hiring, mentoring, and developing other engineers on the team.

Raise the bar for technical standards, reliability, and operational excellence across Intercom’s AI platform.

We’re looking to hire Senior+ AI Infrastructure Engineers. You’re likely a great fit if:

You have 5+ years of experience in software engineering, with a strong track record of shipping high-quality products or platforms.

You hold a degree in Computer Science, Computer Engineering, or a related field (or you have equivalent experience with very strong fundamentals).

You have hands-on experience with one or more of the following:

Model training (especially transformers and LLMs).

Model inference at scale (again, especially transformers and LLMs).

Low-level GPU work, such as writing CUDA or Triton kernels.

Comfortable working in production environments at meaningful scale (traffic, data, or organizational).

You communicate clearly, can explain complex technical topics to different audiences, and enjoy close collaboration with both engineers and non-engineers.

You take pride in strong technical fundamentals, love learning, and are willing to invest in your own development.

Have deep knowledge of at least one programming language (for example Python, Ruby, Java, Go, etc.). Specific language experience is less important than your ability to write clean, reliable code and learn new stacks quickly.

We are a well-treated bunch, with awesome benefits! If there’s something important to you that’s not on this list, talk to us!

Competitive salary, annual bonus and equity

Regular compensation reviews - we reward great work!

Unlimited access to Claude Code and best-in-class AI tools; experimentation & building is encouraged & celebrated.

Generous paid time off above statutory minimum

Hybrid working

MacBooks are our standard, but we also offer Windows for certain roles when needed.

Fun events for employees, friends, and family!

XML job scraping automation by YubHub

]]> full-time senior hybrid model training, model inference, low-level GPU work, CUDA, Triton, Python, Ruby, Java, Go, experience at AI native companies, running training or inference workloads on Kubernetes, AWS, cloud providers, production experience with Python in ML or infrastructure contexts Engineering Technology Intercom https://logos.yubhub.co/intercom.com.png Intercom is an AI company that builds customer service solutions. It was founded in 2011 and serves nearly 30,000 global businesses. https://www.intercom.com/ https://job-boards.greenhouse.io/intercom/jobs/7824142 Berlin, Germany 2026-04-18 586b9fef-509 Senior Software Engineer - Network Enablement (Applied ML) We believe that the way people interact with their finances will drastically improve in the next few years. We're dedicated to empowering this transformation by building the tools and experiences that thousands of developers use to create their own products.

On this team, you will build and operate the ML infrastructure and product services that enable trust and intelligence across Plaid's network. You'll own feature engineering, offline training and batch scoring, online feature serving, and real-time inference so model outputs directly power partner-facing fraud & trust products and bank intelligence features.

Responsibilities

Embed model inference into Network Enablement product flows and decision logic (APIs, feature flags, backend flows).
Define and instrument product + ML success metrics (fraud reduction, retention lift, false positives, downstream impact).
Design and run experiments and rollout plans (backtesting, shadow scoring, A/B tests, feature-flagged releases) to validate product hypotheses.
Build and operate offline training pipelines and production batch scoring for bank intelligence products.
Ship and maintain online feature serving and low-latency model inference endpoints for real-time partner/bank scoring.
Implement model CI/CD, model/version registry, and safe rollout/rollback strategies.
Monitor model/data health: drift/regression detection, model-quality dashboards, alerts, and SLOs targeted to partner product needs.
Ensure offline and online parity, data lineage, and automated validation / data contracts to reduce regressions.
Optimize inference performance and cost for real-time scoring (batching, caching, runtime selection).
Ensure fairness, explainability and PII-aware handling for partner-facing ML features; maintain auditability for compliance.
Partner with platform and cross-functional teams to scale the ML/data foundation (graph features, sequence embeddings, unified pipelines).
Mentor engineers and document team standards for ML productization and operations.

Qualifications

Must-haves:
Strong software engineering skills including systems design, APIs, and building reliable backend services (Go or Python preferred).
Production experience with batch and streaming data pipelines and orchestration tools such as Airflow or Spark.
Experience building or operating real-time scoring and online feature-serving systems, including feature stores and low-latency model inference.
Experience integrating model outputs into product flows (APIs, feature flags) and measuring impact through experiments and product metrics.
Experience with model lifecycle and operations: model registries, CI/CD for models, reproducible training, offline & online parity, monitoring and incident response.
Nice to have:
Experience in fraud, risk, or marketing intelligence domains.
Experience with feature-store products (Tecton / Chronon / Feast / internal) and unified pipelines.
Experience with graph frameworks, graph feature engineering, or sequence embeddings.
Experience optimizing inference at scale (Triton/ONNX/quantization, batching, caching).

Additional Information

Our mission at Plaid is to unlock financial freedom for everyone. To support that mission, we seek to build a diverse team of driven individuals who care deeply about making the financial ecosystem more equitable.

XML job scraping automation by YubHub

]]> full-time senior hybrid $190,800-$286,800 per year software engineering, systems design, APIs, backend services, Go, Python, batch and streaming data pipelines, orchestration tools, Airflow, Spark, real-time scoring, online feature-serving systems, feature stores, low-latency model inference, model outputs, product flows, experiments, product metrics, model lifecycle, operations, model registries, CI/CD, reproducible training, offline & online parity, monitoring, incident response, fraud, risk, marketing intelligence, feature-store products, unified pipelines, graph frameworks, graph feature engineering, sequence embeddings, inference at scale, Triton, ONNX, quantization, batching, caching Engineering Technology Plaid https://logos.yubhub.co/plaid.com.png Plaid is a technology company that powers the tools millions of people rely on to live a healthier financial life. The company has a presence in multiple countries and works with thousands of companies. https://plaid.com/ https://jobs.lever.co/plaid/43b1374d-5c5e-4b63-b710-a95e3cb76bbe San Francisco 2026-04-17 82a4d6f7-01c Staff Research Engineer, Discovery Team About Anthropic

Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.

About the Team

Our team is organised around the north star goal of building an AI scientist – a system capable of solving the long term reasoning challenges and basic capabilities necessary to push the scientific frontier. Our team likes to think across the whole model stack. Currently the team is focused on improving models' abilities to use computers – as a laboratory for long horizon tasks and a key blocker to many scientific workflows.

About the role

As a Research Engineer on our team you will work end to end, identifying and addressing key blockers on the path to scientific AGI. Strong candidates should have familiarity with language model training, evaluation, and inference, be comfortable triaging research ideas and diagnosing problems and enjoy working collaboratively. Familiarity with performance optimisation, distributed systems, vm/sandboxing/container deployment, and large scale data pipelines is highly encouraged.

Join us in our mission to develop advanced AI systems that are both powerful and beneficial for humanity.

Responsibilities:

Working across the full stack to identify and remove bottlenecks preventing progress toward scientific AGI
Develop approaches to address long-horizon task completion and complex reasoning challenges essential for scientific discovery
Scaling research ideas from prototype to production
Create benchmarks and evaluation frameworks to measure model capabilities in scientific workflows and computer use
Implement distributed training systems and performance optimisations to support large-scale model development

You may be a good fit if you:

Have 8+ years of ML research experience
Are familiar with large scale language model training, evaluation, and inference pipelines
Enjoy obsessively iterating on immediate blockers towards longterm goals
Thrive working collaboratively to solve problems
Have expertise in performance optimisation and distributed computing systems
Show strong problem-solving skills and ability to identify technical bottlenecks in complex systems
Can translate research concepts into scalable engineering solutions
Have a track record of shipping ML systems that tackle challenging multi-step reasoning problems

Strong candidates may also have:

Expertise with performance optimisation for language model inference and training
Experience with computer use automation and agentic AI systems
A history working on reinforcement learning approaches for complex task completion
Knowledge of containerisation technologies (Docker, Kubernetes) and cloud deployment at scale
Demonstrated ability to work across multiple domains (language modelling, systems engineering, scientific computing)
Have experience with VM/sandboxing/container deployment and large-scale data processing
Experience working with large scale data problem solving and infrastructure
Published research or practical experience in scientific AI applications or long-horizon reasoning

Logistics

Education requirements: We require at least a Bachelor's degree in a related field or equivalent experience. Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices.

Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this.

We encourage you to apply even if you do not believe you meet every single qualification.** Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team.

Your safety matters to us.** To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you're ever unsure about a communication, don't click any links—visit anthropic.com/careers directly for confirmed position openings.

How we're different

We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science.

XML job scraping automation by YubHub

]]> full-time staff hybrid $350,000 - $850,000USD language model training, evaluation, inference, performance optimisation, distributed systems, vm/sandboxing/container deployment, large scale data pipelines, performance optimisation for language model inference and training, computer use automation and agentic AI systems, reinforcement learning approaches for complex task completion, containerisation technologies (Docker, Kubernetes) and cloud deployment at scale, VM/sandboxing/container deployment and large-scale data processing Engineering Technology Anthropic https://logos.yubhub.co/anthropic.com.png Anthropic is a company that aims to create reliable, interpretable, and steerable AI systems. It has a team of researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. https://job-boards.greenhouse.io https://job-boards.greenhouse.io/anthropic/jobs/4593216008 San Francisco, CA 2026-03-08 da726093-b19 Research Engineer, Discovery About the Role

As a Research Engineer on our team, you will work end to end across the whole model stack, identifying and addressing key infra blockers on the path to scientific AGI. Strong candidates should have familiarity with elements of language model training, evaluation, and inference and eagerness to quickly dive and get up to speed in areas they are not yet an expert on. This may include performance optimization, distributed systems, VM/sandboxing/container deployment, and large scale data pipelines.

Responsibilities:

Design and implement large-scale infrastructure systems to support AI scientist training, evaluation, and deployment across distributed environments
Identify and resolve infrastructure bottlenecks impeding progress toward scientific capabilities
Develop robust and reliable evaluation frameworks for measuring progress towards scientific AGI.
Build scalable and performant VM/sandboxing/container architectures to safely execute long-horizon AI tasks and scientific workflows
Collaborate to translate experimental requirements into production-ready infrastructure
Develop large scale data pipelines to handle advanced language model training requirements
Optimize large scale training and inference pipelines for stable and efficient reinforcement learning

You may be a good fit if you:

Have 6+ years of highly-relevant experience in infrastructure engineering with demonstrated expertise in large-scale distributed systems
Are a strong communicator and enjoy working collaboratively
Possess deep knowledge of performance optimization techniques and system architectures for high-throughput ML workloads
Have experience with containerization technologies (Docker, Kubernetes) and orchestration at scale
Have proven track record of building large-scale data pipelines and distributed storage systems
Excel at diagnosing and resolving complex infrastructure challenges in production environments
Can work effectively across the full ML stack from data pipelines to performance optimization
Have experience collaborating with other researchers to scale experimental ideas
Thrive in fast-paced environments and can rapidly iterate from experimentation to production

Strong candidates may also have:

Experience with language model training infrastructure and distributed ML frameworks (PyTorch, JAX, etc.)
Background in building infrastructure for AI research labs or large-scale ML organizations
Knowledge of GPU/TPU architectures and language model inference optimization
Experience with cloud platforms (AWS, GCP) at enterprise scale
Familiarity with VM and container orchestration.
Experience with workflow orchestration tools and experiment management systems
History working with large scale reinforcement learning
Comfort with large scale data pipelines (Beam, Spark, Dask, …)

Logistics

Education requirements: We require at least a Bachelor's degree in a related field or equivalent experience.
Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices.
Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this.

We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work.

Your safety matters to us. To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you're ever unsure about a communication, don't click any links—visit anthropic.com/careers directly for confirmed position openings.

How we're different

We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale projects, and we're committed to making a positive impact on the world.

XML job scraping automation by YubHub

]]> full-time senior hybrid $350,000 - $850,000 USD infrastructure engineering, large-scale distributed systems, performance optimization, containerization technologies, orchestration at scale, data pipelines, distributed storage systems, complex infrastructure challenges, ML stack, workflow orchestration tools, experiment management systems, reinforcement learning, large scale data pipelines, language model training infrastructure, distributed ML frameworks, GPU/TPU architectures, language model inference optimization, cloud platforms, VM and container orchestration, workflow orchestration tools, experiment management systems, large scale reinforcement learning, large scale data pipelines Engineering Technology Anthropic https://logos.yubhub.co/anthropic.com.png Anthropic is a company that aims to create reliable, interpretable, and steerable AI systems. It has a team of researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. https://job-boards.greenhouse.io https://job-boards.greenhouse.io/anthropic/jobs/4669581008 San Francisco, CA 2026-03-08 a21c0ab8-098 Researcher, Training Location

San Francisco

Employment Type

Full time

Department

Research

Compensation

$360K – $440K • Offers Equity

The base pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience. If the role is non-exempt, overtime pay will be provided consistent with applicable laws. In addition to the salary range listed above, total compensation also includes generous equity, performance-related bonus(es) for eligible employees, and the following benefits.

Medical, dental, and vision insurance for you and your family, with employer contributions to Health Savings Accounts

Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses (parking and transit)

401(k) retirement plan with employer match

Paid parental leave (up to 24 weeks for birth parents and 20 weeks for non-birthing parents), plus paid medical and caregiver leave (up to 8 weeks)

Paid time off: flexible PTO for exempt employees and up to 15 days annually for non-exempt employees

13+ paid company holidays, and multiple paid coordinated company office closures throughout the year for focus and recharge, plus paid sick or safe time (1 hour per 30 hours worked, or more, as required by applicable state or local law)

Mental health and wellness support

Employer-paid basic life and disability coverage

Annual learning and development stipend to fuel your professional growth

Daily meals in our offices, and meal delivery credits as eligible

Relocation support for eligible employees

Additional taxable fringe benefits, such as charitable donation matching and wellness stipends, may also be provided.

More details about our benefits are available to candidates during the hiring process.

This role is at-will and OpenAI reserves the right to modify base pay and other compensation components at any time based on individual performance, team or company results, or market conditions.

About the Team

OpenAI's Training team is responsible for producing the large language models that power our research, our products, and ultimately bring us closer to AGI. Achieving this goal requires combining deep research into improving our current architecture, datasets and optimization techniques, alongside long-term bets aimed at improving the efficiency and capability of future generations of models. We are responsible for integrating these techniques and producing model artifacts used by the rest of the company, and ensuring that these models are world-class in every respect. Recent examples of artifacts with major contributions from our team include GPT4-Turbo, GPT-4o and o1-mini.

About the Role

As a member of the architecture team, you will push the frontier of architecture development for OpenAI's flagship models, enhancing intelligence, efficiency, and adding new capabilities.

Ideal candidates have a deep understanding of LLM architectures, a sophisticated understanding of model inference, and a hands-on empirical approach. A good fit for this role will be equally happy coming up with a creative breakthrough, investing in strengthening a baseline, designing an eval, debugging a thorny regression, or tracking down a bottleneck.

This role is based in San Francisco. We use a hybrid work model of 3 days in the office per week and offer relocation assistance to new employees.

In this role, you will:

Design, prototype and scale up new architectures to improve model intelligence
Execute and analyze experiments autonomously and collaboratively
Study, debug, and optimize both model performance and computational performance
Contribute to training and inference infrastructure

You might thrive in this role if you:

Have experience landing contributions to major LLM training runs
Can thoroughly evaluate and improve deep learning architectures in a self-directed fashion
Are motivated by safely deploying LLMs in the real world
Are well-versed in the state of the art transformer modifications for efficiency

About OpenAI

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.

XML job scraping automation by YubHub

]]> full-time senior hybrid $360K – $440K • Offers Equity Deep learning, Transformers, Model inference, Architecture development, Experiment design, Optimization techniques, LLM architectures, Model performance, Computational performance, Training and inference infrastructure Engineering Technology OpenAI https://logos.yubhub.co/openai.com.png OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. The company was founded in 2015 and has since grown to become a leading player in the field of artificial intelligence. https://jobs.ashbyhq.com https://jobs.ashbyhq.com/openai/97d3670c-e75a-4bb2-a235-171765f5f10e San Francisco 2026-03-06 61433df5-3e7 Member of Technical Staff, Multimodal Infrastructure Summary

Microsoft AI are looking for a talented Member of Technical Staff, Multimodal Infrastructure to help build the next wave of capabilities of our personalized AI assistant, Copilot. We're looking for someone who will bring an abundance of positive energy, empathy, and kindness to the team every day, in addition to being highly effective.

About the Role

We are seeking a highly skilled and experienced engineer to join our team as a Member of Technical Staff, Multimodal Infrastructure. The successful candidate will be responsible for designing, developing, and maintaining large-scale multimodal data processing pipelines, model pretraining and post-training frameworks, and model inference and serving frameworks. They will work closely with research scientists and product engineers to solve infra-related problems and find a path to get things done despite roadblocks to get their work into the hands of users quickly and iteratively.

Accountabilities

Design, develop, and maintain large-scale multimodal data processing pipelines.
Design, develop, and maintain large-scale multimodal model pretraining and post-training frameworks.
Design, develop, and maintain large-scale multimodal model inference and serving frameworks.

The Candidate we're looking for

Experience:

Bachelor's Degree in Computer Science, or related technical discipline AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python.

Technical skills:

Strong proficiency in distributed data processing infra (resource utilization management, fault tolerance, ray & spark) and CPU/GPU batch processing optimizations.
Experience with state-of-art model inference and serving frameworks.
Experience with image/video/audio data processing.
Experience with common data formats for efficient I/O.

Personal attributes:

Enjoy working in a fast-paced, design-driven, product development cycle.
Embody our Culture and Values.

Benefits

Competitive salary and benefits package.
Opportunities for professional growth and development.
Collaborative and dynamic work environment.
Access to cutting-edge technology and tools.
Flexible work arrangements.

XML job scraping automation by YubHub

]]> full-time staff hybrid Competitive salary and benefits package C, C++, C#, Java, JavaScript, Python, Distributed data processing infra, CPU/GPU batch processing optimizations, State-of-art model inference and serving frameworks, Image/video/audio data processing, Common data formats for efficient I/O, Ray & spark, TensorRT-LLM, SGLang, xDiT, Cache-DiT Engineering Technology Microsoft AI https://logos.yubhub.co/microsoft.ai.png Microsoft AI is a leading technology company that specializes in artificial intelligence and machine learning. They are known for their innovative products and services that aim to make a positive impact on people's lives. Microsoft AI is committed to advancing the field of AI and making it more accessible to everyone. https://microsoft.ai https://microsoft.ai/job/member-of-technical-staff-multimodal-infrastructure-mai-superintelligence-team-3/ New York 2026-03-06 cfee4a87-9c7 Member of Technical Staff, Multimodal Infrastructure Summary

About the Role

We're looking for someone who will design, develop and maintain large-scale multimodal data processing pipelines, model pretraining and post-training frameworks, and model inference and serving frameworks. You will work closely with research scientists and product engineers on multimodal data processing, model training, inference and serving tasks. As a contributing member of the core group of engineers, you would also bring to the table best practices driving architectural changes and influence roadmap of relevant software and hardware components.

Accountabilities

Design, develop and maintain large-scale multimodal data processing pipelines.
Design, develop and maintain large-scale multimodal model pretraining and post-training frameworks.
Design, develop and maintain large-scale multimodal model inference and serving frameworks.

The Candidate we're looking for

Experience:

Bachelor's Degree in Computer Science, or related technical discipline AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python.

Technical skills:

Strong proficiency in distributed data processing infra (resource utilization management, fault tolerance, ray & spark) and CPU/GPU batch processing optimizations.
Experience with state-of-art model inference and serving frameworks.
Experience with image/video/audio data processing.
Experience with common data formats for efficient I/O.

Personal attributes:

Enjoy working in a fast-paced, design-driven, product development cycle.
Embody our Culture and Values.

Benefits

Starting January 26, 2026, MAI employees are expected to work from a designated Microsoft office at least four days a week if they live within 50 miles (U.S.) or 25 miles (non-U.S., country-specific) of that location.
Comprehensive health and wellbeing benefits.
Professional development opportunities.
Financial benefits (bonus, equity, pension, etc.).

XML job scraping automation by YubHub

]]> full-time staff hybrid C, C++, C#, Java, JavaScript, Python, Distributed data processing infra, CPU/GPU batch processing optimizations, State-of-art model inference and serving frameworks, Image/video/audio data processing, Common data formats for efficient I/O, Deep learning frameworks, Auto-regressive and diffusion transformer models, Distributed training techniques, Image/video generation and editing, Efficient architectures, Efficient model design, Reinforcement learning training methods Engineering Technology Microsoft AI https://logos.yubhub.co/microsoft.ai.png Microsoft AI is a leading technology company that specializes in artificial intelligence and machine learning. They are known for their innovative products and services that aim to make a positive impact on people's lives. Microsoft AI is committed to advancing the field of AI and making it more accessible to everyone. https://microsoft.ai https://microsoft.ai/job/member-of-technical-staff-multimodal-infrastructure-mai-superintelligence-team-2/ Redmond 2026-03-06 a82f064b-623 Member of Technical Staff, Multimodal Infrastructure Summary

Microsoft AI are looking for a talented Member of Technical Staff, Multimodal Infrastructure to help build the next wave of capabilities of our personalized AI assistant, Copilot. We’re looking for someone who will bring an abundance of positive energy, empathy, and kindness to the team every day, in addition to being highly effective.

About the Role

As a Member of Technical Staff, Multimodal Infrastructure, you will be responsible for designing, developing, and maintaining large-scale multimodal data processing pipelines, model pretraining and post-training frameworks, and model inference and serving frameworks. You will work closely with research scientists and product engineers to solve infra-related problems and find a path to get things done despite roadblocks to get your work into the hands of users quickly and iteratively.

Accountabilities

Design, develop, and maintain large-scale multimodal data processing pipelines.
Design, develop, and maintain large-scale multimodal model pretraining and post-training frameworks.
Design, develop, and maintain large-scale multimodal model inference and serving frameworks.

The Candidate we're looking for

Experience:

Bachelor’s Degree in Computer Science, or related technical discipline AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python.

Technical skills:

Strong proficiency in distributed data processing infra (resource utilization management, fault tolerance, ray & spark) and CPU/GPU batch processing optimizations.
Experience with state-of-art model inference and serving frameworks.
Experience with image/video/audio data processing.
Experience with common data formats for efficient I/O.

Personal attributes:

Enjoy working in a fast-paced, design-driven, product development cycle.
Embody our Culture and Values.

Benefits

Starting January 26, 2026, MAI employees are expected to work from a designated Microsoft office at least four days a week if they live within 50 miles (U.S.) or 25 miles (non-U.S., country-specific) of that location.
Comprehensive health and wellbeing benefits.
Professional development opportunities.
Financial benefits (bonus, equity, pension, etc.).

XML job scraping automation by YubHub

]]> full-time staff hybrid C, C++, C#, Java, JavaScript, Python, Distributed data processing infra, CPU/GPU batch processing optimizations, State-of-art model inference and serving frameworks, Image/video/audio data processing, Common data formats for efficient I/O, Auto-regressive and diffusion transformer models, Distributed training techniques, Image/video generation and editing, Efficient architectures, Efficient model design, Reinforcement learning training methods Engineering Technology Microsoft AI https://logos.yubhub.co/microsoft.ai.png Microsoft AI is a leading technology company that specializes in artificial intelligence and machine learning. They are known for their innovative products and services that aim to make a positive impact on people's lives. Microsoft AI is a subsidiary of Microsoft Corporation, a multinational technology company that was founded in 1975. https://microsoft.ai https://microsoft.ai/job/member-of-technical-staff-multimodal-infrastructure-mai-superintelligence-team/ Mountain View 2026-03-06