{"version":"0.1","company":{"name":"YubHub","url":"https://yubhub.co","jobsUrl":"https://yubhub.co/jobs/skill/speech-recognition"},"x-facet":{"type":"skill","slug":"speech-recognition","display":"Speech Recognition","count":17},"x-feed-size-limit":100,"x-feed-sort":"enriched_at desc","x-feed-notice":"This feed contains at most 100 jobs (the most recently enriched). For the full corpus, use the paginated /stats/by-facet endpoint or /search.","x-generator":"yubhub-xml-generator","x-rights":"Free to redistribute with attribution: \"Data by YubHub (https://yubhub.co)\"","x-schema":"Each entry in `jobs` follows https://schema.org/JobPosting. YubHub-native raw fields carry `x-` prefix.","jobs":[{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_c85087bb-6a8"},"title":"AI Tutor - Danish","description":"<p>As an AI Tutor specialized in multilingual audio capabilities, you will contribute to xAI&#39;s mission by training and refining Grok to excel in voice interactions, speech recognition, and auditory experiences across diverse languages, accents, and cultural contexts.</p>\n<p>Your work will focus on curating and annotating high-quality audio data to enhance Grok&#39;s global accessibility, enabling natural spoken interactions for users worldwide, bridging language barriers through accurate speech processing, and improving the AI&#39;s handling of multilingual audio nuances.</p>\n<p>Responsibilities:</p>\n<ul>\n<li>Use proprietary software to provide labels, annotations, recordings, and inputs on projects involving multilingual audio clips, voice recordings, speech samples, and auditory elements in various languages.</li>\n</ul>\n<ul>\n<li>Support the delivery of high-quality curated audio data that ensures clear, natural spoken output, accurate representation of linguistic and prosodic details (such as intonation, rhythm, and accent), and professional audio standards.</li>\n</ul>\n<ul>\n<li>Collaborate with technical staff to develop tasks that improve AI&#39;s ability to handle speech modulation, accent variation, noise in real-world recordings, and multilingual audio processing.</li>\n</ul>\n<ul>\n<li>Work with technical staff to improve annotation tools for efficient audio workflows.</li>\n</ul>\n<p>Basic Qualifications:</p>\n<ul>\n<li>Native proficiency in Danish with exposure to diverse accents, dialects, or regional variations.</li>\n</ul>\n<ul>\n<li>Proficiency in English (minimum B2 level) with clear, natural vocal delivery and pronunciation suitable for audio recording purposes.</li>\n</ul>\n<ul>\n<li>Strong auditory perception to identify nuances in speech, accents, pronunciation, intonation, and audio quality across languages.</li>\n</ul>\n<ul>\n<li>Demonstrated ability to handle multilingual audio content, including evaluating speech accuracy, cultural vocal expressions, and contextual interpretation in spoken form.</li>\n</ul>\n<ul>\n<li>Demonstrated ability to transcribe audio with high accuracy across accents and varying audio quality.</li>\n</ul>\n<ul>\n<li>Comfort providing high-quality voice recordings and feedback on audio samples in multiple languages.</li>\n</ul>\n<ul>\n<li>Strong comprehension skills and the ability to make independent judgments on ambiguous or varied audio material, including noisy or accented speech.</li>\n</ul>\n<ul>\n<li>Strong communication, interpersonal, analytical, detail-oriented, and organizational skills, with the ability to articulate audio-related feedback effectively.</li>\n</ul>\n<ul>\n<li>Commitment to developing AI that masters sophisticated multilingual audio capabilities.</li>\n</ul>\n<p>Preferred Skills and Experience:</p>\n<ul>\n<li>Demonstration of exceptional attention to linguistic nuance, auditory detail, and data quality beyond standard transcription work.</li>\n</ul>\n<ul>\n<li>Deep understanding and taste of what good/useful Audio data is.</li>\n</ul>\n<ul>\n<li>Strong command of advanced transcription and annotation practices, including handling disfluencies, accents, and prosodic features (intonation, stress, rhythm, emotion, etc) with high consistency and accuracy.</li>\n</ul>\n<ul>\n<li>Background in linguistics (e.g., phonetics, phonology, sociolinguistics), speech sciences, cognitive science, or a related field, or equivalent practical experience, with demonstrated ability to analyze accent variation, pronunciation differences, and multilingual speech patterns.</li>\n</ul>\n<ul>\n<li>Experience working with speech/audio datasets, annotation workflows, or AI training data, including knowledge/experience with training voice models, and an understanding of how data quality impacts model performance.</li>\n</ul>\n<ul>\n<li>Professional experience in voice work, including voice acting, voice recording, podcasting with a measurable audience (e.g., X following), or similar audio production demonstrating attention to clarity and recording quality.</li>\n</ul>\n<ul>\n<li>Demonstrated ability to exercise independent judgment in ambiguous audio scenarios and make consistent, defensible annotation decisions.</li>\n</ul>\n<ul>\n<li>Portfolio (strongly preferred for advanced candidates): Voice samples, annotated transcripts, or audio-related work demonstrating quality, methodology, and attention to detail.</li>\n</ul>\n<ul>\n<li>Candidates with professional experience in voice, linguistics, speech data, or speech evaluation and research are especially encouraged to apply.</li>\n</ul>\n<p>Location and Other Expectations:</p>\n<ul>\n<li>Tutor roles may be offered as full-time, part-time, or contractor positions, depending on role needs and candidate fit.</li>\n</ul>\n<ul>\n<li>For contractor positions, hours will vary widely based on project scope and contractor availability, with no fixed commitments required. On average, most projects may require at least 10 hours per week to deliver effectively, though this is not a fixed commitment and depends on the scope of work. Contractors have full flexibility to set their own hours and determine the exact amount of time needed to complete deliverables.</li>\n</ul>\n<ul>\n<li>Tutor roles may be performed remotely from any location worldwide, subject to legal eligibility, time-zone compatibility, and role-specific needs.</li>\n</ul>\n<ul>\n<li>For US-based candidates, please note that we are unable to hire in Wyoming and Illinois at this time.</li>\n</ul>\n<ul>\n<li>We are unable to provide visa sponsorship.</li>\n</ul>\n<ul>\n<li>For those who will be working from a personal device, your computer must be a Chromebook, a Mac with macOS 11.0 or later, or Windows 10 or later.</li>\n</ul>\n<p>Compensation and Benefits:</p>\n<p>US-based candidates: $35/hour - $45/hour depending on factors including relevant experience, skills, education, geographic location, and qualifications. International candidates: Information will be provided to you during the recruitment process.</p>\n<p>Benefits vary based on employment type, location, and jurisdiction. Benefits for eligible U.S.-based positions include health insurance, 401(k) plan, and paid sick leave. Specific details and role-specific information will be provided to you during the interview process.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_c85087bb-6a8","directApply":true,"hiringOrganization":{"@type":"Organization","name":"xAI","sameAs":"https://www.xai.com/","logo":"https://logos.yubhub.co/xai.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/xai/jobs/5090189007","x-work-arrangement":"remote","x-experience-level":"mid","x-job-type":"full-time|part-time|contract","x-salary-range":"$35/hour - $45/hour","x-skills-required":["Multilingual audio capabilities","Proprietary software","Audio data curation","Annotation tools","Speech recognition","Auditory experiences","Diverse languages","Accents","Cultural contexts","Linguistic and prosodic details","Professional audio standards","Speech modulation","Accent variation","Noise in real-world recordings","Multilingual audio processing","Audio workflows","Transcription","Audio quality","Voice recordings","Feedback on audio samples","Independent judgments","Ambiguous audio scenarios","Defensible annotation decisions","Portfolio","Voice samples","Annotated transcripts","Audio-related work","Quality","Methodology","Attention to detail","Professional experience in voice","Linguistics","Speech data","Speech evaluation and research"],"x-skills-preferred":["Exceptional attention to linguistic nuance","Auditory detail","Data quality","Advanced transcription and annotation practices","Disfluencies","Prosodic features","Intonation","Stress","Rhythm","Emotion","Phonetics","Phonology","Sociolinguistics","Speech sciences","Cognitive science","Pronunciation differences","Multilingual speech patterns","Speech/audio datasets","Annotation workflows","AI training data","Training voice models","Data quality impacts model performance","Voice acting","Voice recording","Podcasting","Measurable audience","Clarity and recording quality"],"datePosted":"2026-04-18T15:57:57.773Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Remote"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Multilingual audio capabilities, Proprietary software, Audio data curation, Annotation tools, Speech recognition, Auditory experiences, Diverse languages, Accents, Cultural contexts, Linguistic and prosodic details, Professional audio standards, Speech modulation, Accent variation, Noise in real-world recordings, Multilingual audio processing, Audio workflows, Transcription, Audio quality, Voice recordings, Feedback on audio samples, Independent judgments, Ambiguous audio scenarios, Defensible annotation decisions, Portfolio, Voice samples, Annotated transcripts, Audio-related work, Quality, Methodology, Attention to detail, Professional experience in voice, Linguistics, Speech data, Speech evaluation and research, Exceptional attention to linguistic nuance, Auditory detail, Data quality, Advanced transcription and annotation practices, Disfluencies, Prosodic features, Intonation, Stress, Rhythm, Emotion, Phonetics, Phonology, Sociolinguistics, Speech sciences, Cognitive science, Pronunciation differences, Multilingual speech patterns, Speech/audio datasets, Annotation workflows, AI training data, Training voice models, Data quality impacts model performance, Voice acting, Voice recording, Podcasting, Measurable audience, Clarity and recording quality"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_47216abf-f85"},"title":"AI Tutor - Dutch","description":"<p>As an AI Tutor specialized in multilingual audio capabilities, you will contribute to xAI&#39;s mission by training and refining Grok to excel in voice interactions, speech recognition, and auditory experiences across diverse languages, accents, and cultural contexts.</p>\n<p>Your work will focus on curating and annotating high-quality audio data to enhance Grok&#39;s global accessibility, enabling natural spoken interactions for users worldwide, bridging language barriers through accurate speech processing, and improving the AI&#39;s handling of multilingual audio nuances.</p>\n<p>Responsibilities: Use proprietary software to provide labels, annotations, recordings, and inputs on projects involving multilingual audio clips, voice recordings, speech samples, and auditory elements in various languages. Support the delivery of high-quality curated audio data that ensures clear, natural spoken output, accurate representation of linguistic and prosodic details (such as intonation, rhythm, and accent), and professional audio standards. Collaborate with technical staff to develop tasks that improve AI&#39;s ability to handle speech modulation, accent variation, noise in real-world recordings, and multilingual audio processing. Work with technical staff to improve annotation tools for efficient audio workflows.</p>\n<p>Basic Qualifications: Native proficiency in Dutch with exposure to diverse accents, dialects, or regional variations. Proficiency in English (minimum B2 level) with clear, natural vocal delivery and pronunciation suitable for audio recording purposes. Strong auditory perception to identify nuances in speech, accents, pronunciation, intonation, and audio quality across languages. Demonstrated ability to handle multilingual audio content, including evaluating speech accuracy, cultural vocal expressions, and contextual interpretation in spoken form. Demonstrated ability to transcribe audio with high accuracy across accents and varying audio quality. Comfort providing high-quality voice recordings and feedback on audio samples in multiple languages. Strong comprehension skills and the ability to make independent judgments on ambiguous or varied audio material, including noisy or accented speech. Strong communication, interpersonal, analytical, detail-oriented, and organizational skills, with the ability to articulate audio-related feedback effectively. Commitment to developing AI that masters sophisticated multilingual audio capabilities.</p>\n<p>Preferred Skills and Experience: Demonstration of exceptional attention to linguistic nuance, auditory detail, and data quality beyond standard transcription work. Deep understanding and taste of what good/useful Audio data is. Strong command of advanced transcription and annotation practices, including handling disfluencies, accents, and prosodic features (intonation, stress, rhythm, emotion, etc) with high consistency and accuracy. Background in linguistics (e.g., phonetics, phonology, sociolinguistics), speech sciences, cognitive science, or a related field, or equivalent practical experience, with demonstrated ability to analyze accent variation, pronunciation differences, and multilingual speech patterns. Experience working with speech/audio datasets, annotation workflows, or AI training data, including knowledge/experience with training voice models, and an understanding of how data quality impacts model performance. Professional experience in voice work, including voice acting, voice recording, podcasting with a measurable audience (e.g., X following), or similar audio production demonstrating attention to clarity and recording quality. Demonstrated ability to exercise independent judgment in ambiguous audio scenarios and make consistent, defensible annotation decisions. Portfolio (strongly preferred for advanced candidates): Voice samples, annotated transcripts, or audio-related work demonstrating quality, methodology, and attention to detail. Candidates with professional experience in voice, linguistics, speech data, or speech evaluation and research are especially encouraged to apply.</p>\n<p>Location and Other Expectations: Tutor roles may be offered as full-time, part-time, or contractor positions, depending on role needs and candidate fit. For contractor positions, hours will vary widely based on project scope and contractor availability, with no fixed commitments required. On average, most projects may require at least 10 hours per week to deliver effectively, though this is not a fixed commitment and depends on the scope of work. Contractors have full flexibility to set their own hours and determine the exact amount of time needed to complete deliverables. Tutor roles may be performed remotely from any location worldwide, subject to legal eligibility, time-zone compatibility, and role-specific needs. For US-based candidates, please note that we are unable to hire in Wyoming and Illinois at this time. We are unable to provide visa sponsorship. For those who will be working from a personal device, your computer must be a Chromebook, a Mac with macOS 11.0 or later, or Windows 10 or later.</p>\n<p>Compensation and Benefits: US-based candidates: $35/hour - $45/hour depending on factors including relevant experience, skills, education, geographic location, and qualifications. International candidates: Information will be provided to you during the recruitment process. Benefits vary based on employment type, location, and jurisdiction. Benefits for eligible U.S.-based positions include health insurance, 401(k) plan, and paid sick leave. Specific details and role-specific information will be provided to you during the interview process.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_47216abf-f85","directApply":true,"hiringOrganization":{"@type":"Organization","name":"xAI","sameAs":"https://www.xai.com/","logo":"https://logos.yubhub.co/xai.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/xai/jobs/5090197007","x-work-arrangement":"remote","x-experience-level":"mid","x-job-type":"full-time|part-time|contract","x-salary-range":"$35/hour - $45/hour","x-skills-required":["Multilingual audio capabilities","Proprietary software","Audio data curation","Annotation tools","Speech recognition","Auditory experiences","Diverse languages","Accents","Cultural contexts"],"x-skills-preferred":["Exceptional attention to linguistic nuance","Auditory detail","Data quality","Advanced transcription and annotation practices","Linguistics","Speech sciences","Cognitive science","Voice work","Voice acting","Voice recording","Podcasting"],"datePosted":"2026-04-18T15:53:42.796Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Remote"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Multilingual audio capabilities, Proprietary software, Audio data curation, Annotation tools, Speech recognition, Auditory experiences, Diverse languages, Accents, Cultural contexts, Exceptional attention to linguistic nuance, Auditory detail, Data quality, Advanced transcription and annotation practices, Linguistics, Speech sciences, Cognitive science, Voice work, Voice acting, Voice recording, Podcasting"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_96e9554d-edc"},"title":"AI Tutor - Arabic","description":"<p>As an AI Tutor specialised in multilingual audio capabilities, you will contribute to xAI&#39;s mission by training and refining Grok to excel in voice interactions, speech recognition, and auditory experiences across diverse languages, accents, and cultural contexts.</p>\n<p>Your work will focus on curating and annotating high-quality audio data to enhance Grok&#39;s global accessibility, enabling natural spoken interactions for users worldwide, bridging language barriers through accurate speech processing, and improving the AI&#39;s handling of multilingual audio nuances.</p>\n<p>Responsibilities: Use proprietary software to provide labels, annotations, recordings, and inputs on projects involving multilingual audio clips, voice recordings, speech samples, and auditory elements in various languages. Support the delivery of high-quality curated audio data that ensures clear, natural spoken output, accurate representation of linguistic and prosodic details (such as intonation, rhythm, and accent), and professional audio standards. Collaborate with technical staff to develop tasks that improve AI&#39;s ability to handle speech modulation, accent variation, noise in real-world recordings, and multilingual audio processing. Work with technical staff to improve annotation tools for efficient audio workflows.</p>\n<p>Basic Qualifications: Native proficiency in Arabic with exposure to diverse accents, dialects, or regional variations. Proficiency in English (minimum B2 level) with clear, natural vocal delivery and pronunciation suitable for audio recording purposes. Strong auditory perception to identify nuances in speech, accents, pronunciation, intonation, and audio quality across languages. Demonstrated ability to handle multilingual audio content, including evaluating speech accuracy, cultural vocal expressions, and contextual interpretation in spoken form. Demonstrated ability to transcribe audio with high accuracy across accents and varying audio quality. Comfort providing high-quality voice recordings and feedback on audio samples in multiple languages. Strong comprehension skills and the ability to make independent judgments on ambiguous or varied audio material, including noisy or accented speech. Strong communication, interpersonal, analytical, detail-oriented, and organisational skills, with the ability to articulate audio-related feedback effectively. Commitment to developing AI that masters sophisticated multilingual audio capabilities.</p>\n<p>Preferred Skills and Experience: Demonstration of exceptional attention to linguistic nuance, auditory detail, and data quality beyond standard transcription work. Deep understanding and taste of what good/useful Audio data is. Strong command of advanced transcription and annotation practices, including handling disfluencies, accents, and prosodic features (intonation, stress, rhythm, emotion, etc) with high consistency and accuracy. Background in linguistics (e.g., phonetics, phonology, sociolinguistics), speech sciences, cognitive science, or a related field, or equivalent practical experience, with demonstrated ability to analyse accent variation, pronunciation differences, and multilingual speech patterns. Experience working with speech/audio datasets, annotation workflows, or AI training data, including knowledge/experience with training voice models, and an understanding of how data quality impacts model performance. Professional experience in voice work, including voice acting, voice recording, podcasting with a measurable audience (e.g., X following), or similar audio production demonstrating attention to clarity and recording quality. Demonstrated ability to exercise independent judgment in ambiguous audio scenarios and make consistent, defensible annotation decisions. Portfolio (strongly preferred for advanced candidates): Voice samples, annotated transcripts, or audio-related work demonstrating quality, methodology, and attention to detail. Candidates with professional experience in voice, linguistics, speech data, or speech evaluation and research are especially encouraged to apply.</p>\n<p>Location and Other Expectations: Tutor roles may be offered as full-time, part-time, or contractor positions, depending on role needs and candidate fit. For contractor positions, hours will vary widely based on project scope and contractor availability, with no fixed commitments required. On average, most projects may require at least 10 hours per week to deliver effectively, though this is not a fixed commitment and depends on the scope of work. Contractors have full flexibility to set their own hours and determine the exact amount of time needed to complete deliverables. Tutor roles may be performed remotely from any location worldwide, subject to legal eligibility, time-zone compatibility, and role-specific needs. For US-based candidates, please note that we are unable to hire in Wyoming and Illinois at this time. We are unable to provide visa sponsorship. For those who will be working from a personal device, your computer must be a Chromebook, a Mac with macOS 11.0 or later, or Windows 10 or later.</p>\n<p>Compensation and Benefits: US-based candidates: $35/hour - $45/hour depending on factors including relevant experience, skills, education, geographic location, and qualifications. International candidates: Information will be provided to you during the recruitment process. Benefits vary based on employment type, location, and jurisdiction. Benefits for eligible U.S.-based positions include health insurance, 401(k) plan, and paid sick leave. Specific details and role-specific information will be provided to you during the interview process.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_96e9554d-edc","directApply":true,"hiringOrganization":{"@type":"Organization","name":"xAI","sameAs":"https://www.xai.com/","logo":"https://logos.yubhub.co/xai.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/xai/jobs/5090171007","x-work-arrangement":"remote","x-experience-level":"mid","x-job-type":"full-time|part-time|contract","x-salary-range":"$35/hour - $45/hour","x-skills-required":["Multilingual audio capabilities","Proprietary software","Audio data curation","Annotation tools","Speech recognition","Auditory experiences","Natural language processing","Accent variation","Noise reduction","Multilingual audio processing"],"x-skills-preferred":["Linguistic nuance","Auditory detail","Data quality","Advanced transcription and annotation practices","Disfluencies","Accents","Prosodic features","Linguistics","Speech sciences","Cognitive science","Voice work","Voice acting","Voice recording","Podcasting"],"datePosted":"2026-04-18T15:52:57.094Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Remote"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Multilingual audio capabilities, Proprietary software, Audio data curation, Annotation tools, Speech recognition, Auditory experiences, Natural language processing, Accent variation, Noise reduction, Multilingual audio processing, Linguistic nuance, Auditory detail, Data quality, Advanced transcription and annotation practices, Disfluencies, Accents, Prosodic features, Linguistics, Speech sciences, Cognitive science, Voice work, Voice acting, Voice recording, Podcasting"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_993beba7-87d"},"title":"AI Tutor - Vietnamese","description":"<p>As an AI Tutor specializing in multilingual audio capabilities, you will contribute to xAI&#39;s mission by training and refining Grok to excel in voice interactions, speech recognition, and auditory experiences across diverse languages, accents, and cultural contexts. Your work will focus on curating and annotating high-quality audio data to enhance Grok&#39;s global accessibility, enabling natural spoken interactions for users worldwide, bridging language barriers through accurate speech processing, and improving the AI&#39;s handling of multilingual audio nuances.</p>\n<p>Responsibilities:</p>\n<ul>\n<li>Use proprietary software to provide labels, annotations, recordings, and inputs on projects involving multilingual audio clips, voice recordings, speech samples, and auditory elements in various languages.</li>\n<li>Support the delivery of high-quality curated audio data that ensures clear, natural spoken output, accurate representation of linguistic and prosodic details (such as intonation, rhythm, and accent), and professional audio standards.</li>\n<li>Collaborate with technical staff to develop tasks that improve AI&#39;s ability to handle speech modulation, accent variation, noise in real-world recordings, and multilingual audio processing.</li>\n<li>Work with technical staff to improve annotation tools for efficient audio workflows.</li>\n</ul>\n<p>Basic Qualifications:</p>\n<ul>\n<li>Native proficiency in Vietnamese with exposure to diverse accents, dialects, or regional variations.</li>\n<li>Proficiency in English (minimum B2 level) with clear, natural vocal delivery and pronunciation suitable for audio recording purposes.</li>\n<li>Strong auditory perception to identify nuances in speech, accents, pronunciation, intonation, and audio quality across languages.</li>\n<li>Demonstrated ability to handle multilingual audio content, including evaluating speech accuracy, cultural vocal expressions, and contextual interpretation in spoken form.</li>\n<li>Demonstrated ability to transcribe audio with high accuracy across accents and varying audio quality.</li>\n<li>Comfort providing high-quality voice recordings and feedback on audio samples in multiple languages.</li>\n<li>Strong comprehension skills and the ability to make independent judgments on ambiguous or varied audio material, including noisy or accented speech.</li>\n<li>Strong communication, interpersonal, analytical, detail-oriented, and organizational skills, with the ability to articulate audio-related feedback effectively.</li>\n<li>Commitment to developing AI that masters sophisticated multilingual audio capabilities.</li>\n</ul>\n<p>Preferred Skills and Experience:</p>\n<ul>\n<li>Demonstration of exceptional attention to linguistic nuance, auditory detail, and data quality beyond standard transcription work.</li>\n<li>Deep understanding and taste of what good/useful Audio data is.</li>\n<li>Strong command of advanced transcription and annotation practices, including handling disfluencies, accents, and prosodic features (intonation, stress, rhythm, emotion, etc) with high consistency and accuracy.</li>\n<li>Background in linguistics (e.g., phonetics, phonology, sociolinguistics), speech sciences, cognitive science, or a related field, or equivalent practical experience, with demonstrated ability to analyze accent variation, pronunciation differences, and multilingual speech patterns.</li>\n<li>Experience working with speech/audio datasets, annotation workflows, or AI training data, including knowledge/experience with training voice models, and an understanding of how data quality impacts model performance.</li>\n<li>Professional experience in voice work, including voice acting, voice recording, podcasting with a measurable audience (e.g., X following), or similar audio production demonstrating attention to clarity and recording quality.</li>\n<li>Demonstrated ability to exercise independent judgment in ambiguous audio scenarios and make consistent, defensible annotation decisions.</li>\n<li>Portfolio (strongly preferred for advanced candidates): Voice samples, annotated transcripts, or audio-related work demonstrating quality, methodology, and attention to detail.</li>\n<li>Candidates with professional experience in voice, linguistics, speech data, or speech evaluation and research are especially encouraged to apply.</li>\n</ul>\n<p>Location and Other Expectations:</p>\n<ul>\n<li>Tutor roles may be offered as full-time, part-time, or contractor positions, depending on role needs and candidate fit.</li>\n<li>For contractor positions, hours will vary widely based on project scope and contractor availability, with no fixed commitments required. On average, most projects may require at least 10 hours per week to deliver effectively, though this is not a fixed commitment and depends on the scope of work. Contractors have full flexibility to set their own hours and determine the exact amount of time needed to complete deliverables.</li>\n<li>Tutor roles may be performed remotely from any location worldwide, subject to legal eligibility, time-zone compatibility, and role-specific needs.</li>\n<li>For US-based candidates, please note that we are unable to hire in Wyoming and Illinois at this time.</li>\n<li>We are unable to provide visa sponsorship.</li>\n<li>For those who will be working from a personal device, your computer must be a Chromebook, a Mac with macOS 11.0 or later, or Windows 10 or later.</li>\n</ul>\n<p>Compensation and Benefits: US-based candidates: $35/hour - $45/hour depending on factors including relevant experience, skills, education, geographic location, and qualifications. International candidates: Information will be provided to you during the recruitment process. Benefits vary based on employment type, location, and jurisdiction. Benefits for eligible U.S.-based positions include health insurance, 401(k) plan, and paid sick leave. Specific details and role-specific information will be provided to you during the interview process.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_993beba7-87d","directApply":true,"hiringOrganization":{"@type":"Organization","name":"xAI","sameAs":"https://www.xai.com/","logo":"https://logos.yubhub.co/xai.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/xai/jobs/5090274007","x-work-arrangement":"remote","x-experience-level":"mid","x-job-type":"full-time|part-time|contract","x-salary-range":"$35/hour - $45/hour","x-skills-required":["Multilingual audio capabilities","Proprietary software","Audio data curation","Speech recognition","Auditory experiences","Accent variation","Noise in real-world recordings","Multilingual audio processing","Annotation tools","Efficient audio workflows","Native proficiency in Vietnamese","Proficiency in English","Strong auditory perception","Multilingual audio content","Speech accuracy","Cultural vocal expressions","Contextual interpretation","Transcription","High-quality voice recordings","Feedback on audio samples","Independent judgments","Ambiguous audio scenarios","Defensible annotation decisions","Portfolio","Voice samples","Annotated transcripts","Audio-related work","Quality","Methodology","Attention to detail"],"x-skills-preferred":["Exceptional attention to linguistic nuance","Auditory detail","Data quality","Advanced transcription and annotation practices","Disfluencies","Accents","Prosodic features","Linguistics","Phonetics","Phonology","Sociolinguistics","Speech sciences","Cognitive science","Pronunciation differences","Multilingual speech patterns","Speech/audio datasets","Annotation workflows","AI training data","Training voice models","Data quality impacts model performance","Professional experience in voice work","Voice acting","Voice recording","Podcasting","Measurable audience","Similar audio production","Clarity and recording quality"],"datePosted":"2026-04-18T15:22:57.303Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Remote"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Multilingual audio capabilities, Proprietary software, Audio data curation, Speech recognition, Auditory experiences, Accent variation, Noise in real-world recordings, Multilingual audio processing, Annotation tools, Efficient audio workflows, Native proficiency in Vietnamese, Proficiency in English, Strong auditory perception, Multilingual audio content, Speech accuracy, Cultural vocal expressions, Contextual interpretation, Transcription, High-quality voice recordings, Feedback on audio samples, Independent judgments, Ambiguous audio scenarios, Defensible annotation decisions, Portfolio, Voice samples, Annotated transcripts, Audio-related work, Quality, Methodology, Attention to detail, Exceptional attention to linguistic nuance, Auditory detail, Data quality, Advanced transcription and annotation practices, Disfluencies, Accents, Prosodic features, Linguistics, Phonetics, Phonology, Sociolinguistics, Speech sciences, Cognitive science, Pronunciation differences, Multilingual speech patterns, Speech/audio datasets, Annotation workflows, AI training data, Training voice models, Data quality impacts model performance, Professional experience in voice work, Voice acting, Voice recording, Podcasting, Measurable audience, Similar audio production, Clarity and recording quality"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_0b4750c4-02c"},"title":"AI Tutor - Thai","description":"<p>As an AI Tutor specialized in multilingual audio capabilities, you will contribute to xAI&#39;s mission by training and refining Grok to excel in voice interactions, speech recognition, and auditory experiences across diverse languages, accents, and cultural contexts.</p>\n<p>Your work will focus on curating and annotating high-quality audio data to enhance Grok&#39;s global accessibility, enabling natural spoken interactions for users worldwide, bridging language barriers through accurate speech processing, and improving the AI&#39;s handling of multilingual audio nuances.</p>\n<p>Responsibilities:</p>\n<ul>\n<li>Use proprietary software to provide labels, annotations, recordings, and inputs on projects involving multilingual audio clips, voice recordings, speech samples, and auditory elements in various languages.</li>\n<li>Support the delivery of high-quality curated audio data that ensures clear, natural spoken output, accurate representation of linguistic and prosodic details (such as intonation, rhythm, and accent), and professional audio standards.</li>\n<li>Collaborate with technical staff to develop tasks that improve AI&#39;s ability to handle speech modulation, accent variation, noise in real-world recordings, and multilingual audio processing.</li>\n<li>Work with technical staff to improve annotation tools for efficient audio workflows.</li>\n</ul>\n<p>Basic Qualifications:</p>\n<ul>\n<li>Native proficiency in Thai with exposure to diverse accents, dialects, or regional variations.</li>\n<li>Proficiency in English (minimum B2 level) with clear, natural vocal delivery and pronunciation suitable for audio recording purposes.</li>\n<li>Strong auditory perception to identify nuances in speech, accents, pronunciation, intonation, and audio quality across languages.</li>\n<li>Demonstrated ability to handle multilingual audio content, including evaluating speech accuracy, cultural vocal expressions, and contextual interpretation in spoken form.</li>\n<li>Demonstrated ability to transcribe audio with high accuracy across accents and varying audio quality.</li>\n<li>Comfort providing high-quality voice recordings and feedback on audio samples in multiple languages.</li>\n<li>Strong comprehension skills and the ability to make independent judgments on ambiguous or varied audio material, including noisy or accented speech.</li>\n<li>Strong communication, interpersonal, analytical, detail-oriented, and organizational skills, with the ability to articulate audio-related feedback effectively.</li>\n<li>Commitment to developing AI that masters sophisticated multilingual audio capabilities.</li>\n</ul>\n<p>Preferred Skills and Experience:</p>\n<ul>\n<li>Demonstration of exceptional attention to linguistic nuance, auditory detail, and data quality beyond standard transcription work.</li>\n<li>Deep understanding and taste of what good/useful Audio data is.</li>\n<li>Strong command of advanced transcription and annotation practices, including handling disfluencies, accents, and prosodic features (intonation, stress, rhythm, emotion, etc) with high consistency and accuracy.</li>\n<li>Background in linguistics (e.g., phonetics, phonology, sociolinguistics), speech sciences, cognitive science, or a related field, or equivalent practical experience, with demonstrated ability to analyze accent variation, pronunciation differences, and multilingual speech patterns.</li>\n<li>Experience working with speech/audio datasets, annotation workflows, or AI training data, including knowledge/experience with training voice models, and an understanding of how data quality impacts model performance.</li>\n<li>Professional experience in voice work, including voice acting, voice recording, podcasting with a measurable audience (e.g., X following), or similar audio production demonstrating attention to clarity and recording quality.</li>\n<li>Demonstrated ability to exercise independent judgment in ambiguous audio scenarios and make consistent, defensible annotation decisions.</li>\n<li>Portfolio (strongly preferred for advanced candidates): Voice samples, annotated transcripts, or audio-related work demonstrating quality, methodology, and attention to detail.</li>\n<li>Candidates with professional experience in voice, linguistics, speech data, or speech evaluation and research are especially encouraged to apply.</li>\n</ul>\n<p>Location and Other Expectations:</p>\n<ul>\n<li>Tutor roles may be offered as full-time, part-time, or contractor positions, depending on role needs and candidate fit.</li>\n<li>For contractor positions, hours will vary widely based on project scope and contractor availability, with no fixed commitments required. On average, most projects may require at least 10 hours per week to deliver effectively, though this is not a fixed commitment and depends on the scope of work. Contractors have full flexibility to set their own hours and determine the exact amount of time needed to complete deliverables.</li>\n<li>Tutor roles may be performed remotely from any location worldwide, subject to legal eligibility, time-zone compatibility, and role-specific needs.</li>\n<li>For US-based candidates, please note that we are unable to hire in Wyoming and Illinois at this time.</li>\n<li>We are unable to provide visa sponsorship.</li>\n<li>For those who will be working from a personal device, your computer must be a Chromebook, a Mac with macOS 11.0 or later, or Windows 10 or later.</li>\n</ul>\n<p>Compensation and Benefits: US-based candidates: $35/hour - $45/hour depending on factors including relevant experience, skills, education, geographic location, and qualifications. International candidates: Information will be provided to you during the recruitment process. Benefits vary based on employment type, location, and jurisdiction. Benefits for eligible U.S.-based positions include health insurance, 401(k) plan, and paid sick leave. Specific details and role-specific information will be provided to you during the interview process.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_0b4750c4-02c","directApply":true,"hiringOrganization":{"@type":"Organization","name":"xAI","sameAs":"https://www.xai.com","logo":"https://logos.yubhub.co/xai.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/xai/jobs/5090272007","x-work-arrangement":"remote","x-experience-level":"mid","x-job-type":"full-time|part-time|contract","x-salary-range":"$35/hour - $45/hour","x-skills-required":["Multilingual audio capabilities","Proprietary software","Audio data curation","Annotation tools","Speech recognition","Auditory experiences","Thai language proficiency","English language proficiency","Auditory perception","Speech accuracy","Cultural vocal expressions","Contextual interpretation","Transcription","Voice recordings","Feedback on audio samples","Comprehension skills","Independent judgments","Communication skills","Interpersonal skills","Analytical skills","Detail-oriented skills","Organizational skills"],"x-skills-preferred":["Exceptional attention to linguistic nuance","Auditory detail","Data quality","Advanced transcription and annotation practices","Disfluencies","Accents","Prosodic features","Linguistics","Speech sciences","Cognitive science","Accent variation","Pronunciation differences","Multilingual speech patterns","Speech/audio datasets","Annotation workflows","AI training data","Voice models","Data quality impacts model performance","Voice work","Voice acting","Voice recording","Podcasting","Audio production","Independent judgment","Defensible annotation decisions","Portfolio","Voice samples","Annotated transcripts","Audio-related work"],"datePosted":"2026-04-18T15:22:13.426Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Remote"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Multilingual audio capabilities, Proprietary software, Audio data curation, Annotation tools, Speech recognition, Auditory experiences, Thai language proficiency, English language proficiency, Auditory perception, Speech accuracy, Cultural vocal expressions, Contextual interpretation, Transcription, Voice recordings, Feedback on audio samples, Comprehension skills, Independent judgments, Communication skills, Interpersonal skills, Analytical skills, Detail-oriented skills, Organizational skills, Exceptional attention to linguistic nuance, Auditory detail, Data quality, Advanced transcription and annotation practices, Disfluencies, Accents, Prosodic features, Linguistics, Speech sciences, Cognitive science, Accent variation, Pronunciation differences, Multilingual speech patterns, Speech/audio datasets, Annotation workflows, AI training data, Voice models, Data quality impacts model performance, Voice work, Voice acting, Voice recording, Podcasting, Audio production, Independent judgment, Defensible annotation decisions, Portfolio, Voice samples, Annotated transcripts, Audio-related work"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_738a6055-653"},"title":"AI Tutor - Italian","description":"<p>As an AI Tutor specialized in multilingual audio capabilities, you will contribute to xAI&#39;s mission by training and refining Grok to excel in voice interactions, speech recognition, and auditory experiences across diverse languages, accents, and cultural contexts.</p>\n<p>Your work will focus on curating and annotating high-quality audio data to enhance Grok&#39;s global accessibility, enabling natural spoken interactions for users worldwide, bridging language barriers through accurate speech processing, and improving the AI&#39;s handling of multilingual audio nuances.</p>\n<p>Responsibilities:</p>\n<ul>\n<li>Use proprietary software to provide labels, annotations, recordings, and inputs on projects involving multilingual audio clips, voice recordings, speech samples, and auditory elements in various languages.</li>\n</ul>\n<ul>\n<li>Support the delivery of high-quality curated audio data that ensures clear, natural spoken output, accurate representation of linguistic and prosodic details (such as intonation, rhythm, and accent), and professional audio standards.</li>\n</ul>\n<ul>\n<li>Collaborate with technical staff to develop tasks that improve AI&#39;s ability to handle speech modulation, accent variation, noise in real-world recordings, and multilingual audio processing.</li>\n</ul>\n<ul>\n<li>Work with technical staff to improve annotation tools for efficient audio workflows.</li>\n</ul>\n<p>Basic Qualifications:</p>\n<ul>\n<li>Native proficiency in Italian with exposure to diverse accents, dialects, or regional variations.</li>\n</ul>\n<ul>\n<li>Proficiency in English (minimum B2 level) with clear, natural vocal delivery and pronunciation suitable for audio recording purposes.</li>\n</ul>\n<ul>\n<li>Strong auditory perception to identify nuances in speech, accents, pronunciation, intonation, and audio quality across languages.</li>\n</ul>\n<ul>\n<li>Demonstrated ability to handle multilingual audio content, including evaluating speech accuracy, cultural vocal expressions, and contextual interpretation in spoken form.</li>\n</ul>\n<ul>\n<li>Demonstrated ability to transcribe audio with high accuracy across accents and varying audio quality.</li>\n</ul>\n<ul>\n<li>Comfort providing high-quality voice recordings and feedback on audio samples in multiple languages.</li>\n</ul>\n<ul>\n<li>Strong comprehension skills and the ability to make independent judgments on ambiguous or varied audio material, including noisy or accented speech.</li>\n</ul>\n<ul>\n<li>Strong communication, interpersonal, analytical, detail-oriented, and organizational skills, with the ability to articulate audio-related feedback effectively.</li>\n</ul>\n<ul>\n<li>Commitment to developing AI that masters sophisticated multilingual audio capabilities.</li>\n</ul>\n<p>Preferred Skills and Experience:</p>\n<ul>\n<li>Demonstration of exceptional attention to linguistic nuance, auditory detail, and data quality beyond standard transcription work.</li>\n</ul>\n<ul>\n<li>Deep understanding and taste of what good/useful Audio data is.</li>\n</ul>\n<ul>\n<li>Strong command of advanced transcription and annotation practices, including handling disfluencies, accents, and prosodic features (intonation, stress, rhythm, emotion, etc) with high consistency and accuracy.</li>\n</ul>\n<ul>\n<li>Background in linguistics (e.g., phonetics, phonology, sociolinguistics), speech sciences, cognitive science, or a related field, or equivalent practical experience, with demonstrated ability to analyze accent variation, pronunciation differences, and multilingual speech patterns.</li>\n</ul>\n<ul>\n<li>Experience working with speech/audio datasets, annotation workflows, or AI training data, including knowledge/experience with training voice models, and an understanding of how data quality impacts model performance.</li>\n</ul>\n<ul>\n<li>Professional experience in voice work, including voice acting, voice recording, podcasting with a measurable audience (e.g., X following), or similar audio production demonstrating attention to clarity and recording quality.</li>\n</ul>\n<ul>\n<li>Demonstrated ability to exercise independent judgment in ambiguous audio scenarios and make consistent, defensible annotation decisions.</li>\n</ul>\n<ul>\n<li>Portfolio (strongly preferred for advanced candidates): Voice samples, annotated transcripts, or audio-related work demonstrating quality, methodology, and attention to detail.</li>\n</ul>\n<ul>\n<li>Candidates with professional experience in voice, linguistics, speech data, or speech evaluation and research are especially encouraged to apply.</li>\n</ul>\n<p>Location and Other Expectations:</p>\n<ul>\n<li>Tutor roles may be offered as full-time, part-time, or contractor positions, depending on role needs and candidate fit.</li>\n</ul>\n<ul>\n<li>For contractor positions, hours will vary widely based on project scope and contractor availability, with no fixed commitments required. On average, most projects may require at least 10 hours per week to deliver effectively, though this is not a fixed commitment and depends on the scope of work. Contractors have full flexibility to set their own hours and determine the exact amount of time needed to complete deliverables.</li>\n</ul>\n<ul>\n<li>Tutor roles may be performed remotely from any location worldwide, subject to legal eligibility, time-zone compatibility, and role-specific needs.</li>\n</ul>\n<ul>\n<li>For US-based candidates, please note that we are unable to hire in Wyoming and Illinois at this time.</li>\n</ul>\n<ul>\n<li>We are unable to provide visa sponsorship.</li>\n</ul>\n<ul>\n<li>For those who will be working from a personal device, your computer must be a Chromebook, a Mac with macOS 11.0 or later, or Windows 10 or later.</li>\n</ul>\n<p>Compensation and Benefits:</p>\n<p>US-based candidates: $35/hour - $45/hour depending on factors including relevant experience, skills, education, geographic location, and qualifications. International candidates: Information will be provided to you during the recruitment process.</p>\n<p>Benefits vary based on employment type, location, and jurisdiction. Benefits for eligible U.S.-based positions include health insurance, 401(k) plan, and paid sick leave. Specific details and role-specific information will be provided to you during the interview process.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_738a6055-653","directApply":true,"hiringOrganization":{"@type":"Organization","name":"xAI","sameAs":"https://www.xai.com","logo":"https://logos.yubhub.co/xai.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/xai/jobs/5090209007","x-work-arrangement":"remote","x-experience-level":"mid","x-job-type":"full-time|part-time|contract","x-salary-range":"$35/hour - $45/hour","x-skills-required":["multilingual audio capabilities","speech recognition","auditory experiences","linguistic and prosodic details","professional audio standards","accent variation","noise in real-world recordings","multilingual audio processing","annotation tools","efficient audio workflows","native proficiency in Italian","English (minimum B2 level)","strong auditory perception","nuances in speech","accents","pronunciation","audio quality","speech accuracy","cultural vocal expressions","contextual interpretation","transcription","voice recordings","feedback on audio samples","independent judgments","ambiguous audio material","communication","interpersonal","analytical","detail-oriented","organizational","audio-related feedback","commitment to developing AI"],"x-skills-preferred":["exceptional attention to linguistic nuance","auditory detail","data quality","deep understanding of good/useful Audio data","advanced transcription and annotation practices","handling disfluencies","prosodic features","background in linguistics","phonetics","phonology","sociolinguistics","speech sciences","cognitive science","equivalent practical experience","analysis of accent variation","pronunciation differences","multilingual speech patterns","experience working with speech/audio datasets","annotation workflows","AI training data","training voice models","understanding of data quality impacts model performance","professional experience in voice work","voice acting","voice recording","podcasting","similar audio production","exercise independent judgment","defensible annotation decisions","portfolio","voice samples","annotated transcripts","audio-related work"],"datePosted":"2026-04-18T15:21:33.699Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Remote"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"multilingual audio capabilities, speech recognition, auditory experiences, linguistic and prosodic details, professional audio standards, accent variation, noise in real-world recordings, multilingual audio processing, annotation tools, efficient audio workflows, native proficiency in Italian, English (minimum B2 level), strong auditory perception, nuances in speech, accents, pronunciation, audio quality, speech accuracy, cultural vocal expressions, contextual interpretation, transcription, voice recordings, feedback on audio samples, independent judgments, ambiguous audio material, communication, interpersonal, analytical, detail-oriented, organizational, audio-related feedback, commitment to developing AI, exceptional attention to linguistic nuance, auditory detail, data quality, deep understanding of good/useful Audio data, advanced transcription and annotation practices, handling disfluencies, prosodic features, background in linguistics, phonetics, phonology, sociolinguistics, speech sciences, cognitive science, equivalent practical experience, analysis of accent variation, pronunciation differences, multilingual speech patterns, experience working with speech/audio datasets, annotation workflows, AI training data, training voice models, understanding of data quality impacts model performance, professional experience in voice work, voice acting, voice recording, podcasting, similar audio production, exercise independent judgment, defensible annotation decisions, portfolio, voice samples, annotated transcripts, audio-related work"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_17620573-a87"},"title":"AI Tutor - Tagalog","description":"<p>As an AI Tutor specialized in multilingual audio capabilities, you will contribute to xAI&#39;s mission by training and refining Grok to excel in voice interactions, speech recognition, and auditory experiences across diverse languages, accents, and cultural contexts.</p>\n<p>Your work will focus on curating and annotating high-quality audio data to enhance Grok&#39;s global accessibility, enabling natural spoken interactions for users worldwide, bridging language barriers through accurate speech processing, and improving the AI&#39;s handling of multilingual audio nuances.</p>\n<p>Responsibilities:</p>\n<ul>\n<li>Use proprietary software to provide labels, annotations, recordings, and inputs on projects involving multilingual audio clips, voice recordings, speech samples, and auditory elements in various languages.</li>\n</ul>\n<ul>\n<li>Support the delivery of high-quality curated audio data that ensures clear, natural spoken output, accurate representation of linguistic and prosodic details (such as intonation, rhythm, and accent), and professional audio standards.</li>\n</ul>\n<ul>\n<li>Collaborate with technical staff to develop tasks that improve AI&#39;s ability to handle speech modulation, accent variation, noise in real-world recordings, and multilingual audio processing.</li>\n</ul>\n<ul>\n<li>Work with technical staff to improve annotation tools for efficient audio workflows.</li>\n</ul>\n<p>Basic Qualifications:</p>\n<ul>\n<li>Native proficiency in Tagalog with exposure to diverse accents, dialects, or regional variations.</li>\n</ul>\n<ul>\n<li>Proficiency in English (minimum B2 level) with clear, natural vocal delivery and pronunciation suitable for audio recording purposes.</li>\n</ul>\n<ul>\n<li>Strong auditory perception to identify nuances in speech, accents, pronunciation, intonation, and audio quality across languages.</li>\n</ul>\n<ul>\n<li>Demonstrated ability to handle multilingual audio content, including evaluating speech accuracy, cultural vocal expressions, and contextual interpretation in spoken form.</li>\n</ul>\n<ul>\n<li>Demonstrated ability to transcribe audio with high accuracy across accents and varying audio quality.</li>\n</ul>\n<ul>\n<li>Comfort providing high-quality voice recordings and feedback on audio samples in multiple languages.</li>\n</ul>\n<ul>\n<li>Strong comprehension skills and the ability to make independent judgments on ambiguous or varied audio material, including noisy or accented speech.</li>\n</ul>\n<ul>\n<li>Strong communication, interpersonal, analytical, detail-oriented, and organizational skills, with the ability to articulate audio-related feedback effectively.</li>\n</ul>\n<ul>\n<li>Commitment to developing AI that masters sophisticated multilingual audio capabilities.</li>\n</ul>\n<p>Preferred Skills and Experience:</p>\n<ul>\n<li>Demonstration of exceptional attention to linguistic nuance, auditory detail, and data quality beyond standard transcription work.</li>\n</ul>\n<ul>\n<li>Deep understanding and taste of what good/useful Audio data is.</li>\n</ul>\n<ul>\n<li>Strong command of advanced transcription and annotation practices, including handling disfluencies, accents, and prosodic features (intonation, stress, rhythm, emotion, etc) with high consistency and accuracy.</li>\n</ul>\n<ul>\n<li>Background in linguistics (e.g., phonetics, phonology, sociolinguistics), speech sciences, cognitive science, or a related field, or equivalent practical experience, with demonstrated ability to analyze accent variation, pronunciation differences, and multilingual speech patterns.</li>\n</ul>\n<ul>\n<li>Experience working with speech/audio datasets, annotation workflows, or AI training data, including knowledge/experience with training voice models, and an understanding of how data quality impacts model performance.</li>\n</ul>\n<ul>\n<li>Professional experience in voice work, including voice acting, voice recording, podcasting with a measurable audience (e.g., X following), or similar audio production demonstrating attention to clarity and recording quality.</li>\n</ul>\n<ul>\n<li>Demonstrated ability to exercise independent judgment in ambiguous audio scenarios and make consistent, defensible annotation decisions.</li>\n</ul>\n<ul>\n<li>Portfolio (strongly preferred for advanced candidates): Voice samples, annotated transcripts, or audio-related work demonstrating quality, methodology, and attention to detail.</li>\n</ul>\n<ul>\n<li>Candidates with professional experience in voice, linguistics, speech data, or speech evaluation and research are especially encouraged to apply.</li>\n</ul>\n<p>Location and Other Expectations:</p>\n<ul>\n<li>Tutor roles may be offered as full-time, part-time, or contractor positions, depending on role needs and candidate fit.</li>\n</ul>\n<ul>\n<li>For contractor positions, hours will vary widely based on project scope and contractor availability, with no fixed commitments required. On average, most projects may require at least 10 hours per week to deliver effectively, though this is not a fixed commitment and depends on the scope of work. Contractors have full flexibility to set their own hours and determine the exact amount of time needed to complete deliverables.</li>\n</ul>\n<ul>\n<li>Tutor roles may be performed remotely from any location worldwide, subject to legal eligibility, time-zone compatibility, and role-specific needs.</li>\n</ul>\n<ul>\n<li>For US-based candidates, please note that we are unable to hire in Wyoming and Illinois at this time.</li>\n</ul>\n<ul>\n<li>We are unable to provide visa sponsorship.</li>\n</ul>\n<ul>\n<li>For those who will be working from a personal device, your computer must be a Chromebook, a Mac with macOS 11.0 or later, or Windows 10 or later.</li>\n</ul>\n<p>Compensation and Benefits:</p>\n<p>US-based candidates: $35/hour - $45/hour depending on factors including relevant experience, skills, education, geographic location, and qualifications. International candidates: Information will be provided to you during the recruitment process.</p>\n<p>Benefits vary based on employment type, location, and jurisdiction. Benefits for eligible U.S.-based positions include health insurance, 401(k) plan, and paid sick leave. Specific details and role-specific information will be provided to you during the interview process.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_17620573-a87","directApply":true,"hiringOrganization":{"@type":"Organization","name":"xAI","sameAs":"https://www.xai.com/","logo":"https://logos.yubhub.co/xai.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/xai/jobs/5090268007","x-work-arrangement":"remote","x-experience-level":null,"x-job-type":"full-time|part-time|contract|temporary|internship","x-salary-range":"$35/hour - $45/hour","x-skills-required":["Multilingual audio capabilities","Proprietary software","Audio data curation","Annotation tools","Speech recognition","Auditory experiences","Natural language processing","Language barriers","Speech processing","Multilingual audio nuances"],"x-skills-preferred":["Exceptional attention to linguistic nuance","Auditory detail","Data quality","Advanced transcription and annotation practices","Disfluencies","Accents","Prosodic features","Linguistics","Speech sciences","Cognitive science","Voice work","Voice acting","Voice recording","Podcasting","Independent judgment","Defensible annotation decisions"],"datePosted":"2026-04-18T15:21:26.846Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Remote"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Multilingual audio capabilities, Proprietary software, Audio data curation, Annotation tools, Speech recognition, Auditory experiences, Natural language processing, Language barriers, Speech processing, Multilingual audio nuances, Exceptional attention to linguistic nuance, Auditory detail, Data quality, Advanced transcription and annotation practices, Disfluencies, Accents, Prosodic features, Linguistics, Speech sciences, Cognitive science, Voice work, Voice acting, Voice recording, Podcasting, Independent judgment, Defensible annotation decisions"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_5f816847-74d"},"title":"AI Tutor - Indonesian","description":"<p>As an AI Tutor specialized in multilingual audio capabilities, you will contribute to xAI&#39;s mission by training and refining Grok to excel in voice interactions, speech recognition, and auditory experiences across diverse languages, accents, and cultural contexts.</p>\n<p>Your work will focus on curating and annotating high-quality audio data to enhance Grok&#39;s global accessibility, enabling natural spoken interactions for users worldwide, bridging language barriers through accurate speech processing, and improving the AI&#39;s handling of multilingual audio nuances.</p>\n<p>Responsibilities:</p>\n<ul>\n<li>Use proprietary software to provide labels, annotations, recordings, and inputs on projects involving multilingual audio clips, voice recordings, speech samples, and auditory elements in various languages.</li>\n</ul>\n<ul>\n<li>Support the delivery of high-quality curated audio data that ensures clear, natural spoken output, accurate representation of linguistic and prosodic details (such as intonation, rhythm, and accent), and professional audio standards.</li>\n</ul>\n<ul>\n<li>Collaborate with technical staff to develop tasks that improve AI&#39;s ability to handle speech modulation, accent variation, noise in real-world recordings, and multilingual audio processing.</li>\n</ul>\n<ul>\n<li>Work with technical staff to improve annotation tools for efficient audio workflows.</li>\n</ul>\n<p>Basic Qualifications:</p>\n<ul>\n<li>Native proficiency in Indonesian with exposure to diverse accents, dialects, or regional variations.</li>\n</ul>\n<ul>\n<li>Proficiency in English (minimum B2 level) with clear, natural vocal delivery and pronunciation suitable for audio recording purposes.</li>\n</ul>\n<ul>\n<li>Strong auditory perception to identify nuances in speech, accents, pronunciation, intonation, and audio quality across languages.</li>\n</ul>\n<ul>\n<li>Demonstrated ability to handle multilingual audio content, including evaluating speech accuracy, cultural vocal expressions, and contextual interpretation in spoken form.</li>\n</ul>\n<ul>\n<li>Demonstrated ability to transcribe audio with high accuracy across accents and varying audio quality.</li>\n</ul>\n<ul>\n<li>Comfort providing high-quality voice recordings and feedback on audio samples in multiple languages.</li>\n</ul>\n<ul>\n<li>Strong comprehension skills and the ability to make independent judgments on ambiguous or varied audio material, including noisy or accented speech.</li>\n</ul>\n<ul>\n<li>Strong communication, interpersonal, analytical, detail-oriented, and organizational skills, with the ability to articulate audio-related feedback effectively.</li>\n</ul>\n<ul>\n<li>Commitment to developing AI that masters sophisticated multilingual audio capabilities.</li>\n</ul>\n<p>Preferred Skills and Experience:</p>\n<ul>\n<li>Demonstration of exceptional attention to linguistic nuance, auditory detail, and data quality beyond standard transcription work.</li>\n</ul>\n<ul>\n<li>Deep understanding and taste of what good/useful Audio data is.</li>\n</ul>\n<ul>\n<li>Strong command of advanced transcription and annotation practices, including handling disfluencies, accents, and prosodic features (intonation, stress, rhythm, emotion, etc) with high consistency and accuracy.</li>\n</ul>\n<ul>\n<li>Background in linguistics (e.g., phonetics, phonology, sociolinguistics), speech sciences, cognitive science, or a related field, or equivalent practical experience, with demonstrated ability to analyze accent variation, pronunciation differences, and multilingual speech patterns.</li>\n</ul>\n<ul>\n<li>Experience working with speech/audio datasets, annotation workflows, or AI training data, including knowledge/experience with training voice models, and an understanding of how data quality impacts model performance.</li>\n</ul>\n<ul>\n<li>Professional experience in voice work, including voice acting, voice recording, podcasting with a measurable audience (e.g., X following), or similar audio production demonstrating attention to clarity and recording quality.</li>\n</ul>\n<ul>\n<li>Demonstrated ability to exercise independent judgment in ambiguous audio scenarios and make consistent, defensible annotation decisions.</li>\n</ul>\n<ul>\n<li>Portfolio (strongly preferred for advanced candidates): Voice samples, annotated transcripts, or audio-related work demonstrating quality, methodology, and attention to detail.</li>\n</ul>\n<ul>\n<li>Candidates with professional experience in voice, linguistics, speech data, or speech evaluation and research are especially encouraged to apply.</li>\n</ul>\n<p>Location and Other Expectations:</p>\n<ul>\n<li>Tutor roles may be offered as full-time, part-time, or contractor positions, depending on role needs and candidate fit.</li>\n</ul>\n<ul>\n<li>For contractor positions, hours will vary widely based on project scope and contractor availability, with no fixed commitments required. On average, most projects may require at least 10 hours per week to deliver effectively, though this is not a fixed commitment and depends on the scope of work. Contractors have full flexibility to set their own hours and determine the exact amount of time needed to complete deliverables.</li>\n</ul>\n<ul>\n<li>Tutor roles may be performed remotely from any location worldwide, subject to legal eligibility, time-zone compatibility, and role-specific needs.</li>\n</ul>\n<ul>\n<li>For US-based candidates, please note that we are unable to hire in Wyoming and Illinois at this time.</li>\n</ul>\n<ul>\n<li>We are unable to provide visa sponsorship.</li>\n</ul>\n<ul>\n<li>For those who will be working from a personal device, your computer must be a Chromebook, a Mac with macOS 11.0 or later, or Windows 10 or later.</li>\n</ul>\n<p>Compensation and Benefits:</p>\n<p>US-based candidates: $35/hour - $45/hour depending on factors including relevant experience, skills, education, geographic location, and qualifications. International candidates: Information will be provided to you during the recruitment process.</p>\n<p>Benefits vary based on employment type, location, and jurisdiction. Benefits for eligible U.S.-based positions include health insurance, 401(k) plan, and paid sick leave. Specific details and role-specific information will be provided to you during the interview process.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_5f816847-74d","directApply":true,"hiringOrganization":{"@type":"Organization","name":"xAI","sameAs":"https://www.xai.com/","logo":"https://logos.yubhub.co/xai.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/xai/jobs/5095657007","x-work-arrangement":"remote","x-experience-level":"mid","x-job-type":"full-time|part-time|contract","x-salary-range":"$35/hour - $45/hour","x-skills-required":["Multilingual audio capabilities","Proprietary software","Audio data curation","Annotation tools","Speech recognition","Auditory experiences","Linguistic and prosodic details","Professional audio standards","Accent variation","Noise in real-world recordings","Multilingual audio processing","Data quality","Independent judgment","Audio-related feedback","Multilingual speech patterns","Speech/audio datasets","Annotation workflows","AI training data","Voice models","Data quality impacts model performance","Voice work","Voice acting","Voice recording","Podcasting","Audio production","Clearity and recording quality","Defensible annotation decisions","Portfolio","Voice samples","Annotated transcripts","Audio-related work"],"x-skills-preferred":["Exceptional attention to linguistic nuance","Auditory detail","Deep understanding and taste of what good/useful Audio data is","Strong command of advanced transcription and annotation practices","Handling disfluencies","Accents","Prosodic features","Intonation","Stress","Rhythm","Emotion","Background in linguistics","Phonetics","Phonology","Sociolinguistics","Speech sciences","Cognitive science","Equivalent practical experience","Analyzing accent variation","Pronunciation differences","Experience working with speech/audio datasets","Training voice models","Understanding data quality impacts model performance","Professional experience in voice work","Similar audio production","Attention to clarity and recording quality","Independent judgment in ambiguous audio scenarios"],"datePosted":"2026-04-18T15:21:16.886Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Remote"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Multilingual audio capabilities, Proprietary software, Audio data curation, Annotation tools, Speech recognition, Auditory experiences, Linguistic and prosodic details, Professional audio standards, Accent variation, Noise in real-world recordings, Multilingual audio processing, Data quality, Independent judgment, Audio-related feedback, Multilingual speech patterns, Speech/audio datasets, Annotation workflows, AI training data, Voice models, Data quality impacts model performance, Voice work, Voice acting, Voice recording, Podcasting, Audio production, Clearity and recording quality, Defensible annotation decisions, Portfolio, Voice samples, Annotated transcripts, Audio-related work, Exceptional attention to linguistic nuance, Auditory detail, Deep understanding and taste of what good/useful Audio data is, Strong command of advanced transcription and annotation practices, Handling disfluencies, Accents, Prosodic features, Intonation, Stress, Rhythm, Emotion, Background in linguistics, Phonetics, Phonology, Sociolinguistics, Speech sciences, Cognitive science, Equivalent practical experience, Analyzing accent variation, Pronunciation differences, Experience working with speech/audio datasets, Training voice models, Understanding data quality impacts model performance, Professional experience in voice work, Similar audio production, Attention to clarity and recording quality, Independent judgment in ambiguous audio scenarios"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_047a7c93-c55"},"title":"AI Tutor - Hindi","description":"<p>As an AI Tutor specialized in multilingual audio capabilities, you will contribute to xAI&#39;s mission by training and refining Grok to excel in voice interactions, speech recognition, and auditory experiences across diverse languages, accents, and cultural contexts.</p>\n<p>Your work will focus on curating and annotating high-quality audio data to enhance Grok&#39;s global accessibility, enabling natural spoken interactions for users worldwide, bridging language barriers through accurate speech processing, and improving the AI&#39;s handling of multilingual audio nuances.</p>\n<p>Responsibilities: Use proprietary software to provide labels, annotations, recordings, and inputs on projects involving multilingual audio clips, voice recordings, speech samples, and auditory elements in various languages. Support the delivery of high-quality curated audio data that ensures clear, natural spoken output, accurate representation of linguistic and prosodic details (such as intonation, rhythm, and accent), and professional audio standards. Collaborate with technical staff to develop tasks that improve AI&#39;s ability to handle speech modulation, accent variation, noise in real-world recordings, and multilingual audio processing. Work with technical staff to improve annotation tools for efficient audio workflows.</p>\n<p>Basic Qualifications: Native proficiency in Hindi with exposure to diverse accents, dialects, or regional variations. Proficiency in English (minimum B2 level) with clear, natural vocal delivery and pronunciation suitable for audio recording purposes. Strong auditory perception to identify nuances in speech, accents, pronunciation, intonation, and audio quality across languages. Demonstrated ability to handle multilingual audio content, including evaluating speech accuracy, cultural vocal expressions, and contextual interpretation in spoken form. Demonstrated ability to transcribe audio with high accuracy across accents and varying audio quality. Comfort providing high-quality voice recordings and feedback on audio samples in multiple languages. Strong comprehension skills and the ability to make independent judgments on ambiguous or varied audio material, including noisy or accented speech. Strong communication, interpersonal, analytical, detail-oriented, and organizational skills, with the ability to articulate audio-related feedback effectively. Commitment to developing AI that masters sophisticated multilingual audio capabilities.</p>\n<p>Preferred Skills and Experience: Demonstration of exceptional attention to linguistic nuance, auditory detail, and data quality beyond standard transcription work. Deep understanding and taste of what good/useful Audio data is. Strong command of advanced transcription and annotation practices, including handling disfluencies, accents, and prosodic features (intonation, stress, rhythm, emotion, etc) with high consistency and accuracy. Background in linguistics (e.g., phonetics, phonology, sociolinguistics), speech sciences, cognitive science, or a related field, or equivalent practical experience, with demonstrated ability to analyze accent variation, pronunciation differences, and multilingual speech patterns. Experience working with speech/audio datasets, annotation workflows, or AI training data, including knowledge/experience with training voice models, and an understanding of how data quality impacts model performance. Professional experience in voice work, including voice acting, voice recording, podcasting with a measurable audience (e.g., X following), or similar audio production demonstrating attention to clarity and recording quality. Demonstrated ability to exercise independent judgment in ambiguous audio scenarios and make consistent, defensible annotation decisions. Portfolio (strongly preferred for advanced candidates): Voice samples, annotated transcripts, or audio-related work demonstrating quality, methodology, and attention to detail. Candidates with professional experience in voice, linguistics, speech data, or speech evaluation and research are especially encouraged to apply.</p>\n<p>Location and Other Expectations: Tutor roles may be offered as full-time, part-time, or contractor positions, depending on role needs and candidate fit. For contractor positions, hours will vary widely based on project scope and contractor availability, with no fixed commitments required. On average, most projects may require at least 10 hours per week to deliver effectively, though this is not a fixed commitment and depends on the scope of work. Contractors have full flexibility to set their own hours and determine the exact amount of time needed to complete deliverables. Tutor roles may be performed remotely from any location worldwide, subject to legal eligibility, time-zone compatibility, and role-specific needs. For US-based candidates, please note that we are unable to hire in Wyoming and Illinois at this time. We are unable to provide visa sponsorship. For those who will be working from a personal device, your computer must be a Chromebook, a Mac with macOS 11.0 or later, or Windows 10 or later.</p>\n<p>Compensation and Benefits: US-based candidates: $35/hour - $45/hour depending on factors including relevant experience, skills, education, geographic location, and qualifications. International candidates: Information will be provided to you during the recruitment process. Benefits vary based on employment type, location, and jurisdiction. Benefits for eligible U.S.-based positions include health insurance, 401(k) plan, and paid sick leave. Specific details and role-specific information will be provided to you during the interview process.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_047a7c93-c55","directApply":true,"hiringOrganization":{"@type":"Organization","name":"xAI","sameAs":"https://www.xai.com/","logo":"https://logos.yubhub.co/xai.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/xai/jobs/5090207007","x-work-arrangement":"remote","x-experience-level":"mid","x-job-type":"full-time|part-time|contract","x-salary-range":"$35/hour - $45/hour","x-skills-required":["Multilingual audio capabilities","Proprietary software","Audio data curation","Annotation tools","Speech recognition","Auditory experiences","Diverse languages","Accents","Cultural contexts","High-quality audio data","Clear spoken output","Linguistic and prosodic details","Professional audio standards","Speech modulation","Accent variation","Noise in real-world recordings","Multilingual audio processing","Efficient audio workflows","Native proficiency in Hindi","English (minimum B2 level)","Strong auditory perception","Multilingual audio content","Speech accuracy","Cultural vocal expressions","Contextual interpretation","Transcription","Audio quality","Voice recordings","Feedback on audio samples","Independent judgments","Ambiguous audio material","Noisy or accented speech","Communication","Interpersonal","Analytical","Detail-oriented","Organizational","Independent judgment","Defensible annotation decisions","Voice samples","Annotated transcripts","Audio-related work","Quality","Methodology","Attention to detail"],"x-skills-preferred":["Exceptional attention to linguistic nuance","Auditory detail","Data quality","Advanced transcription and annotation practices","Disfluencies","Prosodic features","Intonation","Stress","Rhythm","Emotion","Linguistics","Phonetics","Phonology","Sociolinguistics","Speech sciences","Cognitive science","Pronunciation differences","Multilingual speech patterns","Speech/audio datasets","Annotation workflows","AI training data","Training voice models","Data quality impacts model performance","Voice work","Voice acting","Voice recording","Podcasting","Measurable audience","Clarity and recording quality"],"datePosted":"2026-04-18T15:21:00.615Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Remote"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Multilingual audio capabilities, Proprietary software, Audio data curation, Annotation tools, Speech recognition, Auditory experiences, Diverse languages, Accents, Cultural contexts, High-quality audio data, Clear spoken output, Linguistic and prosodic details, Professional audio standards, Speech modulation, Accent variation, Noise in real-world recordings, Multilingual audio processing, Efficient audio workflows, Native proficiency in Hindi, English (minimum B2 level), Strong auditory perception, Multilingual audio content, Speech accuracy, Cultural vocal expressions, Contextual interpretation, Transcription, Audio quality, Voice recordings, Feedback on audio samples, Independent judgments, Ambiguous audio material, Noisy or accented speech, Communication, Interpersonal, Analytical, Detail-oriented, Organizational, Independent judgment, Defensible annotation decisions, Voice samples, Annotated transcripts, Audio-related work, Quality, Methodology, Attention to detail, Exceptional attention to linguistic nuance, Auditory detail, Data quality, Advanced transcription and annotation practices, Disfluencies, Prosodic features, Intonation, Stress, Rhythm, Emotion, Linguistics, Phonetics, Phonology, Sociolinguistics, Speech sciences, Cognitive science, Pronunciation differences, Multilingual speech patterns, Speech/audio datasets, Annotation workflows, AI training data, Training voice models, Data quality impacts model performance, Voice work, Voice acting, Voice recording, Podcasting, Measurable audience, Clarity and recording quality"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_de2d9d14-939"},"title":"AI Tutor - Spanish","description":"<p>As an AI Tutor specialized in multilingual audio capabilities, you will contribute to xAI&#39;s mission by training and refining Grok to excel in voice interactions, speech recognition, and auditory experiences across diverse languages, accents, and cultural contexts.</p>\n<p>Your work will focus on curating and annotating high-quality audio data to enhance Grok&#39;s global accessibility, enabling natural spoken interactions for users worldwide, bridging language barriers through accurate speech processing, and improving the AI&#39;s handling of multilingual audio nuances.</p>\n<p><strong>Responsibilities:</strong> Use proprietary software to provide labels, annotations, recordings, and inputs on projects involving multilingual audio clips, voice recordings, speech samples, and auditory elements in various languages. Support the delivery of high-quality curated audio data that ensures clear, natural spoken output, accurate representation of linguistic and prosodic details (such as intonation, rhythm, and accent), and professional audio standards. Collaborate with technical staff to develop tasks that improve AI&#39;s ability to handle speech modulation, accent variation, noise in real-world recordings, and multilingual audio processing. Work with technical staff to improve annotation tools for efficient audio workflows.</p>\n<p><strong>Basic Qualifications:</strong> Native proficiency in Spanish with exposure to diverse accents, dialects, or regional variations. Proficiency in English (minimum B2 level) with clear, natural vocal delivery and pronunciation suitable for audio recording purposes. Strong auditory perception to identify nuances in speech, accents, pronunciation, intonation, and audio quality across languages. Demonstrated ability to handle multilingual audio content, including evaluating speech accuracy, cultural vocal expressions, and contextual interpretation in spoken form. Demonstrated ability to transcribe audio with high accuracy across accents and varying audio quality. Comfort providing high-quality voice recordings and feedback on audio samples in multiple languages. Strong comprehension skills and the ability to make independent judgments on ambiguous or varied audio material, including noisy or accented speech. Strong communication, interpersonal, analytical, detail-oriented, and organizational skills, with the ability to articulate audio-related feedback effectively. Commitment to developing AI that masters sophisticated multilingual audio capabilities.</p>\n<p><strong>Preferred Skills and Experience:</strong> Demonstration of exceptional attention to linguistic nuance, auditory detail, and data quality beyond standard transcription work. Deep understanding and taste of what good/useful Audio data is. Strong command of advanced transcription and annotation practices, including handling disfluencies, accents, and prosodic features (intonation, stress, rhythm, emotion, etc) with high consistency and accuracy. Background in linguistics (e.g., phonetics, phonology, sociolinguistics), speech sciences, cognitive science, or a related field, or equivalent practical experience, with demonstrated ability to analyze accent variation, pronunciation differences, and multilingual speech patterns. Experience working with speech/audio datasets, annotation workflows, or AI training data, including knowledge/experience with training voice models, and an understanding of how data quality impacts model performance. Professional experience in voice work, including voice acting, voice recording, podcasting with a measurable audience (e.g., X following), or similar audio production demonstrating attention to clarity and recording quality. Demonstrated ability to exercise independent judgment in ambiguous audio scenarios and make consistent, defensible annotation decisions. Portfolio (strongly preferred for advanced candidates): Voice samples, annotated transcripts, or audio-related work demonstrating quality, methodology, and attention to detail. Candidates with professional experience in voice, linguistics, speech data, or speech evaluation and research are especially encouraged to apply.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_de2d9d14-939","directApply":true,"hiringOrganization":{"@type":"Organization","name":"xAI","sameAs":"https://www.xai.com/","logo":"https://logos.yubhub.co/xai.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/xai/jobs/5090264007","x-work-arrangement":"remote","x-experience-level":"mid","x-job-type":"full-time|part-time|contract","x-salary-range":"$35/hour - $45/hour","x-skills-required":["Spanish","English","Audio annotation","Speech recognition","Multilingual audio processing"],"x-skills-preferred":["Advanced transcription and annotation practices","Linguistics","Speech sciences","Cognitive science","Voice work"],"datePosted":"2026-04-18T15:20:57.454Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Remote"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Spanish, English, Audio annotation, Speech recognition, Multilingual audio processing, Advanced transcription and annotation practices, Linguistics, Speech sciences, Cognitive science, Voice work"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_a15f16b0-f9b"},"title":"AI Tutor - Hebrew","description":"<p>As an AI Tutor specialized in multilingual audio capabilities, you will contribute to xAI&#39;s mission by training and refining Grok to excel in voice interactions, speech recognition, and auditory experiences across diverse languages, accents, and cultural contexts.</p>\n<p>Your work will focus on curating and annotating high-quality audio data to enhance Grok&#39;s global accessibility, enabling natural spoken interactions for users worldwide, bridging language barriers through accurate speech processing, and improving the AI&#39;s handling of multilingual audio nuances.</p>\n<p>Responsibilities:</p>\n<ul>\n<li>Use proprietary software to provide labels, annotations, recordings, and inputs on projects involving multilingual audio clips, voice recordings, speech samples, and auditory elements in various languages.</li>\n</ul>\n<ul>\n<li>Support the delivery of high-quality curated audio data that ensures clear, natural spoken output, accurate representation of linguistic and prosodic details (such as intonation, rhythm, and accent), and professional audio standards.</li>\n</ul>\n<ul>\n<li>Collaborate with technical staff to develop tasks that improve AI&#39;s ability to handle speech modulation, accent variation, noise in real-world recordings, and multilingual audio processing.</li>\n</ul>\n<ul>\n<li>Work with technical staff to improve annotation tools for efficient audio workflows.</li>\n</ul>\n<p>Basic Qualifications:</p>\n<ul>\n<li>Native proficiency in Hebrew with exposure to diverse accents, dialects, or regional variations.</li>\n</ul>\n<ul>\n<li>Proficiency in English (minimum B2 level) with clear, natural vocal delivery and pronunciation suitable for audio recording purposes.</li>\n</ul>\n<ul>\n<li>Strong auditory perception to identify nuances in speech, accents, pronunciation, intonation, and audio quality across languages.</li>\n</ul>\n<ul>\n<li>Demonstrated ability to handle multilingual audio content, including evaluating speech accuracy, cultural vocal expressions, and contextual interpretation in spoken form.</li>\n</ul>\n<ul>\n<li>Demonstrated ability to transcribe audio with high accuracy across accents and varying audio quality.</li>\n</ul>\n<ul>\n<li>Comfort providing high-quality voice recordings and feedback on audio samples in multiple languages.</li>\n</ul>\n<ul>\n<li>Strong comprehension skills and the ability to make independent judgments on ambiguous or varied audio material, including noisy or accented speech.</li>\n</ul>\n<ul>\n<li>Strong communication, interpersonal, analytical, detail-oriented, and organizational skills, with the ability to articulate audio-related feedback effectively.</li>\n</ul>\n<ul>\n<li>Commitment to developing AI that masters sophisticated multilingual audio capabilities.</li>\n</ul>\n<p>Preferred Skills and Experience:</p>\n<ul>\n<li>Demonstration of exceptional attention to linguistic nuance, auditory detail, and data quality beyond standard transcription work.</li>\n</ul>\n<ul>\n<li>Deep understanding and taste of what good/useful Audio data is.</li>\n</ul>\n<ul>\n<li>Strong command of advanced transcription and annotation practices, including handling disfluencies, accents, and prosodic features (intonation, stress, rhythm, emotion, etc) with high consistency and accuracy.</li>\n</ul>\n<ul>\n<li>Background in linguistics (e.g., phonetics, phonology, sociolinguistics), speech sciences, cognitive science, or a related field, or equivalent practical experience, with demonstrated ability to analyze accent variation, pronunciation differences, and multilingual speech patterns.</li>\n</ul>\n<ul>\n<li>Experience working with speech/audio datasets, annotation workflows, or AI training data, including knowledge/experience with training voice models, and an understanding of how data quality impacts model performance.</li>\n</ul>\n<ul>\n<li>Professional experience in voice work, including voice acting, voice recording, podcasting with a measurable audience (e.g., X following), or similar audio production demonstrating attention to clarity and recording quality.</li>\n</ul>\n<ul>\n<li>Demonstrated ability to exercise independent judgment in ambiguous audio scenarios and make consistent, defensible annotation decisions.</li>\n</ul>\n<ul>\n<li>Portfolio (strongly preferred for advanced candidates): Voice samples, annotated transcripts, or audio-related work demonstrating quality, methodology, and attention to detail.</li>\n</ul>\n<ul>\n<li>Candidates with professional experience in voice, linguistics, speech data, or speech evaluation and research are especially encouraged to apply.</li>\n</ul>\n<p>Location and Other Expectations:</p>\n<ul>\n<li>Tutor roles may be offered as full-time, part-time, or contractor positions, depending on role needs and candidate fit.</li>\n</ul>\n<ul>\n<li>For contractor positions, hours will vary widely based on project scope and contractor availability, with no fixed commitments required. On average, most projects may require at least 10 hours per week to deliver effectively, though this is not a fixed commitment and depends on the scope of work. Contractors have full flexibility to set their own hours and determine the exact amount of time needed to complete deliverables.</li>\n</ul>\n<ul>\n<li>Tutor roles may be performed remotely from any location worldwide, subject to legal eligibility, time-zone compatibility, and role-specific needs.</li>\n</ul>\n<ul>\n<li>For US-based candidates, please note that we are unable to hire in Wyoming and Illinois at this time.</li>\n</ul>\n<ul>\n<li>We are unable to provide visa sponsorship.</li>\n</ul>\n<ul>\n<li>For those who will be working from a personal device, your computer must be a Chromebook, a Mac with macOS 11.0 or later, or Windows 10 or later.</li>\n</ul>\n<p>Compensation and Benefits:</p>\n<p>US-based candidates: $35/hour - $45/hour depending on factors including relevant experience, skills, education, geographic location, and qualifications. International candidates: Information will be provided to you during the recruitment process.</p>\n<p>Benefits vary based on employment type, location, and jurisdiction. Benefits for eligible U.S.-based positions include health insurance, 401(k) plan, and paid sick leave. Specific details and role-specific information will be provided to you during the interview process.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_a15f16b0-f9b","directApply":true,"hiringOrganization":{"@type":"Organization","name":"xAI","sameAs":"https://www.xai.com/","logo":"https://logos.yubhub.co/xai.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/xai/jobs/5090206007","x-work-arrangement":"remote","x-experience-level":"mid","x-job-type":"full-time|part-time|contract","x-salary-range":"$35/hour - $45/hour","x-skills-required":["Multilingual audio capabilities","Proprietary software","High-quality audio data","Speech recognition","Auditory experiences","Diverse languages","Accents","Cultural contexts","Audio clips","Voice recordings","Speech samples","Auditory elements","Professional audio standards","Speech modulation","Accent variation","Noise in real-world recordings","Multilingual audio processing","Annotation tools","Efficient audio workflows","Native proficiency in Hebrew","Proficiency in English","Strong auditory perception","Multilingual audio content","Speech accuracy","Cultural vocal expressions","Contextual interpretation","Transcription","Audio quality","Comfort providing high-quality voice recordings","Feedback on audio samples","Strong comprehension skills","Independent judgments","Ambiguous audio material","Noisy or accented speech","Communication skills","Interpersonal skills","Analytical skills","Detail-oriented skills","Organizational skills","Commitment to developing AI","Sophisticated multilingual audio capabilities"],"x-skills-preferred":["Exceptional attention to linguistic nuance","Auditory detail","Data quality","Advanced transcription and annotation practices","Handling disfluencies","Prosodic features","Intonation","Stress","Rhythm","Emotion","Background in linguistics","Speech sciences","Cognitive science","Linguistics","Phonetics","Phonology","Sociolinguistics","Pronunciation differences","Multilingual speech patterns","Experience working with speech/audio datasets","Annotation workflows","AI training data","Training voice models","Data quality impacts model performance","Professional experience in voice work","Voice acting","Voice recording","Podcasting","Audio production","Attention to clarity and recording quality","Independent judgment in ambiguous audio scenarios","Defensible annotation decisions","Portfolio","Voice samples","Annotated transcripts","Audio-related work","Quality","Methodology","Attention to detail"],"datePosted":"2026-04-18T15:20:42.554Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Remote"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Multilingual audio capabilities, Proprietary software, High-quality audio data, Speech recognition, Auditory experiences, Diverse languages, Accents, Cultural contexts, Audio clips, Voice recordings, Speech samples, Auditory elements, Professional audio standards, Speech modulation, Accent variation, Noise in real-world recordings, Multilingual audio processing, Annotation tools, Efficient audio workflows, Native proficiency in Hebrew, Proficiency in English, Strong auditory perception, Multilingual audio content, Speech accuracy, Cultural vocal expressions, Contextual interpretation, Transcription, Audio quality, Comfort providing high-quality voice recordings, Feedback on audio samples, Strong comprehension skills, Independent judgments, Ambiguous audio material, Noisy or accented speech, Communication skills, Interpersonal skills, Analytical skills, Detail-oriented skills, Organizational skills, Commitment to developing AI, Sophisticated multilingual audio capabilities, Exceptional attention to linguistic nuance, Auditory detail, Data quality, Advanced transcription and annotation practices, Handling disfluencies, Prosodic features, Intonation, Stress, Rhythm, Emotion, Background in linguistics, Speech sciences, Cognitive science, Linguistics, Phonetics, Phonology, Sociolinguistics, Pronunciation differences, Multilingual speech patterns, Experience working with speech/audio datasets, Annotation workflows, AI training data, Training voice models, Data quality impacts model performance, Professional experience in voice work, Voice acting, Voice recording, Podcasting, Audio production, Attention to clarity and recording quality, Independent judgment in ambiguous audio scenarios, Defensible annotation decisions, Portfolio, Voice samples, Annotated transcripts, Audio-related work, Quality, Methodology, Attention to detail"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_9c65d655-aff"},"title":"AI Tutor - Finnish","description":"<p>As an AI Tutor specialized in multilingual audio capabilities, you will contribute to xAI&#39;s mission by training and refining Grok to excel in voice interactions, speech recognition, and auditory experiences across diverse languages, accents, and cultural contexts.</p>\n<p>Your work will focus on curating and annotating high-quality audio data to enhance Grok&#39;s global accessibility, enabling natural spoken interactions for users worldwide, bridging language barriers through accurate speech processing, and improving the AI&#39;s handling of multilingual audio nuances.</p>\n<p>Responsibilities:</p>\n<ul>\n<li>Use proprietary software to provide labels, annotations, recordings, and inputs on projects involving multilingual audio clips, voice recordings, speech samples, and auditory elements in various languages.</li>\n</ul>\n<ul>\n<li>Support the delivery of high-quality curated audio data that ensures clear, natural spoken output, accurate representation of linguistic and prosodic details (such as intonation, rhythm, and accent), and professional audio standards.</li>\n</ul>\n<ul>\n<li>Collaborate with technical staff to develop tasks that improve AI&#39;s ability to handle speech modulation, accent variation, noise in real-world recordings, and multilingual audio processing.</li>\n</ul>\n<ul>\n<li>Work with technical staff to improve annotation tools for efficient audio workflows.</li>\n</ul>\n<p>Basic Qualifications:</p>\n<ul>\n<li>Native proficiency in Finnish with exposure to diverse accents, dialects, or regional variations.</li>\n</ul>\n<ul>\n<li>Proficiency in English (minimum B2 level) with clear, natural vocal delivery and pronunciation suitable for audio recording purposes.</li>\n</ul>\n<ul>\n<li>Strong auditory perception to identify nuances in speech, accents, pronunciation, intonation, and audio quality across languages.</li>\n</ul>\n<ul>\n<li>Demonstrated ability to handle multilingual audio content, including evaluating speech accuracy, cultural vocal expressions, and contextual interpretation in spoken form.</li>\n</ul>\n<ul>\n<li>Demonstrated ability to transcribe audio with high accuracy across accents and varying audio quality.</li>\n</ul>\n<ul>\n<li>Comfort providing high-quality voice recordings and feedback on audio samples in multiple languages.</li>\n</ul>\n<ul>\n<li>Strong comprehension skills and the ability to make independent judgments on ambiguous or varied audio material, including noisy or accented speech.</li>\n</ul>\n<ul>\n<li>Strong communication, interpersonal, analytical, detail-oriented, and organizational skills, with the ability to articulate audio-related feedback effectively.</li>\n</ul>\n<ul>\n<li>Commitment to developing AI that masters sophisticated multilingual audio capabilities.</li>\n</ul>\n<p>Preferred Skills and Experience:</p>\n<ul>\n<li>Demonstration of exceptional attention to linguistic nuance, auditory detail, and data quality beyond standard transcription work.</li>\n</ul>\n<ul>\n<li>Deep understanding and taste of what good/useful Audio data is.</li>\n</ul>\n<ul>\n<li>Strong command of advanced transcription and annotation practices, including handling disfluencies, accents, and prosodic features (intonation, stress, rhythm, emotion, etc) with high consistency and accuracy.</li>\n</ul>\n<ul>\n<li>Background in linguistics (e.g., phonetics, phonology, sociolinguistics), speech sciences, cognitive science, or a related field, or equivalent practical experience, with demonstrated ability to analyze accent variation, pronunciation differences, and multilingual speech patterns.</li>\n</ul>\n<ul>\n<li>Experience working with speech/audio datasets, annotation workflows, or AI training data, including knowledge/experience with training voice models, and an understanding of how data quality impacts model performance.</li>\n</ul>\n<ul>\n<li>Professional experience in voice work, including voice acting, voice recording, podcasting with a measurable audience (e.g., X following), or similar audio production demonstrating attention to clarity and recording quality.</li>\n</ul>\n<ul>\n<li>Demonstrated ability to exercise independent judgment in ambiguous audio scenarios and make consistent, defensible annotation decisions.</li>\n</ul>\n<ul>\n<li>Portfolio (strongly preferred for advanced candidates): Voice samples, annotated transcripts, or audio-related work demonstrating quality, methodology, and attention to detail.</li>\n</ul>\n<ul>\n<li>Candidates with professional experience in voice, linguistics, speech data, or speech evaluation and research are especially encouraged to apply.</li>\n</ul>\n<p>Location and Other Expectations:</p>\n<ul>\n<li>Tutor roles may be offered as full-time, part-time, or contractor positions, depending on role needs and candidate fit.</li>\n</ul>\n<ul>\n<li>For contractor positions, hours will vary widely based on project scope and contractor availability, with no fixed commitments required. On average, most projects may require at least 10 hours per week to deliver effectively, though this is not a fixed commitment and depends on the scope of work. Contractors have full flexibility to set their own hours and determine the exact amount of time needed to complete deliverables.</li>\n</ul>\n<ul>\n<li>Tutor roles may be performed remotely from any location worldwide, subject to legal eligibility, time-zone compatibility, and role-specific needs.</li>\n</ul>\n<ul>\n<li>For US-based candidates, please note that we are unable to hire in Wyoming and Illinois at this time.</li>\n</ul>\n<ul>\n<li>We are unable to provide visa sponsorship.</li>\n</ul>\n<ul>\n<li>For those who will be working from a personal device, your computer must be a Chromebook, a Mac with macOS 11.0 or later, or Windows 10 or later.</li>\n</ul>\n<p>Compensation and Benefits:</p>\n<p>US-based candidates: $35/hour - $45/hour depending on factors including relevant experience, skills, education, geographic location, and qualifications. International candidates: Information will be provided to you during the recruitment process.</p>\n<p>Benefits vary based on employment type, location, and jurisdiction. Benefits for eligible U.S.-based positions include health insurance, 401(k) plan, and paid sick leave. Specific details and role-specific information will be provided to you during the interview process.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_9c65d655-aff","directApply":true,"hiringOrganization":{"@type":"Organization","name":"xAI","sameAs":"https://www.xai.com/","logo":"https://logos.yubhub.co/xai.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/xai/jobs/5090199007","x-work-arrangement":"remote","x-experience-level":"mid","x-job-type":"full-time|part-time|contract","x-salary-range":"$35/hour - $45/hour","x-skills-required":["multilingual audio capabilities","voice interactions","speech recognition","auditory experiences","linguistic and prosodic details","intonation","rhythm","accent","professional audio standards","speech modulation","accent variation","noise in real-world recordings","multilingual audio processing","annotation tools","efficient audio workflows","native proficiency in Finnish","English (minimum B2 level)","strong auditory perception","nuances in speech","accents","pronunciation","audio quality","multilingual audio content","speech accuracy","cultural vocal expressions","contextual interpretation","transcription","high accuracy","various audio quality","voice recordings","feedback on audio samples","independent judgments","ambiguous or varied audio material","noisy or accented speech","communication","interpersonal","analytical","detail-oriented","organizational","audio-related feedback","commitment to developing AI"],"x-skills-preferred":["exceptional attention to linguistic nuance","auditory detail","data quality","deep understanding and taste of what good/useful Audio data is","strong command of advanced transcription and annotation practices","handling disfluencies","prosodic features","background in linguistics","phonetics","phonology","sociolinguistics","speech sciences","cognitive science","equivalent practical experience","analysis of accent variation","pronunciation differences","multilingual speech patterns","experience working with speech/audio datasets","annotation workflows","AI training data","training voice models","understanding of how data quality impacts model performance","professional experience in voice work","voice acting","voice recording","podcasting","measurable audience","similar audio production","exercise independent judgment","defensible annotation decisions","portfolio","voice samples","annotated transcripts","audio-related work","quality","methodology","attention to detail"],"datePosted":"2026-04-18T15:19:34.287Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Remote"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"multilingual audio capabilities, voice interactions, speech recognition, auditory experiences, linguistic and prosodic details, intonation, rhythm, accent, professional audio standards, speech modulation, accent variation, noise in real-world recordings, multilingual audio processing, annotation tools, efficient audio workflows, native proficiency in Finnish, English (minimum B2 level), strong auditory perception, nuances in speech, accents, pronunciation, audio quality, multilingual audio content, speech accuracy, cultural vocal expressions, contextual interpretation, transcription, high accuracy, various audio quality, voice recordings, feedback on audio samples, independent judgments, ambiguous or varied audio material, noisy or accented speech, communication, interpersonal, analytical, detail-oriented, organizational, audio-related feedback, commitment to developing AI, exceptional attention to linguistic nuance, auditory detail, data quality, deep understanding and taste of what good/useful Audio data is, strong command of advanced transcription and annotation practices, handling disfluencies, prosodic features, background in linguistics, phonetics, phonology, sociolinguistics, speech sciences, cognitive science, equivalent practical experience, analysis of accent variation, pronunciation differences, multilingual speech patterns, experience working with speech/audio datasets, annotation workflows, AI training data, training voice models, understanding of how data quality impacts model performance, professional experience in voice work, voice acting, voice recording, podcasting, measurable audience, similar audio production, exercise independent judgment, defensible annotation decisions, portfolio, voice samples, annotated transcripts, audio-related work, quality, methodology, attention to detail"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_02d3881d-a73"},"title":"AI Tutor - Dutch","description":"<p>As an AI Tutor specialized in multilingual audio capabilities, you will contribute to xAI&#39;s mission by training and refining Grok to excel in voice interactions, speech recognition, and auditory experiences across diverse languages, accents, and cultural contexts.</p>\n<p>Your work will focus on curating and annotating high-quality audio data to enhance Grok&#39;s global accessibility, enabling natural spoken interactions for users worldwide, bridging language barriers through accurate speech processing, and improving the AI&#39;s handling of multilingual audio nuances.</p>\n<p>Responsibilities:</p>\n<ul>\n<li>Use proprietary software to provide labels, annotations, recordings, and inputs on projects involving multilingual audio clips, voice recordings, speech samples, and auditory elements in various languages.</li>\n</ul>\n<ul>\n<li>Support the delivery of high-quality curated audio data that ensures clear, natural spoken output, accurate representation of linguistic and prosodic details (such as intonation, rhythm, and accent), and professional audio standards.</li>\n</ul>\n<ul>\n<li>Collaborate with technical staff to develop tasks that improve AI&#39;s ability to handle speech modulation, accent variation, noise in real-world recordings, and multilingual audio processing.</li>\n</ul>\n<ul>\n<li>Work with technical staff to improve annotation tools for efficient audio workflows.</li>\n</ul>\n<p>Basic Qualifications:</p>\n<ul>\n<li>Native proficiency in Dutch with exposure to diverse accents, dialects, or regional variations.</li>\n</ul>\n<ul>\n<li>Proficiency in English (minimum B2 level) with clear, natural vocal delivery and pronunciation suitable for audio recording purposes.</li>\n</ul>\n<ul>\n<li>Strong auditory perception to identify nuances in speech, accents, pronunciation, intonation, and audio quality across languages.</li>\n</ul>\n<ul>\n<li>Demonstrated ability to handle multilingual audio content, including evaluating speech accuracy, cultural vocal expressions, and contextual interpretation in spoken form.</li>\n</ul>\n<ul>\n<li>Demonstrated ability to transcribe audio with high accuracy across accents and varying audio quality.</li>\n</ul>\n<ul>\n<li>Comfort providing high-quality voice recordings and feedback on audio samples in multiple languages.</li>\n</ul>\n<ul>\n<li>Strong comprehension skills and the ability to make independent judgments on ambiguous or varied audio material, including noisy or accented speech.</li>\n</ul>\n<ul>\n<li>Strong communication, interpersonal, analytical, detail-oriented, and organizational skills, with the ability to articulate audio-related feedback effectively.</li>\n</ul>\n<ul>\n<li>Commitment to developing AI that masters sophisticated multilingual audio capabilities.</li>\n</ul>\n<p>Preferred Skills and Experience:</p>\n<ul>\n<li>Demonstration of exceptional attention to linguistic nuance, auditory detail, and data quality beyond standard transcription work.</li>\n</ul>\n<ul>\n<li>Deep understanding and taste of what good/useful Audio data is.</li>\n</ul>\n<ul>\n<li>Strong command of advanced transcription and annotation practices, including handling disfluencies, accents, and prosodic features (intonation, stress, rhythm, emotion, etc) with high consistency and accuracy.</li>\n</ul>\n<ul>\n<li>Background in linguistics (e.g., phonetics, phonology, sociolinguistics), speech sciences, cognitive science, or a related field, or equivalent practical experience, with demonstrated ability to analyze accent variation, pronunciation differences, and multilingual speech patterns.</li>\n</ul>\n<ul>\n<li>Experience working with speech/audio datasets, annotation workflows, or AI training data, including knowledge/experience with training voice models, and an understanding of how data quality impacts model performance.</li>\n</ul>\n<ul>\n<li>Professional experience in voice work, including voice acting, voice recording, podcasting with a measurable audience (e.g., X following), or similar audio production demonstrating attention to clarity and recording quality.</li>\n</ul>\n<ul>\n<li>Demonstrated ability to exercise independent judgment in ambiguous audio scenarios and make consistent, defensible annotation decisions.</li>\n</ul>\n<ul>\n<li>Portfolio (strongly preferred for advanced candidates): Voice samples, annotated transcripts, or audio-related work demonstrating quality, methodology, and attention to detail.</li>\n</ul>\n<ul>\n<li>Candidates with professional experience in voice, linguistics, speech data, or speech evaluation and research are especially encouraged to apply.</li>\n</ul>\n<p>Location and Other Expectations:</p>\n<ul>\n<li>Tutor roles may be offered as full-time, part-time, or contractor positions, depending on role needs and candidate fit.</li>\n</ul>\n<ul>\n<li>For contractor positions, hours will vary widely based on project scope and contractor availability, with no fixed commitments required. On average, most projects may require at least 10 hours per week to deliver effectively, though this is not a fixed commitment and depends on the scope of work. Contractors have full flexibility to set their own hours and determine the exact amount of time needed to complete deliverables.</li>\n</ul>\n<ul>\n<li>Tutor roles may be performed remotely from any location worldwide, subject to legal eligibility, time-zone compatibility, and role-specific needs.</li>\n</ul>\n<ul>\n<li>For US-based candidates, please note that we are unable to hire in Wyoming and Illinois at this time.</li>\n</ul>\n<ul>\n<li>We are unable to provide visa sponsorship.</li>\n</ul>\n<ul>\n<li>For those who will be working from a personal device, your computer must be a Chromebook, a Mac with macOS 11.0 or later, or Windows 10 or later.</li>\n</ul>\n<p>Compensation and Benefits:</p>\n<p>US-based candidates: $35/hour - $45/hour depending on factors including relevant experience, skills, education, geographic location, and qualifications. International candidates: Information will be provided to you during the recruitment process.</p>\n<p>Benefits vary based on employment type, location, and jurisdiction. Benefits for eligible U.S.-based positions include health insurance, 401(k) plan, and paid sick leave. Specific details and role-specific information will be provided to you during the interview process.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_02d3881d-a73","directApply":true,"hiringOrganization":{"@type":"Organization","name":"xAI","sameAs":"https://www.xai.com/","logo":"https://logos.yubhub.co/xai.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/xai/jobs/5090197007","x-work-arrangement":"remote","x-experience-level":"mid","x-job-type":"full-time|part-time|contract|temporary|internship","x-salary-range":"$35/hour - $45/hour","x-skills-required":["multilingual audio capabilities","voice interactions","speech recognition","auditory experiences","linguistic and prosodic details","intonation","rhythm","accent","professional audio standards","speech modulation","accent variation","noise in real-world recordings","multilingual audio processing","annotation tools","efficient audio workflows","native proficiency in Dutch","proficiency in English","auditory perception","nuances in speech","accents","pronunciation","audio quality","speech accuracy","cultural vocal expressions","contextual interpretation","transcription","high accuracy","vocal delivery","clarity","recording quality","independent judgments","ambiguous audio scenarios","defensible annotation decisions","portfolio","voice samples","annotated transcripts","audio-related work","quality","methodology","attention to detail","voice work","voice acting","voice recording","podcasting","measurable audience","speech sciences","cognitive science","linguistics","phonetics","phonology","sociolinguistics","pronunciation differences","multilingual speech patterns","speech/audio datasets","annotation workflows","AI training data","training voice models","data quality","model performance","independent judgment"],"x-skills-preferred":["exceptional attention to linguistic nuance","auditory detail","deep understanding","taste of what good/useful Audio data is","strong command of advanced transcription and annotation practices","handling disfluencies","prosodic features","high consistency and accuracy","background in linguistics","equivalent practical experience","analyzing accent variation","experienced working with speech/audio datasets","knowledge/experience with training voice models","understanding how data quality impacts model performance","professional experience in voice work","similar audio production","attention to clarity"],"datePosted":"2026-04-18T15:19:04.366Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Remote"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"multilingual audio capabilities, voice interactions, speech recognition, auditory experiences, linguistic and prosodic details, intonation, rhythm, accent, professional audio standards, speech modulation, accent variation, noise in real-world recordings, multilingual audio processing, annotation tools, efficient audio workflows, native proficiency in Dutch, proficiency in English, auditory perception, nuances in speech, accents, pronunciation, audio quality, speech accuracy, cultural vocal expressions, contextual interpretation, transcription, high accuracy, vocal delivery, clarity, recording quality, independent judgments, ambiguous audio scenarios, defensible annotation decisions, portfolio, voice samples, annotated transcripts, audio-related work, quality, methodology, attention to detail, voice work, voice acting, voice recording, podcasting, measurable audience, speech sciences, cognitive science, linguistics, phonetics, phonology, sociolinguistics, pronunciation differences, multilingual speech patterns, speech/audio datasets, annotation workflows, AI training data, training voice models, data quality, model performance, independent judgment, exceptional attention to linguistic nuance, auditory detail, deep understanding, taste of what good/useful Audio data is, strong command of advanced transcription and annotation practices, handling disfluencies, prosodic features, high consistency and accuracy, background in linguistics, equivalent practical experience, analyzing accent variation, experienced working with speech/audio datasets, knowledge/experience with training voice models, understanding how data quality impacts model performance, professional experience in voice work, similar audio production, attention to clarity"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_5fc4a599-f52"},"title":"AI Tutor - Arabic","description":"<p>As an AI Tutor specialized in multilingual audio capabilities, you will contribute to xAI&#39;s mission by training and refining Grok to excel in voice interactions, speech recognition, and auditory experiences across diverse languages, accents, and cultural contexts.</p>\n<p>Your work will focus on curating and annotating high-quality audio data to enhance Grok&#39;s global accessibility, enabling natural spoken interactions for users worldwide, bridging language barriers through accurate speech processing, and improving the AI&#39;s handling of multilingual audio nuances.</p>\n<p>Responsibilities:</p>\n<ul>\n<li>Use proprietary software to provide labels, annotations, recordings, and inputs on projects involving multilingual audio clips, voice recordings, speech samples, and auditory elements in various languages.</li>\n</ul>\n<ul>\n<li>Support the delivery of high-quality curated audio data that ensures clear, natural spoken output, accurate representation of linguistic and prosodic details (such as intonation, rhythm, and accent), and professional audio standards.</li>\n</ul>\n<ul>\n<li>Collaborate with technical staff to develop tasks that improve AI&#39;s ability to handle speech modulation, accent variation, noise in real-world recordings, and multilingual audio processing.</li>\n</ul>\n<ul>\n<li>Work with technical staff to improve annotation tools for efficient audio workflows.</li>\n</ul>\n<p>Basic Qualifications:</p>\n<ul>\n<li>Native proficiency in Arabic with exposure to diverse accents, dialects, or regional variations.</li>\n</ul>\n<ul>\n<li>Proficiency in English (minimum B2 level) with clear, natural vocal delivery and pronunciation suitable for audio recording purposes.</li>\n</ul>\n<ul>\n<li>Strong auditory perception to identify nuances in speech, accents, pronunciation, intonation, and audio quality across languages.</li>\n</ul>\n<ul>\n<li>Demonstrated ability to handle multilingual audio content, including evaluating speech accuracy, cultural vocal expressions, and contextual interpretation in spoken form.</li>\n</ul>\n<ul>\n<li>Demonstrated ability to transcribe audio with high accuracy across accents and varying audio quality.</li>\n</ul>\n<ul>\n<li>Comfort providing high-quality voice recordings and feedback on audio samples in multiple languages.</li>\n</ul>\n<p>Preferred Skills and Experience:</p>\n<ul>\n<li>Demonstration of exceptional attention to linguistic nuance, auditory detail, and data quality beyond standard transcription work.</li>\n</ul>\n<ul>\n<li>Deep understanding and taste of what good/useful Audio data is.</li>\n</ul>\n<ul>\n<li>Strong command of advanced transcription and annotation practices, including handling disfluencies, accents, and prosodic features (intonation, stress, rhythm, emotion, etc) with high consistency and accuracy.</li>\n</ul>\n<ul>\n<li>Background in linguistics (e.g., phonetics, phonology, sociolinguistics), speech sciences, cognitive science, or a related field, or equivalent practical experience, with demonstrated ability to analyze accent variation, pronunciation differences, and multilingual speech patterns.</li>\n</ul>\n<ul>\n<li>Experience working with speech/audio datasets, annotation workflows, or AI training data, including knowledge/experience with training voice models, and an understanding of how data quality impacts model performance.</li>\n</ul>\n<ul>\n<li>Professional experience in voice work, including voice acting, voice recording, podcasting with a measurable audience (e.g., X following), or similar audio production demonstrating attention to clarity and recording quality.</li>\n</ul>\n<ul>\n<li>Demonstrated ability to exercise independent judgment in ambiguous audio scenarios and make consistent, defensible annotation decisions.</li>\n</ul>\n<p>Location and Other Expectations:</p>\n<ul>\n<li>Tutor roles may be offered as full-time, part-time, or contractor positions, depending on role needs and candidate fit.</li>\n</ul>\n<ul>\n<li>For contractor positions, hours will vary widely based on project scope and contractor availability, with no fixed commitments required. On average, most projects may require at least 10 hours per week to deliver effectively, though this is not a fixed commitment and depends on the scope of work. Contractors have full flexibility to set their own hours and determine the exact amount of time needed to complete deliverables.</li>\n</ul>\n<ul>\n<li>Tutor roles may be performed remotely from any location worldwide, subject to legal eligibility, time-zone compatibility, and role-specific needs.</li>\n</ul>\n<ul>\n<li>For US-based candidates, please note that we are unable to hire in Wyoming and Illinois at this time.</li>\n</ul>\n<ul>\n<li>We are unable to provide visa sponsorship.</li>\n</ul>\n<ul>\n<li>For those who will be working from a personal device, your computer must be a Chromebook, a Mac with macOS 11.0 or later, or Windows 10 or later.</li>\n</ul>\n<p>Compensation and Benefits:</p>\n<p>US-based candidates: $35/hour - $45/hour depending on factors including relevant experience, skills, education, geographic location, and qualifications. International candidates: Information will be provided to you during the recruitment process.</p>\n<p>Benefits vary based on employment type, location, and jurisdiction. Benefits for eligible U.S.-based positions include health insurance, 401(k) plan, and paid sick leave. Specific details and role-specific information will be provided to you during the interview process.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_5fc4a599-f52","directApply":true,"hiringOrganization":{"@type":"Organization","name":"xAI","sameAs":"https://www.xai.com/","logo":"https://logos.yubhub.co/xai.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/xai/jobs/5090171007","x-work-arrangement":"Remote","x-experience-level":"Entry-level","x-job-type":"Full-time, Part-time, Contractor","x-salary-range":"$35/hour - $45/hour","x-skills-required":["Multilingual audio capabilities","Proprietary software","Audio data curation","Annotation tools","Speech recognition","Auditory experiences","Linguistic and prosodic details","Professional audio standards","Accent variation","Noise in real-world recordings","Multilingual audio processing","Independent judgment","Ambiguous audio scenarios","Defensible annotation decisions"],"x-skills-preferred":["Exceptional attention to linguistic nuance","Deep understanding of good/useful Audio data","Advanced transcription and annotation practices","Handling disfluencies, accents, and prosodic features","Background in linguistics, speech sciences, or cognitive science","Experience working with speech/audio datasets","Knowledge/experience with training voice models","Understanding of how data quality impacts model performance","Professional experience in voice work","Voice acting, voice recording, or podcasting"],"datePosted":"2026-04-18T15:17:39.431Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Remote"}},"jobLocationType":"TELECOMMUTE","employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"Multilingual audio capabilities, Proprietary software, Audio data curation, Annotation tools, Speech recognition, Auditory experiences, Linguistic and prosodic details, Professional audio standards, Accent variation, Noise in real-world recordings, Multilingual audio processing, Independent judgment, Ambiguous audio scenarios, Defensible annotation decisions, Exceptional attention to linguistic nuance, Deep understanding of good/useful Audio data, Advanced transcription and annotation practices, Handling disfluencies, accents, and prosodic features, Background in linguistics, speech sciences, or cognitive science, Experience working with speech/audio datasets, Knowledge/experience with training voice models, Understanding of how data quality impacts model performance, Professional experience in voice work, Voice acting, voice recording, or podcasting"},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_58928a28-64d"},"title":"Research Engineer/Research Scientist, Audio","description":"<p><strong>About Anthropic</strong></p>\n<p>Anthropic&#39;s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.</p>\n<p><strong>You may be a good fit if you:</strong></p>\n<ul>\n<li>Have hands-on experience with training audio models, whether that&#39;s conversational speech-to-speech, speech translation, speech recognition, text-to-speech, diarization, codecs, or generative audio models</li>\n<li>Genuinely enjoy both research and engineering work, and you&#39;d describe your ideal split as roughly 50/50 rather than heavily weighted toward one or the other</li>\n<li>Are comfortable working across abstraction levels, from signal processing fundamentals to large-scale model training and inference optimization</li>\n<li>Have deep expertise with JAX, PyTorch, or large-scale distributed training, and can debug performance issues across the full stack</li>\n<li>Thrive in fast-moving environments where the most important problem might shift as we learn more about what works</li>\n<li>Communicate clearly and collaborate effectively; audio touches many parts of our systems, so you&#39;ll work closely with teams across the company</li>\n<li>Are passionate about building conversational AI that feels natural, steerable, and safe</li>\n<li>Care about the societal impacts of voice AI and want to help shape how these systems are developed responsibly</li>\n</ul>\n<p><strong>Strong candidates may also have experience with:</strong></p>\n<ul>\n<li>Large language model pretraining and finetuning</li>\n<li>Training diffusion models for image and audio generation</li>\n<li>Reinforcement learning for large language models and diffusion models</li>\n<li>End-to-end system optimization, from performance benchmarking to kernel optimization</li>\n<li>GPUs, Kubernetes, PyTorch, or distributed training infrastructure</li>\n</ul>\n<p><strong>Representative projects:</strong></p>\n<ul>\n<li>Training state-of-the art neural audio codecs for 48 kHz stereo audio</li>\n<li>Developing novel algorithms for diffusion pretraining and reinforcement learning</li>\n<li>Scaling audio datasets to millions of hours of high quality audio</li>\n<li>Creating robust evaluation methodologies for hard-to-measure qualities such as naturalness or expressiveness</li>\n<li>Studying training dynamics of mixed audio-text language models</li>\n<li>Optimizing latency and inference throughput for deployed streaming audio systems</li>\n</ul>\n<p><strong>Logistics</strong></p>\n<p><strong>Education requirements:</strong> We require at least a Bachelor&#39;s degree in a related field or equivalent experience. <strong>Location-based hybrid policy:</strong> Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices.</p>\n<p><strong>Visa sponsorship:</strong> We do sponsor visas! However, we aren&#39;t able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this.</p>\n<p><strong>We encourage you to apply even if you do not believe you meet every single qualification.</strong> Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you&#39;re interested in this work.</p>\n<p><strong>Your safety matters to us.</strong> To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you&#39;re ever unsure about a communication, don&#39;t click any links—visit anthropic.com/careers directly for confirmed position openings.</p>\n<p><strong>How we&#39;re different</strong></p>\n<p>We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI systems that benefit society.</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_58928a28-64d","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Anthropic","sameAs":"https://job-boards.greenhouse.io","logo":"https://logos.yubhub.co/anthropic.com.png"},"x-apply-url":"https://job-boards.greenhouse.io/anthropic/jobs/5074815008","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$350,000 - $500,000 USD","x-skills-required":["audio models","speech-to-speech","speech translation","speech recognition","text-to-speech","diarization","codecs","generative audio models","JAX","PyTorch","large-scale distributed training"],"x-skills-preferred":["large language model pretraining","training diffusion models","reinforcement learning","end-to-end system optimization","GPUs","Kubernetes","PyTorch","distributed training infrastructure"],"datePosted":"2026-03-08T13:46:24.550Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Francisco, CA"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"audio models, speech-to-speech, speech translation, speech recognition, text-to-speech, diarization, codecs, generative audio models, JAX, PyTorch, large-scale distributed training, large language model pretraining, training diffusion models, reinforcement learning, end-to-end system optimization, GPUs, Kubernetes, PyTorch, distributed training infrastructure","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":350000,"maxValue":500000,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_601ca6bf-9b1"},"title":"Senior Machine Learning Engineer, Natural Language Processing - PhD Early Career","description":"<p><strong>[2026] Senior Machine Learning Engineer, Natural Language Processing - PhD Early Career</strong></p>\n<p>San Mateo, CA, United States</p>\n<p>Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences– all created by our global community of developers and creators.</p>\n<p>At Roblox, we’re building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from anywhere in the world, and on any device.</p>\n<p>A career at Roblox means you’ll be working to shape the future of human interaction, solving unique technical challenges at scale, and helping to create safer, more civil shared experiences for everyone.</p>\n<p>Natural Language Processing (NLP) is central to enabling massive-scale communication, creation, and safety across the Roblox platform. This role offers the unique opportunity to build and deploy cutting-edge <strong>NLP, speech, and generative AI models</strong> that operate at an unprecedented scale, impacting hundreds of millions of daily users.</p>\n<p>You will solve an extremely diverse range of high-scale language-related problems—from <strong>real-time moderation of voice and text</strong> to <strong>automatically localizing experiences</strong> and empowering users through <strong>LLM-driven creation tools</strong>. We combine cutting-edge research with large-scale engineering to bridge experimentation and production, designing algorithms that shape the next generation of language services for our immersive, user-generated content platform.</p>\n<p><strong><strong>Teams Hiring for This Role</strong></strong></p>\n<ul>\n<li><strong>Safety AI Systems:</strong>Dedicated to building end-to-end ML systems for maintaining civility and safety across the platform, operating at massive scale. This includes:</li>\n</ul>\n<ul>\n<li><strong>Real-time Moderation:</strong> Building world-class NLP and speech models for <strong>real-time moderation of voice and text</strong> (processing over 6 billion messages daily) and advanced interventions that measurably improve user civility.</li>\n</ul>\n<ul>\n<li><strong>Critical Harms &amp; Advanced Detection:</strong> Developing specialized LLM agents, behavioral analysis, and graph systems for detecting and preventing rare, high-risk scenarios (e.g., child safety, terrorism), requiring adversarial thinking and multi-step reasoning.</li>\n</ul>\n<ul>\n<li><strong>Safety Data Quality:</strong> Ensuring all Safety AI systems are robust by managing the core data infrastructure, MLOps, and Active Learning initiatives for continuous model improvement.</li>\n</ul>\n<p><strong>You Will</strong></p>\n<ul>\n<li>Design and implement <strong>deep learning-based NLP and speech solutions</strong> that address problems across Roblox, from creation to safety.</li>\n</ul>\n<ul>\n<li>Develop advanced models, including <strong>Large Language Models (LLMs), machine translation, and generative AI</strong>, for user interactions, content creation, and moderation.</li>\n</ul>\n<ul>\n<li>Have the independence and <strong>end-to-end responsibility</strong> to develop NLP/ML-based services that are scalable and resilient.</li>\n</ul>\n<ul>\n<li>Be a <strong>technical bar-raiser</strong> for cutting-edge ML technology, high code quality, and architectural designs.</li>\n</ul>\n<ul>\n<li>Work backward from user and product needs to deliver ML solutions that drive engagement, safety, and ecosystem growth.</li>\n</ul>\n<p><strong>You Have</strong></p>\n<ul>\n<li>Possessing or pursuing a Ph.D. in Computer Science, Computer Engineering, Mathematics, Statistics, or a related technical field, with a thesis aligned to Roblox’s research areas.</li>\n</ul>\n<ul>\n<li>Expertise in one or more areas: NLP, Speech Models, Large Language Models, Machine Translation, or Generative AI (including diffusion models).</li>\n</ul>\n<ul>\n<li>Experience with transformer-based model design, training, serving, and product integration.</li>\n</ul>\n<ul>\n<li>A strong research track record, evidenced by multiple publications and presentations in top-tier, peer-reviewed venues (e.g., ACL, EMNLP, Interspeech, ICML, NeurIPS).</li>\n</ul>\n<ul>\n<li>Proficiency in one or more programming languages (e.g., Python, C++, Go, Java) and experience building and optimizing large-scale systems.</li>\n</ul>\n<p>You may redact age, date of birth, and dates of attendance/graduation from your resume if you prefer.</p>\n<p>As you apply, you can find more information about our process by signing up for Speak\\_. You&#39;ll gain access to our practice assessment, comprehensive guides, FAQs, and modules designed to help you ace the hiring process.</p>\n<p>For roles that are based at our headquarters in San Mateo, CA: The starting base pay for this position is as shown below. The actual base pay is dependent upon a variety of job-related factors such as professional background, training, work experience, location, business needs and market demand. Therefore, in some circumstances, the actual salary could fall outside of this expected range. This pay range is subject to change and may be modified in the future. All full-time employees are also eligible for equity compensation and for benefits as described on <strong>this page</strong>.</p>\n<p>Annual Salary Range</p>\n<p>$195,780—$242,100 USD</p>\n<p>Roles that are based in an office are onsite Tuesday, Wednesday, and Thursday, with optional presence on Monday and Friday (unless otherwise noted).</p>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_601ca6bf-9b1","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Roblox","sameAs":"https://careers.roblox.com","logo":"https://logos.yubhub.co/careers.roblox.com.png"},"x-apply-url":"https://careers.roblox.com/jobs/7324377","x-work-arrangement":"hybrid","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$195,780—$242,100 USD","x-skills-required":["NLP","Speech Models","Large Language Models","Machine Translation","Generative AI","Python","C++","Go","Java","Transformer-based model design","Training","Serving","Product integration"],"x-skills-preferred":["Deep learning","Computer vision","Natural language processing","Speech recognition","Text analysis","Sentiment analysis","Named entity recognition","Part-of-speech tagging","Dependency parsing","Semantic role labeling"],"datePosted":"2026-03-06T14:18:50.958Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"San Mateo, CA"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"NLP, Speech Models, Large Language Models, Machine Translation, Generative AI, Python, C++, Go, Java, Transformer-based model design, Training, Serving, Product integration, Deep learning, Computer vision, Natural language processing, Speech recognition, Text analysis, Sentiment analysis, Named entity recognition, Part-of-speech tagging, Dependency parsing, Semantic role labeling","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":195780,"maxValue":242100,"unitText":"YEAR"}}},{"@context":"https://schema.org","@type":"JobPosting","identifier":{"@type":"PropertyValue","name":"YubHub","value":"job_c98c4304-e19"},"title":"Principal Research Engineer","description":"<p>The Epic Games Research &amp; Development team is searching for an experienced hands-on Research Programmer. The ideal candidate will bring a passion for digital humans and experience with machine learning and animation tech.</p>\n<p><strong>What you&#39;ll do</strong></p>\n<p>The Research Programmer will be responsible for researching and prototyping machine learning and generative approaches for game content creation, working with development programmers and product programmers to deploy and productize novel technology results in Epic products, and engaging with the scientific community to contribute domain knowledge and advice in support of strategically relevant R&amp;D efforts.</p>\n<p><strong>What you need</strong></p>\n<ul>\n<li>PhD degree in Computer Science, Machine Learning, Mathematics, Programming, or a related discipline</li>\n</ul>\n<p style=\"margin-top:24px;font-size:13px;color:#666;\">XML job scraping automation by <a href=\"https://yubhub.co\">YubHub</a></p>","url":"https://yubhub.co/jobs/job_c98c4304-e19","directApply":true,"hiringOrganization":{"@type":"Organization","name":"Epic Games","sameAs":"https://www.epicgames.com","logo":"https://logos.yubhub.co/epicgames.com.png"},"x-apply-url":"https://www.epicgames.com/en-US/careers/jobs/5500537004","x-work-arrangement":"onsite","x-experience-level":"senior","x-job-type":"full-time","x-salary-range":"$270,572—$396,838 USD (California Base Pay Range)","x-skills-required":["PhD degree in Computer Science, Machine Learning, Mathematics, Programming, or a related discipline","Proven applied research impact in shipping titles","Strong analytical and reasoning skills with an emphasis on innovative and practical solutions","Excellent programming skills with a preference for experience with Python and C++"],"x-skills-preferred":["Experience working in common ML frameworks such as PyTorch or Tensorflow, and rapid prototyping in Python","In-depth knowledge in one or more of the following areas: real-time graphics, computer vision, machine learning, animation, large-language models, speech recognition, speech synthesis, linguistics, performance-capture, 3D reconstruction"],"datePosted":"2026-01-08T03:13:29.121Z","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Multiple Locations"}},"employmentType":"FULL_TIME","occupationalCategory":"Engineering","industry":"Technology","skills":"PhD degree in Computer Science, Machine Learning, Mathematics, Programming, or a related discipline, Proven applied research impact in shipping titles, Strong analytical and reasoning skills with an emphasis on innovative and practical solutions, Excellent programming skills with a preference for experience with Python and C++, Experience working in common ML frameworks such as PyTorch or Tensorflow, and rapid prototyping in Python, In-depth knowledge in one or more of the following areas: real-time graphics, computer vision, machine learning, animation, large-language models, speech recognition, speech synthesis, linguistics, performance-capture, 3D reconstruction","baseSalary":{"@type":"MonetaryAmount","currency":"USD","value":{"@type":"QuantitativeValue","minValue":270572,"maxValue":396838,"unitText":"YEAR"}}}]}