AI Speech & Audio Projects

Looking for freelance AI Speech & Audio jobs and project work? PeoplePerHour has you covered.

SAVED SEARCHES

3 results

by Marielle J.
$146
English (UK) Dialogue Recording Project
About This Opportunity We are recruiting first-language speakers from the United Kingdom to participate in a speech data collection project. You will record natural conversations with a partner to help train advanced AI speech models. This is straightforward, flexible work that you can do from home. Locations Currently recruiting remote, first-language speakers of British English. What You'll Do Record dual-speaker conversations on 7 different topics with a conversation partner (your friend, co-worker, family member, etc.) Contribute approximately 2.5 hours of conversational audio Use a smartphone to capture clear, high-quality audio in a quiet environment Engage in natural, conversational dialogue on assigned topics Who We're Looking For First-language speaker proficiency in the British English with clear, natural spoken delivery Ownership of a smartphone capable of recording audio Access to a quiet indoor space for recording (remote work-friendly) Reliable, attentive approach to following recording guidelines Comfort with natural, conversational dialogue—no professional acting experience needed Ability to deliver high-quality audio files on schedule
11 days ago6 proposalsRemote
by Shady M.
$180
Icelandic Speakers for Data Audio Collection Project
We’re currently inviting native Icelandic speakers from Iceland to join a paid, fully remote AI audio task. Who we’re looking for: Icelandic speakers from Iceland Task details: 10 short phone calls with an AI bot $18per call ($180 total) Around 30 minutes to complete 100% remote No experience needed How to get started: Complete a quick 30-second Icelandic verification on Click worker (required) Once approved, you’ll get access to the paid audio task
a month ago0 proposalsRemote
by Danny P.
$146
AI assistant call handling
I’m looking for a developer to build an AI-powered call handling and dispatch system for emergency/facilities maintenance calls, similar to corecall.co.uk. I have an existing IP phone system this needs to integrate with. ( Hihi) What it needs to do: • Answer inbound calls via AI (voice agent), 24/7 • Have a natural conversation to capture caller name, location, fault description, and urgency • Automatically match and dispatch the right engineer based on skill, location, and availability • Send SMS/email notifications to both the engineer and the customer • Provide a live dashboard showing incidents, statuses, and engineer availability • Integrate with our existing systems Ideal experience: • Voice AI platforms (Vapi, Retell, Bland, or similar) • SIP/VoIP and PBX integration (3CX experience a strong plus) • Backend development (Node.js or similar) — APIs, webhooks, databases • Twilio or similar for SMS/email notifications • Comfortable building a clean operational dashboard Nice to have: • Prior experience building call-handling/dispatch systems for trades, FM, or field service businesses • Experience with Anthropic/OpenAI APIs for conversational logic To apply, please include: • Relevant past projects (especially anything involving voice AI or SIP/PBX integration) • Your estimated hourly rate and roughly how many hours you’d expect this to take • Any questions about scope I already have a working architecture plan and some backend code scaffolded, so this isn’t starting from zero happy to share on request. PLEASE DO NOT ADD ME ON LINKED IN OR EMAIL ME OR MAKE ANY CONTACT OUTSIDE OF THIS PLATFORM I WILL REPORT AND BLOCK
a month ago27 proposalsRemote

Past Projects

by Abhishek K.

$100

Native UK English and Polish Speakers for Voice Recording

We are looking for native speakers to participate in a paid voice recording assignment. Open Positions: • 2 Female Native UK English Speakers • 10 Native Polish Speakers (Male or Female) Task Overview: Participants will be required to record a series of provided scripts, prompts, and short passages using their natural speaking voice. Detailed instructions will be shared with selected applicants before the project begins. For transparency, sample scripts have been attached to this project listing for reference so applicants can review the nature of the recording task before applying. Requirements: • Native speaker of the required language • Clear and natural speaking voice • Ability to follow recording instructions accurately • Quiet environment suitable for recording • Reliable internet connection Project Details: • Remote work opportunity • Flexible schedule • Short-term assignment • Fixed-price compensation • Multiple openings available How to Apply: Please submit your application with the following information: • Native language • Current country of residence • Voice recording or voice-over experience (if any) • A brief introduction about yourself Relevant sample scripts are attached for reference and review. We look forward to hearing from qualified candidates.

by Labradr D.

$100

Train Voice AI with Native US English

Overview I need help training my voice AI agent model with native American English, necessary for developing AI HR and AI chief of staff capabilities. Scope of work - Train voice AI agent model using native American English inputs - Focus on AI development for HR and chief of staff roles - Collaborate with other native speakers to ensure diverse input quality Company details - AI development AI voice language English Ideal candidate Freelancer speaks: English: Native/bilingual only Location: United States

by Kumail R.

$146

Ai Speech Expert Need

Seeking an AI speech expert to analyze and optimize speech recognition and synthesis systems. Responsibilities include evaluating current models, improving transcription accuracy, reducing latency, refining language models for diverse accents and noisy environments, and recommending architecture or data augmentation strategies. Candidate should possess deep experience with ASR, TTS, neural architectures, transfer learning, and evaluation metrics. Deliverables: technical audit, actionable roadmap, and performance benchmarks to guide implementation.

by Cristobal O.

$50

Upgrade voice from theater play video 1 hr

I need to enhace and insolate voice from a theater play cellphone recording, removing background noise. One hour of video.

by Sommer C.

$53

ACX audio book voice over

I am looking for a voiceover to recreate my short e-book of 10,000 words. The book will do well with a motherly British/ Londoner accent and someone who can relate to the content which is also a guide book so just needs points of humour at some point's not monotone. The speaker will hold no rights to the book and will be required to sign a NDA. You will need to deliver the final copy to ACX format and therefore know what this is . Suggested AI tool would be elevenlabs.io as they provide a great range of voices and tones to suit the requirement of my e-book delivery of the book should take no more than 3 hours I would ask that the hired freelancer provides example of the chosen voice before completing and moving forward with the whole project ad the voice and tone is really important for the book.

by Prashanta P.

$10

Audio Data Collection with Wireless Earbuds

I need two participants for an audio data collection project using wireless earbuds. The task involves recording natural conversation between two people in a quiet indoor environment using the Riverside platform. Project Details: * Total Sessions: 2 recordings (10 minutes each) * Participants: Exactly 2 people * Device Requirement: Wireless earbuds with microphone (e.g., AirPods or similar) * Audio Format: WAV * Recording Method: Audio must be captured only through the earbuds microphone (not phone/laptop mic) Recording Process: 1. Session 1 (10 minutes): * Person A wears earbuds (primary speaker) * Person B sits 1–3 meters away (secondary speaker) 2. Session 2 (10 minutes): * Roles are switched * Person B wears earbuds (primary speaker) * Person A sits 1–3 meters away Requirements: * Continuous recording (no pauses, cuts, or edits) * Natural conversation (no scripted reading) * Distance must remain between 1 to 3 meters * Both participants must be physically present in the same room * Earbuds must remain in use throughout the session Additional Requirements: * Provide metadata including: * Earbud brand/model * Distance between participants * Ages of participants * Recording duration * Environment details (room setup, objects) * Background noise type and level * Room size category * Links to uploaded WAV files Important Notes: * Perform a short test recording before starting * Ensure devices are fully charged * Follow all instructions strictly to avoid rejection Deliverables: * Two 10-minute WAV audio files (one per primary speaker session) * Completed metadata sheet with all required details This is a simple task but requires strict adherence to guidelines and high-quality, natural audio recording.

by Gary D.

$333

opportunity

VAPI AI Customer service agent development

Looking for somebody to create an AI agent tor a client to provide customer service support for their customers over the phone.

by Yousuf A.

$12

Thai Voice Sentence Recording

I’m collecting a set of 300 short Thai sentences for speech-training research and I’d like a native speaker to record them directly in our mobile app. You’ll be working with a smartphone—any operating system is fine, even alternatives beyond iOS or Android—and you should be able to record in a completely quiet space so there’s no background noise on the clips. Once you accept, I’ll send login credentials, a step-by-step recording guideline, and a link to the app. The workflow is straightforward: open the script inside the app, tap to record each sentence, review the waveform for clarity, and save. The system automatically uploads every take, so there’s no post-processing required on your side. Deliverable • 300 clearly spoken Thai sentences, captured and uploaded through the app, passing the built-in silence and clipping checks. I release payment after the platform confirms that all 300 files meet the quality threshold for volume, pronunciation, and absence of background noise. If you’re a native speaker with a quiet room and a smartphone, this should take less than an hour. Feel free to apply and I’ll get you set up right away.

by Kathy V.

$146

Promo video from photos

## AI Promotional Video Needed for New Medical Lifting Aid Product (Healthcare / Assistive Device) ### Project Overview We are launching a new healthcare lifting aid called Lone Raiser, designed to help elderly, disabled, and vulnerable individuals safely stand and transfer independently while reducing injury risk for carers. We are looking for a freelancer to create a professional AI-generated promotional video using product photos, branding, and a script. The video will be used for marketing to care homes, NHS, private carers, and families. --- ### What We Need We require a 30–45 second promotional video that: - Clearly explains the problem and solution - Demonstrates the product visually using AI or animation - Builds trust and credibility in a healthcare setting - Looks modern, clean, and professional - Includes motion graphics and text overlays - Includes voiceover (UK English preferred) We will provide: - Product photos and concept visuals - Logo and branding - Key benefits and messaging - Target audience information - Guidance on tone and style We are open to: - AI video creation - Motion graphics - Medical explainer animation - Product demo style - AI avatar presenter (optional but welcome) --- ### Target Audience - Care homes - Healthcare providers - Occupational therapists - NHS and private healthcare - Families caring for elderly relatives The tone should feel: - Professional - Trustworthy - Safe - Healthcare focused - Not overly “salesy” --- ### Key Product Benefits to Highlight - Reduces risk of injury for carers - Promotes patient independence and dignity - Safe and easy lifting support - Compact and discreet design - Suitable for home and care environments --- ### Deliverables - 1 main video (30–45 seconds) - 1–2 short cutdowns for social media (optional) - Voiceover and captions - Editable source file preferred --- ### Budget We are a startup, so we are looking for cost-effective options and potential long-term collaboration if results are strong. Please include: - Examples of healthcare or product videos - AI or animation work - Estimated turnaround time - Suggested creative approach --- ### Future Opportunity We plan to produce multiple marketing videos, so this could lead to ongoing work. --- We look forward to working with someone creative, reliable, and experienced in healthcare or product marketing.

by Christian B.

$216

A.I Voiceover Creator

Overview We are looking for a talented AI Voiceover Creator to join our team on a part-time, remote basis. This role offers flexible working hours within the weekdays, allowing you to apply your creative skills in producing high-quality AI-generated voiceovers. You will use your expertise to craft engaging voice content that enhances our brand communications and marketing efforts. Your Role - Use our pre-built custom AI voice (in MiniMax – NOT Eleven Labs) - Generate, pace, and clean voiceover from finalized scripts - Apply editing to fix tone, speed, pauses etc (regenerations for certain parts will be needed) - Deliver clean, finished MP3 voiceovers - 3–4 voiceovers per week (approximately 10 minutes each) Requirements - Familiarity with AI tools like MiniMax (or willingness to learn) - AI audio editing experience (generations, cleanup, pacing, pauses) - Fluent English comprehension and pacing sense - VERY detail-oriented and consistent - The voiceover must sound 100 percent real. This is not a press-a-button-and-it’s-done job. Regeneration and editing are required. Trial - This would start as a short trial to make sure the workflow and quality are a good fit on both sides before committing longer-term. Future Work - The channel will eventually expand into courses and digital products, which may create additional work opportunities over time. Application - Please tell us a bit about yourself, your experience with sound artistry, and why we should choose you for the job. - Please send an A.I Voiceover sample you made.

by Phillip T.

$131

AI-Powered Personalized Audiobook System for Shopify

I am looking for one highly experienced developer to build a complete, clean setup for a personalized audiobook system on Shopify. Scope (high level, simple for an expert): Custom Shopify frontend & backend for an audiobook configurator Claude for logic & personalization DeepSeek for story generation ElevenLabs for voice output Custom SFX / background sound integration Automated delivery via Klaviyo (or similar) Customers should be able to configure a story (names, style, voice, mood, length) and receive a fully generated audiobook automatically after checkout. This is not a research project – I’m looking for someone who has already built similar AI pipelines and can implement this efficiently and pragmatically.

by Aryan Y.

$70

AI Speech & Audio Processing Project

We are seeking an experienced freelancer for an AI Speech and Audio Processing project that showcases advanced skills in artificial intelligence, machine learning, and audio manipulation techniques. The ideal candidate will possess a strong understanding of speech recognition, natural language processing, and audio signal processing. The project aims to develop innovative solutions that enhance audio quality, improve speech synthesis, and optimize voice recognition systems. We invite proposals from professionals who are eager to demonstrate their expertise and contribute to this cutting-edge initiative.

by Beverley S.

Help at town hall in cuavas

We have to go to town hall in cuavas on Monday, we are at the second stage of visa application and need the staff to generate a QR code

by Meriem G.

$15

Recording

I need people can recording 30 min in an application

by Ragnar S.

$27_/hr

Vapi Ai Voice calling - Refine spoken and written Swedish

I need help with advanced VAPI configuration for voice and Swedish transcribing, enhanced by instructions for better understanding of the spoken Swedish language. I'm currently using ElevenLabs voice, and I'm looking for someone experienced. Scope of work - Assist with advanced VAPI configuration for Swedish voice using ElevenLabs. Better pronuciation of spoken swedish, better understanding of swedish so the live transcription gets it right. currently testing both Deepgram and Speechmatics. Additional information You don’t need to understand Swedish as we will workshop this together. when we get the foundations right, we like to do structured outputs. Ideal Candidate - Experienced with advanced VAPI configuration and transcribing. - Skilled in using ElevenLabs voice technology. Preferably with pronunciation files, API and advanced transcribing to structured output fields . The Json files and others seems to be formatted differently based on the transcribe model? - Candidate must provide written scope of work/suggestions. Deliverables and expected outcomes. English. - Candidate need fluent in english language. - Keep reasonable service levels and attend to agreed meetings and feedbacks. - Start as soon as possible - Open to ongoing improvements as soon we can go live and actually get business value. Transcription language Swedish - We will help out with the swedish and identify needed improvements.

by Shalu K.

$20_/hr

Multi-lingual Conversation Audio Collection Project (Canada)

TELUS Digital is seeking native-speaking individuals to participate in a conversational data collection project. The task involves recording real-world, two-party conversations to support AI model training. Contributors will work in pairs and generate conversations that sound natural, following strict guidelines for audio quality, content, and file format. This is a remote project Conversations must be recorded in the same room, using a single microphone. Each pair will cover general and medical-related topics (medical background is preferred but not required). The role of one speaker must remain consistent across all recordings for each topic. Your Partner / Friend who will perform the task, they will also need to register in our TELUS Digital AI Community Platform with the same link and submit a separate application. Estimated time to complete the task: Each speaker: up to 2 hours of recorded speech Each pair: up to 4 hours combined Minimum 1 hour of recorded speech required to qualify for payment (after QA check). Each participant may only complete the project once. Pay Rate: Canadian French - $35 per hour. This is an Independent Contractor opportunity. Payments will be made via Hyperwallet, where you can choose PayPal or Bank Transfer as the payment method. Key Requirements: French (Canada) native speaker Willing to record in pair, in the same room, on a single device. Adherence to specific audio specifications (WAV, 16kHz, mono). Ability to follow guidelines to ensure conversations sound natural and are not read from a script. Device with voice recording capability Stable Internet connection for uploading files Register here (both partners need to submit application separately): https://www.telusinternational.ai/cmp/contributor/jobs/available/127938 Selected participants will be contacted by TELUS Digital with detailed guidelines. If you have questions, we will be happy to assist you!

by Raphael C.

$57

Dublagem de Vídeos Virais com Inteligência Artificial

Estou procurando um(a) profissional para realizar dublagens de vídeos virais usando tecnologia de Inteligência Artificial, mantendo naturalidade e sincronização com o áudio original.