Audio Joiner

Free online audio joiner that merges MP3, WAV, M4A, OGG, OPUS, and WEBM files into a single MP3 using your browser's built-in audio engine. No upload, no account required.

S. Siddiqui

Edited by

S. SiddiquiFounder & Editor-in-Chief
Sources:MDN Web Audio APIW3CUpdated Jun 2026

Audio Joiner

Merge multiple audio files into one MP3 — works with WhatsApp voice notes, podcasts, and more. All processing happens in your browser. No files are uploaded to any server.

🎵

Drop audio files here or click to browse

MP3, WAV, M4A, AAC, OGG, OPUS, WebM, FLAC

The Audio Joiner is a free browser-based tool that merges multiple audio files into a single MP3. It supports MP3, WAV, M4A, AAC, OGG, OPUS, WEBM, and FLAC, including WhatsApp voice notes from both iOS and Android devices. All processing happens inside your browser, and no files are uploaded to any server.

What Is Audio Joining?

Audio joining is the process of taking two or more separate audio files and combining them end-to-end into a single continuous file. The result plays back as one uninterrupted recording, exactly as if everything had been recorded in one sitting. The operation is sometimes called audio merging or audio concatenation depending on the software context, but the outcome is always the same: multiple clips become one.

People need this for a wide range of everyday tasks. WhatsApp splits long voice messages into multiple recordings when sent in parts, and recipients often want to combine them into a single file before forwarding or archiving. Podcasters frequently record individual segments or interviews separately and need to join them before editing in a digital audio workstation. Students who record long lectures sometimes end up with several files because their recording app auto-splits at a size limit. Music fans who download clips from different sources want to stitch them into a seamless listening file.

This tool handles joining entirely inside your browser using the Web Audio API, which is built into every modern browser. Your audio files are decoded locally by the browser itself, merged in memory, and then encoded to MP3 using a JavaScript encoder. No files are transmitted to any server at any point, which means your recordings remain completely private regardless of what they contain. You can find this tool alongside other browser-based utilities in the Audio Tools collection.

How to Use This Tool

  1. Add your audio files. Drag and drop your files onto the drop zone, or click it to open the file picker. You can add as many files as you like in one go, or add them one at a time. Supported formats include MP3, WAV, M4A, AAC, OGG, OPUS, WEBM, and FLAC.
  2. Review the file list. The files appear in the order you added them. Check the list carefully from top to bottom before proceeding, as the tool joins files in exactly the sequence shown.
  3. Reorder the files if needed. If the listed order does not match the intended playback order, use the up and down arrows next to each file to rearrange them. This step is important for multi-part voice note sequences where clips may have arrived out of order.
  4. Click Join Audio. Once the file list looks correct, press the Join Audio button to start the merge process. The button will show live status updates as each file is decoded and the tracks are merged in memory.
  5. Wait for processing. The tool decodes each file using the browser's built-in audio engine, then merges them and encodes the result to MP3. This is fast for short voice notes and may take a few seconds for longer recordings. No internet connection is needed once the page has loaded.
  6. Preview the result. An audio player appears so you can listen to the joined file before downloading. Use this to confirm the order and continuity are correct before saving.
  7. Download your file. Click the Download MP3 button to save the merged result to your device as joined-audio.mp3. The file is ready to share, forward, or import into any audio editor.

Why Use This Tool

Complete Privacy: Your Files Never Leave Your Device

Most online audio tools work by uploading your files to a remote server, processing them there, and sending the result back. This means your voice recordings, interviews, or personal messages travel across the internet and sit temporarily on someone else's hardware. This tool works differently: the AudioContext.decodeAudioData() method built into your browser decodes every file locally, and encoding happens in the same browser tab. There is nothing to log, nothing to store, and no third party involved at any point during the join.

Broad Format Support Including WhatsApp Voice Notes

iOS WhatsApp voice notes are saved in M4A/AAC format. Android WhatsApp voice notes use OGG/OPUS. Both are fully supported here alongside the more common MP3 and WAV formats, as well as WEBM (used by browser-based recorders) and FLAC (lossless audio). Because the browser itself decodes the audio before merging, any format your browser can play is a format this tool can join. The output is always a standard MP3 file that plays on any device.

No Installation, No Account, No Arbitrary File Size Limit

There is nothing to install and no account to create. The tool runs directly in your browser tab and starts immediately with no loading delay. There is no enforced file size cap. The practical limit is determined by how much RAM your browser can allocate, which on a modern laptop or desktop is typically several gigabytes. You can comfortably join an hour or more of audio without hitting an artificial restriction.

Works Offline After the Page Loads

Once the tool page has fully loaded in your browser, no further internet connection is required to process your audio. All decoding, merging, and encoding runs inside the browser tab using locally loaded JavaScript. This makes it reliable on slow or intermittent connections, and suitable for use on mobile data where you do not want large file uploads consuming your allowance. It also means the tool continues to work even if your internet drops partway through a session, because all the heavy work is already running locally.

Consistent Output Format

Regardless of the input formats you combine, the output is always a standard MP3 file. This predictability matters when you are delivering files to a client, uploading to a platform with format requirements, or sharing through a messaging app that may not support every audio format. You do not have to think about whether your output will be compatible: MP3 plays on every device, every media player, and every browser without any additional conversion step.

Real-World Use Cases

Combining WhatsApp Voice Notes from a Business Call

A project manager received four separate WhatsApp voice messages from a client during a call recap. The client had hit the recording limit each time and sent multiple clips in sequence. The manager needed to forward one coherent summary to a colleague who had missed the call. By dropping all four OGG files into the joiner, reordering the third and fourth clips which had arrived out of sequence, and clicking Join Audio, she produced a single continuous MP3 that she shared directly in the team channel. No audio editor was needed and the task was complete in under a minute.

Assembling a Podcast Episode from Separate Recordings

A freelance podcast producer records interviews and solo segments on separate days, each saved as an individual WAV file. Before sending to a client for review, he needs a single combined file to verify the running order and approximate duration. The Audio Joiner lets him drop all five segments in, drag them into broadcast order, and download a preview MP3 in about thirty seconds. He uses the preview player to confirm the transitions sound correct before committing to a full edit in his digital audio workstation.

Merging a Split Lecture Recording

A university student recorded a three-hour seminar on her phone, but the recording app automatically split the file every hour, producing three separate M4A recordings. She needed to submit one continuous audio file to her study group's shared drive. Joining all three files using this tool took less than a minute and produced a single MP3 that retained all three hours of audio in the correct sequence, without requiring any specialist software on her laptop.

Building a Continuous Music Mix

A DJ preparing a promotional mix downloaded individual track exports as MP3 files from his digital audio workstation. To share a preview with a venue promoter, he needed one continuous file rather than a folder of individual tracks. He joined eight tracks in sequence using the Audio Joiner and sent the resulting MP3 as a single download link. The join preserved the transitions he had already edited into the individual files, and the whole process took less than two minutes.

Common Mistakes and Troubleshooting

Adding Files in the Wrong Order

The tool joins files exactly as listed. If your second clip was recorded before your first clip, you need to use the up and down arrow buttons to correct the order before clicking Join Audio. Always review the file list from top to bottom before starting the join. If you download the result and the order is wrong, click Start Over, rearrange the files, and re-join. This is the single most common mistake and takes only a few seconds to correct.

Joining Files with Very Different Volume Levels

If one voice note was recorded close to a microphone and another was recorded from a distance, the volume difference will be preserved in the joined output. The tool does not normalise audio levels between clips. If the volume jump between clips is noticeable in the preview, you will need to adjust the levels in an audio editor before or after joining. For WhatsApp voice notes recorded under similar conditions, this is rarely a problem. A list of browser-based options for adjusting audio levels is available in the Audio Utility Tools section.

Expecting Instant Results with Very Large Files

Decoding and encoding audio takes a proportional amount of time. Two short voice notes of thirty seconds each will join in a couple of seconds. An hour of WAV audio might take fifteen to thirty seconds because the browser has to decode the lossless data and then re-encode the merged result to MP3. If the button appears to do nothing, check whether the status message is updating, as processing is still running in the background. Do not close the tab or navigate away until the download button appears.

Using a Browser That Does Not Support the Web Audio API

This tool requires a modern browser that supports the Web Audio API, which includes all current versions of Chrome, Edge, Firefox, and Safari released after 2018. Very old browser versions or certain privacy-hardened browsers with JavaScript disabled will not work. If Join Audio does not respond at all, try opening the tool in a different browser or update your current browser to the latest version. The Web Audio API has been a recognised standard in all major browsers for several years, so any up-to-date browser will work correctly.

Downloading a File That Cuts Off Early

If the downloaded MP3 is shorter than expected, the most likely cause is that the browser ran low on memory during the encode step and wrote an incomplete file. This can happen with very long recordings on devices with limited RAM. To resolve it, close other browser tabs to free up memory, reload the tool, and re-join the files. If the problem persists, try splitting your files into two smaller batches and joining each batch separately before using the tool again on the two resulting files.

Last reviewed: June 7, 2026
Founder's Real-World Experience
S. Siddiqui

S. Siddiqui

Founder & Editor-in-Chief, YourToolsBase

How I combined 4 fragmented WhatsApp voice notes into one file in under a minute

A supplier I work with regularly sends voice notes instead of written messages, sometimes splitting a single update across three or four recordings because he records on the go. Last month he sent four voice notes explaining a delivery delay: two recorded while walking, one from a car, one as a quick follow-up. I needed to share the full explanation with my logistics manager, who was not on the original WhatsApp thread.

Forwarding four separate voice notes is awkward. The listener loses context between clips, especially if they play them out of order. I wanted one clean file I could send as a single audio attachment or upload to our shared drive. My first instinct was to find a desktop app, but every option I found either required installation, a subscription, or uploaded the files to a server. None of those options were acceptable for a private supplier conversation.

The Audio Joiner on YourToolsBase solved it in under 30 seconds. I dragged all four OGG files (the format WhatsApp uses on Android) directly into the tool, confirmed the order using the up and down arrows, and clicked Join Audio. The tool decoded them entirely in my browser using the Web Audio API, merged them into one continuous track, and encoded the result as an MP3 I could download immediately. Nothing left my device. The whole process took less time than it would have taken to explain the problem to a colleague.

The output was clean: no gaps, no clicks between clips, and the volume was consistent across all four recordings because they had all been captured at similar levels. I now keep the Audio Joiner bookmarked for exactly this scenario.

4 voice notes → 1 MP3 in 30 secondsZero files uploaded to any serverOGG WhatsApp format supported natively
Also used alongside: Audio Splitter

Frequently Asked Questions

What audio formats does this tool support?
The tool supports MP3, WAV, M4A, AAC, OGG, OPUS, WEBM, and FLAC. This covers the most common audio formats including iOS WhatsApp voice notes (M4A) and Android WhatsApp voice notes (OGG/OPUS). You can mix different formats in the same join and the output will always be a standard MP3 file.
Is my audio uploaded to a server?
No. Your audio files never leave your device. The tool decodes audio using the Web Audio API built into your browser and encodes the result to MP3 entirely inside the browser tab. Nothing is transmitted to any server, so your recordings remain completely private regardless of what they contain.
Why does the tool take a few seconds to process?
The browser needs to decode each input file from its compressed format into raw audio data, merge all the tracks in memory, and then encode the result to MP3. This takes time proportional to the total length of the audio. Short voice notes are processed in seconds. An hour of audio may take fifteen to thirty seconds.
Can I join WhatsApp voice notes?
Yes, this is one of the primary use cases for this tool. iOS WhatsApp voice notes are in M4A/AAC format and Android WhatsApp voice notes are in OGG/OPUS format. Both are fully supported. Save the voice notes from WhatsApp to your device, add them to the joiner in the order you want, and download the combined MP3.
Is there a file size limit?
There is no enforced file size limit in the tool itself. The practical limit is determined by how much RAM your browser can allocate for the operation. On a modern laptop or desktop, you can comfortably join an hour or more of audio. Very large files on low-memory devices may cause the browser tab to become slow.
Can I change the order of the files before joining?
Yes. After adding your files, they appear as a numbered list in the order they were added. Each file has up and down arrow buttons that let you move it to the correct position. Always review the order before clicking Join Audio, as the tool concatenates files exactly as listed.
What output format does the tool produce?
The output is always an MP3 file named joined-audio.mp3. MP3 is universally supported across all devices, media players, browsers, and messaging apps. If you need a different output format, you can convert the resulting MP3 using a separate audio converter tool after downloading.
Can I join more than 2 files?
Yes, you can join as many files as you like in a single operation. There is no upper limit on the number of files. Add all the files you need, arrange them in the correct order using the arrow buttons, and click Join Audio to merge them all into one continuous MP3.
Will the audio quality be reduced?
The output is encoded as a 128 kbps stereo MP3, which is good quality for voice recordings and general listening. Lossless source formats like WAV and FLAC will be re-encoded to MP3 in the output, which involves a quality trade-off. For most voice notes and podcasts the difference is not noticeable. If fidelity is critical, keep your original source files separately.
What browsers does this work in?
The tool works in any modern browser that supports the Web Audio API, which includes current versions of Chrome, Edge, Firefox, and Safari. Browser versions released before 2018 may not support the Web Audio API fully. If the Join Audio button does not respond, try updating your browser to the latest version.

Rate This Tool

Was this tool helpful?

Be the first to rate this tool

About the Author

S. Siddiqui

S. Siddiqui

Founder & Editor-in-Chief

LinkedIn Profile

S. Siddiqui is the founder and editor-in-chief of YourToolsBase, overseeing all content, tool accuracy, and editorial standards.

View full profile

Related Tools

Authoritative Sources

Formulas and data in this tool are based on guidelines from the above sources.