JavaScript: Browser-based Audio Analysis Using AnalyserNode

Greetings. 🎩

I will demonstrate the implementation of AnalyserNode to reveal (compute) the peaks and such from an audio source. This will only output the numbers rather than an elaborate animation.

In this, we can choose whether to use a sample audio (hosted on a cloud storage) or a local audio file.

Highlights of the flow:

// 1. Create audio element
const audioElement = document.createElement("audio")
audioElement.controls = true
// Make sure we convert the source to a proper Blob first,
// not URL/path string like https://www.host... or local E:\path\to\file\...
// Details in the code below the demonstration.
audioElement.src = [Blob]
// Append the audio element to DOM
document.appendChild(audioElement)

// ---
// 2. Setup AnalyserNode (use global variables to keep these)
// Create audio context
audioContext = new AudioContext()

// Create analyser
analyser = new AnalyserNode(
  audioContext,
  { fftSize: [integer] }
)

// Create source node
sourceNode = audioContext.createMediaElementSource(audioElement)

// Connect analyser to source node
sourceNode.connect(analyser)

// Connect analyser to destination
analyser.connect(audioContext.destination)
  
// ---
// 3. Do computation → Generate output (separate function)
// Copy the current waveform, or time-domain, data into a Uint8Array to "dataArray" variable
analyser.getByteTimeDomainData(dataArray)

// Copy the current frequency data into a Uint8Array to "freqArray" variable
analyser.getByteFrequencyData(freqArray)
  
// Do the arithmetic accordingly
  
// Finally, output the results

Demonstration

An amazing user interface is ready to interface.

.app_container { display: none; } .noscript { text-align: center; font-size: 24px; color: brown; padding: 1rem; } Please enable JavaScript

🟣

🟣 Use local audio ➡️

🧹

Application Flow

The overall flow of the application is as such:

User loads the page
DOMContentLoaded

⬇️

applicationUI() runs

⬇️

Initial UI and variables are prepared.
User selects a source (sample URL or local file)
- Sample audio chosen
  Button clicked
  
  ⬇️
  
  stopStream() clears old state
  
  ⬇️
  
  File fetched from the network → Blob → Blob URL
  
  ⬇️
  
  setupAudioAnalysis(blobURL) builds a fresh audio element.
  
  Make sure to set the cross-origin header for the hosted file to be:
  Access-Control-Allow-Origin: *
  Or using your custom CORS pattern to allow it to be fetched from the URL where the application resides.
  
  Because browser fetch needs an explicit CORS policy on the resource being fetched — which will be converted as Blob URL for the audio src for the analysis using createMediaElementSource.
  
  The demonstration allows .mp3 for the file, but it can also work for .mp4 (if you insist 😂).
- Local file chosen
  File input changes
  
  ⬇️
  
  stopStream() clears old audio, context, analyser, RAF
  
  ⬇️
  
  File converted to Blob URL
  
  ⬇️
  
  setupAudioAnalysis(blobURL) builds a fresh audio element.
Setting up the audio element
setupAudioAnalysis(url)

⬇️

Clears output/UI

⬇️

Creates <audio controls> and a note ➡️ Sets src, sets crossOrigin = "anonymous"

⬇️

Appends it into the DOM

⬇️

Browser loads metadata

⬇️

onloadedmetadata fires

⬇️

queueMicrotask(...) builds the Web Audio graph.
Building the Web Audio graph
Inside the microtask:

Create AudioContext

⬇️

Create AnalyserNode (FFT size set)

⬇️

Create MediaElementSource from the audio

⬇️

Connect:

sourceNode → analyser → destination
User presses Play
audioEl.onplay runs

⬇️

audioContext.resume() summons the audio graph

⬇️

Start the RAF loop:

requestAnimationFrame(getAudioLevels)
Continuous real-time analysis
getAudioLevels() → every RAF tick

⬇️

Reads time-domain data (waveform)

⬇️

Reads frequency-domain data (FFT bins)

⬇️

Computes:
- Peak amplitude
- RMS amplitude
- Zero-crossing rate
- Peak-hold
- Spectral centroid
- 10-band EQ levels (log-spaced)
⬇️

Formats results into HTML

⬇️

Displays live output

⬇️

Schedules next RAF tick.
User switches audio or presses "C L E A R"
stopStream() fires

⬇️

Cancels RAF

⬇️

Stops input stream if any

⬇️

Disconnects nodes

⬇️

Closes AudioContext

⬇️

Removes audio element

⬇️

Resets UI

⬇️

System returns to idle state.

Speaking of long scroll.

Simplified Flow

User selects audio
        ↓
stopStream()
        ↓
setupAudioAnalysis(url)
        ↓
<audio> loads metadata
        ↓
queueMicrotask → build AudioContext + analyser graph
        ↓
User presses Play
        ↓
resume AudioContext
        ↓
RAF loop → getAudioLevels()
        ↓
Analyse (peak/RMS/ZCR/centroid/bands)
        ↓
Render output
        ↓
(repeats every frame)

queueMicrotask

I use queueMicrotask to wrap the AudioContext generation in the onloadedmetadata to avoid Chrome's "long task violation" complaint.

queueMicrotask is quite new (2020), so I employ it to show you the queue method. But since AudioContext boot is synchronous and heavy — Chrome will still show the "long task violation" warning.

I tried it with setTimeout, requestAnimationFrame, Promise, even moved the generation to onplay, still... 😂

The heavy work is not in the handler at all — it's inside Chrome's internal AudioContext boot. AudioContext startup is synchronous, it blocks the main thread. So. 🤷‍♂️

That warning always happens at the "very first time" of the AudioContext generation. After that, with browser reload, should be without warning. Interesting, that. That means the real-time audio thread stays alive for some time. Reproducing the "very first time" (first boot) is bit arbitrary at the moment. It's related to Chrome's audio engine suspension timing (trigger), for energy/CPU resource saving.

HTML of the Demonstration (Style-Element-Script)

This has more than 350 lines. 😶

You can use this button below to open it on a new tab:

Or skim the code below:

The 10-band Logic

The bands:

[31, 62, 125, 250, 500, 1000, 2000, 4000, 8000, 16000]

It's the common web-EQ layout.

Each band:

Has a centre frequency (31 to 16k).
Uses a 1-octave window around each centre:
- From centre ÷ √2.
- To centre × √2.
Averages all FFT bins falling inside that range.
Outputs the strength of the band as an integer amplitude (0–255).

This is the same approach used by browser EQ visualisers and light JS music visualisers.

Microphone

We can also use the microphone as the audio source. But it's not covered in the application above.

Discrete Fourier Transform

The Fast Fourier Transform (FFT) itself is an algorithm — an optimisation — for computing the Discrete Fourier Transform (DFT) efficiently. So the discrete formulation starts with the DFT itself.

The Discrete Fourier Transform (DFT) is a mathematical method that converts a finite sequence of equally spaced samples from the time domain into a set of complex numbers representing the signal's amplitude and phase at specific frequencies in the frequency domain. It is essential in computing because it enables the conversion of time-based digital signals — like audio, images, or sensor data — into their frequency components, allowing computers to analyse, filter, compress, and visualise these signals more effectively by revealing patterns and structures that are not immediately visible in the time domain.

There's the continuous counterpart, simply called the Fourier Transform (FT). Used in pure mathematics and physics to analyse continuous-time signals (like theoretical waveforms, electromagnetic fields, etc.) DFT is there to approximate FT when things are sampled and digitised — within the digital realm.

I remember back then in college, in "Signals and Sytems" class, because the professor was quite captivating, I got a "B". Not because I was any good in signal analysis. But because in my thoughts:

🧠 Interesting.

And I wrote plenty of swirling shenanigans using the formulation. More pages with ink, convincing. Ended with double-line strike to underscore the final answers. I guess the TA (Teaching Asisstant) was like, Oh, by golly. This lad. At least he wrote e^−2πift. And bloody [S_H]_u,r = ^-ikeN?! Isn't it [F_N]_k,n = e^−2πikn/N? 🤔 It does look like it. All right, let's count the pages.

Fourier

The name refers to Jean-Baptiste Joseph Fourier (1768–1830), a French mathematician and physicist who developed Fourier series and the Fourier transform. Served under Napoleon in Egypt, collecting scientific data and dabbling in administrative roles.

Fourier saw order where others saw noise.

And gave us the mathematical key to unlock the hidden frequencies of the universe.

He's the bloke who quite literally reshaped how we see waves.

Thanks for your visit. All the best. 🎩

2025-11-28T05:30:26Z

✒️ Edit

Monkey Raptor V.2

Search This Blog

JavaScript: Browser-based Audio Analysis Using AnalyserNode

Labels

Comments

Post a Comment

Blogger Conditional Tag — b:if and b:else

HTML: Marquee

Natrium and Sodium

Using CDATA Section to Escape Characters on Blogger.com

Chess: Vienna Game: Anderssen Defence (starts with a Headbutt)

Math: Area of Equilateral Triangle (With Given Side Length)

Shaolin vs. Ninja

JavaScript: Calculate Duration Since

Math: (Shortcut for) Simplifying (Denesting) Nested Square Roots