Every FluentPlay game shares the same browser-side audio engine. Your microphone feeds a real-time analysis pipeline that classifies every audio frame — about 60 per second — into a feature stream. That stream feeds the PAD scorer. No audio is recorded. Nothing leaves the browser except one cloud call for phoneme-level accuracy.
Root-mean-square energy of the audio frame. Tracks how much acoustic energy the speaker is producing. Drops during blocks, spikes during forced articulation. A stable RMS across syllables is a fluency signal.
Cumulative count of voiced onsets detected in the session. Each time the pipeline sees a transition from silent or building to voiced, the counter increments. Repeated onsets on the same syllable window signal repetition-type disfluency.
Cumulative time spent in the voiced state, in milliseconds. The ratio of voiced-to-total time tracks productive speech output. Long stretches of building without voiced indicate motor-planning stalls — the system before the block.
Binary flag that fires when the pipeline detects a sustained building state exceeding a duration threshold. A block is not a silence — it is active motor effort without articulation. When the flag fires, the PAD scorer weights that syllable window accordingly.
Runs in any modern browser. No install, no plugin, no app store. Single-file HTML deployments via Netlify Drop.
Audio is analyzed in real time and discarded frame by frame. Nothing is stored. Nothing leaves the device except one cloud call for phoneme-level pronunciation assessment.
From speech onset to scored feature output in under 100 milliseconds. Frame rate of ~60 fps. Fast enough for real-time visual feedback during practice.
The audio pipeline and the PAD scorer are architecturally independent. License the pipeline, the scoring framework, or the full integrated stack.
The audio pipeline captures pre-articulatory timing instability in real time. PAD scores it per syllable. Every game in the FluentPlay library ships with both layers hardwired in. The pipeline and the scoring framework are architecturally independent — licensable separately or as an integrated stack. Patent pending under U.S. Provisional 64/016,001.
Whether you're evaluating the audio pipeline, the PAD scoring framework, or the full integrated stack — describe your use case and we'll schedule a founder call.