Sinewave Synthesis: Tone Combinations

The picture above is a display of the parameters used by the Haskins Laboratories SineWave Synthesizer (SWS) in experiments that study the spatio-temporal aspects of speech. The horizontal axis shows time in milliseconds; the vertical axis shows frequency in Hz. The pattern is a graph of frequency and amplitude variations of three sinusoids. Height in the plane indicates frequency; the thickness of each tracing indicates amplitude.

The properties of tonal analogs of speech vary over time. Accordingly, the tones rise and fall in frequency and amplitude in imitation of the frequency and amplitude variations of vocal resonances over the course of an utterance. Note, however, that unlike the natural speech signal, sinewave speech does not have the normal structure -- there are no broadband formants; there is no regularly pulsed source; the normal short-time "cues" found in speech signals are missing; etc. What remains are just 3 (or sometimes 4) rapidly changing pure tones.

Tone Combinations

Sinewave and Natural Speech

For most listeners, these signals are sufficient to convey a phonetic message (that is, listeners hear them as speech and can identify the individual speech sounds). In this case, sometimes with a little practice, they usually hear the sentence, “Where were you a year ago?” Why? The pattern of variation imposed on the sinusoidal carriers is sufficient information for the perception of phonetic attributes despite the elimination of natural acoustic elements. This reveals that perception is sensitive to information carried by patterns of stimulation independent of the individual elements composing the pattern.

Replication

Tone Combinations

Sentences

The Research

Information

Bibliography

See also

Download all audio files as a Zip in: MP3 format or WAV format

< SWS >