—> To Chapter 4

Some Alternatives to the FFT

We know that in previous sections we've made it sound (oops, bad pun) like the FFT is the "only game in town." But that's not entirely true. Historically,the FFT was the first, most important and most widely used algorithm for frequency domain analysis. Musicians, especially, have taken a liking to it because there are some well-known references on its implementation, and computer musicians especially have gained a lot of experience in its nuances. It's fairly simple to program, and there are a lot of existing software programs to use if you get lazy. Also, it works pretty well (emphasis on the word "pretty").

The FFT breaks a sound up into sinusoids, but there may be reasons that you'd like to break a sound into other sorts of basic sounds. These alternate transforms (a transform is just something that takes a list of the sample values and turns them into a new list of numbers that describes a possible new way to add up different basic sounds — whatever they are) can work better for some purposes than the FFT.

For example, wavelet transforms (sometimes called dynamic base transforms) can modify the resolution for different frequency ranges, unlike the FFT, which has a constant bandwidth (and thus is, by definition, less sensitive to lower frequencies — the important ones— than to higher ones!). Wavelets also use a variety of different analysis waveforms (as opposed to the FFT which only uses sinusoids) to get a better representation of a signal. Unfortunately, wavelet transforms are still a bit uncommon in computer software, since they are in general, harder to implement than FFTs. One of the big problems is deciding which wavelet to use, and in what frequency bandwidth. Often, that decision making process can be more important (and time-consuming) than the actual transform!

Wavelets are a pretty hot topic though, so we’ll probably be seeing some wavelet-based techniques emerge in the near future.

Figure .x Some common wavelet analysis waveforms. Each of these is a prototype, or "mother wavelet", which provides a basic template for all the related analysis waveforms.

A wavelet analysis will start with one of these waveforms and break up the sound into a sum of translated and stretched (dilated) versions of these.


McAulay-Quatieri (MQ) Analysis

Another interesting approach to a more organized, information-rich set of transforms is the extended McAulay-Quatieri (MQ) analysis algorithm. This is used in the popular Macintosh program, Lemur Pro, written by the brilliant computer music researcher Kelly Fitz.

MQ analysis works a lot like normal FFTs — it tries to figure out how a sound can be recreated using a number of sinewaves. However, Lemur Pro is an enhancement of more traditional FFT-based software in that it uses the resulting sine waves to extract frequency tracks from a sound. These tracks attempt to follow particular components of the sound as they change over time. That is, MQ not only represents the amplitude trajectories of partials, but tries to describe the ways that spectral components change in frequency over time as well. This is a fundamentally different way of describing spectra, and a powerful one.

This is an excellent idea, as it allows the analyst a much higher level of data representation, and represents a more perceptually signifcant view of the sound. A number of more advanced FFT-based programs also implement some form of frequency tracking in order to improve the results of the analysis, similar to that of MQ analysis.

For example, even though a sinewave might sweep from 1kHz to 2kHz, it would still be contained in one MQ track (rather than fade in and out of FFT bins, forming what are called lobes).

This type of analysis is often more useful for sound work than regular FFT analysis, since it has sonic cognition ideas built into it. For example, the ear tends to follow frequencies in time, much like an MQ analysis, and loud frequencies tend to mask, in our perception, neighboring quieter frequencies (as do MQ analyses). These kinds of sophisticated analysis techniques, where frequency and amplitude are isolated, suggest purely graphical manipulations of sounds, and we'll show some examples of that in Section 5.7.

Figure .x Frequency tracks in Lemur Pro.The information looks much like one of our old sonograms, but in fact it is the result of a great deal of high-level processing of FFT-type data.

Figure .x MQ plot of 0'00" - 5'49" of Movement II: from the electro-acoustic work Sud by the great French composer and computer music pioneer Jean-Claude Risset . Photo by David Hirst, an innovative composer and computer music researcher from Australia.

—> To Chapter 4

<— Back to 3.6

<— To Table of Contents