Ooura vs FFTW3, Re: [linux-audio-dev] DRC 2.4.0

Tim Goetze tim at quitte.de
Sat Feb 28 15:47:52 UTC 2004


[Steve Harris]

>On Sat, Feb 28, 2004 at 02:43:18PM +0100, Tim Goetze wrote:
>> [Jussi Laako]
>>
>> >Here are some numbers from my benchmark... On 2.4 GHz Celeron.
>> >
>> >Ooura:
>> >59 us / 1024 point complex FFT (single), 16949 FFTs/s, 17.356 MS/s
>> >86 us / 1024 point complex FFT (double), 11628 FFTs/s, 11.907 MS/s
>> >32 us / 1024 point real FFT (single), 31250 FFTs/s, 32.000 MS/s
>> >365 us / 8192 point real FFT (single), 2740 FFTs/s, 22.444 MS/s
>> >8350 us / 65536 point real FFT (single), 120 FFTs/s, 7.849 MS/s
>> >
>> >FFTW3:
>> >43 us / 1024 point complex FFT (single), 23256 FFTs/s, 23.814 MS/s
>> >70 us / 1024 point complex FFT (double), 14286 FFTs/s, 14.629 MS/s
>> >42 us / 1024 point real FFT (single), 23810 FFTs/s, 24.381 MS/s
>> >490 us / 8192 point real FFT (single), 2041 FFTs/s, 16.718 MS/s
>> >30420 us / 65536 point real FFT (single), 33 FFTs/s, 2.154 MS/s
>>
>> seems Ooura is quicker than FFTW3 for real DFT, surprises me.
>
>Do you have the SIMD optimisations on, and have you tried the
>"halfcomplex" mode of FFTW, thats fine for audio, and saves some memory
>i/o.

i assume 'halfcomplex' is what is called 'real' in Jussi's benchmark
output. one thing that makes me wonder is that my copy of Ooura's
routines seems to work with double precision only.

btw, http://www.fftw.org/speed/ paints an altogether different
picture.

tim



More information about the Linux-audio-dev mailing list