On Sat, Jan 22, 2005 at 03:17:28PM +0100, Christian Schoenebeck wrote:
I think for such an important piece like freeverb it
might be worth to add
architecture dependant assembly optimizations. E.g. with SSE(1) you simply
have to change the SSE control register to get rid of the denormal problem.
With that no denormal fixes for individual sample points are necessary
anymore which definitely pays off in performance.
Agreed, but I'm not sure how doing that will affect the host - ie does
-mfpumath=SSE (or whatever the option is) always leave the FPU stack in a
sane state when control goes back to the host? I dont think it would
normally return the FPU to 387 mode, which might do bad things to the
host occasionally.
OTOH it might be fine, I dont understand the interaction with the 387 and
SSE.
- Steve