Malte Steiner wrote:
typedef float v4sf __attribute__ ((vector_size(16)));
These attributes are mostly useful to ensure alignment etc. properties
of variables in stack.
On AMD 64bit Turion (single core) on 64 Studio in
64bit mode this doesnt
improve performance at all, actually it even get worse. Is GCC that good
at optimizing on its own?
Currently compilers are not very good at vectorization of all algorithm
types. Still, best results can be reached with handwritten assembler...
- Jussi