Re: [LAD] GCC Vector extensions

20 Jul 2011

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
On 07/20/2011 10:27 AM, Maurizio De Cecco wrote:
...
  I am playing around with GCC and Clang vector
extensions, on Linux and
 Mac OS X, and i am getting some strange behaviour.
 I am working on jMax Phoenix, and its dsp engine, in its current state,
 is very memory bound; it is based on the aggregation of very small
 granularity operations, like vector sum or multiply, each of them
 executed independently from and to memory.
 I tried to implements all this 'primitive' operations using the vector
 types.
 On clang/MacOSX i get an impressive improvement in performance,
 around 4x on the operations, even just using the vector types for
 copying data; my impression is that the compiler use some kind of vector
 load/store instruction that properly use the available memory bandwidth,
 but unfortunately i do not know more about the x86 architecture.
 On gcc/Linux, (gcc 4.5.2) the same code produce a *slow down* of around
 2.5x.
 Well, anybody have an idea of why ?
 I am actually running linux (Ubuntu 11.04) under a VMWare virtual
 machine, i do not know is this may have any implications. 
Maybe. A better comparison would be: clang/Linux vs. gcc/Linux and
clang/MacOSX vs gcc/MacOSX compiled binaries.
Also as Dan already pointed out: gcc has a whole lot of optimization
flags which are not enabled by default. try '-O3 -msse2 -ffast-math'.
 '-ftree-vectorizer-verbose=2' is handy while optimizing code.
have fun,
robin
...PGP SIGNATURE...
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.11 (GNU/Linux)
iEYEARECAAYFAk4m+HsACgkQeVUk8U+VK0LgtQCfcL9Jvrtf4kN9iPDMrpKQ/1M7
spwAni358xLLAb4n6LeOEMhiqS/kWnYb
=DmSC
-----END PGP SIGNATURE-----

2026

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

Re: [LAD] GCC Vector extensions