[ANN] macstl 0.2.1 -- portable SIMD toolkit beats the autovectorizing Intel ICC 8.1

G

Glen Low

Thanks for all your patience and help with debugging macstl 0.2 --
especially Paul Baxter and Derek Piasecki who helped with AMD64
issues. I'm happy to announce the immediate availability of macstl
0.2.1, which features support for Intel ICC 8.1 on Windows and partial
support for IBM XLC++ 6.0 on Mac OS X.

http://www.pixelglow.com/macstl/

macstl is a portable SIMD (single instruction multiple data) toolkit
featuring fast transcendental and integer division functions, complex
number arithmetic and cross-platform programming, all in an
easy-to-use syntax. macstl requires Mac OS X 10.3 or Windows 2000, XP
or Server 2003. The library is open-source and free when derived code
is reciprocated, otherwise it is $99 for a Personal license and $499
for a Corporate License.

Here's the list of changes:

* Fixed member and binary min and max for vec <unsigned short, 4>
[PBa].
* Fixed #include error with own projects [DCh].
* Added support for Intel ICC 8.1 [ACu].
* Fixed truncation of signed constants in unsigned parameters [DPi].
* Added partial support for IBM XLC 6.0.
* Fixed header access paths and missing functions malloc, free,
vm_allocate, vm_copy, vm_deallocate for Codewarrior.
* Fixed #include <sys/mman.h> error, domain in trigonometric test for
VC++. Improved inlining for ICC.

I haven't got the latest benchmarks up yet, but so far macstl 0.2.1
beats the autovectorizing Intel ICC -- some 2x to 16x faster than
autovectorized code!!


Cheers, Glen Low
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Members online

Forum statistics

Threads
474,444
Messages
2,571,709
Members
48,796
Latest member
Greg L.
Top