On Apr 12, 2010, at 1:17 AM, Stefan Fuhrmann wrote: > Throughput: > SSSE3: ~3.5 bytes / clock tick > SSE2: ~3 bytes / clock tick > MMX: ~1.5 bytes / clock tick > C-Code: ~1 byte / clock tick Stefan, Good. That information should also be in the code comments. Mark