mirror of
https://git.ffmpeg.org/ffmpeg.git
synced 2025-12-23 23:40:15 +01:00
233 to 105 cycles on Arrandale and Win64. Replacing the multiplication by s_m[m] by a pand and a pxor with appropriate vectors is slower. Unrolling is a 15 cycles win. A SSE version was 4 cycles slower. Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
37 KiB
37 KiB