Agner`s CPU blog

Software optimization resources | E-mail subscription to this blog | www.agner.org

List Messageboards

Test results for Intel's Sandy Bridge processor

Author: anon

Date: 2013-08-09 04:50

Interesting. So it sounds like the odd rule also exists in the uop cache territory?

Here is another example:

or rax, 1
or rdx, 1
or rsi, 1
movaps xmm0, [r10]
or rdi, 1
or r8, 1
movaps xmm1, [r11]
or r9, 1

This runs at 2 clocks / 8 instructions regardless of uop cache hit/miss. But if all ORs are changed into AND, it drops to 2.45 clocks / 8 instructions when the code isn't fit into the uop cache.

Of course,

and rax, 1
and rdx, 1
and rsi, 1
movaps xmm0, [r10]
and rdi, 1
and r8, 1
and r9, 1
movaps xmm1, [r11]

This runs at 2 clocks / 8 instructions without problem.

The result means not only that decode throughput of AND instruction is limited to 3 / cycle, but also that 4-1-1-1 pattern rule is applied to the instruction. This makes me believe that macro-fuseable instructions are only handled in simple decoders.

Reply To This Message

Previous Message

Test results for Intel's Sandy Bridge processor new - Agner - 2011-01-30

Test results for Intel's Sandy Bridge processor new - PaulR - 2011-02-15

AVX2 new - phis - 2011-06-23

AVX2 new - Agner - 2011-06-23

Test results for Intel's Sandy Bridge processor new - anon - 2013-08-01

Test results for Intel's Sandy Bridge processor new - Agner - 2013-08-06

Test results for Intel's Sandy Bridge processor new - anon - 2013-08-07

Test results for Intel's Sandy Bridge processor new - Agner - 2013-08-07

Test results for Intel's Sandy Bridge processor new - anon - 2013-08-07

Test results for Intel's Sandy Bridge processor new - Agner - 2013-08-08

Test results for Intel's Sandy Bridge processor new - anon - 2013-08-08

Test results for Intel's Sandy Bridge processor new - Agner - 2013-08-09

Test results for Intel's Sandy Bridge processor - anon - 2013-08-09

Test results for Intel's Sandy Bridge processor new - Agner - 2013-08-10

Test results for Intel's Sandy Bridge processor new - Agner - 2013-08-10

Test results for Intel's Sandy Bridge processor new - John D. McCalpin - 2013-10-09

Test results for Intel's Sandy Bridge processor new - Agner - 2013-10-10

Test results for Intel's Sandy Bridge processor new - John D. McCalpin - 2013-10-11

SB's L1D banks new - Tacit Murky - 2013-11-03

SB's L1D banks new - John D. McCalpin - 2013-11-07

Test results for Intel's Sandy Bridge processor new - John D. McCalpin - 2015-08-18

Test results for Intel's Sandy Bridge processor new - Agner - 2015-08-18

Test results for Intel's Sandy Bridge processor new - John D. McCalpin - 2015-08-24

Test results for Intel's Sandy Bridge processor new - Agner - 2015-08-25

Test results for Intel's Sandy Bridge processor new - John D. McCalpin - 2015-08-25

Haswell upper128 power gating new - Peter Cordes - 2015-08-28

Haswell upper128 power gating new - Agner - 2016-01-16

Haswell upper128 power gating new - John D. McCalpin - 2016-01-29

Haswell upper128 power gating new - Agner - 2016-01-30

Test results for Intel's Sandy Bridge processor new - Agner - 2015-12-20

Test results for Intel's Sandy Bridge processor new - John D. McCalpin - 2015-12-21

Test results for Intel's Sandy Bridge processor new - Agner - 2015-12-22

Test results for Intel's Sandy Bridge processor new - Robert - 2015-12-24

Test results for Intel's Sandy Bridge processor new - Just_Coder - 2015-12-25

Test results for Intel's Sandy Bridge processor new - Agner - 2015-12-26

Test results for Intel's Sandy Bridge processor new - Just_Coder - 2015-08-23

Test results for Intel's Sandy Bridge processor new - Agner - 2015-08-25

List Messageboards