Search found 3 matches
- 2023-06-04, 15:29:08
- Forum: Agner's CPU blog
- Topic: VZEROUPPER issue with Zen4 in 32-bit mode?
- Replies: 3
- Views: 83867
Re: VZEROUPPER issue with Zen4 in 32-bit mode?
I tried replacing all "movdqa reg1, reg2" with "pshufd $0xE4, reg1, reg2" and I'm still getting same results. I also tried to see if forced context switching would allow performance to recover. I modified program to do the YMM instruction + VZEROUPPER just at the beginning of the program, before tes...
- 2023-06-03, 16:10:59
- Forum: Agner's CPU blog
- Topic: VZEROUPPER issue with Zen4 in 32-bit mode?
- Replies: 3
- Views: 83867
VZEROUPPER issue with Zen4 in 32-bit mode?
I've ran into interesting behaviour on Zen4 (7900X) with mixed VEX and non-VEX code. It seems that, in 32-bit mode, VZEROUPPER is not fully clearing some state for YMM/XMM registers which causes non-VEX code run at slower speed after YMM usage. I'm running following code in 32-bit mode and 64-bit mo...
- 2020-08-09, 19:28:30
- Forum: Agner's CPU blog
- Topic: Newer Ryzen
- Replies: 4
- Views: 43212