You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
Tree:
46a7c8826b
00336
2ndgpuopts
Compatlist
Kernel
Update-some-barriers-test
astc_optimization1
astc_optimizations
atomicops-mxwell
bcn-ternary-soft
bump-httplib-minver
camillelavey-patch-1
camillelavey-patch-3
civa
cyber/async-gpu-android
descriptor
descriptor_pool_opt
descriptor_set
discfix
dmnt2
dravee/remove-async
dynarmic-ppc64
eden-orbis-ps4
eds-true-adreno-fixes
eds-true-adreno-fixes-pre-0.1.0
eds_changes1
eds_changes_phasewise
feat/22.0.0
feat/android_kotlin_compose
feat/game_override
feat/new_lang
ffmpeg-cross-compile
fix-grid-autoalign
fix/discord-rpc
fix/socket
flatopsfixes23485
fs-try
gpuopts
interval-zero
kosmic_krisp
kosmickrisp
lanobu
lines
liz-crash-dumps-solaris
liz-dynarmic-macos-fbsd-port
liz-get-rid-of-mcl-intrusive-list
liz-heaptrack-fix
lizize/fixcrashwehenimagesizeexceed
lizzie/adreno5-mali-driver-fix
lizzie/anv-cached-mesa-hack
lizzie/astc-mp4-improv
lizzie/attempt2-better-list-dynarmic
lizzie/audio-remove-recursive-lock
lizzie/batched-draws-wip
lizzie/bsdsockets-mod-fix
lizzie/correcter-convert-abgr8-srgb-d24s8
lizzie/draw-state-inline
lizzie/dynarmic-faster-xbyak
lizzie/dynarmic-next-2026-03-26
lizzie/edit-handheld-mode
lizzie/evil-f32-u32-format-fix
lizzie/fix-dragonslayer-armnce
lizzie/fix-my-fuckup-with-vic
lizzie/fix-nvmap-handles
lizzie/fix-nvmap-handles-but-evil
lizzie/fix-rw-garten
lizzie/fix-vic-msvc-attempt-2
lizzie/fs-msvc-succks
lizzie/gutter-sw-blitter
lizzie/hleinlineremoveredundant
lizzie/inline-123
lizzie/inline-cmif-request
lizzie/inline-dynarmic-spooky
lizzie/invert-gyro
lizzie/ios-port-sud
lizzie/jit-addresschecks
lizzie/lanczos-opt
lizzie/lrunsdfkdsjfl
lizzie/ltofix123
lizzie/macos-fix
lizzie/macos-network-ifaces
lizzie/macos-vk-metal-fix-stype
lizzie/mark-bl-dynarmic
lizzie/maxwell-dumb-tools
lizzie/mlp223
lizzie/multicore-macos-fix1
lizzie/nce-invalidate-split
lizzie/nce-port1123
lizzie/noexcept-dynarmic
lizzie/nuke-vtable-shit
lizzie/product-model-wawa
lizzie/qt-fix-user-dialogue
lizzie/rasterizer-recursive-mutex-not
lizzie/readonly-listicons
lizzie/remove-disassembler-dynarmic
lizzie/remove-mlp-2-level
lizzie/restore-fcsm
lizzie/settings-blocked-domains
lizzie/sgsr
lizzie/sm-AtmosphereHasService
lizzie/splay
lizzie/spscs-mpscs-queue
lizzie/stable-shader-pools
lizzie/stupid-socket-bullshit
lizzie/texture-pass-revert-1
lizzie/try-fix-win11-crash-fuck
lizzie/ulaunch-attempt1
lizzie/unaligned-attempt-2
lizzie/unity-build
lizzie/update-faq-link-wwa
lizzie/vids-ratatata
lizzie/vkexperiments1-highp-fucked
lizzie/wstring-uni-123
lizzie/xbyak-unor-mapfix-with-boost
lizzie/xcode-evil-shit-123
lock-term-1
macroify-surface-stuffs
master
memory_changes
mmap-fixews
msvc
mutliplayer-filter-better1
n64
nce_cpp
pipelinederivative
qcomopts2
querybugfix
refactoreds2
release-early-fences
release/0.0.3
release/0.0.4
release/0.1.0
revert-2695
revert-noinline
revert-xbyak
revertrevert
sgsrtry
shader_recompiler
showcase
showcase2
sjkdbsdfjkbsdf-2834
spvopts
sured-revert
techno48473719
test-revert-gpu-optim
test2
texture_cache
true-eds
true-eds-graphics
true-eds-pre-0.0.1
uma
vk-fix-oom-force-maller-buffers
vk-symph
vkexperiments1
vkfixes1
vuid00336_1
vuid02999
vuid04553
vulkan-thingy
vulkanasync
woa-turnip-expr
work/httplib-sep-mod
work/riscv-eh-impl
workgroup
xbzk-dma-pusher-step-redesign
xbzk-saf-recursive-write-with-permission-request
xbzk/background-support
xbzk/bindless-textures-support
xbzk/dma-step-ondemand-flush
xbzk/flicker-fix
xbzk/input-delay-fix
xbzk/press-back-again-issue-fix
xbzk/settings-fragment-rework
xbzk/unreal-unsafe-junk-guards
xbzk/vsync-immed-fallback-fix
xbzk/vulkan-vuid-goodies-pack
0.0.0
0.0.1-pre-alpha
0.0.2-pre-alpha
test-tag1
test-tag2
v0.0.3
v0.0.3-rc1
v0.0.3-rc2
v0.0.3-rc3
v0.0.3.git
v0.0.4
v0.0.4-rc1
v0.0.4-rc2
v0.0.4-rc2.test
v0.0.4-rc2.test2
v0.0.4-rc3
v0.0.4-rc3.test1
v0.0.4-rc3.test2
v0.0.4.test
v0.1.0
v0.1.0-rc1
v0.1.1
v0.2.0-rc1
v0.2.0-rc2
${ noResults }
Uses arithmetic that can be identified more trivially by compilers for
optimizations. e.g. Rather than shifting the halves of the value and
then swapping and combining them, we can swap them in place.
e.g. for the original swap32 code on x86-64, clang 8.0 would generate:
mov ecx, edi
rol cx, 8
shl ecx, 16
shr edi, 16
rol di, 8
movzx eax, di
or eax, ecx
ret
while GCC 8.3 would generate the ideal:
mov eax, edi
bswap eax
ret
now both generate the same optimal output.
MSVC used to generate the following with the old code:
mov eax, ecx
rol cx, 8
shr eax, 16
rol ax, 8
movzx ecx, cx
movzx eax, ax
shl ecx, 16
or eax, ecx
ret 0
Now MSVC also generates a similar, but equally optimal result as clang/GCC:
bswap ecx
mov eax, ecx
ret 0
====
In the swap64 case, for the original code, clang 8.0 would generate:
mov eax, edi
bswap eax
shl rax, 32
shr rdi, 32
bswap edi
or rax, rdi
ret
(almost there, but still missing the mark)
while, again, GCC 8.3 would generate the more ideal:
mov rax, rdi
bswap rax
ret
now clang also generates the optimal sequence for this fallback as well.
This is a case where MSVC unfortunately falls short, despite the new
code, this one still generates a doozy of an output.
mov r8, rcx
mov r9, rcx
mov rax, 71776119061217280
mov rdx, r8
and r9, rax
and edx, 65280
mov rax, rcx
shr rax, 16
or r9, rax
mov rax, rcx
shr r9, 16
mov rcx, 280375465082880
and rax, rcx
mov rcx, 1095216660480
or r9, rax
mov rax, r8
and rax, rcx
shr r9, 16
or r9, rax
mov rcx, r8
mov rax, r8
shr r9, 8
shl rax, 16
and ecx, 16711680
or rdx, rax
mov eax, -16777216
and rax, r8
shl rdx, 16
or rdx, rcx
shl rdx, 16
or rax, rdx
shl rax, 8
or rax, r9
ret 0
which is pretty unfortunate.
|
7 years ago | |
|---|---|---|
| .. | ||
| compatibility_list.qrc | Show game compatibility within yuzu | 8 years ago |