ameerj
90a0506d56
lower_int64_to_int32: Add 64-bit atomic fallbacks
4 years ago
ameerj
ad58d7eae7
shaders: Add U64->U32x2 Atomic fallback functions
4 years ago
ameerj
4790ba7839
spirv_atomic: Define U32x2 storage buffers for 64-bit storage atomics
Some drivers do not support 64-bit atomics, and fallback to atomically modifying U32x2 vectors. This change ensures that U32x2 storage vectors are defined in the spir-v shader when 64-bit atomics are used.
Fixes a hang on some devices, notably Intel GPUs, when booting Pokemon Legends Arceus
4 years ago
ameerj
14ac0c2923
shader: Add integer attribute get optimization pass
Works around an nvidia driver bug, where casting the integer attributes to float and back to an integer always returned 0.
4 years ago
Fernando Sahmkow
6f98690963
ShaderCache: Better fix for Shuffling gl_FragCoord
5 years ago
FernandoS27
de1c8c5c2c
Texture Cahe/Shader decompiler: Resize PointSize on rescaling, refactor and make reaper more agressive on 4Gb GPUs.
5 years ago
ameerj
581ea90062
rescaling_pass: Fix IR errors when unscalable texture types are encountered
5 years ago
ameerj
99eec162da
rescaling_pass: Logic simplification and minor style cleanup
5 years ago
ameerj
276565973f
rescaling_pass: Scale ImageFetch offset if it exists
Plus some code deduplication
5 years ago
ameerj
dd66384451
rescaling_pass: Enable PatchImageQueryDimensions on fragment stages
5 years ago
ameerj
b027fac794
gl_texture_cache/rescaling_pass: minor cleanup
5 years ago
ameerj
edb5844240
rescaling_pass: Fix and simplify shuffle/fragcoord pass
5 years ago
Fernando Sahmkow
b3a9c8f108
Shader: Don't rescale FragCoord if used by Shuffle
5 years ago
Fernando Sahmkow
dfa8291526
RescalingPass: Agregate pixels on texelFetch while on Fragment Shader
5 years ago
Fernando Sahmkow
8f78444de3
shader: Fix TextureSize check on rescaling.
5 years ago
ReinUsesLisp
e66d5b88a6
shader: Properly scale image reads and add GL SPIR-V support
Thanks for everything!
5 years ago
ReinUsesLisp
fc9bb3c3fe
shader: Properly blacklist and scale image loads
5 years ago
ReinUsesLisp
01379c5e3c
shader/rescaling_pass: Patch more instructions
5 years ago
ReinUsesLisp
c15332c44f
shader: Add IsTextureScaled opcode
5 years ago
ReinUsesLisp
e580299467
shader: Fix rescaling pass
5 years ago
ReinUsesLisp
1672e9ba09
shader: Fix resolution scaling pass
5 years ago
Fernando Sahmkow
360e897ccd
ShaderDecompiler: Add initial support for rescaling.
5 years ago
Fernando Sahmkow
194579bc4f
ShaderCache: Fix Phi Nodes Type on OGL.
5 years ago
Fernando Sahmkow
c50ad56bf5
ShaderCache: Order Phi Arguments from farthest away to nearest.
5 years ago
Fernando Sahmkow
e5291e2031
TexturePass: Fix clamping of images as this allowed negative indices.
5 years ago
Fernando Sahmkow
3f4444b552
Shader Compiler: avoid overflowed indices on indixed samplers.
5 years ago
ReinUsesLisp
8c9febe8f7
shader: Fold UnpackFloat2x16 and PackFloat2x16
Simplifies the code a bit when possible. These instructions should be
no-ops codegen wise.
5 years ago
ReinUsesLisp
1bb46b7d64
shader: Mark ConvertF16F32 and ConvertF32F16 as fp16 instructions
Fixes instances where fp16 types are not declared on SPIR-V but they are
used. This shouldn't happen on master, as it's been uncovered by an
additional optimization pass.
5 years ago
ReinUsesLisp
66a0cedba3
shader: Fold integer FMA from Nvidia's pattern
Fold shaders doing "a * b + c" on integers from the pattern generated by
Nvidia's GL compiler.
On a somewhat complex compute shader it reduces the code size by 16
instructions from 2 matches on Turing GPUs.
On Intel as extracted from KHR_pipeline_executable_properties:
Before the optimization:
```
Instruction Count: 2057
Basic Block Count: 45
Scratch Memory Size: 14752
Spill Count: 232
Fill Count: 261
SEND Count: 610
Cycle Count: 11325
```
After the optimization:
```
Instruction Count: 2046
Basic Block Count: 44
Scratch Memory Size: 13728
Spill Count: 219
Fill Count: 268
SEND Count: 604
Cycle Count: 11367
```
5 years ago
ReinUsesLisp
09fb41dc63
shader: Use TryInstRecursive on XMAD multiply folding
Simplify a bit the logic.
5 years ago
ReinUsesLisp
bf2956d77a
shader: Avoid usage of C++20 ranges to build in clang
5 years ago
lat9nq
49946cf780
shader_recompiler, video_core: Resolve clang errors
Silences the following warnings-turned-errors:
-Wsign-conversion
-Wunused-private-field
-Wbraced-scalar-init
-Wunused-variable
And some other errors
5 years ago
ameerj
41c6cb70f9
glsl: Fix tracking of info.uses_shadow_lod
5 years ago
ameerj
57f222c56e
dual_vertex_pass: Clang format
5 years ago
ReinUsesLisp
7dafa96ab5
shader: Rework varyings and implement passthrough geometry shaders
Put all varyings into a single std::bitset with helpers to access it.
Implement passthrough geometry shaders using host's.
5 years ago
lat9nq
257d2aab74
lower_int64_to_int32: Add missing include
5 years ago
ReinUsesLisp
d8d5501459
shader: Add int64 to int32 lowering pass
5 years ago
ReinUsesLisp
04ef2160f9
shader: Teach global memory base tracker to follow vectors
5 years ago
ReinUsesLisp
97e80dda55
shader: Add constant propagation to integer vectors
5 years ago
ReinUsesLisp
808ef97a08
shader: Move loop safety tests to code emission
5 years ago
ameerj
a0365217f5
texture_pass: Fix is_read image qualification
Atomic operations are considered to have both read and write access. This was not being accounted for.
5 years ago
ReinUsesLisp
0cd08b3e72
shader: Align constant buffer sizes to 16 bytes
WAR for AMD reading zeroes on uniform buffers of size 2.
5 years ago
ReinUsesLisp
374eeda1a3
shader: Properly manage attributes not written from previous stages
5 years ago
ameerj
d36f667bc0
glsl: Address rest of feedback
5 years ago
ameerj
a0d0704aff
glsl: Conditionally add EXT_texture_shadow_lod
5 years ago
ameerj
6aa1bf7b6f
glsl: Implement legacy varyings
5 years ago
ameerj
9ccbd74991
glsl: Fix ATOM and implement ATOMS
5 years ago
ameerj
5399906c26
glsl: Track S32 atomics
5 years ago
ameerj
11ba190462
glsl: Revert ssbo aliasing. Storage Atomics impl
5 years ago
ameerj
3d9ecbe998
glsl: Wip storage atomic ops
5 years ago