Rodrigo Locatti
4d4f9cc104
video_core: Silent miscellaneous warnings ( #2820 )
* texture_cache/surface_params: Remove unused local variable
* rasterizer_interface: Add missing documentation commentary
* maxwell_dma: Remove unused rasterizer reference
* video_core/gpu: Sort member declaration order to silent -Wreorder warning
* fermi_2d: Remove unused MemoryManager reference
* video_core: Silent unused variable warnings
* buffer_cache: Silent -Wreorder warnings
* kepler_memory: Remove unused MemoryManager reference
* gl_texture_cache: Add missing override
* buffer_cache: Add missing include
* shader/decode: Remove unused variables
6 years ago
ReinUsesLisp
878adee0a3
gl_buffer_cache: Add missing include
RasterizerInterface was considered an incomplete object by clang.
6 years ago
ReinUsesLisp
e3534700d7
shader_ir/conversion: Split int and float selector and implement F2F H1
6 years ago
ReinUsesLisp
b13fbc25b8
shader_ir/conversion: Implement F2I F16 Ra.H1
6 years ago
ReinUsesLisp
6207751b00
float_set_predicate: Add missing negation bit for the second operand
6 years ago
ReinUsesLisp
4e35177e23
shader_ir: Implement VOTE
Implement VOTE using Nvidia's intrinsics. Documentation about these can
be found here
https://developer.nvidia.com/reading-between-threads-shader-intrinsics
Instead of using portable ARB instructions I opted to use Nvidia
intrinsics because these are the closest we have to how Tegra X1
hardware renders.
To stub VOTE on non-Nvidia drivers (including nouveau) this commit
simulates a GPU with a warp size of one, returning what is meaningful
for the instruction being emulated:
* anyThreadNV(value) -> value
* allThreadsNV(value) -> value
* allThreadsEqualNV(value) -> true
ballotARB, also known as "uint64_t(activeThreadsNV())", emits
VOTE.ANY Rd, PT, PT;
on nouveau's compiler. This doesn't match exactly to Nvidia's code
VOTE.ALL Rd, PT, PT;
Which is emulated with activeThreadsNV() by this commit. In theory this
shouldn't really matter since .ANY, .ALL and .EQ affect the predicates
(set to PT on those cases) and not the registers.
7 years ago
Fernando Sahmkow
83ec2091c1
Buffer Cache: Adress Feedback.
7 years ago
Fernando Sahmkow
6ce2c85047
Buffer_Cache: Implement flushing.
7 years ago
Fernando Sahmkow
de8ff8a1c6
Buffer_Cache: Implement barriers.
7 years ago
Fernando Sahmkow
286f4c446a
Buffer_Cache: Optimize and track written areas.
7 years ago
Fernando Sahmkow
5f4b746a1e
BufferCache: Rework mapping caching.
7 years ago
Fernando Sahmkow
86d8563314
Buffer_Cache: Fixes and optimizations.
7 years ago
Fernando Sahmkow
862bec001b
Video_Core: Implement a new Buffer Cache
7 years ago
ReinUsesLisp
80702aa88f
renderer_opengl: Implement RGB565 framebuffer format
7 years ago
ReinUsesLisp
9cdf5c6c31
renderer_opengl: Use block linear swizzling for CPU framebuffers
7 years ago
ReinUsesLisp
8ad7268c75
renderer_opengl: Use VideoCore pixel format
7 years ago
ReinUsesLisp
9a76e94b3d
gpu: Change optional<reference_wrapper<T>> to T* for FramebufferConfig
7 years ago
ReinUsesLisp
2ff8044806
shader_ir: Implement NOP
7 years ago
ReinUsesLisp
ec0da3ef64
half_set_predicate: Fix HSETP2_C constant buffer offset
7 years ago
Fernando Sahmkow
e52c895559
GPU: Flush commands on every dma pusher step.
This commit ensures that the host gpu is constantly fed with commands to
work with, while the guest gpu keeps producing the rest of the commands.
This reduces syncing time between host and guest gpu.
7 years ago
ReinUsesLisp
77f1a676a1
decode/half_set_predicate: Fix predicates
7 years ago
Fernando Sahmkow
a452ff983d
MaxwellDMA: Fixes, corrections and relaxations.
This commit fixes offsets on Linear -> Tiled copies, corrects z pos
fortiled->linear copies, corrects bytes_per_pixel calculation in tiled
-> linear copies and relaxes some limitations set by latest dma fixes
refactors.
7 years ago
ReinUsesLisp
104641db07
shader/decode: Implement S2R Tic
7 years ago
Fernando Sahmkow
11f4e739bd
Shader_Ir: Implement F16 Variants of F2F, F2I, I2F.
This commit takes care of implementing the F16 Variants of the
conversion instructions and makes sure conversions are done.
7 years ago
Fernando Sahmkow
7a35178ee2
Maxwell3D: Reorganize and address feedback
7 years ago
Fernando Sahmkow
1158777737
Shader_Ir: Change Debug Asserts for Log Warnings
7 years ago
ReinUsesLisp
45c162444d
shader/half_set_predicate: Fix HSETP2 implementation
7 years ago
ReinUsesLisp
6c4985edc9
shader/half_set_predicate: Implement missing HSETP2 variants
7 years ago
Lioncash
c1c89411da
video_core/control_flow: Provide operator!= for types with operator==
Provides operational symmetry for the respective structures.
7 years ago
Lioncash
1780e0e3d0
video_core/control_flow: Prevent sign conversion in TryGetBlock()
The return value is a u32, not an s32, so this would result in an
implicit signedness conversion.
7 years ago
Lioncash
a162a844d2
video_core/control_flow: Remove unnecessary BlockStack copy constructor
This is the default behavior of the copy constructor, so it doesn't need
to be specified.
While we're at it we can make the other non-default constructor
explicit.
7 years ago
Lioncash
56bc11d952
video_core/control_flow: Use std::move where applicable
Results in less work being done where avoidable.
7 years ago
Lioncash
e7b39f47f8
video_core/control_flow: Use the prefix variant of operator++ for iterators
Same thing, but potentially allows a standard library implementation to
pick a more efficient codepath.
7 years ago
Lioncash
6885e7e7ec
video_core/control_flow: Use empty() member function for checking emptiness
It's what it's there for.
7 years ago
Lioncash
45fa12a05c
video_core: Resolve -Wreorder warnings
Ensures that the constructor members are always initialized in the order
that they're declared in.
7 years ago
Lioncash
47df844338
video_core/control_flow: Make program_size for ScanFlow() a std::size_t
Prevents a truncation warning from occurring with MSVC. Also the
internal data structures already treat it as a size_t, so this is just a
discrepancy in the interface.
7 years ago
Lioncash
3df9558593
video_core/control_flow: Place all internally linked types/functions within an anonymous namespace
Previously, quite a few functions were being linked with external
linkage.
7 years ago
Lioncash
1109db86b7
video_core/shader/decode: Prevent sign-conversion warnings
Makes it explicit that the conversions here are intentional.
7 years ago
Fernando Sahmkow
5a06e33859
Shader_Ir: correct clang format
7 years ago
Fernando Sahmkow
43f57d668c
GPU: Add missing puller methods.
This adds some missing puller methods. We don't assert them as these are
nop operations for us.
7 years ago
Fernando Sahmkow
3a3fee5abf
MaxwellDMA/KeplerCopy: Downgrade DMA log message to Trace.
This log was just to know which games used DMA. It's no longer
important.
7 years ago
Fernando Sahmkow
d3b71ff80d
Gl_Texture_Cache: Remove assert on component type in GetFormatTuple
Textures can have different components types in different orders. This
assert was completely inprecise and the effectiveness of such is better
handled by case and within the texture cache.
7 years ago
Fernando Sahmkow
0b65e9335e
Shader_Ir: Downgrade precision and rounding asserts to debug asserts.
This commit reduces the sevirity of asserts for FP precision and
rounding as this are well known and have little to no consequences in
gpu's accuracy.
7 years ago
ReinUsesLisp
74632c76ce
gl_shader_decompiler: Rename bufferImage to imageBuffer
The online OpenGL documentation is wrong. The type definition is
imageBuffer.
7 years ago
ReinUsesLisp
87909d327f
gl_shader_cache: Fix newline on buffer preprocessor definitions
7 years ago
ReinUsesLisp
e7bdf8b22a
textures: Fix texture buffer size calculation
7 years ago
ReinUsesLisp
84027f4808
gl_texture_cache: Do not set texture parameters to buffers
7 years ago
ReinUsesLisp
73b2dc6d4f
gl_texture_cache: Add missing break in CreateTexture
7 years ago
Fernando Sahmkow
4be61013a1
GL_State: Feedback and fixes
7 years ago
Fernando Sahmkow
5ad889f6fd
Maxwell3D: Address Feedback
7 years ago