yuzu-mainline - A backup of the Yuzu mainline repo. Only includes the master branch, nothing else.

Age	Commit message (Collapse)	Author
2020-06-26	Merge pull request #4147 from ReinUsesLisp/hset2-imm	bunnei
	shader/half_set: Implement HSET2_IMM
2020-06-22	shader/half_set: Implement HSET2_IMM	ReinUsesLisp
	Add HSET2_IMM. Due to the complexity of the encoding avoid using BitField unions and read the relevant bits from the code itself. This is less error prone.
2020-06-20	decode/image: Implement B10G11R11F	Morph
	- Used by Kirby Star Allies
2020-06-05	shader/texture: Join separate image and sampler pairs offline	ReinUsesLisp
	Games using D3D idioms can join images and samplers when a shader executes, instead of baking them into a combined sampler image. This is also possible on Vulkan. One approach to this solution would be to use separate samplers on Vulkan and leave this unimplemented on OpenGL, but we can't do this because there's no consistent way of determining which constant buffer holds a sampler and which one an image. We could in theory find the first bit and if it's in the TIC area, it's an image; but this falls apart when an image or sampler handle use an index of zero. The used approach is to track for a LOP.OR operation (this is done at an IR level, not at an ISA level), track again the constant buffers used as source and store this pair. Then, outside of shader execution, join the sample and image pair with a bitwise or operation. This approach won't work on games that truly use separate samplers in a meaningful way. For example, pooling textures in a 2D array and determining at runtime what sampler to use. This invalidates OpenGL's disk shader cache :) - Used mostly by D3D ports to Switch
2020-06-02	Merge pull request #4016 from ReinUsesLisp/invocation-info	LC
	shader/other: Fix hardcoded value in S2R INVOCATION_INFO
2020-05-30	shader/other: Fix hardcoded value in S2R INVOCATION_INFO	ReinUsesLisp
	Geometry shaders built from Nvidia's compiler check for bits[16:23] to be less than or equal to 0 with VSETP to default to a "safe" value of 0x8000'0000 (safe from hardware's perspective). To avoid hitting this path in the shader, return 0x00ff'0000 from S2R INVOCATION_INFO. This seems to be the maximum number of vertices a geometry shader can emit in a primitive.
2020-05-27	shader/other: Implement MEMBAR.CTS	ReinUsesLisp
	This silences an assertion we were hitting and uses workgroup memory barriers when the game requests it.
2020-05-26	Merge pull request #3981 from ReinUsesLisp/bar	bunnei
	shader/other: Implement BAR.SYNC 0x0
2020-05-26	Merge pull request #3980 from ReinUsesLisp/red-op	bunnei
	shader/memory: Implement non-addition operations in RED
2020-05-21	shader/other: Implement BAR.SYNC 0x0	ReinUsesLisp
	Trivially implement this particular case of BAR. Unless games use OpenCL or CUDA barriers, we shouldn't hit any other case here.
2020-05-21	shader/memory: Implement non-addition operations in RED	ReinUsesLisp
	Trivially implement these instructions. They are used in Astral Chain.
2020-05-21	shader/other: Implement thread comparisons (NV_shader_thread_group)	ReinUsesLisp
	Hardware S2R special registers match gl_Thread*MaskNV. We can trivially implement these using Nvidia's extension on OpenGL or naively stubbing them with the ARB instructions to match. This might cause issues if the host device warp size doesn't match Nvidia's. That said, this is unlikely on proper shaders. Refer to the attached url for more documentation about these flags. https://www.khronos.org/registry/OpenGL/extensions/NV/NV_shader_thread_group.txt
2020-05-09	shader_ir: Separate float-point comparisons in ordered and unordered	ReinUsesLisp
	This allows us to use native SPIR-V instructions without having to manually check for NAN.
2020-05-02	Merge pull request #3693 from ReinUsesLisp/clean-samplers	bunnei
	shader/texture: Support multiple unknown sampler properties
2020-04-28	shader/arithmetic_integer: Fix tracking issue in temporary	ReinUsesLisp
	This temporary is not needed as we mark Rd.CC + IADD.X as unimplemented. It caused issues when tracking global buffers.
2020-04-25	shader/arithmetic_integer: Fix edge case and mark IADD.X Rd.CC as unimplemented	ReinUsesLisp
	IADD.X Rd.CC requires some extra logic that is not currently implemented. Abort when this is hit.
2020-04-25	shader/arithmetic_integer: Change IAdd to UAdd to avoid signed overflow	ReinUsesLisp
	Signed integer addition overflow might be undefined behavior. It's free to change operations to UAdd and use unsigned integers to avoid potential bugs.
2020-04-25	shader/arithmetic_integer: Implement IADD.X	ReinUsesLisp
	IADD.X takes the carry flag and adds it to the result. This is generally used to emulate 64-bit operations with 32-bit registers.
2020-04-25	shader/arithmetic_integer: Implement CC for IADD	ReinUsesLisp

2020-04-25	decode/register_set_predicate: Implement CC	ReinUsesLisp
	P2R CC takes the state of condition codes and puts them into a register. We already have this implemented for PR (predicates). This commit implements CC over that.
2020-04-25	decode/register_set_predicate: Use move for shared pointers	ReinUsesLisp
	Avoid atomic counters used by shared pointers.
2020-04-25	Merge pull request #3734 from ReinUsesLisp/half-float-mods	bunnei
	decode/arithmetic_half: Fix HADD2 and HMUL2 absolute and negation bits
2020-04-24	Merge pull request #3749 from ReinUsesLisp/lea-imm	bunnei
	shader/arithmetic_integer: Fix LEA_IMM encoding
2020-04-23	decode/arithmetic_half: Fix HADD2 and HMUL2 absolute and negation bits	ReinUsesLisp
	The encoding for negation and absolute value was wrong. Extracting is now done manually. Similar instructions having different encodings is the rule, not the exception. To keep sanity and readability I preferred to extract the desired bit manually. This is implemented against nxas: https://github.com/ReinUsesLisp/nxas/blob/8dbc38995711cc12206aa370145a3a02665fd989/table.h#L68 That is itself tested against nvdisasm (Nvidia's official disassembler).
2020-04-23	shader/texture: Support multiple unknown sampler properties	ReinUsesLisp
	This allows deducing some properties from the texture instruction before asking the runtime. By doing this we can handle type mismatches in some instructions from the renderer instead of the shader decoder. Fixes texelFetch issues with games using 2D texture instructions on a 1D sampler.
2020-04-23	shader_ir: Turn classes into data structures	ReinUsesLisp

2020-04-20	shader/arithmetic_integer: Fix LEA_IMM encoding	ReinUsesLisp
	The operand order in LEA_IMM was flipped compared to nvdisasm. Fix that using nxas as reference: https://github.com/ReinUsesLisp/nxas/blob/8dbc38995711cc12206aa370145a3a02665fd989/table.h#L122
2020-04-16	decode/memory: Resolve unused variable warning	Lioncash
	Only the first element of the returned pair is ever used.
2020-04-16	decode/texture: Resolve unused variable warnings.	Lioncash
	Some variables aren't used, so we can remove these. Unfortunately, diagnostics are still reported on structured bindings even when annotated with [[maybe_unused]], so we need to unpack the elements that we want to use manually.
2020-04-16	decode/texture: Collapse loop down into std::generate	Lioncash
	Same behavior, less code.
2020-04-16	decode/texture: Eliminate trivial missing field initializer warnings	Lioncash
	We can just specify the initializers.
2020-04-16	Merge pull request #3673 from lioncash/extra	bunnei
	CMakeLists: Specify -Wextra on linux builds
2020-04-16	decode/shift: Remove unused variable within Shift()	Lioncash
	Removes a redundant variable that is already satisfied by the IsFull() utility function.
2020-04-15	CMakeLists: Specify -Wextra on linux builds	Lioncash
	Allows reporting more cases where logic errors may exist, such as implicit fallthrough cases, etc. We currently ignore unused parameters, since we currently have many cases where this is intentional (virtual interfaces). While we're at it, we can also tidy up any existing code that causes warnings. This also uncovered a few bugs as well.
2020-04-15	Merge pull request #3612 from ReinUsesLisp/red	Fernando Sahmkow
	shader/memory: Implement RED.E.ADD and minor changes to ATOM
2020-04-14	shader/arithmetic: Add FCMP_CR variant	ReinUsesLisp
	Adds another variant of FCMP.
2020-04-13	Merge pull request #3619 from ReinUsesLisp/i2i	Mat M
	shader/conversion: Implement I2I sign extension, saturation and selection
2020-04-13	Merge pull request #3633 from ReinUsesLisp/clean-texdec	Mat M
	shader/texture: Remove type mismatches management from shader decoder
2020-04-12	Merge pull request #3578 from ReinUsesLisp/vmnmx	Fernando Sahmkow
	shader/video: Partially implement VMNMX
2020-04-12	shader/video: Partially implement VMNMX	ReinUsesLisp
	Implements the common usages for VMNMX. Inputs with a different size than 32 bits are not supported and sign mismatches aren't supported either. VMNMX works as follows: It grabs Ra and Rb and applies a maximum/minimum on them (this is defined by .MX), having in mind the input sign. This result can then be saturated. After the intermediate result is calculated, it applies another operation on it using Rc. These operations are merges, accumulations or another min/max pass. This instruction allows to implement with a more flexible approach GCN's min3 and max3 instructions (for instance).
2020-04-10	shader/texture: Remove type mismatches management from shader decoder	ReinUsesLisp
	Since commit e22816a5bb we handle type mismatches from the CPU. We don't need to hack our shader decoder due to game bugs anymore. Removed in this commit.
2020-04-09	Merge pull request #3601 from ReinUsesLisp/some-shader-encodings	bunnei
	video_core/shader: Add some instruction and S2R encodings
2020-04-07	Merge pull request #3489 from namkazt/patch-2	Rodrigo Locatti
	shader: implement SULD.D bits32/64
2020-04-07	address nit.	Nguyen Dac Nam

2020-04-07	shader/conversion: Implement I2I sign extension, saturation and selection	ReinUsesLisp
	Reimplements I2I adding sign extension, saturation (clamp source value to the destination), selection and destination sizes that are not 32 bits wide. It doesn't implement CC yet.
2020-04-07	Apply suggestions from code review	Nguyen Dac Nam
	Co-Authored-By: Rodrigo Locatti <reinuseslisp@airmail.cc>
2020-04-06	shader_decode: SULD.D using std::pair instead of out parameter	namkazy

2020-04-06	shader_decode: SULD.D avoid duplicate code block.	namkazy

2020-04-06	shader_decode: SULD.D fix conversion error.	namkazy

2020-04-06	shader_decode: SULD.D implement bits64 and reverse shader ir init method to ↵	namkazy
	removed shader stage.