yuzu-mainline - A backup of the Yuzu mainline repo. Only includes the master branch, nothing else.

Age	Commit message (Collapse)	Author
2021-02-13	video_core: Reimplement the buffer cache	ReinUsesLisp
	Reimplement the buffer cache using cached bindings and page level granularity for modification tracking. This also drops the usage of shared pointers and virtual functions from the cache. - Bindings are cached, allowing to skip work when the game changes few bits between draws. - OpenGL Assembly shaders no longer copy when a region has been modified from the GPU to emulate constant buffers, instead GL_EXT_memory_object is used to alias sub-buffers within the same allocation. - OpenGL Assembly shaders stream constant buffer data using glProgramBufferParametersIuivNV, from NV_parameter_buffer_object. In theory this should save one hash table resolve inside the driver compared to glBufferSubData. - A new OpenGL stream buffer is implemented based on fences for drivers that are not Nvidia's proprietary, due to their low performance on partial glBufferSubData calls synchronized with 3D rendering (that some games use a lot). - Most optimizations are shared between APIs now, allowing Vulkan to cache more bindings than before, skipping unnecesarry work. This commit adds the necessary infrastructure to use Vulkan object from OpenGL. Overall, it improves performance and fixes some bugs present on the old cache. There are still some edge cases hit by some games that harm performance on some vendors, this are planned to be fixed in later commits.
2021-02-13	gpu: Report renderer errors with exceptions	ReinUsesLisp
	Instead of using a two step initialization to report errors, initialize the GPU renderer and rasterizer on the constructor and report errors through std::runtime_error.
2021-01-24	maxwell_3d: Silence array bounds warnings	ReinUsesLisp

2021-01-15	common/common_funcs: Rename INSERT_UNION_PADDING_{BYTES,WORDS} to _NOINIT	ReinUsesLisp
	INSERT_PADDING_BYTES_NOINIT is more descriptive of the underlying behavior.
2020-12-30	video_core: Rewrite the texture cache	ReinUsesLisp
	The current texture cache has several points that hurt maintainability and performance. It's easy to break unrelated parts of the cache when doing minor changes. The cache can easily forget valuable information about the cached textures by CPU writes or simply by its normal usage.The current texture cache has several points that hurt maintainability and performance. It's easy to break unrelated parts of the cache when doing minor changes. The cache can easily forget valuable information about the cached textures by CPU writes or simply by its normal usage. This commit aims to address those issues.
2020-12-15	Merge pull request #5157 from lioncash/array-dirty	bunnei
	maxwell_3d: Remove unused dirty_pointer array
2020-12-07	video_core: Remove unnecessary enum class casting in logging messages	Lioncash
	fmt now automatically prints the numeric value of an enum class member by default, so we don't need to use casts any more. Reduces the line noise a bit.
2020-12-06	maxwell_3d: Move member variables to end of class	Lioncash
	Follows our established coding style.
2020-12-06	maxwell_3d: Resolve -Wdocumentation warning	Lioncash
	Removes a documentation comment for a non-existent member.
2020-12-06	maxwell_3d: Remove unused dirty_pointer array	Lioncash
	This is unused and removing it shrinks the structure by 3584 bytes.
2020-12-05	maxwell_dma: Rename RenderEnable::Mode::FALSE and TRUE to avoid name conflict	comex
	On Apple platforms, FALSE and TRUE are defined as macros by <mach/boolean.h>, which is included by various system headers. Note that there appear to be no actual users of the names to fix up.
2020-12-04	video_core: Resolve more variable shadowing scenarios	Lioncash
	Resolves variable shadowing scenarios up to the end of the OpenGL code to make it nicer to review. The rest will be resolved in a following commit.
2020-11-26	vk_shader_decompiler: Implement force early fragment tests	ReinUsesLisp
	Force early fragment tests when the 3D method is enabled. The established pipeline cache takes care of recompiling if needed. This is implemented only on Vulkan to avoid invalidating the shader cache on OpenGL.
2020-11-20	Merge pull request #4953 from lioncash/shader-shadow	bunnei
	shader_bytecode: Eliminate variable shadowing
2020-11-20	shader_bytecode: Make use of [[nodiscard]] where applicable	Lioncash
	Ensures that all queried values are made use of.
2020-11-20	shader_bytecode: Eliminate variable shadowing	Lioncash

2020-11-11	maxwell_3d: Use insert instead of loop push_back	ReinUsesLisp
	This reduces the overhead of bounds checking on each element. It won't reduce the cost of allocation because usually this vector's capacity is usually large enough to hold whatever we push to it.
2020-11-11	maxwell_3d: Move code to separate functions	ReinUsesLisp
	Deduplicate some code and put it in separate functions so it's easier to understand and profile.
2020-10-28	shader/arithmetic: Implement FCMP immediate + register variant	ReinUsesLisp
	Trivially add the encoding for this.
2020-10-09	video_core: Enforce -Wclass-memaccess	ReinUsesLisp

2020-10-02	video_core: Enforce -Wunused-variable and -Wunused-but-set-variable	ReinUsesLisp

2020-09-22	General: Make use of std::nullopt where applicable	Lioncash
	Allows some implementations to avoid completely zeroing out the internal buffer of the optional, and instead only set the validity byte within the structure. This also makes it consistent how we return empty optionals.
2020-09-18	fermi_2d: Make use of designated initializers	Lioncash
	Same behavior, less repetition. We can also ensure all members of Config are initialized.
2020-08-22	video_core: Initialize renderer with a GPU	ReinUsesLisp
	Add an extra step in GPU initialization to be able to initialize render backends with a valid GPU instance.
2020-08-16	Merge pull request #4519 from lioncash/semi	bunnei
	maxwell_3d: Resolve -Wextra-semi warning
2020-08-14	maxwell_3d: Resolve -Wextra-semi warning	Lioncash
	Semicolons after a function definition aren't necessary.
2020-08-10	textures/decoders: Fix block linear to pitch copies	ReinUsesLisp
	There were two issues with block linear copies. First the swizzling was wrong and this commit reimplements them. The other issue was that these copies are generally used to download render targets from the GPU and yuzu was not downloading them from host GPU memory unless the extreme GPU accuracy setting was selected. This commit enables cached memory reads for all accuracy levels. - Fixes level thumbnails in Super Mario Maker 2.
2020-07-10	video_core/textures: Add and use SwizzleSliceToVoxel, and minor style changes	ReinUsesLisp
	Change GOB sizes from free-functions to constexpr constants. Add SwizzleSliceToVoxel, a function that swizzles a 2D array of pixels into a 3D texture and use it for 3D copies.
2020-07-07	maxwell_dma: Rename registers to match official docs and reorder	ReinUsesLisp
	Rename registers in the MaxwellDMA class to match Nvidia's official documentation. This one can be found here: https://github.com/NVIDIA/open-gpu-doc/blob/master/classes/dma-copy/clb0b5.h While we are at it, reorganize the code in MaxwellDMA to be separated in different functions.
2020-06-26	Merge pull request #4147 from ReinUsesLisp/hset2-imm	bunnei
	shader/half_set: Implement HSET2_IMM
2020-06-24	Addressed issues	David Marcec

2020-06-24	Macro HLE support	David Marcec

2020-06-22	shader/half_set: Implement HSET2_IMM	ReinUsesLisp
	Add HSET2_IMM. Due to the complexity of the encoding avoid using BitField unions and read the relevant bits from the code itself. This is less error prone.
2020-06-13	Merge pull request #4049 from ReinUsesLisp/separate-samplers	bunnei
	shader/texture: Join separate image and sampler pairs offline
2020-06-08	texture_cache: Implement rendering to 3D textures	ReinUsesLisp
	This allows rendering to 3D textures with more than one slice. Applications are allowed to render to more than one slice of a texture using gl_Layer from a VTG shader. This also requires reworking how 3D texture collisions are handled, for now, this commit allows rendering to slices but not to miplevels. When a render target attempts to write to a mipmap, we fallback to the previous implementation (copying or flushing as needed). - Fixes color correction 3D textures on UE4 games (rainbow effects). - Allows Xenoblade games to render to 3D textures directly.
2020-06-05	shader/texture: Join separate image and sampler pairs offline	ReinUsesLisp
	Games using D3D idioms can join images and samplers when a shader executes, instead of baking them into a combined sampler image. This is also possible on Vulkan. One approach to this solution would be to use separate samplers on Vulkan and leave this unimplemented on OpenGL, but we can't do this because there's no consistent way of determining which constant buffer holds a sampler and which one an image. We could in theory find the first bit and if it's in the TIC area, it's an image; but this falls apart when an image or sampler handle use an index of zero. The used approach is to track for a LOP.OR operation (this is done at an IR level, not at an ISA level), track again the constant buffers used as source and store this pair. Then, outside of shader execution, join the sample and image pair with a bitwise or operation. This approach won't work on games that truly use separate samplers in a meaningful way. For example, pooling textures in a 2D array and determining at runtime what sampler to use. This invalidates OpenGL's disk shader cache :) - Used mostly by D3D ports to Switch
2020-06-04	Merge pull request #4009 from ogniK5377/macro-jit-prod	bunnei
	video_core: Implement Macro JIT
2020-06-04	Default init labels and use initializer list for macro engine	David Marcec

2020-06-03	Mark parameters as const	David Marcec

2020-06-02	Pass by reference instead of copying parameters	David Marcec

2020-06-01	Merge pull request #3998 from ReinUsesLisp/init-3d	bunnei
	maxwell_3d: Initialize more registers to their expected value
2020-05-30	Implement macro JIT	David Marcec

2020-05-28	maxwell_3d: Reduce severity of logs that can be spammed	ReinUsesLisp
	These logs were killing performance on some games when they were spammed. Reduce them to Debug severity.
2020-05-27	maxwell_3d: Initialize line widths	ReinUsesLisp
	Initialize line widths to avoid setting a line width of zero.
2020-05-27	maxwell_3d: Initialize polygon modes	ReinUsesLisp
	NVN expects this to be initialized as Fill, otherwise games that never bind a rasterizer state will log an invalid polygon mode.
2020-05-13	Merge pull request #3899 from ReinUsesLisp/float-comparisons	bunnei
	shader_ir: Add separate instructions for ordered and unordered comparisons and fix NE on GLSL
2020-05-09	shader_ir: Separate float-point comparisons in ordered and unordered	ReinUsesLisp
	This allows us to use native SPIR-V instructions without having to manually check for NAN.
2020-05-08	Merge pull request #3885 from ReinUsesLisp/viewport-swizzles	bunnei
	video_core: Implement viewport swizzles with NV_viewport_swizzle
2020-05-05	Merge pull request #3815 from FernandoS27/command-list-2	bunnei
	GPU: More optimizations to GPU Command List Processing and DMA Copy Optimizations
2020-05-04	vk_graphics_pipeline: Implement viewport swizzles with NV_viewport_swizzle	ReinUsesLisp