| Age | Commit message (Collapse) | Author | |
|---|---|---|---|
| 2019-08-27 | shader_ir/conversion: Implement F2I F16 Ra.H1 | ReinUsesLisp | |
| 2019-08-27 | float_set_predicate: Add missing negation bit for the second operand | ReinUsesLisp | |
| 2019-08-21 | shader_ir: Implement VOTE | ReinUsesLisp | |
| Implement VOTE using Nvidia's intrinsics. Documentation about these can be found here https://developer.nvidia.com/reading-between-threads-shader-intrinsics Instead of using portable ARB instructions I opted to use Nvidia intrinsics because these are the closest we have to how Tegra X1 hardware renders. To stub VOTE on non-Nvidia drivers (including nouveau) this commit simulates a GPU with a warp size of one, returning what is meaningful for the instruction being emulated: * anyThreadNV(value) -> value * allThreadsNV(value) -> value * allThreadsEqualNV(value) -> true ballotARB, also known as "uint64_t(activeThreadsNV())", emits VOTE.ANY Rd, PT, PT; on nouveau's compiler. This doesn't match exactly to Nvidia's code VOTE.ALL Rd, PT, PT; Which is emulated with activeThreadsNV() by this commit. In theory this shouldn't really matter since .ANY, .ALL and .EQ affect the predicates (set to PT on those cases) and not the registers. | |||
| 2019-08-21 | Buffer Cache: Adress Feedback. | Fernando Sahmkow | |
| 2019-08-21 | Buffer_Cache: Implement flushing. | Fernando Sahmkow | |
| 2019-08-21 | Buffer_Cache: Implement barriers. | Fernando Sahmkow | |
| 2019-08-21 | Buffer_Cache: Optimize and track written areas. | Fernando Sahmkow | |
| 2019-08-21 | BufferCache: Rework mapping caching. | Fernando Sahmkow | |
| 2019-08-21 | Buffer_Cache: Fixes and optimizations. | Fernando Sahmkow | |
| 2019-08-21 | Video_Core: Implement a new Buffer Cache | Fernando Sahmkow | |
| 2019-08-21 | Merge pull request #2769 from FernandoS27/commands-flush | bunnei | |
| GPU: Flush commands on every dma pusher step. | |||
| 2019-08-21 | Merge pull request #2777 from ReinUsesLisp/hsetp2-fe3h-fix | bunnei | |
| half_set_predicate: Fix HSETP2_C constant buffer offset | |||
| 2019-08-21 | Merge pull request #2753 from FernandoS27/float-convert | bunnei | |
| Shader_Ir: Implement F16 Variants of F2F, F2I, I2F. | |||
| 2019-08-21 | renderer_opengl: Implement RGB565 framebuffer format | ReinUsesLisp | |
| 2019-08-21 | renderer_opengl: Use block linear swizzling for CPU framebuffers | ReinUsesLisp | |
| 2019-08-21 | renderer_opengl: Use VideoCore pixel format | ReinUsesLisp | |
| 2019-08-21 | gpu: Change optional<reference_wrapper<T>> to T* for FramebufferConfig | ReinUsesLisp | |
| 2019-08-18 | Merge pull request #2778 from ReinUsesLisp/nop | bunnei | |
| shader_ir: Implement NOP | |||
| 2019-08-18 | Merge pull request #2768 from ReinUsesLisp/hsetp2-fix | bunnei | |
| decode/half_set_predicate: Fix predicates | |||
| 2019-08-04 | shader_ir: Implement NOP | ReinUsesLisp | |
| 2019-08-04 | half_set_predicate: Fix HSETP2_C constant buffer offset | ReinUsesLisp | |
| 2019-07-26 | GPU: Flush commands on every dma pusher step. | Fernando Sahmkow | |
| This commit ensures that the host gpu is constantly fed with commands to work with, while the guest gpu keeps producing the rest of the commands. This reduces syncing time between host and guest gpu. | |||
| 2019-07-26 | Merge pull request #2592 from FernandoS27/sync1 | bunnei | |
| Implement GPU Synchronization Mechanisms & Correct NVFlinger | |||
| 2019-07-26 | decode/half_set_predicate: Fix predicates | ReinUsesLisp | |
| 2019-07-25 | MaxwellDMA: Fixes, corrections and relaxations. | Fernando Sahmkow | |
| This commit fixes offsets on Linear -> Tiled copies, corrects z pos fortiled->linear copies, corrects bytes_per_pixel calculation in tiled -> linear copies and relaxes some limitations set by latest dma fixes refactors. | |||
| 2019-07-25 | Merge pull request #2739 from lioncash/cflow | bunnei | |
| video_core/control_flow: Minor changes/warning cleanup | |||
| 2019-07-25 | Merge pull request #2737 from FernandoS27/track-fix | bunnei | |
| Shader_Ir: Correct tracking to track from right to left | |||
| 2019-07-25 | Merge pull request #2743 from FernandoS27/surpress-assert | bunnei | |
| Downgrade and suppress a series of GPU asserts and debug messages. | |||
| 2019-07-24 | Merge pull request #2704 from FernandoS27/conditional | bunnei | |
| maxwell3d: Implement Conditional Rendering | |||
| 2019-07-22 | shader/decode: Implement S2R Tic | ReinUsesLisp | |
| 2019-07-22 | Merge pull request #2734 from ReinUsesLisp/compute-shaders | bunnei | |
| gl_rasterizer: Implement compute shaders | |||
| 2019-07-21 | Merge pull request #2735 from FernandoS27/pipeline-rework | bunnei | |
| Rework Dirty Flags in GPU Pipeline, Optimize CBData and Redo Clearing mechanism | |||
| 2019-07-20 | Shader_Ir: Implement F16 Variants of F2F, F2I, I2F. | Fernando Sahmkow | |
| This commit takes care of implementing the F16 Variants of the conversion instructions and makes sure conversions are done. | |||
| 2019-07-20 | Maxwell3D: Reorganize and address feedback | Fernando Sahmkow | |
| 2019-07-19 | Shader_Ir: Change Debug Asserts for Log Warnings | Fernando Sahmkow | |
| 2019-07-19 | shader/half_set_predicate: Fix HSETP2 implementation | ReinUsesLisp | |
| 2019-07-19 | shader/half_set_predicate: Implement missing HSETP2 variants | ReinUsesLisp | |
| 2019-07-18 | video_core/control_flow: Provide operator!= for types with operator== | Lioncash | |
| Provides operational symmetry for the respective structures. | |||
| 2019-07-18 | video_core/control_flow: Prevent sign conversion in TryGetBlock() | Lioncash | |
| The return value is a u32, not an s32, so this would result in an implicit signedness conversion. | |||
| 2019-07-18 | video_core/control_flow: Remove unnecessary BlockStack copy constructor | Lioncash | |
| This is the default behavior of the copy constructor, so it doesn't need to be specified. While we're at it we can make the other non-default constructor explicit. | |||
| 2019-07-18 | video_core/control_flow: Use std::move where applicable | Lioncash | |
| Results in less work being done where avoidable. | |||
| 2019-07-18 | video_core/control_flow: Use the prefix variant of operator++ for iterators | Lioncash | |
| Same thing, but potentially allows a standard library implementation to pick a more efficient codepath. | |||
| 2019-07-18 | video_core/control_flow: Use empty() member function for checking emptiness | Lioncash | |
| It's what it's there for. | |||
| 2019-07-18 | video_core: Resolve -Wreorder warnings | Lioncash | |
| Ensures that the constructor members are always initialized in the order that they're declared in. | |||
| 2019-07-18 | video_core/control_flow: Make program_size for ScanFlow() a std::size_t | Lioncash | |
| Prevents a truncation warning from occurring with MSVC. Also the internal data structures already treat it as a size_t, so this is just a discrepancy in the interface. | |||
| 2019-07-18 | video_core/control_flow: Place all internally linked types/functions within ↵ | Lioncash | |
| an anonymous namespace Previously, quite a few functions were being linked with external linkage. | |||
| 2019-07-18 | video_core/shader/decode: Prevent sign-conversion warnings | Lioncash | |
| Makes it explicit that the conversions here are intentional. | |||
| 2019-07-18 | Merge pull request #2738 from lioncash/shader-ir | bunnei | |
| shader-ir: Minor cleanup-related changes | |||
| 2019-07-18 | Shader_Ir: correct clang format | Fernando Sahmkow | |
| 2019-07-18 | GPU: Add missing puller methods. | Fernando Sahmkow | |
| This adds some missing puller methods. We don't assert them as these are nop operations for us. | |||
