aboutsummaryrefslogtreecommitdiff
path: root/src/video_core
AgeCommit message (Collapse)Author
2019-08-27shader_ir/conversion: Implement F2I F16 Ra.H1ReinUsesLisp
2019-08-27float_set_predicate: Add missing negation bit for the second operandReinUsesLisp
2019-08-21shader_ir: Implement VOTEReinUsesLisp
Implement VOTE using Nvidia's intrinsics. Documentation about these can be found here https://developer.nvidia.com/reading-between-threads-shader-intrinsics Instead of using portable ARB instructions I opted to use Nvidia intrinsics because these are the closest we have to how Tegra X1 hardware renders. To stub VOTE on non-Nvidia drivers (including nouveau) this commit simulates a GPU with a warp size of one, returning what is meaningful for the instruction being emulated: * anyThreadNV(value) -> value * allThreadsNV(value) -> value * allThreadsEqualNV(value) -> true ballotARB, also known as "uint64_t(activeThreadsNV())", emits VOTE.ANY Rd, PT, PT; on nouveau's compiler. This doesn't match exactly to Nvidia's code VOTE.ALL Rd, PT, PT; Which is emulated with activeThreadsNV() by this commit. In theory this shouldn't really matter since .ANY, .ALL and .EQ affect the predicates (set to PT on those cases) and not the registers.
2019-08-21Buffer Cache: Adress Feedback.Fernando Sahmkow
2019-08-21Buffer_Cache: Implement flushing.Fernando Sahmkow
2019-08-21Buffer_Cache: Implement barriers.Fernando Sahmkow
2019-08-21Buffer_Cache: Optimize and track written areas.Fernando Sahmkow
2019-08-21BufferCache: Rework mapping caching.Fernando Sahmkow
2019-08-21Buffer_Cache: Fixes and optimizations.Fernando Sahmkow
2019-08-21Video_Core: Implement a new Buffer CacheFernando Sahmkow
2019-08-21Merge pull request #2769 from FernandoS27/commands-flushbunnei
GPU: Flush commands on every dma pusher step.
2019-08-21Merge pull request #2777 from ReinUsesLisp/hsetp2-fe3h-fixbunnei
half_set_predicate: Fix HSETP2_C constant buffer offset
2019-08-21Merge pull request #2753 from FernandoS27/float-convertbunnei
Shader_Ir: Implement F16 Variants of F2F, F2I, I2F.
2019-08-21renderer_opengl: Implement RGB565 framebuffer formatReinUsesLisp
2019-08-21renderer_opengl: Use block linear swizzling for CPU framebuffersReinUsesLisp
2019-08-21renderer_opengl: Use VideoCore pixel formatReinUsesLisp
2019-08-21gpu: Change optional<reference_wrapper<T>> to T* for FramebufferConfigReinUsesLisp
2019-08-18Merge pull request #2778 from ReinUsesLisp/nopbunnei
shader_ir: Implement NOP
2019-08-18Merge pull request #2768 from ReinUsesLisp/hsetp2-fixbunnei
decode/half_set_predicate: Fix predicates
2019-08-04shader_ir: Implement NOPReinUsesLisp
2019-08-04half_set_predicate: Fix HSETP2_C constant buffer offsetReinUsesLisp
2019-07-26GPU: Flush commands on every dma pusher step.Fernando Sahmkow
This commit ensures that the host gpu is constantly fed with commands to work with, while the guest gpu keeps producing the rest of the commands. This reduces syncing time between host and guest gpu.
2019-07-26Merge pull request #2592 from FernandoS27/sync1bunnei
Implement GPU Synchronization Mechanisms & Correct NVFlinger
2019-07-26decode/half_set_predicate: Fix predicatesReinUsesLisp
2019-07-25MaxwellDMA: Fixes, corrections and relaxations.Fernando Sahmkow
This commit fixes offsets on Linear -> Tiled copies, corrects z pos fortiled->linear copies, corrects bytes_per_pixel calculation in tiled -> linear copies and relaxes some limitations set by latest dma fixes refactors.
2019-07-25Merge pull request #2739 from lioncash/cflowbunnei
video_core/control_flow: Minor changes/warning cleanup
2019-07-25Merge pull request #2737 from FernandoS27/track-fixbunnei
Shader_Ir: Correct tracking to track from right to left
2019-07-25Merge pull request #2743 from FernandoS27/surpress-assertbunnei
Downgrade and suppress a series of GPU asserts and debug messages.
2019-07-24Merge pull request #2704 from FernandoS27/conditionalbunnei
maxwell3d: Implement Conditional Rendering
2019-07-22shader/decode: Implement S2R TicReinUsesLisp
2019-07-22Merge pull request #2734 from ReinUsesLisp/compute-shadersbunnei
gl_rasterizer: Implement compute shaders
2019-07-21Merge pull request #2735 from FernandoS27/pipeline-reworkbunnei
Rework Dirty Flags in GPU Pipeline, Optimize CBData and Redo Clearing mechanism
2019-07-20Shader_Ir: Implement F16 Variants of F2F, F2I, I2F.Fernando Sahmkow
This commit takes care of implementing the F16 Variants of the conversion instructions and makes sure conversions are done.
2019-07-20Maxwell3D: Reorganize and address feedbackFernando Sahmkow
2019-07-19Shader_Ir: Change Debug Asserts for Log WarningsFernando Sahmkow
2019-07-19shader/half_set_predicate: Fix HSETP2 implementationReinUsesLisp
2019-07-19shader/half_set_predicate: Implement missing HSETP2 variantsReinUsesLisp
2019-07-18video_core/control_flow: Provide operator!= for types with operator==Lioncash
Provides operational symmetry for the respective structures.
2019-07-18video_core/control_flow: Prevent sign conversion in TryGetBlock()Lioncash
The return value is a u32, not an s32, so this would result in an implicit signedness conversion.
2019-07-18video_core/control_flow: Remove unnecessary BlockStack copy constructorLioncash
This is the default behavior of the copy constructor, so it doesn't need to be specified. While we're at it we can make the other non-default constructor explicit.
2019-07-18video_core/control_flow: Use std::move where applicableLioncash
Results in less work being done where avoidable.
2019-07-18video_core/control_flow: Use the prefix variant of operator++ for iteratorsLioncash
Same thing, but potentially allows a standard library implementation to pick a more efficient codepath.
2019-07-18video_core/control_flow: Use empty() member function for checking emptinessLioncash
It's what it's there for.
2019-07-18video_core: Resolve -Wreorder warningsLioncash
Ensures that the constructor members are always initialized in the order that they're declared in.
2019-07-18video_core/control_flow: Make program_size for ScanFlow() a std::size_tLioncash
Prevents a truncation warning from occurring with MSVC. Also the internal data structures already treat it as a size_t, so this is just a discrepancy in the interface.
2019-07-18video_core/control_flow: Place all internally linked types/functions within ↵Lioncash
an anonymous namespace Previously, quite a few functions were being linked with external linkage.
2019-07-18video_core/shader/decode: Prevent sign-conversion warningsLioncash
Makes it explicit that the conversions here are intentional.
2019-07-18Merge pull request #2738 from lioncash/shader-irbunnei
shader-ir: Minor cleanup-related changes
2019-07-18Shader_Ir: correct clang formatFernando Sahmkow
2019-07-18GPU: Add missing puller methods.Fernando Sahmkow
This adds some missing puller methods. We don't assert them as these are nop operations for us.