artemist/yuzu - Forgejo: Beyond coding. We Forge.

Author	SHA1	Message	Date
Fernando Sahmkow	0f89828073	MacroHLE: Implement DrawIndexedIndirect & DrawArraysIndirect.	2023-01-01 16:43:57 -05:00
Fernando Sahmkow	a5a94f52ff	MacroHLE: Add MultidrawIndirect HLE Macro.	2023-01-01 16:43:57 -05:00
ameerj	c6590ad07b	scratch_buffer: Explicitly defing resize and resize_destructive functions resize keeps previous data intact when the buffer grows resize_destructive destroys the previous data when the buffer grows	2022-12-19 22:40:50 -05:00
ameerj	61e4f2d931	dma_pusher: Rework command_headers usage Uses ScratchBuffer and avoids overwriting the command_headers buffer with the prefetch_command_list	2022-12-19 18:08:04 -05:00
Fernando Sahmkow	bc8b3d225e	VideoCore: Refactor fencing system.	2022-10-06 21:00:52 +02:00
Fernando Sahmkow	139ea93512	VideoCore: implement channels on gpu caches.	2022-10-06 21:00:51 +02:00
Morph	99ceb03a1c	general: Convert source file copyright comments over to SPDX This formats all copyright comments according to SPDX formatting guidelines. Additionally, this resolves the remaining GPLv2 only licensed files by relicensing them to GPLv2.0-or-later.	2022-04-23 05:55:32 -04:00
ameerj	1bc7d61b57	video_core: Reduce unused includes	2022-03-19 15:01:31 -04:00
Fernando Sahmkow	2c8f4ed27f	BufferCache: Additional download fixes.	2021-07-09 22:20:36 +02:00
ReinUsesLisp	82c2601555	video_core: Reimplement the buffer cache Reimplement the buffer cache using cached bindings and page level granularity for modification tracking. This also drops the usage of shared pointers and virtual functions from the cache. - Bindings are cached, allowing to skip work when the game changes few bits between draws. - OpenGL Assembly shaders no longer copy when a region has been modified from the GPU to emulate constant buffers, instead GL_EXT_memory_object is used to alias sub-buffers within the same allocation. - OpenGL Assembly shaders stream constant buffer data using glProgramBufferParametersIuivNV, from NV_parameter_buffer_object. In theory this should save one hash table resolve inside the driver compared to glBufferSubData. - A new OpenGL stream buffer is implemented based on fences for drivers that are not Nvidia's proprietary, due to their low performance on partial glBufferSubData calls synchronized with 3D rendering (that some games use a lot). - Most optimizations are shared between APIs now, allowing Vulkan to cache more bindings than before, skipping unnecesarry work. This commit adds the necessary infrastructure to use Vulkan object from OpenGL. Overall, it improves performance and fixes some bugs present on the old cache. There are still some edge cases hit by some games that harm performance on some vendors, this are planned to be fixed in later commits.	2021-02-13 02:17:22 -03:00
Lioncash	677a8b208d	video_core: Resolve more variable shadowing scenarios Resolves variable shadowing scenarios up to the end of the OpenGL code to make it nicer to review. The rest will be resolved in a following commit.	2020-12-04 16:19:09 -05:00
bunnei	dc5396a466	video_core: dma_pusher: Remove integrity check on command lists. - This seems to cause softlocks in Breath of the Wild.	2020-11-07 00:08:19 -08:00
bunnei	c6e1c46ac7	video_core: dma_pusher: Add support for integrity checks. - Log corrupted command lists, rather than crash.	2020-11-01 01:52:38 -07:00
bunnei	c64545d07a	video_core: dma_pusher: Add support for prefetched command lists.	2020-11-01 01:52:38 -07:00
David Marcec	4b9504028d	DmaPusher: Remove dead code in step	2020-05-16 12:42:27 +10:00
Fernando Sahmkow	b87422a86f	VideoCore/GPU: Delegate subchannel engines to the dma pusher.	2020-04-27 22:07:21 -04:00
Fernando Sahmkow	3fedcc2f6e	DMAPusher: Propagate multimethod writes into the engines.	2020-04-23 08:52:55 -04:00
Fernando Sahmkow	339d0d9d6c	GPU: Delay Fences.	2020-04-22 11:36:08 -04:00
Fernando Sahmkow	8b1eb44b3e	BufferCache: Implement OnCPUWrite and SyncGuestHost	2020-04-22 11:36:07 -04:00
Fernando Sahmkow	da8f17715d	GPU: Refactor synchronization on Async GPU	2020-04-22 11:36:06 -04:00
Lioncash	44e959157b	dma_pusher: Remove reliance on the global system instance With this, the video core is now has no calls to the global system instance at all.	2020-04-19 16:12:08 -04:00
ReinUsesLisp	eed789d0d1	video_core: Reintroduce dirty flags infrastructure	2020-02-28 17:56:41 -03:00
ReinUsesLisp	96ac3d518a	gl_rasterizer: Remove dirty flags	2020-02-28 16:39:27 -03:00
Fernando Sahmkow	e52c895559	GPU: Flush commands on every dma pusher step. This commit ensures that the host gpu is constantly fed with commands to work with, while the guest gpu keeps producing the rest of the commands. This reduces syncing time between host and guest gpu.	2019-07-26 16:54:22 -04:00
Fernando Sahmkow	f2e7b29c14	Maxwell3D: Rework the dirty system to be more consistant and scaleable	2019-07-17 17:29:49 -04:00
Fernando Sahmkow	9e98100c94	Dma_pusher: ASSERT on empty command_list This is a measure to avoid crashes on command list reading as an empty command_list is considered a NOP.	2019-05-19 10:48:31 -04:00
bunnei	c52233ec8b	Merge pull request #2322 from ReinUsesLisp/wswitch video_core: Silent -Wswitch warnings	2019-04-28 22:24:58 -04:00
ReinUsesLisp	fbe8d1ceaa	video_core: Silent -Wswitch warnings	2019-04-18 15:54:39 -03:00
Fernando Sahmkow	13d626fc21	Use ReadBlockUnsafe for fetyching DMA CommandLists	2019-04-16 11:22:34 -04:00
Lioncash	fbf452ab0e	video_core/texures/texture: Remove unnecessary includes Nothing in this header relies on common_funcs or the memory manager. This gets rid of reliance on indirect inclusions in the OpenGL caches.	2019-04-06 00:03:35 -04:00
bunnei	574e89d924	video_core: Refactor to use MemoryManager interface for all memory access. # Conflicts: # src/video_core/engines/kepler_memory.cpp # src/video_core/engines/maxwell_3d.cpp # src/video_core/morton.cpp # src/video_core/morton.h # src/video_core/renderer_opengl/gl_global_cache.cpp # src/video_core/renderer_opengl/gl_global_cache.h # src/video_core/renderer_opengl/gl_rasterizer_cache.cpp	2019-03-16 00:38:48 -04:00
ReinUsesLisp	e7ac5a6adf	dma_pusher: Store command_list_header by copy Instead of holding a reference that will get invalidated by dma_pushbuffer.pop(), hold it as a copy. This doesn't have any performance cost since CommandListHeader is 8 bytes long.	2019-03-08 04:06:54 -03:00
Markus Wick	6dd40976d0	video_core/dma_pusher: Simplyfy Step() logic. As fetching command list headers and and the list of command headers is a fixed 1:1 relation now, they can be implemented within a single call. This cleans up the Step() logic quite a bit.	2019-02-19 10:28:42 +01:00
Markus Wick	717394c980	video_core/dma_pusher: The full list of headers at once. Fetching every u32 from memory leads to a big overhead. So let's fetch all of them as a block if possible. This reduces the Memory::* calls by the dma_pusher by a factor of 10.	2019-02-19 09:58:38 +01:00
ReinUsesLisp	2bdbb90af7	video_core: Assert on invalid GPU to CPU address queries	2019-02-03 04:58:40 -03:00
bunnei	ac74b71d75	dma_pushbuffer: Optimize to avoid loop and copy on Push.	2018-11-27 19:17:33 -05:00
bunnei	c568f5cea7	gpu: Move command list profiling to DmaPusher::DispatchCalls.	2018-11-27 18:42:21 -05:00
bunnei	abea6fa90c	gpu: Rewrite GPU command list processing with DmaPusher class. - More accurate impl., fixes Undertale (among other games).	2018-11-26 23:14:01 -05:00

38 commits