artemist/yuzu - Forgejo: Beyond coding. We Forge.

Author	SHA1	Message	Date
Lioncash	fb563e75e9	vk_graphics_pipeline: Resolve narrowing warnings For whatever reason, VK_TRUE and VK_FALSE aren't defined as having a VkBool32 type, so we need to cast to it explicitly.	2020-07-16 18:13:49 -04:00
Lioncash	5330ca396d	vk_compute_pipeline: Make use of designated initializers where applicable	2020-07-16 17:32:12 -04:00
Lioncash	757ddd8158	vk_compute_pass: Make use of designated initializers where applicable Note: Some barriers can't be converted over yet, as they ICE MSVC.	2020-07-16 17:23:56 -04:00
Lioncash	a66a0a6a53	vk_buffer_cache: Make use of designated initializers where applicable Note: An array within CopyFrom() cannot be converted over yet, as it ICEs MSVC when converted over.	2020-07-16 16:59:39 -04:00
Rodrigo Locatti	be68ee88c2	Merge pull request #4333 from lioncash/desig3 vk_graphics_pipeline: Make use of designated initializers where applicable	2020-07-16 17:41:45 -03:00
Rodrigo Locatti	b6d73ec9c2	Merge pull request #4332 from lioncash/vkdev vk_device: Make use of designated initializers where applicable	2020-07-16 17:41:20 -03:00
ReinUsesLisp	210cc0204d	decode/other: Implement S2R.LaneId This maps to host's thread id. - Fixes graphical issues on Paper Mario.	2020-07-16 16:09:39 -03:00
ReinUsesLisp	88e57b13e0	gl_arb_decompiler: Execute BAR even when inside control flow Unlike GLSL, GLASM allows us to call BAR inside control flow. - Fixes graphical artifacts in Paper Mario.	2020-07-16 16:05:52 -03:00
ReinUsesLisp	a5a72cbd20	renderer_{opengl,vulkan}: Clamp shared memory to host's limit This stops shaders from failing to build when the exceed host's shared memory size limit. An error is logged.	2020-07-16 16:02:46 -03:00
bunnei	98b36625fa	Merge pull request #4321 from lioncash/desig vk_blit_screen: Make use of designated initializers where applicable	2020-07-16 14:55:36 -04:00
Lioncash	969100d41a	shader_cache: Make use of std::erase_if Now that we use C++20, we can also make use of std::erase_if instead of needing to do the erase-remove idiom.	2020-07-14 15:49:15 -04:00
bunnei	666b37ad56	Merge pull request #4242 from ReinUsesLisp/maxwell-dma maxwell_dma: Match official doc and support pitch->voxel copies	2020-07-14 14:04:16 -04:00
Lioncash	0f8b977663	vk_device: Make use of designated initializers where applicable Avoids redundant repetitions of variable names, and allows assignment all in one statement.	2020-07-13 22:24:01 -04:00
Lioncash	0475a167f8	vk_graphics_pipeline: Make use of designated initializers where applicable Avoids redundant variable name repetitions.	2020-07-13 21:07:56 -04:00
ReinUsesLisp	fbc232426d	video_core: Rearrange pixel format names Normalizes pixel format names to match Vulkan names. Previous to this commit pixel formats had no convention, leading to confusion and potential bugs.	2020-07-13 01:44:23 -03:00
ReinUsesLisp	eda37ff26b	video_core: Fix DXT4 and RGB565	2020-07-13 01:01:09 -03:00
ReinUsesLisp	a8dab2ffb3	video_core/format_lookup_table: Add formats with existing PixelFormat	2020-07-13 01:01:09 -03:00
ReinUsesLisp	480850ffe7	video_core: Fix B5G6R5_UNORM render target format	2020-07-13 01:01:09 -03:00
ReinUsesLisp	990b14f181	video_core: Fix B5G6R5U	2020-07-13 01:01:09 -03:00
ReinUsesLisp	1d20aac795	video_core: Implement RGBA32_SINT render target	2020-07-13 01:01:09 -03:00
ReinUsesLisp	9338599d72	video_core: Implement RGBA32_SINT render target	2020-07-13 01:01:09 -03:00
ReinUsesLisp	95c0f5afe5	video_core: Implement RGBA16_SINT render target	2020-07-13 01:01:09 -03:00
ReinUsesLisp	977d6c46f3	video_core: Implement RGBA8_SINT render target	2020-07-13 01:01:09 -03:00
ReinUsesLisp	50c6030a8d	video_core: Implement RG32_SINT render target	2020-07-13 01:01:09 -03:00
ReinUsesLisp	e849d68048	video_core: Implement RG8_SINT render target and fix RG8_UINT	2020-07-13 01:01:09 -03:00
ReinUsesLisp	f29fede49c	video_core: Implement R8_SINT render target	2020-07-13 01:01:08 -03:00
ReinUsesLisp	fd33e996e0	video_core: Implement R8_SNORM render target	2020-07-13 01:01:08 -03:00
ReinUsesLisp	505c206eb8	video_core/surface: Remove explicit values on PixelFormat's definition	2020-07-13 01:01:08 -03:00
ReinUsesLisp	143662118c	video_core/surface: Reorder render target to pixel format switch	2020-07-13 01:01:08 -03:00
Lioncash	db6fbd5894	vk_blit_screen: Make use of designated initializers where applicable Now that we make use of C++20, we can use designated initializers to make things a little nicer to read.	2020-07-12 19:45:30 -04:00
ReinUsesLisp	0fe09df386	vk_state_tracker: Fix dirty flags for stencil_enable on VK_EXT_extended_dynamic_state Fixes a regression on any game using stencil on devices with VK_EXT_extended_dynamic_state.	2020-07-12 20:43:42 -03:00
ReinUsesLisp	fca26980a2	vk_rasterizer: Pass <pSizes> to CmdBindVertexBuffers2EXT This has been fixed in Nvidia's public beta driver 451.74. The previous beta driver will be broken, people using these will have to update.	2020-07-10 18:15:32 -03:00
ReinUsesLisp	c574ab5aa1	video_core/textures: Add and use SwizzleSliceToVoxel, and minor style changes Change GOB sizes from free-functions to constexpr constants. Add SwizzleSliceToVoxel, a function that swizzles a 2D array of pixels into a 3D texture and use it for 3D copies.	2020-07-10 04:09:32 -03:00
Rodrigo Locatti	e73c53fad1	Merge pull request #4283 from lat9nq/fix-linux-nvidia-vulkan vk_stream_buffer: Prevent Vulkan crash in Linux on recent NVIDIA driver	2020-07-10 00:18:44 -03:00
lat9nq	63d23835ef	configuration: implement per-game configurations (#4098 ) * Switch game settings to use a pointer In order to add full per-game settings, we need to be able to tell yuzu to switch to using either the global or game configuration. Using a pointer makes it easier to switch. * configuration: add new UI without changing existing funcitonality The new UI also adds General, System, Graphics, Advanced Graphics, and Audio tabs, but as yet they do nothing. This commit keeps yuzu to the same functionality as originally branched. * configuration: Rename files These weren't included in the last commit. Now they are. * configuration: setup global configuration checkbox Global config checkbox now enables/disables the appropriate tabs in the game properties dialog. The use global configuration setting is now saved to the config, defaulting to true. This also addresses some changes requested in the PR. * configuration: swap to per-game config memory for properties dialog Does not set memory going in-game. Swaps to game values when opening the properties dialog, then swaps back when closing it. Uses a `memcpy` to swap. Also implements saving config files, limited to certain groups of configurations so as to not risk setting unsafe configurations. * configuration: change config interfaces to use config-specific pointers When a game is booted, we need to be able to open the configuration dialogs without changing the settings pointer in the game's emualtion. A new pointer specific to just the configuration dialogs can be used to separate changes to just those config dialogs without affecting the emulation. * configuration: boot a game using per-game settings Swaps values where needed to boot a game. * configuration: user correct config during emulation Creates a new pointer specifically for modifying the configuration while emulation is in progress. Both the regular configuration dialog and the game properties dialog now use the pointer Settings::config_values to focus edits to the correct struct. * settings: split Settings::values into two different structs By splitting the settings into two mutually exclusive structs, it becomes easier, as a developer, to determine how to use the Settings structs after per-game configurations is merged. Other benefits include only duplicating the required settings in memory. * settings: move use_docked_mode to Controls group `use_docked_mode` is set in the input settings and cannot be accessed from the system settings. Grouping it with system settings causes it to be saved with per-game settings, which may make transferring configs more difficult later on, especially since docked mode cannot be set from within the game properties dialog. * configuration: Fix the other yuzu executables and a regression In main.cpp, we have to get the title ID before the ROM is loaded, else the renderer will reflect only the global settings and now the user's game specific settings. * settings: use a template to duplicate memory for each setting Replaces the type of each variable in the Settings::Values struct with a new class that allows basic data reading and writing. The new struct Settings::Setting duplicates the data in memory and can manage global overrides per each setting. * configuration: correct add-ons config and swap settings when apropriate Any add-ons interaction happens directly through the global values struct. Swapping bewteen structs now also includes copying the necessary global configs that cannot be changed nor saved in per-game settings. General and System config menus now update based on whether it is viewing the global or per-game settings. * settings: restore old values struct No longer needed with the Settings::Setting class template. * configuration: implement hierarchical game properties dialog This sets the apropriate global or local data in each setting. * clang format * clang format take 2 can the docker container save this? * address comments and style issues * config: read and write settings with global awareness Adds new functions to read and write settings while keeping the global state in focus. Files now generated per-game are much smaller since often they only need address the global state. * settings: restore global state when necessary Upon closing a game or the game properties dialog, we need to restore all global settings to the original global state so that we can properly open the configuration dialog or boot a different game. * configuration: guard setting values incorrectly This disables setting values while a game is running if the setting is overwritten by a per game setting. * config: don't write local settings in the global config Simple guards to prevent writing the wrong settings in the wrong files. * configuration: add comments, assume less, and clang format No longer assumes that a disabled UI element means the global state is turned off, instead opting to directly answer that question. Still however assumes a game is running if it is in that state. * configuration: fix a logic error Should not be negated * restore settings' global state regardless of accept/cancel Fixes loading a properties dialog and causing the global config dialog to show local settings. * fix more logic errors Fixed the frame limit would set the global setting from the game properties dialog. Also strengthened the Settings::Setting member variables and simplified the logic in config reading (ReadSettingGlobal). * fix another logic error In my efforts to guard RestoreGlobalState, I accidentally negated the IsPowered condition. * configure_audio: set toggle_stretched_audio to tristate * fixed custom rtc and rng seed overwriting the global value * clang format * rebased * clang format take 4 * address my own review Basically revert unintended changes * settings: literal instead of casting "No need to cast, use 1U instead" Thanks, Morph! Co-authored-by: Morph <39850852+Morph1984@users.noreply.github.com> * Revert "settings: literal instead of casting " This reverts commit 95e992a87c898f3e882ffdb415bb0ef9f80f613f. * main: fix status buttons reporting wrong settings after stop emulation * settings: Log UseDockedMode in the Controls group This should have happened when use_docked_mode was moved over to the controls group internally. This just reflects this in the log. * main: load settings if the file has a title id In other words, don't exit if the loader has trouble getting a title id. * use a zero * settings: initalize resolution factor with constructor instead of casting * Revert "settings: initalize resolution factor with constructor instead of casting" This reverts commit 54c35ecb46a29953842614620f9b7de1aa9d5dc8. * configure_graphics: guard device selector when Vulkan is global Prevents the user from editing the device selector if Vulkan is the global renderer backend. Also resets the vulkan_device variable when the users switches back-and-forth between global and Vulkan. * address reviewer concerns Changes function variables to const wherever they don't need to be changed. Sets Settings::Setting to final as it should not be inherited from. Sets ConfigurationShared::use_global_text to static. Co-Authored-By: VolcaEM <volcaem@users.noreply.github.com> * main: load per-game settings after LoadROM This prevents `Restart Emulation` from restoring the global settings after the per-game settings were applied. Thanks to BSoDGamingYT for finding this bug. * Revert "main: load per-game settings after LoadROM" This reverts commit 9d0d48c52d2dcf3bfb1806cc8fa7d5a271a8a804. * main: only restore global settings when necessary Loading the per-game settings cannot happen after the ROM is loaded, so we have to specify when to restore the global state. Again thanks to BSoD for finding the bug. * configuration_shared: address reviewer concerns except operator overrides Dropping operator override usage in next commit. Co-Authored-By: LC <lioncash@users.noreply.github.com> * settings: Drop operator overrides from Setting template Requires using GetValue and SetValue explicitly. Also reverts a change that broke title ID formatting in the game properties dialog. * complete rebase * configuration_shared: translate "Use global configuration" Uses ConfigurePerGame to do so, since its usage, at least as of now, corresponds with ConfigurationShared. * configure_per_game: address reviewer concern As far as I understand, it prevents the program from unnecessarily copying strings. Co-Authored-By: LC <lioncash@users.noreply.github.com> Co-authored-by: Morph <39850852+Morph1984@users.noreply.github.com> Co-authored-by: VolcaEM <volcaem@users.noreply.github.com> Co-authored-by: LC <lioncash@users.noreply.github.com>	2020-07-09 22:42:09 -04:00
lat9nq	1c7d106aac	vk_stream_buffer: set allocable_size to 9 MiB This solves the crash on Linux systems running the current Linux Long Lived branch nVidia driver.	2020-07-09 21:28:32 -04:00
ReinUsesLisp	2a9d17b7e7	maxwell_dma: Rename registers to match official docs and reorder Rename registers in the MaxwellDMA class to match Nvidia's official documentation. This one can be found here: https://github.com/NVIDIA/open-gpu-doc/blob/master/classes/dma-copy/clb0b5.h While we are at it, reorganize the code in MaxwellDMA to be separated in different functions.	2020-07-07 19:19:33 -03:00
bunnei	35f7740b6c	Merge pull request #4150 from ReinUsesLisp/dynamic-state-impl vulkan: Use VK_EXT_extended_dynamic_state when available	2020-07-07 10:58:09 -04:00
Fernando Sahmkow	52882a93a5	Merge pull request #4194 from ReinUsesLisp/fix-shader-cache shader_cache: Fix use-after-free and orphan invalidation cache entries	2020-07-04 20:49:00 -04:00
bunnei	41a333321a	Merge pull request #4175 from ReinUsesLisp/read-buffer gl_buffer_cache: Copy to buffers created as STREAM_READ before downloading	2020-07-02 23:30:08 -04:00
Rodrigo Locatti	c58e21cd76	Merge pull request #4082 from Morph1984/mirror-once-clamp maxwell_to_gl: Implement MirrorOnceClampOGL wrap mode using GL_MIRROR_CLAMP_EXT	2020-07-02 04:57:40 -03:00
ReinUsesLisp	f6cb128eac	shader_cache: Fix use-after-free and orphan invalidation cache entries This fixes some cases where entries could have been removed multiple times reading freed memory. To address this issue this commit removes duplicates from entries marked for removal and sorts out the removal process to fix another use-after-free situation. Another issue fixed in this commit is orphan invalidation cache entries. Previously only the entries that were invalidated in the current operations had its entries removed. This led to more use-after-free situations when these entries were actually invalidated but referenced an object that didn't exist.	2020-07-01 18:16:53 -03:00
Fernando Sahmkow	a4f48efea4	Merge pull request #4176 from ReinUsesLisp/compatible-formats texture_cache: Check format compatibility before copying	2020-06-30 15:36:13 -04:00
Fernando Sahmkow	977a3ab352	Merge pull request #4157 from ReinUsesLisp/unified-turing gl_device: Enable NV_vertex_buffer_unified_memory on Turing devices	2020-06-30 14:36:51 -04:00
Morph	1b31755ba6	maxwell_to_gl: Implement MirrorOnceClampOGL using GL_MIRROR_CLAMP_EXT Like MirrorOnceBorder, this requires the GL_EXT_texture_mirror_clamp extension. This extension is unfortunately not available on Intel's drivers (both Windows proprietary and Linux Mesa). Use GL_MIRROR_CLAMP_TO_EDGE as a fallback if the extension is unavailable.	2020-06-30 02:40:14 -04:00
Rodrigo Locatti	d217017c9e	Merge pull request #4191 from Morph1984/vertex-formats maxwell_to_gl/vk: Reorder vertex formats	2020-06-30 03:30:00 -03:00
David	7c970132b5	macro: Add support for "middle methods" on the code cache (#4112 ) Macro code is just uploaded sequentially from a starting address, however that does not mean the entry point for the macro is at that address. This PR adds preliminary support for executing macros in the middle of our cached code.	2020-06-30 02:32:24 -03:00
Morph	10eca7f651	maxwell_to_gl: Rename VertexType() to VertexFormat()	2020-06-29 11:48:38 -04:00
Rodrigo Locatti	f84cbf6429	Merge pull request #4140 from ReinUsesLisp/validation-layers renderer_vulkan: Update validation layer name and test before enabling	2020-06-29 02:12:38 -03:00
Morph	4a35df337b	maxwell_to_vk: Reorder vertex formats and add A2B10G10R10 for all types except float	2020-06-28 02:57:10 -04:00
Morph	78d80d99a0	maxwell_to_gl: Add 32 bit component sizes to (un)signed scaled formats Add 32 bit component sizes to (un)signed scaled formats and group (un)signed normalized, scaled, and integer formats together.	2020-06-28 02:51:13 -04:00
Fernando Sahmkow	528b19a842	General: Tune the priority of main emulation threads so they have higher priority than less important helper threads.	2020-06-27 11:36:09 -04:00
Fernando Sahmkow	ad92865497	General: Correct rebase, sync gpu and context management.	2020-06-27 11:36:08 -04:00
Fernando Sahmkow	dc58058203	General: Setup yuzu threads' microprofile, naming and registry.	2020-06-27 11:35:09 -04:00
Fernando Sahmkow	e31425df38	General: Recover Prometheus project from harddrive failure This commit: Implements CPU Interrupts, Replaces Cycle Timing for Host Timing, Reworks the Kernel's Scheduler, Introduce Idle State and Suspended State, Recreates the bootmanager, Initializes Multicore system.	2020-06-27 11:35:06 -04:00
bunnei	efef7b1517	Merge pull request #4147 from ReinUsesLisp/hset2-imm shader/half_set: Implement HSET2_IMM	2020-06-26 23:14:56 -04:00
ReinUsesLisp	9d55e5586f	vk_rasterizer: Use nullptr for <pSizes> in CmdBindVertexBuffers2EXT Disable this temporarily.	2020-06-26 20:57:22 -03:00
ReinUsesLisp	8584a77eb2	vk_pipeline_cache: Avoid hashing and comparing dynamic state when possible With extended dynamic states, some bytes don't have to be collected from the pipeline key, hence we can avoid hashing and comparing them on lookups.	2020-06-26 20:57:22 -03:00
ReinUsesLisp	1a84209418	vulkan/fixed_pipeline_state: Move state out of individual structures	2020-06-26 20:57:22 -03:00
ReinUsesLisp	c94b398f14	vk_rasterizer: Use VK_EXT_extended_dynamic_state	2020-06-26 20:57:22 -03:00
ReinUsesLisp	a6db8e5f4d	renderer_vulkan/wrapper: Add VK_EXT_extended_dynamic_state functions	2020-06-26 20:55:15 -03:00
ReinUsesLisp	c387a72c76	fixed_pipeline_state: Add requirements for VK_EXT_extended_dynamic_state This moves dynamic state present in VK_EXT_extended_dynamic_state to a separate structure in FixedPipelineState. This is structure is at the bottom allowing us to hash and memcmp only when the extension is not supported.	2020-06-26 20:55:15 -03:00
ReinUsesLisp	7527402a46	vk_device: Enable VK_EXT_extended_dynamic_state when available	2020-06-26 20:55:15 -03:00
ReinUsesLisp	bb2cbdf704	texture_cache: Test format compatibility before copying Avoid illegal copies. This intercepts the last step of a copy to avoid generating validation errors or corrupting the driver on some instances. We can create views and emit copies accordingly in future commits and remove this last-step validation.	2020-06-26 20:52:22 -03:00
bunnei	3579db425e	Merge pull request #4144 from FernandoS27/tt-fix TextureCache: Fix case where layer goes off bound.	2020-06-26 19:02:39 -04:00
bunnei	78d3b54ea7	Merge pull request #4111 from ReinUsesLisp/preserve-contents-vk vk_rasterizer: Don't preserve contents on full screen clears	2020-06-26 18:48:12 -04:00
ReinUsesLisp	1d6be9febf	video_core/compatible_formats: Table to test if two formats are legal to view or copy Add a flat table to test if it's legal to create a texture view between two formats or copy betweem them. This table is based on ARB_copy_image and ARB_texture_view. Copies are more permissive than views.	2020-06-26 19:28:11 -03:00
ReinUsesLisp	6481d91e4a	gl_buffer_cache: Copy to buffers created as STREAM_READ before downloading After marking buffers as resident, Nvidia's driver seems to take a slow path. To workaround this issue, copy to a STREAM_READ buffer and then call GetNamedBufferSubData on it. This is a temporary solution until we have asynchronous flushing.	2020-06-26 16:58:40 -03:00
Rodrigo Locatti	5872fc21fe	Merge pull request #4151 from ReinUsesLisp/gl-invalidations gl_shader_cache: Avoid use after move for program size	2020-06-25 21:05:27 -03:00
David Marcec	a927d8be52	gl_device: Fix IsASTCSupported Other targets were never actually checked	2020-06-25 19:12:56 +10:00
ReinUsesLisp	bc8d3b8f82	gl_device: Enable NV_vertex_buffer_unified_memory on Turing devices Once we make sure not to corrupt Nvidia's driver, we can safely use resident buffers on Turing devices. See GitHub pull request #4156	2020-06-25 01:28:47 -03:00
bunnei	0e1268e507	Merge pull request #4105 from ReinUsesLisp/resident-buffers gl_rasterizer: Use NV_vertex_buffer_unified_memory for vertex buffer robustness	2020-06-24 11:40:30 -04:00
bunnei	2f2df9a4a7	Merge pull request #4083 from Morph1984/B10G11R11F decode/image: Implement B10G11R11F	2020-06-24 11:02:38 -04:00
Fernando Sahmkow	32343d820d	Merge pull request #4046 from ogniK5377/macro-hle-prod Add support for HLEing Macros	2020-06-24 09:01:00 -04:00
ReinUsesLisp	32a2dcd415	buffer_cache: Use buffer methods instead of cache virtual methods	2020-06-24 02:36:14 -03:00
ReinUsesLisp	39c97f1b65	gl_stream_buffer: Use InvalidateBufferData instead unmap and map Making the stream buffer resident increases GPU usage significantly on some games. This seems to be addressed invalidating the stream buffer with InvalidateBufferData instead of using a Unmap + Map (with invalidation flags).	2020-06-24 02:36:14 -03:00
ReinUsesLisp	41a4090320	gl_rasterizer: Use NV_vertex_buffer_unified_memory for vertex buffer robustness Switch games are allowed to bind less data than what they use in a vertex buffer, the expected behavior here is that these values are read as zero. At the moment of writing this only D3D12, OpenGL and NVN through NV_vertex_buffer_unified_memory support vertex buffer with a size limit. In theory this could be emulated on Vulkan creating a new VkBuffer for each (handle, offset, length) tuple and binding the expected data to it. This is likely going to be slow and memory expensive when used on the vertex buffer and we have to do it on all draws because we can't know without analyzing indices when a game is going to read vertex data out of bounds. This is not a problem on OpenGL's BufferAddressRangeNV because it takes a length parameter, unlike Vulkan's CmdBindVertexBuffers that only takes buffers and offsets (the length is implicit in VkBuffer). It isn't a problem on D3D12 either, because D3D12_VERTEX_BUFFER_VIEW on IASetVertexBuffers takes SizeInBytes as a parameter (although I am not familiar with robustness on D3D12). Currently this only implements buffer ranges for vertex buffers, although indices can also be affected. A KHR_robustness profile is not created, but Nvidia's driver reads out of bound vertex data as zero anyway, this might have to be changed in the future. - Fixes SMO random triangles when capturing an enemy, getting hit, or looking at the environment on certain maps.	2020-06-24 02:36:14 -03:00
ReinUsesLisp	32485917ba	gl_buffer_cache: Mark buffers as resident Make stream buffer and cached buffers as resident and query their address. This allows us to use GPU addresses for several proprietary Nvidia extensions.	2020-06-24 02:36:14 -03:00
ReinUsesLisp	73fb3a304b	gl_device: Expose NV_vertex_buffer_unified_memory except on Turing Expose NV_vertex_buffer_unified_memory when the driver supports it. This commit adds a function the determine if a GL_RENDERER is a Turing GPU. This is required because on Turing GPUs Nvidia's driver crashes when the buffer is marked as resident or on DeleteBuffers. Without a synchronous debug output (single threaded driver), it's likely that the driver will crash in the first blocking call.	2020-06-24 02:36:14 -03:00
ReinUsesLisp	00c66a7289	gl_stream_buffer: Always use a non-coherent buffer	2020-06-24 02:35:33 -03:00
ReinUsesLisp	da79ec9565	gl_stream_buffer: Always use persistent memory maps yuzu no longer supports platforms without persistent maps.	2020-06-24 02:35:33 -03:00
Rodrigo Locatti	b66ccaa376	Merge pull request #4129 from Morph1984/texture-shadow-lod-workaround gl_shader_decompiler: Workaround textureLod when GL_EXT_texture_shadow_lod is not available	2020-06-24 01:51:15 -03:00
David Marcec	f5e2aec422	addressed issues	2020-06-24 12:18:33 +10:00
David Marcec	52340e94ac	clear mme draw mode We already draw, so we can clear it	2020-06-24 12:09:04 +10:00
David Marcec	fabdf5d385	Addressed issues	2020-06-24 12:09:03 +10:00
David Marcec	74b4334d51	Fix constbuffer for 0217920100488FF7	2020-06-24 12:09:02 +10:00
David Marcec	6ce5f3120b	Macro HLE support	2020-06-24 12:09:01 +10:00
ReinUsesLisp	9f54cd4dad	gl_shader_cache: Avoid use after move for program size All programs had a size of zero due to this bug, skipping invalidations. While we are at it, remove some unused forward declarations.	2020-06-23 22:54:42 -03:00
bunnei	15aeae3dd3	Merge pull request #4127 from lioncash/dst-typo texture_cache: Fix incorrect address used in a DeduceSurface() call	2020-06-23 15:59:37 -04:00
ReinUsesLisp	39ab33ee1c	shader/half_set: Implement HSET2_IMM Add HSET2_IMM. Due to the complexity of the encoding avoid using BitField unions and read the relevant bits from the code itself. This is less error prone.	2020-06-22 20:51:18 -03:00
Fernando Sahmkow	544b15e8e4	TextureCache: Fix case where layer goes off bound. The returned layer is expected to be between 0 and the depth of the surface, anything larger is off bounds.	2020-06-22 11:37:40 -04:00
Rodrigo Locatti	406d298457	Merge pull request #4110 from ReinUsesLisp/direct-upload-sets vk_update_descriptor: Upload descriptor sets data directly	2020-06-22 05:02:13 -03:00
ReinUsesLisp	2f09c7ddd3	renderer_vulkan: Update validation layer name and test before enabling Update validation layer string to VK_LAYER_KHRONOS_validation. While we are at it, properly check for available validation layers before enabling them.	2020-06-22 04:10:45 -03:00
bunnei	14a1181a97	Merge pull request #4122 from lioncash/hide video_core: Eliminate some variable shadowing	2020-06-21 22:38:04 -04:00
bunnei	c27c76ed43	Merge pull request #4126 from lioncash/noexcept vulkan/wrapper: Remove noexcept from GetSurfaceCapabilitiesKHR()	2020-06-21 22:36:14 -04:00
Morph	f77c897b8d	gl_shader_decompiler: Enable GL_EXT_texture_shadow_lod if available Enable GL_EXT_texture_shadow_lod if available. If this extension is not available, such as on Intel/AMD proprietary drivers, use textureGrad as a workaround.	2020-06-20 23:02:29 -04:00
Morph	1e65da971b	gl_device: Check for GL_EXT_texture_shadow_lod	2020-06-20 22:14:32 -04:00
bunnei	f98bf1025f	Merge pull request #4120 from lioncash/arb gl_arb_decompiler: Avoid several string copies	2020-06-20 22:11:49 -04:00
MerryMage	c12eb814b4	macro_jit_x64: Use ecx for shift register shl/shr only accept cl as their second argument	2020-06-20 22:24:05 +01:00
Lioncash	ef53b2fd08	texture_cache: Fix incorrect address used in a DeduceSurface() call Previously the source was being deduced twice in a row.	2020-06-20 14:11:28 -04:00
merry	928e9c09aa	Merge pull request #4125 from lioncash/macro-shift macro_jit_x64: Amend readability of Compile_ExtractShiftLeftRegister()	2020-06-20 16:08:23 +01:00
merry	2bd903e021	Merge pull request #4123 from lioncash/unused-var macro_jit_x64: Remove unused variable	2020-06-20 16:07:58 +01:00
Morph	480e1fa987	decode/image: Implement B10G11R11F - Used by Kirby Star Allies	2020-06-20 00:28:30 -04:00
bunnei	7d1dca4c98	Merge pull request #4099 from MerryMage/macOS-build Fix compilation on macOS	2020-06-19 23:31:04 -04:00
Lioncash	5865a10885	gl_arb_decompiler: Avoid several string copies Variables that are marked as const cannot have the move constructor invoked when returning from a function (the move constructor requires a non-const variable so it can "steal" the resources from it.	2020-06-19 23:09:16 -04:00
Lioncash	a6e5b84d1f	vulkan/wrapper: Remove noexcept from GetSurfaceCapabilitiesKHR() Check() can throw an exception if the Vulkan result isn't successful. We remove the check so that std::terminate isn't outright called and allows for better debugging (should it ever actually fail).	2020-06-19 23:01:59 -04:00
Lioncash	5a4e89b901	macro_jit_x64: Correct readability of Compile_ExtractShiftLeftImmediate() Previously dst wasn't being used.	2020-06-19 22:57:23 -04:00
Lioncash	140f953b6a	macro_jit_x64: Correct readability of Compile_ExtractShiftLeftRegister() Previously dst wasn't being used.	2020-06-19 22:56:55 -04:00
Lioncash	8ea749c1ca	macro_jit_x64: Remove unused variable Removes a completely unused label and marks another variable as unused, given it seems like it has potential uses in the future.	2020-06-19 22:10:45 -04:00
Lioncash	479605b3e5	memory_manager: Eliminate variable shadowing Renames some variables to prevent ones in inner scopes from shadowing outer-scoped variables. The Copy* functions have no shadowing, but we rename them anyways to remain consistent with the other functions.	2020-06-19 22:02:58 -04:00
Lioncash	811bff009e	macro_jit_x64: Eliminate variable shadowing in Compile_ProcessResult() We can reduce the capture scope so that it's not possible for both "reg" variables to clash with one another. While we're at it, we can prevent unnecessary copies while we're at it.	2020-06-19 21:57:44 -04:00
Lioncash	4514b80b3e	buffer_cache: Eliminate local variable shadowing We can just make use of the instance in the scope above this one.	2020-06-19 21:55:02 -04:00
bunnei	7daea551c0	Merge pull request #4087 from MerryMage/macrojit-inline-Read macro_jit_x64: Inline Engines::Maxwell3D::GetRegisterValue	2020-06-19 21:32:07 -04:00
MerryMage	977ceb4056	macro_jit_x64: Remove unused function Read	2020-06-19 11:39:41 +01:00
bunnei	5a092fb61e	Merge pull request #4090 from MerryMage/macrojit-bugs macro_jit_x64: Optimization correctness	2020-06-18 22:28:17 -04:00
ReinUsesLisp	cf137ea40b	vk_rasterizer: Don't preserve contents on full screen clears There's no need to load contents from the CPU when a clear resets all the contents of the underlying memory. This is already implemented on OpenGL and the texture cache.	2020-06-18 18:18:33 -03:00
ReinUsesLisp	7d763f060e	vk_update_descriptor: Upload descriptor sets data directly Instead of copying to a temporary payload before sending the update task to the worker thread, insert elements to the payload directly.	2020-06-18 17:47:19 -03:00
MerryMage	69f38355ed	vk_rasterizer: BindTransformFeedbackBuffersEXT accepts a size of type VkDeviceSize	2020-06-18 15:47:44 +01:00
MerryMage	b1eada6079	renderer_vulkan: Fix macOS GetBundleDirectory reference	2020-06-18 15:47:44 +01:00
MerryMage	442e48ef4c	memory_util: boost hashes are size_t * boost::hash_value returns a size_t * boost::hash_combine takes a size_t& argument	2020-06-18 15:47:43 +01:00
MerryMage	8ae7154541	Rename PAGE_SHIFT to PAGE_BITS macOS header files #define PAGE_SHIFT	2020-06-18 15:47:43 +01:00
Morph	2f420618ea	vk_sampler_cache: Emulate GL_LINEAR/NEAREST minification filters Emulate GL_LINEAR/NEAREST minification filters using minLod = 0 and maxLod = 0.25 during sampler creation	2020-06-18 04:56:31 -04:00
Morph	be660e7749	maxwell_to_vk: Reorder filter cases and correct mipmap_filter=None maxwell_to_vk: Reorder filtering modes to start with None, then Nearest, then Linear. maxwell_to_vk: Logs filter modes under UNREACHABLE_MSG instead of UNIMPLEMENTED_MSG, since any unknown filter modes are invalid and not unimplemented. maxwell_to_vk: Return VK_SAMPLER_MIPMAP_MODE_NEAREST instead of VK_SAMPLER_MIPMAP_MODE_LINEAR when mipmap_filter is None with the description from the VkSamplerCreateInfo(3) man page.	2020-06-18 04:56:31 -04:00
Morph	8868fb745f	maxwell_to_gl: Miscellaneous changes maxwell_to_gl: Log unimplemented features under UNIMPLEMENTED_MSG instead of LOG_ERROR to bring into parity with maxwell_to_vk maxwell_to_gl: Deduplicate logging in VertexType(), merging them into one. maxwell_to_gl: Return GL_NEAREST instead of GL_LINEAR if an unknown texture filter mode is encountered. maxwell_to_gl: Log the mipmap filter mode if an unknown value is passed in. maxwell_to_gl: Reorder filtering modes to start with None, then Nearest, then Linear.	2020-06-18 04:56:31 -04:00
Rodrigo Locatti	edb2114bac	Merge pull request #4092 from Morph1984/image-bindings gl_device: Reserve 4 image bindings for fragment stage	2020-06-18 04:59:48 -03:00
MerryMage	44f10d9b9f	macro_jit_x64: Inline Engines::Maxwell3D::GetRegisterValue	2020-06-17 17:17:08 +01:00
bunnei	a8ac99b619	Merge pull request #4086 from MerryMage/abi xbyak_abi: Cleanup	2020-06-17 11:20:52 -04:00
MerryMage	c409722435	macro_jit_x64: Optimization implicitly assumes same destination	2020-06-17 10:36:36 +01:00
MerryMage	a6ddd7c382	macro_jit_x64: Should not skip zero registers for certain ALU ops The code generated for these ALU ops assume src_a and src_b are always valid.	2020-06-17 10:36:34 +01:00
bunnei	b660ef6c8a	Merge pull request #4089 from MerryMage/macrojit-cleanup-1 macro_jit_x64: Cleanup	2020-06-16 23:44:48 -04:00
bunnei	798ec003ce	Merge pull request #4041 from ReinUsesLisp/arb-decomp gl_arb_decompiler: Implement an assembly shader decompiler	2020-06-16 14:56:23 -04:00
Morph	e2f5d16540	gl_device: Reserve at least 4 image bindings for fragment stage Due to the limitation of GL_MAX_IMAGE_UNITS being low (8) on Intel's and Nvidia's proprietary drivers, we have to reserve an appropriate amount of image bindings for each of the stages. So far games have been observed to use 4 image bindings on the fragment stage (Kirby Star Allies) and 1 on the vertex stage (TWD series). No games thus far in my limited testing used more than 4 images concurrently and across all currently active programs. This fixes shader compilation errors on Kirby Star Allies on OpenGL (GLSL/GLASM)	2020-06-16 03:03:07 -04:00
Rodrigo Locatti	0bd9bc7201	Merge pull request #4066 from ReinUsesLisp/shared-ptr-buf buffer_cache: Avoid passing references of shared pointers and misc style changes	2020-06-15 22:29:32 -03:00
MerryMage	cf0aad7d6a	macro_jit_x64: Remove NEXT_PARAMETER Not required, as PARAMETERS can just be incremented directly.	2020-06-15 21:19:38 +01:00
MerryMage	1799f4e774	macro_jit_x64: Remove unused function Compile_WriteCarry	2020-06-15 21:19:38 +01:00
MerryMage	c09a9e5cc7	macro_jit_x64: Select better registers All registers are now callee-save registers. RBX and RBP selected for STATE and RESULT because these are most commonly accessed; this is to avoid the REX prefix. RBP not used for STATE because there are some SIB restrictions, RBX emits smaller code.	2020-06-15 21:19:38 +01:00
MerryMage	79aa7b3ace	macro_jit_x64: Remove REGISTERS Unnecessary since this is just an offset from STATE.	2020-06-15 21:00:59 +01:00
MerryMage	35db6e1c68	macro_jit_x64: Remove JITState::parameters This can be passed in as an argument instead.	2020-06-15 20:55:02 +01:00
MerryMage	389549b80d	macro_jit_x64: Remove METHOD_ADDRESS_64 Unnecessary variable.	2020-06-15 20:51:33 +01:00
MerryMage	a6a43a5ae0	macro_jit_x64: Remove RESULT_64 This Reg64 codepath has the exact same behaviour as the Reg32 one.	2020-06-15 20:35:08 +01:00
MerryMage	d563017dfe	xbyak_abi: Remove *GPS variants of stack manipulation functions	2020-06-15 18:59:54 +01:00
ReinUsesLisp	6e5d8aac4d	video_core/macro_jit_x64: Remove initializer in member variable Fix build time issues on gcc. Confirmed through asan that avoiding this initialization is safe.	2020-06-15 05:17:55 -03:00
bunnei	92021a344c	Merge pull request #4064 from ReinUsesLisp/invalidate-buffers gl_rasterizer: Mark vertex buffers as dirty after buffer cache invalidation	2020-06-14 00:29:16 -04:00
bunnei	c2ea1e1bcb	Merge pull request #4049 from ReinUsesLisp/separate-samplers shader/texture: Join separate image and sampler pairs offline	2020-06-13 13:48:27 -04:00
bunnei	5633887569	Merge pull request #3986 from ReinUsesLisp/shader-cache shader_cache: Implement a generic runtime shader cache	2020-06-12 23:14:48 -04:00
ReinUsesLisp	87011a97f9	gl_arb_decompiler: Implement FSwizzleAdd	2020-06-11 22:12:07 -03:00
ReinUsesLisp	a63a0daa5e	gl_arb_decompiler: Implement an assembly shader decompiler Emit code compatible with NV_gpu_program5. This should emit code compatible with Fermi, but it wasn't tested on that architecture. Pascal has some issues not present on Turing GPUs.	2020-06-11 22:12:07 -03:00
bunnei	83e3b77ed7	Merge pull request #4027 from ReinUsesLisp/3d-slices texture_cache: Implement rendering to 3D textures	2020-06-09 21:52:15 -04:00
ReinUsesLisp	6508cdd003	buffer_cache: Avoid passing references of shared pointers and misc style changes Instead of using as template argument a shared pointer, use the underlying type and manage shared pointers explicitly. This can make removing shared pointers from the cache more easy. While we are at it, make some misc style changes and general improvements (like insert_or_assign instead of operator[] + operator=).	2020-06-09 18:30:49 -03:00
ReinUsesLisp	7646f2c21d	gl_rasterizer: Mark vertex buffers as dirty after buffer cache invalidation Vertex buffers bindings become invalid after the stream buffer is invalidated. We were originally doing this, but it got lost at some point. - Fixes Animal Crossing: New Horizons, but it affects everything.	2020-06-08 20:24:16 -03:00
ReinUsesLisp	6e122f0b2c	buffer_cache: Return stream buffer invalidation in Map instead of Unmap We have to invalidate whatever cache is being used before uploading the data, hence it makes more sense to return this on Map instead of Unmap.	2020-06-08 20:22:31 -03:00
bunnei	3626254f48	Merge pull request #4040 from ReinUsesLisp/nv-transform-feedback gl_rasterizer: Use NV_transform_feedback for XFB on assembly shaders	2020-06-08 16:18:33 -04:00
bunnei	98d2461529	Merge pull request #4052 from ReinUsesLisp/debug-output renderer_opengl: Only enable DEBUG_OUTPUT when graphics debugging is enabled	2020-06-08 10:16:41 -04:00
ReinUsesLisp	bd43c05470	texture_cache: Port original code management for 2D vs 3D textures Handle blits to images as 2D, even when they have block depth. - Fixes rendering issues on Luigi's Mansion 3	2020-06-08 05:02:22 -03:00
ReinUsesLisp	c99f5d405b	texture_cache: Simplify blit code	2020-06-08 05:01:44 -03:00
ReinUsesLisp	3c2ae53b4c	texture_cache: Handle 3D texture blits with one layer	2020-06-08 05:01:00 -03:00
ReinUsesLisp	c95c254f3e	texture_cache: Implement rendering to 3D textures This allows rendering to 3D textures with more than one slice. Applications are allowed to render to more than one slice of a texture using gl_Layer from a VTG shader. This also requires reworking how 3D texture collisions are handled, for now, this commit allows rendering to slices but not to miplevels. When a render target attempts to write to a mipmap, we fallback to the previous implementation (copying or flushing as needed). - Fixes color correction 3D textures on UE4 games (rainbow effects). - Allows Xenoblade games to render to 3D textures directly.	2020-06-08 05:01:00 -03:00
Rodrigo Locatti	2293e8a11a	Merge pull request #4034 from ReinUsesLisp/storage-texels vk_rasterizer: Implement storage texels and atomic image operations	2020-06-07 18:43:24 -03:00
ReinUsesLisp	abcea1bb18	rasterizer_cache: Remove files and includes The rasterizer cache is no longer used. Each cache has its own generic implementation optimized for the cached data.	2020-06-07 04:32:57 -03:00
ReinUsesLisp	678f95e4f8	vk_pipeline_cache: Use generic shader cache Trivial port the generic shader cache to Vulkan.	2020-06-07 04:32:57 -03:00
ReinUsesLisp	b96f65b62b	gl_shader_cache: Use generic shader cache Trivially port the generic shader cache to OpenGL.	2020-06-07 04:32:57 -03:00
ReinUsesLisp	dc27252352	shader_cache: Implement a generic shader cache Implement a generic shader cache for fast lookups and invalidations. Invalidations are cheap but expensive when a shader is invalidated. Use two mutexes instead of one to avoid locking invalidations for lookups and vice versa. When a shader has to be removed, lookups are locked as expected.	2020-06-07 04:32:32 -03:00
ReinUsesLisp	e78d681a6c	gl_device: Black list NVIDIA 443.24 for fast buffer uploads Skip fast buffer uploads on Nvidia 443.24 Vulkan beta driver on OpenGL. This driver throws the following error when calling BufferSubData or BufferData on buffers that are candidates for fast constant buffer uploads. This is the equivalens to push constants on Vulkan, except that they can access the full buffer. The error: Unknown internal debug message. The NVIDIA OpenGL driver has encountered an out of memory error. This application might behave inconsistently and fail. If this error persists on future drivers, we might have to look deeper into this issue. For now, we can black list it and log it as a temporary solution.	2020-06-06 02:56:42 -03:00
ReinUsesLisp	354fbe701e	renderer_opengl: Only enable DEBUG_OUTPUT when graphics debugging is enabled Avoids logging when it's not relevant. This can potentially reduce driver's internal thread overhead.	2020-06-05 21:21:12 -03:00
bunnei	98671b4cfe	Merge pull request #4013 from ReinUsesLisp/skip-no-xfb vk_rasterizer: Skip transform feedbacks when extension is unavailable	2020-06-05 11:14:36 -04:00
ReinUsesLisp	5b2b6d594c	shader/texture: Join separate image and sampler pairs offline Games using D3D idioms can join images and samplers when a shader executes, instead of baking them into a combined sampler image. This is also possible on Vulkan. One approach to this solution would be to use separate samplers on Vulkan and leave this unimplemented on OpenGL, but we can't do this because there's no consistent way of determining which constant buffer holds a sampler and which one an image. We could in theory find the first bit and if it's in the TIC area, it's an image; but this falls apart when an image or sampler handle use an index of zero. The used approach is to track for a LOP.OR operation (this is done at an IR level, not at an ISA level), track again the constant buffers used as source and store this pair. Then, outside of shader execution, join the sample and image pair with a bitwise or operation. This approach won't work on games that truly use separate samplers in a meaningful way. For example, pooling textures in a 2D array and determining at runtime what sampler to use. This invalidates OpenGL's disk shader cache :) - Used mostly by D3D ports to Switch	2020-06-05 00:24:51 -03:00
ReinUsesLisp	e1438f8e91	shader/track: Move bindless tracking to a separate function	2020-06-04 23:02:55 -03:00
bunnei	22369df357	Merge pull request #4031 from Morph1984/fix-gs-outputs gl_shader_decompiler: Fix geometry shader outputs on Intel drivers	2020-06-04 15:18:51 -04:00
bunnei	34d4abc4f9	Merge pull request #4009 from ogniK5377/macro-jit-prod video_core: Implement Macro JIT	2020-06-04 11:40:52 -04:00
David Marcec	eca3d16e54	Default init labels and use initializer list for macro engine	2020-06-04 22:23:07 +10:00
ReinUsesLisp	3d99b449d3	gl_rasterizer: Use NV_transform_feedback for XFB on assembly shaders NV_transform_feedback, NV_transform_feedback2 and ARB_transform_feedback3 with NV_transform_feedback interactions allows implementing transform feedbacks as dynamic state. Maxwell implements transform feedbacks as dynamic state, so using these extensions with TransformFeedbackStreamAttribsNV allows us to properly emulate transform feedbacks without having to recompile shaders when the state changes.	2020-06-03 20:22:12 -03:00
bunnei	c647999c61	Merge pull request #4012 from ReinUsesLisp/mipmap-overlaps texture_cache: Handle overlaps with multiple subresources	2020-06-03 12:17:25 -04:00
David Marcec	411f5527d4	Mark parameters as const	2020-06-03 16:33:38 +10:00
bunnei	623b93a2b3	Merge pull request #4014 from ReinUsesLisp/astc-nvidia gl_device: Avoid devices with CAVEAT_SUPPORT on ASTC	2020-06-02 17:43:33 -04:00
bunnei	597d8b4bd4	Merge pull request #4006 from ReinUsesLisp/squash-ubos glsl: Squash constant buffers into a single SSBO when we hit the limit	2020-06-02 14:58:50 -04:00
LC	9a0c1456e3	Merge pull request #4016 from ReinUsesLisp/invocation-info shader/other: Fix hardcoded value in S2R INVOCATION_INFO	2020-06-02 09:47:53 -04:00
LC	c5de3c1059	Merge pull request #4033 from ReinUsesLisp/vk-r16ui maxwell_to_vk: Add R16UI image format	2020-06-02 09:42:49 -04:00
David Marcec	3a20e74f40	Pass by reference instead of copying parameters	2020-06-02 16:37:06 +10:00
ReinUsesLisp	866c1165af	vk_shader_decompiler: Implement atomic image operations Implement atomic operations on images. On GLSL these are atomicImage* functions (e.g. atomicImageAdd).	2020-06-02 02:20:02 -03:00
ReinUsesLisp	4a6b9a1a71	vk_rasterizer: Implement storage texels This is the equivalent of an image buffer on OpenGL. - Used by Octopath Traveler	2020-06-02 02:16:33 -03:00
ReinUsesLisp	3a59e724c9	maxwell_to_vk: Add R16UI image format - Used by Octopath Traveler	2020-06-02 02:15:20 -03:00
bunnei	4511502ca6	Merge pull request #4001 from ReinUsesLisp/avoid-copies buffer_cache: Avoid copying twice on certain cases	2020-06-01 16:59:17 -04:00
bunnei	bb6d93630f	Merge pull request #3998 from ReinUsesLisp/init-3d maxwell_3d: Initialize more registers to their expected value	2020-06-01 16:11:56 -04:00
Morph	74f2e5f1a4	gl_shader_decompiler: Declare gl_Layer and gl_ViewportIndex within gl_PerVertex for vertex and tessellation shaders	2020-06-01 15:35:44 -04:00
Morph	70188d69b0	gl_shader_decompiler: Fix geometry shader outputs for Intel drivers On Intel's proprietary drivers, gl_Layer and gl_ViewportIndex are not allowed members of gl_PerVertex block, causing the shader to fail to compile. Fix this by declaring these variables outside of gl_PerVertex.	2020-06-01 15:34:05 -04:00
Rodrigo Locatti	3a6714ab7f	Merge pull request #4005 from ReinUsesLisp/g24r8 format_lookup_table: Implement G24S8 format as S8Z24	2020-06-01 16:07:58 -03:00
bunnei	6c0b1a9ee2	Merge pull request #3996 from ReinUsesLisp/front-faces fixed_pipeline_state,gl_rasterizer: Swap negative viewport checks for front faces	2020-06-01 14:04:35 -04:00
ReinUsesLisp	0ee310ebdc	gl_device: Avoid devices with CAVEAT_SUPPORT on ASTC This avoids using Nvidia's ASTC decoder on OpenGL. The last time it was profiled, it was slower than yuzu's decoder. While we are at it, fix a bug in the texture cache when native ASTC is not supported.	2020-05-31 21:34:34 -03:00
ReinUsesLisp	ee21e4ecd3	glsl: Squash constant buffers into a single SSBO when we hit the limit Avoids compilation errors at the cost of shader build times and runtime performance when a game hits the limit of uniform buffers we can use.	2020-05-31 21:33:49 -03:00
bunnei	e68ee43a1a	Merge pull request #3930 from ReinUsesLisp/animal-borders vk_rasterizer: Implement constant attributes	2020-05-31 18:40:17 -04:00
bunnei	edbf3144d2	Merge pull request #3958 from FernandoS27/gl-debug OpenGL: Enable Debug Context and Synchronous debugging when graphics debugging is enabled	2020-05-31 17:04:27 -04:00
bunnei	f7debcaa04	Merge pull request #3999 from ReinUsesLisp/opt-tex-cache texture_cache: Optimize GetSurfacesInRegion	2020-05-31 17:02:29 -04:00
Morph	bb8ef38152	gl_device: Enable compute shaders for Intel proprietary drivers Previously we were disabling compute shaders on Intel's proprietary driver due to broken compute. This has been fixed in the latest Intel drivers. Re-enable compute for Intel proprietary drivers and remove the check for broken compute.	2020-05-31 03:21:07 -04:00
bunnei	058ec22787	Merge pull request #3982 from ReinUsesLisp/membar-cts shader/other: Implement MEMBAR.CTS	2020-05-30 11:51:42 -04:00
ReinUsesLisp	f2d1aa97ad	shader/other: Fix hardcoded value in S2R INVOCATION_INFO Geometry shaders built from Nvidia's compiler check for bits[16:23] to be less than or equal to 0 with VSETP to default to a "safe" value of 0x8000'0000 (safe from hardware's perspective). To avoid hitting this path in the shader, return 0x00ff'0000 from S2R INVOCATION_INFO. This seems to be the maximum number of vertices a geometry shader can emit in a primitive.	2020-05-30 01:49:14 -03:00
ReinUsesLisp	1ee1a5d3d6	texture_cache: More relaxed reconstruction Only reupload textures when they've not been modified from the GPU.	2020-05-29 23:56:52 -03:00
David Marcec	8118ea160b	Favor switch case over jump table Easier to read and will emit a jump table automatically.	2020-05-30 12:23:58 +10:00
David Marcec	b032ebdfee	Implement macro JIT	2020-05-30 11:40:04 +10:00
David Marcec	d0bdd26c26	Add xbyak external	2020-05-30 10:55:27 +10:00
ReinUsesLisp	e454f7e7a7	texture_cache: Only copy textures that were modified from host	2020-05-29 20:12:46 -03:00
ReinUsesLisp	dd70e097cc	texture_cache: Reload textures when number of resources mismatch	2020-05-29 20:10:58 -03:00
bunnei	87b272699f	Merge pull request #4007 from ReinUsesLisp/reduce-logs maxwell_3d: Reduce severity of logs that can be spammed	2020-05-29 17:29:17 -04:00
ReinUsesLisp	5616be12be	vk_rasterizer: Skip transform feedbacks when extension is unavailable Avoids calling transform feedback procedures when VK_EXT_transform_feedback is not available.	2020-05-29 03:05:29 -03:00
ReinUsesLisp	5b37cecd76	texture_cache: Handle overlaps with multiple subresources Implement more surface reconstruct cases. Allow overlaps with more than one layer and mipmap and copies all of them to the new texture. - Fixes textures moving around objects on Xenoblade games	2020-05-29 02:57:30 -03:00
bunnei	1bb3122c1f	Merge pull request #3991 from ReinUsesLisp/depth-sampling texture_cache: Implement depth stencil texture swizzles	2020-05-28 23:33:38 -04:00
ReinUsesLisp	9b06e823ee	maxwell_3d: Reduce severity of logs that can be spammed These logs were killing performance on some games when they were spammed. Reduce them to Debug severity.	2020-05-28 18:23:25 -03:00
ReinUsesLisp	fc153f6bcd	format_lookup_table: Implement G24S8 format as S8Z24	2020-05-28 17:16:07 -03:00
bunnei	099ac9c2a8	Merge pull request #3993 from ReinUsesLisp/fix-zla gl_shader_manager: Unbind GLSL program when binding a host pipeline	2020-05-28 12:15:22 -04:00
ReinUsesLisp	3b2dee88e6	buffer_cache: Avoid copying twice on certain cases Avoid copying to a staging buffer on non-granular memory addresses. Add a callable argument to StreamBufferUpload to be able to copy to the staging buffer directly from ReadBlockUnsafe.	2020-05-27 23:05:50 -03:00
ReinUsesLisp	b8b6f94ba9	texture_cache: Use unordered_map::find instead of operator[] on hot code	2020-05-27 17:59:04 -03:00
bunnei	630fc12d4e	Merge pull request #3961 from Morph1984/bgra8_srgb maxwell_to_vk: Add format B8G8R8A8_SRGB and add Attachable capability for B8G8R8A8_UNORM	2020-05-27 16:44:22 -04:00
ReinUsesLisp	d2b2557542	texture_cache: Use small vector for surface vectors This avoids most heap allocations when collecting surfaces into a vector.	2020-05-27 17:31:14 -03:00
ReinUsesLisp	f3f056c3b6	maxwell_3d: Initialize line widths Initialize line widths to avoid setting a line width of zero.	2020-05-27 16:53:43 -03:00
ReinUsesLisp	31eb658fea	maxwell_3d: Initialize polygon modes NVN expects this to be initialized as Fill, otherwise games that never bind a rasterizer state will log an invalid polygon mode.	2020-05-27 16:52:52 -03:00
ReinUsesLisp	32e6727dae	shader/other: Implement MEMBAR.CTS This silences an assertion we were hitting and uses workgroup memory barriers when the game requests it.	2020-05-27 00:19:45 -03:00
ReinUsesLisp	b2c4521a91	texture_cache: Fix layered null surfaces Null texture cubes were not considered arrays, causing issues on Vulkan and OpenGL when creating views.	2020-05-26 17:50:08 -03:00
ReinUsesLisp	b17fe82973	gl_texture_cache: Implement small texture view cache for swizzles This fixes cases where the texture swizzle was applied twice on the same draw to a texture bound to two different slots.	2020-05-26 17:50:08 -03:00
ReinUsesLisp	8bba84a401	texture_cache: Implement depth stencil texture swizzles Stop ignoring image swizzles on depth and stencil images. This doesn't fix a known issue on Xenoblade Chronicles 2 where an OpenGL texture changes swizzles twice before being used. A proper fix would be having a small texture view cache for this like we do on Vulkan.	2020-05-26 17:44:50 -03:00
ReinUsesLisp	606a62d4c7	gl_rasterizer: Port front face flip check from Vulkan While Vulkan was assuming we had no negative viewports, OpenGL code was assuming we had them. Port the old code from Vulkan to OpenGL, checking if the first viewport is negative before flipping faces. This is not a complete implementation since we only check for the first viewport to be negative. That said, unless a game is using Vulkan, OpenGL and NVN games should be fine here, and we can always compare with our Vulkan backend to see if there's a difference.	2020-05-26 16:33:50 -03:00
ReinUsesLisp	efe7b7483b	fixed_pipeline_state: Remove unnecessary check for front faces flip The check to flip faces when viewports are negative were a left over from the old OpenGL code. This is not required on Vulkan where we have negative viewports.	2020-05-26 16:32:27 -03:00
bunnei	508242c267	Merge pull request #3981 from ReinUsesLisp/bar shader/other: Implement BAR.SYNC 0x0	2020-05-26 14:40:13 -04:00
bunnei	623d9c47a2	Merge pull request #3980 from ReinUsesLisp/red-op shader/memory: Implement non-addition operations in RED	2020-05-26 12:50:41 -04:00
ReinUsesLisp	c13e2f1b75	gl_shader_manager: Unbind GLSL program when binding a host pipeline Fixes regression in Link's Awakening caused by `420cc13248`	2020-05-26 04:20:39 -03:00
bunnei	86345c126a	Merge pull request #3978 from ReinUsesLisp/write-rz shader_decompiler: Visit source nodes even when they assign to RZ	2020-05-25 21:31:33 -04:00
bunnei	1adabdac7f	Merge pull request #3905 from FernandoS27/vulkan-fix Correct a series of crashes and intructions on Async GPU and Vulkan Pipeline	2020-05-24 15:23:38 -04:00
bunnei	325e7eed3c	Merge pull request #3964 from ReinUsesLisp/arb-integration renderer_opengl: Add assembly program code paths	2020-05-24 00:34:12 -04:00
bunnei	487dd05170	Merge pull request #3979 from ReinUsesLisp/thread-group shader/other: Implement thread comparisons (NV_shader_thread_group)	2020-05-24 00:33:06 -04:00
ReinUsesLisp	5d0986a53b	shader/other: Implement BAR.SYNC 0x0 Trivially implement this particular case of BAR. Unless games use OpenCL or CUDA barriers, we shouldn't hit any other case here.	2020-05-21 23:20:43 -03:00
ReinUsesLisp	103809a0ca	shader/memory: Implement non-addition operations in RED Trivially implement these instructions. They are used in Astral Chain.	2020-05-21 23:19:46 -03:00
ReinUsesLisp	e2b67a868b	shader/other: Implement thread comparisons (NV_shader_thread_group) Hardware S2R special registers match gl_Thread*MaskNV. We can trivially implement these using Nvidia's extension on OpenGL or naively stubbing them with the ARB instructions to match. This might cause issues if the host device warp size doesn't match Nvidia's. That said, this is unlikely on proper shaders. Refer to the attached url for more documentation about these flags. https://www.khronos.org/registry/OpenGL/extensions/NV/NV_shader_thread_group.txt	2020-05-21 23:18:37 -03:00
ReinUsesLisp	ed4e324991	shader_decompiler: Visit source nodes even when they assign to RZ Some operations like atomicMin were ignored because they returned were being stored to RZ. This operations have a side effect and it was being ignored.	2020-05-21 23:16:03 -03:00
ReinUsesLisp	434856c636	vk_shader_decompiler: Don't assert for void returns Atomic instructions can be used without returning anything and this is valid code. Remove the assert.	2020-05-21 23:16:03 -03:00
ReinUsesLisp	ebaace294f	buffer_cache: Remove unused boost headers	2020-05-21 16:44:00 -03:00
ReinUsesLisp	a2dcc642c1	map_interval: Add interval allocator and drop hack Drop the std::list hack to allocate memory indefinitely. Instead use a custom allocator that keeps references valid until destruction. This allocates fixed chunks of memory and puts pointers in a free list. When an allocation is no longer used put it back to the free list, this doesn't heap allocate because std::vector doesn't change the capacity. If the free list is empty, allocate a new chunk.	2020-05-21 16:44:00 -03:00
ReinUsesLisp	19d4f28001	buffer_cache: Use boost::container::small_vector for maps in range Most overlaps in the buffer cache only contain one mapped address. We can avoid close to all heap allocations once the buffer cache is warmed up by using a small_vector with a stack size of one.	2020-05-21 16:44:00 -03:00
ReinUsesLisp	891236124c	buffer_cache: Use boost::intrusive::set for caching Instead of using boost::icl::interval_map for caching, use boost::intrusive::set. interval_map is intended as a container where the keys can overlap with one another; we don't need this for caching buffers and a std::set-like data structure that allows us to search with lower_bound is enough.	2020-05-21 16:44:00 -03:00
ReinUsesLisp	3b0baf746e	buffer_cache: Remove shared pointers Removing shared pointers is a first step to be able to use intrusive objects and keep allocations close to one another in memory.	2020-05-21 16:02:54 -03:00
ReinUsesLisp	599274e3f0	buffer_cache: Minor style changes Minor style changes. Mostly done so I avoid editing it while doing other changes.	2020-05-21 16:02:20 -03:00
ReinUsesLisp	420cc13248	renderer_opengl: Add assembly program code paths Add code required to use OpenGL assembly programs based on NV_gpu_program5. Decompilation for ARB programs is intended to be added in a follow up commit. This does not include ARB decompilation and it's not in an usable state. The intention behind assembly programs is to reduce shader stutter significantly on drivers supporting NV_gpu_program5 (and other required extensions). Currently only Nvidia's proprietary driver supports these extensions. Add a UI option hidden for now to avoid people enabling this option accidentally. This code path has some limitations that OpenGL compatibility doesn't have: - NV_shader_storage_buffer_object is limited to 16 entries for a single OpenGL context state (I don't know if this is an intended limitation, an specification issue or I am missing something). Currently causes issues on The Legend of Zelda: Link's Awakening. - NV_parameter_buffer_object can't bind buffers using an offset different to zero. The used workaround is to copy to a temporary buffer (this doesn't happen often so it's not an issue). On the other hand, it has the following advantages: - Shaders build a lot faster. - We have control over how floating point rounding is done over individual instructions (SPIR-V on Vulkan can't do this). - Operations on shared memory can be unsigned and signed. - Transform feedbacks are dynamic state (not yet implemented). - Parameter buffers (uniform buffers) are per stage, matching NVN and hardware's behavior. - The API to bind and create assembly programs makes sense, unlike ARB_separate_shader_objects.	2020-05-19 18:00:04 -03:00
Morph	d0fc12684a	maxwell_to_vk: Add format B8G8R8A8_SRGB Add format B8G8R8A8_SRGB and add Attachable capability for B8G8R8A8_UNORM Used by Bravely Default II	2020-05-18 13:02:09 -04:00
Fernando Sahmkow	4cff5dd194	OpenGL: Enable Debug Context and Synchronous debugging when graphics debugging is enabled. This commit aims to help easing debugging of driver crashes without having to modify existing code.	2020-05-17 21:45:09 -04:00
David Marcec	4b9504028d	DmaPusher: Remove dead code in step	2020-05-16 12:42:27 +10:00
ReinUsesLisp	7a27b7f3a3	vk_rasterizer: Match OpenGL's FlushAndInvalidate behavior Match OpenGL's behavior. This can fix or simplify bisecting issues on Vulkan.	2020-05-15 20:40:08 -03:00
bunnei	b1a1bd12ca	Merge pull request #3899 from ReinUsesLisp/float-comparisons shader_ir: Add separate instructions for ordered and unordered comparisons and fix NE on GLSL	2020-05-13 09:51:14 -04:00
ReinUsesLisp	91dddca26e	vk_rasterizer: Implement constant attributes Constant attributes (in OpenGL known disabled attributes) are not supported on Vulkan, even with extensions. To emulate this behavior we return zero on reads from disabled vertex attributes in shader code. This has no caching cost because attribute formats are not dynamic state on Vulkan and we have to store it in the pipeline cache anyway. - Fixes Animal Crossing: New Horizons terrain borders	2020-05-13 04:36:47 -03:00
ReinUsesLisp	cf6a40fc12	vk_rasterizer: Remove buffer check in attribute selection This was a left over from OpenGL when disabled buffers where not properly emulated. We no longer have to assert this as it is checked in vertex buffer initialization.	2020-05-13 04:36:47 -03:00
bunnei	1beaebe666	Merge pull request #3816 from ReinUsesLisp/vk-rasterizer-enable vk_graphics_pipeline: Implement rasterizer_enable on Vulkan	2020-05-11 18:22:51 -04:00
ReinUsesLisp	8b329ddcc9	gl_shader_decompiler: Properly emulate NaN behaviour on NE "Not equal" operators on GLSL seem to behave as unordered when we expect an ordered comparison. Manually emulate this checking for LGE values (numbers, not-NaNs).	2020-05-10 02:59:33 -03:00
Fernando Sahmkow	1887afaf9e	RasterizerCache: Correct documentation.	2020-05-09 21:03:39 -04:00
Fernando Sahmkow	8d15f8b28e	VkPipelineCache: Use a null shader on invalid address.	2020-05-09 20:51:34 -04:00
Fernando Sahmkow	0a4be73b9b	VideoCore: Use SyncGuestMemory mechanism for Shader/Pipeline Cache invalidation.	2020-05-09 19:25:29 -04:00
Rodrigo Locatti	7e376af8fc	Merge pull request #3839 from Morph1984/r8g8ui texture: Implement R8G8UI	2020-05-09 05:28:55 -03:00
ReinUsesLisp	4e57f9d5cf	shader_ir: Separate float-point comparisons in ordered and unordered This allows us to use native SPIR-V instructions without having to manually check for NAN.	2020-05-09 04:55:15 -03:00
bunnei	a9ee6e346b	Merge pull request #3842 from makigumo/maxwell_to_vk_vertexattribute_signed_int maxwell_to_vk: implement missing signed int formats	2020-05-09 00:36:09 -04:00
bunnei	50c27d5ae1	Merge pull request #3885 from ReinUsesLisp/viewport-swizzles video_core: Implement viewport swizzles with NV_viewport_swizzle	2020-05-08 15:16:53 -04:00
bunnei	028f6fdbf6	Merge pull request #3884 from ReinUsesLisp/border-colors vk_sampler_cache: Use VK_EXT_custom_border_color when available	2020-05-07 12:18:53 -04:00
bunnei	41682e0888	Merge pull request #3815 from FernandoS27/command-list-2 GPU: More optimizations to GPU Command List Processing and DMA Copy Optimizations	2020-05-05 17:12:42 -04:00
bunnei	eb2c50c5e6	Update src/video_core/gpu.cpp Co-authored-by: David <25727384+ogniK5377@users.noreply.github.com>	2020-05-05 15:39:44 -04:00
bunnei	ea09930196	Update src/video_core/gpu.cpp Co-authored-by: David <25727384+ogniK5377@users.noreply.github.com>	2020-05-05 15:39:37 -04:00
ReinUsesLisp	227278098a	vk_sampler_cache: Use VK_EXT_custom_border_color when available This should fix grass interactions on Breath of the Wild on Vulkan. It is currently untested against validation layers. Nvidia's Windows 443.09 beta driver or Linux 440.66.12 is required for now.	2020-05-04 20:49:23 -03:00
ReinUsesLisp	2dbf5290f2	vk_graphics_pipeline: Implement viewport swizzles with NV_viewport_swizzle	2020-05-04 18:31:17 -03:00
ReinUsesLisp	f813cd3ff7	gl_rasterizer: Implement viewport swizzles with NV_viewport_swizzle	2020-05-04 17:51:30 -03:00
ReinUsesLisp	9b8e962368	maxwell_3d: Add viewport swizzles	2020-05-04 17:50:59 -03:00
bunnei	2aff0b4733	Merge pull request #3808 from ReinUsesLisp/wait-for-idle {maxwell_3d,buffer_cache}: Implement memory barriers using 3D registers	2020-05-03 02:43:18 -04:00
bunnei	f4ca8e0d3e	Merge pull request #3732 from lioncash/header vulkan: Remove unnecessary includes	2020-05-02 01:36:57 -04:00
bunnei	0128901102	Merge pull request #3809 from ReinUsesLisp/empty-index vk_rasterizer: Skip index buffer setup when vertices are zero	2020-05-02 01:21:57 -04:00
ReinUsesLisp	3b668e1210	vk_graphics_pipeline: Implement rasterizer_enable on Vulkan We can simply enable rasterizer discard matching the current pipeline key.	2020-05-02 01:47:25 -03:00
bunnei	e6b4311178	Merge pull request #3693 from ReinUsesLisp/clean-samplers shader/texture: Support multiple unknown sampler properties	2020-05-02 00:45:41 -04:00
Jan Beich	b4d0724a63	fixed_pipeline_state: explicitly use template keyword after `1f345ebe3a` In file included from src/video_core/renderer_opengl/renderer_opengl.cpp:25: In file included from src/./video_core/renderer_opengl/gl_rasterizer.h:26: In file included from src/./video_core/renderer_opengl/gl_fence_manager.h:11: src/./video_core/fence_manager.h:91:32: error: use 'template' keyword to treat 'Write' as a dependent template name memory_manager.Write<u32>(current_fence->GetAddress(), current_fence->GetPayload()); ^ template src/./video_core/fence_manager.h:137:32: error: use 'template' keyword to treat 'Write' as a dependent template name memory_manager.Write<u32>(current_fence->GetAddress(), current_fence->GetPayload()); ^ template	2020-05-01 23:38:23 +00:00
Dan	96ee1b42bc	maxwell_to_vk: implement missing signed int formats	2020-04-30 23:39:16 +02:00
Morph	7909860d16	texture: Implement R8G8UI - Used by The Walking Dead: The Final Season	2020-04-30 13:19:36 -04:00
bunnei	bf3f030a0d	Merge pull request #3807 from ReinUsesLisp/fix-depth-clamp maxwell_3d: Fix depth clamping register	2020-04-30 13:07:31 -04:00
bunnei	c7b5a87c90	Merge pull request #3799 from ReinUsesLisp/iadd-cc shader: Implement P2R CC, IADD Rd.CC and IADD.X	2020-04-30 12:56:36 -04:00
bunnei	da2b8295e1	Merge pull request #3805 from ReinUsesLisp/preserve-contents texture_cache: Reintroduce preserve_contents accurately	2020-04-30 12:56:19 -04:00
bunnei	6572660fde	Merge pull request #3788 from FernandoS27/revert Revert: shader_decode: Fix LD, LDG when track constant buffer.	2020-04-30 12:55:39 -04:00
Lioncash	6c53edd4d3	vulkan: Remove unnecessary includes Reduces some header churn and reduces rebuilds when some header internals change. While we're at it we can also resolve a missing include in buffer_cache.	2020-04-28 21:54:46 -04:00
ReinUsesLisp	871aadbe36	shader/arithmetic_integer: Fix tracking issue in temporary This temporary is not needed as we mark Rd.CC + IADD.X as unimplemented. It caused issues when tracking global buffers.	2020-04-28 17:14:53 -03:00
Fernando Sahmkow	9df67b2095	Clang Format and Documentation.	2020-04-28 14:02:51 -04:00
Fernando Sahmkow	37c690576f	MaxwellDMA: Optimize micro copies.	2020-04-28 13:44:14 -04:00
bunnei	72b73d22ab	Merge pull request #3784 from ReinUsesLisp/shader-memory-util shader/memory_util: Deduplicate code	2020-04-28 12:05:50 -04:00
ReinUsesLisp	d6a24b4a5b	vk_rasterizer: Skip index buffer setup when vertices are zero Xenoblade 2 invokes a draw call with zero vertices. This is likely due to indirect drawing (glDrawArraysIndirect). This causes a crash in the staging buffer pool when trying to create a buffer with a size of zero. To workaround this, skip index buffer setup entirely when the number of indices is zero.	2020-04-28 02:24:33 -03:00
ReinUsesLisp	fe931ac976	{maxwell_3d,buffer_cache}: Implement memory barriers using 3D registers Drop MemoryBarrier from the buffer cache and use Maxwell3D's register WaitForIdle. To implement this on OpenGL we just call glMemoryBarrier with the necessary bits. Vulkan lacks this synchronization primitive, so we set an event and immediately wait for it. This is not a pretty solution, but it's what Vulkan can do without submitting the current command buffer to the queue (which ends up being more expensive on the CPU).	2020-04-28 02:18:12 -03:00
Fernando Sahmkow	b87422a86f	VideoCore/GPU: Delegate subchannel engines to the dma pusher.	2020-04-27 22:07:21 -04:00
Fernando Sahmkow	90e5694230	VideoCore/Engines: Refactor Engines CallMethod.	2020-04-27 21:47:58 -04:00
ReinUsesLisp	bb1ed66d99	maxwell_3d: Fix depth clamping register Using deko3d as reference: `4e47ba0013/source/maxwell/gpu_3d_state.cpp (L42)` We were using bits 3 and 4 to determine depth clamping, but these are the same both enabled and disabled: state->depthClampEnable ? 0x101A : 0x181D The same happens on Nvidia's OpenGL driver, where they do something like this (default capabilities, GL 4.5 compatibility): (state & DEPTH_CLAMP) != 0 ? 0x201a : 0x281c There's always a difference between the first bits in this register, but bit 11 is consistently disabled on both deko3d/NVN and OpenGL. This commit changes yuzu's behaviour to use bit 11 to determine depth clamping. - Fixes depth issues on Super Mario Odyssey's intro.	2020-04-27 20:50:14 -03:00
Fernando Sahmkow	1517cba8ca	Merge pull request #3766 from ReinUsesLisp/renderpass-cache-key vk_renderpass_cache: Pack renderpass cache key and unify keys	2020-04-27 16:05:14 -04:00
Fernando Sahmkow	a65e9ad552	Merge pull request #3756 from ReinUsesLisp/integrated-devices vk_memory_manager: Remove unified memory model flag	2020-04-27 16:04:22 -04:00
bunnei	6c7d8073be	Merge pull request #3742 from FernandoS27/command-list Optimize GPU Command Lists and Introduce Fast GPU Time Option	2020-04-27 00:18:46 -04:00
ReinUsesLisp	8da16cf9fb	texture_cache: Reintroduce preserve_contents accurately This reverts commit `94b0e2e5da`. preserve_contents proved to be a meaningful optimization. This commit reintroduces it but properly implemented on OpenGL. We have to make sure the clear removes all the previous contents of the image. It's not currently implemented on Vulkan because we can do smart things there that's preferred to be introduced in a separate commit.	2020-04-26 19:53:02 -03:00
Rodrigo Locatti	7e38dd580f	Merge pull request #3753 from ReinUsesLisp/ac-vulkan {gl,vk}_rasterizer: Add lazy default buffer maker and use it for empty buffers	2020-04-26 01:55:43 -03:00
ReinUsesLisp	ddd82ef42b	shader/memory_util: Deduplicate code Deduplicate code shared between vk_pipeline_cache and gl_shader_cache as well as shader decoder code. While we are at it, fix a bug in gl_shader_cache where compute shaders had an start offset of a stage shader.	2020-04-26 01:38:51 -03:00
ReinUsesLisp	e895a4e2d7	shader/arithmetic_integer: Fix edge case and mark IADD.X Rd.CC as unimplemented IADD.X Rd.CC requires some extra logic that is not currently implemented. Abort when this is hit.	2020-04-25 22:58:33 -03:00
ReinUsesLisp	2a96bea6a7	shader/arithmetic_integer: Change IAdd to UAdd to avoid signed overflow Signed integer addition overflow might be undefined behavior. It's free to change operations to UAdd and use unsigned integers to avoid potential bugs.	2020-04-25 22:57:54 -03:00
ReinUsesLisp	c788f9c0bd	shader/arithmetic_integer: Implement IADD.X IADD.X takes the carry flag and adds it to the result. This is generally used to emulate 64-bit operations with 32-bit registers.	2020-04-25 22:56:11 -03:00
ReinUsesLisp	255197e643	shader/arithmetic_integer: Implement CC for IADD	2020-04-25 22:55:26 -03:00
ReinUsesLisp	ffc5ec6fa8	decode/register_set_predicate: Implement CC P2R CC takes the state of condition codes and puts them into a register. We already have this implemented for PR (predicates). This commit implements CC over that.	2020-04-25 22:54:42 -03:00
ReinUsesLisp	d523734266	decode/register_set_predicate: Use move for shared pointers Avoid atomic counters used by shared pointers.	2020-04-25 22:54:14 -03:00
bunnei	c5bf693882	Merge pull request #3721 from ReinUsesLisp/sort-devices vulkan/wrapper: Sort physical devices	2020-04-25 03:27:40 -04:00
bunnei	4e37825dab	Merge pull request #3734 from ReinUsesLisp/half-float-mods decode/arithmetic_half: Fix HADD2 and HMUL2 absolute and negation bits	2020-04-25 00:41:43 -04:00
ReinUsesLisp	527a1574c3	vk_rasterizer: Pack texceptions and color formats on invalid formats Sometimes for unknown reasons NVN games can bind a render target format of 0. This may be a yuzu bug. With the commits before this the formats were specified without being "packed", assuming all formats and texceptions will be written like in the color_attachments vector. To address this issue, iterate all render targets and pack them as they are valid. This way they will match color_attachments. - Fixes validation errors and graphical issues on Breath of the Wild.	2020-04-24 22:21:29 -03:00
bunnei	7c8acb0025	Merge pull request #3749 from ReinUsesLisp/lea-imm shader/arithmetic_integer: Fix LEA_IMM encoding	2020-04-24 14:30:13 -04:00
Fernando Sahmkow	d8a961cd6c	Revert: shader_decode: Fix LD, LDG when track constant buffer.	2020-04-24 11:00:54 -04:00
Markus Wick	e717a1df20	Fix -Wdeprecated-copy warning.	2020-04-24 09:33:04 +02:00
Markus Wick	c499c22cf7	Fix -Werror=conversion error.	2020-04-24 09:33:04 +02:00
ReinUsesLisp	dbaebd8582	decode/arithmetic_half: Fix HADD2 and HMUL2 absolute and negation bits The encoding for negation and absolute value was wrong. Extracting is now done manually. Similar instructions having different encodings is the rule, not the exception. To keep sanity and readability I preferred to extract the desired bit manually. This is implemented against nxas: `8dbc389957/table.h (L68)` That is itself tested against nvdisasm (Nvidia's official disassembler).	2020-04-23 18:29:38 -03:00
ReinUsesLisp	4fb921ff6b	shader/texture: Support multiple unknown sampler properties This allows deducing some properties from the texture instruction before asking the runtime. By doing this we can handle type mismatches in some instructions from the renderer instead of the shader decoder. Fixes texelFetch issues with games using 2D texture instructions on a 1D sampler.	2020-04-23 18:04:13 -03:00
ReinUsesLisp	72deb773fd	shader_ir: Turn classes into data structures	2020-04-23 18:00:06 -03:00
ReinUsesLisp	3e35101895	vk_rasterizer: Fix framebuffer creation validation errors Framebuffer creation was ignoring the number of color attachments.	2020-04-23 17:34:16 -03:00
ReinUsesLisp	8c37cd1af6	vk_pipeline_cache: Unify pipeline cache keys into a single operation This allows us to call Common::CityHash and std::memcmp only once for GraphicsPipelineCacheKey. While we are at it, do the same for compute.	2020-04-23 17:34:16 -03:00
ReinUsesLisp	f665c92114	vk_renderpass_cache: Pack renderpass cache key to 12 bytes	2020-04-23 17:34:16 -03:00
bunnei	ff0c49e1ce	kernel: memory: Improve implementation of device shared memory. (#3707 ) * kernel: memory: Improve implementation of device shared memory. * fixup! kernel: memory: Improve implementation of device shared memory. * fixup! kernel: memory: Improve implementation of device shared memory.	2020-04-23 11:37:12 -04:00
Fernando Sahmkow	5c9feaebb6	Clang Format.	2020-04-23 08:52:58 -04:00
Fernando Sahmkow	b8aef40c56	GPU: Add Fast GPU Time Option.	2020-04-23 08:52:57 -04:00
Fernando Sahmkow	18a88d19dc	Maxwell3D: Process Macros on MultiMethod.	2020-04-23 08:52:56 -04:00
Fernando Sahmkow	3fedcc2f6e	DMAPusher: Propagate multimethod writes into the engines.	2020-04-23 08:52:55 -04:00
bunnei	2409fedacf	Merge pull request #3697 from lioncash/declarations CMakeLists: Enable -Wmissing-declarations on Linux builds	2020-04-23 02:18:52 -04:00
bunnei	bf2ddb8fd5	Merge pull request #3677 from FernandoS27/better-sync Introduce Predictive Flushing and Improve ASYNC GPU	2020-04-22 22:09:38 -04:00
ReinUsesLisp	d9463f4562	vk_pipeline_cache: Fix unintentional memcpy into optional The intention behind this was to assign a float to from an uint32_t, but it was unintentionally being copied directly into the std::optional. Copy to a temporary and assign that temporary to std::optional. This can be replaced with std::bit_cast<float> once we are in C++20.	2020-04-22 21:36:05 -03:00
Fernando Sahmkow	c043ac4f13	GL_Fence_Manager: use GL_TIMEOUT_IGNORED instead of a loop,	2020-04-22 20:34:32 -04:00
Fernando Sahmkow	afae40a99e	Merge pull request #3653 from ReinUsesLisp/nsight-aftermath renderer_vulkan: Integrate Nvidia Nsight Aftermath on Windows	2020-04-22 11:39:01 -04:00
Fernando Sahmkow	4e37f1b113	Address Feedback.	2020-04-22 11:36:27 -04:00
Fernando Sahmkow	39e5b72948	Async GPU: Correct flushing behavior to be similar to old async GPU behavior.	2020-04-22 11:36:26 -04:00
Fernando Sahmkow	1b3be8a8f8	MaxwellDMA: Correct copying on accuracy level.	2020-04-22 11:36:25 -04:00
Fernando Sahmkow	644588fd88	ShaderCache/PipelineCache: Cache null shaders.	2020-04-22 11:36:25 -04:00
Fernando Sahmkow	f616dc0b59	Address Feedback.	2020-04-22 11:36:24 -04:00
Fernando Sahmkow	ec2f3e48e1	Fix GCC error.	2020-04-22 11:36:23 -04:00
Fernando Sahmkow	b3e5f177ba	QueryCache: Only do async flushes on async gpu.	2020-04-22 11:36:21 -04:00
Fernando Sahmkow	f4ab223ef0	Async GPU: Only do reactive flushing on Extreme Level.	2020-04-22 11:36:20 -04:00
ReinUsesLisp	b752faf2d3	vk_fence_manager: Initial implementation	2020-04-22 11:36:19 -04:00
Fernando Sahmkow	0649f05900	QueryCache: Implement Async Flushes.	2020-04-22 11:36:18 -04:00
Fernando Sahmkow	131b342130	OpenGL: Guarantee writes to Buffers.	2020-04-22 11:36:18 -04:00
Fernando Sahmkow	1fb516cd97	GPU: Implement Flush Requests for Async mode.	2020-04-22 11:36:17 -04:00
Fernando Sahmkow	b7bc3c2549	FenceManager: Manage syncpoints and rename fences to semaphores.	2020-04-22 11:36:16 -04:00
Fernando Sahmkow	96bb961a64	BufferCache: Refactor async managing.	2020-04-22 11:36:15 -04:00
Fernando Sahmkow	b10db7e4a5	FenceManager: Implement async buffer cache flushes on High settings	2020-04-22 11:36:15 -04:00
Fernando Sahmkow	4adfc9bb08	Rasterizer: Document SignalFence & ReleaseFences and setup skeletons on Vulkan.	2020-04-22 11:36:14 -04:00
Fernando Sahmkow	a081a7c855	GPU: Fix rebase errors.	2020-04-22 11:36:13 -04:00
Fernando Sahmkow	e84eb64e51	Rasterizer: Disable fence managing in synchronous gpu.	2020-04-22 11:36:12 -04:00
Fernando Sahmkow	165ae823f5	ThreadManager: Sync async reads on accurate gpu.	2020-04-22 11:36:12 -04:00
Fernando Sahmkow	57fdbd9b89	FenceManager: Implement should wait.	2020-04-22 11:36:11 -04:00
Fernando Sahmkow	1f345ebe3a	GPU: Implement a Fence Manager.	2020-04-22 11:36:10 -04:00
Fernando Sahmkow	487379c593	OpenGL: Implement Fencing backend.	2020-04-22 11:36:10 -04:00
Fernando Sahmkow	ed7e965712	TextureCache: Flush linear textures after finishing rendering.	2020-04-22 11:36:09 -04:00
Fernando Sahmkow	339d0d9d6c	GPU: Delay Fences.	2020-04-22 11:36:08 -04:00
Fernando Sahmkow	8b1eb44b3e	BufferCache: Implement OnCPUWrite and SyncGuestHost	2020-04-22 11:36:07 -04:00
Fernando Sahmkow	da8f17715d	GPU: Refactor synchronization on Async GPU	2020-04-22 11:36:06 -04:00
Fernando Sahmkow	a60a22d9c2	Texture Cache: Implement OnCPUWrite and SyncGuestHost	2020-04-22 11:36:05 -04:00
Fernando Sahmkow	084ceb925a	UI: Replasce accurate GPU option for GPU Accuracy Level	2020-04-22 11:36:04 -04:00
ReinUsesLisp	6f47bd9641	vk_memory_manager: Remove unified memory model flag All drivers (even Intel) seem to have a device local memory type that is not host visible. Remove this flag so all devices follow the same path. This fixes a crash when trying to map to host device local memory on integrated devices.	2020-04-21 22:06:38 -03:00
bunnei	d64290884a	Merge pull request #3714 from lioncash/copies gl_shader_decompiler: Avoid copies where applicable	2020-04-21 20:16:02 -04:00

... 5 6 7 8 9 ...

4889 commits