artemist/yuzu - Forgejo: Beyond coding. We Forge.

Author	SHA1	Message	Date
ReinUsesLisp	6a0220b2e1	texture_cache: Implement layered framebuffer attachments Layered framebuffer attachments is a feature that allows applications to write attach layered textures to a single attachment. What layer the fragments are written to is decided from the shader using gl_Layer.	2020-02-16 04:19:32 -03:00
ReinUsesLisp	1caf3f11c8	vk_shader_decompiler: Implement Layer output attribute SPIR-V's Layer is GLSL's gl_Layer. It lets the application choose from a shader stage (vertex, tessellation or geometry) which framebuffer layer write the output fragments to.	2020-02-16 04:17:37 -03:00
ReinUsesLisp	bfda5ff3f6	texture_cache: Avoid matches in 3D textures Code before this commit was trying to match 3D textures with another target. Fix that.	2020-02-16 04:15:42 -03:00
ReinUsesLisp	fd62bdf377	surface_base: Implement texture buffer flushes Implement downloads to guest memory from texture buffers on the generic cache and OpenGL.	2020-02-16 04:13:27 -03:00
bunnei	0f70f68fb3	Revert "video_core: memory_manager: Use GPU interface for cache functions."	2020-02-15 17:47:15 -05:00
ReinUsesLisp	14c2a4a2ec	texture: Implement R32I	2020-02-15 16:26:50 -03:00
ReinUsesLisp	6910ade146	shader/texture: Allow 2D shadow arrays and simplify code Shadow sampler 2D arrays are supported on OpenGL, so there's no reason to forbid these. Enable textureLod usage on these. Minor style changes.	2020-02-15 02:36:28 -03:00
ReinUsesLisp	91aa58e410	maxwell_3d: Unify draw methods Pass instanced state of a draw invocation as an argument instead of having two separate virtual methods.	2020-02-14 18:09:40 -03:00
ReinUsesLisp	6d3a046caa	query_cache: Address feedback	2020-02-14 17:38:27 -03:00
ReinUsesLisp	54a00ee4cf	query_cache: Fix ambiguity in CacheAddr getter	2020-02-14 17:38:27 -03:00
ReinUsesLisp	cc0694559f	query_cache: Add a recursive mutex for concurrent usage	2020-02-14 17:38:27 -03:00
ReinUsesLisp	bcd348f238	vk_query_cache: Implement generic query cache on Vulkan	2020-02-14 17:38:27 -03:00
ReinUsesLisp	c31382ced5	query_cache: Abstract OpenGL implementation Abstract the current OpenGL implementation into the VideoCommon namespace and reimplement it on top of that. Doing this avoids repeating code and logic in the Vulkan implementation.	2020-02-14 17:38:27 -03:00
ReinUsesLisp	73d2d3342d	gl_query_cache: Optimize query cache Use a custom cache instead of relying on a ranged cache.	2020-02-14 17:38:27 -03:00
ReinUsesLisp	aae8c180cb	gl_query_cache: Implement host queries using a deferred cache Instead of waiting immediately for executed commands, defer the query until the guest CPU reads it. This way we get closer to what the guest program is doing. To archive this we have to build a dependency queue, because host APIs (like OpenGL and Vulkan) use ranged queries instead of counters like NVN. Waiting for queries implicitly uses fences and this requires a command being queued, otherwise the driver will lock waiting until a timeout. To fix this when there are no commands queued, we explicitly call glFlush.	2020-02-14 17:33:13 -03:00
ReinUsesLisp	ef9920e164	gl_rasterizer: Sort method declarations	2020-02-14 17:27:17 -03:00
ReinUsesLisp	fe1238be7a	gl_rasterizer: Add queued commands counter Keep track of the queued OpenGL commands that can signal a fence if waited on. As a side effect, we avoid calls to glFlush when no commands are queued.	2020-02-14 17:27:17 -03:00
ReinUsesLisp	2b58652f08	maxwell_3d: Slow implementation of passed samples (query 21) Implements GL_SAMPLES_PASSED by waiting immediately for queries.	2020-02-14 17:27:17 -03:00
bunnei	63a59b9935	Merge pull request #3379 from ReinUsesLisp/cbuf-offset shader/decode: Fix constant buffer offsets	2020-02-14 13:22:53 -05:00
ReinUsesLisp	3217400dd1	gl_resource_manager: Add managed query class	2020-02-13 22:25:55 -03:00
bunnei	3563af2364	Merge pull request #3395 from FernandoS27/queries GPU: Refactor queries implementation and correct GPU Clock.	2020-02-13 20:18:26 -05:00
ReinUsesLisp	336a4f8e99	gl_rasterizer: Use the least generic OpenGL draw function possible This may help some implementations.	2020-02-13 21:55:21 -03:00
ReinUsesLisp	cbea8c74de	vk_shader_decompiler: Fix vertex id and instance id Vulkan's VertexIndex and InstanceIndex don't match with hardware. This is because Nvidia implements gl_VertexID and gl_InstanceID. The math that relates these is: gl_VertexIndex = gl_BaseVertex + gl_VertexID gl_InstanceIndex = gl_InstanceIndex + gl_InstanceID To emulate it using what Vulkan's SPIR-V offers (the Index variants) this commit substracts gl_Base from gl_*Index to obtain the OpenGL and hardware's equivalent.	2020-02-13 20:25:28 -03:00
Fernando Sahmkow	d6ed31b9fa	GPU: Address Feedback.	2020-02-13 18:16:07 -04:00
bunnei	37f1cf8cbd	Merge pull request #3376 from ReinUsesLisp/point-sprite gl_rasterizer: Implement GL_POINT_SPRITE	2020-02-11 08:26:07 -05:00
Fernando Sahmkow	8e9a4944db	GPU: Implement GPU Clock correctly.	2020-02-10 10:44:54 -04:00
Fernando Sahmkow	0cb3bcfbb7	Maxwell3D: Correct query reporting.	2020-02-10 10:41:43 -04:00
bunnei	84ea9c2b42	Merge pull request #3372 from ReinUsesLisp/fix-back-stencil maxwell_3d: Fix stencil back mask	2020-02-09 22:29:28 -05:00
bunnei	e210835dd0	Merge pull request #3387 from bunnei/gpu-mpscqueue gpu_thread: Use MPSCQueue for GPU commands.	2020-02-08 21:15:48 -05:00
bunnei	b5c13ee0eb	gpu_thread: Use MPSCQueue for GPU commands. - Necessary for multiple service threads.	2020-02-07 23:01:23 -05:00
bunnei	7cacb08cdf	video_core: memory_manager: Use GPU interface for cache functions.	2020-02-07 22:59:35 -05:00
bunnei	90bda66028	Merge pull request #3378 from ReinUsesLisp/uscaled maxwell_to_gl: Implement R8G8_USCALED	2020-02-07 22:55:52 -05:00
bunnei	90df4b8e2b	Merge pull request #3369 from ReinUsesLisp/shf shader/shift: Implement SHF	2020-02-07 22:06:57 -05:00
bunnei	09d766d357	Merge pull request #3362 from ReinUsesLisp/fix-instanced gl_rasterizer: Fix instanced draw arrays	2020-02-06 21:39:59 -05:00
ReinUsesLisp	bf9a822b87	shader/decode: Fix constant buffer offsets Some instances were using cbuf34.offset instead of cbuf34.GetOffset(). This returned the an invalid offset. Address those instances and rename offset to "shifted_offset" to avoid future bugs.	2020-02-05 12:19:09 -03:00
ReinUsesLisp	8bb9eef97b	maxwell_to_gl: Implement R8G8_USCALED	2020-02-04 21:32:36 -03:00
ReinUsesLisp	c81c361e82	maxwell_to_gl: Reduce unimplemented formats to LOG_ERROR	2020-02-04 21:32:08 -03:00
ReinUsesLisp	0eb36c90f4	vk_rasterizer: Use noexcept variants of std::bitset Removes bounds checking from "texceptions" instances.	2020-02-04 18:04:24 -03:00
bunnei	08c508b1c4	Merge pull request #3357 from ReinUsesLisp/bfi-rc shader/bfi: Implement register-constant buffer variant	2020-02-04 15:14:13 -05:00
ReinUsesLisp	7da52673d0	gl_rasterizer: Implement GL_POINT_SPRITE OpenGL core defaults to GL_POINT_SPRITE, meanwhile on OpenGL compatibility we have to explicitly enable it. This fixes gl_PointCoord's behaviour.	2020-02-04 15:19:45 -03:00
bunnei	bf21aacc74	Merge pull request #3356 from ReinUsesLisp/fcmp shader/arithmetic: Implement FCMP	2020-02-04 11:36:59 -05:00
bunnei	c31ec00d67	Merge pull request #3337 from ReinUsesLisp/vulkan-staged yuzu: Implement Vulkan frontend	2020-02-03 16:56:25 -05:00
ReinUsesLisp	4eed744277	maxwell_3d: Fix stencil back mask	2020-02-02 17:50:46 -03:00
ReinUsesLisp	223a89a19f	shader: Remove curly braces initializers on shared pointers	2020-02-01 22:52:10 -03:00
bunnei	b5bbe7e752	Merge pull request #3282 from FernandoS27/indexed-samplers Partially implement Indexed samplers in general and specific code in GLSL	2020-02-01 20:41:40 -05:00
ReinUsesLisp	729ca120e3	shader/shift: Implement SHIFT_RIGHT_{IMM,R} Shifts a pair of registers to the right and returns the low register.	2020-02-01 21:20:02 -03:00
ReinUsesLisp	017474c3f8	shader/shift: Implement SHF_LEFT_{IMM,R} Shifts a pair of registers to the left and returns the high register.	2020-02-01 21:19:44 -03:00
bunnei	c593e45dbd	Merge pull request #3347 from ReinUsesLisp/local-mem shader/memory: Implement LDL.S16, LDS.S16, STL.S16 and STS.S16	2020-01-30 10:59:52 -05:00
ReinUsesLisp	b69321650e	gl_rasterizer: Fix instanced draw arrays glDrawArrays was being used when the draw had a base instance specified. This commit removes the draw parameters abstraction and fixes the mentioned issue.	2020-01-30 02:22:00 -03:00
bunnei	2db7adc42a	Merge pull request #3350 from ReinUsesLisp/atom shader/memory: Implement ATOM.ADD	2020-01-29 16:49:54 -05:00
ReinUsesLisp	f92cbc5501	yuzu: Implement Vulkan frontend Adds a Qt and SDL2 frontend for Vulkan. It also finishes the missing bits on Vulkan initialization.	2020-01-29 17:53:11 -03:00
ReinUsesLisp	788d57d723	settings: Add settings for graphics backend	2020-01-29 17:53:11 -03:00
ReinUsesLisp	9f0162e4b5	shader/other: Fix skips for SYNC and BRK	2020-01-29 17:53:11 -03:00
ReinUsesLisp	270177f38a	shader/other: Stub S2R LaneId	2020-01-29 17:53:11 -03:00
ReinUsesLisp	b35449c85d	buffer_cache: Delay buffer destructions Delay buffer destruction some extra frames to avoid destroying buffers that are still being used from older frames. This happens on Nvidia's driver with mailbox.	2020-01-29 17:53:11 -03:00
bunnei	b11aeced18	Merge pull request #3355 from ReinUsesLisp/break-down texture_cache/surface_base: Fix layered break down	2020-01-29 12:29:56 -05:00
bunnei	91f79225e7	Merge pull request #3358 from ReinUsesLisp/implicit-texture-cache gl_texture_cache: Silence implicit sign cast warnings	2020-01-29 11:23:50 -05:00
bunnei	c457e47297	Merge pull request #3359 from ReinUsesLisp/assert-point-size gl_shader_decompiler: Remove UNIMPLEMENTED for gl_PointSize	2020-01-28 15:19:51 -05:00
ReinUsesLisp	8178fe8960	gl_shader_decompiler: Remove UNIMPLEMENTED for gl_PointSize This was implemented by a previous commit and it's no longer required.	2020-01-28 16:32:30 -03:00
ReinUsesLisp	abae795986	gl_texture_cache: Silence implicit sign cast warnings	2020-01-27 20:59:11 -03:00
ReinUsesLisp	137a8aa55c	shader/bfi: Implement register-constant buffer variant It's the same as the variant that was implemented, but it takes the operands from another source.	2020-01-27 01:20:38 -03:00
ReinUsesLisp	e3fc3459c8	shader/arithmetic: Implement FCMP Compares the third operand with zero, then selects between the first and second.	2020-01-27 01:15:44 -03:00
ReinUsesLisp	f55f6ff9bb	texture_cache/surface_base: Fix layered break down Layered break downs was passing "layer" as a "depth" parameter. This commit addresses that.	2020-01-26 21:48:07 -03:00
ReinUsesLisp	d17dfa6104	gl_texture_cache: Properly implement depth/stencil sampling This addresses the long standing issue of compatibility vs. core profiles on OpenGL, properly implementing depth vs. stencil sampling depending on the texture swizzle.	2020-01-26 21:44:08 -03:00
ReinUsesLisp	d95d4ac843	shader/memory: Implement ATOM.ADD ATOM operates atomically on global memory. For now only add ATOM.ADD since that's what was found in commercial games. This asserts for ATOM.ADD.S32 (handling the others as unimplemented), although ATOM.ADD.U32 shouldn't be any different. This change forces us to change the default type on SPIR-V storage buffers from float to uint. We could also alias the buffers, but it's simpler for now to just use uint. While we are at it, abstract the code to avoid repetition.	2020-01-26 01:54:24 -03:00
Fernando Sahmkow	bb8eb15d39	Shader_IR: Address feedback.	2020-01-25 09:04:59 -04:00
ReinUsesLisp	d26e74f0a3	shader/memory: Implement STL.S16 and STS.S16	2020-01-25 03:16:10 -03:00
ReinUsesLisp	9a2cdf8520	shader/memory: Implement unaligned LDL.S16 and LDS.S16	2020-01-25 03:16:10 -03:00
ReinUsesLisp	531f25a037	shader/memory: Move unaligned load/store to functions	2020-01-25 03:16:10 -03:00
ReinUsesLisp	96638f57c9	shader/memory: Implement LDL.S16 and LDS.S16	2020-01-25 03:15:55 -03:00
bunnei	dfd998216c	Merge pull request #3344 from ReinUsesLisp/vk-botw vk_shader_decompiler: Disable default values on unwritten render targets	2020-01-24 17:31:55 -05:00
Fernando Sahmkow	806f569143	Shader_IR: Change name of TrackSampler function so it does not confuse with the type.	2020-01-24 16:44:48 -04:00
Fernando Sahmkow	3919b7b8a9	Shader_IR: Corrections, styling and extras.	2020-01-24 16:44:48 -04:00
Fernando Sahmkow	37b8504faa	Shader_IR: Correct Custom Variable assignment.	2020-01-24 16:44:47 -04:00
Fernando Sahmkow	7c530e0666	Shader_IR: Propagate bindless index into the GL compiler.	2020-01-24 16:44:47 -04:00
Fernando Sahmkow	3c34678627	Shader_IR: Implement Injectable Custom Variables to the IR.	2020-01-24 16:43:31 -04:00
Fernando Sahmkow	2b02f29a2d	GL Backend: Introduce indexed samplers into the GL backend	2020-01-24 16:43:31 -04:00
Fernando Sahmkow	037ea431ce	Shader_IR: deduce size of indexed samplers	2020-01-24 16:43:31 -04:00
Fernando Sahmkow	f4603d23c5	Shader_IR: Setup Indexed Samplers on the IR	2020-01-24 16:43:30 -04:00
Fernando Sahmkow	603c861532	Shader_IR: Implement initial code for tracking indexed samplers.	2020-01-24 16:43:30 -04:00
Fernando Sahmkow	64496f2456	Shader_IR: Address Feedback	2020-01-24 16:43:30 -04:00
Fernando Sahmkow	b97608ca64	Shader_IR: Allow constant access of guest driver.	2020-01-24 16:43:30 -04:00
Fernando Sahmkow	dc5cfa8d28	Shader_IR: Address Feedback	2020-01-24 16:43:29 -04:00
Fernando Sahmkow	74aa7de5e3	Guest_driver: Correct compiling errors in GCC.	2020-01-24 16:43:29 -04:00
Fernando Sahmkow	1e4b6bef6f	Shader_IR: Store Bound buffer on Shader Usage	2020-01-24 16:43:29 -04:00
Fernando Sahmkow	c921e496eb	GPU: Implement guest driver profile and deduce texture handler sizes.	2020-01-24 16:43:29 -04:00
bunnei	a104b985a8	Merge pull request #3273 from FernandoS27/txd-array Shader_IR: Implement TXD Array.	2020-01-24 14:02:40 -05:00
ReinUsesLisp	1690f1adba	vk_shader_decompiler: Disable default values on unwritten render targets Some games like The Legend of Zelda: Breath of the Wild assign render targets without writing them from the fragment shader. This generates Vulkan validation errors, so silence these I previously introduced a commit to set "vec4(0, 0, 0, 1)" for these attachments. The problem is that this is not what games expect. This commit reverts that change.	2020-01-24 01:16:21 -03:00
ReinUsesLisp	3ce28342a2	gl_shader_cache: Disable fastmath on Nvidia	2020-01-21 19:08:08 -03:00
Fernando Sahmkow	79e0991d9b	Merge pull request #3330 from ReinUsesLisp/vk-blit-screen vk_blit_screen: Initial implementation	2020-01-20 22:32:16 -04:00
ReinUsesLisp	a665581684	vk_blit_screen: Address feedback	2020-01-20 18:43:11 -03:00
bunnei	69b44392a7	Merge pull request #3328 from ReinUsesLisp/vulkan-atoms vk_shader_decompiler: Implement UAtomicAdd (ATOMS) on SPIR-V	2020-01-20 00:01:52 -05:00
bunnei	5a077c95ce	Merge pull request #3322 from ReinUsesLisp/vk-front-face vk_graphics_pipeline: Set front facing properly	2020-01-19 23:22:34 -05:00
ReinUsesLisp	f5dfe68a94	vk_blit_screen: Initial implementation This abstraction takes care of presenting accelerated and non-accelerated or "framebuffer" images to the Vulkan swapchain.	2020-01-19 21:12:43 -03:00
bunnei	41373d212e	Merge pull request #3313 from ReinUsesLisp/vk-rasterizer vk_rasterizer: Implement Vulkan's rasterizer	2020-01-19 18:09:01 -05:00
ReinUsesLisp	b2c976ad0e	vk_shader_decompiler: Implement UAtomicAdd (ATOMS) on SPIR-V Also updates sirit to include atomic instructions.	2020-01-19 16:40:31 -03:00
Fernando Sahmkow	51c8aea979	Merge pull request #3317 from ReinUsesLisp/gl-decomp-cc-decomp gl_shader_decompiler: Fix decompilation of condition codes	2020-01-18 19:56:55 -04:00
ReinUsesLisp	d110a371bb	gl_state: Use bool instead of GLboolean This fixes template resolution considering GLboolean an integer instead of a bool.	2020-01-18 19:10:34 -03:00
ReinUsesLisp	94915d4ea1	vk_graphics_pipeline: Set front facing properly Front face was being forced to a certain value when cull face is disabled. Set a default value on initialization and drop the forcefully set front facing value with culling disabled.	2020-01-18 18:50:47 -03:00
bunnei	9bf4850f74	Merge pull request #3305 from ReinUsesLisp/point-size-program gl_state: Implement PROGRAM_POINT_SIZE	2020-01-18 01:56:32 -05:00
bunnei	15163edaaa	Merge pull request #3312 from ReinUsesLisp/atoms-u32 shader/memory: Implement ATOMS.ADD.U32	2020-01-18 00:54:07 -05:00
ReinUsesLisp	09b1d762d7	vk_rasterizer: Address feedback	2020-01-17 21:40:01 -03:00
ReinUsesLisp	f34e519da3	gl_shader_decompiler: Fix decompilation of condition codes Use Visit instead of reimplementing it. Fixes unimplemented negations for condition codes.	2020-01-17 21:23:01 -03:00
bunnei	48863afb65	Merge pull request #3306 from ReinUsesLisp/gl-texture gl_texture_cache: Minor fixes and style changes	2020-01-17 15:44:02 -05:00
bunnei	657b3a366e	Merge pull request #3311 from ReinUsesLisp/z32fx24s8 format_lookup_table: Fix ZF32_X24S8 component types	2020-01-17 08:22:32 -05:00
ReinUsesLisp	fe5356d223	vk_rasterizer: Implement Vulkan's rasterizer This abstraction is Vulkan's equivalent to OpenGL's rasterizer. It takes care of joining all parts of the backend and rendering accordingly on demand.	2020-01-16 23:05:15 -03:00
ReinUsesLisp	38e789c761	renderer_vulkan: Add header as placeholder	2020-01-16 22:54:15 -03:00
bunnei	e041f33569	Merge pull request #3300 from ReinUsesLisp/vk-texture-cache vk_texture_cache: Implement generic texture cache on Vulkan	2020-01-16 19:19:26 -05:00
ReinUsesLisp	f09cd52980	vk_texture_cache: Address feedback	2020-01-16 18:23:10 -03:00
ReinUsesLisp	63ba41a26d	shader/memory: Implement ATOMS.ADD.U32	2020-01-16 17:30:55 -03:00
ReinUsesLisp	0caab54b5d	format_lookup_table: Fix ZF32_X24S8 component types Component types for ZF32_X24S8 were using UNORM. Drivers will set FLOAT, UINT, UNORM, UNORM; causing a format mismatch. This commit addresses that.	2020-01-16 17:29:13 -03:00
Rodrigo Locatti	82e1285c1e	vk_texture_cache: Fix typo in commentary Co-Authored-By: MysticExile <30736337+MysticExile@users.noreply.github.com>	2020-01-16 16:59:46 -03:00
bunnei	30faf6a964	Merge pull request #3308 from lioncash/private maxwell_3d: Make dirty_pointers private	2020-01-16 13:26:35 -05:00
bunnei	d23869811d	Merge pull request #3304 from lioncash/fwd-decl renderer_opengl/utils: Forward declare private structs	2020-01-16 11:21:18 -05:00
Lioncash	9e874898f5	maxwell_3d: Make dirty_pointers private This isn't used outside of the class itself, so we can make it private for the time being.	2020-01-16 04:07:15 -05:00
ReinUsesLisp	c375d735e6	gl_state: Implement PROGRAM_POINT_SIZE For gl_PointSize to have effect we have to activate GL_PROGRAM_POINT_SIZE.	2020-01-15 16:14:17 -03:00
Lioncash	7af56dfa76	renderer_opengl/utils: Remove unused header inclusions Nothing from these headers are used, so they can be removed.	2020-01-15 06:31:23 -05:00
Lioncash	06d30fbcca	renderer_opengl/utils: Forward declare private structs Keeps the definitions hidden and allows changes to the structs without needing to recompile all users of classes containing said structs.	2020-01-15 06:30:01 -05:00
ReinUsesLisp	66a1c777c9	gl_texture_cache: Use local variables to simplify DownloadTexture	2020-01-14 17:39:48 -03:00
ReinUsesLisp	cdb00546f0	gl_texture_cache: Fix format for RGBX16F	2020-01-14 17:38:33 -03:00
ReinUsesLisp	2d09467f6f	gl_texture_cache: Use Snorm internal format for RG8S	2020-01-14 17:37:58 -03:00
ReinUsesLisp	02624c35ec	gl_texture_cache: Use Snorm internal format for ABGR8S	2020-01-14 17:37:23 -03:00
Rodrigo Locatti	64cd46579b	Merge pull request #3303 from lioncash/reorder control_flow: Silence -Wreorder warning for CFGRebuildState	2020-01-14 16:15:18 -03:00
Lioncash	a1eee1749e	control_flow: Silence -Wreorder warning for CFGRebuildState Organizes the initializer list in the same order that the variables would actually be initialized in.	2020-01-14 13:28:48 -05:00
Lioncash	f10ea944e0	gl_shader_cache: Remove unused STAGE_RESERVED_UBOS constant Given this isn't used, this can be removed entirely.	2020-01-14 13:16:52 -05:00
Lioncash	4cd5ad90f3	gl_shader_cache: std::move entries in CachedShader constructor Avoids several reallocations of std::vector instances where applicable.	2020-01-14 13:14:16 -05:00
Lioncash	15a6840e7a	gl_shader_cache: Remove unused entries variable in BuildShader() Eliminates a few unnecessary constructions of std::vectors.	2020-01-14 13:11:49 -05:00
bunnei	55f95e7f26	Merge pull request #3287 from ReinUsesLisp/ldg-stg-16 shader_ir/memory: Implement u16 and u8 for STG and LDG	2020-01-14 09:57:08 -05:00
bunnei	15788ffcde	Merge pull request #3288 from ReinUsesLisp/uncurse-aoffi shader_ir/texture: Simplify AOFFI code	2020-01-13 23:52:12 -05:00
bunnei	6985eea519	Merge pull request #3290 from ReinUsesLisp/gl-clamp maxwell_to_vk: Implement GL_CLAMP hacking Nvidia's driver	2020-01-13 19:16:06 -05:00
ReinUsesLisp	09e17fbb0f	vk_texture_cache: Implement generic texture cache on Vulkan It currently ignores PBO linearizations since these should be dropped as soon as possible on OpenGL.	2020-01-13 20:37:50 -03:00
ReinUsesLisp	2b2712fa95	texture_cache/surface_params: Make GetNumLayers public	2020-01-13 20:35:43 -03:00
Rodrigo Locatti	b1138e5ea1	vk_compute_pass: Address feedback Comment hardcoded SPIR-V modules.	2020-01-10 22:46:34 -03:00
ReinUsesLisp	3d46709b7f	maxwell_to_vk: Implement GL_CLAMP hacking Nvidia's driver Nvidia's driver defaults invalid enumerations to GL_CLAMP. Vulkan doesn't expose GL_CLAMP through its API, but we can hack it on Nvidia's driver using the internal driver defaults.	2020-01-10 17:12:50 -03:00
ReinUsesLisp	13021b534c	shader_ir/texture: Simplify AOFFI code	2020-01-09 03:50:37 -03:00
ReinUsesLisp	e2a2a556b9	shader_ir/memory: Implement u16 and u8 for STG and LDG Using the same technique we used for u8 on LDG, implement u16. In the case of STG, load memory and insert the value we want to set into it with bitfieldInsert. Then set that value.	2020-01-09 02:12:29 -03:00
ReinUsesLisp	908e085d02	vk_compute_pass: Add compute passes to emulate missing Vulkan features This currently only supports quad arrays and u8 indices. In the future we can remove quad arrays with a table written from the CPU, but this was used to bootstrap the other passes helpers and it was left in the code. The blob code is generated from the "shaders/" directory. Read the instructions there to know how to generate the SPIR-V.	2020-01-08 19:24:26 -03:00
ReinUsesLisp	82a64da077	vk_shader_util: Add helper to build SPIR-V shaders	2020-01-08 19:22:20 -03:00
ReinUsesLisp	6888d776ff	vk_pipeline_cache: Initial implementation Given a pipeline key, this cache returns a pipeline abstraction (for graphics or compute).	2020-01-06 22:02:26 -03:00
ReinUsesLisp	2effdeb924	vk_graphics_pipeline: Initial implementation This abstractio represents the state of the 3D engine at a given draw. Instead of changing individual bits of the pipeline how it's done in APIs like D3D11, OpenGL and NVN; on Vulkan we are forced to put everything together into a single, immutable object. It takes advantage of the few dynamic states Vulkan offers.	2020-01-06 22:02:26 -03:00
ReinUsesLisp	dc96a59fa0	vk_compute_pipeline: Initial implementation This abstraction represents a Vulkan compute pipeline.	2020-01-06 22:02:26 -03:00
ReinUsesLisp	b392a5986e	vk_pipeline_cache: Add file and define descriptor update template filler This function allows us to share code between compute and graphics pipelines compilation.	2020-01-06 22:02:26 -03:00
ReinUsesLisp	3142f1b597	fixed_pipeline_state: Add depth clamp	2020-01-06 22:02:26 -03:00
ReinUsesLisp	9c548146ca	vk_rasterizer: Add placeholder	2020-01-06 22:02:26 -03:00
bunnei	5be00cba15	Merge pull request #3276 from ReinUsesLisp/pipeline-reqs vk_update_descriptor/vk_renderpass_cache: Add pipeline cache dependencies	2020-01-06 17:03:34 -05:00
ReinUsesLisp	5aeff9aff5	vk_renderpass_cache: Initial implementation The renderpass cache is used to avoid creating renderpasses on each draw. The hashed structure is not currently optimized.	2020-01-06 18:28:32 -03:00
ReinUsesLisp	322d6a0311	vk_update_descriptor: Initial implementation The update descriptor is used to store in flat memory a large chunk of staging data used to update descriptor sets through templates. It provides a push interface to easily insert descriptors following the current pipeline. The order used in the descriptor update template has to be implicitly followed. We can catch bugs here using validation layers.	2020-01-06 18:28:32 -03:00
ReinUsesLisp	5b01f80a12	vk_stream_buffer/vk_buffer_cache: Avoid halting and use generic cache The stream buffer before this commit once it was full (no more bytes to write before looping) waiting for all previous operations to finish. This was a temporary solution and had a noticeable performance penalty in performance (from what a profiler showed). To avoid this mark with fences usages of the stream buffer and once it loops wait for them to be signaled. On average this will never wait. Each fence knows where its usage finishes, resulting in a non-paged stream buffer. On the other side, the buffer cache is reimplemented using the generic buffer cache. It makes use of the staging buffer pool and the new stream buffer.	2020-01-06 18:13:41 -03:00
ReinUsesLisp	ceb851b590	vk_memory_manager: Misc changes * Allocate memory in discrete exponentially increasing chunks until the 128 MiB threshold. Allocations larger thant that increase linearly by 256 MiB (depending on the required size). This allows to use small allocations for small resources. * Move memory maps to a RAII abstraction. To optimize for debugging tools (like RenderDoc) users will map/unmap on usage. If this ever becomes a noticeable overhead (from my profiling it doesn't) we can transparently move to persistent memory maps without harming the API, getting optimal performance for both gameplay and debugging. * Improve messages on exceptional situations. * Fix typos "requeriments" -> "requirements". * Small style changes.	2020-01-06 18:13:41 -03:00
ReinUsesLisp	85bb6a6f08	vk_buffer_cache: Temporarily remove buffer cache This is intended for a follow up commit to avoid circular dependencies.	2020-01-06 17:58:46 -03:00
bunnei	89fc75d769	Merge pull request #3257 from degasus/no_busy_loops video_core: Block in WaitFence.	2020-01-06 00:09:57 -05:00
Fernando Sahmkow	56e450a3f7	Merge pull request #3264 from ReinUsesLisp/vk-descriptor-pool vk_descriptor_pool: Initial implementation	2020-01-05 15:54:41 -04:00
bunnei	cd0a7dfdbc	Merge pull request #3258 from FernandoS27/shader-amend Shader_IR: add the ability to amend code in the shader ir.	2020-01-04 14:05:17 -05:00
Fernando Sahmkow	3dd6b55851	Shader_IR: Address Feedback	2020-01-04 14:40:57 -04:00
Fernando Sahmkow	a1667a7b46	Shader_IR: Implement TXD Array. This commit extends the compilation of TXD to support array samplers on TXD.	2020-01-04 13:28:02 -04:00
Rodrigo Locatti	6e347d8d1b	Update src/video_core/renderer_vulkan/vk_descriptor_pool.cpp Co-Authored-By: Mat M. <mathew1800@gmail.com>	2020-01-03 17:34:30 -03:00
ReinUsesLisp	0d6d8129c4	yuzu: Remove Maxwell debugger This was carried from Citra and wasn't really used on yuzu. It also adds some runtime overhead. This commit removes it from yuzu's codebase.	2020-01-02 23:09:44 -03:00
bunnei	ae0e481677	Merge pull request #3243 from ReinUsesLisp/topologies maxwell_to_gl: Implement missing primitive topologies	2020-01-01 20:33:33 -05:00
ReinUsesLisp	1fe7df4517	vk_descriptor_pool: Initial implementation Create a large descriptor pool where we allocate all our descriptors from. It has to be wide enough to support any pipeline, hence its large numbers. If the descritor pool is filled, we allocate more memory at that moment. This way we can take advantage of permissive drivers like Nvidia's that allocate more descriptors than what the spec requires.	2020-01-01 16:44:06 -03:00
bunnei	028b2718ed	Merge pull request #3239 from ReinUsesLisp/p2r shader/p2r: Implement P2R Pr	2019-12-31 20:37:16 -05:00
Fernando Sahmkow	b3371ed09e	Shader_IR: add the ability to amend code in the shader ir. This commit introduces a mechanism by which shader IR code can be amended and extended. This useful for track algorithms where certain information can derived from before the track such as indexes to array samplers.	2019-12-30 15:31:48 -04:00
Fernando Sahmkow	7bd447355f	Merge pull request #3248 from ReinUsesLisp/vk-image vk_image: Add an image object abstraction	2019-12-30 14:25:14 -04:00
Rodrigo Locatti	4cbb363d3f	vk_image: Avoid unnecesary equals	2019-12-30 13:28:23 -03:00
Fernando Sahmkow	287d5921cf	Merge pull request #3249 from ReinUsesLisp/vk-staging-buffer-pool vk_staging_buffer_pool: Add a staging pool for temporary operations	2019-12-30 12:25:59 -04:00
Markus Wick	cb9dd01ffd	video_core: Block in WaitFence. This function is called rarely and blocks quite often for a long time. So don't waste power and let the CPU sleep. This might also increase the performance as the other cores might be allowed to clock higher.	2019-12-30 13:04:53 +01:00
Rodrigo Locatti	f2c61bbe13	vk_staging_buffer_pool: Initialize last epoch to zero	2019-12-29 19:19:43 -03:00
Fernando Sahmkow	f846e3d6d0	Merge pull request #3250 from ReinUsesLisp/empty-fragment gl_rasterizer: Allow rendering without fragment shader	2019-12-28 14:33:53 -04:00
bunnei	8a76f816a4	Merge pull request #3228 from ReinUsesLisp/ptp shader/texture: Implement AOFFI and PTP for TLD4 and TLD4S	2019-12-26 21:43:44 -05:00
ReinUsesLisp	5b989f189f	gl_rasterizer: Allow rendering without fragment shader Rendering without a fragment shader is usually used in depth-only passes.	2019-12-26 16:38:49 -03:00
ReinUsesLisp	3813af2f3c	vk_staging_buffer_pool: Add a staging pool for temporary operations The job of this abstraction is to provide staging buffers for temporary operations. Think of image uploads or buffer uploads to device memory. It automatically deletes unused buffers.	2019-12-25 18:12:17 -03:00
ReinUsesLisp	c83bf7cd1e	vk_image: Add an image object abstraction This object's job is to contain an image and manage its transitions. Since Nvidia hardware doesn't know what a transition is but Vulkan requires them anyway, we have to state track image subresources individually. To avoid the overhead of tracking each subresource in images with many subresources (think of cubemap arrays with several mipmaps), this commit tracks when subresources have diverged. As long as this doesn't happen we can check the state of the first subresource (that will be shared with all subresources) and update accordingly. Image transitions are deferred to the scheduler command buffer.	2019-12-25 18:00:16 -03:00
Fernando Sahmkow	5619d24377	Merge pull request #3244 from ReinUsesLisp/vk-fps fixed_pipeline_state: Define structure and loaders	2019-12-25 14:31:29 -04:00
bunnei	4af569ee47	Merge pull request #3236 from ReinUsesLisp/rasterize-enable gl_rasterizer: Implement RASTERIZE_ENABLE	2019-12-24 22:54:10 -05:00
ReinUsesLisp	b9e3f5eb36	fixed_pipeline_state: Define symetric operator!= and mark as noexcept Marks as noexcept Hash, operator== and operator!= for consistency.	2019-12-24 18:24:08 -03:00
ReinUsesLisp	4a3026b16b	fixed_pipeline_state: Define structure and loaders The intention behind this hasheable structure is to describe the state of fixed function pipeline state that gets compiled to a single graphics pipeline state object. This is all dynamic state in OpenGL but Vulkan wants it in an immutable state, even if hardware can edit it freely. In this commit the structure is defined in an optimized state (it uses booleans, has paddings and many data entries that can be packed to single integers). This is intentional as an initial implementation that is easier to debug, implement and review. It will be optimized in later stages, or it might change if Vulkan gets more dynamic states.	2019-12-22 22:59:11 -03:00
ReinUsesLisp	5770418fb3	maxwell_3d: Add depth bounds registers	2019-12-22 22:55:06 -03:00
ReinUsesLisp	91d35559e5	maxwell_to_gl: Implement missing primitive topologies Many of these topologies are exclusively available in OpenGL.	2019-12-22 22:33:01 -03:00
bunnei	e976d0e924	Merge pull request #3241 from ReinUsesLisp/gl-shader-cache gl_shader_cache: Style changes	2019-12-22 16:23:46 -05:00
bunnei	1e76655f83	Merge pull request #3238 from ReinUsesLisp/vk-resource-manager vk_resource_manager: Catch device losses and other changes	2019-12-22 15:57:16 -05:00
bunnei	0f3ac9cfeb	Merge pull request #3203 from FernandoS27/tex-cache-fixes Texture Cache: Add HLE methods for building 3D textures	2019-12-22 14:25:13 -05:00
Fernando Sahmkow	3dc585d011	Merge pull request #3237 from ReinUsesLisp/vk-shader-decompiler vk_shader_decompiler: Misc changes	2019-12-22 12:36:56 -04:00
Fernando Sahmkow	218ee18417	Texture Cache: Improve documentation	2019-12-22 12:29:23 -04:00
Fernando Sahmkow	a3916588b6	Texture Cache: Address Feedback	2019-12-22 12:24:34 -04:00
Fernando Sahmkow	51c9e98677	Texture Cache: Add HLE methods for building 3D textures within the GPU in certain scenarios. This commit adds a series of HLE methods for handling 3D textures in general. This helps games that generate 3D textures on every frame and may reduce loading times for certain games.	2019-12-22 12:24:34 -04:00
Fernando Sahmkow	aea978e037	Merge pull request #3230 from ReinUsesLisp/vk-emu-shaders renderer_vulkan/shader: Add helper GLSL shaders	2019-12-22 11:23:09 -04:00
Fernando Sahmkow	27efcc15e9	Merge pull request #3240 from ReinUsesLisp/decomp-cond-code vk_shader_decompiler: Use Visit instead of reimplementing it	2019-12-22 11:20:55 -04:00
bunnei	16dcfacbfc	Merge pull request #3235 from ReinUsesLisp/ldg-u8 shader/memory: Implement LDG.U8 and unaligned U8 loads	2019-12-21 22:50:28 -05:00
ReinUsesLisp	1e16023d60	gl_shader_cache: Update commentary for shared memory Remove false commentary. Not dividing by 4 the size of shared memory is not a hack; it describes the number of integers, not bytes. While we are at it sort the generated code to put preprocessor lines on the top.	2019-12-20 22:51:21 -03:00
ReinUsesLisp	486c6a5316	gl_shader_cache: Remove unused entry in GetPrimitiveDescription	2019-12-20 22:49:30 -03:00
ReinUsesLisp	af93909c9c	vk_shader_decompiler: Use Visit instead of reimplementing it ExprCondCode visit implements the generic Visit. Use this instead of that one. As an intended side effect this fixes unwritten memory usages in cases when a negation of a condition code is used.	2019-12-20 21:36:25 -03:00
ReinUsesLisp	38d3a48873	shader/p2r: Implement P2R Pr P2R dumps predicate or condition codes state to a register. This is useful for unit testing.	2019-12-20 18:02:41 -03:00
ReinUsesLisp	cf27b59493	shader/r2p: Refactor P2R to support P2R	2019-12-20 17:55:42 -03:00
bunnei	7be65c6a68	Merge pull request #3234 from ReinUsesLisp/i2f-u8-selector shader/conversion: Implement byte selector in I2F	2019-12-19 22:36:26 -05:00
bunnei	6d55b14cc0	Merge pull request #3233 from ReinUsesLisp/mismatch-sizes shader/texture: Properly shrink unused entries in size mismatches	2019-12-19 20:40:27 -05:00
ReinUsesLisp	e41da22c8d	vk_resource_manager: Add entry to VKFence to test its usage	2019-12-19 16:31:34 -03:00
ReinUsesLisp	ec983a2451	vk_reosurce_manager: Add assert for releasing fences Notify the programmer when a request to release a fence is invalid because the fence is already free.	2019-12-19 16:31:34 -03:00
ReinUsesLisp	6ddffa010a	vk_resource_manager: Implement VKFenceWatch move constructor This allows us to put VKFenceWatch inside a std::vector without storing it in heap. On move we have to signal the fences where the new protected resource is, adding some overhead.	2019-12-19 16:31:34 -03:00
ReinUsesLisp	54747d60bc	vk_device: Add entry to catch device losses VK_NV_device_diagnostic_checkpoints allows us to push data to a Vulkan queue and then query it even after a device loss. This allows us to push the current pipeline object and see what was the call that killed the device.	2019-12-19 16:31:33 -03:00
ReinUsesLisp	2a63b3bdb9	vk_shader_decompiler: Fix full decompilation When full decompilation was enabled, labels were not being inserted and instructions were misused. Fix these bugs.	2019-12-19 16:24:45 -03:00
ReinUsesLisp	de918ebeb0	vk_shader_decompiler: Skip NDC correction when it is native Avoid changing gl_Position when the NDC used by the game is [0, 1] (Vulkan's native).	2019-12-19 16:24:45 -03:00

... 2 3 4 5 6 ...

3905 commits