sneedmca/suyu - sneedGit: git for sneed group

mirror of https://git.suyu.dev/suyu/suyu synced 2024-12-27 03:42:51 -06:00

Author	SHA1	Message	Date
bunnei	c7b5a87c90	Merge pull request #3799 from ReinUsesLisp/iadd-cc shader: Implement P2R CC, IADD Rd.CC and IADD.X	2020-04-30 12:56:36 -04:00
bunnei	6572660fde	Merge pull request #3788 from FernandoS27/revert Revert: shader_decode: Fix LD, LDG when track constant buffer.	2020-04-30 12:55:39 -04:00
ReinUsesLisp	871aadbe36	shader/arithmetic_integer: Fix tracking issue in temporary This temporary is not needed as we mark Rd.CC + IADD.X as unimplemented. It caused issues when tracking global buffers.	2020-04-28 17:14:53 -03:00
bunnei	72b73d22ab	Merge pull request #3784 from ReinUsesLisp/shader-memory-util shader/memory_util: Deduplicate code	2020-04-28 12:05:50 -04:00
ReinUsesLisp	ddd82ef42b	shader/memory_util: Deduplicate code Deduplicate code shared between vk_pipeline_cache and gl_shader_cache as well as shader decoder code. While we are at it, fix a bug in gl_shader_cache where compute shaders had an start offset of a stage shader.	2020-04-26 01:38:51 -03:00
ReinUsesLisp	e895a4e2d7	shader/arithmetic_integer: Fix edge case and mark IADD.X Rd.CC as unimplemented IADD.X Rd.CC requires some extra logic that is not currently implemented. Abort when this is hit.	2020-04-25 22:58:33 -03:00
ReinUsesLisp	2a96bea6a7	shader/arithmetic_integer: Change IAdd to UAdd to avoid signed overflow Signed integer addition overflow might be undefined behavior. It's free to change operations to UAdd and use unsigned integers to avoid potential bugs.	2020-04-25 22:57:54 -03:00
ReinUsesLisp	c788f9c0bd	shader/arithmetic_integer: Implement IADD.X IADD.X takes the carry flag and adds it to the result. This is generally used to emulate 64-bit operations with 32-bit registers.	2020-04-25 22:56:11 -03:00
ReinUsesLisp	255197e643	shader/arithmetic_integer: Implement CC for IADD	2020-04-25 22:55:26 -03:00
ReinUsesLisp	ffc5ec6fa8	decode/register_set_predicate: Implement CC P2R CC takes the state of condition codes and puts them into a register. We already have this implemented for PR (predicates). This commit implements CC over that.	2020-04-25 22:54:42 -03:00
ReinUsesLisp	d523734266	decode/register_set_predicate: Use move for shared pointers Avoid atomic counters used by shared pointers.	2020-04-25 22:54:14 -03:00
bunnei	4e37825dab	Merge pull request #3734 from ReinUsesLisp/half-float-mods decode/arithmetic_half: Fix HADD2 and HMUL2 absolute and negation bits	2020-04-25 00:41:43 -04:00
bunnei	7c8acb0025	Merge pull request #3749 from ReinUsesLisp/lea-imm shader/arithmetic_integer: Fix LEA_IMM encoding	2020-04-24 14:30:13 -04:00
Fernando Sahmkow	d8a961cd6c	Revert: shader_decode: Fix LD, LDG when track constant buffer.	2020-04-24 11:00:54 -04:00
ReinUsesLisp	dbaebd8582	decode/arithmetic_half: Fix HADD2 and HMUL2 absolute and negation bits The encoding for negation and absolute value was wrong. Extracting is now done manually. Similar instructions having different encodings is the rule, not the exception. To keep sanity and readability I preferred to extract the desired bit manually. This is implemented against nxas: `8dbc389957/table.h (L68)` That is itself tested against nvdisasm (Nvidia's official disassembler).	2020-04-23 18:29:38 -03:00
ReinUsesLisp	4fb921ff6b	shader/texture: Support multiple unknown sampler properties This allows deducing some properties from the texture instruction before asking the runtime. By doing this we can handle type mismatches in some instructions from the renderer instead of the shader decoder. Fixes texelFetch issues with games using 2D texture instructions on a 1D sampler.	2020-04-23 18:04:13 -03:00
ReinUsesLisp	72deb773fd	shader_ir: Turn classes into data structures	2020-04-23 18:00:06 -03:00
bunnei	2409fedacf	Merge pull request #3697 from lioncash/declarations CMakeLists: Enable -Wmissing-declarations on Linux builds	2020-04-23 02:18:52 -04:00
bunnei	9bf3abcb63	Merge pull request #3698 from lioncash/warning General: Resolve minor assorted warnings	2020-04-21 14:11:18 -04:00
ReinUsesLisp	8734ccb0cb	shader/arithmetic_integer: Fix LEA_IMM encoding The operand order in LEA_IMM was flipped compared to nvdisasm. Fix that using nxas as reference: `8dbc389957/table.h (L122)`	2020-04-20 21:54:59 -03:00
bunnei	73db83c0ab	Merge pull request #3679 from lioncash/track track: Eliminate redundant copies	2020-04-19 01:22:47 -04:00
Lioncash	e2d8be1ca2	General: Resolve warnings related to missing declarations	2020-04-16 23:43:34 -04:00
Lioncash	678ac54749	decode/memory: Resolve unused variable warning Only the first element of the returned pair is ever used.	2020-04-16 22:45:44 -04:00
Lioncash	d159643fd7	decode/texture: Resolve unused variable warnings. Some variables aren't used, so we can remove these. Unfortunately, diagnostics are still reported on structured bindings even when annotated with [[maybe_unused]], so we need to unpack the elements that we want to use manually.	2020-04-16 22:45:41 -04:00
Lioncash	f522abd8ab	decode/texture: Collapse loop down into std::generate Same behavior, less code.	2020-04-16 22:29:07 -04:00
Lioncash	7e2d60de26	decode/texture: Eliminate trivial missing field initializer warnings We can just specify the initializers.	2020-04-16 22:27:21 -04:00
bunnei	79c1269f0f	Merge pull request #3673 from lioncash/extra CMakeLists: Specify -Wextra on linux builds	2020-04-16 21:12:33 -04:00
Rodrigo Locatti	a5a2ee8766	Merge pull request #3689 from lioncash/unused-var decode/shift: Remove unused variable within Shift()	2020-04-16 02:05:54 -03:00
Lioncash	cd2a12e78f	decode/shift: Remove unused variable within Shift() Removes a redundant variable that is already satisfied by the IsFull() utility function.	2020-04-16 00:16:06 -04:00
Lioncash	72a224d3fc	control_flow: Make use of std::move in TryInspectAddress() Eliminates redundant atomic reference count increments and decrements.	2020-04-15 23:31:22 -04:00
Lioncash	e15ec2705c	track: Eliminate redundant copies Two variables can be references, while two others can be std::moved. Makes for 4 less atomic reference count increments and decrements.	2020-04-15 21:50:09 -04:00
Lioncash	1c340c6efa	CMakeLists: Specify -Wextra on linux builds Allows reporting more cases where logic errors may exist, such as implicit fallthrough cases, etc. We currently ignore unused parameters, since we currently have many cases where this is intentional (virtual interfaces). While we're at it, we can also tidy up any existing code that causes warnings. This also uncovered a few bugs as well.	2020-04-15 21:33:46 -04:00
Fernando Sahmkow	e33196d4e7	Merge pull request #3612 from ReinUsesLisp/red shader/memory: Implement RED.E.ADD and minor changes to ATOM	2020-04-15 15:03:49 -04:00
ReinUsesLisp	fefe7f18f9	shader/arithmetic: Add FCMP_CR variant Adds another variant of FCMP.	2020-04-14 19:11:04 -03:00
Mat M	7b62212461	Merge pull request #3619 from ReinUsesLisp/i2i shader/conversion: Implement I2I sign extension, saturation and selection	2020-04-13 10:17:07 -04:00
Mat M	47036859eb	Merge pull request #3633 from ReinUsesLisp/clean-texdec shader/texture: Remove type mismatches management from shader decoder	2020-04-13 10:13:05 -04:00
Fernando Sahmkow	3d91dbb21d	Merge pull request #3578 from ReinUsesLisp/vmnmx shader/video: Partially implement VMNMX	2020-04-12 10:44:03 -04:00
ReinUsesLisp	76f178ba6e	shader/video: Partially implement VMNMX Implements the common usages for VMNMX. Inputs with a different size than 32 bits are not supported and sign mismatches aren't supported either. VMNMX works as follows: It grabs Ra and Rb and applies a maximum/minimum on them (this is defined by .MX), having in mind the input sign. This result can then be saturated. After the intermediate result is calculated, it applies another operation on it using Rc. These operations are merges, accumulations or another min/max pass. This instruction allows to implement with a more flexible approach GCN's min3 and max3 instructions (for instance).	2020-04-12 00:34:42 -03:00
ReinUsesLisp	a87b16da9a	shader/texture: Remove type mismatches management from shader decoder Since commit `e22816a5bb` we handle type mismatches from the CPU. We don't need to hack our shader decoder due to game bugs anymore. Removed in this commit.	2020-04-10 00:57:32 -03:00
bunnei	b96fd0bd0e	Merge pull request #3601 from ReinUsesLisp/some-shader-encodings video_core/shader: Add some instruction and S2R encodings	2020-04-09 00:17:39 -04:00
Rodrigo Locatti	487f9ba525	Merge pull request #3489 from namkazt/patch-2 shader: implement SULD.D bits32/64	2020-04-07 16:21:09 -03:00
Nguyen Dac Nam	935648ffa9	address nit.	2020-04-07 18:29:30 +07:00
ReinUsesLisp	da706cad25	shader/conversion: Implement I2I sign extension, saturation and selection Reimplements I2I adding sign extension, saturation (clamp source value to the destination), selection and destination sizes that are not 32 bits wide. It doesn't implement CC yet.	2020-04-07 02:19:44 -03:00
Nguyen Dac Nam	bf1174c114	Apply suggestions from code review Co-Authored-By: Rodrigo Locatti <reinuseslisp@airmail.cc>	2020-04-07 07:55:49 +07:00
namkazy	2c98e14d13	shader_decode: SULD.D using std::pair instead of out parameter	2020-04-06 13:46:55 +07:00
namkazy	9efa51311f	shader_decode: SULD.D avoid duplicate code block.	2020-04-06 13:34:06 +07:00
namkazy	7f5696513f	shader_decode: SULD.D fix conversion error.	2020-04-06 13:26:58 +07:00
namkazy	2906372ba1	shader_decode: SULD.D implement bits64 and reverse shader ir init method to removed shader stage.	2020-04-06 13:09:19 +07:00
ReinUsesLisp	3185245845	shader/memory: Implement RED.E.ADD Implements a reduction operation. It's an atomic operation that doesn't return a value. This commit introduces another primitive because some shading languages might have a primitive for reduction operations.	2020-04-06 02:24:47 -03:00
ReinUsesLisp	fd0a2b5151	shader/memory: Add "using std::move"	2020-04-06 02:18:14 -03:00
ReinUsesLisp	79970c9174	shader/memory: Minor fixes in ATOM	2020-04-06 00:54:22 -03:00
Fernando Sahmkow	69277de29d	Merge pull request #3592 from ReinUsesLisp/ipa shader_decompiler: Remove FragCoord.w hack and change IPA implementation	2020-04-05 19:29:40 -04:00
namkazy	730f9b55b3	silent warning (conversion error)	2020-04-05 16:02:07 +07:00
namkazy	9f6ebccf06	shader_decode: SULD.D -> SINT actually same as UNORM.	2020-04-05 15:18:42 +07:00
namkazy	6f2b7087c2	shader_decode: SULD.D fix decode SNORM component	2020-04-05 14:46:43 +07:00
namkazy	69657ff19c	clang-format	2020-04-05 12:57:50 +07:00
namkazy	24cc64c5b3	shader_decode: get sampler descriptor from registry.	2020-04-05 12:54:48 +07:00
namkazy	acd3f0ab37	tweaking.	2020-04-05 10:31:32 +07:00
namkazy	3e3afa9be6	cleanup unuse params	2020-04-05 10:31:31 +07:00
namkazy	5cd5857000	cleanup debug code.	2020-04-05 10:31:30 +07:00
namkazy	658112783d	reimplement get component type, uncomment mistaken code	2020-04-05 10:31:30 +07:00
namkazy	3ad06e9b2b	remove disable optimize	2020-04-05 10:31:30 +07:00
namkazy	f24c2e1103	[wip] reimplement SULD.D	2020-04-05 10:31:29 +07:00
namkazy	58bcb86af5	add shader stage when init shader ir	2020-04-05 10:31:29 +07:00
Nguyen Dac Nam	2cefdd92bd	clang-fix	2020-04-05 10:31:28 +07:00
Nguyen Dac Nam	1f3d142875	shader: image - import PredCondition	2020-04-05 10:31:27 +07:00
Nguyen Dac Nam	08db60392d	shader: SULD.D bits32 implement more complexer method.	2020-04-05 10:31:27 +07:00
Nguyen Dac Nam	ed1d8beb13	shader: SULD.D import StoreType	2020-04-05 10:31:26 +07:00
Nguyen Dac Nam	6d235b8631	shader: implement SULD.D bits32	2020-04-05 10:31:26 +07:00
ReinUsesLisp	60106531b4	shader/other: Add error message for some S2R registers	2020-04-04 03:46:07 -03:00
ReinUsesLisp	8b719e9e1d	shader_bytecode: Rename MOV_SYS to S2R	2020-04-04 03:37:51 -03:00
ReinUsesLisp	16ae98dbb3	shader_ir: Add error message for EXIT.FCSM_TR	2020-04-04 03:34:08 -03:00
ReinUsesLisp	e1bd89e1c2	shader/memory: Silence no return value warning Silences a warning about control paths not all returning a value.	2020-04-02 03:34:27 -03:00
ReinUsesLisp	2339fe199f	shader_decompiler: Remove FragCoord.w hack and change IPA implementation Credits go to gdkchan and Ryujinx. The pull request used for this can be found here: https://github.com/Ryujinx/Ryujinx/pull/1082 yuzu was already using the header for interpolation, but it was missing the FragCoord.w multiplication described in the linked pull request. This commit finally removes the FragCoord.w == 1.0f hack from the shader decompiler. While we are at it, this commit renames some enumerations to match Nvidia's documentation (linked below) and fixes component declaration order in the shader program header (z and w were swapped). https://github.com/NVIDIA/open-gpu-doc/blob/master/Shader-Program-Header/Shader-Program-Header.html	2020-04-01 21:48:55 -03:00
Fernando Sahmkow	b03c0536ce	Merge pull request #3561 from ReinUsesLisp/f2f-conversion shader/conversion: Fix F2F rounding operations with different sizes	2020-03-31 14:45:02 -04:00
Fernando Sahmkow	5b95a01463	Merge pull request #3577 from ReinUsesLisp/lea shader/lea: Fix LEA implementation	2020-03-31 14:36:07 -04:00
Nguyen Dac Nam	238c35b2c9	clang-format	2020-03-31 08:08:06 +07:00
Nguyen Dac Nam	defb9642da	shader_decode: fix by suggestion	2020-03-31 08:02:44 +07:00
namkazy	cb0a4151f8	clang-format	2020-03-30 20:46:21 +07:00
namkazy	4f7bea403a	shader_decode: ATOM/ATOMS: add function to avoid code repetition	2020-03-30 18:47:50 +07:00
Nguyen Dac Nam	972485ff18	shader_decode: implement ATOM operation for S32 and U32	2020-03-30 17:44:48 +07:00
namkazy	93cac0d294	clang-format	2020-03-30 17:44:48 +07:00
Nguyen Dac Nam	3dc09a6250	shader_decode: implement ATOMS instr partial.	2020-03-30 17:44:46 +07:00
Nguyen Dac Nam	2c780db5b9	shader: node - update correct comment	2020-03-30 17:44:44 +07:00
Nguyen Dac Nam	c119473c40	shader_decode: add Atomic op for common usage	2020-03-30 17:44:44 +07:00
ReinUsesLisp	5300a918c6	shader/lea: Simplify generated LEA code	2020-03-28 03:55:04 -03:00
ReinUsesLisp	523a709bf1	shader/lea: Fix op_a and op_b usages They were swapped.	2020-03-27 18:37:20 -03:00
ReinUsesLisp	796b3319e6	shader/lea: Remove const and use move when possible	2020-03-27 18:36:38 -03:00
ReinUsesLisp	46791c464a	shader/conversion: Fix F2F rounding operations with different sizes Rounding operations only matter when the conversion size of source and destination is the same, i.e. .F16.F16, .F32.F32 and .F64.F64. When there is a mismatch (.F16.F32), these bits are used for IEEE rounding, we don't emulate this because GLSL and SPIR-V don't support configuring it per operation.	2020-03-26 01:58:49 -03:00
bunnei	23c7dda710	Merge pull request #3544 from makigumo/myfork/patch-2 xmad: fix clang build error	2020-03-25 19:29:16 -04:00
bunnei	e6aff11057	Merge pull request #3520 from ReinUsesLisp/legacy-varyings gl_shader_decompiler: Implement legacy varyings	2020-03-25 19:27:51 -04:00
makigumo	5a5c6d4ed8	xmad: fix clang build error	2020-03-23 00:09:31 +01:00
bunnei	bdddbe2daa	Merge pull request #3505 from namkazt/patch-8 shader_decode: implement XMAD mode CSfu	2020-03-19 17:41:01 -04:00
Rodrigo Locatti	ddafc99776	Merge pull request #3502 from namkazt/patch-3 shader_decode: Reimplement BFE instructions	2020-03-15 21:23:04 -03:00
ReinUsesLisp	6442e02c5d	shader/shader_ir: Track usage in input attribute and of legacy varyings	2020-03-15 21:01:52 -03:00
ReinUsesLisp	8e6e55d6f8	shader/shader_ir: Fix clip distance usage stores	2020-03-15 20:53:14 -03:00
ReinUsesLisp	464bd5fad7	shader/shader_ir: Change declare output attribute to a switch	2020-03-15 20:49:35 -03:00
Nguyen Dac Nam	3287b1247d	clang-format	2020-03-14 10:07:40 +07:00
Nguyen Dac Nam	240d45830d	nit	2020-03-14 09:57:24 +07:00
ReinUsesLisp	ae6189d7c2	shader/transform_feedback: Expose buffer stride	2020-03-13 18:33:05 -03:00

1 2 3 4 5 ...

766 commits