Age | Commit message (Collapse) | Author | Files | Lines |
|
This reverts commit a3e1937719aad1d542680da3a0cfdc8367558182.
|
|
|
|
|
|
|
|
|
|
|
|
This intrinsic is translated to ALLOC_EXPORT_WORD1_SWIZ, hence its
name. It is used to store vs/fs outputs
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Remove Cxxx registers, add new special register - "ALU_CONST" and new
operand for each alu src - "sel". ALU_CONST is used to designate that the
new operand contains the value to override src.sel, src.kc_bank, src.chan
for constants in the driver.
|
|
|
|
DAGCombiner::reduceBuildVecConvertToConvertBuildVec() was making two
mistakes:
1. It was checking the legality of scalar INT_TO_FP nodes and then generating
vector nodes.
2. It was passing the result value type to
TargetLoweringInfo::getOperationAction() when it should have been
passing the value type of the first operand.
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@171420 91177308-0d34-0410-b5e6-96231b3b80d8
|
|
|
|
No functionality change.
|
|
Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>
|
|
Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>
|
|
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@170901 91177308-0d34-0410-b5e6-96231b3b80d8
|
|
Unlike SGPRs VGPRs doesn't need to be aligned.
Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
Tested-by: Michel Dänzer <michel.daenzer@amd.com>
Signed-off-by: Christian König <deathsimple@vodafone.de>
|
|
Branch if we have enough instructions so that it makes sense.
Also remove branches if they don't make sense.
Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
Tested-by: Michel Dänzer <michel.daenzer@amd.com>
Signed-off-by: Christian König <deathsimple@vodafone.de>
|
|
This patch replaces the control flow handling with a new
pass which structurize the graph before transforming it to
machine instruction. This has a couple of different advantages
and currently fixes 20 piglit tests without a single regression.
It is now a general purpose transformation that could be not
only be used for SI/R6xx, but also for other hardware
implementations that use a form of structurized control flow.
v2: further cleanup, fixes and documentation
Signed-off-by: Christian König <deathsimple@vodafone.de>
Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
Tested-by: Michel Dänzer <michel.daenzer@amd.com>
|
|
They seem to work fine.
Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
Tested-by: Michel Dänzer <michel.daenzer@amd.com>
Signed-off-by: Christian König <deathsimple@vodafone.de>
|
|
Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
Tested-by: Michel Dänzer <michel.daenzer@amd.com>
Signed-off-by: Christian König <deathsimple@vodafone.de>
|
|
The Align parameter is a power of two, so 16 results in 64K
alignment. Additional to that even 16 byte alignment doesn't
make any sense, so just remove it.
Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
Tested-by: Michel Dänzer <michel.daenzer@amd.com>
Signed-off-by: Christian König <deathsimple@vodafone.de>
|
|
Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
|
|
store intrinsic
Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
|
|
Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
|
|
VALU instructions can only read from one SGPR, and that's the condition code
mask in this case.
Fixes a number of radeonsi piglit regressions from Vincent's max/min changes
(which means the matching to AMDGPUfmax/min doesn't work for some reason).
Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
Signed-off-by: Michel Dänzer <michel.daenzer@amd.com>
|
|
Fixes hundreds of radeonsi piglit regressions from commit
62c8e1ec4a93b28c55c1c7accb6cbd64952e59c2 ('AMDGPU: replace int_AMDGPU_rcp by
fdiv (1.0, x) in RECIP pattern').
Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
Signed-off-by: Michel Dänzer <michel.daenzer@amd.com>
|
|
|
|
|
|
This will reduce the number of tablegen patterns we need.
|
|
|
|
|
|
Reviewed-by: Vincent Lejeune <vljn at ovi.com>
|
|
The use list may change during the execution of the loop, so we
need to manually keep track of the next item in the list.
|
|
Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
|
|
Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
|
|
Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
Reviewed by: Christian König <christian.koenig@amd.com>
Signed-off-by: Michel Dänzer <michel.daenzer@amd.com>
|
|
- Add \file to every file
- Use \brief, \returns, \param, etc. where necessary
- Remove duplicate function / class names from doxygen comments
|
|
- Fix more coding style errors with braces on their own line
Found using: grep -rn '^[ ]*{$'
- Remove underscore from start of macro defs
- Remove all tabs
- Remove "end of" comments that don't refer to namespaces.
- Fix an issue with parens being on the wrong line.
- Fix warnings from building with clang
|
|
v2:
- Fold the immediates using the SelectionDAG
Reviewed-by: Vincent Lejeune <vljn at ovi.com>
|
|
Reviewed-by: Vincent Lejeune <vljn at ovi.com>
|
|
Reviewed-by: Vincent Lejeune <vljn at ovi.com>
|
|
v2:
- Fix alignment of case statements
Reviewed-by: Vincent Lejeune <vljn at ovi.com>
|
|
Reviewed-by: Vincent Lejeune <vljn at ovi.com>
|
|
By using the S_*_SAVEEXEC_b64 instructions.
Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
Reviewed-and-tested-by: Michel Dänzer <michel.daenzer@amd.com>
Signed-off-by: Christian König <deathsimple@vodafone.de>
|