summaryrefslogtreecommitdiff
path: root/src/gallium/drivers/cell/ppu
AgeCommit message (Collapse)Author
2008-09-26cell: checkpoint: more work in emit_function_call()Brian Paul
Simple function call works now, but we don't save/restore the caller's registers yet.
2008-09-26cell: asst clean-up, var renamingBrian Paul
2008-09-26cell: remove unneeded blend/depth_stencil subclassesBrian Paul
2008-09-26cell: checkpoint: support for function calls in SPU shadersBrian Paul
Will be used for instructions like SIN/COS/POW/TEX/etc. The PPU needs to know the address of some functions in the SPU address space. Send that info to the PPU/main memory rather than patch up shaders on the SPU side. Not finished/tested yet...
2008-09-26cell: inst reorder to save a cycleBrian Paul
2008-09-23CELL: fix colormask code generationRobert Ellison
The colormask code generation had assumed that its input packed pixels were in RGBA format. In fact, the format they're in is dependent on the pipe color format. Now the color format is passed in to gen_colormask(), and proper color format-dependent SPU code is generated.
2008-09-22cell: Fixed bug with absolute, negate, set-negative logic in source fetch ↵Jonathan White
for TGSI instructions. The logic should operate on the origin channel not the swizzled channel. Please enter the commit message for your changes.
2008-09-22cell: Added TRUNC, SWZ (extended) and XPD instructions, verified against ↵Jonathan White
softpipe. Optimized FLR and FRC. Fixed writeback logic for DP3, DP4 and DPH.
2008-09-22cell: Added DPH instruction and verified against softpipe.Jonathan White
2008-09-19cell: Added FRC instructionJonathan White
2008-09-19cell: Added FLR instruction. Verified the following instructions match ↵Jonathan White
softpipe: MOV, ADD, MUL, SGE, SUB, MAD, ABS, SLT, MIN, MAX, LRP, DP3, DP4, CMP, FLR
2008-09-19cell: Fixed bugs with DP3 and DP4, they match softpipe results now.Jonathan White
2008-09-19cell: flesh out support for other Z/stencil formatBrian Paul
Also: improve float/int Z conversion. Use clgt instead of cgt in depth test since we're comparing unsigned values.
2008-09-19cell: fix a commentBrian Paul
2008-09-19cell: change spe_complement() to take a src and dst reg, like other instructionsBrian Paul
2008-09-19CELL: add codegen for logic op, color maskRobert Ellison
- rtasm_ppc_spe.c, rtasm_ppc_spe.h: added a new macro function "spe_load_uint" for loading and splatting unsigned integers in a register; it will use "ila" for values 18 bits or less, "ilh" for word values that are symmetric across halfwords, "ilhu" for values that have zeroes in their bottom halfwords, or "ilhu" followed by "iohl" for general 32-bit values. Of the 15 color masks of interest, 4 are 18 bits or less, 2 are symmetric across halfwords, 3 are zero in the bottom halfword, and 6 require two instructions to load. - cell_gen_fragment.c: added full codegen for logic op and color mask.
2008-09-18cell: Added CMP instructionJonathan White
2008-09-18cell: Fix bug with complement logic for SGE and SLEJonathan White
2008-09-18cell: Added SGE and SLE instructions to dispatch functionJonathan White
2008-09-18cell: Added SGE and SLE instructionsJonathan White
2008-09-18CELL: finish fragment ops blending (except for unusual D3D modes)Robert Ellison
- Added new "macro" functions spe_float_min() and spe_float_max() to rtasm_ppc_spe.{ch}. These emit instructions that cause the minimum or maximum of each element in a vector of floats to be saved in the destination register. - Major changes to cell_gen_fragment.c to implement all the blending modes (except for the mysterious D3D-based PIPE_BLENDFACTOR_SRC1_COLOR, PIPE_BLENDFACTOR_SRC1_ALPHA, PIPE_BLENDFACTOR_INV_SRC1_COLOR, and PIPE_BLENDFACTOR_INV_SRC1_ALPHA). - Some revamping of code in cell_gen_fragment.c: use the new spe_float_min() and spe_float_max() functions (instead of expanding these calculations inline via macros); create and use an inline utility function for handling "optional" register allocation (for the {1,1,1,1} vector, and the blend color vectors) instead of expanding with macros; use the Float Multiply and Subtract (fnms) instruction to simplify and optimize many blending calculations.
2008-09-17cell: example of doing fs/fm sequence with fnms in blendingBrian Paul
2008-09-17cell: dump generated code if CELL_DEBUG=asmBrian Paul
2008-09-17CELL: fleshing out the blending fragment opsRobert Ellison
- Added two new debug flags (to be used with the CELL_DEBUG environment variable). The first, "CELL_DEBUG=fragops", activates SPE fragment ops debug messages. The second, "CELL_DEBUG=fragopfallback", will eventually be used to disable the use of generated SPE code for fragment ops in favor of the default fallback reference routine. (During development, though, the parity of this flag is reversed: all users will get the reference code *unless* CELL_DEBUG=fragopfallback is set. This will prevent hiccups in code generation from affecting the other developers.) - Formalized debug message usage and macros in spu/spu_main.c. - Added lots of new code to ppu/cell_gen_fragment.c to extend the number of supported source RGB factors from 4 to 15, and to complete the list of supported blend equations. More coming, to complete the source and destination RGB and alpha factors, and to complete the rest of the fragment operations...
2008-09-16cell: Added RCP and RSQ instruction support.Jonathan White
2008-09-16cell: Added DP3 and DP4 instructionsJonathan White
2008-09-16cell: Optimized LERP with fmaJonathan White
Please enter the commit message for your changes.
2008-09-16cell: Fixed MIN/MAX algorithmJonathan White
2008-09-15cell: export CELL_DEBUG=asm to dump SPU assembly codeBrian Paul
2008-09-15cell: Added LERP instructionJonathan White
2008-09-15cell: Added support for SLT, SEQ and SNE instructionsJonathan White
2008-09-15cell: Added support for ABS instructionJonathan White
2008-09-15Added support for SUB and MAD instructionsJonathan White
2008-09-13cell: implement negation, absolute value and set-sign for src regs in code genBrian Paul
2008-09-12cell: remove old disassembly/dump code; use dumper code in SPE emitter.Brian Paul
2008-09-12cell: implement DDX/DDY codegen (untested)Brian Paul
2008-09-12cell: implement TGSI immediates in SPE code generatorBrian Paul
2008-09-12cell: initial support for IF/ELSE/ENDIF in fragment shader codegenBrian Paul
Only one level of if/else/endif nesting is currently working.
2008-09-12cell: disable invalid spe_release_func() calls, fixes crash on exitBrian Paul
2008-09-12cell: fix non-debug build errorBrian Paul
2008-09-12cell: implement swizzling for src regsBrian Paul
2008-09-12cell: disable calls to old gen codeBrian Paul
2008-09-11cell: initial support for fragment shader code generation.Brian Paul
TGSI shaders are translated into SPE instructions which are then sent to the SPEs for execution. Only a few opcodes work, no swizzling yet, no support for constants/immediates, etc.
2008-09-11cell: minor improvements to fragment code-genBrian Paul
2008-09-11cell: put cell_ prefix on gen_fragment_function()Brian Paul
2008-09-11cell: fix typos in blend code-genBrian Paul
2008-09-11cell: disable NEW_VS emitBrian Paul
2008-09-11cell: checkpoint: remove more of the old per-fragment codeBrian Paul
2008-09-11cell: begin new blending code (both codegen and fallback paths)Brian Paul
2008-09-11cell: checkpoint commit of new per-fragment processingBrian Paul
Do code generation for alpha test, z test, stencil, blend, colormask and framebuffer/tile read/write as a single code block. Ian's previous blend/z/stencil test code is still there but mostly disabled and will be removed soon.