summaryrefslogtreecommitdiff
path: root/src/gallium/drivers/cell
AgeCommit message (Collapse)Author
2008-10-10cell: pass texture unit (sampler number) to txp() functionBrian Paul
The glsl/multitex demo runs now.
2008-10-10cell: fix function prologue/epilogue code for large stack framesBrian Paul
The ai instruction is limited to a 10-bit signed immediate value.
2008-10-10CELL: fixing stencil bugsRobert Ellison
These are the defects found and fixed so far. Several more have been observed; I'm working on them. - Fixed an error in spe_load_uint() that caused incorrect values to be loaded if the given unsigned value had the low 18 bits as 0, and that caused inefficient code to be emitted if the given value had the high 14 bits as 0. - Fixed a problem in stencil code generation where optional registers weren't tracked correctly. - Fixed a problem that the stencil function NEVER was acting as ALWAYS. - Fixed several problems that could occur if stenciling were enabled but depth was disabled. - Fixed a problem with two-sided stencil writemask handling that could cause a stencil writemask to not be applied. - Fixed several state permutations that were incorrectly flagged as not requiring stencil values to be calculated.
2008-10-10cell: call cell_flush_int() at end of cell_create_context()Brian Paul
Ensures that SPUs are initialized/ready before proceeding. This fixes a spurious assertion failure when the SPU-side shader function info hasn't been returned to the PPU before shader codegen.
2008-10-10cell: fix LERP when dst reg is a src regBrian Paul
Also, bump up frame size and fix some assertions.
2008-10-10cell: fix fm/fs copy & paste bug from a few commits agoBrian Paul
2008-10-10cell: fix bug in emit_FLR() when src reg == dst regBrian Paul
2008-10-10cell: fix bug in emit_FRC() when src register == dst register.Brian Paul
With this fix, the glsl/brick demo runs.
2008-10-10cell: updates in response to draw's struct vertex_info changesBrian Paul
2008-10-09cell: implement basic TXP instruction in fragment shadersBrian Paul
Lots of restrictions for now (one 2D texture, no mipmaps, etc.) for now but basic texture demos work. TEX, TXD, TXP do the same thing for the time being.
2008-10-09cell: better immediate value allocation, better commentsBrian Paul
2008-10-09cell: massage the emit functions to get better instruction schedulingBrian Paul
2008-10-09cell: more accurate commentsBrian Paul
2008-10-08cell: implement function calls from shader code. fslight demo runs now.Brian Paul
Used for SIN, COS, EXP2, LOG2, POW instructions. TEX next. Fixed some bugs in MIN, MAX, DP3, DP4, DPH instructions. In rtasm code: Special-case spe_lqd(), spe_stqd() functions so they take byte offsets but low-order 4 bits are shifted out. This makes things consistant with SPU assembly language conventions. Added spe_get_registers_used() function.
2008-10-08cell: implement more built-in shader functions, link spu code with -lmBrian Paul
2008-10-08cell: increase SPU_MAX_FRAGMENT_PROGRAM_INSTSBrian Paul
2008-10-07cell: add support for fragment shader constant buffersBrian Paul
2008-10-07cell: fix incorrect extended swizzle term code in get_src_reg()Brian Paul
2008-10-07cell: fix formattingBrian Paul
2008-10-07cell: remove old codeBrian Paul
2008-10-07cell: memset() key to zeroBrian Paul
2008-10-07cell: use new keymap to save/re-use fragment ops codeBrian Paul
2008-10-03CELL: changes to generate SPU code for stencilingRobert Ellison
This set of code changes are for stencil code generation support. Both one-sided and two-sided stenciling are supported. In addition to the raw code generation changes, these changes had to be made elsewhere in the system: - Added new "register set" feature to the SPE assembly generation. A "register set" is a way to allocate multiple registers and free them all at the same time, delegating register allocation management to the spe_function unit. It's quite useful in complex register allocation schemes (like stenciling). - Added and improved SPE macro calculations. These are operations between registers and unsigned integer immediates. In many cases, the calculation can be performed with a single instruction; the macros will generate the single instruction if possible, or generate a register load and register-to-register operation if not. These macro functions are: spe_load_uint() (which has new ways to load a value in a single instruction), spe_and_uint(), spe_xor_uint(), spe_compare_equal_uint(), and spe_compare_greater_uint(). - Added facing to fragment generation. While rendering, the rasterizer needs to be able to determine front- and back-facing fragments, in order to correctly apply two-sided stencil. That requires these changes: - Added front_winding field to the cell_command_render block, so that the state tracker could communicate to the rasterizer what it considered to be the front-facing direction. - Added fragment facing as an input to the fragment function. - Calculated facing is passed during emit_quad().
2008-09-26cell: checkpoint: more work in emit_function_call()Brian Paul
Simple function call works now, but we don't save/restore the caller's registers yet.
2008-09-26cell: stub-out sin/cos function bodies to avoid trashing caller's stack for nowBrian Paul
2008-09-26cell: move command processing code into new spu_command.c fileBrian Paul
2008-09-26cell: move debug-related declarationsBrian Paul
2008-09-26cell: move debug macros into new spu_debug.hBrian Paul
2008-09-26cell: move really_clear_tiles()Brian Paul
2008-09-26cell: align instruction buffers to 8-byte, not 32-byte boundaryBrian Paul
2008-09-26cell: asst clean-up, var renamingBrian Paul
2008-09-26cell: remove unneeded blend/depth_stencil subclassesBrian Paul
2008-09-26cell: checkpoint: support for function calls in SPU shadersBrian Paul
Will be used for instructions like SIN/COS/POW/TEX/etc. The PPU needs to know the address of some functions in the SPU address space. Send that info to the PPU/main memory rather than patch up shaders on the SPU side. Not finished/tested yet...
2008-09-26cell: inst reorder to save a cycleBrian Paul
2008-09-23CELL: fix colormask code generationRobert Ellison
The colormask code generation had assumed that its input packed pixels were in RGBA format. In fact, the format they're in is dependent on the pipe color format. Now the color format is passed in to gen_colormask(), and proper color format-dependent SPU code is generated.
2008-09-22cell: Fixed bug with absolute, negate, set-negative logic in source fetch ↵Jonathan White
for TGSI instructions. The logic should operate on the origin channel not the swizzled channel. Please enter the commit message for your changes.
2008-09-22cell: Added TRUNC, SWZ (extended) and XPD instructions, verified against ↵Jonathan White
softpipe. Optimized FLR and FRC. Fixed writeback logic for DP3, DP4 and DPH.
2008-09-22cell: Added DPH instruction and verified against softpipe.Jonathan White
2008-09-19cell: make sure the fragment ops and fragment shader code buffer is at a ↵Brian Paul
32-byte boundary To make sure even/odd instructions hit the right pipes.
2008-09-19cell: Added FRC instructionJonathan White
2008-09-19cell: Added FLR instruction. Verified the following instructions match ↵Jonathan White
softpipe: MOV, ADD, MUL, SGE, SUB, MAD, ABS, SLT, MIN, MAX, LRP, DP3, DP4, CMP, FLR
2008-09-19cell: Fixed bugs with DP3 and DP4, they match softpipe results now.Jonathan White
2008-09-19cell: flesh out support for other Z/stencil formatBrian Paul
Also: improve float/int Z conversion. Use clgt instead of cgt in depth test since we're comparing unsigned values.
2008-09-19cell: issue warning to stderr when using fallback fragment opsBrian Paul
2008-09-19cell: fix a commentBrian Paul
2008-09-19cell: the test for CELL_DEBUG_FRAGMENT_OP_FALLBACK in ↵Brian Paul
cmd_state_fragment_ops() was inverted
2008-09-19cell: change spe_complement() to take a src and dst reg, like other instructionsBrian Paul
2008-09-19CELL: add codegen for logic op, color maskRobert Ellison
- rtasm_ppc_spe.c, rtasm_ppc_spe.h: added a new macro function "spe_load_uint" for loading and splatting unsigned integers in a register; it will use "ila" for values 18 bits or less, "ilh" for word values that are symmetric across halfwords, "ilhu" for values that have zeroes in their bottom halfwords, or "ilhu" followed by "iohl" for general 32-bit values. Of the 15 color masks of interest, 4 are 18 bits or less, 2 are symmetric across halfwords, 3 are zero in the bottom halfword, and 6 require two instructions to load. - cell_gen_fragment.c: added full codegen for logic op and color mask.
2008-09-18CELL: mark several transient files as .gitignoreRobert Ellison
progs/demos: added new demo "fbo_firecube" progs/glsl: added new demo "pointcoord" src/gallium/drivers/cell/spu: added the g3d_spu executable, a Cell SPU executable file, which seems to be occasionally built as part of the cell driver src/glu/sgi: added "exptmp", a byproduct of the "mklib" process that sometimes gets deleted and sometimes not.
2008-09-18cell: Added CMP instructionJonathan White