summaryrefslogtreecommitdiff
path: root/src/gallium/auxiliary/rtasm
AgeCommit message (Collapse)Author
2008-10-10cell: add emit_RI10s() which does range checking on the 10-bit signed ↵Brian Paul
immediate field This type of checking should be expanded to cover more instructions...
2008-10-10cell: additional 'offset' checking in spe_lqd(), spe_stqd()Brian Paul
2008-10-10cell: fix assertions in spe_lqd(), spe_stqd()Brian Paul
2008-10-10CELL: fixing stencil bugsRobert Ellison
These are the defects found and fixed so far. Several more have been observed; I'm working on them. - Fixed an error in spe_load_uint() that caused incorrect values to be loaded if the given unsigned value had the low 18 bits as 0, and that caused inefficient code to be emitted if the given value had the high 14 bits as 0. - Fixed a problem in stencil code generation where optional registers weren't tracked correctly. - Fixed a problem that the stencil function NEVER was acting as ALWAYS. - Fixed several problems that could occur if stenciling were enabled but depth was disabled. - Fixed a problem with two-sided stencil writemask handling that could cause a stencil writemask to not be applied. - Fixed several state permutations that were incorrectly flagged as not requiring stencil values to be calculated.
2008-10-10Merge commit 'origin/gallium-0.1' into gallium-0.2Keith Whitwell
Conflicts: src/gallium/auxiliary/gallivm/instructionssoa.cpp src/gallium/auxiliary/gallivm/soabuiltins.c src/gallium/auxiliary/rtasm/rtasm_x86sse.c src/gallium/auxiliary/rtasm/rtasm_x86sse.h src/mesa/main/texenvprogram.c src/mesa/shader/arbprogparse.c src/mesa/shader/prog_statevars.c src/mesa/state_tracker/st_draw.c src/mesa/vbo/vbo_exec_draw.c
2008-10-09cell: fix incorrect bitmask in spe_load_uint()Brian Paul
2008-10-08cell: implement function calls from shader code. fslight demo runs now.Brian Paul
Used for SIN, COS, EXP2, LOG2, POW instructions. TEX next. Fixed some bugs in MIN, MAX, DP3, DP4, DPH instructions. In rtasm code: Special-case spe_lqd(), spe_stqd() functions so they take byte offsets but low-order 4 bits are shifted out. This makes things consistant with SPU assembly language conventions. Added spe_get_registers_used() function.
2008-10-08gallium: asst. clean-upsBrian Paul
Don't use register qualifier. Doxygen-ize comments. Remove 'extern'.
2008-10-08gallium: better instruction printing for SPE codeBrian Paul
2008-10-06gallium: replace assertion with conditional/recovery codeBrian
The assertion failed when we ran out of exec memory. Found with conform texcombine test.
2008-10-06rtasm: fix debug buildKeith Whitwell
2008-10-03CELL: changes to generate SPU code for stencilingRobert Ellison
This set of code changes are for stencil code generation support. Both one-sided and two-sided stenciling are supported. In addition to the raw code generation changes, these changes had to be made elsewhere in the system: - Added new "register set" feature to the SPE assembly generation. A "register set" is a way to allocate multiple registers and free them all at the same time, delegating register allocation management to the spe_function unit. It's quite useful in complex register allocation schemes (like stenciling). - Added and improved SPE macro calculations. These are operations between registers and unsigned integer immediates. In many cases, the calculation can be performed with a single instruction; the macros will generate the single instruction if possible, or generate a register load and register-to-register operation if not. These macro functions are: spe_load_uint() (which has new ways to load a value in a single instruction), spe_and_uint(), spe_xor_uint(), spe_compare_equal_uint(), and spe_compare_greater_uint(). - Added facing to fragment generation. While rendering, the rasterizer needs to be able to determine front- and back-facing fragments, in order to correctly apply two-sided stencil. That requires these changes: - Added front_winding field to the cell_command_render block, so that the state tracker could communicate to the rasterizer what it considered to be the front-facing direction. - Added fragment facing as an input to the fragment function. - Calculated facing is passed during emit_quad().
2008-10-03rtasm: add sse_movntpsKeith Whitwell
2008-10-02rtasm: add prefetch instructionsKeith Whitwell
2008-10-02rtasm: add prefetch instructionsKeith Whitwell
2008-09-29rtasm: Implement immediate group 1 instructions. Fix SIB emition.José Fonseca
2008-09-26gallium: SPU register commentsBrian Paul
2008-09-19cell: use different opcodes for spe_move() depending on even/odd addressBrian Paul
2008-09-19gallium: added spe_code_size()Brian Paul
2008-09-19cell: change spe_complement() to take a src and dst reg, like other instructionsBrian Paul
2008-09-19CELL: add codegen for logic op, color maskRobert Ellison
- rtasm_ppc_spe.c, rtasm_ppc_spe.h: added a new macro function "spe_load_uint" for loading and splatting unsigned integers in a register; it will use "ila" for values 18 bits or less, "ilh" for word values that are symmetric across halfwords, "ilhu" for values that have zeroes in their bottom halfwords, or "ilhu" followed by "iohl" for general 32-bit values. Of the 15 color masks of interest, 4 are 18 bits or less, 2 are symmetric across halfwords, 3 are zero in the bottom halfword, and 6 require two instructions to load. - cell_gen_fragment.c: added full codegen for logic op and color mask.
2008-09-18CELL: finish fragment ops blending (except for unusual D3D modes)Robert Ellison
- Added new "macro" functions spe_float_min() and spe_float_max() to rtasm_ppc_spe.{ch}. These emit instructions that cause the minimum or maximum of each element in a vector of floats to be saved in the destination register. - Major changes to cell_gen_fragment.c to implement all the blending modes (except for the mysterious D3D-based PIPE_BLENDFACTOR_SRC1_COLOR, PIPE_BLENDFACTOR_SRC1_ALPHA, PIPE_BLENDFACTOR_INV_SRC1_COLOR, and PIPE_BLENDFACTOR_INV_SRC1_ALPHA). - Some revamping of code in cell_gen_fragment.c: use the new spe_float_min() and spe_float_max() functions (instead of expanding these calculations inline via macros); create and use an inline utility function for handling "optional" register allocation (for the {1,1,1,1} vector, and the blend color vectors) instead of expanding with macros; use the Float Multiply and Subtract (fnms) instruction to simplify and optimize many blending calculations.
2008-09-15gallium: emit SPU instructions in assembler-compatible syntaxBrian Paul
2008-09-15Fixed emit_RRRJonathan White
2008-09-12gallium: added print/dump code to SPE code emitterBrian Paul
2008-09-12gallium: added spe_splat_word()Brian Paul
2008-09-12gallium: minor optimization to spe_load_int()Brian Paul
2008-09-12gallium: initial PPC/Altivec codegenBrian Paul
2008-09-11gallium: add special cases in spe_load_float(), spe_load_int(), added ↵Brian Paul
spe_splat()
2008-09-11gallium: assorted additions and fixes to Cell SPE rtasm codeBrian Paul
Fix incorrect opcode for fsmbi. Added "macro" functions for loading floats/ints, register complement, zero, move. Added #defines for return address and stack pointer registers. Added assertions to check that the instruction buffer doesn't overflow.
2008-09-11gallium: comments, assertions, etcBrian Paul
2008-09-01gallium: include u_pointer,h, not p_pointer.hBrian Paul
2008-08-26gallium: thread wrapper clean-upBrian Paul
In p_thread.h replace _glthread_* functions with new pipe_* functions. Remove other old cruft.
2008-08-24gallium: refactor/replace p_util.h with util/u_memory.h and util/u_math.hBrian Paul
Also, rename p_tile.[ch] to u_tile.[ch]
2008-06-02rtasm: Use enum sse_cc in sse_cmpps.José Fonseca
2008-05-30Merge branch 'gallium-tex-surfaces' into gallium-0.1Jakob Bornecrantz
Conflicts: src/gallium/drivers/i915simple/i915_context.h
2008-05-28rtasm: special case for [ebp]Keith Whitwell
2008-05-28Merge branch 'gallium-vertex-linear' into gallium-0.1Jakob Bornecrantz
Conflicts: src/gallium/auxiliary/draw/draw_pt_varray.c
2008-05-28gallium: Windows CE portability fixes.José Fonseca
2008-05-24rtasm: add some helpers for calling out from generated codeKeith Whitwell
2008-05-23rtasm: remove debugKeith Whitwell
2008-05-23rtasm: add x87 instructions and debug-check for x87 stack usageKeith Whitwell
2008-05-23rtasm: export debug reg print functionKeith Whitwell
2008-05-23rtasm: remove unused struct memberKeith Whitwell
2008-05-01rtasm: learn another version of pushKeith Whitwell
2008-05-01rtasm: fix labels after (not so) recent change to allow dynamic fn growthKeith Whitwell
Using char * for labels doesn't work if you realloc the function during assembly and free the old storage...
2008-05-01rtasm: assert stack is fully popped in returnKeith Whitwell
2008-04-28gallium: Generate SSE code to swizzle and unswizzle vs inputs and outputs.Michal Krol
Change SSE_SWIZZLES #define to 0 to disable it.
2008-04-27rtasm: Implement x86_retw.José Fonseca
2008-04-21rtasm: debug some missing funcsKeith Whitwell