summaryrefslogtreecommitdiff
path: root/src/gallium
AgeCommit message (Collapse)Author
2008-09-30Gallivm: port to llvm 2.4.Stephane Marchesin
2008-10-01util: Fix util_fast_pow/exp2/log2.José Fonseca
- Use a lookup table for log2. - Compute (float) (1 << ipart) by tweaking with the exponent directly to avoid integer overflow and float conversion. - Also table negative exponents to avoid float division and branching. - Implement util_fast_exp as function of util_fast_exp2.
2008-09-30tgsi: SSE2 optimized exp2, log2 and pow implementations.José Fonseca
Special care must be taken when calling compiler generated SSE2 functions from the runtime generated SSE2: saving the xmm registers, and notify gcc the stack is not 16byte aligned. It would be more efficient to keep the stack pointer 16byte aligned, but too hairy, and not consistent in all x86 architectures. This has been tested in linux x86 and windows x86 userspace. Not tested on x86-64 because it is broken for other reasons (even without this change).
2008-09-30util: Header for SSE2 intrinsics portability.José Fonseca
2008-09-29rtasm: Implement immediate group 1 instructions. Fix SIB emition.José Fonseca
2008-09-28Gallivm: fix off-by-one.Stephane Marchesin
2008-09-28Gallivm: more instructions.Stephane Marchesin
2008-09-28Gallivm: make it compile again, add some opcodes.Stephane Marchesin
2008-09-26cell: checkpoint: more work in emit_function_call()Brian Paul
Simple function call works now, but we don't save/restore the caller's registers yet.
2008-09-26cell: stub-out sin/cos function bodies to avoid trashing caller's stack for nowBrian Paul
2008-09-26gallium: SPU register commentsBrian Paul
2008-09-26cell: move command processing code into new spu_command.c fileBrian Paul
2008-09-26cell: move debug-related declarationsBrian Paul
2008-09-26cell: move debug macros into new spu_debug.hBrian Paul
2008-09-26cell: move really_clear_tiles()Brian Paul
2008-09-26cell: align instruction buffers to 8-byte, not 32-byte boundaryBrian Paul
2008-09-26cell: asst clean-up, var renamingBrian Paul
2008-09-26cell: remove unneeded blend/depth_stencil subclassesBrian Paul
2008-09-26cell: checkpoint: support for function calls in SPU shadersBrian Paul
Will be used for instructions like SIN/COS/POW/TEX/etc. The PPU needs to know the address of some functions in the SPU address space. Send that info to the PPU/main memory rather than patch up shaders on the SPU side. Not finished/tested yet...
2008-09-26cell: inst reorder to save a cycleBrian Paul
2008-09-26st: change from ** to * for st_unreference_framebuffer()Alan Hourihane
2008-09-26util: Update fast_log2 article url.José Fonseca
2008-09-24add cso_hash_contains() functionAlan Hourihane
2008-09-23CELL: fix colormask code generationRobert Ellison
The colormask code generation had assumed that its input packed pixels were in RGBA format. In fact, the format they're in is dependent on the pipe color format. Now the color format is passed in to gen_colormask(), and proper color format-dependent SPU code is generated.
2008-09-23CELL: improve legibility of CELL_DEBUG environment variable outputRobert Ellison
2008-09-22cell: Fixed bug with absolute, negate, set-negative logic in source fetch ↵Jonathan White
for TGSI instructions. The logic should operate on the origin channel not the swizzled channel. Please enter the commit message for your changes.
2008-09-22cell: Added TRUNC, SWZ (extended) and XPD instructions, verified against ↵Jonathan White
softpipe. Optimized FLR and FRC. Fixed writeback logic for DP3, DP4 and DPH.
2008-09-22cell: Added DPH instruction and verified against softpipe.Jonathan White
2008-09-19cell: use different opcodes for spe_move() depending on even/odd addressBrian Paul
2008-09-19cell: make sure the fragment ops and fragment shader code buffer is at a ↵Brian Paul
32-byte boundary To make sure even/odd instructions hit the right pipes.
2008-09-19gallium: added ALIGN32_ATTRIBBrian Paul
2008-09-19gallium: added spe_code_size()Brian Paul
2008-09-19cell: Added FRC instructionJonathan White
2008-09-19cell: Added FLR instruction. Verified the following instructions match ↵Jonathan White
softpipe: MOV, ADD, MUL, SGE, SUB, MAD, ABS, SLT, MIN, MAX, LRP, DP3, DP4, CMP, FLR
2008-09-19cell: Fixed bugs with DP3 and DP4, they match softpipe results now.Jonathan White
2008-09-19cell: flesh out support for other Z/stencil formatBrian Paul
Also: improve float/int Z conversion. Use clgt instead of cgt in depth test since we're comparing unsigned values.
2008-09-19cell: issue warning to stderr when using fallback fragment opsBrian Paul
2008-09-19cell: fix a commentBrian Paul
2008-09-19cell: the test for CELL_DEBUG_FRAGMENT_OP_FALLBACK in ↵Brian Paul
cmd_state_fragment_ops() was inverted
2008-09-19cell: disable XShmPutImage for tiled surface for nowBrian Paul
Multiple displays of same surface data causes pixels to get scrambled.
2008-09-19cell: change spe_complement() to take a src and dst reg, like other instructionsBrian Paul
2008-09-19Merge branch 'gallium-0.2' of ssh+git://git.freedesktop.org/git/mesa/mesa ↵José Fonseca
into gallium-0.2
2008-09-19util: Use OpenGL rasterization rules in blits and mipmap generation.José Fonseca
2008-09-19i915simple: Use defined MSAA arrayJakob Bornecrantz
2008-09-19fix for MSAAAlan Hourihane
2008-09-19i915simple: Front flushing does workJakob Bornecrantz
2008-09-19i915simple: Fix extentions being loadedJakob Bornecrantz
2008-09-19CELL: add codegen for logic op, color maskRobert Ellison
- rtasm_ppc_spe.c, rtasm_ppc_spe.h: added a new macro function "spe_load_uint" for loading and splatting unsigned integers in a register; it will use "ila" for values 18 bits or less, "ilh" for word values that are symmetric across halfwords, "ilhu" for values that have zeroes in their bottom halfwords, or "ilhu" followed by "iohl" for general 32-bit values. Of the 15 color masks of interest, 4 are 18 bits or less, 2 are symmetric across halfwords, 3 are zero in the bottom halfword, and 6 require two instructions to load. - cell_gen_fragment.c: added full codegen for logic op and color mask.
2008-09-19gallium: Fix doxygen comments.José Fonseca
2008-09-19softpipe: Obey const qualifier.José Fonseca