summaryrefslogtreecommitdiff
path: root/src/gallium/auxiliary/rtasm
AgeCommit message (Collapse)Author
2010-08-19translate_sse: try to fix Win64Luca Barbieri
Not sure whether it works now (it is still disabled).
2010-08-16rtasm: add minimal x86-64 support and new instructions (v5)Luca Barbieri
Changes in v5: - Add sse2_movdqa Changes in v4: - Use _WIN64 instead of WIN64 Changes in v3: - Add target and target caps functions, so that they could be different in principle from the current CPU and they don't need #ifs to check Changes in v2: - Win64 support (untested) - Use u_cpu_detect.h constants instead of #ifs This commit adds minimal x86-64 support: only movs between registers are supported for r8-r15, and x64_rexw() must be used to ask for 64-bit operations. It also adds several new instructions for the new translate_sse code. movdqa
2010-08-16translate: add support for 8/16-bit indicesLuca Barbieri
Currently, only 32-bit indices are supported, but some use cases translate needs support for all types.
2010-05-13rtasm: fix void * / func * casting warningsBrian Paul
2010-03-26rtasm: Silence gnu_printf format warnings.Vinson Lee
2010-03-25rtasm: Fix typos in comments.Vinson Lee
2010-02-02gallium: Move p_thread.h and p_atomic.h out of gallium interfaces.José Fonseca
Into os/os_thread.h and util/u_atomic.h respectively.
2010-01-05Merge branch 'master' into instanced-arraysMichal Krol
Conflicts: src/gallium/auxiliary/tgsi/tgsi_dump.c src/gallium/include/pipe/p_shader_tokens.h
2010-01-04Merge branch 'mesa_7_7_branch'Brian Paul
Conflicts: docs/relnotes.html src/gallium/drivers/llvmpipe/lp_tex_sample_c.c src/gallium/drivers/r300/r300_cs.h src/mesa/drivers/dri/i965/brw_wm_surface_state.c src/mesa/main/enums.c
2010-01-03rtasm: Play nicely with Windows's Data Execution Prevention.José Fonseca
2010-01-01gallium: Generate a single library for auxiliaries with Make too.José Fonseca
2010-01-01scons: Aggregate all tiny libraries in a single library.José Fonseca
Makes integration of gallium into out of tree components much easier. No pratical change for components in this tree,
2009-12-30rtasm: Add x86_div().Michal Krol
2009-07-16rtasm: export sse_movmskpsKeith Whitwell
2009-06-15rtasm: Use 32bit constant.José Fonseca
As we're only using 32bit bitmasks.
2009-04-10rtasm: Silence `static function not used' warning.Michal Krol
2009-03-25Add #ifdefs needed to compile Gallium on Solaris with gcc or Sun ccAlan Coopersmith
Signed-off-by: Alan Coopersmith <alan.coopersmith@sun.com>
2009-02-25gallium: Add support for BSD operating systems, tested with FreeBSDBenjamin Close
BSD supports pipe in the same way as linux hence options which are safe for linux are also safe for BSD. Define PIPE_OS_BSD in include/pipe/p_config.h and adjust the defines to make use of it. Also define MAP_ANONYMOUS for BSD systems which use MAP_ANON Signed-off-by: Benjamin Close <Benjamin.Close@clearchain.com>
2009-02-20gallium: Improve makefiles for librariesJakob Bornecrantz
The template makefile that most libraries in gallium included was based on dri and had a bunch unrelevant junk in it. Update it and improve the depending makefiles.
2009-02-18util: Move p_debug.h into util module.José Fonseca
The debug functions depend on several util function for os abstractions, and these depend on debug functions, so a seperate module is not possible.
2009-01-11cell: datatype clean-ups in SPE rtasmBrian Paul
2009-01-10gallium: added comment/annotation support to PPC rtasmBrian Paul
2009-01-10gallium: s/false/FALSE/Brian Paul
2008-12-30rtasm: Remove spurious semi-colons after function bodies.José Fonseca
2008-11-21CELL: use variant-length fragment ops programsRobert Ellison
This is a set of changes that optimizes the memory use of fragment operation programs (by using and transmitting only as much memory as is needed for the fragment ops programs, instead of maximal sizes), as well as eliminate the dependency on hard-coded maximal program sizes. State that is not dependent on fragment facing (i.e. that isn't using two-sided stenciling) will only save and transmit a single fragment operation program, instead of two identical programs. - Added the ability to emit a LNOP (No Operation (Load)) instruction. This is used to pad the generated fragment operations programs to a multiple of 8 bytes, which is necessary for proper operation of the dual instruction pipeline, and also required for proper SPU-side decoding. - Added the ability to allocate and manage a variant-length struct cell_command_fragment_ops. This structure now puts the generated function field at the end, where it can be as large as necessary. - On the PPU side, we now combine the generated front-facing and back-facing code into a single variant-length buffer (and only use one if the two sets of code are identical) for transmission to the SPU. - On the SPU side, we pull the correct sizes out of the buffer, allocate a new code buffer if the one we have isn't large enough, and save the code to that buffer. The buffer is deallocated when the SPU exits. - Commented out the emit_fetch() static function, which was not being used.
2008-11-13CELL: fix stencil twiddling, stencil invertRobert Ellison
Many stencil tests were failing because of a failure to read the stencil buffer, due to "twiddling" (or "untwiddling") "an unsupported texture format". This is fixed for the case of a stencil/Z S824Z format (which twiddles just like the 32-bit color formats). tests/stencilwrap.c was failing on the GL_INVERT test, because the emitted code for "spe_xori" turned out not to be an actual "xori" instruction, but rather a "stqd" instruction, because of a typo in the rtasm code. This is now fixed, and tests/stencil_wrap now works.
2008-11-12gallium: add missing prototypesBrian Paul
2008-11-12cell: move semicolons to silence warnings w/ other compilersBrian Paul
2008-11-12cell: fix typo in EMIT_ macroBrian Paul
2008-11-12rtasm: Use INLINE keyword. Compile for all platforms, not only GALLIUM_CELL.Michal Krol
2008-11-12rtasm: Compile only for GALLIUM_CELL.Michal Krol
2008-11-11CELL: two-sided stencil fixesRobert Ellison
With these changes, the tests/stencil_twoside test now works. - Eliminate blending from the stencil_twoside test, as it produces an unneeded dependency on having blending working - The spe_splat() function will now work if the register being splatted and the destination register are the same - Separate fragment code generated for front-facing and back-facing fragments. Often these are the same; if two-sided stenciling is on, they can be different. This is easier and faster than generating code that does both tests and merges the results. - Fixed a cut/paste bug where if the back Z-pass stencil operation were different from all the other operations, the back Z-fail results were incorrect.
2008-10-29gallium: grow SPE instruction buffer as neededBrian Paul
2008-10-29gallium: no longer pass max_inst to ppc_init_func()Brian Paul
2008-10-29gallium: use execmem for PPC code, grow instruction buffer as neededBrian Paul
2008-10-29gallium: fix alignment parameter passed to u_mmAllocMem()Brian Paul
Was 32, now 5. The param is expressed as a power of two exponent. The net effect is that the alignment was a no-op on X86 but on PPC we always got the same memory address everytime rtasm_exec_malloc() was called.
2008-10-29gallium: prefix memory manager functions with u_ to differentiate from ↵Brian Paul
functions in mesa/main/mm.c
2008-10-29gallium: test for PIPE_OS_LINUX instead of __linux__Brian Paul
2008-10-29gallium: added ppc_vnmsubfp()Brian Paul
2008-10-23scons: ppc support.Michel Dänzer
2008-10-22gallium: remove ppc_vload_float(), rename ppc_vecmove() -> ppc_vmove().Brian Paul
2008-10-22gallium: added ppc_vzero()Brian Paul
2008-10-22gallium: added ppc_vload_float(), for limited casesBrian Paul
2008-10-22gallium: fix-up confusing register allocation masks in rtasm_ppc.cBrian Paul
Plus, add ppc_reserve_register() func.
2008-10-22gallium: added ppc_lvewx()Brian Paul
2008-10-22cell: implement many more PPC instructions for code genBrian Paul
2008-10-10cell: add emit_RI10s() which does range checking on the 10-bit signed ↵Brian Paul
immediate field This type of checking should be expanded to cover more instructions...
2008-10-10cell: additional 'offset' checking in spe_lqd(), spe_stqd()Brian Paul
2008-10-10cell: fix assertions in spe_lqd(), spe_stqd()Brian Paul
2008-10-10CELL: fixing stencil bugsRobert Ellison
These are the defects found and fixed so far. Several more have been observed; I'm working on them. - Fixed an error in spe_load_uint() that caused incorrect values to be loaded if the given unsigned value had the low 18 bits as 0, and that caused inefficient code to be emitted if the given value had the high 14 bits as 0. - Fixed a problem in stencil code generation where optional registers weren't tracked correctly. - Fixed a problem that the stencil function NEVER was acting as ALWAYS. - Fixed several problems that could occur if stenciling were enabled but depth was disabled. - Fixed a problem with two-sided stencil writemask handling that could cause a stencil writemask to not be applied. - Fixed several state permutations that were incorrectly flagged as not requiring stencil values to be calculated.