android-x86-mesa.git - Androïd/x86 port of Mesa drivers

Age	Commit message (Collapse)	Author
2010-01-12	gallium: Simplify PIPE_ALIGN_VAR.	José Fonseca
	gcc allows pre-fix variable attributes. Suggested by Ian Romanick.
2010-01-12	gallium: Generalize the alignment macros to other compilers and any alignment.	José Fonseca

2010-01-06	gallium: remove PIPE_TEX_FILTER_ANISO	Luca Barbieri
	This patch removes PIPE_TEX_FILTER_ANISO. Anisotropic filtering is enabled if and only if max_anisotropy > 1.0. Values between 0.0 and 1.0, inclusive, of max_anisotropy are to be considered equivalent, and meaning to turn off anisotropic filtering. This approach has the small drawback of eliminating the possibility of enabling anisotropic filter on either minification or magnification separately, which Radeon hardware seems to support, is currently support by Gallium but not exposed to OpenGL. If this is actually useful it could be handled by splitting max_anisotropy in two values and adding an appropriate OpenGL extension. NOTE: some fiddling & reformatting by keithw to get this patch to apply. Hopefully nothing broken in the process.
2010-01-05	Remove TGSI_OPCODE_SHR, map existing usage to TGSI_OPCODE_ISHR.	Michal Krol
	This is to differentiate it from its unsigned version, TGSI_OPCODE_USHR.
2009-11-24	tgsi: rename fields of tgsi_full_src_register to reduce verbosity	Keith Whitwell
	SrcRegister -> Register SrcRegisterInd -> Indirect SrcRegisterDim -> Dimension SrcRegisterDimInd -> DimIndirect
2009-11-24	tgsi: rename fields of tgsi_full_dst_register to reduce verbosity	Keith Whitwell
	DstRegister -> Register DstRegisterInd -> Indirect
2009-11-24	tgsi: rename fields of tgsi_full_declaration to reduce verbosity	Keith Whitwell
	DeclarationRange -> Range
2009-11-24	tgsi: rename fields of tgsi_full_instruction to avoid excessive verbosity	Keith Whitwell
	InstructionPredicate -> Predicate InstructionLabel -> Label InstructionTexture -> Texture FullSrcRegisters -> Src FullDstRegisters -> Dst
2009-10-23	gallium: remove extended negate also, and also the ExtSwz token	Keith Whitwell
	Likewise, the extended negate functionality hasn't been used since mesa switched to using tgsi_ureg to build programs, and has been translating the SWZ opcode internally to a single MAD.
2009-10-23	cell: typo from ExtSwizzle commit	Keith Whitwell

2009-10-23	gallium: remove the swizzling parts of ExtSwizzle	Keith Whitwell
	These haven't been used by the mesa state tracker since the conversion to tgsi_ureg, and it seems that none of the other state trackers are using it either. This helps simplify one of the biggest suprises when starting off with TGSI shaders.
2009-10-23	gallium: remove noise opcodes	Keith Whitwell
	Provide a dummy implementation in the GL state tracker (move 0.5 to the destination regs). At some point, a motivated person could add a better implementation of noise. Currently not even the nvidia binary drivers do anything more than this. In any case, the place to do this is in the GL state tracker, not the poor driver.
2009-09-01	tgsi: remove redundant CND0 opcode	Keith Whitwell
	Can be implemented with CMP src2, src1, src0
2009-07-31	Rename TGSI LOOP instruction to better match theri usage.	Michal Krol
	The LOOP/ENDLOOP pair is renamed to BGNFOR/ENDFOR as its behaviour is similar to a C language for-loop. The BGNLOOP2/ENDLOOP2 pair is renamed to BGNLOOP/ENDLOOP as now there is no name collision.
2009-07-23	gallium: remove deprecated TGSI opcodes	Keith Whitwell
	Various opcodes which can be implemented trivially with other TGSI opcodes, such as matrix multiplication and negation. These were not used by any state tracker or implemented by any of the drivers.
2009-07-22	cell: update TGSI_OPCODE_ cases	Brian Paul

2009-05-21	cell: perform triangle cull a little earlier	Jonathan Adamczewski
	In spu_tri.c:setup_sort_vertices() triangles are culled after the vertices are sorted. This patch moves the check a little earlier and performs the actual check a little faster through intrinsics and a little trickery. Reduced code size and less work is done before a triangle is deemed OK to skip.
2009-05-21	cell: unroll inner loop of spu_render.c:cmd_render()	Jonathan Adamczewski
	It was taking approximately 50 cycles to extract the vertex indices, calculate the vertex_header pointers and call tri_draw() for each three vertices - . Unrolled, it takes less than 100 cycles to extract, unpack, calculate pointers and call tri_draw() eight times. It does have a nasty jump-tabled switch. I'm sure that there's a better way... Code size of spu_render.o gets larger due to the extra constants and work in the inner loop, there are extra stack saves and loads because there are more registers in use, and an assert. spu_tri.o gets a little smaller.
2009-02-18	util: Move p_debug.h into util module.	José Fonseca
	The debug functions depend on several util function for os abstractions, and these depend on debug functions, so a seperate module is not possible.
2009-02-16	cell: use some SPU intrinsics to get slightly better code in eval_inputs()	Brian Paul
	Suggested by Jonathan Adamczewski. There may be more places to do this...
2009-02-15	cell: minor Makefile clean-up	Brian Paul

2009-02-15	cell: new/tighter code for computing fragment program inputs	Brian Paul

2009-02-15	cell: combine eval_z(), eval_w() functions	Brian Paul

2009-02-07	cell: compile fix: alpha.ref is now alpha.ref_value	Brian Paul

2009-01-14	cell: Specify constant as float for CEILF().	Jonathan Adamczewski
	Without the f, the constant is treated as a double, resulting in slower arithmetic and libgcc conversion calls each time CEILF() is used.
2009-01-13	cell: Add missing suffix to SHUFFLE macro	Jonathan Adamczewski

2009-01-12	cell: allocate batch buffers w/ 16-byte alignment	Jonathan Adamczewski
	Replace cell_batch{align,alloc)*() with cell_batch_alloc16(), allocating multiples of 16 bytes that are 16 byte aligned. Opcodes are stored in preferred slot of SPU machine word. Various structures are explicitly padded to 16 byte multiples. Added STATIC_ASSERT().
2009-01-05	cell: SIMDize sorting in setup_sort_vertices()	Jonathan Adamczewski
	Put setup.v{min,mid,max,provoke} into a union with qword vertex_headers. Rewrite vertex sorting to more efficiently handle the packed data items. Reduces spu_tri.o by ~128 bytes.
2009-01-05	cell: SIMDize some subtractions	Jonathan Adamczewski
	Put edge.{dx,dy} into a union with a vector and perform subtractions in setup_sort_vertices() on vectors. Reduces spu_tri.o by ~300 bytes.
2009-01-04	cell: improvements to spu_tri.c	Jonathan Adamczewski
	Replace int setup.span{left,right}[2] with vec_uint4 setup.span.quad SIMDize calculate_mask() and inline into into flush_spans() Set setup.span.quad members using spu_shuffle() or spu_sel(). Reduces spu_tri.o by ~116 bytes.
2009-01-04	cell: new spu_shuffle.h header	Jonathan Adamczewski
	Facilitates creation of shuffle patterns for use with spu_shuffle() and si_shufb() intrinsics. To be used by subsequent patches.
2008-11-21	CELL: use variant-length fragment ops programs	Robert Ellison
	This is a set of changes that optimizes the memory use of fragment operation programs (by using and transmitting only as much memory as is needed for the fragment ops programs, instead of maximal sizes), as well as eliminate the dependency on hard-coded maximal program sizes. State that is not dependent on fragment facing (i.e. that isn't using two-sided stenciling) will only save and transmit a single fragment operation program, instead of two identical programs. - Added the ability to emit a LNOP (No Operation (Load)) instruction. This is used to pad the generated fragment operations programs to a multiple of 8 bytes, which is necessary for proper operation of the dual instruction pipeline, and also required for proper SPU-side decoding. - Added the ability to allocate and manage a variant-length struct cell_command_fragment_ops. This structure now puts the generated function field at the end, where it can be as large as necessary. - On the PPU side, we now combine the generated front-facing and back-facing code into a single variant-length buffer (and only use one if the two sets of code are identical) for transmission to the SPU. - On the SPU side, we pull the correct sizes out of the buffer, allocate a new code buffer if the one we have isn't large enough, and save the code to that buffer. The buffer is deallocated when the SPU exits. - Commented out the emit_fetch() static function, which was not being used.
2008-11-11	CELL: two-sided stencil fixes	Robert Ellison
	With these changes, the tests/stencil_twoside test now works. - Eliminate blending from the stencil_twoside test, as it produces an unneeded dependency on having blending working - The spe_splat() function will now work if the register being splatted and the destination register are the same - Separate fragment code generated for front-facing and back-facing fragments. Often these are the same; if two-sided stenciling is on, they can be different. This is easier and faster than generating code that does both tests and merges the results. - Fixed a cut/paste bug where if the back Z-pass stencil operation were different from all the other operations, the back Z-fail results were incorrect.
2008-10-30	CELL: stencil bug fixes	Robert Ellison
	Two definitive bugs in stenciling were fixed. The first, reversed registers in the generated Select Bytes (selb) instruction, caused the stenciling INCR and DECR operations to fail dramatically, putting new values in where old values were supposed to be and vice versa. The second caused stencil tiles to not be read and written from main memory by the SPUs. A per-spu flag, spu.read_depth, was used to indicate whether the SPU should be reading depth tiles, and was set only when depth was enabled. A second flag, spu.read_stencil, was set when stenciling was enabled, but never referenced. As stenciling and depth are in the same tiles on the Cell, and there is no corresponding TAG_WRITE_TILE_STENCIL to complement TAG_WRITE_TILE_COLOR and TAG_WRITE_TILE_Z, I fixed this by eliminating the unused "spu.read_stencil", renaming "spu.read_depth" to "spu.read_depth_stencil", and setting it if either stenciling or depth is enabled. I also added an optimization to the fragment ops generation code, that avoids calculating stencil values and/or stencil writemask when the stencil operations are all KEEP.
2008-10-29	cell: use simd utilities for pow, exp2, log2	Brian Paul

2008-10-28	cell: fix a number of fence issues	Brian Paul
	Plus add assertions to check status, alignment, etc.
2008-10-22	cell: implement fencing for texture buffers	Brian Paul
	If we delete a texture, we need to keep the underlying tiled data buffer around until any rendering that references it has completed. Keep a list of buffers referenced by a rendering batch. Unref/free them when the associated batch's fence is executed/signalled.
2008-10-17	cell: use an approximation in compute_lambda_2d() to avoid sqrt	Brian Paul
	Though, the logf() call still needs attention.
2008-10-17	cell: add new debug flag (cache) to report texture cache stats on exit	Brian Paul

2008-10-17	cell: use 7-bit weights in sample_texture_2d_bilinear_int()	Brian Paul
	This allows us to use 16-bit signed mul/add instructions. Had to used unsigned mul before and there's no unsigned mul/add instruction.
2008-10-16	cell: pass spu_texture_level ptr to get_four_texels()	Brian Paul

2008-10-16	cell: implement KIL instruction	Brian Paul

2008-10-16	cell: trilinear mipmap interpolation	Brian Paul

2008-10-16	cell: update comments	Brian Paul

2008-10-16	cell: call proper sampler function in sample_texture_cube()	Brian Paul

2008-10-16	cell: clean up various texture-related things	Brian Paul
	Distinguish among texture targets in codegen. progs/demos/cubemap.c runs correctly now too.
2008-10-15	cell: start some performance measurements	Brian Paul
	Use the spu_write_decrementer() and spu_read_decrementer() functions to measure time. Convert to milliseconds according to the system timebase value.
2008-10-15	cell: updated debug code	Brian Paul

2008-10-15	cell: get rid of last usage of float4 union/typedef	Brian Paul
	Results in slightly tighter code.
2008-10-15	cell: simplify triangle front/back face determination	Brian Paul