Christophe Gisquet 
							
						 
					 
					
						
						
						
						
							
						
						
							996697e266 
							
						 
					 
					
						
						
							
							x86: float dsp: unroll SSE versions  
						
						... 
						
						
						
						vector_fmul and vector_fmac_scalar are guaranteed that they can process in
batch of 16 elements, but their SSE versions only does 8 at a time.
Therefore, unroll them a bit.
299 to 261c for 256 elements in vector_fmac_scalar on Arrandale/Win64.
Signed-off-by: Janne Grunau <janne-libav@jannau.net> 
						
						
					 
					
						2014-02-20 14:18:05 +01:00 
						 
				 
			
				
					
						
							
							
								Christophe Gisquet 
							
						 
					 
					
						
						
						
						
							
						
						
							566b7a20fd 
							
						 
					 
					
						
						
							
							x86: float dsp: butterflies_float SSE  
						
						... 
						
						
						
						97c -> 49c
Some codecs could benefit from more unrolling, but AAC doesn't. 
						
						
					 
					
						2013-05-03 08:08:02 +02:00 
						 
				 
			
				
					
						
							
							
								Ronald S. Bultje 
							
						 
					 
					
						
						
						
						
							
						
						
							d56668bd80 
							
						 
					 
					
						
						
							
							floatdsp: move scalarproduct_float from dsputil to avfloatdsp.  
						
						... 
						
						
						
						This makes the aac decoder and all voice codecs independent of dsputil. 
						
						
					 
					
						2013-01-22 11:55:42 -08:00 
						 
				 
			
				
					
						
							
							
								Ronald S. Bultje 
							
						 
					 
					
						
						
						
						
							
						
						
							42d3246948 
							
						 
					 
					
						
						
							
							floatdsp: move vector_fmul_reverse from dsputil to avfloatdsp.  
						
						... 
						
						
						
						Now, nellymoserenc and aacenc no longer depends on dsputil. Independent
of this patch, wmaprodec also does not depend on dsputil, so I removed
it from there also. 
						
						
					 
					
						2013-01-22 11:55:42 -08:00 
						 
				 
			
				
					
						
							
							
								Ronald S. Bultje 
							
						 
					 
					
						
						
						
						
							
						
						
							55aa03b9f8 
							
						 
					 
					
						
						
							
							floatdsp: move vector_fmul_add from dsputil to avfloatdsp.  
						
						
						
						
					 
					
						2013-01-22 11:55:42 -08:00 
						 
				 
			
				
					
						
							
							
								Justin Ruggles 
							
						 
					 
					
						
						
						
						
							
						
						
							1c012e6bfb 
							
						 
					 
					
						
						
							
							x86: float_dsp: fix loading of the len parameter on x86-32  
						
						
						
						
					 
					
						2012-12-07 21:19:29 -05:00 
						 
				 
			
				
					
						
							
							
								Justin Ruggles 
							
						 
					 
					
						
						
						
						
							
						
						
							ecc8b02194 
							
						 
					 
					
						
						
							
							x86: float_dsp: fix compilation of ff_vector_dmul_scalar_avx() on x86-32  
						
						... 
						
						
						
						Signed-off-by: Janne Grunau <janne-libav@jannau.net> 
						
						
					 
					
						2012-12-06 14:11:15 +01:00 
						 
				 
			
				
					
						
							
							
								Justin Ruggles 
							
						 
					 
					
						
						
						
						
							
						
						
							ac7eb4cb20 
							
						 
					 
					
						
						
							
							float_dsp: add vector_dmul_scalar() to multiply a vector of doubles  
						
						... 
						
						
						
						Include x86-optimized versions for SSE2 and AVX. 
						
						
					 
					
						2012-12-05 11:23:36 -05:00 
						 
				 
			
				
					
						
							
							
								Justin Ruggles 
							
						 
					 
					
						
						
						
						
							
						
						
							947f933687 
							
						 
					 
					
						
						
							
							x86: float_dsp: add SSE version of vector_fmul_scalar()  
						
						
						
						
					 
					
						2012-11-26 11:30:19 -05:00 
						 
				 
			
				
					
						
							
							
								Diego Biurrun 
							
						 
					 
					
						
						
						
						
							
						
						
							2b479bcab0 
							
						 
					 
					
						
						
							
							build: Drop AVX assembly ifdefs  
						
						... 
						
						
						
						An assembler able to cope with AVX instructions is now required. 
						
						
					 
					
						2012-11-11 20:43:28 +01:00 
						 
				 
			
				
					
						
							
							
								Diego Biurrun 
							
						 
					 
					
						
						
						
						
							
						
						
							6860b4081d 
							
						 
					 
					
						
						
							
							x86: include x86inc.asm in x86util.asm  
						
						... 
						
						
						
						This is necessary to allow refactoring some x86util macros with cpuflags. 
						
						
					 
					
						2012-10-31 00:37:42 +01:00 
						 
				 
			
				
					
						
							
							
								Justin Ruggles 
							
						 
					 
					
						
						
						
						
							
						
						
							7327525997 
							
						 
					 
					
						
						
							
							x86: float_dsp: fix ff_vector_fmac_scalar_avx() on Win64  
						
						... 
						
						
						
						The SWAP macro does not work for explicit xmm/ymm usage, so instead just move
the scalar value from xmm2 to xmm0. 
						
						
					 
					
						2012-09-07 14:49:10 -04:00 
						 
				 
			
				
					
						
							
							
								Diego Biurrun 
							
						 
					 
					
						
						
						
						
							
						
						
							17337f54c0 
							
						 
					 
					
						
						
							
							x86: Split inline and external assembly #ifdefs  
						
						
						
						
					 
					
						2012-08-31 01:53:25 +02:00 
						 
				 
			
				
					
						
							
							
								Mans Rullgard 
							
						 
					 
					
						
						
						
						
							
						
						
							a3df4781f4 
							
						 
					 
					
						
						
							
							x86: add colons after labels  
						
						... 
						
						
						
						nasm prints a warning if the colon is missing.
Signed-off-by: Mans Rullgard <mans@mansr.com> 
						
						
					 
					
						2012-08-07 15:20:56 +01:00 
						 
				 
			
				
					
						
							
							
								Ronald S. Bultje 
							
						 
					 
					
						
						
						
						
							
						
						
							30b45d9c38 
							
						 
					 
					
						
						
							
							x86inc: automatically insert vzeroupper for YMM functions.  
						
						
						
						
					 
					
						2012-07-26 13:43:16 -07:00 
						 
				 
			
				
					
						
							
							
								Justin Ruggles 
							
						 
					 
					
						
						
						
						
							
						
						
							82b2df9790 
							
						 
					 
					
						
						
							
							float_dsp: add x86-optimized functions for vector_fmac_scalar()  
						
						
						
						
					 
					
						2012-06-18 18:01:14 -04:00 
						 
				 
			
				
					
						
							
							
								Justin Ruggles 
							
						 
					 
					
						
						
						
						
							
						
						
							d5a7229ba4 
							
						 
					 
					
						
						
							
							Add a float DSP framework to libavutil  
						
						... 
						
						
						
						Move vector_fmul() from DSPContext to AVFloatDSPContext. 
						
						
					 
					
						2012-06-08 13:14:38 -04:00