Janne Grunau
28a8b5413b
h264/aarch64: add intra loop filter neon asm
Add my neon asm from x264 relicensed under the LGPL 2.1 or later. Ported
(x264 uses nv12 chroma) and optimized.
Cycle count for checkasm --bench on a Snapdragon 820e:
h264_h_loop_filter_luma_intra_8bpp_c: 60.0
h264_h_loop_filter_luma_intra_8bpp_neon: 54.2
h264_v_loop_filter_luma_intra_8bpp_c: 148.3
h264_v_loop_filter_luma_intra_8bpp_neon: 73.8
h264_h_loop_filter_chroma_intra_8bpp_c: 27.8
h264_h_loop_filter_chroma_intra_8bpp_neon: 21.4
h264_h_loop_filter_chroma_mbaff_intra_8bpp_c: 15.8
h264_h_loop_filter_chroma_mbaff_intra_8bpp_neon: 15.7
h264_v_loop_filter_chroma_intra_8bpp_c: 45.8
h264_v_loop_filter_chroma_intra_8bpp_neon: 17.3
2019-01-26 12:05:10 +01:00
..
2015-12-24 13:58:18 +01:00
2015-12-24 13:58:18 +01:00
2016-03-01 10:18:28 +01:00
2015-12-14 16:45:02 +01:00
2015-12-14 16:45:02 +01:00
2016-09-29 14:48:04 +02:00
2016-09-29 14:48:04 +02:00
2019-01-26 12:05:10 +01:00
2019-01-26 12:05:10 +01:00
2016-11-10 11:18:22 +02:00
2016-11-14 00:10:13 +02:00
2016-03-01 10:18:28 +01:00
2016-11-10 00:13:48 +01:00
2017-10-18 10:49:33 +03:00
2016-03-26 21:25:56 +02:00
2016-09-28 10:01:52 +02:00
2016-09-29 14:48:04 +02:00
2016-12-14 21:53:05 +01:00
2016-09-29 14:48:04 +02:00
2017-01-03 14:15:58 +02:00
2017-06-20 16:14:03 +03:00
2017-02-24 00:03:00 +02:00
2017-06-20 16:14:03 +03:00