diff options
author | Chuanbo Weng <chuanbo.weng@intel.com> | 2014-12-03 15:32:43 +0800 |
---|---|---|
committer | Zhigang Gong <zhigang.gong@intel.com> | 2014-12-03 15:55:51 +0800 |
commit | b9d1080b709cd0178edb93796e5c3dcdc212f97a (patch) | |
tree | 87deed9806faa0148548d451f52e237d729b920f | |
parent | da07d7a43d1fecd57a7ed2d35aa4670b7da7c8c8 (diff) |
Change CL_DEVICE_PREFERRED_VECTOR_WIDTH_CHAR from 8 to 16.
Because accessing global memory by uchar16/char16 will fully utilize
memory bandwidth, so change CL_DEVICE_PREFERRED_VECTOR_WIDTH_CHAR from
8 to 16. Three OpenCV cases will speedup from this patch:
OCL_ThreshFixture_Threshold, 25% improvement
OCL_MaxFixture_Max, 105% improvement
OCL_MinFixture_Min, 105% improvement.
Signed-off-by: Chuanbo Weng <chuanbo.weng@intel.com>
Reviewed-by: Zhigang Gong <zhigang.gong@linux.intel.com>
-rw-r--r-- | src/cl_gt_device.h | 2 |
1 files changed, 1 insertions, 1 deletions
diff --git a/src/cl_gt_device.h b/src/cl_gt_device.h index 37abfd28..ed19f109 100644 --- a/src/cl_gt_device.h +++ b/src/cl_gt_device.h @@ -24,7 +24,7 @@ .max_1d_global_work_sizes = {1024 * 1024 * 256, 1, 1}, .max_2d_global_work_sizes = {8192, 8192, 1}, .max_3d_global_work_sizes = {8192, 8192, 2048}, -.preferred_vector_width_char = 8, +.preferred_vector_width_char = 16, .preferred_vector_width_short = 8, .preferred_vector_width_int = 4, .preferred_vector_width_long = 2, |