summaryrefslogtreecommitdiff
path: root/src/performance.h
AgeCommit message (Collapse)AuthorFilesLines
2014-06-12Fix the same kernel name issue of OCL_OUTPUT_KERNEL_PERFYongjia Zhang1-1/+1
Now it treats kernels with same kernel name and different build options separately. When OCL_OUTPUT_KERNEL_PERF==1, it outputs the time summary as before, but if OCL_OUTPUT_KERNEL_PERF==2, it will output the time details including the kernel build options and kernels with same kernel name but different build options will output separately. v2: use strncmp and strncpy instead of strcmp and strcpy. Signed-off-by: Yongjia Zhang <yongjia.zhang@intel.com> Reviewed-by: Zhigang Gong <zhigang.gong@linux.intel.com>
2014-04-02Add kernels performance outputYongjia Zhang1-0/+12
if environment variable OCL_OUTPUT_KERNEL_PERF is set non-zero, then after the executable program exits, beignet will output the time information of each kernel executed. v2:fixed the patch's trailing whitespace problem. v3:if OCL_OUTPUT_KERNEL_PERF is 1, then the output will only contains time summary, if it is 2, then the output will contain time summary and detail. Add output 'Ave' and 'Dev', 'Ave' is the average time per kernel per execution round, 'Dev' is the result of 'Ave' divide a kernel's all executions' standard deviation. Signed-off-by: Yongjia Zhang <yongjia.zhang@intel.com> Reviewed-by: Zhigang Gong <zhigang.gong@linux.intel.com>