Platform: Intel(R) OpenCL Graphics Device: Intel(R) Arc(TM) Graphics Driver version : 24.52.032224 (Linux x64) Compute units : 128 Clock frequency : 2350 MHz Global memory bandwidth (GBPS) float : 65.34 float2 : 71.02 float4 : 69.16 float8 : 72.32 float16 : 55.74 Single-precision compute (GFLOPS) float : 4779.29 float2 : 4751.73 float4 : 4764.57 float8 : 4734.84 float16 : 4490.16 Half-precision compute (GFLOPS) half : 9471.28 half2 : 9392.70 half4 : 9457.13 half8 : 9385.27 half16 : 9286.76 Double-precision compute (GFLOPS) double : 149.58 double2 : 147.26 double4 : 148.91 double8 : 147.75 double16 : 145.23 Integer compute (GIOPS) int : 1255.70 int2 : 1197.28 int4 : 1194.39 int8 : 1199.35 int16 : 1183.43 Integer compute Fast 24bit (GIOPS) int : 1202.85 int2 : 1200.39 int4 : 1187.74 int8 : 1199.03 int16 : 1198.82 Integer char (8bit) compute (GIOPS) char : 2869.81 char2 : 2882.62 char4 : 2890.91 char8 : 2817.97 char16 : 2710.32 Integer short (16bit) compute (GIOPS) short : 7476.13 short2 : 7123.38 short4 : 7195.44 short8 : 7158.95 short16 : 6993.68 Transfer bandwidth (GBPS) enqueueWriteBuffer : 21.23 enqueueReadBuffer : 21.35 enqueueWriteBuffer non-blocking : 31.34 enqueueReadBuffer non-blocking : 31.21 enqueueMapBuffer(for read) : 25.35 memcpy from mapped ptr : 21.24 enqueueUnmap(after write) : 32.07 memcpy to mapped ptr : 20.95 Kernel launch latency : 39.49 us