I am dealing with kernels which have no floating point numbers but instead the computational part is mostly logical operations like AND and exclusive or (XOR).

Is there an equivalent of FLOPs?

Is there a convient way to estimate non-FLOP performance?

Many thanks

Bill

Count the number of useful operations your kernel performs in each thread, multiply by the number of threads, and divide by the number of seconds the kernel took to complete. The theoretical "operation/second" rate is 1/2 of the FLOP rate because the FLOP rate counts FMA which performs two floating point operations in one clock.

Ok. I was hopping there might be a short cut or tool to do this.

Thanks

Bill

