CUDA Thread limitation

Hi,
I am running in sort of wiered issue. I have about 10 threads running out of which only 6 (and always 6) return the right answer (the other returns a big negative number) after doing a complex math operation. Was wondering if someone could clue me in as to what might be going on ?

could you post your code?