Parallel Architecture Research in Eindhoven:
.ca Cache at all levels, likely to be accessed again.
.cg Cache at global level (cache in L2 and below, not L1).
.cs Cache streaming, likely to be accessed once.
.cv Cache as volatile (consider cached system memory lines stale, fetch again).
The ld.cs load cached streaming operation allocates global lines with evict-first policy in L1
and L2 to limit cache pollution by temporary streaming data that may be accessed once or
twice. When ld.cs is applied to a Local window address, it performs the ld.lu operation.
Working on a Fermi assembler.. for the fun of it! :)
You must Log In to add a comment.
New Private Message
Follow Us On
Copyright © 2015 NVIDIA Corporation