DeviceMemory

Description: Measures the maximum device memory bandwidth for different levels of the memory hierarchy and different access patterns. Results are reported in GB/s.

Problem Sizes: Problem size automatically scales up with device performance (designed to saturate bandwidth).

Precision: N/A

Includes PCIe Transfer Time: No

Specific Tests:

  • ImageCacheAccess - Reports read bandwidth from texture/image memory
  • ImageLinearAccess - Reports read bandwidth from texture/image memory using a linear access pattern
  • ImageRandAccess - Reports read bandwidth from texture/image memory using a random access pattern
  • readConstantMemoryCoalesced - Reports read bandwidth from constant memory using coalesced (per thread-block contiguous) accesses
  • readGlobalMemoryCoalesced - Reports read bandwidth from global memory using coalesced (per thread-block contiguous) accesses
  • readGlobalMemoryUnit - Reports read bandwidth from global memory using uncoalesced (per thread contiguous) accesses
  • readLocalMemory - Reports read bandwidth from local memory
  • writeGlobalMemoryCoalesced - Reports write bandwidth to global memory using coalesced (per thread-block contiguous) accesses
  • writeGlobalMemoryUnit - Reports write bandwidth to global memory using uncoalesced (per thread contiguous) accesses
  • writeLocalMemory - Reports write bandwidth to local memory
 
shoc/devmem.txt · Last modified: 2011/11/11 13:13 by kspafford
Recent changes RSS feed Driven by DokuWiki