Non-Uniform Memory Access (NUMA) is a multi-processing architecture where the memory access time is dependent on its location relative to the processor. A processor accessing memory on its same NUMA node will have faster access than if it were accessing memory on a different node. The busTRACE analyzer allows you to capture and analyze NUMA Node information for each I/O.
The NUMA Node Performance Tester is a small applet that is designed to run on a multiple NUMA node platform. The program start by performing a memory benchmark on its local NUMA node and then progressively getting further in distance.
As you can see from the screenshot, when the memory buffers and processor are located on the same NUMA node (i.e. 0-0-0 and 1-1-1), maximum memory throughput is achieved. When the source and destination buffer are both in a different node than the processor (i.e. 1-1-0 and 0-0-1), increased latency and decreased performance occurs.