The University of Massachusetts Amherst

Search Results - hanmei+yang

2 Results Sort By:
CachePerf: A New Tool for Classifying Different Types of Cache Misses Correctly
The cache plays a key role in determining the performance of applications, no matter for sequential or concurrent programs on homogeneous and heterogeneous architecture. Therefore, it is important to locate and differentiate cache misses accurately. This invention provides a first unified profiling tool–CachePerf–that can correctly and efficiently identify different types of cache misses while imposing reasonable overhead, differentiate issues of allocators from those of applications, and exclude minor issues without much performance impact.
Published: 8/3/2023   |   Inventor(s): Tongping Liu, Jin Zhou, Jiaxun Tang, Hanmei Yang
Category(s): Computers, Engineering, Software & information technology
A new method of managing heap memory for NUMA architecture
Today’s computers have multiple processing cores to improve performance; however, a bottleneck arises in multi-core processors where there is only one memory controller, as each processor would need to be able to access the same memory. The Non-Uniform Memory Access computing architecture, or NUMA, addresses this bottleneck by giving a separate memory controller to each processing core. Those computing nodes are connected, and memory allocator software dictates how the system allocates resources to computing tasks. Current memory allocators have substantial drawbacks for tasks that require a large amount of remote accesses, have load imbalance among memory controllers, or have interconnect congestion.

 

The inventors have created a patent pending NUMA memory allocator called NUMAlloc that solves the above memory management problems through four innovations: 1) Binding-based memory management; 2) Interleaved heap; 3) Huge page support; 4) Reduced overhead of migrating objects among freelists. These innovations result in a drop-in, scalable memory allocator that compared to the state-of-the-art allocator speeds computing performance by an average of 13% and speeds up to about 5x for memory intensive applications.

Published: 6/26/2023   |   Inventor(s): Tongping Liu, Xin Zhao, Hanmei Yang
Category(s): Software & information technology, Computers, Engineering