A perf_events-based memory profiler for GNU/Linux. Works on AMD64 and ARM (among others) and is claimed to be very fast (hopefully much less overhead than Valgrind).