summaryrefslogtreecommitdiff
path: root/kernel/bpf/cpumap.c
diff options
context:
space:
mode:
authorNamhyung Kim <namhyung@google.com>2021-03-11 20:54:12 +0900
committerPeter Zijlstra <peterz@infradead.org>2021-03-16 21:44:42 +0100
commitbdacfaf26da166dd56c62f23f27a4b3e71f2d89e (patch)
treeec997eec7ea48b7875e1de7902a72ad80e8d5196 /kernel/bpf/cpumap.c
parent9483409ab5067941860754e78a4a44a60311d276 (diff)
perf core: Add a kmem_cache for struct perf_event
The kernel can allocate a lot of struct perf_event when profiling. For example, 256 cpu x 8 events x 20 cgroups = 40K instances of the struct would be allocated on a large system. The size of struct perf_event in my setup is 1152 byte. As it's allocated by kmalloc, the actual allocation size would be rounded up to 2K. Then there's 896 byte (~43%) of waste per instance resulting in total ~35MB with 40K instances. We can create a dedicated kmem_cache to avoid such a big unnecessary memory consumption. With this change, I can see below (note this machine has 112 cpus). # grep perf_event /proc/slabinfo perf_event 224 784 1152 7 2 : tunables 24 12 8 : slabdata 112 112 0 The sixth column is pages-per-slab which is 2, and the fifth column is obj-per-slab which is 7. Thus actually it can use 1152 x 7 = 8064 byte in the 8K, and wasted memory is (8192 - 8064) / 7 = ~18 byte per instance. Signed-off-by: Namhyung Kim <namhyung@kernel.org> Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Link: https://lkml.kernel.org/r/20210311115413.444407-1-namhyung@kernel.org
Diffstat (limited to 'kernel/bpf/cpumap.c')
0 files changed, 0 insertions, 0 deletions