diff options
author | Kan Liang <kan.liang@linux.intel.com> | 2021-02-02 12:09:10 -0800 |
---|---|---|
committer | Arnaldo Carvalho de Melo <acme@redhat.com> | 2021-02-08 16:25:00 -0300 |
commit | 590db42de068a1d11e51bd0796a9044621aeed2e (patch) | |
tree | 8a08575b6be5ba31cdbbaae1c95eda72f623cf82 /tools/perf/util/event.h | |
parent | ea8d0ed6eae37b01953a29bca98112d9e2507a84 (diff) | |
download | lwn-590db42de068a1d11e51bd0796a9044621aeed2e.tar.gz lwn-590db42de068a1d11e51bd0796a9044621aeed2e.zip |
perf report: Support instruction latency
The instruction latency information can be recorded on some platforms,
e.g., the Intel Sapphire Rapids server. With both memory latency
(weight) and the new instruction latency information, users can easily
locate the expensive load instructions, and also understand the time
spent in different stages. The users can optimize their applications in
different pipeline stages.
The 'weight' field is shared among different architectures. Reusing the
'weight' field may impacts other architectures. Add a new field to store
the instruction latency.
Like the 'weight' support, introduce a 'ins_lat' for the global
instruction latency, and a 'local_ins_lat' for the local instruction
latency version.
Add new sort functions, INSTR Latency and Local INSTR Latency,
accordingly.
Add local_ins_lat to the default_mem_sort_order[].
Signed-off-by: Kan Liang <kan.liang@linux.intel.com>
Cc: Andi Kleen <ak@linux.intel.com>
Cc: Jin Yao <yao.jin@linux.intel.com>
Cc: Jiri Olsa <jolsa@redhat.com>
Cc: Madhavan Srinivasan <maddy@linux.vnet.ibm.com>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Stephane Eranian <eranian@google.com>
Link: http://lore.kernel.org/lkml/1612296553-21962-7-git-send-email-kan.liang@linux.intel.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Diffstat (limited to 'tools/perf/util/event.h')
-rw-r--r-- | tools/perf/util/event.h | 1 |
1 files changed, 1 insertions, 0 deletions
diff --git a/tools/perf/util/event.h b/tools/perf/util/event.h index 2afea7247dd3..60752e4c9727 100644 --- a/tools/perf/util/event.h +++ b/tools/perf/util/event.h @@ -142,6 +142,7 @@ struct perf_sample { u16 insn_len; u8 cpumode; u16 misc; + u16 ins_lat; bool no_hw_idx; /* No hw_idx collected in branch_stack */ char insn[MAX_INSN]; void *raw_data; |