#systemtap on 2018-11-10 — irc logs at freenode.irclog.whitequark.org

2015-11-12 23:18 fche changed the topic of #systemtap to: http://sourceware.org/systemtap; email systemtap@sourceware.org if answers here not timely, conversations may be logged

00:00 <agentzh> is this a known issue?

00:00 <agentzh> gdb does not have this issue when setting breakpoints on parent functions' entries.

00:00 <agentzh> is there a way to compensate this side effects of uprobes?

00:01 <agentzh> *effect

00:06 <agentzh> sorry, i mean putting a return probe on main().

00:12 <agentzh> the backtrace always stopped at the frame 0x7fffffffe000.

00:49 <agentzh> created a PR with a minimal example: https://sourceware.org/bugzilla/show_bug.cgi?id=23876

00:49 <agentzh> fche: it would be great if you can have a quick look. thanks!

01:08 <fche> function return probes are known to muck with backtraces :(

01:47 <agentzh> any workaround to that?

01:49 <fche> I don't know of one. the main problem is that when the kernel mucks up the stack (to hook into the retprobes machinery), it doesn't emit info that enables a general unwinder to follow that trampoline back up

01:49 <fche> (I wouldn't be surprised if you had a *retprobe on a program, and ran gdb, you'd see the bad effect too.)

05:34 _whitelogger has joined #systemtap

06:24 <agentzh> fche: will it help if i probe onto the ret instructions myself to emulate the return probes using ordinary uprobes?

06:25 <agentzh> thanks for your explanation on the problems with uretprobes + unwinding

06:26 <agentzh> for a related question, is there a probe syntax to add a probe onto an arbitrary instruction address? the absolute statement syntax does not work with inode-based uprobes, it seems.

06:26 <agentzh> sorry if it's already documented somewhere. just never seen such info.

09:37 _whitelogger has joined #systemtap

10:31 sscox has quit [Ping timeout: 276 seconds]

15:28 sscox has joined #systemtap

15:39 sscox has quit [Ping timeout: 240 seconds]

15:52 sscox has joined #systemtap

16:30 <fche> yes, if you can put a probe onto the ret, you should see workable tracebacks

16:30 <fche> t

16:31 <fche> the difficulty is in locating them. (dtrace at one point used to disassemble the target binaries to locate ret's.)

16:31 <fche> see PR10056, PR18714

16:31 <fche> @agentzh ^

17:03 <agentzh> fche: great! will check them out.

17:03 <agentzh> thanks!

17:03 <fche> they're long-unresolved, so yeah just a placeholder for known missing features

17:04 <agentzh> got it.

17:48 <agentzh> fche: is it possible to specify a probe on an instruction address?

17:49 <agentzh> right now in the stap language?

17:49 <fche> for userspace? I believe no.

17:49 <agentzh> yes, userspace.

17:49 <agentzh> the absolute statement syntax does not work with inode-based uprobes, it seems.

17:49 <fche> yes, that's the subject of the second PR

17:50 <agentzh> oh, sorry, just read the first one, not the 2nd one.

17:51 <agentzh> PR18714 is not for fortran? https://gcc.gnu.org/bugzilla/show_bug.cgi?id=18714

17:51 <agentzh> *is for

17:51 <agentzh> wrong PR number?

17:51 <fche> yea

17:51 <fche> no

17:51 <fche> https://sourceware.org/bugzilla/show_bug.cgi?id=18714

17:52 <agentzh> oh sorry, was distracted by the gcc bugzilla link in the first PR

17:53 <fche> they're both sourceware.org/bugzilla numbers

17:53 <agentzh> *nod*

17:53 <agentzh> okay, inode+offset probe is still a TODO.

17:53 <agentzh> do you have any syntax in mind?

17:54 <fche> process.statement(0xaddr) - the question is whether 0xaddr can be unambiguously interpreted by the user

17:54 <agentzh> and also process("file").statement(0xaddr) ?

17:54 <fche> or .statement(0xaddr).absolute, which is what the syntax used to be (and referred to run-time vm addresses)

17:55 <fche> yes

17:56 <agentzh> i think the statement(0xaddr).absolute is still a relative address? vm addresses are volatile for PIE/PIC.

17:56 <fche> it -wasn't- relative, that was the point of the .absolute suffix

17:56 <agentzh> okay, then i had the wrong memory.

17:57 <fche> statement(0xaddr) alone would refer to a location similarly as .statement("*@file.c:444") would

17:57 <agentzh> okay

17:58 <agentzh> we'll take a stab at implementing it.

17:58 <agentzh> for inode-based uprobes.

17:59 <agentzh> it would also complete the gap for true dwarf-less probling :)

17:59 <agentzh> with @vma() under the belt already.

18:00 <fche> sure

18:09 <fche> would suggest thinking of the first PR (non retprobes for return probes) as separate from the second PR (inode-address)

18:11 <agentzh> sure, they're different.

18:11 <agentzh> is there any dwarf marker for ret instructions yet?

18:11 <agentzh> there's no pointers in PR49167

18:12 <agentzh> beside the one for tailcalls.

18:12 <fche> not really good ones, as at that time. an 'epilogue' tag exists

18:12 <fche> and could be that alex oliva's recent sfn-related debuginfo improvements in gcc give enough info

18:12 <fche> but someone would have to dig in and see

18:13 <agentzh> okay, i'm taking notes.

18:13 <agentzh> thanks

18:13 <fche> righto

18:13 <agentzh> it would be much easier if we don't have to analyze the full function instructions.

18:13 <fche> of course

18:13 <fche> that's why stap isn't doing that

18:13 <fche> plus we're pretty cross-platform

18:14 <agentzh> aye

18:54 gila has joined #systemtap

18:55 <agentzh> fche: hmm, seems like process.statement(0xaddr) is already working with inode-based uprobes?

18:55 <agentzh> just tried this: stap -e 'probe process.statement(0x4004b6) { println(register("eax")) }' -c ./a.out

18:55 <agentzh> and it seems to work alright.

18:56 <agentzh> the only annoying bit is this error: "semantic error: address 0x4004b7 does not match the beginning of a statement (try 0x4004b6)"

18:56 <agentzh> maybe we can relax this a bit by something like a command-line option?

18:58 <fche> that'd be dwarf based, searching the line / pc-range mapping tables

18:59 <agentzh> oh, i see.

22:39 orivej_ has quit [Ping timeout: 246 seconds]

23:58 <ggherdov> Hello, I'm getting this error from a tapset at Pass 5: "ERROR: module release mismatch (4.18.0-MY-KERNEL-FOO vs 4.18.0-MY-KERNEL-BAR)"

23:58 <ggherdov> Indeed I just compiled and installed these two kernel version, but I'm not sure which "module" is systemtap complaining about. Full error at https://bpaste.net/show/741a3868e4b5

23:58 <ggherdov> Any hint on how to fix that?

23:58 <fche> So the version of the kernel build tree stap's using must be for the wrong one - not the kernel that you're actually running

23:59 <ggherdov> oh, I see.

23:59 <fche> if you bump up the verbosity two more levels (another -vv), stap will rat out where the kernel build tree it's using: