#systemtap on 2018-08-30 — irc logs at freenode.irclog.whitequark.org

2015-11-12 23:18 fche changed the topic of #systemtap to: http://sourceware.org/systemtap; email systemtap@sourceware.org if answers here not timely, conversations may be logged

04:40 _whitelogger has joined #systemtap

05:11 slowfranklin has joined #systemtap

07:01 orivej has quit [Ping timeout: 240 seconds]

07:38 orivej has joined #systemtap

08:00 orivej has quit [Ping timeout: 252 seconds]

08:44 pwithnall has joined #systemtap

08:49 mjw has joined #systemtap

11:23 orivej has joined #systemtap

12:11 naveen2 has joined #systemtap

12:14 naveen1 has quit [Ping timeout: 252 seconds]

12:18 naveen4 has joined #systemtap

12:19 naveen2 has quit [Ping timeout: 246 seconds]

12:28 naveen1 has joined #systemtap

12:30 naveen4 has quit [Ping timeout: 246 seconds]

12:32 naveen2 has joined #systemtap

12:35 naveen1 has quit [Ping timeout: 245 seconds]

12:47 naveen3 has joined #systemtap

12:50 naveen2 has quit [Ping timeout: 240 seconds]

12:52 naveen4 has joined #systemtap

12:55 naveen3 has quit [Ping timeout: 245 seconds]

12:56 orivej has quit [Ping timeout: 240 seconds]

13:23 wcohen has joined #systemtap

13:51 tromey has joined #systemtap

14:18 brolley has joined #systemtap

14:38 orivej has joined #systemtap

14:59 introom has joined #systemtap

15:51 pwithnall has quit [Ping timeout: 252 seconds]

16:37 pwithnall has joined #systemtap

17:50 slowfranklin has quit [Quit: slowfranklin]

18:08 mjw has quit [Quit: Leaving]

18:11 orivej has quit [Ping timeout: 250 seconds]

19:08 orivej has joined #systemtap

20:01 orivej has quit [Ping timeout: 252 seconds]

20:05 slowfranklin has joined #systemtap

20:15 slowfranklin has quit [Quit: slowfranklin]

20:23 pwithnall has quit [Ping timeout: 240 seconds]

20:25 slowfranklin has joined #systemtap

21:24 <agentzh> fche: the duplicate func removal optimization pass tends to lead to confusing line numbers in runtime error messages (like read faults). i just wasted 2 hours on such misinfo.

21:25 <agentzh> it's very common for duplicate synthetic functions for primitives like @cast().

21:25 <agentzh> and the read fault error message may point to completely unrelated parts of the user stap script file.

21:26 <agentzh> should we ignore synthetic functions in that optimizer?

21:28 tromey has quit [Quit: ERC (IRC client for Emacs 26.1.50)]

21:33 wcohen has quit [Ping timeout: 252 seconds]

21:45 <fche> could run code with stap -u ... but a stap -vvv kind of thing will show how many functions are clones of each other

21:46 <agentzh> fche: yeah, that was how i nailed this down.

21:47 <agentzh> but for production uses, the read fault might no longer be reproducible easily.

21:47 <agentzh> so -u is not very helpful for debugging the online errors.

21:47 <agentzh> especially when stap currently gives no backtrace, just a single file name and line number pair.

21:49 <agentzh> unless we always run stap -u online...

21:50 <agentzh> which would be sad.

21:51 <agentzh> maybe the right fix is for the caller to provide source file location info for those synthetic functions.

21:51 <agentzh> so dedup'ing those functions still lead to correct loc info in error messages.

21:51 <agentzh> *leads to

21:58 <fche> not so easy, as the caller may itself be duplicated, and thus may not have correct original-source coordinates

22:02 <agentzh> right.

22:05 brolley has left #systemtap [#systemtap]

22:08 slowfranklin has quit [Quit: slowfranklin]

22:12 <agentzh> fche: i'd turn off dedeup for synthetic functions in our branch for now. i guess it's not something that will be accepted upstream.

22:12 <agentzh> just a trade-off

22:12 slowfranklin has joined #systemtap

22:13 <fche> not sure what's special about synthetic functions for this purpose

22:13 <fche> hm

22:13 <agentzh> because it's very common to have duplicate synthetic functions derived from identical @cast() expressions.

22:13 <fche> perhaps the body-sharing optimization could be conditional on the same token object

22:13 <agentzh> like @cast(p, "foo")

22:14 <agentzh> it can appear very frequently.

22:14 <agentzh> in the same .stp file.

22:14 <fche> yes but we can have identical probe handlers too from wildcards, same thing

22:15 <agentzh> that's much rarer for our use cases... :)

22:24 <agentzh> *at least

22:26 slowfranklin has quit [Quit: slowfranklin]

22:30 <fche> for testing, consider extending elaborate.cxx line 4958ish to print the ->tok objects of the original & substitute functiondecl objects

22:39 <agentzh> fche: yeath, that was how i tracked the problem down.

22:39 <agentzh> i'll create a PR to record this issue for now.

22:39 <agentzh> with a minimal test case.

22:49 <agentzh> done: https://sourceware.org/bugzilla/show_bug.cgi?id=23598

22:50 <fche> agentzh, about the abort v4, lgtm

22:51 <fche> (there's a typo 'flat' vs 'flag' in there, but whatever)

22:51 <agentzh> v4 for which patch?

22:51 <agentzh> abort?

22:52 <fche> yup, said "abort v4" :-)

22:52 <fche> was a bit too curt with my words though. thanks, go and commit

22:52 <agentzh> oh, sorry, i read it as "about" :P

22:52 <agentzh> great! i'll fix the typo and commit :D

22:53 <agentzh> fche: it seems like -u is less exercised by the users and might expose bugs...just found one.

22:53 <agentzh> *by the user community

22:53 <fche> could be

22:53 <agentzh> will create a PR.

22:54 <agentzh> i have to say stap has accumulated so many features over the years! it's amazing :)

22:56 <fche> re. return-nonvalue-v3 ... here's a glitch

22:56 <fche> stap -p1 -e 'function f() { if (a) return a = 2 }'

22:57 <fche> the keyword-enumerated detection of void-return fails in the sense that a reader can't tell what's what

22:57 <fche> having that list of statement keywords is not that easy to explain, and doesn't stand out visually well

23:01 <agentzh> fche: so i should remove that part?

23:02 <fche> yeah, I think so, both to simplify the code and the explanation. "non-value return only accepted with a ; or } following it"

23:02 <fche> one of the consequences of our loosey grammar, oh well.

23:02 <agentzh> okay, no problems. i'll rework the patch.

23:02 <fche> thanks

23:02 <agentzh> sure

23:03 <fche> for test cases that you need to check only for parseability or non-parseability, you don't need all that .exp material, just add barenaked .stp files under testsuite/parseok vs parseko

23:04 <agentzh> fche: yeah, but those tests are auto-generated so it's actually easier for me :P

23:04 <agentzh> i hope that i can keep that way if you don't mind.

23:04 <fche> ok

23:04 <agentzh> thanks

23:04 <fche> wouldn't be surprised if at some point we extend the .exp machinery to have gcc-like directives embedded in .stp files

23:05 <fche> to reduce the need for .exp files for a few more classes of cases (like expected outputs or diagnostics)

23:05 <agentzh> *nod*

23:05 <agentzh> i'm just not fluent in tcl yet to do that big thing.

23:06 <agentzh> oh you mean embed testing facility directly in the stap language?

23:06 <agentzh> *embedding

23:06 <fche> I mean in tcl

23:07 <agentzh> okay

23:07 <fche> (stap scripts can to some extent self-test, in printf("pass") vs error("fail") e.g.

23:07 <fche> some tests do that

23:07 <agentzh> yeah, inlined assertions.

23:07 <agentzh> luajit does that in most of its official test suite, for example.

23:08 <agentzh> though personally i like comparing output from the outside more :)

23:08 <agentzh> for most of the cases.

23:08 <agentzh> so that it's easier to know the actual output by simply reading the test suite report.

23:09 <agentzh> but i agree inlined assertions could be more advanced to do that automatically.

23:09 <agentzh> luajit's assert() is just like C's, true or false only.

23:09 <agentzh> there is no way other than patching the test script itself to know how different things are in an assert().

23:11 <fche> the stap tapset does in fact have an assert() function

23:23 <agentzh> yeah, indeed. found it in logging.stp.

23:24 <agentzh> fche: created a PR for -u: https://sourceware.org/bugzilla/show_bug.cgi?id=23599

23:24 <agentzh> iirc, you are working on a similar issue to avoid bringing unused tapset functions?

23:24 <agentzh> not sure if it'll also be applicable for -u.

23:24 <agentzh> *to -u

23:25 <fche> separate issue

23:25 <agentzh> okay, is there a quick fix for the PR? it looks strange.

23:25 <fche> which PR?

23:25 <agentzh> https://sourceware.org/bugzilla/show_bug.cgi?id=23599

23:25 <fche> the usymname one?

23:25 <agentzh> yep

23:26 <agentzh> i could enable need_unwind when need_symbols is true, but it looks too hacky.

23:27 <fche> ah interesting

23:27 <fche> print_ubacktrace() is the failing function

23:28 <agentzh> yep

23:28 <fche> that should be satisfied from runtime/stack.c's implementation

23:29 <agentzh> fche: yeah, but for some reasons, s.need_unwind is set to false.

23:29 <agentzh> so that stack.c file is not included in the kernel module C file.

23:29 <agentzh> hence the compilation error.

23:29 <fche> hm and yet tapset/linux/ucontext-unwind.stp has the pragma in it

23:30 <fche> interesting

23:30 <fche> so some translation traversal does seem sensitive to -u in an inconsistent manner with pragma detection

23:38 <agentzh> yes, it seems.

23:38 <agentzh> or maybe they just skip unused tapset functions for pragma search.

23:43 <agentzh> fche: okay, i found the bug.

23:44 <agentzh> it's indeed the case.

23:44 <agentzh> embeddedcode_info_pass() only searches among probes.

23:44 <agentzh> not functions.

23:44 <agentzh> so unused functions won't get searched.

23:45 <agentzh> fche: does this change look good? https://pastebin.com/K1hyYXjq

23:45 <agentzh> if yes, i'll submit a formal patch to the mailing list for review.

23:45 <agentzh> this patch makes my test case pass.