#systemtap on 2020-11-06 — irc logs at freenode.irclog.whitequark.org

2015-11-12 23:18 fche changed the topic of #systemtap to: http://sourceware.org/systemtap; email systemtap@sourceware.org if answers here not timely, conversations may be logged

00:25 khaled has quit [Remote host closed the connection]

00:29 khaled has joined #systemtap

01:12 sscox has quit [Ping timeout: 264 seconds]

01:43 orivej has quit [Ping timeout: 272 seconds]

02:00 khaled has quit [Quit: Konversation terminated!]

02:26 <kerneltoast> fche, here are the results: https://gist.github.com/kerneltoast/e38a041d174d869bf2357adb75b5004c#file-systemtap-sum-diff

02:34 zamba has quit [Ping timeout: 256 seconds]

02:35 fLiPr3VeRsE has quit [Ping timeout: 256 seconds]

02:35 fLiPr3VeRsE has joined #systemtap

02:41 zamba has joined #systemtap

02:54 hpt has joined #systemtap

02:55 <fche> kerneltoast, ok, I see nothing scary in the patch or the diffs

02:55 <fche> we're hoping to cut the release in the next day ish so let's hope this is the last bit that's kind of low level

02:56 <kerneltoast> i was hoping to get the lockup fixed in time for the release too but alas

02:59 <fche> yeah that one's more troubling but I don't think your fix was on the right track

03:00 <kerneltoast> needs moar grok

03:17 derek0883 has quit [Remote host closed the connection]

03:22 sscox has joined #systemtap

03:25 derek0883 has joined #systemtap

04:59 <kerneltoast> fche, we should also add a patch to fix the memory leak when the task work fails to get added

06:28 lijunlong has quit [Ping timeout: 246 seconds]

06:29 derek0883 has quit [Remote host closed the connection]

06:29 lijunlong has joined #systemtap

06:30 derek0883 has joined #systemtap

06:44 irker835 has joined #systemtap

06:44 <irker835> systemtap: sultan systemtap.git:master * release-4.3-128-g498aa23b6 / runtime/linux/task_finder2.c: PR26144: task_finder2: execute task workers in order

07:40 lijunlong has quit [Ping timeout: 256 seconds]

07:42 lijunlong has joined #systemtap

07:52 khaled has joined #systemtap

09:00 hpt has quit [Ping timeout: 246 seconds]

09:01 hpt has joined #systemtap

09:13 hpt has quit [Ping timeout: 240 seconds]

09:44 irker835 has quit [Quit: transmission timeout]

09:47 orivej has joined #systemtap

10:01 pviktori has joined #systemtap

10:25 pviktori has quit [Ping timeout: 256 seconds]

11:26 pviktori has joined #systemtap

11:48 mjw has joined #systemtap

13:06 khaled has quit [Quit: Konversation terminated!]

13:09 khaled has joined #systemtap

13:10 orivej has quit [Ping timeout: 256 seconds]

14:02 wcohen has quit [Remote host closed the connection]

14:31 wcohen has joined #systemtap

15:27 amerey has joined #systemtap

16:06 tromey has joined #systemtap

18:07 derek0883 has quit [Remote host closed the connection]

18:11 derek0883 has joined #systemtap

18:58 <kerneltoast> fche, hiya

18:58 <fche> uh oh

18:59 <kerneltoast> would it be bad to wrap a small code style nitpick into an unrelated patch?

19:00 <fche> depends

19:00 <fche> what're you thinking

19:00 <kerneltoast> https://gist.github.com/kerneltoast/32a19af77cb401140da5ac881ad4f6d9

19:00 orivej has joined #systemtap

19:02 <fche> well, not just a style thing, it should shrink the critical section somewhat in task_worker_f

19:02 <fche> fn

19:02 <fche> looks okay to me, testing supports?

19:02 <kerneltoast> lemme test real quick

19:02 <kerneltoast> want it to be a separate patch?

19:02 irker042 has joined #systemtap

19:02 <irker042> systemtap: amerey systemtap.git:master * release-4.3-129-g4ff87eef4 / po/cs.gmo po/cs.po po/en.po po/fr.po po/pl.po po/systemtap.pot: prerelease: update-po

19:02 <irker042> systemtap: amerey systemtap.git:master * release-4.3-130-g3345cd0df / AUTHORS: prerelease: AUTHORS bump

19:02 <irker042> systemtap: amerey systemtap.git:master * release-4.3-131-g8dc7997f7 / tapset/uconversions.stp: tapset/uconversions.stp: Fix user_string_n_nofault description.

19:03 <fche> kerneltoast, what as a separate patch? the diff you posted looks okay

19:03 <kerneltoast> the task_worker_fn hunk but if not, coolio

19:03 <kerneltoast> coolbeans

19:14 <kerneltoast> fche, our lean tester is happy. here's the polished commit: https://gist.github.com/kerneltoast/32a19af77cb401140da5ac881ad4f6d9

19:16 <fche> thanks, nice

19:23 <kerneltoast> fche, i think i see a problem with _stp_runtime_entryfn_get_context

19:24 <kerneltoast> but my brain is still processing

19:32 <irker042> systemtap: amerey systemtap.git:master * release-4.3-132-gf75ec911a / tapset/uconversions.stp: tapset/uconversions.stp: Fix format of user_string_n_nofault

19:32 <irker042> systemtap: amerey systemtap.git:master * release-4.3-133-g3df5453fe / : prerelease: update-docs

19:35 <kerneltoast> fche, i guess this is moreso an optimization: https://gist.github.com/kerneltoast/45815f8345acad1f840af3b20b0b1adb

20:14 <fche> hm, so no sign of the bug?

20:14 <fche> hm ya know

20:20 <fche> could use a warning in the _put_context() function, as that c == _stp_runtime_get_context is really an assertion

20:25 <fche> hmmmmmmmm

20:25 <fche> ok have a hypothesis

20:26 <fche> what if we do encounter a reentrancy event, but at that moment for whatever reason, we're in rcu-idle state

20:26 <fche> so _stp_runtime_get_context() returns 0 in the if (c == ...) test

20:26 <fche> then the test would be false and the context would stay busy ..... hm never mind, that's a more friendly outcome

20:27 <fche> but I'm thinking that particular assertion could introduce heisenbugs

20:28 <fche> kerneltoast, if you can induce that crash easily enough with make -j *check, could you try it with commenting out that if... and leaving in the atomic_dec unconditionally?

20:28 <fche> agentzh, kerneltoast, making any sense?

20:29 <kerneltoast> fche, commenting out which if? also, my cmpxchg patch doesn't fix the bug, it's just an optimization

20:29 <kerneltoast> i did test something just now similar to what you're thinking of

20:29 <fche> the if (c == _stp_runtime_get_context()) in _put_context

20:30 <kerneltoast> fche, this is what i tried: https://gist.github.com/kerneltoast/2c1f1dff6d6aa4dd14eefdd0ede0bc5d

20:31 <kerneltoast> the lockup still occurred without that printk getting hit

20:31 <fche> ok

20:32 <fche> could you try my hypothetical too easily?

20:32 <kerneltoast> yeah

20:32 <kerneltoast> just comment out that if? nothing else?

20:32 <fche> yes, do the atomic_dec(&c->busy); eery time

20:33 <kerneltoast> it takes 10-20 min to reproduce usually btw

20:33 <kerneltoast> pretty fast

20:33 <kerneltoast> + atomic_dec(&c->busy);

20:33 <kerneltoast> - atomic_dec(&c->busy);

20:33 <kerneltoast> - if (c == _stp_runtime_get_context())

20:33 <fche> yup

20:34 <kerneltoast> it's chuggin away

20:34 <fche> thanks

20:35 <kerneltoast> when's the release btw?

20:35 <fche> any time now,

20:35 <fche> I'm personally holding it up with work :*

20:35 <fche> but am having a tough time concentraing :)

20:36 <kerneltoast> hehe

20:36 <kerneltoast> i should get agentzh to push the memleak fix soon then

20:36 <fche> dammit

20:36 <fche> :)

20:36 <kerneltoast> agentzh gogogogog

20:54 derek0883 has quit [Remote host closed the connection]

21:03 derek0883 has joined #systemtap

21:12 <kerneltoast> fche, it ran to completion. testing it again now

21:12 <fche> really, no sign of the reentrancy this time?

21:12 <kerneltoast> yes, but from previous testing it doesn't happen 100% of the time, just *most* of the time

21:12 <kerneltoast> we may have gotten lucky

21:13 <kerneltoast> i could also add a print

21:13 <fche> cueing daft punk

21:13 <kerneltoast> a print to indicate when the atomic_dec would've been ignored

21:14 <kerneltoast> running the testsuite again

21:14 <kerneltoast> chug a lug lug

21:15 <fche> not sure whether that's necessarily a bug or problem so let's not print there for now

21:15 <fche> chug a lug lug <<< okay

21:15 <kerneltoast> better to have a print to see if it's even doing something

21:21 <kerneltoast> fche, it died

21:23 <kerneltoast> my print didn't get printed either

21:24 <kerneltoast> turn the daft punk off

21:24 <kerneltoast> do not pass GO

21:24 <kerneltoast> do not collect pension

21:28 <kerneltoast> fche, here's the full dmesg https://gist.github.com/kerneltoast/a317b64277fe454c3307005332b2d265

21:31 <fche> stap_4f0a4a843ca425508f184f4a28f3b46b__13439 (<input>):

21:31 <fche> after a reboot, can you find the stap_4f* corresponding files under $build/testsuite/.systemtap-root/cache/* ?

21:32 <fche> with the .c file, it should be possible to figure out which <input> this comes from

21:33 <fche> then I'd run that test (only) on a super busyfied machine

21:35 <kerneltoast> fche, here's the file in its enormity: https://gist.github.com/kerneltoast/f73190f91e74bca314d53d8c9d176c76

21:36 tromey has quit [Quit: ERC (IRC client for Emacs 27.1.50)]

21:36 <fche> uprobes_onthefly.x ..... lots of tracepoints .....

21:39 <kerneltoast> there are 15 calls to stp_lock_probe and 12 stp_unlock_probe

21:39 <kerneltoast> let's see where there's a discrepancy...

21:41 <fche> that's not necessarily a problem if they're in different nested blocks

21:41 <fche> at execution time there should be a matching set run

21:41 <kerneltoast> i know but it's just a belt-and-suspenders check to do

21:41 <fche> and an unlock at probe exit should be unavoidable

21:41 <kerneltoast> and i know how much you like belts and suspenders

21:42 <kerneltoast> oh hey

21:42 <fche> hm and interesting

21:42 <kerneltoast> i think i see the problem

21:42 <kerneltoast> goto out;

21:42 <kerneltoast> else

21:42 <kerneltoast> if (!stp_lock_probe(locks, ARRAY_SIZE(locks)))

21:42 <kerneltoast> c->locked = 1;

21:42 <kerneltoast> interrupt after the lock but before c->locked = 1

21:43 <fche> the reentrancy mechanism should prevent this

21:43 <kerneltoast> with the get_context?

21:43 <fche> yes

21:43 <fche> the lock_probe() mechanism should operate between concurrent probe handlers

21:46 <kerneltoast> maybe something's janky with the get_context rcu stuff

21:47 <kerneltoast> i'm not exactly sure why rcu is used there

21:49 derek088_ has joined #systemtap

21:53 derek088_ has quit [Remote host closed the connection]

21:54 derek0883 has quit [Ping timeout: 272 seconds]

21:54 <fche> commit 2699450d

21:55 <kerneltoast> no i mean why is rcu used at all

21:55 <kerneltoast> see: https://gist.github.com/kerneltoast/2c00e7be1e5f9250efea463f79ff8efb

21:57 <fche> yeah, that's plausible generally

21:57 <fche> but I believe the BZ1788662 related tests are still necessary - to prevent a certain type of reentrancy

21:58 <kerneltoast> the atomic_cmpxchg should protect from that

21:58 <irker042> systemtap: amerey systemtap.git:master * release-4.3-134-g91087d81d / man/stapprobes.3stap: man/stapprobes.3stap: Mention nd_syscall argument writing.

21:58 <fche> see also commit b5f8a8a64b6354e5

21:58 <fche> kerneltoast, not sure that's enough. You probably don't have access to the bz in question, but there was in fact some related problem

21:59 <agentzh> kerneltoast: looking into that patch.

21:59 <fche> basically if -any- code run by a stap probe handler invoked rcu-related functions, then WARNING: suspicious RCU usage kernel messages could result

21:59 derek0883 has joined #systemtap

22:00 <kerneltoast> fche, b5f8a8a64b6354e5 is also irrelevant, because the code was still using RCU at that point

22:00 <kerneltoast> rcu_assign_pointer(contexts[cpu], NULL);

22:01 <kerneltoast> if you use RCU inconsistently it'll scream of course

22:01 <kerneltoast> i'll brb in an hour

22:02 <kerneltoast> but anyway, it still died with this gist: https://gist.github.com/kerneltoast/2c00e7be1e5f9250efea463f79ff8efb

22:02 <kerneltoast> so this isn't the problem

22:02 <fche> kerneltoast's last gist seems to restore context allocation to just before PR20192's commit b5f8a8a64

22:03 <kerneltoast> my gist doesn't make use of RCU though

22:03 <fche> I understand

22:03 <fche> that's why I said --before-- PR20192's commit

22:03 <kerneltoast> the code just before that b5f commit had some RCU usage

22:03 <fche> that commit introduced rcu into this path

22:03 <kerneltoast> no it didn't, see: [02:00 PM] <ffffffkerneltoast> rcu_assign_pointer(contexts[cpu], NULL);

22:04 derek0883 has quit [Remote host closed the connection]

22:04 <kerneltoast> it was using RCU inconsistently

22:04 <fche> ah, there are some other older uses

22:04 <fche> commit 2d9786c1d9

22:05 <fche> six years ago, well there we go

22:05 <fche> if we can get rid of the rcu code in this path a la the latest gist, I think I'm all for it, but it'd definitely need lockdep etc. fuller testing overnight etc etc etc

22:06 derek0883 has joined #systemtap

22:09 <irker042> systemtap: sultan systemtap.git:master * release-4.3-135-g7db54199f / runtime/linux/task_finder2.c: task_finder2: fix memory leak when task workers fail to get added

22:12 * agentzh finds fche's release date is a great way to increase kerneltoast's productivity *grin*

22:12 <fche> and decrease mine :) thereby extending the release date :)

22:12 <agentzh> lol

22:12 <fche> but hey it's probably a good tradeoff

22:12 <agentzh> we should release often.

22:37 <fche> kerneltoast, you said it still died with your last patch variant 2c00e7be etc.

22:38 <fche> one extra diagnostic you could put in there is in the context_put function, check if c->locked is 1

22:38 <fche> there's no way it should be 1 (things still locked) by the time the context-put is run

22:38 <fche> could emit a BUG at that time

22:58 <fche> agentzh, kerneltoast, am trying to reproduce the bug here, but until I manage

22:58 <fche> mind trying https://paste.centos.org/view/bf62ff41 ?

22:59 <agentzh> fche: yeah, should be easy to reproduce with -j17 on a 8c16t box.

23:00 <fche> unfortunately by that time of the code, the probe_point/name are already nuked, but it'd be handy to know which one it was

23:00 * fche doesn't have a vm that big locally, but trying a biggish -j job

23:01 <kerneltoast> okay i'm back

23:01 <kerneltoast> seems like you want to fix this before the release eh

23:01 <kerneltoast> yes it still died with 2c00e7be

23:01 <kerneltoast> trying your patch now

23:01 <fche> well kernel hangs are bad m'kay

23:02 <kerneltoast> yeeeeep

23:02 <kerneltoast> they reel bad

23:02 <kerneltoast> no recovery

23:02 <kerneltoast> just deadbeef

23:05 <kerneltoast> fche, okay running with your centos paste

23:07 <fche> not sure it'd catch this case, but maybe

23:42 <kerneltoast> fche, your BUG didn't get hit

23:42 <fche> ok

23:42 <fche> bug that hang still there?

23:43 <kerneltoast> yep still hung

23:47 <fche> hm the exact same test could be fired into the _get fn

23:48 <fche> it should not be 1 upon entry to a probe handler either (from a prior context that finished running, or naturally from a Currently running one)

23:49 <kerneltoast> whaddya mean

23:50 <kerneltoast> shouldn't be locked and then acquire the context?

23:50 <fche> this is a different lock

23:50 <fche> c->locked should be 0 or 2 incoming and outgoing from a context_get and context_put fn

23:52 <kerneltoast> fche, the contexts are per-cpu, but the locks are not

23:52 <fche> https://paste.centos.org/view/70d7a786

23:53 <fche> I know, that's the point

23:53 <kerneltoast> yea so the context won't protect from another cpu

23:53 <fche> it need not

23:53 <fche> we're dealing with apparent reentrancy situation here

23:54 <fche> so the get_context() would've been somehow hit twice on the same cpu, or somehow returning the same context that was somehow ?!?! not marked busy properly

23:54 <fche> c->locked is not a lock

23:54 <fche> it's an indication whether the current context has acquired the stap global variable locks

23:54 <kerneltoast> running that paste now

23:55 <fche> so the theory goes, in the reentry case, this _get_context test for c->locked==1 should be another way for detecting suspected reentrancy