#systemtap on 2020-12-01 — irc logs at freenode.irclog.whitequark.org

2015-11-12 23:18 fche changed the topic of #systemtap to: http://sourceware.org/systemtap; email systemtap@sourceware.org if answers here not timely, conversations may be logged

00:00 derek0883 has quit [Remote host closed the connection]

00:00 derek0883 has joined #systemtap

00:02 khaled has quit [Quit: Konversation terminated!]

00:50 derek0883 has quit [Remote host closed the connection]

00:51 derek0883 has joined #systemtap

00:55 derek0883 has quit [Remote host closed the connection]

00:55 derek0883 has joined #systemtap

00:58 derek0883 has quit [Remote host closed the connection]

00:58 derek0883 has joined #systemtap

01:14 derek0883 has quit [Remote host closed the connection]

01:14 derek0883 has joined #systemtap

01:17 hpt has joined #systemtap

01:18 derek0883 has quit [Remote host closed the connection]

01:19 derek0883 has joined #systemtap

01:19 derek0883 has quit [Remote host closed the connection]

01:20 derek0883 has joined #systemtap

01:24 derek0883 has quit [Remote host closed the connection]

01:24 derek0883 has joined #systemtap

01:33 derek088_ has joined #systemtap

01:33 derek0883 has quit [Remote host closed the connection]

01:38 derek0883 has joined #systemtap

01:38 derek088_ has quit [Remote host closed the connection]

01:46 <kerneltoast> fche, so i'm trying to pass the context to every print statement

01:47 <kerneltoast> it's all fine and dandy until we get to _stp_vlog inside runtime/linux/io.c

01:47 <kerneltoast> _stp_vlog is used by _stp_error, _stp_warn, _stp_softerror, _stp_dbug

01:48 <kerneltoast> and _stp_vlog needs the context pointer

01:48 <kerneltoast> which makes everything suddenly very messy

01:48 <kerneltoast> because we'd have to somehow get a context pointer to every [_stp_error, _stp_warn, _stp_softerror, _stp_dbug] call

01:49 <kerneltoast> ideas on what to do about that?

01:51 <fche> hmmmmmmmmmmmm

01:55 <kerneltoast> agreed.

01:56 <fche> so even if we were to extend all those apis to pass the context around, we'd have to be willing to GRAB one at every entrypoint into the runtime, for all probe point APIs (perf, kprobes, tracepoints, everything)

01:57 <fche> and our problem recently was that we couldn't safely (in light of reentrancy or whatever?) do the entryfn_get_context() at any context whatsoever?

01:59 <kerneltoast> entryfn_get_context() is safe to do in any context, but it's not reentrant. our problem is that there is a lot of reentrant print usage, and instead of introducing a new lock to handle the reentrancy protection, you suggested leaning on the context

02:00 <fche> yeah, that just means having to -get- a suitable context

02:00 <fche> "but it's not reentrant"

02:01 <fche> ............................. ..... but it should be it should be it should be

02:01 <fche> in the sense of safely blocking reentrancy

02:04 <kerneltoast> blocking reentrancy with the context is hard though

02:05 <kerneltoast> the next possible solution would be to get rid of the reentrancy dependencies within the print driver

02:05 <kerneltoast> and adding back our own reentrancy lock

02:05 <fche> blocking reentrancy is hard .... I'm sure we've treaded on this before, but ... why?

02:05 <kerneltoast> because of _stp_vlog

02:05 <fche> are we just using the wrong level of locking gadgetry to detect/prevent reentrancy safely?

02:06 <kerneltoast> it's hard because of that evil function

02:06 <kerneltoast> _stp_vlog

02:06 <fche> I mean within the entryfn_get_context function

02:06 <kerneltoast> ah well

02:06 <kerneltoast> prints are allowed to be used outside of probes where the context is held

02:06 <kerneltoast> such as in module_exit

02:06 <kerneltoast> we would have to track down all this errant usage

02:07 <kerneltoast> so there are prints that occur while the context isn't held

02:08 <kerneltoast> the compiler could track all of those down for us if we pass the context pointer around

02:08 <kerneltoast> you said you were semi certain that module_exit was the only place using prints without a context being held

02:08 <fche> yeah other than dbug & pals

02:10 <kerneltoast> my most recent non-invasive solution was to check if the context is held when printing and, if it isn't, acquire a reentrancy lock

02:11 <kerneltoast> but that's redundant because our friend mr context can cover the reentrancy protection for us

02:11 <kerneltoast> for the low price of our sanity

02:12 <kerneltoast> i'm tempted to roll with that

02:12 <kerneltoast> i.e., falling back to a reentrancy lock inside the print driver if the context isn't held

02:13 <kerneltoast> thoughts?

02:13 <fche> terminology nit:

02:13 <fche> if context is not held, we'd take the context the normal way

02:14 <fche> if the context is held, we'd take it "reentrantly" - i.e., just use it, assuming that the probe handler that grabbed it will keep the ref alive just fine (sort of kind of rcu like)

02:15 <kerneltoast> yes but if the context isn't held i don't wanna grab it

02:15 <fche> for the duration of an stp_vlog? why not?

02:15 <kerneltoast> instead i'd use a spin lock local to the print driver

02:15 <kerneltoast> because it makes semantics more annoying

02:16 <kerneltoast> we'd have to track if the print driver grabbed a context or not

02:16 <fche> hm yeah

02:16 <fche> ok

02:16 <fche> hmmmmmmmmmm

02:16 <fche> ok I'll wander off and see if another Cunning Plan appears

02:17 <fche> now as a separate matter

02:17 <kerneltoast> hah

02:17 <fche> how much would that switch to STP_BULKMODE on the runtime side help with all this?

02:17 <kerneltoast> it has nothing to do with this

02:17 <kerneltoast> :)

02:18 <kerneltoast> the goal right now is to fix panics and data corruption caused by unchecked print reentrancy

02:18 <kerneltoast> oh actually this does help with the bulkmode stuff

02:19 <kerneltoast> it ensures that per-cpu log buffers are only ever modified while irqs are disabled

02:19 <kerneltoast> which is a requirement for the userspace side of things

02:19 <kerneltoast> so that userspace can't read out the log buffer while a print is writing to it

02:19 <fche> userspace won't be running while an irq handler is running on that cpu

02:21 <kerneltoast> hmm yeah and disabling irqs isn't going to provide our reentrancy protection

02:21 <kerneltoast> guess i can change that to a preempt_disable

02:21 <kerneltoast> oh nvm i should keep irqs disabled while printing

02:22 <kerneltoast> so that only NMIs have the possibility of their prints getting dropped

02:22 <kerneltoast> and then it helps with the bulkmode stuff because disabling irqs implies disabling preempt

02:22 <kerneltoast> voila

02:24 <fche> well anyway I'll wander off now so I don't get confused and pass on that virus

02:24 <fche> see ya in oh 9 or so hours :-)

02:25 <kerneltoast> sweet dreams

02:25 <kerneltoast> maybe you'll find the elegant solution to all this in your subconscious

02:25 <fche> will be dreaming of locks and semaphores and that annoying neighbour who keeps on interrupting

02:25 <fche> NO PROMISES :-)

02:25 modem has quit [Read error: Connection reset by peer]

02:26 <kerneltoast> Nathan Maxwell Irwin

02:26 <kerneltoast> N.M.I.

02:26 modem has joined #systemtap

02:26 <kerneltoast> the worst neighbor

02:27 <fche> with an infinity of clones

02:28 derek0883 has quit [Remote host closed the connection]

02:28 derek0883 has joined #systemtap

02:48 derek0883 has quit [Remote host closed the connection]

02:48 derek0883 has joined #systemtap

03:09 derek0883 has quit [Ping timeout: 260 seconds]

03:18 derek0883 has joined #systemtap

03:23 derek0883 has quit [Remote host closed the connection]

03:23 derek0883 has joined #systemtap

03:28 derek0883 has quit [Ping timeout: 272 seconds]

04:21 orivej has quit [Ping timeout: 264 seconds]

04:21 orivej has joined #systemtap

04:36 derek0883 has joined #systemtap

04:37 derek0883 has quit [Remote host closed the connection]

04:50 derek0883 has joined #systemtap

05:41 derek0883 has quit [Remote host closed the connection]

06:15 derek0883 has joined #systemtap

06:16 derek0883 has quit [Remote host closed the connection]

06:22 derek0883 has joined #systemtap

06:32 derek0883 has quit [Ping timeout: 265 seconds]

07:38 orivej has quit [Ping timeout: 272 seconds]

07:49 khaled has joined #systemtap

11:03 orivej has joined #systemtap

11:12 hpt has quit [Ping timeout: 260 seconds]

12:00 mjw has joined #systemtap

14:57 tromey has joined #systemtap

15:09 modem has quit [Ping timeout: 240 seconds]

15:17 modem has joined #systemtap

15:46 modem has quit [Ping timeout: 260 seconds]

15:53 modem has joined #systemtap

15:57 khaled has quit [Remote host closed the connection]

15:58 khaled has joined #systemtap

17:01 derek0883 has joined #systemtap

17:13 _whitelogger has joined #systemtap

17:48 derek0883 has quit [Remote host closed the connection]

17:48 derek0883 has joined #systemtap

18:02 derek0883 has quit [Remote host closed the connection]

18:05 derek0883 has joined #systemtap

18:18 <irker576> systemtap: sultan systemtap.git:master * release-4.4-13-gbb25d64f7 / runtime/linux/runtime_context.h: runtime_context: synchronize _stp_context_stop more strictly

18:18 irker576 has joined #systemtap

18:19 <kerneltoast> fche, i pushed a small change to fix the only nitpicks i saw with my un-RCU patch

18:19 <kerneltoast> if we're lucky, it'll make the buildbots happy

18:19 <fche> will report

18:19 <fche> thanks

18:22 derek0883 has quit [Remote host closed the connection]

18:25 derek0883 has joined #systemtap

18:25 derek0883 has quit [Remote host closed the connection]

18:37 wcohen has quit [Read error: Connection reset by peer]

18:37 wcohen has joined #systemtap

18:44 derek0883 has joined #systemtap

18:51 derek0883 has quit [Remote host closed the connection]

18:51 derek0883 has joined #systemtap

18:55 derek0883 has quit [Remote host closed the connection]

18:56 derek0883 has joined #systemtap

19:12 derek0883 has quit [Remote host closed the connection]

19:12 derek0883 has joined #systemtap

19:24 derek0883 has quit [Remote host closed the connection]

19:24 derek0883 has joined #systemtap

19:31 derek0883 has quit [Read error: Connection reset by peer]

19:31 derek0883 has joined #systemtap

19:38 derek0883 has quit [Ping timeout: 240 seconds]

19:43 derek0883 has joined #systemtap

19:52 derek0883 has quit [Ping timeout: 240 seconds]

19:53 derek0883 has joined #systemtap

19:58 derek0883 has quit [Remote host closed the connection]

19:59 derek0883 has joined #systemtap

20:18 derek0883 has quit [Remote host closed the connection]

20:18 derek0883 has joined #systemtap

20:19 mjw has quit [Quit: Leaving]

20:21 <fche> kerneltoast, no final results yet

20:21 <fche> but one failure (pr14546.exp on a rhel8 buildbot) that may or may not be related

20:39 derek0883 has quit [Ping timeout: 260 seconds]

20:52 <kerneltoast> related to the change i pushed today or the un-RCU in general?

20:59 <fche> not sure

20:59 <fche> hm the rhel8 buildbot survived the un-rcu patch

20:59 <fche> from last week

20:59 <fche> rerunning the testsuite against the current git/master now

21:00 <kerneltoast> hum hum

21:00 <kerneltoast> fche, btw this is a promising print patch: https://gist.github.com/kerneltoast/26a1e07ceda8bc72b0ff60543d29bcb1

21:00 <fche> mm dumm dee dummm hummm

21:00 <kerneltoast> it's running through the testsuite atm

21:01 <kerneltoast> hoping this is all good so i can proceed to the bulkmode stuff

21:02 <fche> can you elaborate on why _stp_print_lock can't be STP_DEF*'d ? I believe the intent there is to use raw spinlocks even on -rt kernels. Why is that a problem?

21:04 <kerneltoast> because prints need to be reentrant

21:05 <fche> sorry why?

21:06 <fche> "to be reentrant" ?

21:07 <kerneltoast> because the print api is heavily bootstrapped and some print functions are implemented using other print functions

21:10 <kerneltoast> a prominent example is _stp_vsnprintf

21:10 <kerneltoast> when the buffer given to _stp_vsnprintf is NULL, _stp_vsnprintf will use the print log buffer

21:11 <fche> ok go on, still not seeing the reentrance

21:14 <kerneltoast> _stp_printf uses _stp_vsnprintf, and _stp_vsnprintf needs to acquire _stp_print_lock when the buffer given to it is NULL, which _stp_printf does

21:14 <kerneltoast> then _stp_printf is nested in a bunch of places where the log buffer is manually accessed with _stp_reserve_bytes

21:15 <kerneltoast> and then there are nested _stp_error usages as well, such as in _stp_vsnprintf itself

21:17 <fche> eww, doesn't seem as though stp_vsnprintf should make any error calls that could conceivably result in infinite recursion

21:20 <kerneltoast> there's no recursion though

21:25 <fche> what is the lowest level function that talks straight to the data structures that require concurrency control

21:28 <kerneltoast> btw, the STP_DEF* for rw locks doesn't affect the trylock api

21:28 <kerneltoast> and we're only ever trylocking _stp_print_lock

21:35 derek0883 has joined #systemtap

21:36 <kerneltoast> fche, i hit a deadlock while the testsuite was running on that patch. but the deadlock is unrelated to the patch

21:37 <kerneltoast> check it out: https://gist.github.com/kerneltoast/26a1e07ceda8bc72b0ff60543d29bcb1#file-unrelated-deadlock-in-testsuite-txt

21:38 tromey has quit [Quit: ERC (IRC client for Emacs 27.1)]

21:39 orivej has quit [Ping timeout: 260 seconds]

21:40 <kerneltoast> lock_acquire() has tracepoints

21:40 <kerneltoast> i think that's related to the bug from the un-RCU patch

21:40 <kerneltoast> that's just evil

21:40 <fche> yes

21:40 <fche> yes, I think I pointed that out last week or ish --- that was one source of our reentrancy problems

21:41 <kerneltoast> yeah you did but the backtrace was kaput

21:41 <fche> trust the frank

21:41 <fche> (sometimes)

21:41 <fche> ok

21:41 <fche> I'm starting to think that getting rid of that mutex locking goo from a few months ago is rather urgent

21:41 <fche> sorry agentzh I hate you now

21:41 <fche> just a little

21:41 <fche> and stp_bulkmode should fix that, AIUI

21:42 <fche> FIGURES agentzh is not here now

21:42 <kerneltoast> haha

21:42 derek0883 has quit [Remote host closed the connection]

21:42 <fche> slacker

21:42 derek0883 has joined #systemtap

21:42 lindi- has quit [Ping timeout: 260 seconds]

21:43 agentzh has joined #systemtap

21:43 <kerneltoast> agentzh, fche wants to chew you out :P

21:44 <agentzh> kerneltoast: did i miss anything here?

21:44 <fche> nothing important just me hating you a little

21:44 <kerneltoast> [1:41:34 pm] <fche> I'm starting to think that getting rid of that mutex locking goo from a few months ago is rather urgent

21:44 <kerneltoast> [1:41:43 pm] <fche> sorry agentzh I hate you now

21:44 <kerneltoast> [1:41:47 pm] <fche> just a little

21:44 <agentzh> oh yeah

21:44 <kerneltoast> agentzh, the revelation of the day is this:

21:44 <kerneltoast> [1:40:00 pm] <kerneltoast> lock_acquire() has tracepoints

21:45 <kerneltoast> kernel locks have tracepoints

21:45 <agentzh> yeah yeah yeah

21:45 <kerneltoast> fche foresaw this last week but it's definitely confirmed now

21:45 <fche> forgot: "trust the frank (sometimes)"

21:46 <fche> important for the documentary record

21:46 <kerneltoast> agentzh, this is what happened: https://gist.github.com/kerneltoast/26a1e07ceda8bc72b0ff60543d29bcb1#file-unrelated-deadlock-in-testsuite-txt

21:48 <kerneltoast> "This changes everything."

21:50 derek0883 has quit [Ping timeout: 240 seconds]

21:54 derek0883 has joined #systemtap

21:58 derek0883 has quit [Remote host closed the connection]

21:58 derek0883 has joined #systemtap

22:13 <kerneltoast> fche, shall i replace my rwlocks with atomics?

22:17 <kerneltoast> fche, also, the lock tracepoints only happen when CONFIG_DEBUG_LOCK_ALLOC=y

22:21 <kerneltoast> for spin type locks at least

22:28 lindi- has joined #systemtap

22:29 <agentzh> kerneltoast: yeah we were already aware of that thing. mutex_trylock() is bad.

22:29 <agentzh> so what's new?

22:30 <agentzh> do we have a proper bugfix patch for it?

22:30 <kerneltoast> mutex_trylock() is bad but i didn't know mutex_trylock() had a tracepoint inside it

22:30 <kerneltoast> and the spin locks have tracepoints inside of them too

22:30 <agentzh> oh, i didn't know either.

22:30 <agentzh> that's crazy.

22:31 <kerneltoast> that backtrace i linked you is an example of the mutex_trylock() tracepoint deadlocking

22:31 <agentzh> ah, okay, i see.

22:31 demon000_ has joined #systemtap

22:31 <agentzh> fche: please don't hate me. i got your approval for that patch ;)

22:32 demon000_ is now known as tanislav000

22:32 <tanislav000> Heyo

22:32 <kerneltoast> yoyo

22:32 <fche> agentzh, I hate myself too :)

22:33 <tanislav000> Who doesn't

22:33 <agentzh> :D

22:33 <fche> no body loves me / everybody hates me / I'm gonna eat some worms

22:33 <tanislav000> Enjoy your meal

22:33 <agentzh> please don't.

22:33 <fche> agentzh, we were talking about getting rid of that inode mutex lock goo

22:33 <fche> by going to STP_BULKMODE on the kernel side

22:33 <agentzh> sure, i want that thing gone for good too.

22:34 <fche> so there's no contention

22:34 <agentzh> yep, i saw that conversation.

22:34 <agentzh> it's a very good direction.

22:34 <agentzh> it solves the cpu 0 contention too.

22:34 <agentzh> in nonbulk mode currently.

22:35 <agentzh> i can't wait to see it happens :)

22:35 <agentzh> kerneltoast has been working on these items for some while already.

22:35 <kerneltoast> we also need to fix the panics caused by printing inside an IRQ

22:35 <agentzh> yep

22:35 <kerneltoast> which has been a tougher problem to solve cleanly

22:35 <agentzh> sadly my irssi died and i lost a lot of conversations.

22:35 <fche> methinks the fixes are interrelated

22:35 <agentzh> probably.

22:35 <kerneltoast> and now i need to rework the patch again because i can't use locks lol

22:36 <agentzh> just noted that today.

22:36 <agentzh> pity.

22:36 <agentzh> do we have some public irc logs somewhere?

22:36 <kerneltoast> and i need to rework the un-RCU patch i pushed to fix fche's deadlocks

22:36 <agentzh> kerneltoast: ah. that's life :)

22:36 * fche logs EVERYTHING but am chemically restrained from sharin git

22:37 <agentzh> kerneltoast: that's quite a long work queue :)

22:37 <kerneltoast> yeah...

22:37 <agentzh> fche: we've just got another kernel developer onboard to join the stap party.

22:37 <kerneltoast> for every bug fixed, a new one is created

22:37 <agentzh> he's kerneltoast's friend.

22:37 <agentzh> tomorrow will be his first day.

22:38 <agentzh> an important part of his onboard meeting is how to bug fche on IRC.

22:38 <agentzh> :D

22:38 <tanislav000> That might be true

22:38 <tanislav000> I mean agentzh didn't directly tell me who to bug

22:39 <tanislav000> But it makes sense that you're the one, fche

22:39 <kerneltoast> tanislav000, direct all your rage to fche as you see fit

22:39 <agentzh> tanislav000: are you demon00000?

22:39 <tanislav000> yeah

22:39 <agentzh> oh, great.

22:39 <tanislav000> demon000 was taken

22:39 <agentzh> i was not aware.

22:39 <agentzh> i never count zeros though ;)

22:39 <kerneltoast> you can reserve nicknames on freenode btw

22:39 <tanislav000> i'm new to irc

22:40 <tanislav000> I don't think I was even born when this thing was created

22:40 <agentzh> guys, let's make sure fche never sleeps! :D

22:40 <fche> that's not too hard

22:40 <kerneltoast> we can bug fche in multiple timezones now

22:40 <fche> wait ALL y'all on one torture team?

22:41 <kerneltoast> you bet

22:41 <tanislav000> I guess

22:41 <kerneltoast> we're actually 3 monkeys in a trench coat

22:41 <fche> if this is some scheme to drive me to early retirement, I must warn you - IT MIGHT WORK

22:42 <tanislav000> I think the opposite is what we want

22:42 <tanislav000> You should never retire

22:42 * kerneltoast bbl

22:43 orivej has joined #systemtap

22:43 * tanislav000 wants agentzh to check his slack so I can continue getting started

23:04 tonyj has quit [Ping timeout: 260 seconds]

23:11 orivej has quit [Ping timeout: 240 seconds]

23:34 fLiPr3VeRsE has quit [Ping timeout: 240 seconds]

23:34 fLiPr3VeRsE has joined #systemtap

23:52 amerey has quit [Quit: Leaving]

23:56 tonyj has joined #systemtap