fche changed the topic of #systemtap to: http://sourceware.org/systemtap; email systemtap@sourceware.org if answers here not timely, conversations may be logged
lijunlong has quit [Read error: Connection reset by peer]
lijunlong has joined #systemtap
<kerneltoast>
fche, i have results for the rcu cleanup patch
derek0883 has quit [Remote host closed the connection]
derek0883 has joined #systemtap
hpt has joined #systemtap
<fche>
kerneltoast, concur, thanks!
khaled has quit [Quit: Konversation terminated!]
derek0883 has quit [Remote host closed the connection]
derek0883 has joined #systemtap
lijunlong has quit [Ping timeout: 256 seconds]
derek0883 has quit [Remote host closed the connection]
derek0883 has joined #systemtap
lijunlong has joined #systemtap
derek0883 has quit [Remote host closed the connection]
derek0883 has joined #systemtap
<irker573>
systemtap: sultan systemtap.git:master * release-4.4-11-g3c4f82ca0 / runtime/linux/runtime_context.h: runtime_context: factor out RCU usage using a rw lock
irker573 has joined #systemtap
derek0883 has quit [Remote host closed the connection]
khaled has joined #systemtap
lijunlong has quit [Remote host closed the connection]
derek0883 has joined #systemtap
derek0883 has quit [Remote host closed the connection]
lijunlong has joined #systemtap
derek0883 has joined #systemtap
derek0883 has quit [Remote host closed the connection]
derek0883 has joined #systemtap
derek0883 has quit [Remote host closed the connection]
derek0883 has joined #systemtap
irker573 has quit [Quit: transmission timeout]
derek0883 has quit [Ping timeout: 272 seconds]
lijunlong has quit [Remote host closed the connection]
orivej has quit [Ping timeout: 246 seconds]
zamba has quit [Quit: WeeChat 2.4]
hpt has quit [Ping timeout: 256 seconds]
mjw has joined #systemtap
orivej has joined #systemtap
derek0883 has joined #systemtap
derek0883 has quit [Ping timeout: 260 seconds]
amerey has joined #systemtap
<fche>
kerneltoast, some of our buildbots are not happy after this patch, showing apparent hangs/crashes during testing
<fche>
hm, one of the on-the-fly tests is common to the crashes (whether on f33, f32, or f34 rawhide x86-64)
<kerneltoast>
fche, could i get a log?
<fche>
haven't seen one, I think the my kernels are hanging and a watchdog is rebooting them within 30ish secondes
<fche>
make installcheck RUNTESTFLAGS=uprobes_onthefly.exp
<kerneltoast>
fche, can _stp_runtime_context_wait() be called from an NMI?
<kerneltoast>
I'm guessing no
<kerneltoast>
Because it has an msleep
<fche>
no
<fche>
that's a shutdown-time process from a clean user context
<kerneltoast>
that was the only hazard i saw in my patch when i made it
<kerneltoast>
i wonder what's exploding
<fche>
that uprobes_onthefly test has been a good stresser of a bunch of runtime subsystems
<kerneltoast>
what's the oldest kernel you're seeing the issue on?
<fche>
5.9 something
<kerneltoast>
oldest is 5.9??
<kerneltoast>
o_O
<fche>
something like that. the buildbots passed on stuff older than fedora32's kernels
<kerneltoast>
so only 5.9 explodes? ouch
<fche>
can't say 'only', only reporting what I've seen so far
<kerneltoast>
i have a theory
<kerneltoast>
watchdog could fire if the write_lock_irqsave(&_stp_context_lock, flags); takes too long
<kerneltoast>
fche, could you test a patch for me?