fche changed the topic of #systemtap to: http://sourceware.org/systemtap; email systemtap@sourceware.org if answers here not timely, conversations may be logged
derek0883 has joined #systemtap
derek0883 has quit [Remote host closed the connection]
derek0883 has joined #systemtap
derek0883 has quit [Ping timeout: 246 seconds]
derek0883 has joined #systemtap
thibaultcha has joined #systemtap
thibaultcha has quit [Changing host]
thibaultcha has joined #systemtap
derek0883 has quit [Remote host closed the connection]
<agentzh>
kerneltoast: any time difference in your test suite runs this time?
hpt has joined #systemtap
derek0883 has joined #systemtap
khaled has quit [Quit: Konversation terminated!]
derek0883 has quit [Remote host closed the connection]
derek0883 has joined #systemtap
derek0883 has quit [Remote host closed the connection]
derek0883 has joined #systemtap
derek0883 has quit [Ping timeout: 272 seconds]
derek0883 has joined #systemtap
sscox has quit [Ping timeout: 258 seconds]
derek0883 has quit [Remote host closed the connection]
derek0883 has joined #systemtap
derek0883 has quit [Remote host closed the connection]
derek0883 has joined #systemtap
derek0883 has quit [Remote host closed the connection]
derek0883 has joined #systemtap
derek0883 has quit [Remote host closed the connection]
derek0883 has joined #systemtap
derek0883 has quit [Ping timeout: 256 seconds]
orivej has quit [Ping timeout: 260 seconds]
derek0883 has joined #systemtap
derek0883 has quit [Ping timeout: 260 seconds]
orivej has joined #systemtap
lijunlong has quit [Ping timeout: 265 seconds]
lijunlong has joined #systemtap
khaled has joined #systemtap
mjw has joined #systemtap
hpt has quit [Ping timeout: 265 seconds]
derek0883 has joined #systemtap
derek0883 has quit [Remote host closed the connection]
derek0883 has joined #systemtap
derek0883 has quit [Remote host closed the connection]
derek0883 has joined #systemtap
_whitelogger has joined #systemtap
derek0883 has joined #systemtap
orivej has quit [Ping timeout: 260 seconds]
derek0883 has quit [Ping timeout: 260 seconds]
mjw has quit [Ping timeout: 264 seconds]
mjw has joined #systemtap
amerey has joined #systemtap
sscox has joined #systemtap
sscox has quit [Ping timeout: 260 seconds]
tromey has joined #systemtap
derek0883 has joined #systemtap
derek0883 has quit [Remote host closed the connection]
derek0883 has joined #systemtap
xar- has quit [*.net *.split]
xar- has joined #systemtap
mjw has quit [Ping timeout: 264 seconds]
mjw has joined #systemtap
derek0883 has quit [Remote host closed the connection]
derek0883 has joined #systemtap
<kerneltoast>
fche, happy friday
<kerneltoast>
i ran the testsuite with the -debug kernel on fedora
<kerneltoast>
my laptop freezes after 5 minutes, and this occurs *without* all of my recent rcu changes
sscox has joined #systemtap
<agentzh>
fche: does your buildbot or testbot farm have such debug kernels?
<fche>
hi kerneltoast agentzh
<fche>
I believe at least our rawhide buildbot runs a -debug kernel
<fche>
kerneltoast, what's the last test that succeeded?
<kerneltoast>
fche, nd_sys.stp was the most recent module loaded according to dmesg, but my dmesg is truncated due to the freeze
<kerneltoast>
i'm going to try and get a kdump now. it's been 100% reproducible thus far
derek0883 has quit [Remote host closed the connection]
derek0883 has joined #systemtap
orivej has joined #systemtap
<fche>
nd_sys
<fche>
interesting......
<fche>
that should be pretty vanilla tracepoint-based capture
<fche>
yeah a kdump / backtrace would be helpful
<fche>
with any luck it's a minor thing
<kerneltoast>
i've been trying to get a kdump but the capture kernel is not getting booted for some reason
<kerneltoast>
it works fine with a sysrq panic
<kerneltoast>
maybe this is some kind of deadlock that isn't causing a panic or an oops
<kerneltoast>
nd_sys is always the last entry in my truncated dmesg
tromey` has joined #systemtap
tromey has quit [Ping timeout: 260 seconds]
<fche>
a sysrq-t or sysrq-w & netconsole should give a usable backtrace
<kerneltoast>
sysrq doesn't respond
<kerneltoast>
i switched to the fb console in the hopes that i'd see a message when it froze, but no such luck
<fche>
sysrq enabled tho?
<fche>
netconsole is really good in case you're not using it already
<kerneltoast>
yep, sysrq works fine while the laptop is still responding
derek0883 has quit [Remote host closed the connection]
derek0883 has joined #systemtap
derek0883 has quit [Remote host closed the connection]
derek0883 has joined #systemtap
tromey` has quit [Quit: ERC (IRC client for Emacs 27.1.50)]
derek0883 has quit [Ping timeout: 258 seconds]
orivej has quit [Ping timeout: 260 seconds]
mjw has quit [Quit: Leaving]
derek0883 has joined #systemtap
modem has quit [*.net *.split]
lkthomas_ has quit [*.net *.split]
lkthomas_ has joined #systemtap
modem has joined #systemtap
amerey has quit [Remote host closed the connection]
derek0883 has quit [Remote host closed the connection]
derek0883 has joined #systemtap
<kerneltoast>
fche, 5.6.6-debug works
<fche>
no hang/etc.?
<kerneltoast>
so i guess the 5.8.15-debug kernel is fubar
<kerneltoast>
yeah no hang
<kerneltoast>
maybe it's just a case of some amd driver regression
<fche>
anything just slightly older or newer than that kernel?
<kerneltoast>
there's a 5.8.16 kernel in testing
<kerneltoast>
i can't find anything between 5.6.6 and 5.8.15 (5.6.6 is the f32 release kernel)
<kerneltoast>
maybe all those kernels between 5.6.6 - 5.8.15 are archived somewhere but i haven't found anything trawling google
<kerneltoast>
(I'm not too familiar with fedora)
<fche>
koji.fedoraproject.org has every build in history :)
<fche>
even ones not quite released yet
<kerneltoast>
the 5.8.16 changelog is tiny, that probably won't be any different
<kerneltoast>
fche, I'm not seeing kernel-debug on koji
khaled has quit [Remote host closed the connection]
<kerneltoast>
fche, also the only complaint from lockdep is that MAX_LOCKDEP_CHAIN_HLOCKS is too low
<fche>
it's a subpackage of kernel
<kerneltoast>
fche, oh nevermind, 5.6.6 froze
<kerneltoast>
it just took a bit longer
<kerneltoast>
i guess my laptop sucks?
<fche>
yes, please send it to me immediately :)
<fche>
any sysrq or netconsole leftovers?
<kerneltoast>
i didn't have sysrq enabled right now
<kerneltoast>
but i bet it wouldn't have worked, like last time
<fche>
the humanity, the humanity!
<fche>
netconsole is also your friend
<kerneltoast>
funny thing, i actually want to get rid of this laptop...
<kerneltoast>
what can be done with netconsole?
<fche>
it tells the kernel to send kmsg content live to a nearby syslog server via udp
<kerneltoast>
oh it sends printks over udp
<fche>
it operates at a very low level, even during panics
<kerneltoast>
lower than fbcon?
<fche>
maybe, not sure.
<kerneltoast>
fbcon is usually the go-to for displaying panics
<kerneltoast>
and that did not work here
<fche>
worth a shot.
<kerneltoast>
does it take a while to set up?
<fche>
one syslogd server on the network
<fche>
then /etc/sysconfig/netconsole with some ip addresses
<fche>
then systemctl enable --now netconsole
<fche>
and BOB IS YOUR UNCLE
<kerneltoast>
and i'm guessing wifi won't work
<fche>
not sure.
<kerneltoast>
this crummy laptop does have an rj45 slot