fche changed the topic of #systemtap to: http://sourceware.org/systemtap; email systemtap@sourceware.org if answers here not timely, conversations may be logged
mjw has quit [Quit: Leaving]
derek0883 has quit [Ping timeout: 240 seconds]
derek0883 has joined #systemtap
_whitelogger has joined #systemtap
orivej has quit [Quit: orivej]
derek0883 has quit [Remote host closed the connection]
derek0883 has joined #systemtap
orivej has joined #systemtap
derek0883 has quit [Remote host closed the connection]
derek0883 has joined #systemtap
khaled has quit [Quit: Konversation terminated!]
hpt has joined #systemtap
_whitelogger has joined #systemtap
derek088_ has joined #systemtap
derek0883 has quit [Ping timeout: 264 seconds]
derek088_ has quit [Ping timeout: 240 seconds]
derek0883 has joined #systemtap
derek0883 has quit [Remote host closed the connection]
derek0883 has joined #systemtap
orivej has quit [Ping timeout: 264 seconds]
derek0883 has quit [Remote host closed the connection]
derek0883 has joined #systemtap
derek0883 has quit [Ping timeout: 260 seconds]
hpt has quit [Ping timeout: 260 seconds]
orivej has joined #systemtap
mjw has joined #systemtap
khaled has joined #systemtap
orivej has quit [Ping timeout: 240 seconds]
orivej has joined #systemtap
amerey has joined #systemtap
amerey has quit [Quit: Leaving]
amerey has joined #systemtap
derek0883 has joined #systemtap
<kerneltoast>
fche, so it wasn't just a fluke: the bulkmode patch makes the testsuite take 2.5x longer to run
<fche>
interesting
<fche>
maybe a startup or shutdown weird-delay thing?
derek0883 has quit [Remote host closed the connection]
derek0883 has joined #systemtap
<kerneltoast>
there are a lot of timeouts like this: FAIL: tcptest startup (timeout)
<kerneltoast>
guess i'll see why that happens
<fche>
ok, that's a good thing
<fche>
things are not just slower to run but something is temporarily stuck
<kerneltoast>
so for something like this: FAIL: add startup (timeout)
<kerneltoast>
how does the testsuite separate the startup from the test? what part of add.exp is "startup"?
<fche>
look at add.exp
<fche>
it calls some other dejagnu (tcl) procedure
<fche>
and that bad boy (probably in testsuite/lib/*) prints that "startup (timeout)" message
<fche>
looking at relay_wakeup_readers and related state
<fche>
also can strace -f staprun/stapio that is running add.stp
<kerneltoast>
stapio is just polling
<fche>
yup, but not taking the data from that probe begin
<fche>
./stap -v -e 'probe begin {log("hi") }' <<< visible there too
<fche>
so some wakeup is missing
<kerneltoast>
hmmmmmmmmmmm
<kerneltoast>
seems like this code really wants __stp_relay_switch_subbuf() to be called
<fche>
every now and then
<fche>
we have that wakeup timer thing, wonder why it's not enough / doing its job
<kerneltoast>
the timer does go off
<kerneltoast>
_stp_relay_data.wakeup isn't getting set
<fche>
__stp_relay_switch_subbuf is the only place that can do that
<kerneltoast>
right but i have no idea what "switching the subbuffer" means
<kerneltoast>
or when we should do it
<kerneltoast>
any ideas about that?
derek0883 has quit [Remote host closed the connection]
derek0883 has joined #systemtap
<fche>
just comparing master vs your branch
derek0883 has quit [Ping timeout: 264 seconds]
<fche>
stap -v -e 'probe timer.ms(1) {log("hi") }' <<< interesting: no output for a while, then (once whatever buffer fills), staprun does finally get the hint and prints it out
<kerneltoast>
yeah it looks like the _stp_get_rchan_subbuf() logic is configured to flush when the buffer is full
derek0883 has joined #systemtap
<kerneltoast>
it's gotta be this subbuf switch stuff