fche changed the topic of #systemtap to: http://sourceware.org/systemtap; email systemtap@sourceware.org if answers here not timely, conversations may be logged
<kerneltoast>
oh turns out the thing i did earlier fixed it
<kerneltoast>
but
<kerneltoast>
i was running add.stp on its own
<kerneltoast>
so i didn't notice it was fixed
<kerneltoast>
hah
<kerneltoast>
i'm dumb
<kerneltoast>
{
<kerneltoast>
static int _stp_data_write_commit(void *entry)
<irker962>
systemtap: sultan systemtap.git:sultan/bulkmode2 * release-4.4-26-g5c1b84a8d / runtime/print_flush.c runtime/transport/relay_v2.c runtime/transport/transport.c runtime/transport/transport.h staprun/relay.c: always use per-cpu bulkmode relayfs files to communicate with userspace
<irker962>
systemtap: sultan systemtap.git:sultan/bulkmode2 * release-4.4-27-g5836a314d / tapset-timers.cxx: Revert "REVERTME: tapset-timers: work around on-the-fly deadlocks caused by mutex_trylock"
<kerneltoast>
fche, check out sultan/bulkmode2
<kerneltoast>
i'm gonna run it through the testsuite now since it works with add.exp
<kerneltoast>
i'll let you know how it goes in about 12000 seconds
<kerneltoast>
(that's how long the testsuite takes to run, i just check dmesg for the last stap log)
derek0883 has quit [Remote host closed the connection]
derek0883 has joined #systemtap
orivej has quit [Ping timeout: 240 seconds]
<agentzh>
hopefully we can run the stap test suite in parallel soon.
<agentzh>
right now it's really a struggle.
<fche>
hey the new code might work well enough in parallel mode too
<fche>
patience is not a struggle
<fche>
you just need to hold the sleeping pigeon pose for 666 minutes
<fche>
breathe in through the nose, out through the mouth
<fche>
contract and relax all the muscles around the vagus nerve
<fche>
and by the end of it all
<fche>
you'll have testsuite results
<fche>
hope this helps
<fche>
kerneltoast, I see what you did and can see why it should work
<fche>
but not sure why it wasn't there before
<agentzh>
kerneltoast: maybe we could split the test suite and run each subset of files in separate kvm guests...
<agentzh>
that way we can utilize all the phy cpu cores...
<kerneltoast>
you mean to speed up serial mode?
<agentzh>
yes
<kerneltoast>
not sure if previous tests in the testsuite influence the rest of the results though
<agentzh>
but seems like bulkmode is near the corner, probably we can do parallel runs directly anyway.
<kerneltoast>
seems like we're going to be from serial mode soon anyway
<kerneltoast>
*free from serial mode
<agentzh>
aye
<agentzh>
kerneltoast: you got fun things to do while waiting for the test results? ;)
<kerneltoast>
yeah i've got a big fat merge to work on
<agentzh>
oh yeah that one.
<agentzh>
cool
<agentzh>
fche: the thing is that our customers is very impatient. so we really need to get these bugs behind us ASAP ;)
<agentzh>
*are
<kerneltoast>
lucky me that i've never had to face our customers :P
<agentzh>
oh boy, you are protected :)
<kerneltoast>
agentzh, we still might have the probe_lock lockup stopping us from parallel runs, and fche said his buildbots were still dying even with the bulkmode patch
<kerneltoast>
so there's no shortage of bugs :)))
<kerneltoast>
fche, yeah i have no idea why that code was not there before, nor how it was working before...
<agentzh>
still dying with your latest bulkmode2 branch?
<agentzh>
maybe bulk mode was never exercised much.
<agentzh>
since the default was never bulk.
<agentzh>
it's a relatively new territory.
<kerneltoast>
agentzh, bulkmode2 just fixes a problem where prints weren't being flushed. bulkmode1 got rid of the mutex_trylock, so the buildbots should not be suffering from the mutex_trylock bug anymore
<agentzh>
that's good news.
<agentzh>
hopefully they can get merged soon.
<agentzh>
once the test suite is green enough.
<kerneltoast>
yeah bulkmode2 should be good to go once the testsuite is all green
<kerneltoast>
4000 seconds left
<agentzh>
yeah, probe lock is still out there.
orivej has joined #systemtap
<fche>
I'll rerun my local tests against the new branch
irker962 has quit [Quit: transmission timeout]
<kerneltoast>
oh boy, the testsuite is taking longer than 12000 seconds
<kerneltoast>
i hope it's not more timeouts
<fche>
things looking good here so far
<agentzh>
it's rare that fche is still around. haha
derek0883 has quit [Remote host closed the connection]
<kerneltoast>
fche, hmm i'm seeing this, i think it's an environment issue: