#systemtap on 2020-10-19 — irc logs at freenode.irclog.whitequark.org

2015-11-12 23:18 fche changed the topic of #systemtap to: http://sourceware.org/systemtap; email systemtap@sourceware.org if answers here not timely, conversations may be logged

00:19 khaled has quit [Quit: Konversation terminated!]

01:22 hpt has joined #systemtap

02:06 derek0883 has joined #systemtap

02:22 derek0883 has quit [Remote host closed the connection]

02:22 derek0883 has joined #systemtap

02:40 sscox has quit [Ping timeout: 265 seconds]

04:03 derek0883 has quit [Remote host closed the connection]

04:09 derek0883 has joined #systemtap

04:32 derek0883 has quit [Remote host closed the connection]

04:38 derek0883 has joined #systemtap

05:57 derek0883 has quit [Remote host closed the connection]

06:35 derek0883 has joined #systemtap

06:35 derek0883 has quit [Remote host closed the connection]

06:43 hpt has quit [Quit: Lost terminal]

06:43 hpt has joined #systemtap

07:02 khaled has joined #systemtap

07:16 orivej has quit [Ping timeout: 265 seconds]

07:18 mjw has joined #systemtap

07:19 orivej has joined #systemtap

07:35 orivej has quit [Ping timeout: 265 seconds]

08:21 lijunlong has quit [Ping timeout: 246 seconds]

08:23 lijunlong has joined #systemtap

09:39 hpt has quit [Ping timeout: 272 seconds]

10:05 khaled has quit [Quit: Konversation terminated!]

10:06 khaled has joined #systemtap

10:22 orivej has joined #systemtap

10:55 khaled_ has joined #systemtap

10:56 khaled has quit [Ping timeout: 258 seconds]

11:11 orivej has quit [Ping timeout: 260 seconds]

12:03 khaled_ has quit [Quit: Konversation terminated!]

12:05 khaled has joined #systemtap

13:54 amerey has joined #systemtap

14:34 orivej has joined #systemtap

16:10 derek0883 has joined #systemtap

16:26 derek0883 has quit [Remote host closed the connection]

16:26 derek0883 has joined #systemtap

16:27 derek0883 has quit [Remote host closed the connection]

16:28 derek0883 has joined #systemtap

16:46 tromey has joined #systemtap

17:09 derek0883 has quit [Remote host closed the connection]

17:15 derek0883 has joined #systemtap

18:30 zamba has quit [Ping timeout: 256 seconds]

18:35 zamba has joined #systemtap

19:55 irker694 has joined #systemtap

19:55 <irker694> systemtap: smakarov systemtap.git:master * release-4.3-85-g2f7e3794a / testsuite/systemtap.onthefly/kprobes_onthefly.exp: PR26755 kprobes_onthefly.exp: skip lock_* tracepoints pending investigation

20:15 derek0883 has quit [Remote host closed the connection]

20:16 derek0883 has joined #systemtap

20:21 derek0883 has quit [Ping timeout: 244 seconds]

20:37 derek0883 has joined #systemtap

20:37 tromey has quit [Quit: ERC (IRC client for Emacs 27.1.50)]

21:09 derek0883 has quit [Ping timeout: 264 seconds]

21:11 derek0883 has joined #systemtap

21:30 <kerneltoast> fche, hi!

21:31 <kerneltoast> i've got a fun patch for you to review

21:31 <fche> uh oh

21:31 <fche> I must go

21:31 <fche> no

21:31 <fche> run

21:31 <fche> RUN

21:33 <kerneltoast> nooooo come back

21:33 <kerneltoast> https://gist.github.com/kerneltoast/8a9518f7f17b8fc1613be69a5d0719ff

21:33 <fche> nope, too afraid to look

21:33 <kerneltoast> comeeee backkkkkkk

21:33 <kerneltoast> i'll give you a cookie

21:33 <fche> SOLD

21:33 <kerneltoast> it might be a tracking cookie

21:33 <kerneltoast> or a peanut butter cookie

21:34 <kerneltoast> the commit message is intentionally sparse at the moment (read: nonexistent)

21:35 <kerneltoast> the patch passes the test suite as it is but i'm going to run the test suite with a lockdep kernel too to be extra certain

21:35 derek0883 has quit [Remote host closed the connection]

21:35 derek0883 has joined #systemtap

21:37 <fche> surprised that all the _rcu work is doable from all the contexts this code is invoked from, but maybe it's only a few

21:38 <kerneltoast> what context were you expecting this to not work in?

21:38 <fche> from within e.g. funky tracepoints or interrupts

21:39 <kerneltoast> just because we can't sleep?

21:39 <fche> yeah, probably beyond that

21:39 <fche> I think many of the kernel tracepoints are "rated" for a fairly minimal type of workload being done from within their callbacks

21:40 <fche> certainly not the full generality of stap probe handling & accessories

21:40 <fche> (serhei is chasing down one such interaction with the on-the-fly arming/disarming logic just now)

21:41 <kerneltoast> hmm, well the intention of RCU was to make the work as minimal as possible in these paths

21:41 <fche> yeah

21:41 <kerneltoast> it's quite lean. no RCU barriers or synchronizations needed

21:42 <fche> btw, would not worry about one aspect mentioned in the patch: an rcu_barrier in the case of unloading the stap module. at least we've been always assuming this is a rare / possibly-heavy event

21:43 <kerneltoast> i suspected it would be exceptionally rare, but i sleep better at night knowing it's covered

21:43 <fche> sleeping is important

21:43 <fche> at night, even more important

21:43 <kerneltoast> but not in interrupt context

21:43 <kerneltoast> that's why i only use doctor-recommended RCU to help me sleep

21:44 <fche> take two RCUs and good night

21:44 <fche> three is perfect

21:44 <fche> FOUR IS RIGHT OUT

21:44 <kerneltoast> 9 out of 10 dentists concur that three RCUs a day is the perfect balance

21:44 <kerneltoast> the tenth dentist has insomnia

21:47 <fche> and the eleventh is crazy

21:47 <fche> okay, please do run the testsuite

21:47 <fche> and see if you can do it on a machine with some good heavy background process churn

21:47 <kerneltoast> testsuite with lockdep, you mean?

21:47 <kerneltoast> i've already run the testsuite normally a few times and it's been knock on wood

21:47 <kerneltoast> but that wood may be balsa

22:04 amerey has quit [Remote host closed the connection]

22:06 <agentzh> kerneltoast: i think fche means the full systemtap test suite, not just our lean test suite.

22:06 <kerneltoast> ahh

22:06 <agentzh> the former is horrible.

22:06 <agentzh> can take many hours to run...

22:06 <fche> in a good way

22:06 <kerneltoast> oh boy

22:06 <fche> in a very good way

22:06 <fche> hey if you guys force me to think about this old code

22:06 <kerneltoast> i guess if it passes the full test suite it's golden?

22:06 <agentzh> sometimes leading to softlockup according to my experience.

22:06 <fche> I'll force you back to run the dejagnu test suite :)

22:07 <agentzh> and also many "expected" test failures.

22:07 <fche> if the test results are "reasonable". there is no "pass the full testsuite" in the sense of 0 FAILs.

22:07 <agentzh> but not marked as expected...depending on the kernel you are using.

22:07 <fche> and on compiler and on .... too many parameters :(

22:07 <agentzh> yep, i used to do diffs of the failures.

22:07 <agentzh> before and after my own changes.

22:08 <kerneltoast> hey this test suite sounds like what we used to test kernels at canonical

22:08 <kerneltoast> we had an entire system of categorizing the known failures

22:08 <fche> before very very long, we hope to have an online system to report testsuite results to, and let it report regressions etc. back to you

22:08 <kerneltoast> and it was all done manually

22:08 <kerneltoast> good times..

22:08 <fche> I'm sure serhei would be interested to here about that categorization gadget

22:09 <kerneltoast> it's called a human

22:09 <agentzh> i believe the hash table is definitely read from interrupt contexts like uprobes and timer probes.

22:09 <agentzh> is that okay for RCU locks?

22:10 <agentzh> maybe it's easier than running the full stap test suite.

22:10 <kerneltoast> yes that should be fine for RCU locks

22:10 <kerneltoast> they're not really even locks

22:10 <kerneltoast> they do not block at all

22:11 <agentzh> do we need to worry about premptions here?

22:11 <agentzh> even though they don't sleep.

22:11 <kerneltoast> nope

22:11 <kerneltoast> rcu_read_lock disables preemption

22:11 <agentzh> okay

22:12 <kerneltoast> you can't sleep inside the lock

22:12 <kerneltoast> that's all

22:12 <agentzh> makes sense.

22:12 <agentzh> fche: any particular concerns here?

22:13 <agentzh> we do have our own lean version of a test suite that can be run in parallel.

22:13 <agentzh> which indeed caught a bug in kerneltoast's first version of the rcu patch.

22:13 <agentzh> will that be enough for pushing to master? ;)

22:13 <fche> please try the whole suite

22:13 <agentzh> or maybe we can push it to a branch so that we can reuse your build bot?

22:13 <fche> this is low level enough that tricky workloads could be required to trip it up

22:15 <agentzh> kerneltoast: maybe you can run it overnight.

22:15 <agentzh> see testsuite/README for details.

22:15 <agentzh> it can run in parallel to save some time.

22:15 <agentzh> a lot of time on a SMP system.

22:15 <fche> sudo make installcheck and walk away for the night :)

22:16 <agentzh> but i guess we still need to run twice to compare the diffs.

22:16 <agentzh> the diffs of the test report.

22:16 <kerneltoast> yeah that'll probably vary by the host machine...

22:16 <agentzh> of course.

22:16 <agentzh> also, you need to use the open source repo's master to test.

22:17 <agentzh> not our private branch.

22:17 <kerneltoast> yeah

22:17 <agentzh> assuming the tests would complete.

22:17 <agentzh> sometimes they don't...

22:17 <kerneltoast> oof

22:17 <agentzh> like hitting some bugs.

22:17 <fche> your private branch doesn't carry the testsuite?

22:17 <fche> it should work fine

22:18 <agentzh> like this one: https://sourceware.org/bugzilla/show_bug.cgi?id=23493

22:18 <agentzh> or maybe i was just unlucky ;)

22:18 <kerneltoast> fche, is there a preferred kernel version for testing?

22:18 <agentzh> i was using the public branch.

22:18 <agentzh> for that PR.

22:20 <fche> kerneltoast, use whatever is convenient

22:21 * kerneltoast uses linux-next

22:26 derek0883 has quit [Ping timeout: 258 seconds]

22:36 <irker694> systemtap: alizhang systemtap.git:azhang/pr13838 * release-4.3-92-gb5918b7be / runtime/softfloat.c runtime/softfloat.h tapset/floatingpoint.stp testsuite/buildok/floatingpoint.stp: basically finished string to floating point conversion

22:47 derek0883 has joined #systemtap

22:54 derek0883 has quit [Remote host closed the connection]

22:54 derek0883 has joined #systemtap

23:36 khaled has quit [Quit: Konversation terminated!]