#systemtap on 2021-01-12 — irc logs at freenode.irclog.whitequark.org

2015-11-12 23:18 fche changed the topic of #systemtap to: http://sourceware.org/systemtap; email systemtap@sourceware.org if answers here not timely, conversations may be logged

01:19 hpt has joined #systemtap

01:25 derek0883 has quit [Remote host closed the connection]

01:57 khaled_ has quit [Quit: Konversation terminated!]

01:58 derek0883 has joined #systemtap

02:54 derek0883 has quit [Remote host closed the connection]

02:55 derek0883 has joined #systemtap

04:39 derek0883 has quit [Remote host closed the connection]

05:02 derek0883 has joined #systemtap

05:22 derek0883 has quit [Ping timeout: 246 seconds]

05:29 derek0883 has joined #systemtap

05:57 beauty2 has quit [Ping timeout: 246 seconds]

06:14 fdalleau_away has quit [Quit: Coyote finally caught me]

06:26 orivej has joined #systemtap

06:40 derek0883 has quit [Remote host closed the connection]

06:49 derek0883 has joined #systemtap

07:37 derek0883 has quit [Remote host closed the connection]

07:43 derek0883 has joined #systemtap

08:01 khaled has joined #systemtap

08:06 derek0883 has quit [Remote host closed the connection]

08:06 orivej has quit [Ping timeout: 246 seconds]

08:32 orivej has joined #systemtap

08:46 mjw has joined #systemtap

09:18 derek0883 has joined #systemtap

09:24 derek0883 has quit [Ping timeout: 264 seconds]

09:38 hpt has quit [Ping timeout: 264 seconds]

10:09 beauty2 has joined #systemtap

11:06 ggherdov has quit [Ping timeout: 246 seconds]

11:06 ggherdov has joined #systemtap

11:18 orivej has quit [Ping timeout: 260 seconds]

13:25 orivej has joined #systemtap

13:44 derek0883 has joined #systemtap

13:49 derek0883 has quit [Ping timeout: 272 seconds]

15:01 amerey has joined #systemtap

16:43 derek0883 has joined #systemtap

17:13 fdalleau has joined #systemtap

17:24 fdalleau has quit [Quit: Coyote finally caught me]

18:01 derek0883 has quit [Remote host closed the connection]

18:07 derek0883 has joined #systemtap

18:16 tromey has joined #systemtap

18:54 orivej has quit [Ping timeout: 240 seconds]

19:06 mjw has quit [Quit: Leaving]

19:55 <kerneltoast> fche, https://i.imgur.com/YWt4qi1.png

19:57 <fche> https://bugzilla.redhat.com/show_bug.cgi?id=1914948 << hey dude, any suspicious why this would happen with the new transport but not the release-4.4 level one?

19:58 <kerneltoast> let's trade bugs

19:58 <kerneltoast> -DSTAP_TRANS_PROCFS makes the debugfs crash go away

19:58 <fche> noice

20:10 <kerneltoast> fche, it's probably a combination of _stp_subbuf_size being reduced in 8819e2a04596deb2fe427d261bebcaf3d2620dfb and _stp_data_write_reserve() needlessly burning through subbuffers

20:10 <kerneltoast> if you make a big request to _stp_data_write_reserve(), it'll pick out a subbuffer and then burn it if the requested alloc size won't fit in that subbuffer

20:13 <fche> that ruby test.sh case is minuscule tho,it prints like 7 lines

20:13 <kerneltoast> o

20:14 <kerneltoast> which commit does systemtap-4.4-6.el8 correspond to

20:15 <fche> it's not visible externally; it's a build of stap 4.4 plus all the runtime/transport-related patches over the last few months

20:15 <fche> (plus one or two other things)

20:16 <fche> (that should be irrelevant)

20:16 * fche hasn't reproduced the problem on git stap , just wanted to bring it to your attention

20:16 <kerneltoast> ah, but you can repro it just fine on that internal build?

20:17 <fche> our qa friend can apparently

20:18 <fche> so this was just reported to me a few hours back

20:18 <fche> Fresh News

20:18 <kerneltoast> did qa friend try git stap?

20:21 <fche> don't think so, mentioned it to him just now

20:22 <fche> hmmm

20:22 <fche> never ind about "small output"

20:22 <fche> it's 50000 lines in a few seconds

20:22 <fche> so yeah subbuf exhaustion etc. a possibility

20:23 <kerneltoast> try this: https://paste.centos.org/view/8da5796d

20:23 <kerneltoast> actually i can make that conserve subbuffers even better

20:24 <kerneltoast> oh nvm i can't

20:24 <kerneltoast> or can it

20:24 <kerneltoast> *can i

20:25 <kerneltoast> fche, check this out: https://paste.centos.org/view/7e738827

20:25 <kerneltoast> might need tuning to make sure print buffers are flushed out when the module is unloaded

20:26 <fche> not sure we can just decrease the size_request like that - wouldnt' that cause cross-subbuf spans?

20:26 <kerneltoast> we already decrease the size request like that

20:26 <kerneltoast> size_request = __stp_relay_switch_subbuf(buf, size_request);

20:27 derek0883 has quit [Remote host closed the connection]

20:27 <kerneltoast> crossing subbufs is unavoidable

20:34 derek0883 has joined #systemtap

20:51 <kerneltoast> fche, so should we just roll with the procfs flag?

20:52 <fche> yeah I'm thinking we should just switch over

20:52 <fche> as the default

20:53 <kerneltoast> why are there two options anyway

20:54 <kerneltoast> someone can have debugfs disabled but good luck booting with procfs disabled

20:54 <fche> later

20:56 <fche> btw

20:56 <fche> stap -DDEBUG_TRANS for this test case shows a stream of len=78 / _stp_print_flush:30 prints

20:57 <fche> I wonder if we're wasting 8K (STP_BUFFER_SIZE) of content per each write

21:00 <fche> ok other data

21:01 <fche> with -DSTP_RELAY_TIMER_INTERVAL=1 we lose less data; with =0 we lose less yet, but still lose som

21:03 derek0883 has quit [Remote host closed the connection]

21:05 <kerneltoast> did you try my patch?

21:06 <fche> not yet, just gathering data

21:06 <kerneltoast> if you have thousands of small messages then i can see this happening

21:06 <kerneltoast> (without my patch)

21:06 <fche> yeah, that's what we're seeing.

21:18 derek0883 has joined #systemtap

21:30 orivej has joined #systemtap

22:13 derek0883 has quit [Remote host closed the connection]

22:16 derek0883 has joined #systemtap

22:25 tromey has quit [Quit: ERC (IRC client for Emacs 27.1)]