#systemtap on 2018-08-07 — irc logs at freenode.irclog.whitequark.org

2015-11-12 23:18 fche changed the topic of #systemtap to: http://sourceware.org/systemtap; email systemtap@sourceware.org if answers here not timely, conversations may be logged

00:00 <agentzh> it does not have to be in mainstream kernel.

00:00 <agentzh> it could just be an out-of-tree kernel module.

00:00 <agentzh> right now stap always inserts a new kernel module and uploads it, which is a bit wasteful.

00:01 <agentzh> the kernel module could be a vm which keeps running in the kernel space.

00:02 <agentzh> though an ad-hoc kernel module exposes more optimization opportunities for specific stap scripts.

00:03 <agentzh> we are also thinking about rolling out one since we're also VERY unhappy about the ebpf's kernel side limitations, which is inhuman after years of stap usage.

00:03 <agentzh> stap is so powerful and innovative in many ways, even as compared to dtrace.

00:04 <agentzh> bcc is still like stone age things.

00:13 <fche> we'd really appreciate an outsider like yourself writing up such observations

00:13 <fche> it'd help prioritize our work and maybe be leverage w.r.t. the kernel folks

00:17 <agentzh> fche: sure, will do.

00:17 <agentzh> i already have a talk in nginx conf 2018 which will talk about stap, bcc, gdb and etc.

00:17 <agentzh> it will be in Oct.

00:18 <agentzh> *have proposed a talk

00:18 <agentzh> will definitely write more blog posts too.

00:18 <fche> looking forward to them

00:19 <agentzh> mozilla rr is also on our radar.

00:19 <agentzh> thanks

01:44 pokk11 has joined #systemtap

01:44 <pokk11> This blog is essentially an ad for the Handshake ICO scam with a one-line "denial" of involvement mixed in there. It's obviously very unethical of Christel to not mention her own involvement in the scam which the blog post promotes.

01:44 <pokk11> Christel just posted this "denial" on the freenode blog https://freenode.net/news/spam-shake

01:44 <pokk11> Consider Andrew Lee's involvement, Andrew Lee is Christel's boss at London Trust Media and he also controls the majority of freenode voting rights. Andrew Lee also heads the handshake ICO scam. Coincidence?

01:44 <pokk11> Oh, and about those donations she speaks of: https://twitter.com/ISCdotORG/status/1025461692132519936

01:44 <pokk11> Don't support freenode and their ICO scam, switch to a network that hasn't been co-opted by corporate interests. OFTC or efnet might be a good choice. Perhaps even https://matrix.org/

01:50 GorillaWarfare28 has joined #systemtap

01:50 <GorillaWarfare28> Christel just posted this "denial" on the freenode blog https://freenode.net/news/spam-shake

01:50 pokk11 has quit [Remote host closed the connection]

01:50 <GorillaWarfare28> This blog is essentially an ad for the Handshake ICO scam with a one-line "denial" of involvement mixed in there. It's obviously very unethical of Christel to not mention her own involvement in the scam which the blog post promotes.

01:50 <GorillaWarfare28> Consider Andrew Lee's involvement, Andrew Lee is Christel's boss at London Trust Media and he also controls the majority of freenode voting rights. Andrew Lee also heads the handshake ICO scam. Coincidence?

01:50 <GorillaWarfare28> Oh, and about those donations she speaks of: https://twitter.com/ISCdotORG/status/1025461692132519936

01:50 <GorillaWarfare28> Don't support freenode and their ICO scam, switch to a network that hasn't been co-opted by corporate interests. OFTC or efnet might be a good choice. Perhaps even https://matrix.org/

01:51 GorillaWarfare28 has quit [Killed (Sigyn (Spam is off topic on freenode.))]

01:52 <agentzh> fche: seems like my stap test suite run never ends on my federa 26 (kernel 4.16.11). it locks up the CPU infinitely: "kernel:watchdog: BUG: soft lockup - CPU#2 stuck for 22s! [stapio:37300]"

01:52 <agentzh> now i cannot ssh to that vm. and all the CPU core is 100%.

01:53 <agentzh> any hints on debugging this would be very appreciated.

01:53 <fche> dmesg should at least identify the last script

01:53 <agentzh> so i should kill that vm now?

01:53 <fche> or check its console out

01:54 <fche> suggest netconsole or console-to-serial-tty-to-host-file for development vm's

01:56 <agentzh> oh, yeah, i should have configured that.

01:56 <agentzh> thanks for the tip.

01:57 <fche> 4.6.11 is old ... it might be afflicted by a series of kernel bugs introduced post-spectre

01:57 <fche> that nuke kprobes and some related facilities

01:58 <fche> 4.16.13 is better

02:12 <agentzh> fche: these are all the /var/log/messages output during the test run: https://pastebin.com/cUEfpTgc

02:37 <agentzh> fche: oh, good to know. sadly fed 26 is already O\

02:37 <agentzh> *EOF

02:38 <agentzh> i'll try finding kernel debug package for older federa 27 kernels then.

02:38 <agentzh> *debuginfo

02:54 wmealing has joined #systemtap

02:54 * wmealing waves

02:59 <agentzh> okay, i've finally found and installed the kernel 4.16.6 packages for fedora 27 on Koji.

03:00 <wmealing> noice.

03:01 <wmealing> say i wanted to bail out of a function early, in a probe.. ie, entirely out of the function.. be gone with the evil.. is guru mode the only way ?

03:01 <agentzh> wmealing: because stap does not support the latest kernel 4.17 packages on fedora 27.

03:02 <wmealing> agentzh: doesn't support how ?

03:02 <agentzh> the new kernel 4.17 breaks stap's syscall layer and the stap team is still working on it.

03:02 <wmealing> oh damn, i didnt know.. thanks for that.

03:02 <agentzh> sure

03:02 <wmealing> is this some kind of kprobes breakage ?

03:03 <agentzh> oops, sorry, i mean installed kernel 4.16.16 package, not 4.16.6. the latter is too old.

03:03 * wmealing looks at his kernel version.

03:04 <wmealing> 3.10.0-862.11.3.el7.x86_64

03:04 <agentzh> hah, still RHEL/CentOS 7 :)

03:04 <agentzh> pretty safe then.

03:04 <wmealing> aint never gunna give me up

03:05 <agentzh> wmealing: regarding to your question, i'm about to proposing a patch for that.

03:06 <agentzh> my patch originally changed the exit() tapset function, but fche doesn't like changing the current behavior. so maybe abort() or quit() then.

03:06 <wmealing> agentzh: is it -in- systemtap or something that i can abuse easily ?

03:06 <agentzh> it will be a stap core patch.

03:06 * wmealing nods

03:06 <agentzh> i can paste my current patch now if you'd like to give it a try.

03:07 <wmealing> sadly, i'm problem solving as we speak under a deadline.

03:07 <wmealing> i dont want to go to kprobes if i can help it.

03:08 <wmealing> i see a lot of the example scripts are about "monitoring" where as i need to change behavior.

03:10 <wmealing> https://i.imgflip.com/2fezw2.jpg

03:15 orivej has quit [Ping timeout: 260 seconds]

03:19 <wmealing> agentzh: can you execute guru code inside a probe ?

03:20 <agentzh> wmealing: you can use error("") + try/catch to do that.

03:23 * wmealing will try, ive not used either of those

03:37 <wmealing> well, that didnt work how i thought it would.

03:39 <wmealing> non guru mode crashed.

03:40 <wmealing> wait, it abused $return, its guru mode.

03:41 <wmealing> maybe nto crashed, but its not responding. thats cute

04:02 <wmealing> ok,i know what it was

04:03 <wmealing> returning the positive error value

04:03 <wmealing> not the correct negative error value

04:03 <wmealing> ie, not -EACCESS

04:47 <wmealing> this is a bit frustrating.. i think stap is eliding an expression which it thinks is side affect free, but it needs it later on.

04:47 <wmealing> and complaisn it can't find local

04:47 * wmealing thinks

05:05 <wmealing> forgot $ is source local

06:16 orivej has joined #systemtap

06:26 orivej has quit [Ping timeout: 256 seconds]

07:19 <invano> fche: sorry but after leaving I didn't have time to join again yesterday

07:20 <invano> oh wow I was sure to have CONFIG_DEBUG_INFO but I didn't see the _REDUCED. Let me recompile but I assume the problem is gone

07:21 <invano> I need time, a lot, to remember all those compilation flags..still far to be a gcc/dwarf expert :)

07:46 wmealing has quit [Remote host closed the connection]

08:32 <invano> yes, thanks fche

09:31 orivej has joined #systemtap

09:37 mjw has joined #systemtap

10:29 <invano> it's always me with my problems. I was using non-dwarf kprobes before. 1y ago and more I was without debuginfo in kernel. I do have dbginfo now, so I tried to switch to dw_syscalls.*. Also, translation of the stap script to C is much faster w/ dwarf.

10:30 <invano> With dw_syscall.* I'm encountering this warning repeated quite a lot: "WARNING: instance of overloaded function will never be reached: identifier '_stp_syscall_nr' at /usr/local/share/systemtap/tapset/linux/aux_syscalls.stp:119:10 source: function _stp_syscall_nr:long ()"

10:43 <invano> never mind..it was me mixing up tapsets from systemtap3.3 and systemtap git -.-

10:54 <fche> hey no problem, just am glad we're figuring this out

11:33 <invano> yes yes many thanks

11:35 <invano> I'm seeing the same @cast issue on MIPS now, even if I have CONFIG_DEBUG_INFO_REDUCED not set. Just some time to finish some stuff with arm and then I switch to the mips and I investigate that

11:39 <invano> btw it's more than one year now that I have some patches to fix(&add) MIPS o32 support for uprobes. I should share them

11:40 orivej has quit [Ping timeout: 240 seconds]

11:48 <fche> sure

11:48 <fche> with the arm trace, the eu-readelf -w of the typequery*ko file was what provided the hint I needed

11:48 <fche> the compiler flags at the top

11:51 <invano> Yes took note, including the explanation you wrote yesterday

11:53 <invano> Anyway, are there any known big issues at the moment with systemtap git version?

11:53 <fche> syscall probe aliases on kernel 4.17+ generally broken

11:54 <invano> ah ok 4.17

11:54 <invano> I did a git pull && install for fun and it breaks on compilation

11:54 <fche> shouldn't do that

11:55 <invano> warnings on various @define in syscalls + compilation error on a type cast with CONTEXT->sregs

11:55 <invano> compilation of the stap script sorry

11:57 <fche> would be curious about those errors, but that part is a work-in-progress

11:58 <invano> I can send you a log, or we just wait for the wip to finish and then check again

11:59 <fche> fpaste-ing it would be fine

11:59 <invano> sure

12:11 <invano> ok https://pastebin.com/rapwUNTe - I used double -v combo otherwise it gets too big for pastebin. I can put more v tho if you need it. same command I pasted yesterday for cross-compiling with arm. You trigger it with -e "probe syscall.* {..}"

12:11 <invano> warnings start at line 2061. int-to-pointer-cast error on line 2560

12:50 pviktori has quit [Quit: No Ping reply in 180 seconds.]

12:50 <fche> looks like a possible mixture of work-in-progress-ness and maybe you didn't run a make install lately?

12:51 tonyj has quit [Ping timeout: 244 seconds]

12:55 <invano> I ran make uninstall on 3.3rel and make install on git

13:05 <fche> so looking at that last pastebin, I wouldn't worry too much about the warnings

13:05 <fche> but the cast error is more interesting

13:08 <invano> yup it's what blocks everything

13:11 jhg_ has quit [Read error: Connection reset by peer]

13:16 jhg_ has joined #systemtap

13:22 <fche> yeah, ok, so it's probably a simple enough explanation

13:22 <fche> we're getting 64-bit ints via the context / function parameters; we're casting down through a void* into a struct pt_regs*

13:22 <fche> and on arm that's 32 measly bits die

13:22 <fche> wide

13:23 <fche> for some reason we're not seeing that diagnostic on other platforms ... not sure why

13:23 <fche> but will add a more paranoid cast sequence

13:25 <fche> ok try git master now

13:25 <invano> let me see

13:29 <invano> great

13:29 <fche> SHIP IT

13:29 <invano> warnings still there but compilation works fine

13:29 <fche> ok

13:35 <invano> oh wow you got a patch for mips o32 in January this year cool! Exactly same stuff I did with my patches in the past. nice!

13:36 <fche> I believe many of those patches came from the community years ago; we just finally pushed them in

13:37 <invano> I only see a mismatch in tapset/mips/registers.stp

13:41 <fche> pls send your patch if you see a problem

13:41 <invano> _reg_offsets["zero"] for CONFIG_32 is 24. If I'm not wrong, that should be because struct pt_regs had a unsigned long pad0[6] before. Starting from kernel 3.16.3 pad0[6] became pad0[8] so all _reg_offsets for CONFIG_32BIT should be realigned accordingly with a 8B shift

13:41 <invano> sure!

13:41 <fche> I don't think we have any mips hardware here to test on.

14:14 orivej has joined #systemtap

14:17 brolley has joined #systemtap

15:06 aryehw has joined #systemtap

15:14 tromey has joined #systemtap

15:42 <invano> yes I can test it

15:43 <invano> In the meanwhile, on mips, after a probe syscall.*: semantic error: No cfa_ops supplied, but needed by DW_OP_call_frame_cfa: identifier '$ringid' at /usr/local/share/systemtap/tapset/linux/sysc_add_key.stp:35:19

15:54 <fche> not sure whether that represents a problem in systemtap or gcc or the kernel; I remember x86 assembler in the kernel needing some custom dwarf-y assembly to make it complete

16:00 <invano> not sure either..I guess more the combo toolchain/gcc/kernel

16:01 <invano> I hit a similar problem on glibc e.g. extracting vars from process("libc").function("x") with DW_OP_GNU_entry_value

16:26 <fche> that one's a systemtap shortcoming (not enough dwarf5 knowledge iirc)

16:49 <invano> yeah I was reading it on a couple of bug reports

16:52 <invano> but is there something to probe <alias>.* but a specific one? like probe syscall.* except syscallX or probe process(Y).function("*") except function Y. For syscall I found a set declaration for doing syscall.{x,y,z}. However, I'm not finding any kind of not operator

17:02 orivej has quit [Ping timeout: 240 seconds]

17:27 <agentzh> fche: i'm still getting CPU stuck when running the 3.3 release's test suite on fedora 27's kernel 4.16.16: https://sourceware.org/bugzilla/show_bug.cgi?id=23493

17:27 <agentzh> it would be great if you can have a look.

17:28 <agentzh> the release 3.3's test suite has far fewer failures than the master branch on my side, though it also runs into CPU stuck after 1 hour's run.

17:29 <fche> agentzh, thanks for filing

17:29 <fche> stap-report would be useful

17:29 <fche> as would the dmesg|tail

17:31 <agentzh> fche: the box has rebooted and i'm afraid dmesg is not very helpful? i pasted the /var/log/messages output to the PR ticket already.

17:32 <fche> ah, the errrors just before the stuckage would also be good

17:33 <fche> but yow, memory allocation failed, well that's abnormal; we -might- not handle it quite right, though we have tests for it

17:34 <fche> a bunch of sigsegvs afterwards, yow

17:40 <agentzh> fche: already included all the lines a few minutes right before the stuck in the ticket :)

17:41 <agentzh> earlier messages are not errors and are not relevant.

17:41 <fche> ok, so abnormality started at the mempool_init failure? ok

17:42 <agentzh> fche: just added an attachment to the PR for the stap-report output.

17:43 <fche> how much ram on the box, btw? I don't think stap-report collects that but should

17:43 <agentzh> fche: Mem: 8143832 176968 7371364 12212 595500 7641028

17:43 <agentzh> 8G RAM

17:44 <fche> ok, no excuse there. I mean stap in some contexts insists on using completely free memory for allocation, and that could fail, but still obviously that should not be likely, nor bring the machine down.

17:44 <agentzh> that's all ram i have in the physical box (which is a mid-2015 MBP)

17:45 <fche> ah, was the crash in a smaller vm?

17:45 <agentzh> all the results are from the VM itself.

17:46 <fche> hm, so the vm memory limit is the same as the host physical memory? interesting

17:47 <agentzh> fche: oops, sorry, my bad, the host machine has 16G of ram.

17:47 <agentzh> the VM has 8G.

17:47 <agentzh> i was running make -j8.

17:47 <agentzh> so it might be even more hungery about memory.

17:48 <fche> ok

17:48 <agentzh> so to work around it, i should try enlarging the vm's memory assignment and using smaller job count?

17:48 <agentzh> but ideally we could get it fixed :)

17:49 <agentzh> quick and normal test failures would be much better than CPU stuck.

17:49 <fche> of course

17:50 <agentzh> sometimes i was hoping that we could use mozilla rr to record such failures and replay back and forth as we wish :)

17:51 <agentzh> fche: please let me know if you need any more info. appreciate your help!

17:52 <fche> yeah, not sure how fast progress is likely there; just would start reading alloc code in the runtime

17:52 <agentzh> gotcha.

17:53 <agentzh> fortunately it seems to be quite easy to reproduce on my side, on 2 different physical boxes.

17:54 <agentzh> one was using kernel 4.16.11 against stap master on a federa 26 vm atop Intel NUC, the other was using kernel 4.16.16 against stap 3.3 release tag on a fedora 27 vm atop MBP.

17:54 <agentzh> just let the test suite run like 1 hour or so and boom, all CPU cores are stuck.

18:01 <fche> something's odd there

18:01 <fche> we have a bunch of bots routinely building master & running the full testsuite

18:01 tromey has quit [Read error: Connection reset by peer]

18:02 <agentzh> the master branch has WAY more test failures than the 3.3 release tag on my side.

18:03 <agentzh> so i have to give up fighting the master.

19:49 tromey has joined #systemtap

20:25 orivej has joined #systemtap

20:41 <agentzh> fche: do you think this patch is okay to push? https://sourceware.org/ml/systemtap/2018-q3/msg00053.html

20:42 <agentzh> you used to give me the commit bit though i haven't exercised for several years already :P

20:42 <agentzh> this patch just reorders the CFLAGS and CXXFLAGS order.

21:13 tromey has quit [Quit: ERC (IRC client for Emacs 26.1.50)]

21:30 <fche> agentzh, go for it

22:13 brolley has left #systemtap [#systemtap]

22:34 fche has quit [Ping timeout: 248 seconds]

22:41 mjw has quit [Quit: Leaving]