fche changed the topic of #systemtap to: http://sourceware.org/systemtap; email systemtap@sourceware.org if answers here not timely, conversations may be logged
gila has quit [Quit: My Mac Pro has gone to sleep. ZZZzzz…]
zodbot has quit [Quit: nirik]
zodbot has joined #systemtap
irker885 has quit [Quit: transmission timeout]
ericlee has joined #systemtap
vbernat has quit [Quit: The future belongs to those who believe in the beauty of their dreams.]
vbernat has joined #systemtap
slowfranklin has joined #systemtap
<ericlee> Hi, why the size is -11 in socket.receive sometime?
<ericlee> what's the meaning of it?
<ericlee> that's error code
khaled has joined #systemtap
slowfranklin has quit [Quit: slowfranklin]
sadmac has joined #systemtap
sadmac has left #systemtap [#systemtap]
<khaled> Hi all, Would someone please help me with this error ? https://paste.fedoraproject.org/paste/1rHhJSY7suAUBm8TGjQfFg
<khaled> I have the same setup on another computer and it works, same kernel and stap version.
gila has joined #systemtap
zodbot has quit [Read error: Connection reset by peer]
zodbot has joined #systemtap
sscox has quit [Ping timeout: 250 seconds]
orivej has joined #systemtap
_whitelogger has joined #systemtap
slowfranklin has joined #systemtap
slowfranklin has quit [Quit: slowfranklin]
slowfranklin has joined #systemtap
<fche> ericlee, EAGAIN11/* Try again */
<fche> khaled, suspect one of the machines has the kernel debuginfo installed, the other doesn't
<fche> if you run on the working machine: stap --remote=$other_machine_hostname ... then it should work
<fche> (or copy over a stap -p4 .ko file and staprun on the other-machine)
<khaled> @fche, Thanks, I think I found the root cause, the printf format was printf("%s %s %s\n", execname(), pid(), pathname) although it should be printf("%s %d %s\n", execname(), pid(), pathname);
<khaled> Although the error is misleading
<fche> that's quite wrong
<fche> maybe there were two separate errors?
<khaled> Which one? the first right?
<fche> yeah
<fche> will look into that
<fche> can reproduce the bug
<fche> if you had the time to file a sourceware.org/bugzilla bug on that, it'd be great, otherwise we will at some point
<khaled> I will create a bug for it, Thanks for your review
slowfranklin has left #systemtap [#systemtap]
orivej has joined #systemtap
khaled has quit [Quit: Konversation terminated!]
khaled has joined #systemtap
khaled has quit [Client Quit]
khaled has joined #systemtap
<fche> thanks dude
<khaled> Hi Frank, Please check if this is another bug for kernel 5 https://paste.fedoraproject.org/paste/UAnhuhZ5s6Nx~OZ4IN0Xng
slowfranklin has joined #systemtap
<fche> stap -L 'vfs.{write,read,open}' <--- worth a try to see what variables are in scope
<khaled> vfs.open name:string pathname:string argstr:string
<khaled> vfs.read pathname:string dev:long devname:string ino:long name:string argstr:string
<khaled> vfs.write pathname:string dev:long devname:string ino:long name:string argstr:string
<khaled> pathname is there
<khaled> I think checking the vfs code diff between kernel 4 & 5 is worthy of checking
<khaled> I'll do that when I get time
<fche> hm, with git stap this works on kernel 5.0-rc3:
<fche> stap -p4 -e 'probe vfs.{write,read,open} { println(pathname) }'
<khaled> Same
<khaled> semantic error: while processing probe kernel.function("vfs_write@fs/read_write.c:533") from: vfs.write from: vfs.write
<khaled> semantic error: failed to retrieve location attribute for 'file' [man error::dwarf]: identifier '$file' at /usr/local/share/systemtap/tapset/linux/vfs.stp:1051:57
<khaled> dieoffset: 0x344f957 from /lib/modules/5.0.0-rc5+/build/vmlinux
<khaled> function: vfs_write at fs/read_write.c:546:3 inlined by vfs_write at fs/read_write.c:533:9
<khaled> source: pathname = fullpath_struct_file(task_current(), $file)
<khaled> ^
<khaled> Pass 2: analysis failed. [man error::pass2]
<fche> ok, different behaviour here
<khaled> obviously, yes
<fche> kernel debuginfo / inlining is always a tricky area
<fche> are you using a fedora kernel build?
<khaled> could it be that my .config lacks something ?
<fche> mainline kbuild kills some debuginfo unnecessarily; see the tail end of any stap release notes re. -fvar-tracking-assignments
<khaled> i didn't get what you mean.
<fche> see the line 80% down about kernel commit #2062afb4f804a
slowfranklin has quit [Quit: slowfranklin]
<khaled> Got your point
<khaled> I will try re-compiling my kernel with KBUILD_CFLAGS += $(call cc-option, -fno-var-tracking-assignments)
<fche> er - without that line
<khaled> Sorry, without
<fche> we do want var-tracking-assignments
<fche> yeah. no guarantee that this is enough to fix it, but it should help
<fche> and not hurt anything
<fche> geez linus :)
<khaled> :D
<khaled> I'll keep you updated :)
<fche> righto
slowfranklin has joined #systemtap
orivej_ has joined #systemtap
orivej has quit [Ping timeout: 250 seconds]
slowfranklin has quit [Quit: slowfranklin]
<khaled> It worked after removing -fno-var-tracking-assignments, Thanks Frank for demystifying this behavior :-)
khaled has quit [Quit: Konversation terminated!]
khaled has joined #systemtap
<fche> ah very good
<fche> what a bummer that this problem still exists
khaled has quit [Quit: Konversation terminated!]