fche changed the topic of #systemtap to: http://sourceware.org/systemtap; email systemtap@sourceware.org if answers here not timely, conversations may be logged
scox has joined #systemtap
hpt has joined #systemtap
tromey has quit [Quit: ERC (IRC client for Emacs 26.0.50)]
_whitelogger has joined #systemtap
_whitelogger has joined #systemtap
hkshaw has joined #systemtap
Humble has joined #systemtap
sanoj has joined #systemtap
nkambo has joined #systemtap
groleo has joined #systemtap
scox has quit [Ping timeout: 268 seconds]
orivej has joined #systemtap
scox has joined #systemtap
scox has quit [Ping timeout: 240 seconds]
scox has joined #systemtap
dtatulea has joined #systemtap
dtatulea_ has quit [Ping timeout: 264 seconds]
sfix has quit [Ping timeout: 264 seconds]
sfix has joined #systemtap
invano has joined #systemtap
<invano>
morning everyone. I got a staprun related question for you guys. I'm using a stap script to monitor specific processes. I run staprun on a VM through ssh and then, again by using ssh, I run the process I want to monitor. The problem is that sometimes I cannot log anything at all. I found out (thanks -vv) that the stap module is ready after printing "stapio:stp_main_loop:... systemtap_module_init() returned 0". However this is not enough. 20% of the times
<invano>
staprun still misses the execution of the new process. I'm wondering if there is something I'm missing...and hope I've been clear enough
mjw has joined #systemtap
hpt has quit [Quit: Lost terminal]
pwithnall__ has joined #systemtap
prefork has quit [Remote host closed the connection]
prefork has joined #systemtap
pwithnall__ has quit [Quit: pwithnall__]
scox has quit [Ping timeout: 260 seconds]
pwithnall____ has joined #systemtap
__positron has quit [Ping timeout: 255 seconds]
<fche>
invano, hi
<fche>
doesn't sound familiar to stap versions in the last year or two; before then we had an occasional bug where not all target processes would be found
<fche>
did stap report anything about missed probes in the 20% cases, at shutdown?
<fche>
(by the way, stap --remote=HOST can help cross-compile, send, and execute a script via ssh
__positron has joined #systemtap
wcohen has quit [Ping timeout: 260 seconds]
scox has joined #systemtap
mbenitez has joined #systemtap
mbenitez has joined #systemtap
mbenitez has quit [Changing host]
<invano>
fche: hi. no, no missed probes in that case. It's like systemtap doesn't see at all the process.
<fche>
sounds odd. it shouldn't matter, but how soon after the stap script startup do you start the target process?
<invano>
As I told you, if I run the process after "stapio:stp_main_loop:... systemtap_module_init() returned 0" it gets better and better
<invano>
I tried to wait 1, 5, or even 30 secs. What's really important is to wait for that string logged by staprun and then start the process
<invano>
For reference, I'm using a modified version of strace.stp in which I still have the forked set concept and I filter also per uid of the process.
<fche>
ok. how old is this version of stap? (stap -V )
<invano>
ah, of course when I can't get a trace, if I restart the stap over and over I'll finally get what I need
<invano>
I'm using stap 2.9
<invano>
If I remember correctly I had the same issue even with latest stap 3.1 but I can give it a new try
nkambo has quit [Ping timeout: 240 seconds]
<fche>
ok, a few more things one can try to gather info for a bug report
<fche>
stap -DDEBUG_PROBES
<fche>
-DDEBUG_TASK_FINDER_VMA
<fche>
-DDEBUG_UPROBES
<fche>
-DDEBUG_TASK_FINDER
<fche>
stap-report's output also helpful
drsmith_away is now known as drsmith
tromey has joined #systemtap
sanoj has quit [Quit: Leaving]
sona has joined #systemtap
<sona>
hi
<sona>
i want to probe a userspace function taken at run time from user
<sona>
how to do this
brolley has joined #systemtap
<fche>
no easy way - resolution of probe locations requires symbol/debuginfo processing, which is not really available at stap run time
<fche>
you can pass a function name as a command line argument and sub it into a stap script with $@ etc.
<fche>
or you could insert probes everywhere, disabled with an on-the-fly global variable, and then selectively activate it in response to a procfs write
<fche>
that might be a reasonable compromise method
<sona>
fche, oky
wcohen has joined #systemtap
<fche>
I can try talking you through that last one, it's an advanced (but fun) technique
<sona>
sona, which one is more efficient among the two
<sona>
fhce,
<sona>
fche,
<fche>
the former case requires one to rerun stap with every new function
<fche>
the latter case requires one to probably create the stap script with another script, and may involve runtime overheads for the inactivated probes
<fche>
what -action- (probe handler) did you have in mind for all these?
<sona>
fche, so probe handler just takes traces of functions who triggerd it
<sona>
fche, trying to go by approach 1 as it will put probes on the required function, thus avoiding resouce overhead as in later case
<fche>
so approach 1 is the trivial % stap -e 'probe process.function(@1) { ... } ' foo_bar
wcohen has quit [Remote host closed the connection]
<sona>
fche, foo_bar need to be applied after running the command?
<sona>
i meant if running stap -e 'probe process.function(@1)' {printf("data")}' give parse error
<fche>
yes.
<fche>
so approach 1 does not work at script run time.
<fche>
approach 2 cheats by computing all possible probe points at script processing time, but disabling them on-the-fly at run-time, until commanded
<sona>
fche, oky
Humble has quit [Read error: Connection reset by peer]
efiop has quit [Ping timeout: 260 seconds]
eichiro has quit [Ping timeout: 255 seconds]
fche has quit [Ping timeout: 264 seconds]
jlebon has quit [Ping timeout: 255 seconds]
eichiro has joined #systemtap
wcohen has joined #systemtap
efiop has joined #systemtap
fche has joined #systemtap
Humble has joined #systemtap
jlebon has joined #systemtap
groleo has quit [Ping timeout: 255 seconds]
sona has quit [Ping timeout: 245 seconds]
scox has quit [Quit: scox]
scox has joined #systemtap
hkshaw has quit [Ping timeout: 260 seconds]
irker394 has joined #systemtap
<irker394>
systemtap: dsmith systemtap.git:refs/heads/master * release-3.1-61-gbccff21 / configure configure.ac httpd/Makefile.am httpd/Makefile.in httpd/main.cxx httpd/server.cxx httpd/server.h: Make changes for RHEL6 in the http web service server code. http://tinyurl.com/lumn4ed
mbenitez has quit [Quit: To office]
__positron has quit [Ping timeout: 240 seconds]
mbenitez has joined #systemtap
mbenitez has joined #systemtap
mbenitez has quit [Changing host]
zodbot has quit [Ping timeout: 240 seconds]
__positron has joined #systemtap
zodbot has joined #systemtap
sona has joined #systemtap
sona has quit [Ping timeout: 255 seconds]
drsmith is now known as drsmith_away
drsmith_away is now known as drsmith
pwithnall__ has joined #systemtap
pwithnall__ has quit [Ping timeout: 268 seconds]
mjw has quit [Quit: Leaving]
pfallenop has quit [Ping timeout: 258 seconds]
pfallenop has joined #systemtap
csanting has quit [Remote host closed the connection]
wcohen has quit [Ping timeout: 259 seconds]
tromey has quit [Quit: ERC (IRC client for Emacs 26.0.50)]