fche changed the topic of #systemtap to: http://sourceware.org/systemtap; email systemtap@sourceware.org if answers here not timely, conversations may be logged
gila has joined #systemtap
_whitelogger has joined #systemtap
_whitelogger has joined #systemtap
slowfranklin has joined #systemtap
slowfranklin has quit [Client Quit]
introom has quit [Ping timeout: 260 seconds]
introom has joined #systemtap
naveen2 has joined #systemtap
naveen has quit [Ping timeout: 240 seconds]
orivej has joined #systemtap
naveen2 has quit [Quit: WeeChat 1.9.1]
naveen has joined #systemtap
naveen2 has joined #systemtap
naveen has quit [Ping timeout: 240 seconds]
slowfranklin has joined #systemtap
naveen2 has quit [Quit: WeeChat 1.9.1]
pwithnall has joined #systemtap
orivej has quit [Ping timeout: 246 seconds]
orivej has joined #systemtap
mjw has joined #systemtap
pwithnall has quit [Quit: pwithnall]
pwithnall has joined #systemtap
naveen has joined #systemtap
orivej has quit [Ping timeout: 260 seconds]
orivej has joined #systemtap
orivej has quit [Ping timeout: 245 seconds]
orivej has joined #systemtap
brolley has joined #systemtap
tromey has joined #systemtap
gregwork has joined #systemtap
<gregwork>
are there any useful tap's for troubleshooting hadoop performance ?
<fche>
hi
<fche>
I am not aware of any scripts targeted to hadoop per se, but stap has helped with various system- and sometimes jvm-level problems
<gregwork>
can you run multiple stap scripts at the same time ?
<fche>
certainly
<fche>
so what kinds of problems are you seeing, and what brings you to stap ?
<gregwork>
oh im involved with a task force here to investigate hadoop perf issues in prod. We have got a static workload we can run over and over on the data science admins looking at the app, I was going to look at the view from the OS perspective. Observe where the kernel thinks the app is spending its time, what syscalls/io etc
<gregwork>
im hoping to compare notes as we run through the workload
slowfranklin has left #systemtap [#systemtap]
<fche>
aha. yeah, syscalls analysis is a pretty basic use of the tool. also good if one needs to go deeper - device drivers, file systems, etc. internals
<gregwork>
i know that strace can be run with -c
<gregwork>
to count the syscalls used by a process
<gregwork>
is there a stap/deeper equiv
<fche>
sure, stap works easily systemwide rather than per-process, and uses a much smaller interference interface to gather the syscall stuff
<fche>
t
<fche>
there is a strace workalike script in there that you could try just to get the feel
<gregwork>
which one is that
<fche>
(but note, your kernel version needs to be <= 4.16 for now)