#systemtap on 2019-11-22 — irc logs at freenode.irclog.whitequark.org

2015-11-12 23:18 fche changed the topic of #systemtap to: http://sourceware.org/systemtap; email systemtap@sourceware.org if answers here not timely, conversations may be logged

00:11 simon__ has joined #systemtap

00:33 <simon__> fche I'm back! :-)

00:33 <simon__> I tried out the callgraph on the big executable and it said:

00:33 <simon__> Pass 2: analyzed script: 93757 probes, 79948 functions, 1 embed, 3 globals using 1307148virt/1277392res/7012shr/1272896data kb, in 21310usr/270sys/21594real ms.

00:34 <simon__> virtual memory exhausted: Cannot allocate memory

00:34 <simon__> Pass 4: compilation failed. [man error::pass4]

00:34 <simon__> real 36m34.129s

00:35 <simon__> Also, my laptop seemed to completely lock up too while those 36m were going on... :-(

00:39 <simon__> Does this mean that neither the macro method nor the callgraph method is going to help me instrument and trace this C/C++ project with ~ 80k functions? Because even if it did not run out of memory, who'd want to wait 30+ minutes to start the executable?

01:08 hpt has joined #systemtap

01:33 simon__ has quit [Ping timeout: 246 seconds]

01:40 simon__ has joined #systemtap

01:40 <simon__> hi fche!

01:58 <fche> yo

01:58 <simon__> hello again!

01:59 <simon__> can you see my messages and question from 16:33 ? Or about 1.5 hours ago?

01:59 <fche> hm, 90000 probes is a huge number, yea

01:59 <fche> just got back

01:59 <simon__> kk nice :-)

01:59 <fche> (9pm here)

01:59 <fche> shouldn't be 30min, unless perhaps the machine was really short of ram and was paging to death

01:59 <simon__> kk :-)

02:00 <fche> on a larger ram box, that should be manageable, I suspect, but it's larger than usual

02:00 <fche> thousands, ok, hundred thousand ... wow!

02:00 <simon__> laptop has 32GB of RAM...

02:01 <fche> this could be one of those times where agentzh's separate-compilation-of-stap-modules could be necessary

02:01 <fche> for a probe job of that magnitude

02:01 <fche> could you try smaller, like say one tenth the size?

02:02 <simon__> yep, that's what I was thinking too... so I'm just making a script which manufactures a source file with n functions inside it :-)

02:04 <simon__> does the linux kernel do things differently, because surely it also has any many probe points, or more?

02:04 <fche> %stap -l 'kernel.function("*").call' | wc -l

02:05 <fche> 46373

02:05 <fche> (stap can probe numerically many more places than function calls, in principle)

02:05 <fche> (in practice, memory etc. constraints kick in)

02:12 <fche> my poor gcc has been running for five minutes, chugging a stap script for 'probe kernel.function("*") {}' ... shouldn't be that long

02:12 <fche> but memory consumption is okay for now

02:12 <simon__> I created a script which generates n functions which all call each other and finally the last one returns

02:13 <fche> try using not para-callgraph.stp but a simpler one - that doesn't print $$vars type things (which are so inherently context-specific and result in a lot of extra code being generated, to pull out and pretty-print all those local variables)

02:13 <simon__> however, para-callgraph.stp appears to fail if more than 63 functions deep are called... :-(

02:14 <simon__> this is where function bar100() gets called: 1403 zoo(24173): ->bar100 i=0x63

02:14 <simon__> but the very next line is: 1413 zoo(24173): <-bar63 return=0x64

02:15 <simon__> return lines for bar64() through bar100() are mysteriously missing...

02:16 <simon__> Do you know a way to increase the call depth? Or I need to write a simpler auto generate C program... :-)

02:22 <fche> hm, I can't think of a reason that shouldn't work

02:25 <simon__> I came up with a workaround script which generates n functions which are called one at a time instead...

02:25 <simon__> perl -e '$funcs=10; printf qq[#include <stdio.h>\n#include "trace.h"\n]; $i=1; while($i <= $funcs){ printf qq[int bar%d(int i){return(1 + i);}\n], $i; $i++; } printf qq[int main(void){ int i=0;\n]; foreach my $i(1..$funcs){ printf qq[ i = bar%d(i);\n], $i; } printf qq[ printf("- i=%%d\\n", i);\n}\n];' | tee zoo.c

02:29 <simon__> possibly a bug in stap? is there a test to test recursion depth bigger than 63?

02:30 <fche> stap doesn't experience recursion

02:30 <fche> it would experience a sequence of calls at deeper and deeper nesting levels, but stap itself doesn't know or care

02:31 <fche> again try a stap script that does not process the $$parms; it should drastically reduce resource requirements

02:32 <simon__> do you have such a script handy?

02:35 <simon__> so with 10k functions then this source file was generated: -rw-r--r-- 1 root root 69547889 Nov 21 18:32 stap_d59bb5f6406a3529e5d18985a6051fe1_10337070_src.c

02:36 <simon__> it's taking a long time to compile...

02:36 <simon__> looks like cc1 is using 2.5GB RAM so far...

02:39 <simon__> so 1,000 functions took real 0m18.938s, but 10,000 functions took real 5m22.487s

02:41 <simon__> so I'm wondering what's in that file because 69,547,889 / 10,000 is 6,954 bytes per function? Seems a bit on the steep side, or?

02:43 <fche> if it involves grabbing a bunch of different context variables and pretty-printing, that could be about right

02:43 <fche> if you grab the para-callgraph.stp script file, and replace $$parms with "" and $$return with "" then it won't mess with that

02:47 <simon__> I found out where it caches the .c file... :-)

02:48 <fche> sssh, tell no one, it's a secret

02:48 <simon__> So I guess it's not / 10,000 but really / 20,000 because it's the enter and leave points...

02:48 <fche> or you could run stap -k ...

02:48 <fche> yup, things add up

02:49 <simon__> so that's approx. 3,500 bytes per probe thingy...

02:50 <simon__> and there's lots AND LOTS of duplicated lines, e.g.: 20,000 * #define STAP_RETURN(v) do { STAP_RETVALUE = (int64_t) (v); goto out; } while(0)

02:50 <simon__> and that's just one of many examples...

02:50 <fche> those are harmless tho

02:51 <simon__> yy but the compile time starts to go through the roof...

02:52 <fche> not because of those

02:52 <fche> macros

02:53 <simon__> how can I get the actual compile command line for the monster C file?

02:54 <fche> stap --vp 02 ish .. stap -k will keep the tmp directory so you can run it for yourself later

02:54 <simon__> zoo.c with 10k functions takes 3.1 seconds for gcc to compile... but it looks like the much bigger .c file takes over 5 minutes... admittedly it is much bigger...

02:54 <fche> sorry stap --vp 0002 (pass 4 verbosity 2)

02:55 <simon__> thanks!

02:57 * fche must sign off shortly

02:57 <fche> a comfy pillow beckons

02:57 <fche> good luck dude and we can talk again tomorrow

02:58 <simon__> thanks for your help! and greetings from Vancouver, Canada! :-)

03:01 <fche> ah it's just late dinner time for you then

03:01 <fche> say hi to the whales in the fraser

03:33 <simon__> wow... you know your geography :-)

03:52 simon__ has quit [Ping timeout: 240 seconds]

06:32 khaled has joined #systemtap

06:55 khaled_ has joined #systemtap

06:58 khaled has quit [Ping timeout: 240 seconds]

08:12 ema_ is now known as ema

09:56 hpt has quit [Ping timeout: 265 seconds]

10:01 mjw has joined #systemtap

10:15 sscox has quit [Ping timeout: 245 seconds]

11:58 khaled_ has quit [Read error: Connection reset by peer]

11:58 khaled has joined #systemtap

12:29 khaled_ has joined #systemtap

12:33 khaled has quit [Ping timeout: 246 seconds]

13:43 wcohen has quit [Ping timeout: 276 seconds]

14:21 sscox has joined #systemtap

14:40 wcohen has joined #systemtap

17:03 simon__ has joined #systemtap

17:03 <simon__> hi again fche!

17:05 tromey has joined #systemtap

17:17 <fche> hey simon__ -morning

17:17 <simon__> hello! yes, morning 9:17 AM for me... but 12:17 PM for you?

17:17 <simon__> :-)

17:22 <fche> so what's new today

17:22 <simon__> fche, may I ask you some more questions about systemtap?

17:24 <simon__> yesterday I discovered that for the 10,000 function example, systemtap spends over 5 minutes creating and compiling its .c file which gets compiled to a .ko kernel object file... where can I find more info about the architecture of systemtap and how the .ko file fits into the big picture?

17:25 <fche> the docs include an introduction/architecture paper

17:26 <simon__> "Original architecture paper (July 2005)." is this the best one?

17:26 <fche> it's a good one to start

17:26 <fche> t

17:26 <fche> the concepts are the same

17:27 <simon__> thanks!

17:32 <simon__> another question: Yesterday we talked briefly about including and excluding functions in a runt-time call-tree trace. What about if I managed to instrument a large executable with ~ 80k functions and wanted to do something more complicated like: Have more control over the verbosity? Give some functions a higher or lower verbosity so that they are included or excluded in the trace depending upon the verbosity level? And also, let's say

17:32 <simon__> I would like to run the executable with verbosity switched off, but switch it on when a particular function is first executed? What are ways that you would approach these types of challenges with systemtap?

17:32 <fche> 'verbosity' within a script is entirely under your control

17:32 <fche> it's a programming language, eh?

17:33 <fche> so you print when you want to - you can track nesting levels, function names, time of day, whatever you want

17:33 <fche> and you decide when something should be printed

17:37 simon__ has quit [Ping timeout: 240 seconds]

17:41 khaled_ has quit [Ping timeout: 265 seconds]

17:41 khaled_ has joined #systemtap

17:57 simon__ has joined #systemtap

18:45 gromero_ has joined #systemtap

18:46 gromero has quit [Ping timeout: 240 seconds]

18:48 irker893 has joined #systemtap

18:48 <irker893> systemtap: sapatel systemtap.git:refs/heads/sapatel/pr22315 * release-4.1-101-ga07b09e / bpf-internal.h bpf-translate.cxx stapbpf/bpfinterp.cxx stapbpf/bpfinterp.h stapbpf/stapbpf.cxx tapset/bpf/exit.stp tapset/logging.stp: initial http://tinyurl.com/tj88j8r

18:48 <irker893> systemtap: sapatel systemtap.git:refs/heads/sapatel/pr22315 * release-4.1-102-gf8e7cd4 / bpf-internal.h bpf-translate.cxx stapbpf/bpfinterp.cxx tapset/bpf/exit.stp tapset/logging.stp: polishing http://tinyurl.com/sxjz26d

19:25 orivej has quit [Ping timeout: 250 seconds]

19:28 orivej has joined #systemtap

20:21 wcohen has quit [Ping timeout: 252 seconds]

20:44 simon__ has quit [Quit: Leaving]

21:24 wcohen has joined #systemtap

21:29 khaled_ has quit [Remote host closed the connection]

21:34 khaled has joined #systemtap

21:34 khaled has quit [Client Quit]

21:35 khaled has joined #systemtap

21:58 tromey has quit [Quit: ERC (IRC client for Emacs 26.1)]

22:09 sscox has quit [Ping timeout: 240 seconds]

23:25 amerey has quit [Quit: Leaving]

23:30 sscox has joined #systemtap