fche changed the topic of #systemtap to: http://sourceware.org/systemtap; email systemtap@sourceware.org if answers here not timely, conversations may be logged
<irker074>
systemtap: smakarov systemtap.git:refs/heads/stapbpf/pr22330 * release-4.0-140-gf1e1d05 / stapbpf/stapbpf.cxx: stapbpf PR22330 fix :: support for non-contiguous active cpus http://tinyurl.com/y5nud8ao
wcohen has quit [Ping timeout: 252 seconds]
agentzh has joined #systemtap
<agentzh>
hi folks, i noted that varuse_collecting_visitor::visit_embeddedcode() is painfully slow when translating large .stp scripts (taking >90% of the total CPU time in my samples). i wonder if we could just cache the result for the same "current_function"?
<agentzh>
the interned_string::find() calls in that visit_embeddedcode() method is so expensive.
sscox has quit [Ping timeout: 250 seconds]
wcohen has joined #systemtap
tromey has quit [Quit: ERC (IRC client for Emacs 26.1)]
hpt has joined #systemtap
mjw has joined #systemtap
<agentzh>
I tried adding a varuse_collected boolean field to the embeddedcode class and our most complex stap script's compilation time (phase 2) reduces from 28s to 6s on my machine. yay
<agentzh>
interestingly this bottleneck was spotted by stap itself, via timer.profile + reversed cpu flame graphs.