#jruby on 2020-08-31 — irc logs at freenode.irclog.whitequark.org

2020-08-03 20:53 ChanServ changed the topic of #jruby to: Get 9.2.13.0! http://jruby.org/ | http://wiki.jruby.org | http://logs.jruby.org/jruby/ | http://bugs.jruby.org | Paste at http://gist.github.com

00:32 ur5us has quit [Ping timeout: 244 seconds]

00:48 ur5us has joined #jruby

05:51 ur5us has quit [Ping timeout: 244 seconds]

08:00 quadz_ has quit [*.net *.split]

08:05 quadz_ has joined #jruby

11:05 ur5us has joined #jruby

11:38 ur5us has quit [Ping timeout: 244 seconds]

13:12 nirvdrum has joined #jruby

17:11 <headius[m]> chrisseaton: finally got my apple device, where did you the openjdk zero build you're using?

17:59 <headius[m]> enebo: I'm looking at IR for def foo(*a); super(*a); end and seeing it splat twice

17:59 <headius[m]> I'd like to get super calls inlining but this is a bit confusing

18:05 <headius[m]> I think for now I will skip trying to inline super sites that have splats... the signature juggling is a distraction right now

18:06 <headius[m]> we may want to look at specializing IR a bit more for the different super argument forms

18:09 nirvdrum has quit [Ping timeout: 256 seconds]

18:10 <enebo[m]> yeah that is interesting

18:11 <headius[m]> class Y; def foo(*a); super(*a); end; end

18:11 <enebo[m]> but if a is changed then a splat should happen twice shouldn't it?

18:11 <headius[m]> I see the rest arg receive and then two splats before the call

18:11 <headius[m]> well, whether it changes or not why are there two?

18:11 <enebo[m]> probably because we do not track how a is used

18:11 <enebo[m]> but I wonder how MRI does this

18:11 <headius[m]> could be

18:12 <headius[m]> getting super to inline will be easy but for this argument logic

18:12 <enebo[m]> well it could just be conservatively ignored in this form

18:12 <enebo[m]> I would like to understand what should happen but this is only so common and other forms woudl still benefit

18:14 <headius[m]> yeah I am taking a step back and just looking at non-splatted now, because those are passing args through pretty much like regular calls

18:14 <headius[m]> so the remaining logic change is just in looking up and guarding the site

18:14 <enebo[m]> and IR did eliminate a class of calls where it does just decompose to a regular call already

18:15 <enebo[m]> but as far as improvements go there are plenty of unresolved super

18:15 <headius[m]> yeah I'm also only looking at instance super right now

18:16 <headius[m]> we have all the logic in all the right places but figuring out the right way to turn that inside out and cache/inline is tricksy

18:16 <headius[m]> perhaps I should do this as a prototype in the old style non-indy CallSite first

18:16 <headius[m]> then non-indy and interp will cache

18:17 <enebo[m]> it is helpful to have non-indy version regardless

18:18 <enebo[m]> Or at least I am a fan of the longer term idea of only indy'ing hot stuff once we can crack the profiling nut

18:20 <headius[m]> yeah well the simple profiling option is still pretty easy... add a counter at the site and until it reaches N we just use a non-inlining mono cache

18:20 <headius[m]> so very quick to bind to a very simple dispatcher, but if it's continually hit we flip to indy

18:21 <headius[m]> actually this is almost how it works right now... the end of the PIC is a simple monomorphic cache

18:21 <headius[m]> so basically we bind that first and then if we start to see heavily-hit momonorphic targets we start wrapping that monomorphic cache with some PIC laters

18:22 <headius[m]> mostly it just flips the current logic on its head... current is "build pic until we see N targets and then failover to mono cache"... new logic would be "use mono cache until we see that it's monomorphic and hot, and then build PIC"

18:23 <headius[m]> but we'd always use indy, just with a trivial target at first

18:23 <enebo[m]> I was thinking a bit more coarse but sure

18:24 <enebo[m]> For me the largest problem in determining hotness of lets say an entire method has eluded me so I just mention it

18:24 <enebo[m]> I did make that delta time stuff but it lacks any reasonable calibration

18:25 <enebo[m]> we cannot just pick a value and have it work everywhere but I do not know how to determine that value at runtime

18:25 <headius[m]> yeah I am just eager to find a way to have always-on indy without impacting warmup and startup and memory profile

18:25 <enebo[m]> heh yeah me too

18:25 <headius[m]> the benefit of using indy all the time is that by changing call site target the JVM will deopt/reopt for us rather than us trying to do it

18:25 <headius[m]> otherwise we need to re-emit code

18:26 <enebo[m]> yep just emitting the perfect thing once would obviously be best

18:26 <headius[m]> I mean a lot of what we've discussed could be shoved into indy sites, like "here's my frame data, make it lazily if you need to"

18:26 <headius[m]> rather than trying to profile for frame, we just create it on demand as part of the calls that need it

18:26 <headius[m]> if no call needs it it stays on JVM frame

18:27 <headius[m]> but that's beyond this discussion

18:27 <enebo[m]> yeah

18:27 <headius[m]> we need to get supers inlining and kwargs optimized first I think :-)

18:28 <enebo[m]> I remember java dude (name is escaping me but perhaps I shouldn't name him anyways) at JVMLS getting a little nasty at a Scala talk?

18:28 <enebo[m]> He was like what are the cost metrics or something like that on how much time a feature would take

18:28 <enebo[m]> Like somehow he wanted an accounting like "this feature takes 10 clocks"

18:29 <headius[m]> probably Josh Bloch and his "semantic gap" talk

18:29 <enebo[m]> but this is somewhat an issue with JVM itself

18:29 <enebo[m]> well it was Josh Bloch but not his talk

18:29 <enebo[m]> he was attacking someone who gave a talk

18:29 <headius[m]> ah sure

18:29 <headius[m]> talk might have been David Pollack's talk on "wow look at what scala does unnder the covers"

18:29 <enebo[m]> but the semantic gap or sorts is the JVM itself

18:30 <headius[m]> which had the opposite effect on most of those watchinng

18:30 <enebo[m]> That actually was it I think

18:30 <enebo[m]> We do not really have a lot of ability to examine all but small chunks of code and say, "ah it turns out like that"

18:30 <enebo[m]> but once you put it into something large it might not turn out like that

18:31 <enebo[m]> this is not really special to any runtime or compiler for that matter but it is what makes it difficult to reason with

18:32 <enebo[m]> I think qualitatively early invokedynamic suffered from code explosion

18:32 <enebo[m]> I am not fully sure how true it is but we do still see warmup issues (I think anyways) on large apps

18:33 <enebo[m]> Actually if someone who worked on indy would write a blog on changes that would be very enjoyable

18:33 <headius[m]> well I think this has improved somewhat since they fixed how the metrics for inlining depth and size are calculated across indy/MH calls

18:34 <headius[m]> it's also unclear how much this failed inlining is affecting our stuff... it's certainly slow, but it may actually be producing a lot more code since none of the indy stuff inlines either

18:35 <headius[m]> so if we figure out how to fix that and more indy sites start inlining the warmup and memory effects may reduce

18:35 <enebo[m]> ah yeah I do not know which case that is but I saw your email and is that engineer back from vacation? :)

18:35 <headius[m]> looks like vladimir might have a trick to fix this

18:35 <enebo[m]> as I have said in the past I feel like we need something we can just run occasionally to measure impact of changes on warmup

18:36 <enebo[m]> I think it ultimately should be something close to a Rails app but I have been thinking about that and I am concerned that Rails is too much of a moving target to use it as a long term bench

18:37 <enebo[m]> So something with a lot of disparate callsites which are called in a mix like a traditional website where a few how paths and a lot of occasional ones

18:37 <enebo[m]> s/how/hot

18:38 <enebo[m]> we could vendor lock some version of Rails and it will work for a couple of years or more if we are luck

18:39 <headius[m]> mmm yeah

18:39 <headius[m]> rails is just a mess to try to use for investigating this stuff

18:40 <headius[m]> perhaps some benchmark that jeremyevans uses on roda or sequel? We know it will be designed to be as fast as Rubily possible

18:40 <headius[m]> there's a kernel of typical Ruby patterns that we always want fast

18:41 <headius[m]> Rails has so much wacky stuff outside that kernel it's hard to see through

18:43 <enebo[m]> yeah fast as Rubily possible may not be the code we should examine though

18:43 <enebo[m]> It does not hurt we can execute that well obviously

18:43 <headius[m]> maybe

18:43 <enebo[m]> and perhaps all code will go that way? I don't know

18:43 <enebo[m]> but I do agree with the pattern notion

18:43 subbu is now known as subbu|lunch

18:44 <enebo[m]> I am not sure how much of most of Rails codebase actually cares or not

18:44 <enebo[m]> We are missing some big wins with kwargs no doubt already in rails

18:44 <headius[m]> at this point I feel like we have enough known unknowns to keep us busy, like super inlining, zero-alloc kwargs, and so on

18:44 <enebo[m]> yeah

18:44 <headius[m]> but I think those need us both because there's IR work to do

18:45 <headius[m]> I am proceeding with a simple call site for super stuff to get a feel for it now

19:55 subbu|lunch is now known as subbu

20:05 nirvdrum has joined #jruby

20:15 mistergibson has joined #jruby

20:42 byteit101[m] has joined #jruby

21:03 <headius[m]> enebo: I'm calling it a day... super caching is a little bit trickier because we need to know nothing changed below the superclass, or the lookup might change

21:03 <headius[m]> I think it should be possible to just verify bottom class, like we do for normal sites, but the lookup returns a cache entry from the superclass

21:03 <headius[m]> so might need a new way to cache super methods

21:04 <enebo[m]> I think with stablization of types being generally common it should be fine

21:04 <headius[m]> yeah

21:04 <enebo[m]> singletons/eigenonsense being the exception perhaps

21:04 <headius[m]> I pushed a PR with some other small optimizations

21:05 <headius[m]> hmmm something small regressed though

21:05 <enebo[m]> ok

21:08 ur5us has joined #jruby

21:26 mistergibson has quit [Quit: Leaving]

21:36 <lopex> numbers

21:51 nirvdrum has quit [Ping timeout: 260 seconds]

21:58 _whitelogger has joined #jruby

22:48 ur5us has quit [Remote host closed the connection]

22:48 ur5us has joined #jruby

22:58 neoice has joined #jruby