#jruby on 2020-01-28 — irc logs at freenode.irclog.whitequark.org

2019-08-12 18:53 ChanServ changed the topic of #jruby to: Get 9.2.8.0! http://jruby.org/ | http://wiki.jruby.org | http://logs.jruby.org/jruby/ | http://bugs.jruby.org | Paste at http://gist.github.com

00:14 nirvdrum has quit [Ping timeout: 265 seconds]

01:01 ur5us has joined #jruby

01:12 nirvdrum has joined #jruby

01:34 ur5us has quit [Ping timeout: 260 seconds]

02:02 nirvdrum has quit [Ping timeout: 268 seconds]

03:08 travis-ci has joined #jruby

03:08 travis-ci has left #jruby [#jruby]

03:08 <travis-ci> jruby/jruby (jit_irscope_removal:cc715b2 by Charles Oliver Nutter): The build is still failing. https://travis-ci.org/jruby/jruby/builds/642699791 [168 min 28 sec]

06:48 nirvdrum has joined #jruby

07:35 nirvdrum has quit [Ping timeout: 265 seconds]

08:04 rusk has joined #jruby

08:07 rusk has quit [Read error: Connection reset by peer]

08:09 rusk has joined #jruby

09:00 nirvdrum has joined #jruby

09:12 shellac has joined #jruby

09:35 nirvdrum has quit [Ping timeout: 268 seconds]

10:00 nirvdrum has joined #jruby

10:36 Liothen has quit []

10:37 Liothen has joined #jruby

12:08 nirvdrum has quit [Ping timeout: 260 seconds]

12:47 shellac has quit [Quit: Computer has gone to sleep.]

13:04 shellac has joined #jruby

13:39 nirvdrum has joined #jruby

13:43 shellac has quit [Quit: Computer has gone to sleep.]

13:44 lucasb has joined #jruby

14:09 <headius[m]> good morning!

15:06 <rwilliams[m]> related: https://www.youtube.com/watch?v=a3mrxWSmd_s

15:07 <rwilliams[m]> headius: what time zone are you in currently?

15:07 <headius[m]> currently, US central

15:07 <rwilliams[m]> you're based in MN right?

15:07 <headius[m]> yup me and enebo both

15:08 <rwilliams[m]> I could do a suberb of Minneapolis

15:08 <headius[m]> it's a nice area

15:08 <rwilliams[m]> though the wheather would take some adjusting

15:08 <headius[m]> Winters notwithstanding

15:08 <rwilliams[m]> hah yeah

15:09 <rwilliams[m]> When the company you're consulting for says they're gonna take you ice fishing.....

15:09 <headius[m]> it's not so bad if you get used to mostly being inside from December through maybe March

15:09 <headius[m]> I've never lived anywhere you could be outside year round

15:09 <rwilliams[m]> I live in sacramento so anything below 40 is like death

15:09 <rwilliams[m]> I don't really own a jacket

15:10 <rwilliams[m]> well winter coat

15:10 <enebo[m]> Some people adjust pretty quickly and others never feel warm again.

15:10 <headius[m]> yeah bit of an adjustment but at least you're not in a really warm place

15:10 <rwilliams[m]> Yeah i did a couple of trips to enden prairie in nov and early spring

15:10 <headius[m]> weirdly we have large immigrant populations from Indochina and Somalia

15:11 <enebo[m]> MN keeps getting warmer except for a 2 weeks in the winter where it gets much colder because the jet stream has been changing

15:11 <headius[m]> yeah we always get a week below 0ºF

15:11 <headius[m]> sometimes way below... was it down to -30 last year?

15:12 <enebo[m]> warming climate has been pushing jet stream down so I think we get a little longer and a little colder...but only a little more than it used to be

15:12 <enebo[m]> It also I think has been responsible for how much rain we have been getting in warmer months

15:13 <enebo[m]> last year had most precipitation in our history

15:13 <rwilliams[m]> wow

15:13 <rwilliams[m]> We had great rain last year after probably 5 of so years of drought

15:13 <enebo[m]> yeah get used to it

15:14 <enebo[m]> CA will probably continue to dry out and you will be fighting water wars from your primary water source

15:14 <rwilliams[m]> Yeah :(

15:14 <headius[m]> we get a lot of work done during the winter

15:14 <enebo[m]> hahaha

15:15 <headius[m]> yeah Mad Max California Dreamin

15:15 <enebo[m]> I used to be a more outdoors guy in winter...not any more

15:15 <rwilliams[m]> The company i was working for used geothermal for their data centers during the winter, it was pretty cool

15:15 <rwilliams[m]> for cooling

15:15 <headius[m]> at least you won't be fighting over guzzoline once everyone's driving EVs

15:15 <rwilliams[m]> Yeah

15:16 <rwilliams[m]> I'm thining a model 3 for my next car

15:16 <enebo[m]> I have owned a nissan leaf since 2015

15:16 <rwilliams[m]> Nice

15:16 <headius[m]> I'd get an EV for my next car but I'd need to talk to my apartment about charging

15:17 <enebo[m]> 84 mile range sometimes hampers me but any new EV has way more than enough

15:17 <headius[m]> someone's going to breach that question soon enough though

15:17 <enebo[m]> I prefer a proper hatchback so I think even the model Y is off the table

15:17 <enebo[m]> plus I dislike the ipad only interface of 3+

15:17 <rwilliams[m]> enebo: I was looking at the egolf a few years back but i regularly drive 90 ish mile round trips so it would be a pain

15:18 <enebo[m]> headius: yeah that is something places like NL has figured out but it will have to happen soon

15:18 <enebo[m]> rwilliams: yeah Tesla is a great choice with supercharger. ID.3 by VW when it comes out will have the range but charging in US "depends". CA probably is fine for non-Teslas too

15:19 <enebo[m]> ID.3 will kill it in europe

15:19 <enebo[m]> It should do well in US too but EU loves their hatchbacks

15:19 <rwilliams[m]> oh that looks nice

15:19 shellac has joined #jruby

15:20 <rwilliams[m]> i dig smallish cars

15:20 <rwilliams[m]> I had a gti and golf r previously

15:20 <headius[m]> EV does worry me because I take 2-3 long driving trips in summer

15:20 <headius[m]> I could rent but meh

15:20 <enebo[m]> Tesla model Y I think is compromising range via aero to have proper hatch size

15:21 <enebo[m]> yeah renting is a good choice though for long road trips since you do not put that wear on your car

15:21 <enebo[m]> priced out it may be similar costwise but it drags out when you need to make that next big purchase

15:21 <headius[m]> hard to know whether the wear is worth it for renting a car for a week just to drive to and from a place though

15:21 <headius[m]> seven days of rental to save 20h of driving wear on my car

15:22 <enebo[m]> 20h of driving is how much normal driving? 3-4 weeks?

15:22 <headius[m]> I'd say more like 2 with kid stuff

15:22 <headius[m]> maybe less

15:22 <headius[m]> I drive to Fridley for pool every week

15:23 <headius[m]> yeah I'd say I easily do 10h a week

15:23 <rwilliams[m]> pool like swimming or pool like billiards?

15:23 <headius[m]> which is weird since I don't commute

15:23 <headius[m]> pool like billiards, I'm in an APA league for 8 and 9-ball

15:23 <rwilliams[m]> oh cool

15:23 <enebo[m]> yeah at highways speeds that would be 600+ miles

15:29 <headius[m]> enebo: almost have IRBytecodeAdapters busted up into smaller interfaces

15:29 <headius[m]> makes it pretty clear what stuff is indy-always now

15:29 <headius[m]> I'll restore the non-indy versions from before recent bytecode optimization

15:29 <headius[m]> should be possible by end of day to have AOT emit code with no invokedynamic

15:29 <headius[m]> well hopefully way before end of day

15:30 <headius[m]> I dunno if this is the "best" structure but it will let us compile things separately... how we compile literal values vs invocations vs yields etc

15:31 <enebo[m]> ok well we can play with this. I also have needs of wanting to call different things for inliner

15:31 <headius[m]> yeah that can be another form of the InvocationCompiler

15:31 <enebo[m]> like being able to produce a different callsite and use indy for post-inlined versions

15:31 <headius[m]> should be able to just implement invokes and reconfigure

15:32 <enebo[m]> but it is likely a dimensionality aspect as well...AOT with inlining vs JIT with inlining

15:32 <enebo[m]> anyways I will see how you changed things today :)

15:34 <rwilliams[m]> So will we be able to have ruby/rails/and our gems AOT'd for development so startup should be super fast? when this all gets going.

15:36 <enebo[m]> well it our hope that it will be faster

15:37 <enebo[m]> headius: this is still one huge outstanding question...why if I run more rails commands does rails console get slower?

15:37 <enebo[m]> at most it should just mean more files are in ~/.ir but the same number of them should get consumed running rails console whether they are more files or not

15:38 <enebo[m]> The more I think about this the more it bothers me because the speed difference is large

15:46 <enebo[m]> CLASSPATH= the issue? is Java scanning all that whether it is used or not?

15:47 <headius[m]> could you try jarring them up?

15:47 <enebo[m]> sure I can give that a go

15:48 <headius[m]> the way I'm loading the script classes, I check first if they are actually there as a .class file using getResource

15:48 den_d has quit []

15:48 <enebo[m]> So after running runner I have 1704 .class files

15:48 <headius[m]> to avoid hitting ClassNotFound for everything when nothing's compiled

15:48 den_d has joined #jruby

15:49 <enebo[m]> runner will rails new; scaffold migrate, server

15:49 <headius[m]> I dunno if that or classloading are sensitive to the number of files

15:49 Iambchop has quit []

15:49 <enebo[m]> I am seeing nothing with -d but would I?

15:49 Iambchop has joined #jruby

15:49 <enebo[m]> Are we eating that exception somewhere?

15:49 <headius[m]> JVM6/7, ClassData6/7, and IRBytecodeAdapter6/7 are gone!

15:49 <headius[m]> huzzah

15:49 <enebo[m]> ok well I think I am excited but I am not sure yet :)

15:50 <headius[m]> you can see tryScriptFromClass in Ruby

15:50 <headius[m]> only exception it catches is CNFE and logging will output something

15:51 <headius[m]> but really now you shouldn't see it since I check if the resource is there

15:51 <headius[m]> there's no other fast way to say "try to load this class" unfortunately

15:51 <enebo[m]> time echo exit | CLASSPATH=/home/enebo/a.jar JRUBY_OPTS="-Xcompile.cache.classes=true --dev -J-XX:+UseParallelGC" jruby -S rails c

15:51 <headius[m]> if you have more classes but don't reference them I don't know why it would be slower

15:51 <enebo[m]> If I do this it complains: LoadError: load error: rails/cli -- java.lang.NullPointerException: null

15:52 <enebo[m]> but I went into ~/.ir and did a jar cf ../a.jar .

15:52 <headius[m]> pushed changes

15:52 <enebo[m]> should I have did that from home?

15:52 <enebo[m]> ignore me

15:52 <enebo[m]> I changed dir :)

15:52 <headius[m]> I want to see NPE

15:53 <enebo[m]> ok doing that second after I go into an actual ruby dif

15:53 <enebo[m]> err rails

15:53 <headius[m]> making jar in .ir should work

15:53 <enebo[m]> ok working

15:54 <enebo[m]> same speed as .jar

15:55 <enebo[m]> This might be because I do not have your synch. commit

15:56 <enebo[m]> I cannot repro now

15:56 <headius[m]> I did not dig deeper on that so I don't know what was causing it exactly

15:57 <headius[m]> same speed as in you see slower than just generating rails c classes

15:58 <enebo[m]> same speed using cache classes in ~/.ir or in a jar

15:58 <enebo[m]> how do I do appcds?

15:59 <headius[m]> it's in the gist I made yesterday

15:59 <enebo[m]> not really related unless it massively changes this

15:59 <headius[m]> https://gist.github.com/headius/5de632bd4a95492b5f16582f8b297040

15:59 <enebo[m]> ah cool I missed this

15:59 <headius[m]> get a diff when generating just for rails c

16:00 <headius[m]> like diff class count + which ones extra are there

16:00 <enebo[m]> so can I put both appcds flags on command line or do I need to do one then the other to run

16:01 <enebo[m]> yeah I will look at relative difference next

16:01 <headius[m]> enebo: you should try the ReadyNow feature from zing if you get a chance

16:01 <headius[m]> it's linux only so I'd have to spin up a VM and that would not be accurate

16:01 <enebo[m]> and possibly run something completely unrelated to rails to get that nunber higher

16:01 <headius[m]> https://www.azul.com/zing-trial-download/

16:01 <headius[m]> ReadyNow is supposed to do CDS + JIT caching I believe

16:02 <headius[m]> one and then the other re: CDS flags

16:02 <headius[m]> I think they are working on making it possible to just update the existing archive but that doesn't work in 13

16:02 <enebo[m]> ok

16:02 <headius[m]> a lot of these features are not optimized well for MacOS too so your results may not be totally anomalous

16:03 <headius[m]> MacOS OpenJDK still polls for file watcher I believe 🙄

16:03 xardion has quit [Remote host closed the connection]

16:03 <headius[m]> I'm going to continue de-indyfying

16:07 <enebo[m]> appcds slows this down

16:08 <enebo[m]> err let me double check since I had to switch to 13

16:08 xardion has joined #jruby

16:09 <enebo[m]> p

16:09 travis-ci has joined #jruby

16:09 <travis-ci> jruby/jruby (jit_irscope_removal:98d9cb5 by Charles Oliver Nutter): The build is still failing. https://travis-ci.org/jruby/jruby/builds/642954618 [170 min 21 sec]

16:09 travis-ci has left #jruby [#jruby]

16:10 <enebo[m]> weirdly it feels much noisier

16:11 <enebo[m]> unfortunately running rails is long enough where anything else on my system can indluence

16:20 <enebo[m]> well this is annoying...I see no difference today at all with doing just a single rails c

16:21 <enebo[m]> which maybe means the synchronization or something like that killed something showing that behavior?

16:22 <enebo[m]> likely meaning something was not fully loading

16:22 <enebo[m]> but if that is true then I would expect to sometimes see this issue with all the extra files

16:24 <headius[m]> issue?

16:24 <headius[m]> not sure what rev you were testing against yesterday

16:24 <headius[m]> maybe you were missing some fixes that were silently failing some code

16:26 <enebo[m]> the issue that is was a lot faster

16:26 <enebo[m]> yeah I could go back a few commits and see if it happens

16:26 <headius[m]> yeah that's a good point

16:26 <headius[m]> if it fails it should always fail

16:26 <headius[m]> not like doing a larger command will jit some file differently

16:35 <headius[m]> enebo: you said you did confirm the console comes up yeah?

16:35 <headius[m]> like without > /dev/null

16:39 shellac has quit [Ping timeout: 248 seconds]

16:43 <enebo[m]> yeah it works

16:43 <enebo[m]> https://gist.github.com/enebo/a9ec1111b3228f39c1211d54cb5f44e6

16:43 <enebo[m]> ok different scenario but I documented it fully this time

16:43 <enebo[m]> I am positive I did not do this originally but it shows similar behavior

16:44 <headius[m]> can you run cached rails c with -d passed to JRuby?

16:44 <headius[m]> should show if there's different errors

16:44 <headius[m]> that is a conundrum though

16:44 <headius[m]> not a conundrum

16:45 <headius[m]> a puzzle

16:45 <enebo[m]> I have

16:45 <enebo[m]> I will add a gist...I think we get two circular and some missing files rails looks for

16:46 <headius[m]> you could also compare classloading for cached case, -Xverbose:class

16:46 <enebo[m]> https://gist.github.com/enebo/5308f3d4cf0e7c89bfb46ee96a98fb67

16:46 <headius[m]> if both modes load the same classes I'm stumped

16:47 <enebo[m]> well of course with what I showed you they will not

16:47 <headius[m]> seems fine

16:47 <headius[m]> what do you mean?

16:47 <enebo[m]> one only does the first ruby process of running that command and the second one does both

16:47 <enebo[m]> so it is a massive difference in number of classes loaded

16:47 <headius[m]> for emitting AOT

16:47 <headius[m]> but for rails c I want to see what classes actually load

16:48 <headius[m]> there shouldn't be any difference regardless of how much you AOT

16:48 <enebo[m]> I missed in second section but in second run it generates 1421 classes

16:48 <enebo[m]> first is only 134

16:48 <headius[m]> ugh I have to write constant lookup logic without indy...that has never existed in 9k

16:48 <headius[m]> not generate

16:48 <headius[m]> load

16:49 <headius[m]> only 134 files AOT if you just dump for rails c? That seems really low

16:49 <enebo[m]> well I fully expect the difference to be 134 vs 1421 but I can see if those both end up being the same amount of loads or not as were generated

16:49 <enebo[m]> I must have not written that gist clearly

16:50 <headius[m]> the rails c case that only touches 134 scripts should definitely NOT be loading 1421 classes

16:50 <headius[m]> your runner causes a lot more files to load and AOT

16:50 <headius[m]> that should not change how many cached AOT classes are loaded by rails c

16:50 <enebo[m]> hey let me try this again

16:50 <enebo[m]> one I put -X+C into main command and in the other I put it in JRUBY_OPTS

16:51 <headius[m]> yup

16:51 <enebo[m]> I did this so second spawn of Rails in rails console would not pre-cache those

16:51 <enebo[m]> that second spawn loads a lot more code than the first

16:51 <headius[m]> oh

16:51 <enebo[m]> which is why we only see 134 class files vs 1421

16:51 <headius[m]> well that's the reason right there

16:51 <headius[m]> most of the rails c scripts aren't actually loading from classes then

16:52 <enebo[m]> but only doing 134 made it start up noticeably faste than --dev without it

16:52 <headius[m]> that's why it's faster

16:52 <enebo[m]> but it is still faster than --dev

16:52 <headius[m]> ok

16:52 <headius[m]> ok I think I follow

16:52 <enebo[m]> that was my wonderment...it is clear that cloassloading can take longer than --dev

16:53 <enebo[m]> but some classloading did pay off

16:53 <headius[m]> so caching AOT for just the first-process Rails scripts makes rails c start up faster

16:53 <headius[m]> caching AOT for both first and second does not help

16:53 <enebo[m]> yeah considerably faster

16:54 <headius[m]> so something compiled for that first set is actually helping and something in the second set is hurting

16:54 <enebo[m]> So your work right now may pay larger dividends if the cost of the slowdown is indy on classes/modules/scripts

16:55 <enebo[m]> it should help code interp so I expect the full case to improve on timings

16:55 <enebo[m]> and currently it is approx even with --dev

16:56 <headius[m]> that's interesting

16:57 <headius[m]> I wish there were a way to force rails to not spawn

16:57 <headius[m]> even -v spawns now I think

16:57 <headius[m]> ok I believe you

16:57 <headius[m]> I can try to repro that now that I understand

16:58 <headius[m]> I thought you were always using env but the runner case was also emitting AOT for stuff like generating app

16:58 <enebo[m]> I was yesterday...I swear...but with these results I do have doubts now

16:58 <headius[m]> so rails c full AOT is that 1400-some count and something in that extra 1200+ seems to be negating the gains from the initial 134

16:59 <enebo[m]> yeah

16:59 <enebo[m]> our best hope is modules/scripts/classes are slower and not using indy will be a lion share

17:00 <enebo[m]> we need guided dumps

17:03 rusk has quit [Remote host closed the connection]

17:04 <headius[m]> yeah so interesting facts..

17:05 <headius[m]> parent rails process only loads 134ish scripts, which is remarkable

17:05 <headius[m]> once loaded it will futz around with bundler and dependency stuff and then respawn

17:05 <headius[m]> so that logic might not have jitted before and remained interpreted and slow every time

17:06 <headius[m]> now it's already bytecode so that plus Tier=1 improves the speed of the parent launch

17:06 <headius[m]> child process loads ALL of rails and app and then just starts up console

17:06 <headius[m]> so most stuff doesn't execute more than running through the script most likely

17:09 <enebo[m]> yeah I was not sure how much of railties loaded but I assumed most of rails was not in first process

17:09 <headius[m]> right

17:09 <enebo[m]> 1421 scripts seems massive to me

17:09 <headius[m]> so fewer classes, fewer methods

17:09 <enebo[m]> but helpful

17:09 <enebo[m]> we know they will get loaded in second run

17:09 <enebo[m]> so both runs do them

17:09 <enebo[m]> which might be part of it

17:10 <headius[m]> I will have almost everything non-indy shortly

17:10 <headius[m]> constants will take longer

17:10 <enebo[m]> but it underscores loading from AOT is slower than parsing

17:10 <headius[m]> because I have to make that code up

17:11 <headius[m]> right, so lower to load (ignoring CDS etc) but potentially we're seeing warmup kick in faster

17:11 <headius[m]> you could get jit logs for the two cases, just for top process

17:11 <enebo[m]> yeah my take as well

17:11 <headius[m]> two cases = --dev with and without cache

17:12 <headius[m]> if we see more ruby methods JIT by JVM we may have an answer

17:12 <headius[m]> a good answer for FOSDEM Too

17:14 <headius[m]> I realize my splitting up of JIT stuff is not quite what we want

17:14 <headius[m]> we want things that run for the script body to be non-indy in probably all cases

17:15 <headius[m]> but anything in repeatable bodies should usually be indy except if we're AOTing for non-indy environment

17:15 <headius[m]> the split is clearly things that run once versus things that run many times

17:16 <headius[m]> this is also something that might justify only emitting AOT for repeatable bodies and leaving script's once-through as interpreter

17:16 <headius[m]> justification for AOT emitting per-method but your 1400 classes will be more like 14000 then

17:17 nirvdrum has quit [Ping timeout: 268 seconds]

17:17 <headius[m]> dear god I need that new MBP just for the blasted arrow keys

17:23 <enebo[m]> so I was thinking the same thing with saving interp'd version of scripts/module/bodies which do not contain anything other than defs

17:23 <enebo[m]> we could actually do ruby rewriting and emit smoething without comments

17:30 <enebo[m]> heh just thinking about making a mega simple interpreter which just knows how to call constructors which represent compiled methods or interpreted class bodies also in the same interpreted language

17:30 <enebo[m]> the language would be tiny

17:31 <headius[m]> yeah

17:31 <headius[m]> these are definitely very different targets

17:33 <enebo[m]> oh wow...I just realized there is some very interesting potential

17:34 <enebo[m]> script -> module -> class -> method1, .... method n

17:34 <enebo[m]> the script/module/class could all literalls just be in the same "script" and not nested at all

17:34 <enebo[m]> so that structure could just be serialized

17:35 shellac has joined #jruby

17:35 <enebo[m]> if there is other logic...then well it is ruby but most ruby classes are literally just mild nesting with methods defined in them

17:36 <headius[m]> well this kinda plays into the codeDB idea Brian was working on

17:36 <headius[m]> or at least I think it does...basically if we can separate the structure of what a script defines from the little bits of code it runs we can do a better job of loading up that structure quickly

17:36 <enebo[m]> well it would be a different form...but I think this is more akin to just having a better serialized format than Ruby or IR

17:36 <headius[m]> I mean a large portion of the code in script+module+class bodies is nothing but defining things

17:37 <enebo[m]> main problem with Ruby parser is it is really big

17:37 <enebo[m]> ah yeah that is true

17:37 <enebo[m]> I would say nearly all of it is

17:37 <headius[m]> but at the very least anything we should have a lighter-weight execution mode for any scope that's not a method, block, or metaclass inside method or block

17:37 <enebo[m]> There are some libraries which have lots of top-level logic but once within a class it just goes normal again

17:38 <enebo[m]> The other major legs this has is the notion that we are not polluting all thes one-time types

17:38 <enebo[m]> Java types I mean in this case

17:39 <enebo[m]> So main premise is Ruby parser warms up slowly and JIT is just a really expensive way of doing this

17:39 <enebo[m]> appcds is a hope for making JIT less expensive (I should say AOT here)

17:39 shellac has quit [Ping timeout: 248 seconds]

17:40 <enebo[m]> but even if it was free we would use way less memory if we had a simple interp which threw that away immediately

17:40 <headius[m]> yeah

17:41 <headius[m]> so with the AOT the way it is now we're paying the high cost of loading and running script/module/class bodies exactly once

17:41 <enebo[m]> Well we may have a new "futures" section

17:41 <headius[m]> but also gaining a leap into already-compiled methods and blocks

17:41 <headius[m]> we want the latter but not the former

17:41 <enebo[m]> yeah exactly

17:41 <enebo[m]> so one-time should construct those others in something which can warm very quickly

17:41 <enebo[m]> but not add a java type

17:43 <enebo[m]> I do realize one additional snag (not really a snag but important addition)

17:43 <enebo[m]> require is a call at the top of most calls so this simpler interp still needs to address require as a call

17:44 <enebo[m]> so class/module/def/require as valid elements and class/module can just emit into the same string its contents

17:49 <headius[m]> I don't think calls in a script body need any special handling

17:49 <headius[m]> oh but you are thinking of a Ruby subset for script body I guess?

17:49 <enebo[m]> yeah completely different interp

17:50 <enebo[m]> that is a loop which literally only knows how to do like 6 things

17:50 <headius[m]> in theory that can work but I feel like most scripts will have at least one thing to break that

17:50 <headius[m]> not only require but metaprogramming methods

17:50 <headius[m]> attrs, aliases

17:50 <headius[m]> define_method

17:51 <enebo[m]> most? I don't think so but subscopes in those files won't and can use it

17:51 <enebo[m]> it is not all or nothing

17:51 <enebo[m]> I guess it would make me wonder how many would have to be that way to be worth it

17:51 <enebo[m]> and also perhaps there are a handful of ruby builtin methods where they could be added

17:52 <enebo[m]> attr_reader* and alias/alias_method

17:52 <headius[m]> yeah I have nothing to back up my feeling

17:52 <headius[m]> if I pick a file at random...

17:52 <enebo[m]> I agree that there are many files which do but I bet there are many scopes which don't

17:52 <headius[m]> find.rb is ok, needs module_function

17:53 <enebo[m]> if it was 40% it likely would still be a big win

17:53 <headius[m]> so those are there too, module_function, public, private, protected

17:53 <headius[m]> and in 2.7 ruby2_keywords or whatever nonsense

17:53 <headius[m]> forwardable assigns a class instance var

17:54 <jeremyevans> headius[m]: ruby2_keywords isn't scoped like module_function, public, private, protected

17:54 <enebo[m]> just looked in random activejob file

17:54 <headius[m]> OpenURI builds a hash for a constant...hash could contain a lot of other stuff but it's all symbols and constant values here

17:54 <enebo[m]> one require with a bunch of module/class nesting...one module has extend

17:55 <headius[m]> so most of these have manageable needs

17:55 <jeremyevans> headius[m]: assuming by scoped you mean when called without arguments, affects future methods

17:55 <enebo[m]> but fcalls in any of these scopes would be trivial to add

17:56 <enebo[m]> I think fcall so long as it is constant or immediate and stuff talked about above for scopes could be a very small interp

17:56 <headius[m]> you could kind of figure this out right now by just compiling something with lazy method bodies then scan instrs

17:56 <headius[m]> or just walk all scopes until you reach method or blocks

17:56 <headius[m]> ignore those

17:57 <enebo[m]> well I was thinking about what source elements must exist and just check in AST building

17:57 <headius[m]> yeah I dunno how much smaller it is looking at things

17:57 <headius[m]> like what wouldn't be in a script body?

17:57 <headius[m]> all the literals are possible for constants

17:57 <headius[m]> all forms of calls

17:57 <enebo[m]> You know I am saying we only do this in the presence of limited set of elements

17:58 <enebo[m]> if there are more than that then punt

17:58 <headius[m]> sure, there may be a subset that's a win

17:58 <headius[m]> clearly if it's only class, module, def, visibility methods, attrs, and requires that's one possible subset

17:58 <enebo[m]> I think it will be pretty large but I guess I will have to analyze and probably try something

17:58 <headius[m]> constants pull in all the literal types

17:59 <enebo[m]> but the constant is its own interp element

17:59 <headius[m]> everything I've looked at so far would need class, module, def, visibility, attr, and at least constant assignment for the class/module

18:00 <enebo[m]> extend Foo is a interp of Foo and fcall on the result

18:00 <headius[m]> we could also go back to AST interp just for script/module/class and only use IR for method/block bodies

18:00 <headius[m]> how's that for nutso

18:00 <headius[m]> 99% of what runs at app boot would go through AST interp but methods and blocks would be IR and optimizable

18:00 <headius[m]> NUTSO

18:03 <enebo[m]> well I said that a while up above

18:03 <enebo[m]> but I realized that if we are going to interp then we could choose how to interp

18:04 <enebo[m]> a limited form for limited format files and ordinary for things not limited

18:06 <enebo[m]> lunch be back in a bit

18:06 <headius[m]> I didn't see you mention AST interp but yeah this is all interesting

18:06 <headius[m]> I mean really CDS is basically codeDB because it caches the structure of class files

18:06 <enebo[m]> oh haha I did not read that properly

18:06 <enebo[m]> yeah that is nutso

18:07 <headius[m]> shareclasses in OpenJ9 is the same thing...it pre-processes the metadata for classes into the internal VM format and saves that

18:07 <enebo[m]> I mean if we make an AST interp we may as well just kill the IR interp

18:07 <enebo[m]> which is always a possibility

18:07 <headius[m]> so the general idea is being able to smartly build/understand the structure of a Ruby script without a full interp

18:07 <headius[m]> rather than blindly interpreting it like any other piece of code

18:08 <enebo[m]> ok my food is hot on the table...back in a few

18:08 <headius[m]> yeah it's a possibility

18:08 nirvdrum has joined #jruby

18:08 <headius[m]> HOT FOOD

18:08 <headius[m]> ok

18:08 <headius[m]> I should probably eat something other than coffee

18:29 <enebo[m]> eat some coffee

18:32 <enebo[m]> another detail is in simple module A { module B { class C { def ... } } } it would save it as a single linear stream and each nesting would push to some pre-calcd primitive and the interp would just be a single linear pass potentially not even saving to the array if the back side of each scope does nothing

18:32 <enebo[m]> s/primitive/primitive array/

18:33 <enebo[m]> which is most common scenario

18:33 <headius[m]> sure

18:33 <enebo[m]> but talking through this a bit I can see supporting limited Constant and Constant Set + fcalls would greatly expand what it could do

18:34 <enebo[m]> the other great thing about this would be not standing up an AST

18:34 <enebo[m]> It could even be protobuf

18:35 <headius[m]> yeah

18:35 <enebo[m]> At some point the balancing point would be more about how many scopes fit the limited description and how big the resulting interp is

18:35 <enebo[m]> because tiny pre-inlined all temps would be ideal

18:36 <enebo[m]> nesting of scopes likely means knowing whether we care or allocing a small array or pre-allocing one as some field

18:36 <enebo[m]> anyways this is not for today :)

18:37 <enebo[m]> I like the idea in as much as I believe I can reduce the whole thing to something which is faster to parse than the original Ruby

18:37 <enebo[m]> and it still blows my mind that the Ruby parser is competitive after all these years

18:37 <enebo[m]> If you look at stdlib about 50% is comments now...processing those are not free

18:38 <enebo[m]> not expensive but even us having to revalidate every string eevn though it is the same ident as the previous 50 in a file

18:38 <enebo[m]> just having a simple constant pool should make it a win

18:41 <headius[m]> yeah that structure can be boiled down a lot more

18:41 <headius[m]> try zing now

18:41 <headius[m]> or get jit log from top-level rails process with and without cache

18:41 <headius[m]> I almost have non-indy AOT done except for constants

18:42 <enebo[m]> ok I got key and downloaded

18:42 <headius[m]> non-indy constants shouldn't be hard if I just call what interp does

18:46 <headius[m]> the other thing I realized about openj9 is that I don't know if it's bothering to compile and cache those once-through bodies

18:46 <headius[m]> I'll ask J9 folks actually

18:57 <headius[m]> enebo: I've pushed non-indy stuff I have so far

18:57 <enebo[m]> cool

18:57 <headius[m]> will look into constants now but you should try with latest

18:57 <headius[m]> that's the last piece in JIT that still uses indy unconditionally, I believe

18:57 <enebo[m]> ok I have zing installed now but I do not know what to do from there

18:58 <enebo[m]> I will peruse their user guide

18:58 <headius[m]> other than e.g. MethodHandle instances passed out to compiled block/method

18:58 <enebo[m]> I will try your changes first thoguh

18:58 <headius[m]> yeah you want the ReadyNow feature

18:58 <headius[m]> I have never used it

18:58 <headius[m]> hopefully it's as easy as J9

18:58 <enebo[m]> I will just trying running this as is quick to see if it is faster/slower than hotspot

18:59 <headius[m]> oy constant logic is bigger than I expected in interp

18:59 <headius[m]> because it's caching

19:00 <enebo[m]> heh --dev is slower than no flags

19:00 <enebo[m]> Don't know how Zing deals with that

19:00 <headius[m]> interesting

19:01 <enebo[m]> doh I should probably not specify parallel either

19:01 <enebo[m]> wonder if it just ignores stuff

19:02 <headius[m]> J9 does

19:02 <headius[m]> there's probably a separate set of flags that would be appropriate for --dev on zing

19:02 <headius[m]> we can add to the uberlauncher

19:02 <headius[m]> I need a break before I attempt and give up on non-indy constants

19:02 <enebo[m]> so however --dev is getting interpreted it is slower than not specifying it

19:02 <enebo[m]> removing parallel changed nothing at all

19:03 <enebo[m]> cache on and off are the same speed

19:03 <headius[m]> the biggest gain from --dev is tier 1 but it also turns off jit...maybe jit is more important on zing

19:13 travis-ci has joined #jruby

19:13 travis-ci has left #jruby [#jruby]

19:13 <travis-ci> jruby/jruby (jit_irscope_removal:7876f07 by Charles Oliver Nutter): The build was fixed. https://travis-ci.org/jruby/jruby/builds/643038568 [166 min 7 sec]

19:21 <enebo[m]> zing will make us never exit if I am doing this right

19:21 <enebo[m]> ProfileLogOut will write info out

19:21 <enebo[m]> ProfileLogIn will use that

19:21 <enebo[m]> doing a ProfileLogOut will finish and make a file

19:21 <enebo[m]> ProfileLogIn seemingly just does not exit

19:22 <headius[m]> Make us never exit?

19:22 <headius[m]> That's weird

19:22 <enebo[m]> yeah very weird

19:22 <enebo[m]> The docs are far from clear

19:23 <enebo[m]> It does make an epic log file on report generation

19:24 <enebo[m]> I think we have an issue with native gems and switching JVM above 9

19:24 <enebo[m]> or 9+

19:24 <enebo[m]> each time I switch it claims it cannot find bindex gem

19:27 <enebo[m]> LoadError: Could not open library '/home/enebo/work/jruby/lib/ruby/gems/shared/gems/sassc-2.2.1/ext/libsass.so' : /home/enebo/Applications/jdks/zing-jdk11.0.0-19.12.101.0-2/lib/server/../../etc/libc++/libstdc++.so.6: version `GLIBCXX_3.4.26' not found (required by /home/enebo/work/jruby/lib/ruby/gems/shared/gems/sassc-2.2.1/ext/libsass.so)

19:27 <enebo[m]> yay

19:28 <enebo[m]> what in the actual hell is up with that

19:31 <enebo[m]> ok experiment is done

19:32 <enebo[m]> I believe I did not originally see this because bindex was complaining but I did not see that in the copious amount of output I happened to be generating with zing flags

19:33 <enebo[m]> If I get this error it is that zing compiled their JVM with a different glibcxx than is on my machine and dynloading that is a bad scene

19:33 <enebo[m]> that == sassc

19:37 <headius[m]> you got the Java 11 one?

19:37 <enebo[m]> yeah

19:37 <headius[m]> ah

19:38 <headius[m]> did you test anything but rails?

19:38 <headius[m]> that looks like it's having trouble with its own glibc

19:38 <headius[m]> oh no I see

19:39 <headius[m]> sassc is compiling something that uses libstdc++ and so when it tries to load it fails because zing has its own preferred libstdc++

19:39 <enebo[m]> no. I mean I could try something simpler but I am testing HEAD atm

19:39 <enebo[m]> I will circle back and try -e 1 or something

19:39 <enebo[m]> well I think it is that was compiled with a different version than I have and it expect .so loads to use same

19:39 <enebo[m]> yeah

19:39 <headius[m]> yeah

19:39 <headius[m]> set your LD_LIBRARY_PATH to include zing's libstdc++

19:39 <headius[m]> probably have to rebuild sassc

19:40 <headius[m]> or just test something that doesn't use sassc :-)

19:40 <enebo[m]> I can try -e1 perhaps first

19:40 <enebo[m]> just to see if it hangs or not

19:41 <headius[m]> so you never got past the hanging on exit?

19:42 <headius[m]> I mentioned it to Philip Reames from Azul on twitter

19:42 <headius[m]> he's the one who poke me about trying Zing

19:44 <enebo[m]> ok was rerunning with current HEAD on non-zing

19:44 <headius[m]> I should start a bytecode dump of new AOT

19:44 <enebo[m]> I do not likely see any difference...it looks very much within the realm of noise of a difference

19:45 <headius[m]> bytecode trace I mean

19:45 <enebo[m]> I was thinking we should put line number on staticscope

19:45 <enebo[m]> it would get rid of a bunch of ldcs

19:46 <enebo[m]> which I am sure are not a massive amount of code but if this is all running bytecode interp then perhaps it adds up

19:47 <headius[m]> yes

19:47 <headius[m]> oh well versus ldc, it's probably more expensive

19:47 <headius[m]> but it's more bytecode

19:48 <headius[m]> it wouldn't be more bytecode than a load scope + call method though

19:48 <headius[m]> ldc of a constant is just about as cheap as you can get except for the specialized iload_1 and such

19:49 <enebo[m]> It is just that it keeps pushing it down the stack into compiled method too

19:49 <enebo[m]> it probably never leaves the same register but still

19:49 <headius[m]> ah right I did just encounter that, had to add ldc to get the file and line into something

19:49 <enebo[m]> I guess CompiledIRMethod and AbstractIRMethod constructors do warm quickly to C1

19:49 <headius[m]> yeah file and line LDC could be eliminated if they're carried by a StaticScope

19:49 <headius[m]> +1

19:51 <headius[m]> ok finally going to do my lunch routine while this thing chews on bytecode trace

19:51 <headius[m]> we'll see if the new AOT files are cheaper than old

19:52 <headius[m]> did you try CDS with new AOT?

19:52 <headius[m]> if not that would be worth a quick check

19:53 <enebo[m]> I did not but I will try

20:00 <headius[m]> FYI I chatted with @cla4es from Oracle and he said CDS can't do much to help method handles, even constant ones like for lambdas, but jlink can

20:01 <headius[m]> so jlink is definitely another "future" thing we need to try

20:07 <enebo[m]> so CDS is 13.8s with --dev vs 14.[45] ish

20:07 <enebo[m]> that is with the 1400 classes

20:08 <enebo[m]> so --dev and classcast with all those classes is like 14.4s and with CDS of those it is 13.8 (also using --dev)

20:14 <enebo[m]> With 143 class version from my behavior gist time drops from 12.1s to 11.[56]s

20:14 <enebo[m]> so CDS is definitely helping but not massively so

20:15 <headius[m]> so with cds we're starting to see a gain

20:15 <enebo[m]> but considering a plain --dev is 14.4s or so and this is pretty good

20:16 <headius[m]> ok so clarify

20:16 <enebo[m]> interesting that with how things are we can probably sweet spot this a little bit by not running very much

20:16 <headius[m]> --dev no class cache + cds is what

20:16 <enebo[m]> 14.4s

20:16 <headius[m]> what's --dev no cache without cds?

20:17 <enebo[m]> sorry I never ran your firsto ne

20:17 <headius[m]> oh

20:17 <headius[m]> --dev + cds is the fastest scenario i have seen for most commands

20:17 <headius[m]> so I'm curious about that

20:17 <enebo[m]> ok I will give it awhirl

20:20 <enebo[m]> I am going to write a script to test all of these

20:20 <enebo[m]> and kill my browser when I run it

20:20 <headius[m]> probably sleep a bit between to try to reduce chance of throttling

20:20 <enebo[m]> yeah not a bad idea

20:20 <headius[m]> I'm going to try to figure out jlink since I'm sure that will come up

20:21 <headius[m]> we are a better module citizen now so it may be possible

20:21 <enebo[m]> yeah I think jlink has a second good possibility of eliminating some modules

20:21 <enebo[m]> not really sure how much is actually loaded or not already but ??

20:22 <headius[m]> heh ok

20:22 <headius[m]> $ jlink --module-path lib/jruby.jar --add-modules org.jruby.dist --output /tmp

20:22 <headius[m]> Error: automatic module cannot be used with jlink: org.jruby.dist from file:///Users/headius/projects/jruby/lib/jruby.jar

20:22 <headius[m]> maybenot

20:26 <headius[m]> hmm yeah ok so this may not be be feasible until everything is better modularized

20:26 <headius[m]> meaning all our upstream dependencies as well

21:23 <headius[m]> ok I have new -Xint bytecode count

21:23 <headius[m]> without cached classes it's 408M, before AOT changes with class cache it was 511M, and after changes it's 490

22:07 <headius[m]> enebo: ok getting late in the day so let's pow-wow about what else we need to do before leaving tomorrow

22:08 <headius[m]> I've given up on jlink for now...might be able to hack it but I'm not making it a priority

22:08 <headius[m]> I will look at your slides to see what we have so far

22:09 <headius[m]> I'm gathering some last bytecode counts and then going to pull in some startup stats for non-cached CDS, OpenJ9 etc

22:20 <headius[m]> enebo: good news on gem list with caching

22:22 <headius[m]> the cold bytecode count (JVM JIT on, so this will vary) yesterday was 52M bytecodes executed

22:23 <headius[m]> today it's 43.3M

22:23 <headius[m]> that's versus non-cached gem list at 47.5M

22:23 <headius[m]> so at least for this run it executed over 4M fewer cold bytecodes

22:31 <enebo[m]> ok