#jruby on 2018-06-20 — irc logs at freenode.irclog.whitequark.org

2018-05-24 16:34 ChanServ changed the topic of #jruby to: Get 9.2.0.0! http://jruby.org/ | http://wiki.jruby.org | http://logs.jruby.org/jruby/ | http://bugs.jruby.org | Paste at http://gist.github.com

00:09 Eiam has joined #jruby

00:10 <Eiam> Hi good folks in #jruby, a superset of the other good folks in #ruby. I am currently migrating a project from ruby to jruby and just trying to map my gems over and get things installed

00:10 <Eiam> currently, my understanding is ruby 2.5.1 is not yet supported, I'm locked to 2.5.0 with the 9.2.0.0 engine

00:11 <Eiam> is that accurate?

00:12 <Eiam> second, I'm trying to bring over my favorite debugger, pry. Locally I can "gem install pry-debugger-jruby" and it installs without issue, but when I include the same thing in my gemfile it fails ,"could not find pry-debugger-jruby" in rubygems repository or installed locally. which is a bit weird, since i just installed it. "gem list" clearly shows "pry-debugger-jruby (1.2.1 java)"

01:25 jrafanie has joined #jruby

03:41 machty has quit [Ping timeout: 256 seconds]

03:43 lopex has quit [Ping timeout: 245 seconds]

03:44 electrical_ has quit [Ping timeout: 256 seconds]

03:45 machty has joined #jruby

03:46 jrafanie has quit [Quit: Textual IRC Client: www.textualapp.com]

03:46 electrical_ has joined #jruby

03:46 lopex has joined #jruby

03:47 knowtheory_ has quit [Ping timeout: 256 seconds]

03:48 deathy has quit [Ping timeout: 276 seconds]

03:49 flavorjones has quit [Ping timeout: 260 seconds]

03:49 duncanm has quit [Ping timeout: 256 seconds]

03:49 Iambchop has quit [Ping timeout: 256 seconds]

03:49 Scorchin has quit [Ping timeout: 256 seconds]

03:49 snowp has quit [Ping timeout: 255 seconds]

03:49 fidothe_ has quit [Ping timeout: 276 seconds]

03:50 asarih has quit [Ping timeout: 260 seconds]

03:50 codefinger has quit [Ping timeout: 256 seconds]

03:51 atambo_ has quit [Ping timeout: 256 seconds]

03:51 electrical_ has quit [Ping timeout: 245 seconds]

03:51 qmx has quit [Ping timeout: 276 seconds]

03:51 lopex has quit [Ping timeout: 260 seconds]

03:51 machty has quit [Ping timeout: 260 seconds]

03:51 zph has quit [Ping timeout: 255 seconds]

03:51 phlebas has quit [Ping timeout: 255 seconds]

03:51 amitchellbullard has quit [Ping timeout: 255 seconds]

04:43 Eiam has quit [Ping timeout: 245 seconds]

05:22 deathy has joined #jruby

05:23 asarih has joined #jruby

05:24 fidothe_ has joined #jruby

05:24 qmx has joined #jruby

05:32 Iambchop_ has joined #jruby

05:32 zph has joined #jruby

05:53 codefinger has joined #jruby

05:53 atambo_ has joined #jruby

05:55 qmx has quit [Ping timeout: 276 seconds]

05:57 deathy has quit [Ping timeout: 256 seconds]

05:57 asarih has quit [Ping timeout: 276 seconds]

05:57 zph has quit [Ping timeout: 265 seconds]

05:57 atambo_ has quit [Ping timeout: 240 seconds]

05:57 fidothe_ has quit [Ping timeout: 276 seconds]

05:58 codefinger has quit [Ping timeout: 260 seconds]

05:58 Iambchop_ has quit [Ping timeout: 276 seconds]

05:59 <kares> Eiam: sounds like a Bundler issue (assuming you're to read the backlog)

06:11 machty has joined #jruby

06:12 <GitHub25> [jruby] kares closed issue #4759: unstable ambiguous overload resolution https://git.io/v5GEj

06:17 qmx has joined #jruby

06:20 fidothe_ has joined #jruby

06:20 amitchellbullard has joined #jruby

06:21 snowp has joined #jruby

06:21 knowtheory_ has joined #jruby

06:21 lopex has joined #jruby

06:21 flavorjones has joined #jruby

06:21 asarih has joined #jruby

06:21 electrical_ has joined #jruby

06:21 phlebas has joined #jruby

06:21 zph has joined #jruby

06:21 fidothe_ has joined #jruby

06:21 fidothe_ has quit [Changing host]

06:22 amitchellbullard has joined #jruby

06:22 amitchellbullard has quit [Changing host]

06:22 snowp has joined #jruby

06:22 snowp has quit [Changing host]

06:22 knowtheory_ has joined #jruby

06:22 knowtheory_ has quit [Changing host]

06:22 deathy has joined #jruby

06:25 atambo_ has joined #jruby

06:25 codefinger has joined #jruby

06:27 Iambchop_ has joined #jruby

06:54 <GitHub123> [jruby] kares pushed 5 new commits to master: https://git.io/fsweh

06:54 <GitHub123> jruby/master 0303b01 kares: [test] make sure we can run file test on MRI...

06:54 <GitHub123> jruby/master e65a3cd kares: [refactor] and split out a few expand-path internals

06:54 <GitHub123> jruby/master 536bff4 kares: [test] re-arrange JRuby's file/pathname tests

06:55 duncanm has joined #jruby

07:17 claudiuinberlin has joined #jruby

07:17 Scorchin has joined #jruby

07:28 olle has joined #jruby

07:28 olle has quit [Client Quit]

07:45 Puffball has quit [Quit: Puffball]

08:07 travis-ci has joined #jruby

08:07 <travis-ci> kares/warbler (master:97da1d0 by kares): The build has errored. (https://travis-ci.org/kares/warbler/builds/394441221)

08:07 travis-ci has left #jruby [#jruby]

08:21 <GitHub198> [jruby] kares closed issue #4334: improve Ruby's Set performance https://git.io/v1JtJ

08:24 <GitHub199> [jruby] kares closed issue #4245: Bad to_s output for extremely small floats https://git.io/vPxMR

08:26 rawra has quit [Remote host closed the connection]

08:29 <GitHub174> [jruby] kares closed issue #4668: Multi-byte method name is not compatible with CRuby https://git.io/vH5oC

08:33 <GitHub157> [jruby] kares closed issue #4580: Define method with GBK name by using 'def' has wrong symbol name https://git.io/v9CFh

08:34 travis-ci has joined #jruby

08:34 <travis-ci> kares/warbler (master:0ee9978 by kares): The build failed. (https://travis-ci.org/kares/warbler/builds/394448224)

08:34 travis-ci has left #jruby [#jruby]

08:35 rawra has joined #jruby

08:40 rawra has quit [Ping timeout: 256 seconds]

09:46 claudiuinberlin has quit [Ping timeout: 265 seconds]

10:11 Puffball has joined #jruby

11:20 drbobbeaty has joined #jruby

11:25 jrafanie has joined #jruby

11:26 drbobbeaty has quit [Quit: My MacBook Pro has gone to sleep. ZZZzzz…]

11:38 drbobbeaty has joined #jruby

11:42 jrafanie has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

12:00 <GitHub196> [jruby] kares created test-readline-1.3.0 (+1 new commit): https://git.io/fG2iv

12:00 <GitHub196> jruby/test-readline-1.3.0 f860fb2 kares: let's update jruby-readling to 1.3.0...

12:21 <GitHub161> [jruby] elsabio opened issue #5223: Illegal reflective access by org.jruby.util.ShellLauncher https://git.io/fG2X4

12:42 drbobbeaty has quit [Quit: My MacBook Pro has gone to sleep. ZZZzzz…]

12:48 rawra has joined #jruby

12:49 drbobbeaty has joined #jruby

12:49 drbobbeaty has quit [Client Quit]

12:55 rawra has quit [Ping timeout: 248 seconds]

13:25 <kares> headius: you got access to the json repo? it seems not to be fixing issues

13:26 <kares> or enebo ^^

14:02 <enebo> kares: I don't think I do the florian one?

14:05 <enebo> kares: https://github.com/flori/json/issues/284

14:05 <enebo> not sure what to think...I see the guy has been committing to other stuff but he seems pretty awol on this project

14:05 <enebo> no commit in 8 months

14:19 jrafanie has joined #jruby

14:43 <kares> enebo: yep that one

14:43 <kares> adding myself to the comments

14:43 <kares> thought Charlie might have rights since he has rubygems push right

14:44 <enebo> kares: it is probable that he does have commit rights. he is on vacation this week

14:45 <kares> okay we'll see if I do not forget to ask him next week or so :)

15:10 rawra has joined #jruby

15:15 rawra has quit [Ping timeout: 276 seconds]

15:15 rawra has joined #jruby

15:20 rawra has quit [Ping timeout: 256 seconds]

16:01 xardion has quit [Remote host closed the connection]

16:07 xardion has joined #jruby

16:38 claudiuinberlin has joined #jruby

17:31 claudiuinberlin has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

17:35 havenwood has quit [Quit: ZNC 1.7.0 - https://znc.in]

17:36 havenwood has joined #jruby

17:36 havenwood has quit [Changing host]

17:36 havenwood has joined #jruby

17:48 init has joined #jruby

17:49 init has left #jruby [#jruby]

17:52 Eiam has joined #jruby

18:24 claudiuinberlin has joined #jruby

18:38 subbu is now known as subbu|lunch

19:17 rawra has joined #jruby

19:21 rawra has quit [Ping timeout: 256 seconds]

19:37 claudiuinberlin has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

19:48 <lopex> enebo: https://code.facebook.com/posts/605721433136474/accelerate-large-scale-applications-with-bolt/

19:50 <enebo> lopex: neat

19:51 <lopex> enebo: to what extent do they have to go for having php codebase

19:54 <enebo> lopex: I guess

19:55 <enebo> lopex: run it on ruby executable!

19:55 <enebo> lopex: but ruby loads the universe so this instr starvation no doubt is not a big part of startup

19:55 <lopex> enebo: funcall's would kill it

19:56 <enebo> lopex: it will be fun to use a tool like that and see how various things do...does rust via LLVM generation get anything?

19:56 <lopex> enebo: this and memory read stalls where primary causes intel introduced HT right ?

19:56 subbu|lunch is now known as subbu

19:57 <enebo> lopex: I don't know

19:57 <lopex> enebo: so if one thread stalled (or flushed) and waits for memory

19:58 <lopex> there might be a chance another one would resume and do anything :P

19:58 <enebo> lopex: well if you are reading pages between thread context switches no doubt packing that code into less pages can only be good

19:58 <enebo> oh you are talking about intel bugs

19:58 <lopex> enebo: also how it plays with caches

19:58 <enebo> I don't know

19:58 <lopex> no

19:58 <lopex> just HT

19:59 <enebo> I never read about how hyperthreading actually works

19:59 <enebo> fwiw I have never been very interested as I almost never see it work out for me

19:59 <lopex> enebo: I recall from what Click said

19:59 <enebo> ok

20:00 <lopex> or it was one of it's uses

20:00 <lopex> stalls

20:00 <enebo> so this release is given to LLVM

20:00 <enebo> that is pretty nice

20:00 <lopex> enebo: but for the most part he complained about lack of good profiling tools

20:00 <lopex> hardware counters etc

20:00 <lopex> you just dont know how much your code is stalling

20:01 <enebo> I remember his page fault talk and modern hardware talk but it was a long time ago that I saw it

20:01 <enebo> cache misses is the death of continuing perf improvements

20:02 <lopex> yeah, same on his last talk

20:02 <lopex> and no good way to measure it

20:02 <enebo> lopex: I guess he have revised and modernized the talk a bit in the last year or two

20:02 <lopex> yeah, I'm talking about this one

20:02 <lopex> but stalls/misses still dominant

20:03 <enebo> I saw it like 7-8 years ago so I just remember very broad strokes :)

20:03 <lopex> ram is new hard disk

20:03 <lopex> of not for ram thins would go 10x faster

20:03 <lopex> up to 10x

20:04 <lopex> enebo: https://www.youtube.com/watch?v=OFgxAFdxYAQ

20:04 <lopex> and then "jvm does that?" talk

20:05 <lopex> enebo: so I'm wondering about this open addressing hash

20:05 <lopex> enebo: if you allocate a bunch of buckets in a row

20:06 <lopex> they should still be in the same cache lines right ?

20:06 <lopex> the bucket introduce indirections sure

20:07 <enebo> well memory locality of n objects linked by references is almost always going to be worse than just relying on a primitive array

20:07 <lopex> yeah

20:07 <enebo> I fully expect to see improved performance in those benchmarks with the last round of changes

20:08 <lopex> sure

20:08 <lopex> enebo: but you should see better diff when making puts interleaved by other allocations

20:08 <enebo> GC handbook talks about some collectors relocating on locality of object references but I don't think any JVM collectors do it.

20:08 <lopex> so the buckets are more scattered

20:08 <enebo> The handbook made it sounds very difficult

20:09 <lopex> and not that indirections are dominant

20:09 <enebo> lopex: yeah in real life we are not hashing the same thing over and over in a loop so it should only improve

20:09 <enebo> I guess I don't know how much it matters in Java though

20:09 <lopex> enebo: oh, how many locality metrics are there ?

20:09 <enebo> After all we have objects all over the place all the time

20:10 <lopex> I guess more than one

20:10 <enebo> so having good locality is good but it may not be visible in the sea of cache misses

20:10 <enebo> I have no idea though

20:10 <lopex> so we go back to profiling tools

20:10 <enebo> yep

20:11 <enebo> the notion we are able to control this very much at the level we are at probably is wishful thinking but I do expect that primitive array version to microbench faster

20:11 <lopex> enebo: and funny thing is that all is not for von neuman fault

20:12 <lopex> just ram

20:12 <lopex> and that fast flip flop takes up to 6 transistors

20:12 <lopex> and ram bit is just one transistor and a capacitor

20:13 <lopex> and that cap is responsible for all that

20:15 <lopex> enebo: ah, I posted this for headius once: https://www.youtube.com/watch?v=8mFl8fywIP4&t=15m45s

20:16 <enebo> lopex: thanks...afk a little bit

20:39 claudiuinberlin has joined #jruby

21:15 claudiuinberlin has quit [Quit: Textual IRC Client: www.textualapp.com]

21:22 <ChrisBr> enebo: I added some numbers to the PR

21:23 <ChrisBr> no idea how to calculate the memory you said

21:24 <ChrisBr> surprisingly without the HashEntry objects it seems to be slower for hashes > 1000 elements :(

21:26 <lopex> ChrisBr: hiya

21:27 <lopex> ChrisBr: might be collisions ?

21:27 <lopex> in open addressing you usually do more work but gain on cache locality

21:28 <enebo> ChrisBr: ok I will check them out

21:28 <lopex> ultimately you still have the load factor too

21:30 <lopex> ChrisBr: also take a look at https://en.wikipedia.org/wiki/Cuckoo_hashing

21:30 <ChrisBr> yeah right! But we have a max load factor of 50%

21:30 <enebo> lopex: how much is object record sizewise: I remember 23 bytes?

21:30 <enebo> or words

21:30 <lopex> enebo: hmm,

21:31 <lopex> class is a pointer

21:31 <lopex> hash is always 32 bit

21:31 <lopex> and bit mask integer

21:32 <lopex> enebo: depends how you count

21:33 <enebo> there is also compressed storage now too for fields

21:33 <enebo> in this case though that would not apply

21:34 <lopex> so the ref itself, class, 32 bit hashcode, and an int for metadata

21:34 <enebo> ChrisBr: Ignoring growing array you can just try different linear sizes and see when it is better

21:34 <lopex> enebo: not sure how packed they are wrt alignement

21:35 <lopex> enebo: and a size for arrays

21:35 <enebo> lopex: but for this bucket we basically are referencing object and using a long

21:35 <enebo> so those will just align

21:35 <lopex> enebo: and next, prev

21:35 <lopex> and key

21:36 <lopex> and value

21:36 <enebo> lopex: yeah but all aligned

21:36 <lopex> if you're talking about a bucket

21:36 <enebo> yeah linked list is significant

21:36 <ChrisBr> enebo: so you mean trying with smaller inital size?

21:36 <lopex> yeah, words are always aligned

21:36 <lopex> in java

21:37 <enebo> ChrisBr: just saying 15 seems like it is clearly slower at linear

21:37 <enebo> ChrisBr: half makes me wonder if MRI even picked a reasonable size

21:37 <enebo> 15 seems like a human just picked it

21:37 <enebo> I guess it is 1 less than 16

21:37 <lopex> enebo: afaik mri is insane and it once had 2x

21:38 <lopex> enebo: so guaranteed collistions

21:38 <lopex> not sure what it does now

21:38 <lopex> ChrisBr: java is 0.75

21:39 <enebo> you know I did not really ever look at this PR

21:39 <ChrisBr> enebo: size it 16 :) did I write 15 ? :/

21:40 <enebo> so the RubyHashEntry is still there so my comment about removing it makes no sense since it still exists

21:40 <lopex> ChrisBr: do you also use those MRI magic exp numbers etc ?

21:40 <enebo> ChrisBr: oh well I naively assumed testing 15 and 16 was showing boundary difference unless 16 is 0 elements to 15?

21:41 <enebo> I guess maybe I should not talk too much right after a run though

21:41 <enebo> you probably do mean 0-15 elements is linear where 0 is pretty much no search but still linear

21:42 <ChrisBr> right, so I test 16 elements (0-15 - liner search) and then I test 16 elements (hashing)

21:42 <enebo> I guess I would try 8

21:42 <ChrisBr> yeah, right

21:42 <enebo> and maybe even 4 but just to see

21:42 <ChrisBr> lopex: magic exp numbers?

21:42 <ChrisBr> like the 16 for linear search?

21:44 <lopex> ChrisBr: oh I mean the "feature" arrays

21:45 <enebo> this version allocates a rubyhashentry every time you search?

21:47 <ChrisBr> for put not, for get I create a temporary RubyHashEntry to return it

21:47 <ChrisBr> otherwise I would need to change the "interface" everywhere

21:48 <ChrisBr> but I only store the RubyObjects and not the RubyHashEntry's

21:48 <enebo> ChrisBr: yeah I see. I bet that is showing up in the get bench. This version is trading search locality with an allocation

21:48 <ChrisBr> so it is "just" a temporary object, but sure we would also need to get rid of it at some point

21:49 <enebo> ChrisBr: a benchmark play thing would be to reuse a single instance to see how much it changes get performance

21:49 <enebo> ChrisBr: that might make us consider whether we want to change the API or not

21:49 <ChrisBr> single instance you mean retrieving always the same key?

21:50 <lopex> just using the same entry as a singleton

21:50 <enebo> no I mean allocation a since RubyHashEntry and basically just set the values into that single instance

21:50 <lopex> for the benchmark

21:50 <ChrisBr> ah right, ok

21:50 <ChrisBr> I can try that

21:50 <enebo> that may have been a garbled sentence :P

21:50 <enebo> but so long as you could parse it

21:51 <lopex> enebo: actually what interface sohuld be changed ?

21:51 <ChrisBr> lopex: interface was maybe the wrong word

21:51 <ChrisBr> method signature

21:51 <lopex> ChrisBr: then do it :P

21:51 <ChrisBr> this is still "quick & dirty" ;)

21:51 <enebo> lopex: I don't know but if that allocation is big part of get perf then maybe we consider changing API

21:52 <enebo> ChrisBr: this is just an experiment :)

21:52 <enebo> an evil one

21:52 <lopex> ChrisBr: the entry thing should not be transparent for internal jruby api

21:52 <lopex> ChrisBr: we could construct it for JI though

21:53 <lopex> enebo: seems fair enough ?

21:53 <enebo> lopex: not sure I got that. I thought you meant since it is internal we can change it

21:53 <lopex> enebo: yes, but java allow to iterate over entries afaik ?

21:54 <enebo> lopex: oh I see

21:54 <enebo> lopex: ah yeah not sure how iteration would work but I am not sure how much we care if it is fast or slow in JI case

21:54 <lopex> enebo: yeah it's not a concern for us wrt jruby api

21:54 <enebo> lopex: I guess internally we can iterate over it in a less boxed way

21:55 <enebo> lopex: but for implementing Map I guess I don't know

21:55 <lopex> enebo: java's Map.Entry and entrySet

21:55 <enebo> lopex: but don't we already box into some Entry object Java wants in that case?

21:56 <enebo> so RubyHashEntry is not really important in that case is it?

21:56 <lopex> enebo: yeah, at very least we reconstruct the thing

21:56 <lopex> enebo: yes

21:56 <enebo> lopex: but don't we already or does RubyHashEntry implement Entry?

21:56 <lopex> enebo: I mean agreed

21:56 <enebo> lopex: ok

21:56 <lopex> enebo: yeah, good catch, and probably I made it so

21:57 <enebo> lopex: so that is a concern

21:57 <lopex> enebo: we're old

21:57 <lopex> enebo: why ?

21:57 <lopex> enebo: I mean only for JI

21:58 <enebo> well maybe not too bad a concern but now enumeration from Java needs to allocate no instances

21:58 <enebo> with this change it will get slower

21:58 <enebo> internally so long as we don't use that then it won't affect "ruby"

21:58 <lopex> enebo: depends on the usage yes

21:58 jrafanie has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

21:58 <enebo> but anyone in JI land who passes it into java will see it slow down. So by concern i mean we should see how bad that is

21:58 <enebo> It might not be bad enough to care

21:59 <lopex> enebo: unless it's mitigated by the gains

21:59 <enebo> true

21:59 <enebo> I can see fetch is quite a bit faster with the new version

21:59 <lopex> enebo: hmm, good call, it might be the reason java havent introduced open addressing

21:59 <lopex> right ?

21:59 <enebo> no hmm

22:00 <enebo> it is slower on fetch

22:00 <enebo> I read it backwards

22:00 <enebo> so I bet allocation is the difference

22:00 <lopex> definitely

22:01 <lopex> enebo: lolz, actually java api's usage might be the reason java will not introduce open addressing

22:01 <enebo> lopex: yeah possibly

22:01 <lopex> and it will be the exposure of the entry type

22:01 <enebo> Java is a big ship with a lot of cargo

22:02 <lopex> enebo: just like the hashcode

22:02 <lopex> enebo: and tonn of momentum

22:02 <lopex> a ton

22:02 <enebo> I wonder how well graal deals with this microbench

22:02 <lopex> enebo: you mean excape analysis ?

22:02 <enebo> yes

22:03 <lopex> enebo: I sort of though hotspot would make it work

22:03 <lopex> thought

22:03 <enebo> In case of temp alloc instance

22:03 <enebo> yeah it seems very limited in use

22:03 <enebo> shit is ripped out from method which makes it and it is read only

22:03 <enebo> Even if graal kills it with perf I am not sure we can just roll with it or not

22:03 <ChrisBr> enebo: lopex: for get I also do not use the hash anymore because I don't store it anymore! Do you think it makes a difference to still calculate the hash and compare instead of the values?

22:03 <lopex> enebo: I dont know, print inlining might give some insight

22:04 <enebo> ChrisBr: maybe. It is a difference

22:04 <lopex> ChrisBr: I think you cannot rely on hash perf being cheap

22:05 <lopex> er

22:05 <enebo> some objects do deep hash calcs right?

22:05 <lopex> yes

22:05 <enebo> like struct?

22:05 <lopex> but wait

22:05 <lopex> I'm confused

22:05 <lopex> you get hash, find a bucket and when do you use it the second time ?

22:06 <lopex> same for open addressing right ?

22:06 <lopex> enebo: what you do is equals then

22:07 <ChrisBr> so for the key I want to insert / fetch I need to calc hash and bucket, yes

22:08 <ChrisBr> but for everything what is already there, I don't have the hash anymore (because I only store the key & value object)

22:08 <ChrisBr> so I would need to calc the hash for every element (for linear search for every iteration)

22:10 <lopex> doh, it's for rehash

22:10 <lopex> enebo: ^^

22:10 <lopex> and just rehash ?

22:11 <enebo> caching hash?

22:11 <lopex> yes

22:11 <lopex> and copy

22:11 <lopex> seems right

22:11 <enebo> just to know whether it needs to move?

22:11 <enebo> it compares against fresh hashCode?

22:11 <lopex> for rehash ?

22:11 <enebo> yeah

22:11 <enebo> I am asking you

22:12 <enebo> Seems like RubyString could really cache hashcode and invalidate if contents change

22:12 <enebo> it recalcs every hashCode call

22:12 <ChrisBr> before we stored the hash inside the RubyHashEntry

22:12 <enebo> unrelated

22:13 <ChrisBr> then for fetching, putting, deleting etc we compared the hash and then equals

22:13 <lopex> enebo: no, just to copy the hashes

22:13 <enebo> ChrisBr: lopex I thought that was just used for rehash

22:13 <ChrisBr> because we already had the hash for the new elements and existing ones so it is cheaper to compare than the equals

22:13 <lopex> since rehash is a form of copy :P

22:14 <ChrisBr> now we don't store them anymore so we would need to calculate the hash on the fly

22:14 <enebo> ChrisBr: so you might want to use a String as part of bench in addition to Fixnum since that recalcs hashCode all the time

22:14 <lopex> enebo: aaaah

22:14 <enebo> the fixnum one is pretty much free

22:14 <lopex> enebo: it's used for fast skip on bucket search

22:14 <ChrisBr> right

22:14 <lopex> er, it doesnt make sense

22:14 <lopex> wait

22:15 <ChrisBr> that is the question: is it still fast skip if we need to calc the hash on the fly or do we just do an equals

22:18 <lopex> yeah, actually when bucket(hash1) != bucket(hash2) and hash1 != hash2

22:18 <lopex> bucket loosed information yes

22:18 <lopex> jeeze it was so long ago

22:19 <lopex> well, it's a fast skip after all

22:20 <lopex> ChrisBr: yeah, since hash for ruby potentially calls dozens of ruby objects by method call

22:21 <lopex> ChrisBr: or

22:21 <enebo> I have to go to dinner now but hopefully you two will figure this out I guess removing the hash comparison from internalKeyExist makes me wonder what happens if key contents change

22:22 <lopex> ChrisBr: if we minimize the collisions by increasing load factor we can get rid of it

22:22 <lopex> enebo: ^^

22:22 <lopex> enebo: and thar what mri seems to be doing

22:22 <enebo> although that question makes me think finding the same object if object state exists is bizarre in first place

22:22 <lopex> enebo: ^^

22:22 <lopex> enebo: makes sens ?

22:22 <lopex> *sense

22:23 <enebo> lopex: higher factor means more buckets so less items likely to be in same one

22:23 <enebo> lopex: more buckets compared to number of entries in hash

22:24 <enebo> that was awkward way of saying that

22:24 <enebo> I mean relationship as more entries are added more buckets would be added compared to using a lower load factor

22:24 <enebo> so less collision chance

22:24 <ChrisBr> enebo: this is what you meant, right? https://github.com/jruby/jruby/pull/5215/commits/a0afbdc372543b2cec081ea96b856dbf3bb0424d ?

22:24 <enebo> but any collision will require some eql

22:25 <ChrisBr> my benchmark is kinda unreliable, sometimes it is faster but sometimes not :(

22:25 <enebo> heh

22:25 <enebo> ChrisBr: I did mean that yeah

22:25 <lopex> enebo: yes, but that's what mri does doesnt it ?

22:25 <enebo> lopex: I don't know

22:25 <enebo> lopex: could be

22:25 <lopex> enebo: I bet

22:26 <enebo> It makes sense with less collisions hashing would be faster

22:26 <lopex> enebo: caching hash greatly simplifies that for us

22:26 <lopex> enebo: *not caching

22:26 <enebo> eql can be very expensive

22:26 <lopex> doh

22:26 <lopex> enebo: yes, but it seems to be mitigated by thata from mris perspective

22:26 <enebo> sure

22:27 <enebo> makes sense as a theory but I cannot tell you why they did it :)

22:27 <ChrisBr> so the load factor should be at max 50% as bin array is always two times bigger than the entries array ...

22:27 <enebo> seems like a good theory though

22:27 <lopex> enebo: also in c they could be using a scheme like (key, hash, index) as entries in an arrays itself

22:27 <lopex> enebo: we cant

22:28 <lopex> enebo: right ?

22:28 <ChrisBr> anyway, sleeping time in Germany! cu tomorrow & thanks for the help

22:28 <lopex> enebo: we cant use Unsafe here

22:28 <enebo> no we could have int[] hash, index and Object for key, value

22:28 <enebo> but two arrays

22:28 <enebo> ChrisBr: thanks talk to you later

22:28 <lopex> ChrisBr: seeya, we'll be waiting for you

22:28 <enebo> lopex: gotta go too now

22:29 <lopex> enebo: ok, so, next time

22:29 <ChrisBr> we already have now two arrays! One for the buckets and the other one with the key & values

22:29 <ChrisBr> or do you mean to also store the hash in one of the array?

22:29 <lopex> yes, canonical open addressing

22:30 <ChrisBr> anyway, cu tomorrow ;)

22:30 <lopex> ChrisBr: for now the theory is not I guess

22:30 <lopex> ChrisBr: since it depends on hashing density (high density == lots of eql calls) and vice versa