#jruby on 2021-01-14 — irc logs at freenode.irclog.whitequark.org

2020-12-10 18:57 ChanServ changed the topic of #jruby to: Get 9.2.14.0! http://jruby.org/ | http://wiki.jruby.org | http://logs.jruby.org/jruby/ | http://bugs.jruby.org | Paste at http://gist.github.com

00:12 ur5us has quit [Ping timeout: 264 seconds]

00:37 ur5us has joined #jruby

01:52 ur5us has quit [Ping timeout: 264 seconds]

02:08 ur5us has joined #jruby

04:18 ur5us has quit [Ping timeout: 272 seconds]

07:48 Antiarc has quit [Ping timeout: 240 seconds]

08:01 ur5us has joined #jruby

09:00 hopewise[m] has joined #jruby

09:00 <hopewise[m]> Hello every body, can some one help me here:

09:00 <hopewise[m]> https://stackoverflow.com/questions/65715964/no-implicit-conversion-of-javajavapackage-into-string

09:02 Antiarc has joined #jruby

09:04 <hopewise[m]> never mind, I just had to add a quotation

09:04 <hopewise[m]> * never mind, I just had to add a quotation around java

09:54 ur5us has quit [Ping timeout: 264 seconds]

15:06 <headius[m]> hopewise: ah yep that would do it

16:00 jbeyer_wgt[m] has quit [Quit: Idle for 30+ days]

17:21 <headius[m]> enebo: https://github.com/ruby/ruby/pull/2576

17:32 <headius[m]> so ko1 did the same thing I suggested yesterday and moved the core of monitor.rb to native to avoid the interrupt handling

17:32 <headius[m]> @dav

17:32 <headius[m]> daveg_lookout: funny thing

17:39 <daveg_lookout[m]> Doesn't that still have a race in #mon_synchronize since @mon_data.enter is outside the begin/rescue block?

17:39 <headius[m]> that is ok... you want the monitor/lock entry to be outside the begu

17:39 <headius[m]> oops

17:40 <headius[m]> you want it outside because if there is an error locking it you don't want to try to unlock it

17:40 <daveg_lookout[m]> ah, so if enter fails, you don't exit

17:40 <headius[m]> right

17:40 <headius[m]> new keyboard who dis

17:41 <headius[m]> but CRuby, like JRuby, will not check thread interrupts at all in native code unless done explicitly, so this avoids the interrupt issue altogether

17:41 <headius[m]> to me it is an admission that the handle_interrupt pattern was a bad way to fix that problem, but yolo

17:41 <headius[m]> in any case...

17:41 <headius[m]> https://github.com/jruby/jruby/pull/6534

17:51 <daveg_lookout[m]> Out of curiosity, how is the race I mentioned avoided? Is there no way to be interrupted before the begin block is invoked? I would've assumed you needed a entered flag, only set after mon_data.enter succeeded.

18:00 <headius[m]> https://twitter.com/jruby/status/1349778054353190912

18:00 <headius[m]> with the Ruby code (fixed) the interrupts have been disabled before entering the monitor

18:00 <headius[m]> in the native version there is no interrupt check happening

18:02 <headius[m]> FWIW the handle_interrupt feature was added specifically to address this: http://blog.headius.com/2008/02/ruby-threadraise-threadkill-timeoutrb.html

18:02 <headius[m]> perhaps a touch dramatic

18:03 <daveg_lookout[m]> One question on your change -- previously, SizedQueue.setup called defineClassUnder with runtime.getClass("Queue") and now it's passing in Object.class -- is that important? Not familiar with defineClassUnder...

18:04 <daveg_lookout[m]> ok, thanks for the explanation, trying to make sure i understand

18:04 <headius[m]> oops that is not right

18:04 <headius[m]> I hope that fails CI!

18:06 <headius[m]> yeah it should extend Queue not Object... copypasta bug

18:07 <daveg_lookout[m]> other than that, lgtm, not that I'm a fully qualified reviewer

18:08 <headius[m]> yeah thanks for that!

18:10 <headius[m]> oh good, it failed spectacularly

18:11 <headius[m]> fixed and repushed

18:44 subbu is now known as subbu|afk

18:51 subbu|afk is now known as subbu

19:16 subbu is now known as subbu|lunch

19:25 <headius[m]> enebo: so thoughts on that native monitor PR?

19:27 <enebo[m]> seems fine to me

19:27 <enebo[m]> hahah what is the last line you see?

19:27 <headius[m]> "seems fine to me"

19:28 <enebo[m]> ok...I keep seeing my last line as "so it may just be there is no magic combo"

19:28 <headius[m]> that's weird

19:28 <enebo[m]> It keeps moving to the bottom so it must think it is in the future

19:28 <enebo[m]> I wish it could have said "in bed" at least it would give mileage

19:29 <enebo[m]> I think I will reload this page :)

19:29 <headius[m]> there is the minor compat issue of Monitor no longer mixing in MonitorMixin but that relationship was weird before and this has already shipped in CRuby 2.7

19:29 <headius[m]> ko1 basically moved the guts of MonitorMixin to Monitor as native code and flipped the relationship between the two

19:29 <enebo[m]> is it gone?...hmm

19:29 <enebo[m]> yep

19:30 <enebo[m]> but that also means MRI will hit issues from that if there is an issue

19:30 <headius[m]> right

19:30 <enebo[m]> we will just see it in pre-2.7 release

19:30 <headius[m]> I based this on the 3.0 codebase so clearly they shipped it again

19:31 <enebo[m]> ok that gives some confidence that even if someone does notice the writing has been on the wall for more than a year

19:31 <headius[m]> right

19:31 <headius[m]> and nobody will notice anyway

19:31 <headius[m]> would have to be doing something like monitor.kind_of? MonitorMixin anyway because the new Monitor still exports the same API

19:32 <headius[m]> so I will merge it for 9.3 then

19:34 <enebo[m]> cool

19:53 travis-ci has joined #jruby

19:53 <travis-ci> jruby/jruby (master:778c5ae by Charles Oliver Nutter): The build was broken. https://travis-ci.com/jruby/jruby/builds/213212522 [210 min 30 sec]

19:53 travis-ci has left #jruby [#jruby]

20:02 ur5us has joined #jruby

20:02 <headius[m]> there's that weird failure again

20:03 <headius[m]> I wonder if there is a race in this spec

20:04 <headius[m]> aha I think there is

20:05 <headius[m]> enebo: look at this spec, io/close_spec.rb around line 57

20:05 <headius[m]> basically it is trying to wait until the thread is blocking on IO before closing the related stream, so that the thread gets interrupted

20:05 <headius[m]> but the race is determining that... we mark the thread as "stop" immediately before making the blocking read call

20:06 <headius[m]> but if this logic closes it after we mark "stop" but before we try to read the thread itself will raise the normal error

20:06 <headius[m]> not the "another thread" error

20:07 <headius[m]> we have had to tag other tests like this because it is not possible to know that the thread is actually blocking on the IO yet

20:08 <enebo[m]> so is the underlying real problem is stop means two things?

20:08 <enebo[m]> should the thread have a state like init

20:09 <enebo[m]> I realize I am not asking something we can just change

20:09 <headius[m]> so when I have looked into this in the past I considered using the JDK thread state, but the race is still basically the same

20:09 <headius[m]> essentially you can't mark the thread as stopped AND start the blocking call as a single atomic operation

20:10 <headius[m]> so inspecting the thread state to know if it is blocking on IO will always be racy

20:10 <enebo[m]> but what about my question

20:11 <enebo[m]> it is stopped before the read and after it is blocking?

20:11 <headius[m]> I guess the underlying problem is that stop means "I am going to do something that stops me"

20:11 <enebo[m]> Oh wait

20:11 <headius[m]> but you don't know whether it has started doing that thing

20:11 <enebo[m]> it stops the thread to start the read but that is the race internally and we can hit between

20:11 <headius[m]> right

20:12 <enebo[m]> without starting another thread we cannot stop after we start the read :)

20:12 <headius[m]> so depending on when this spec sees that the thread is stopped, it might close the IO early and not trigger the cross-thread close error

20:13 <headius[m]> this might even be a race on CRuby but they can get closer to making stop + read look atomic to other threads

20:13 <enebo[m]> yeah so we have two problems though. The first is the test but the second is how to do you even write this?

20:13 <headius[m]> but still they have to release the gvl immediately before the read, so there is a potential for it to happen

20:13 <headius[m]> my assessmen

20:14 <headius[m]> my assessment years ago when I looked into this is that it is not possible to write this

20:14 <enebo[m]> you could pass n times and make a very loose assumption ruby is slow enough that the read will actually start

20:14 <headius[m]> not reliably

20:14 <headius[m]> yeah that is the best you can do, maybe throw a delay in there

20:14 <enebo[m]> So count != 3 && conditions

20:14 <headius[m]> right

20:14 <headius[m]> I think I will file an issue for this

20:15 <headius[m]> TR has it tagged too fwiw but I believe they should have the same issue

20:15 <enebo[m]> yeah it makes sense

20:15 <enebo[m]> MRI might just end up working out but the local set and the read are not a transaction

20:16 <enebo[m]> so I could see that theoretically an internal change could end up causing MRI to show this behavior in the future

20:16 <headius[m]> yeah exactly

20:16 <enebo[m]> A change they would quickly undo :)

20:16 <headius[m]> even if release gvl was immediately before calling read there is a race

20:17 <headius[m]> because in that moment another thread could start running and see "stop" state

20:17 <enebo[m]> too bad there is not some read(start_marker: read_started, ...)

20:19 <enebo[m]> Ultimately though atomically writing a value will have some gap for the read call itself if we consider the status of the system call to be what we are trying to see

20:19 <headius[m]> regardless of what we do in the implementation there is still the fundamental problem that you eventually have to call read at the kernel level

20:19 <enebo[m]> yeah

20:19 <headius[m]> so any gap between when you set up state or call callbacks and that read will be a race

20:20 <enebo[m]> that was what I meant by system call

20:20 <headius[m]> this is why I figured there is actually no way to do this

20:20 <headius[m]> right

20:20 <headius[m]> ok

20:20 <enebo[m]> we can make the race a lot smaller though

20:20 <headius[m]> maybe this can be replaced with some spec that tries it in 100 threads and only requires that one of them have the message

20:20 <enebo[m]> heh

20:20 <enebo[m]> I mean it may work

20:21 <enebo[m]> I know people hate delays and those are not perfect either

20:21 <enebo[m]> another obvious fix to the test is to have the read run twice and work the first time

20:22 <enebo[m]> ignore that

20:23 <headius[m]> I would love to hear about a better way but I gave up looking for a solution last time I did this

20:24 <headius[m]> I hate to say never but there may never be a way to test this simply

20:24 <enebo[m]> I was going to say you record the first read and use that as part of the test which would also know going_to_read would have to have been set

20:24 <headius[m]> it just makes that first read another race condition unfortunately

20:24 <enebo[m]> but it is the same race

20:24 <headius[m]> yeah

20:24 <enebo[m]> just using an extra test :)

20:24 <headius[m]> 🤷‍♂️

20:24 <enebo[m]> yep

20:25 subbu|lunch is now known as subbu

20:33 <headius[m]> https://github.com/ruby/spec/issues/828

20:33 <headius[m]> I will tag it

21:04 ur5us has quit [Ping timeout: 264 seconds]

21:06 ur5us has joined #jruby

21:30 truths33ker[m] has quit [Ping timeout: 260 seconds]

21:30 kares[m] has quit [Ping timeout: 260 seconds]

21:30 liamwhiteGitter[ has quit [Ping timeout: 260 seconds]

21:30 enebo[m] has quit [Ping timeout: 260 seconds]

21:30 truths33ker[m] has joined #jruby

21:31 kares[m] has joined #jruby

21:31 enebo[m] has joined #jruby

21:31 liamwhiteGitter[ has joined #jruby