#jruby on 2019-10-31 — irc logs at freenode.irclog.whitequark.org

2019-08-12 18:53 ChanServ changed the topic of #jruby to: Get 9.2.8.0! http://jruby.org/ | http://wiki.jruby.org | http://logs.jruby.org/jruby/ | http://bugs.jruby.org | Paste at http://gist.github.com

00:07 <headius[m]> Nice! Which one?

00:08 <lopex> no idea what was wrong about it

00:08 <lopex> but it had to do something with cookies and random

00:18 KarolBucekGitter has quit [*.net *.split]

00:18 JulesIvanicGitte has quit [*.net *.split]

00:18 MattPattersonGit has quit [*.net *.split]

00:20 MattPattersonGit has joined #jruby

00:20 KarolBucekGitter has joined #jruby

00:20 JulesIvanicGitte has joined #jruby

00:22 nirvdrum has joined #jruby

01:25 nirvdrum has quit [Ping timeout: 240 seconds]

04:37 _whitelogger has joined #jruby

05:20 den_d has quit [Excess Flood]

05:20 den_d has joined #jruby

05:46 _whitelogger has joined #jruby

06:00 drbobbeaty has quit [Ping timeout: 264 seconds]

06:11 drbobbeaty has joined #jruby

08:46 _whitelogger has joined #jruby

10:51 <fidothe> Working on https://github.com/jruby/jruby/issues/5905. Figured out that RubyMatchData already copes with the put-a-string-pattern-in-it-and-lazily-create-a-regexp case. Hurrah. When I put a multibyte-char string in it, I get the bytes dumped out in the result from `#inspect`, instead of seeing the UTF-8 char I expect. This works fine with the regexp-version, using the same input string. Is there something about

10:51 <fidothe> `RubyString.strDup(runtime)` that needs me to do something extra about encodings?

10:52 <fidothe> If I call `#string` on the `MatchData` instance then I get the UTF-8 encoded string I expect

10:54 <fidothe> so that seems like it's not the culprit

11:03 <fidothe> Okay, so you need to pass the whole string in as the `str` param when creating a MatchData. Makes sense, should have twigged from the way the Regexp-using version invokes it...

11:13 rusk has quit [Ping timeout: 250 seconds]

11:19 rusk has joined #jruby

11:49 nirvdrum has joined #jruby

12:14 <headius[m]> Yeah that sounds right

12:31 drbobbeaty has quit [Ping timeout: 245 seconds]

12:38 shellac has joined #jruby

12:46 drbobbeaty has joined #jruby

13:27 sagax has joined #jruby

13:35 <fidothe> Now I'm down to failing tests

13:36 <fidothe> What's the origin of the tests in `test/mri/ruby`? `test_string.rb` is failing with some stuff that looks (at first glance) unrelated, and some is definitely related.

13:37 <fidothe> https://travis-ci.com/fidothe/jruby/jobs/251444032#L4676 for example

13:40 <fidothe> That looks like we need to use `RubyRegexp.regsub` or equivalent

13:40 <fidothe> i.e. we should be expanding `\0`

13:40 <fidothe> hrm

13:41 <fidothe> if that's done there's basically no way out of creating the Regex

13:43 metafr[m] has joined #jruby

13:44 <fidothe> Working on the assumption that the Rubydocs are comprehensive, looks like you could get away with special casing on the presence of \0 - \9

13:45 <fidothe> and subbing in the match for `\0` and empty string for `\1 `- `\9`

13:45 <fidothe> I guess I need to dive into the MRI source on this

13:46 <fidothe> more coffee

13:46 <metafr[m]> Hi Guys, sorry, newbie question : in the JRuby 9.1.17.0 release notes it is said that this JRuby version is compatible with ruby 2.x. But what version of ruby JRuby 9.1.17.0 is using by default ? Is there a way to configure it in a RoR application?

13:51 <fidothe> @metafr[m] JRuby 9.1.17.0 is compatible with Ruby 2.3

13:53 <fidothe> You can't pick versions like you could in JRuby 1.7 (where the choice was MRI 1.8 or 1.9)

13:54 <enebo[m]> fidothe: those test are internal test suite of C Ruby itself

13:55 <fidothe> @enebo[m] thanks. I can resolve the references to bugs now :-)

13:56 <enebo[m]> looks like largely 2 issues to work through glancing at that output

13:56 <fidothe> @enebo[m] Yeah. The encoding one looks nasty

13:56 <enebo[m]> you mean the \0 issue?

13:56 <fidothe> the `\0` expansion is straightforward but annoying

13:57 <fidothe> @enebo[m] no, this one: https://travis-ci.com/fidothe/jruby/jobs/251444034#L4682

13:57 <fidothe> <"あああ!foo!"> expected but was

13:57 <fidothe> <"あああ!foo/">.

13:57 <metafr[m]> fidothe: Thanks for your answer.

13:58 <enebo[m]> yeah I can see this...so last byte is not found due to probably walking wrong encoding

13:58 <enebo[m]> or something with how it is walked

13:59 <fidothe> @metafr[m] hope it was useful. If you need features from Ruby 2.4 or later, JRuby 9.2.8.0 supports 2.5

13:59 <fidothe> @enebo[m] i assumed it was something about multi-byte length calculations

14:00 <fidothe> This has a been a fun introduction to Java programming :-)

14:00 <enebo[m]> fidothe: oh yeah it likely is either wrong encoding assumed or perhaps wrong string helper method like preciseMBCLength vs something else

14:01 <enebo[m]> fidothe: oh cool. Java itself I hope is not a big barrier. We are not really doing too many complicated things in Java

14:01 <enebo[m]> fidothe: you using an IDE?

14:01 <enebo[m]> fidothe: if you are working on this now I can pull your fork and see if I notice anything

14:02 <enebo[m]> fidothe: assuming you want any help. Sometimes it is fun to work through it on your own :)

14:02 <enebo[m]> fidothe: also anything involving encodings or joni/jcodings and lopex is around too

14:02 <fidothe> @enebo[m] IntelliJ IDEA. I've been meaning to learn how to write Java, and not just read it, for ages, and this seemed like a good way in

14:04 <fidothe> @enebo[m] I'll ping you or @lopex if I have questions. I'm still getting used to holding the different string models - RubyString, byte[], java string, and how they interact in my head

14:04 <enebo[m]> fidothe: yeah I always say we are usually pretty easy since you get to isolate to making a few methods and you can usually look at similar ones in the same core type to figure which methods to use

14:04 <enebo[m]> fidothe: and in fact your first stab is a little more complicated because m17n is its own domain

14:05 <enebo[m]> yeah in this case you should be extracting bytelist to get a byte[] mostly

14:06 <fidothe> Fortunately I understand that bit pretty well, at least as far as Unicode and how it works in encodings and how that stuff works in normal Ruby

14:06 <enebo[m]> biggest problem tends to be forgetting begin index (which usually gets missed for about a month since most strings will have begin = 0)

14:06 <enebo[m]> fidothe: cool

14:13 <metafr[m]> > @metafr[m] JRuby 9.1.17.0 is compatible with Ruby 2.3

14:13 <metafr[m]> @fidothe does it mean that 9.1.17.0 features the same vulnerabilities as the ones we can find in ruby 2.3 versions ( eg https://nvd.nist.gov/vuln/detail/CVE-2018-16395 )

14:15 <enebo[m]> metafr: It might or it might not. It is no longer supported so there will be no fixes but in this case we do not use openssl but our own implementation trying to emulate openssl so it probably does not apply to us (although you would need to try an exploit to see)

14:17 <enebo[m]> metafr: in most cases we tend to not have as many CVEs because Java does not have the memory safety issues that C does. In cases where we do can be issues like that one where possibly our impl does the same incorrect logic

14:17 <enebo[m]> (and I am not saying we are vulnerable to that ... I don't know if 9.1.x is)

14:18 <enebo[m]> metafr: I am assuming there is a reason you cannot contemplate 9.2.9.0 but 2.3 -> 2.5 is not too massive for incompatibilities...Largely the integer unificiation (Bignum/Fixnum => Integer)

14:22 <metafr[m]> @enebo thanks a lot for these explanations

14:23 <enebo[m]> metafr: yeah sorry it was not clearer that 9.1.x is 2.3.x

14:24 nirvdrum_ has joined #jruby

15:15 rusk has quit [Remote host closed the connection]

15:52 rusk has joined #jruby

16:03 xardion has quit [Remote host closed the connection]

16:03 xardion has joined #jruby

16:34 sagax has quit [Quit: Konversation terminated!]

16:34 sagax has joined #jruby

16:48 shellac has quit [Ping timeout: 240 seconds]

17:02 <enebo[m]> fidothe: I just did a quick run of that failing test. StringSupport.index offset parameter appears to be the character offset and not the byte offset.

17:02 <fidothe> Aha

17:03 <enebo[m]> As a secondary comment this method is somewhat innefficient in the sense it needs to re-walk from front of string every time up to the right place each call into index

17:03 <fidothe> So previous Multibyte Tests I did worked by accident

17:03 <enebo[m]> since in a mbc scenario it cannot just jump forward

17:03 <enebo[m]> yeah they worked because all chars were byte size of 1

17:03 <enebo[m]> anyways I am going to eat some lunch but I thought I would share that

17:04 <fidothe> No, I did some multibyte tests but the last replacement was never at the end of the string

17:05 <fidothe> I hype I’d use string support index because String#index uses it. Good starting point for further improvements

17:05 <fidothe> I thought I’d use

17:05 <fidothe> Blooming iOS

17:41 nirvdrum_ has quit [Ping timeout: 240 seconds]

17:42 nirvdrum has quit [Ping timeout: 268 seconds]

18:07 rusk has quit [Remote host closed the connection]

18:20 lucasb has joined #jruby

18:41 nirvdrum has joined #jruby

18:41 nirvdrum_ has joined #jruby

18:56 <enebo[m]> fidothe: maybe we can make an index which passes in a byte[] with begin index

19:00 <fidothe> First step, make it work. Benchmark suggests that even the crude version will be faster in a bunch of situations. I suspect it won’t be for gsub on a long string

19:10 <enebo[m]> fidothe: yeah an improved version of index will be pretty easy to plug in

19:10 <enebo[m]> fidothe: in fact at that point keep track of characters will be removed at that point too

19:11 <fidothe> The hard work there will be encoding stuff I guess.

19:11 <enebo[m]> as you have this written the new version of index may even fit better

19:11 <enebo[m]> since you are just passing in appropriate offset into byte[] for where next char starts

19:12 <enebo[m]> really the new version of index will just be like the old one but will remove the offset() call towards the front

19:13 <enebo[m]> I looked briefly and saw no one else use index in a repeated fashion so no opportunity to fix this in other things

19:29 <lopex> fidothe: do you also follow the usage of mri's need_backref ?

19:31 <lopex> doh it's more invasive than you thought

19:32 <lopex> enebo[m]: now it all revolves around rb_pat_search(VALUE pat, VALUE str, long pos, int set_backref_str)

19:33 <lopex> and they set set_backref_str as they wish from the callers

19:33 <lopex> and it's a boolean

19:33 <lopex> er, I meant, more invasive than I thought

19:34 <lopex> so scan is affected as well

19:36 <lopex> headius[m]: updated jcodings to unicode 12.1.0 shall we change the deps now ?

19:37 <headius[m]> Yeah might as well

19:37 <lopex> oh, will look at sonatype

19:37 <lopex> if it;s ready

19:42 Rumo[m] has left #jruby ["User left"]

19:46 <lopex> is there a faster way of checking the release than refreshing on https://repo1.maven.org/maven2/org/jruby/jcodings/jcodings/ ?

19:57 <headius[m]> Not that I know of 😕

19:58 <headius[m]> I have been pushing immediately as a PR so I don't have to sit there waiting. It fails at first but I can return to it later and merge

19:58 <lopex> oh, it's there now

19:58 <lopex> like 20 minutes

19:59 <lopex> ah, good hack

20:01 <lopex> ok, joni tests pass, but I dont think we need a release for that

20:04 <lopex> btw I'm using self hosted gitlab and they use 2.6.5p114

20:05 <lopex> so like, new mri adoption is quite quick

20:06 <lopex> headius[m]: btw have you seen https://gitlab.com/honeyryderchuck/httpx

20:06 <lopex> "JRuby's openssl is based on Bouncy Castle, which is massively outdated and still doesn't implement ALPN. So HTTP/2 over TLS/ALPN"

20:22 <headius[m]> Hmm

20:22 <headius[m]> I had not

20:26 <lopex> the readme is 1 year old so ..

20:28 <headius[m]> Yeah I'm really not looking forward to a new impl of openssl

20:31 <lopex> but https://www.bouncycastle.org/releasenotes.html

20:31 <lopex> alpn is ther

20:31 <lopex> e

20:32 <lopex> so they might have assumed some jruby dep version

20:32 <lopex> at that time

20:32 <headius[m]> Ahh well that would be good if we can just do another BC update

20:33 <headius[m]> Gonna have to clean up jossl codebase one of these days and try to use more JSSE everywhere we can

21:18 <lopex> but why use "massively outdated"

21:18 <lopex> I'd rather use ancient

21:19 <lopex> fidothe: you still there ?

21:26 <fidothe> lopex: not really...

21:28 <lopex> fidothe: oh, btw I made a mistake about problem apprehension, it was towards myself

21:28 nirvdrum has quit [Quit: Leaving]

21:33 <fidothe> lopex: no problem 👍

21:45 <lopex> headius[m]: btw I need that jruby docker image

21:46 <lopex> headius[m]: I'm a victim of my own laziness

21:50 lucasb has quit [Quit: Connection closed for inactivity]

21:54 nirvdrum_ has quit [Ping timeout: 265 seconds]

21:58 travis-ci has joined #jruby

21:58 <travis-ci> jruby/jruby (master:1ad6c29 by Marcin Mielzynski): The build is still failing. https://travis-ci.org/jruby/jruby/builds/605739364 [205 min 47 sec]

21:58 travis-ci has left #jruby [#jruby]