#pypy on 2017-11-24 — irc logs at freenode.irclog.whitequark.org

2017-10-17 08:34 antocuni changed the topic of #pypy to: PyPy, the flexible snake (IRC logs: https://botbot.me/freenode/pypy/ ) | use cffi for calling C | "PyPy: the Gradual Reduction of Magic (tm)"

00:00 jamesaxl has quit [Quit: WeeChat 1.9.1]

00:02 ArneBab has joined #pypy

00:23 <bbot2> Success: http://buildbot.pypy.org/builders/rpython-linux-x86-64/builds/21 [default]

00:25 slackyy has joined #pypy

00:30 <bbot2> Failure: http://buildbot.pypy.org/builders/pypy-c-jit-win-x86-32/builds/3529 [default]

00:33 <bbot2> Success: http://buildbot.pypy.org/builders/rpython-linux-x86-32/builds/14 [default]

00:36 antocuni has joined #pypy

00:50 <bbot2> Success: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-64/builds/5095 [default]

00:54 <bbot2> Failure: http://buildbot.pypy.org/builders/pypy-c-jit-macosx-x86-64/builds/3440 [default]

01:00 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-linux-s390x/builds/739 [default]

01:00 tbodt has joined #pypy

01:05 slacky__ has joined #pypy

01:05 tbodt has quit [Read error: Connection reset by peer]

01:06 tbodt has joined #pypy

01:06 <bbot2> Success: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-32/builds/4343 [default]

01:08 slackyy has quit [Ping timeout: 240 seconds]

01:13 <bbot2> Success: http://buildbot.pypy.org/builders/build-pypy-c-jit-linux-armhf-raring/builds/1730

01:19 Garen has joined #pypy

01:24 yuyichao has quit [Ping timeout: 248 seconds]

01:30 antocuni has quit [Ping timeout: 248 seconds]

01:36 yuyichao has joined #pypy

01:40 ceridwen has quit [Ping timeout: 258 seconds]

01:41 marr has quit [Ping timeout: 260 seconds]

01:45 pilne has quit [Quit: Quitting!]

01:46 tbodt has quit [Read error: Connection reset by peer]

01:46 tbodt has joined #pypy

01:47 tbodt has quit [Read error: Connection reset by peer]

01:48 tbodt has joined #pypy

01:53 ceridwen has joined #pypy

02:00 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-macosx-x86-64/builds/3441 [py3.5]

02:00 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-64/builds/5096 [py3.5]

02:00 <bbot2> Started: http://buildbot.pypy.org/builders/own-linux-x86-64/builds/6389 [py3.5]

02:00 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-win-x86-32/builds/3530 [py3.5]

02:00 <bbot2> Started: http://buildbot.pypy.org/builders/own-linux-x86-32/builds/5582 [py3.5]

02:00 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-32/builds/4344 [py3.5]

02:18 <bbot2> Success: http://buildbot.pypy.org/builders/build-pypy-c-linux-armhf-raspbian/builds/1606

02:18 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-app-level-linux-armhf-v7/builds/1302

02:18 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-app-level-linux-armhf-raspbian/builds/1527

02:33 jcea has quit [Quit: jcea]

02:35 <bbot2> Failure: http://buildbot.pypy.org/builders/own-linux-x86-64/builds/6389 [py3.5]

02:38 Rhy0lite has quit [Quit: Leaving]

02:40 tbodt has quit [Quit: My Mac has gone to sleep. ZZZzzz…]

02:46 <bbot2> Success: http://buildbot.pypy.org/builders/build-pypy-c-jit-linux-armel/builds/1979

02:57 <bbot2> Failure: http://buildbot.pypy.org/builders/pypy-c-jit-win-x86-32/builds/3530 [py3.5]

02:58 <bbot2> Failure: http://buildbot.pypy.org/builders/own-linux-x86-32/builds/5582 [py3.5]

03:07 <bbot2> Failure: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-64/builds/5096 [py3.5]

03:23 <bbot2> Success: http://buildbot.pypy.org/builders/build-pypy-c-linux-armel/builds/1866

03:24 <bbot2> Success: http://buildbot.pypy.org/builders/pypy-c-jit-linux-s390x/builds/739 [default]

03:26 <bbot2> Failure: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-32/builds/4344 [py3.5]

03:37 ArneBab_ has joined #pypy

03:41 ArneBab has quit [Ping timeout: 248 seconds]

04:00 <bbot2> Started: http://buildbot.pypy.org/builders/jit-benchmark-linux-x86-32/builds/3190

04:29 <bbot2> Failure: http://buildbot.pypy.org/builders/pypy-c-jit-macosx-x86-64/builds/3441 [py3.5]

04:30 <bbot2> Success: http://buildbot.pypy.org/builders/build-pypy-c-jit-linux-armhf-raspbian/builds/1735

04:30 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-linux-armhf-raspbian/builds/1475

04:30 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-linux-armhf-v7/builds/1312

04:36 _whitelogger has joined #pypy

04:54 <bbot2> Failure: http://buildbot.pypy.org/builders/rpython-win-x86-32/builds/16 [default]

06:30 <bbot2> Failure: http://buildbot.pypy.org/builders/pypy-c-app-level-linux-armhf-v7/builds/1302

06:38 <bbot2> Failure: http://buildbot.pypy.org/builders/own-win-x86-32/builds/1613 [default]

07:02 jamesaxl has joined #pypy

07:18 <kenaan_> mattip default d7c94a4970dd /pypy/module/_continuation/test/conftest.py: generate conf.h for tests

07:18 <kenaan_> mattip py3.5 d2807ddb8178 /pypy/module/_continuation/test/conftest.py: merge default into py3.5

07:27 marr has joined #pypy

07:44 Nizumzen has quit [Ping timeout: 240 seconds]

08:16 Nizumzen has joined #pypy

08:32 antocuni has joined #pypy

08:41 <antocuni> arigato: I remember that once you had a script to take a pypy usession directory and create one with only the files needed to run "make"

08:41 <antocuni> is it still somewhere?

08:47 mattip has joined #pypy

08:48 <mattip> antocuni: played with eventlet a bit, http://paste.openstack.org/show/627291

08:49 <mattip> runtest == 4 shows about half the samples, so it is something like

08:49 <mattip> running an eventlet does not preserve, but does restore, the vmprof.enable

08:50 mattip has left #pypy ["bye"]

08:51 <antocuni> mattip: interesting

08:51 <antocuni> I'm looking at it right now

08:57 <fijal> antocuni: I would expect if you do loads('{"foo":"bar"}') (very short string), then rffi2charp will hurt you significantly in _pypyjson

08:57 <antocuni> fijal: yes, maybe

08:58 <antocuni> fijal: as you said, nowadays it's probably much better to pin the llstr to call rdtoa

08:59 <fijal> pinning is not "free", but it's not very expensive either

08:59 <fijal> it's surely cheaper than copying a buffer

08:59 <antocuni> good, then

09:00 <antocuni> the question is whether to pin the whole buffer for the duration of the parsing, or pinning it briefly only when you need to call rdtoa

09:04 <kenaan_> fijal unicode-utf8 109fd5f5d4eb /pypy/: start working on pypyjson

09:04 <kenaan_> fijal unicode-utf8 8fac293591e9 /pypy/module/_io/interp_textio.py: merge

09:05 <fijal> the latter I would think

09:05 <fijal> we kinda promised not to do the former

09:05 <fijal> I mean it depends - is one loads under one GIL?

09:05 marr has quit [Ping timeout: 260 seconds]

09:05 <antocuni> I think so, why wouldn't it?

09:07 <fijal> I don't know - can it execute python code in between?

09:07 <fijal> I mean, I'm sure it can somehow, but in a meaningful way

09:10 <antocuni> I has a space.call_function

09:10 <antocuni> although the function it's always space.w_int

09:10 <fijal> yeah that's what I meant with "meaningful way"

09:10 <fijal> I'm sure it can call __eq__ or something

09:11 <antocuni> well, probably not since it only handles very primitive types: list, dict, int, float, str

09:11 <antocuni> but it's hard to prove

09:12 <fijal> I don't think i need a proof

09:12 <fijal> just the "how likely it is to call something massive"

09:12 <fijal> and I think "not likely at all"

09:12 <njs> do you support all the random silliness that json.load does, like object_pairs_hook?

09:13 <fijal> we do, but we probably don't go through _pypyjson then

09:13 <bbot2> Success: http://buildbot.pypy.org/builders/jit-benchmark-linux-x86-32/builds/3190

09:14 <antocuni> njs: if you pass "strange" options, you go through the slow pure-python version

09:14 <antocuni> if you just to json.loads, you use the fast _pypyjson

09:17 <antocuni> wow, I'm trying to compile a pypy on my machine; OOM kiiled the final "ld" because it was using 4GB of ram O_o

09:21 <fijal> are you using -flto or something?

09:21 <antocuni> I'm just using "make debug"

09:22 <antocuni> I don't think it uses flto nowadays

09:22 <fijal> it went back and forth, I don't know any more

09:22 <antocuni> looking at the command line, it doesn't seem so

09:23 <fijal> then it looks like a bug?

09:24 <antocuni> or maybe it really uses so much, but nobody never noticed?

09:30 yuyichao_ has joined #pypy

09:30 yuyichao has quit [Ping timeout: 268 seconds]

09:33 <fijal> antocuni: there is another reason, \0 at the end

09:33 <fijal> I *think* we started adding them everywhere?

09:33 <fijal> or at least thought about it

09:34 <antocuni> I don't really know

09:46 yuyichao has joined #pypy

09:49 yuyichao_ has quit [Ping timeout: 240 seconds]

09:50 <fijal> arigato: ping

09:52 oberstet2 has joined #pypy

09:53 <kenaan_> fijal unicode-utf8 8a24f68050df /pypy/module/_ssl/interp_ssl.py: fix _ssl module

10:01 mattip has joined #pypy

10:05 <antocuni> ok, I *think* the vmprof+eventlet problem is caused by the fact that at each switch, we call vmprof_stop_sampling twice, and vmprof_start_sampling only once

10:05 <mattip> about translation using too much memory, if you rerun make alone (outside translation) it uses less memory

10:06 <mattip> something like the translation process memory is leaking into the forked subprocess used to run make

10:06 <antocuni> mattip: I'm already running make alone

10:06 <mattip> ahh, never mind then

10:17 <kenaan_> fijal unicode-utf8 467a32f09dd6 /: start fixing _rawffi

10:17 <fijal> I wonder if we can kill _rawffi

10:19 <antocuni> isn't it still used by ctypes?

10:22 <fijal> yeah

10:22 <fijal> who uses ctypes

10:22 <fijal> ok, so here is a question

10:22 <fijal> we allow so far creation of invalid unicode out of C functions (we don't check at the boundary)

10:22 <fijal> is it ok?

10:23 <antocuni> I think ctypes is still used by some modules written years ago

10:24 <fijal> like that's a change

10:24 <fijal> (not be able to create invalid unicode using ctypes)

10:24 <fijal> but I'm not sure if it's not a bug

10:26 <antocuni> not sure what you mean

10:27 <fijal> antocuni: if you have a function returning wchar_t*

10:27 <fijal> we don't check the ranges of output, we just blindly copy them into the unicode string

10:28 <fijal> so if you have wide build and return invalid unicode and you call function via ctypes we get invalid unicode strings

10:29 <njs> does cpython validate in this case?

10:30 <fijal> I doubt it

10:30 <fijal> maybe on python3, they did improve

10:30 * fijal wonders if he feels like checking

10:30 <antocuni> yes, I think we should basically try to mimic cpython

10:31 <antocuni> if cpython allows it, I am SURE that in the wild there is some obscure module which returns invalid unicode from ctypes :)

10:32 <fijal> antocuni: well, that's not possible

10:32 <fijal> we decided we are not going to allow invalid unicode

10:32 <fijal> but I wonder if it's a bug :)

10:33 <antocuni> so if we have already decided, why do you ask? :)

10:33 <fijal> was this always a bug?

10:34 <fijal> I just found out that it does not check (cffi does for example)

10:49 antocuni has quit [Ping timeout: 240 seconds]

10:49 <arigato> fijal: that's known. you can also use the array module to make invalid unicodes (on python 2)

10:50 <arigato> we pondered it and decided not to care for now

10:57 <fijal> right

10:57 <fijal> arigato: do you feel like fixing _sre?

10:57 <fijal> I'll add checking to wcharp2utf8 and stuff and then catch ValueError in callers, I would think

10:57 <arigato> ok

10:58 <arigato> wcharp2utf8 already checks, by calling the _append() function which checks

10:58 <bbot2> Failure: http://buildbot.pypy.org/builders/pypy-c-jit-linux-armhf-v7/builds/1312

10:58 <fijal> right, but the caller should catch ValueError right?

10:58 <arigato> yes

10:58 <arigato> I wrote a comment in wcharp2utf8()

10:59 <arigato> and I made sure the only user (at that point in time two days ago) was catching ValueError

11:04 <fijal> right, I need to make sure too

11:04 <fijal> arigato: we have _rawffi, _sre and _io and we're done with modules

11:04 <fijal> then we can do some proper tests

11:04 <fijal> and _pypyjson

11:04 <fijal> and try to merge to py3k

11:05 <fijal> if we can merge it by next week, we have a month to finalize benchmarking, do a release and write a proper blog post

11:05 <fijal> which should be enough?

11:27 Nizumzen has quit [Quit: KVIrc 4.2.0 Equilibrium http://www.kvirc.net/]

11:47 <mattip> not sure what is going on with s390x and virtualenv

11:47 <mattip> http://buildbot.pypy.org/builders/own-linux-s390x/builds/774/steps/shell_6/logs/stdio

11:47 <mattip> I thought creating a virtualenv installs updated pip and setuptools?

11:48 <mattip> maybe I should try forcing an update?

11:50 <kenaan_> mattip buildbot 0a18cb374a4e /bot2/pypybuildbot/builds.py: not needed, virtualenv is deleted by "hg purge"

11:50 <kenaan_> mattip buildbot 0548ff25f980 /bot2/pypybuildbot/builds.py: update pip, setuptools

11:53 <bbot2> Exception: http://buildbot.pypy.org/builders/pypy-c-app-level-linux-armhf-raspbian/builds/1527

11:54 <bbot2> Exception: http://buildbot.pypy.org/builders/pypy-c-jit-linux-armhf-raspbian/builds/1475

11:55 bbot2 has quit [Quit: buildmaster reconfigured: bot disconnecting]

11:56 bbot2 has joined #pypy

11:56 <mattip> let's see what happens tonite

12:01 jcea has joined #pypy

12:38 antocuni has joined #pypy

12:58 jcea has quit [Quit: jcea]

13:00 jcea has joined #pypy

13:00 <kenaan_> cfbolz unicode-utf8 82223a975b6b /pypy/module/_pypyjson/interp_decoder.py: fix unicode \-encoding in _pypyjson

13:00 <kenaan_> cfbolz unicode-utf8 a9bb96fbf9d4 /pypy/: fix more tests BUT: a slight pessimization, because object decoding becomes a little bit slower

13:02 jcea has quit [Client Quit]

13:04 jcea has joined #pypy

13:21 <arigato> cfbolz: why can't decode_key() return a utf8 byte string instead of a unicode string on default?

13:21 <cfbolz> arigato: it can, but it doesn't help anyway

13:21 <cfbolz> because on the branch there is no general UnicodeDictStrategy

13:22 <cfbolz> (on the branch it only works for ascii strings :-( )

13:25 <arigato> I'm fearing that you're chanching pypyjson in this way because it makes sense for now, but then we'll need UnicodeDictStrategy anyway, and we'll forget to revert pypyjson

13:25 <cfbolz> yes, I see that fear. I should at least put a todo

13:27 <kenaan_> cfbolz unicode-utf8 8dac9e38c3d5 /TODO: add todo

14:15 <kenaan_> cfbolz unicode-utf8 6a13aba253bd /rpython/rlib/: use an actual iterator, to make the code nicer (they work well in rpython nowadays)

14:15 <kenaan_> cfbolz unicode-utf8 5b81f483c459 /pypy/module/_pypyjson/interp_encoder.py: fix encoding to operate on utf-8 encoded strings

14:21 <cfbolz> arigato: before I continue a lot, could you take a look at this diff?:

14:21 <cfbolz> https://www.irccloud.com/pastebin/1aGawBTa/

14:34 <arigato> looks good to me

14:36 <cfbolz> pfff, confusion

14:39 jamesaxl has quit [Read error: Connection reset by peer]

14:39 jamesaxl has joined #pypy

14:56 mattip has left #pypy ["bye"]

15:02 rubdos has quit [Ping timeout: 250 seconds]

15:13 <kenaan_> cfbolz unicode-utf8 f5be33826726 /rpython/rlib/: support for append_utf8

15:13 <kenaan_> cfbolz unicode-utf8 48da1a44d860 /pypy/objspace/std/unicodeobject.py: replace a lot of uses of StringBuilder by Utf8StringBuilder

15:24 <kenaan_> cfbolz unicode-utf8 f5a5189e5314 /pypy/objspace/std/unicodeobject.py: small cleanup of copy-pasted join code

15:26 <cfbolz> arigato: it's all completely annoying. architecture-wise we should have a type in rutf8 that contains most of the logic in unicodeobject.py. then, unwrapping a w_unicode would give that type. but then we would get yet another indirection.

15:43 <arigato> yes

15:43 <arigato> the alternative would be to add a field to the low-level rstr

15:43 <arigato> but it's also annoying

15:44 <arigato> of course, all these tuple-returning functions we have in the branch now are also relatively costly

15:46 <antocuni> uh, apparently we don't have a way to check whether we already installed a `pypyjit.set_compile_hook` :(

15:47 <arigato> cfbolz: maybe at some point we should do something about that

15:48 <cfbolz> arigato: or not, to discourage designs where you return a lot of tuples :-P

15:48 <cfbolz> But yes, I see your point

16:09 marr has joined #pypy

17:17 <fijal> cfbolz: one of my thinking was "let's not have yet another layer of rpython magic"

17:18 <fijal> we can make tuple returning function do what they would do in C right?

17:18 <fijal> specifically x, y = foo() kinda call

17:18 <fijal> it seems even easy-ish

17:19 <arigato> it's all but easy-ish

17:19 <arigato> it's a mess that have implications everywhere including throughout the JIT

17:19 <fijal> and the gc?

17:20 <arigato> dunno, I can see a way that makes it have no implications in the GC

17:20 <arigato> but everywhere else

17:21 <fijal> right

17:21 <fijal> well, any good ideas how to do it otherwise?

17:22 <kenaan_> rlamy default 2477eb379774 /pypy/module/_io/interp_textio.py: Keep chipping away at readline_w()

17:22 <arigato> no magic idea that will solve all your use cases, no

17:23 <fijal> well, one option would be to return the builder

17:23 <fijal> which is again a bit of a mess for JIT

17:25 <fijal> arigato: is there a good way to measure if returning a tuple is indeed a problem?

17:29 <fijal> arigato: should I attack _rse?

17:33 <bbot2> Started: http://buildbot.pypy.org/builders/own-linux-x86-64/builds/6390 [fijal: force build, unicode-utf8]

17:36 <arigato> feel free to start but please tell me if you stop, so that I can work on it

17:37 <fijal> arigato: ah ok, I won't do anything today I think

17:37 <fijal> and maybe I should actually not do anything tomorrow either :-)

17:37 <fijal> so feel free to do anything you want, I still have _rawffi to whack if I want to do something

17:49 <bbot2> Failure: http://buildbot.pypy.org/builders/own-linux-x86-64/builds/6390 [fijal: force build, unicode-utf8]

17:51 oberstet2 has quit [Ping timeout: 250 seconds]

17:51 <fijal> http://buildbot.pypy.org/summary/longrepr?testname=AppTestStacklet.%28%29.test_new_empty&builder=own-linux-x86-64&build=6390&mod=pypy.module._continuation.test.test_stacklet

17:51 <fijal> ok that looks like someone broke default

17:51 <fijal> is it fixed?

17:53 <fijal> arigato: so cfbolz has a good point that we already use utf8 on pypy3

17:53 <fijal> so maybe having an rpython-level utf8 string would solve both the tuple issue and pypy3 issue?

18:06 <arigato> right, it would solve a few deeper issues than the tuple one, like recomputing things currently stored on W_UnicodeObject in some situations

18:06 <arigato> on the other hand, it's a major mess

18:07 <arigato> pypy3 doesn't "have" a utf8 string, it just uses a regular string that happens to contain utf8

18:10 <fijal> why is it a major mess?

18:11 <fijal> I mean, we would use a subclass of str() on emulated level, with rpython-level being slightly different with extra fields

18:11 <fijal> I think the main problem is that emulated level will be even slower, but maybe that's ok?

18:19 <arigato> so you're thinking about a rstr.UTF8STR that would look like a rstr.STR with a few extra fields?

18:19 <fijal> yeah

18:19 <fijal> and the emulated layer would be *cough* a subclass of str

18:19 <arigato> would it annotate as a different and incompatible SomeUtf8String ?

18:20 <fijal> yeah

18:20 <arigato> I'm sure you'll need sometimes to convert between that and a regular str, is making a copy ok?

18:21 <fijal> we can have an operation that does that

18:21 <fijal> makes a copy while emulated and a cast when not emulated

18:21 <fijal> "cast"

18:21 <fijal> one way you need to scan a string anyway

18:21 <arigato> well, how do you "cast"?

18:21 <fijal> right

18:21 <arigato> the rstr.UTF8STR cannot be compatible enough, not easily

18:22 <arigato> you'd need a copy, which defeats the point of .encode('utf8') not making a copy

18:22 <fijal> indeed

18:22 <fijal> there are messier options of course

18:24 <fijal> like, have a bit saying which one is it and storing extra data at the end of the string

18:24 <fijal> (which is, super messy)

18:24 <arigato> as I said earlier we could have an extra pointer inside all rstr.STR

18:24 <arigato> so that we don't need a different rstr.UTF8STR

18:25 <fijal> yes, that's an option too

18:25 <fijal> it kinda shifts the balance in RPython a bit

18:25 * fijal should really make food

18:25 <fijal> arigato: the problem is as follows - what do we do with py3k?

18:25 <fijal> where text_w returns utf8 string (but no flags)

18:26 <fijal> do we rerun check_utf8 when rewrapping it?

18:26 <fijal> maybe?

18:26 <fijal> and we write a super fast check_utf8

18:26 <fijal> or do we do something else?

18:26 <fijal> that sounds like the easiest option for now (and one that's also an improvement on the current setup anyway)

18:27 <arigato> I guess you're talking in the continuation of the current work

18:27 <arigato> not in the RPython string hack world

18:27 <fijal> I mean - how do we merge utf8 to py3k

18:27 <arigato> because in the RPython string hack world, it's easier

18:27 <fijal> after the merge to default

18:27 <arigato> yes, I understand

18:27 jamesaxl has quit [Read error: Connection reset by peer]

18:28 <arigato> I'm saying, we came up with a different idea, so let's explore it a little bit

18:28 <fijal> yes, sure

18:28 <arigato> in this different world, it's easier for py3k

18:28 <fijal> so the question is - do we explore it now or do we first try to merge the current approach to py3k?

18:28 <arigato> who knows

18:28 jamesaxl has joined #pypy

18:29 <fijal> note that even if we call check_utf8 at the rewrapping, it's STILL a massive improvement over the current situation

18:29 <fijal> and gives us clear path how to finish the branch (and mozilla contract)

18:29 <fijal> maybe we should make it a Leysin sprint topic "improve even further" :-)

18:30 <arigato> I should ask I guess: are you sure that the work that CPython/PyPy5.9 does in .encode('utf8') and .decode('utf8') is really enough to offset the extra overhead in the unicode-utf8 branch of mostly every other operation?

18:31 <arigato> well it's also less memory, so it's not clearly "every other operation"

18:31 <fijal> what is "every other operation"?

18:32 <fijal> getitem, sure

18:32 <arigato> but every operation actually looking inside the string, like most unicode methods, are probably a bit slower

18:32 <fijal> (and yes, I believe so)

18:32 <fijal> I doubt it

18:32 <fijal> eg find scans a lot less of memory

18:33 <arigato> ok

18:33 <arigato> I guess we'll see in benchmark results

18:33 <fijal> startswith for example should be faster

18:33 <fijal> arigato: well, give me an example :)

18:33 <arigato> things like UnicodeDictStrategy missing is probably costing something too

18:34 <fijal> again, no

18:34 <fijal> because I added the one for ascii

18:34 <fijal> and we don't run a single benchmark with an actual non-ascii unicode payload I think

18:35 <fijal> isupper is probably slower

18:36 <fijal> no, it's exact same speed on constant string

18:36 <arigato> ok, then maybe. I'll trust the benchmarks

18:36 <fijal> I think we SHOULD benchmark unicode non-ascii payloads :)

18:37 <fijal> but then we never did, so complaining that the branch might do something there is a bit problematic

18:37 <arigato> it seems to me that there is more complexity, which will translate into slower interpreted code and more bridges in the JIT

18:37 <arigato> but that's only a guess

18:39 <arigato> "more bridges" is mostly about: you do a small operation on a unicode string, and you get a bridge for ascii/non-ascii-unicode-string

18:39 <fijal> right

18:40 <fijal> let's translate and have a look

18:40 <fijal> we should also carefully look at some logs

18:40 <fijal> (eg check_utf8 forcing virtual strings etc)

18:41 <fijal> but that's a bit why I wanted to finish the modules, so I can have benchmarks

18:42 <cfbolz> FWIW, I also think that we should reduce the number of special cases for ASCII

18:43 <fijal> ok, so the logs are quite bad for example

18:43 <fijal> jitlogs

18:45 <cfbolz> For what kind of operation?

18:45 <fijal> addition, here

18:45 <fijal> for i in range(10000): unicode(i) + some_constant_unicode

18:46 <fijal> maybe bytes object should keep in mind where it's a valid utf8

18:46 <fijal> arigato: anyway, yes, lots of tweaking required

18:46 <fijal> so no, I'm not sure

18:47 <fijal> but we need to check

18:59 yuyichao has quit [Ping timeout: 240 seconds]

19:12 yuyichao has joined #pypy

19:16 slacky__ has quit [Ping timeout: 248 seconds]

19:27 rubdos has joined #pypy

19:45 <kenaan_> rlamy default 189c2cce360e /: More refactoring: deal with the remnant more explicitly and handle size limit inside _find_line_ending()

19:59 antocuni has quit [Ping timeout: 260 seconds]

20:13 jamesaxl has quit [Read error: Connection reset by peer]

20:15 jamesaxl has joined #pypy

20:20 <kenaan_> rlamy default 9c9233da7cc4 /pypy/module/_io/: Replace (pos-if-found, pos-if-not-found) tuple with (position, found)

20:26 <kenaan_> rlamy unicode-utf8 f9a1926628b2 /: hg merge default

20:37 cjwelborn has quit [Ping timeout: 252 seconds]

20:55 jcea has quit [Remote host closed the connection]

20:55 jcea has joined #pypy

21:34 traverseda has quit [Read error: Connection reset by peer]

21:42 traverseda has joined #pypy

22:17 jamesaxl has quit [Quit: WeeChat 1.9.1]

22:29 pilne has joined #pypy

22:34 traverseda has quit [Remote host closed the connection]

23:00 <bbot2> Started: http://buildbot.pypy.org/builders/own-linux-s390x/builds/775 [default]

23:00 <bbot2> Started: http://buildbot.pypy.org/builders/jitbackendonly-own-linux-armhf/builds/1667

23:00 <bbot2> Started: http://buildbot.pypy.org/builders/build-pypy-c-jit-linux-armel/builds/1980

23:00 <bbot2> Started: http://buildbot.pypy.org/builders/build-pypy-c-jit-linux-armhf-raring/builds/1731

23:00 <bbot2> Started: http://buildbot.pypy.org/builders/build-pypy-c-linux-armhf-raspbian/builds/1607

23:00 <bbot2> Started: http://buildbot.pypy.org/builders/jitbackendonly-own-linux-armhf-v7/builds/1435

23:00 <bbot2> Started: http://buildbot.pypy.org/builders/build-pypy-c-linux-armel/builds/1867

23:00 <bbot2> Started: http://buildbot.pypy.org/builders/build-pypy-c-jit-linux-armhf-raspbian/builds/1736

23:00 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-macosx-x86-64/builds/3442 [default]

23:00 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-64/builds/5097 [default]

23:00 <bbot2> Started: http://buildbot.pypy.org/builders/own-linux-x86-64/builds/6391 [default]

23:00 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-win-x86-32/builds/3531 [default]

23:00 <bbot2> Started: http://buildbot.pypy.org/builders/own-linux-x86-32/builds/5583 [default]

23:00 <bbot2> Started: http://buildbot.pypy.org/builders/own-win-x86-32/builds/1614 [default]

23:00 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-32/builds/4345 [default]

23:01 <bbot2> Success: http://buildbot.pypy.org/builders/jitbackendonly-own-linux-armhf-v7/builds/1435

23:03 <bbot2> Success: http://buildbot.pypy.org/builders/jitbackendonly-own-linux-armhf/builds/1667

23:27 <bbot2> Success: http://buildbot.pypy.org/builders/own-linux-x86-64/builds/6391 [default]

23:28 cjwelborn has joined #pypy

23:30 <bbot2> Success: http://buildbot.pypy.org/builders/own-linux-x86-32/builds/5583 [default]

23:31 <bbot2> Failure: http://buildbot.pypy.org/builders/own-win-x86-32/builds/1614 [default]

23:39 kolko has quit [Quit: ZNC - http://znc.in]