#pypy on 2017-06-07 — irc logs at freenode.irclog.whitequark.org

00:00 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-linux-s390x/builds/566 [default]

00:12 jamescampbell has joined #pypy

00:17 jamescampbell has quit [Ping timeout: 246 seconds]

00:28 <bbot2> Success: http://buildbot.pypy.org/builders/build-pypy-c-jit-linux-armhf-raring/builds/1578

00:40 marr has quit [Ping timeout: 240 seconds]

00:42 <bbot2> Failure: http://buildbot.pypy.org/builders/pypy-c-jit-macosx-x86-64/builds/3073 [default]

00:57 <bbot2> Success: http://buildbot.pypy.org/builders/build-pypy-c-linux-armhf-raspbian/builds/1456

01:00 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-macosx-x86-64/builds/3074 [py3.5]

01:00 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-64/builds/4672 [py3.5]

01:00 <bbot2> Started: http://buildbot.pypy.org/builders/own-linux-x86-64/builds/5991 [py3.5]

01:00 <bbot2> Started: http://buildbot.pypy.org/builders/own-linux-x86-32/builds/5234 [py3.5]

01:00 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-32/builds/3996 [py3.5]

01:09 yuyichao has quit [Ping timeout: 258 seconds]

01:16 buhman has joined #pypy

01:17 glyph has quit [Excess Flood]

01:17 glyph has joined #pypy

01:22 <bbot2> Success: http://buildbot.pypy.org/builders/build-pypy-c-linux-armel/builds/1716

01:26 <bbot2> Success: http://buildbot.pypy.org/builders/pypy-c-jit-linux-s390x/builds/566 [default]

01:33 yuyichao has joined #pypy

02:01 ArneBab has joined #pypy

02:02 pilne has quit [Quit: Quitting!]

02:05 <bbot2> Failure: http://buildbot.pypy.org/builders/own-linux-s390x/builds/599 [default]

02:05 ArneBab_ has quit [Ping timeout: 255 seconds]

02:08 jcea has quit [Quit: jcea]

02:09 <bbot2> Failure: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-64/builds/4672 [py3.5]

02:28 <bbot2> Failure: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-32/builds/3996 [py3.5]

02:31 jacob22_ has quit [Ping timeout: 255 seconds]

02:32 jacob22_ has joined #pypy

02:35 <bbot2> Success: http://buildbot.pypy.org/builders/build-pypy-c-jit-linux-armhf-raspbian/builds/1583

02:39 Taggnostr has joined #pypy

02:40 <bbot2> Failure: http://buildbot.pypy.org/builders/own-linux-x86-64/builds/5991 [py3.5]

02:51 Garen has quit [Read error: Connection reset by peer]

02:51 Garen has joined #pypy

02:58 <bbot2> Failure: http://buildbot.pypy.org/builders/pypy-c-jit-linux-armhf-raspbian/builds/1333

02:58 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-app-level-linux-armhf-raspbian/builds/1387

02:59 ArneBab_ has joined #pypy

03:00 <bbot2> Started: http://buildbot.pypy.org/builders/jit-benchmark-linux-x86-64/builds/2231

03:00 <bbot2> Started: http://buildbot.pypy.org/builders/jit-benchmark-linux-x86-32/builds/3023

03:00 <bbot2> Failure: http://buildbot.pypy.org/builders/pypy-c-app-level-linux-armhf-raspbian/builds/1387

03:00 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-linux-armhf-raspbian/builds/1334

03:03 ArneBab has quit [Ping timeout: 246 seconds]

03:10 <bbot2> Failure: http://buildbot.pypy.org/builders/own-linux-x86-32/builds/5234 [py3.5]

03:29 ArchDebian has joined #pypy

03:40 dw_ is now known as dw

04:14 ArchDebian has quit [Ping timeout: 240 seconds]

04:28 ArchDebian has joined #pypy

05:00 inhahe_ has quit [Read error: Connection reset by peer]

05:01 inhahe_ has joined #pypy

05:09 chelz has joined #pypy

05:09 jamadden has quit [Quit: Leaving.]

05:30 jacob22_ has quit [Quit: Konversation terminated!]

05:36 jamescampbell has joined #pypy

05:47 arigato has joined #pypy

05:53 tilgovi has joined #pypy

06:00 tilgovi has quit [Read error: Connection reset by peer]

06:03 marvin has joined #pypy

06:03 marvin is now known as Guest3081

06:05 <kenaan> Alecsandru Patrascu ctypes_char_indexing dd4db811a46e /lib_pypy/_ctypes/array.py: no string if we're array of char

06:06 <kenaan> Alecsandru Patrascu ctypes_char_indexing c5adea5c7b06 /lib_pypy/_ctypes/array.py: fix char_p and unichar_p interpretation

06:06 <kenaan> Alecsandru Patrascu ctypes_char_indexing 76bf604679cd /pypy/module/test_lib_pypy/ctypes_tests/test_array.py: added tests for char_p and unichar_p interpretation

06:06 <kenaan> Alecsandru Patrascu ctypes_char_indexing 2ea696d9721d /lib_pypy/_ctypes/array.py: indentation fix

06:06 <kenaan> arigo default 60d070027d70 /: Merged in palecsandru/pypy_ctypes_char_indexing/ctypes_char_indexing (pull request #552) Indexing into char* behav...

06:15 ArchDebian has quit [Ping timeout: 240 seconds]

06:16 <kenaan> arigo default e2b62fc32b15 /pypy/module/test_lib_pypy/ctypes_tests/test_array.py: Simplify and complete the tests

06:26 realitix has joined #pypy

06:30 Ubuntu-BR has joined #pypy

06:35 amaury has joined #pypy

06:41 <buhman> https://gist.github.com/buhman/79fd0f9683e561de6ff129952a718bbc I find this result quite surprising

06:41 <buhman> pypy isn't even using more CPU either.

06:42 <arigato> §what is dd_rescue?

06:43 <arigato> the two outputs seem very similar to me, what is the problem?

06:44 <amaury> The kB/s, I guess

06:44 <njs> buhman: if you're just using dd_rescue as a way to get stats on output, then you might like 'pv' (= 'pipe viewer')

06:44 <buhman> I thought people might question 'pv' more than dd_rescue

06:44 <buhman> I guess not

06:45 <buhman> arigato: pypy is 5x slower

06:47 <njs> hmm, yeah, I can reproduce - I'm getting ~1 GiB/s out of cpython, and ~140 MiB/s out of pypy (after leaving it running for a full minute)

06:51 realitix has quit [Ping timeout: 255 seconds]

06:51 <njs> pypy: http://vmprof.com/#/01a59d38-f374-4fe6-80c2-d2f1f1934128 / cpython: http://vmprof.com/#/6ebdd95c-5790-4770-8541-cc4abe262856

06:53 <arigato> fwiw, I would have asked the same question with 'pv' instead of 'dd_rescue'

06:53 <arigato> I use 'time' to measure wall clock time...

06:54 <buhman> arigato: yeah, but I wanted to express the result as number of bytes written over time

06:54 <njs> the most suspicious thing I see in feed() is a call to self._buffer.extend(data), where self._buffer is a bytearray

06:55 <buhman> hmm

06:58 <buhman> njs: maybe we wouldn't have this problem if trio had subprocess support ;p

07:00 <buhman> I didn't know about vmprof; this is really cool

07:16 <bbot2> Failure: http://buildbot.pypy.org/builders/jitbackendonly-own-linux-armhf/builds/1492

07:23 realitix has joined #pypy

07:23 <cfbolz> buhman, njs: well, pypy3 is not that speed-tuned yet. please file a bug

07:26 amaury has quit [Ping timeout: 255 seconds]

07:26 <njs> buhman: well, it's true that trio's subprocess support will hopefully end up with simpler code paths than asyncio's, but if bytearray is sluggish than that's important and needs fixing :-)

07:26 <njs> bytearray.extend is on the critical path of pretty much any reasonable py3 network server

07:28 amaury has joined #pypy

07:30 <arigato> http://paste.pound-python.org/show/rpwbTWWZsZqIOkuvzaDI/ is much faster on pypy3 than python3.5

07:31 <arigato> just saying

07:31 <cfbolz> ;-)

07:32 <njs> yeah, it's a bit mysterious. the bytearray thing is just a guess - can vmprof give line-by-line profiles, or does that even mean anything once the JIT gets involved?

07:35 amaury has quit [Ping timeout: 268 seconds]

07:43 <LarstiQ> njs: iirc it can on cpython

07:49 cstratak has joined #pypy

07:50 Remi_M has joined #pypy

08:01 vkirilichev has joined #pypy

08:02 v3d has joined #pypy

08:02 kenaan has quit [Ping timeout: 260 seconds]

08:05 hephaestus has quit [Ping timeout: 240 seconds]

08:07 <bbot2> Failure: http://buildbot.pypy.org/builders/own-win-x86-32/builds/1425 [default]

08:12 cstratak has quit [Ping timeout: 246 seconds]

08:16 antocuni has joined #pypy

08:16 Ubuntu-BR has quit [Ping timeout: 260 seconds]

08:26 <bbot2> Success: http://buildbot.pypy.org/builders/jit-benchmark-linux-x86-32/builds/3023

08:26 cstratak has joined #pypy

08:28 <bbot2> Failure: http://buildbot.pypy.org/builders/pypy-c-jit-macosx-x86-64/builds/3074 [py3.5]

08:30 Ubuntu-BR has joined #pypy

08:52 jamescampbell has quit [Quit: Leaving...]

08:55 gabrielm has joined #pypy

08:58 jacob22_ has joined #pypy

08:58 jamescampbell has joined #pypy

08:59 <gabrielm> Hi! I studied a bit the json library and I tried to reduce the memory consumption using memory_pressure. From my test it result that the speed is the same but the memory consumption can drop by 2x . This is related to this issue https://bitbucket.org/pypy/pypy/issues/1124

09:00 <gabrielm> It is OK to create a pull request with this modifications?

09:02 <arigato> yes

09:02 <arigato> where did you add memory_pressure?

09:08 <gabrielm> in file _pypjson/interp_decoder.py in loads() method after decoder = JSONDecoder(space, s)

09:08 <gabrielm> for loads

09:09 <arigato> uh, and by how much?

09:09 <gabrielm> down to 50%

09:09 <arigato> no, I mean

09:09 <gabrielm> aaa

09:09 <arigato> if you call add_memory_pressure() you need to give it an argument

09:10 <gabrielm> rgc.add_memory_pressure(len(s))

09:10 <arigato> that doesn't seem to make sense, because there is a try: finally: that will release the string deterministically

09:12 marr has joined #pypy

09:13 <gabrielm> in finally there is a call to close method that does free some thing but perhaps there are more intermediate objects that could be freed

09:14 <gabrielm> things*

09:14 <arigato> no, in RPython normally objects are managed by the GC, which knows when to free things

09:15 <arigato> it's possible that there is another place elsewhere where memory pressure would be useful, and then adding add_memory_pressure() here helps with that other place in this case

09:16 <arigato> or, it's also possible that RPython's GC is not correctly handling this case

09:16 <gabrielm> in my worload I do only json operations

09:16 <gabrielm> workload*

09:17 jamescampbell has quit [Remote host closed the connection]

09:19 <gabrielm> I'm talking about this workload:

09:19 <gabrielm> or l in open('data.txt'):

09:19 <gabrielm> j = json.loads(l)

09:19 <gabrielm> for*

09:19 <arigato> it probably doesn't change much, but try first to close the file

09:19 <LarstiQ> gabrielm: there is no dispute that your change doesn't lower the memory

09:20 hephaestus has joined #pypy

09:23 <arigato> what size has the data.txt file?

09:23 v3d has quit [Ping timeout: 246 seconds]

09:23 <gabrielm> https://pastebin.com/d3LZsMVP

09:23 <arigato> ah, it's read one line at a time, anyway

09:23 <gabrielm> 3.9 GB

09:24 <gabrielm> yes

09:26 cwillu_at_work has quit [Ping timeout: 260 seconds]

09:27 <arigato> note that if you replace "j = json.loads(l)" with "pass"

09:27 <arigato> you get very similar results

09:28 <antocuni> so, basically, the only effect of add_memory_pressure is to cause a major collection more often?

09:28 <arigato> yes, but in this precise case, it's unrelated to json

09:28 <gabrielm> hmm, so there is a problem with iterating a file line by line?

09:29 squeaky_pl has joined #pypy

09:29 <arigato> at least for me, if I comment out the whole "for" loop, then I get 70MB instead of 80MB

09:29 <arigato> unsure where the extra 10MB go, but I wouldn't be surprized at all if:

09:30 <arigato> you run the very short-running program, it won't even fill the heap enough to trigger a major collection

09:30 <arigato> you run the longer-running program, garbage accumulates until major collections occur

09:30 <antocuni> gabrielm: no, it's one artifact of how a GC works; from the point of view of the OS the memory continuously grows until you do a major collection, which shrinks it again. If you run the major collections more often, you see lower peaks

09:31 <antocuni> gabrielm: but the total amount of memory actually used by your program does not change

09:32 <gabrielm> I see

09:32 <arigato> it's unlikely that reading a file line-by-line creates garbage in the form that add_memory_pressure() helps with

09:33 <arigato> can you give us an example where there is really a 2x drop-down?

09:33 <arigato> not just 10%

09:33 <antocuni> arigato: I think it's a 2x in the sense that the increment is 12.3 MiB vs 28.7 MiB

09:34 <arigato> uh, ok

09:34 <arigato> then no, this way to measure doesn't make sense on pypy

09:34 <arigato> if your program happens to consume 1GB of memory before, then you might see memory grow by 500MB before any major collection occurs

09:34 <arigato> which is normal

09:36 <arigato> and indeed, the add_memory_pressure() that you added has the effect of triggering the major collection after 250MB of extra memory instead of 500MB

09:37 <arigato> because it says to the GC, "there are N extra bytes of memory that you don't see", in addition to the N-bytes of the string that is seen (the largest objects in this case)

09:37 * antocuni --> lunch

09:38 <arigato> an equivalent way to achieve the same result is to tweak the environment variable PYPY_GC_MAJOR_COLLECT

09:38 <gabrielm> I understand

09:38 <arigato> see rpython/memory/gc/incminimark.py

09:39 <gabrielm> so that issue can be solved?

09:39 <arigato> the default of 1.82 was chosen because it's a good balance between memory and speed

09:39 cwillu_at_work has joined #pypy

09:40 <arigato> well, my point is that I wonder: the issue says 300MB, but the small example you gave today is lower than 100MB

09:41 antocuni has quit [Ping timeout: 246 seconds]

09:42 <arigato> can you reproduce the 300MB of catalinif?

09:46 <gabrielm> nope, my results are this: https://pastebin.com/EjuYSv3V using pypy 5.7.1

09:48 <arigato> trying on my own, gendata.py takes a while...

09:48 <gabrielm> you need to install also module memory_profiler

09:49 <arigato> no, I'm using a different tool that only reports the maximum RSS of the program

09:49 <gabrielm> what tool? :D

09:50 <cfbolz> time?

09:51 <arigato> ...right, /bin/time also outputs that number

09:51 <arigato> the one I'm used to is

09:51 <arigato> https://bitbucket.org/arigo/arigo/raw/default/hack/misc/memusage.py

09:51 <arigato> but indeed, /bin/time is probably giving more precise results

09:53 <gabrielm> thank you!

09:53 <arigato> yes, "maxresident" seems reasonable

10:05 <gabrielm> should I try to continue with this?

10:07 nimaje1 has joined #pypy

10:07 nimaje1 is now known as nimaje

10:07 nimaje has quit [Killed (rajaniemi.freenode.net (Nickname regained by services))]

10:10 <arigato> yes, I think an interesting remaining question is:

10:10 <arigato> open('data.txt').readlines()

10:11 <arigato> not on the whole 3.7 GB file

10:11 <arigato> but on the smaller, 34 MB example from the original report of issue #1124

10:11 <arigato> this line seems to consume 10x more memory in PyPy than in CPython

10:13 <arigato> (not to mention, it's very much slower on pypy)

10:14 <arigato> (but that may be only a side-effect of it consuming so much more memory)

10:14 <gabrielm> ok, so this means that this issue is not related to json

10:14 <arigato> exactly, or at least not any more

10:15 <arigato> (maybe it was related to json too in 2012)

10:16 <gabrielm> OK, I will have a look at this and I will post an update on the issue

10:16 <arigato> thanks!

10:17 <arigato> it doesn't really make sense that memory usage is so much higher, given that it should only build a list of strings

10:17 Ubuntu-BR has quit [Ping timeout: 255 seconds]

10:17 Guest1244 has joined #pypy

10:18 <arigato> any extra memory that is uses should be garbage that goes away while the list is being built, not garbage that stays alive until the list is fully built...

10:19 <gabrielm> yes, I agree

10:19 <arigato> http://paste.pound-python.org/show/TwwUnTRWZ3EHdXIJKGLg/

10:20 <arigato> if I run this with various values in the range(), I see that doing more iterations consumes a lot more memory

10:21 jcea has joined #pypy

10:23 <arigato> right, but up to a maximum, and the maximum is lower than 2x the memory after one iteration, so it's all explainable with the GC

10:24 <arigato> except that we don't know why running it once already consumes so much

10:28 <gabrielm> I will take a look inside, maybe I will find some explanation

10:41 Tiberium has joined #pypy

10:43 oberstet has joined #pypy

10:43 gabrielm has quit [Quit: http://www.kiwiirc.com/ - A hand crafted IRC client]

10:56 <arigato> ok, something is really wrong

10:57 <arigato> the 34MB data.txt loads as a list of 10'000 strings only

10:57 <arigato> but it seems to take 266MB on PyPy, just the list of strings object

10:58 jamadden has joined #pypy

10:58 <arigato> according to gc.dump_rpy_heap("dumpfile")

10:59 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-64/builds/4673 [plan_rich: force build, vmprof-0.4.8]

10:59 kenaan has joined #pypy

10:59 <kenaan> plan_rich vmprof-0.4.8 876d67108576 /pypy/module/_vmprof/interp_vmprof.py: missing vmprof_ prefix

11:02 <bbot2> Exception: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-64/builds/4673 [plan_rich: force build, vmprof-0.4.8]

11:02 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-64/builds/4674 [plan_rich: force build, vmprof-0.4.8]

11:03 <bbot2> Failure: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-64/builds/4674 [plan_rich: force build, vmprof-0.4.8]

11:06 <kenaan> arigo default 7f9ffd5232c6 /pypy/tool/gcdump.py: Tweak to also report the single largest objects

11:06 <kenaan> plan_rich vmprof-0.4.8 ba7e4af9836a /: added prefix vmprof_ to wrong callsite

11:06 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-64/builds/4675 [plan_rich: force build, vmprof-0.4.8]

11:06 squeaky_pl has quit [Remote host closed the connection]

11:06 <arigato> ah, in fact, after readlines(), there is a list alive *somewhere* that contains one string *per character*

11:07 <arigato> so after readlines() on a 34MB file, you get 266.51M in one list somewhere

11:07 <cfbolz> Argh

11:08 <arigato> it's a list of rpython strings, but for some reason, they are shared, not 34 million strings

11:08 <arigato> (or, for all I know, it's a list of 34 million Nones)

11:10 <arigato> I bet it's a list of 34 million single characters, actually, but stored as a list of full strings

11:15 <arigato> aaaAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAHHHHHHHH

11:16 <arigato> ok I got it

11:16 <cfbolz> yes?

11:16 <arigato> "aaaaaaaaaaaaah" indeed

11:17 <arigato> it's the list of 10'000 strings

11:17 <arigato> it's a normal list, except it is pre-allocated to 34 million

11:17 <cfbolz> because?

11:17 <arigato> why? because it was built like that:

11:17 <arigato> http://paste.pound-python.org/show/azmGprf9P0iUupDVRx2Z/

11:18 <cfbolz> pfffffff

11:18 <arigato> RPython is too clever and thinks, oh, a loop which adds one item to the list

11:18 <cfbolz> yes

11:18 <cfbolz> even though the if is super rare

11:21 <cfbolz> arigato: so what do we do?

11:21 <cfbolz> kill this opt if there is an if?

11:21 <arigato> I don't know :-/

11:21 <arigato> maybe

11:24 <arigato> yes, if there is a path through the loop which doesn't contain an append() then we shouldn't do it

11:25 <arigato> we need to be careful about exceptions though

11:25 <arigato> but there are usually paths that exit the loop

11:26 <cfbolz> :-(

11:27 <cfbolz> arigato: maybe worth it to print a few more of the graphs where the optimization applies and check them by hand

11:28 antocuni has joined #pypy

11:30 <arigato> yes

11:32 marr has quit [Ping timeout: 240 seconds]

11:44 gabrielm has joined #pypy

11:50 cstratak has quit [Quit: Leaving]

11:52 cstratak has joined #pypy

12:00 Tiberium has quit [Remote host closed the connection]

12:00 <arigato> annoying stuff

12:00 <arigato> new_shape = [s for s in cur_shape if s != 1]

12:00 <arigato> in micronumpy

12:00 <arigato> now crashes translation

12:01 <arigato> and I fear in many other places it will be less efficient, unless we manage somehow to return a non-resizable list

12:06 <gabrielm> @arigato: I saw what you discovered earlier about file.readlines

12:06 <gabrielm> how that could be fixed?

12:06 <arigato> yes, I meant to tell you

12:06 <arigato> we need to somehow tweak the optimization

12:06 <arigato> it's not clear how, yet

12:07 Rhy0lite has joined #pypy

12:07 <arigato> it turned out to be a real issue with a detail of the RPython optimizer

12:08 <gabrielm> one of my silly ideas was to rewrite the module for realines to fix this specific issue here but would be nice to fix this on a global scale

12:08 <arigato> right, but even that is not clear

12:09 <bbot2> Success: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-64/builds/4675 [plan_rich: force build, vmprof-0.4.8]

12:09 <arigato> maybe using a "while" loop instead of the "for" loop would be enough to side-step the optimization

12:09 jacob22_ has quit [Quit: Konversation terminated!]

12:09 <arigato> but yes, of course it would be nice to fix this more generally

12:10 <gabrielm> I was thinking to count the number of '\n' and write a for loop for that number and inside to write annother for loop to iterate over the chars

12:10 <LarstiQ> gabrielm: that sounds slow?

12:10 <arigato> that's RPython, so it's not slow

12:10 <gabrielm> the append would be after the inner for

12:11 <arigato> it requires two passes over the data instead of one, but well

12:11 <arigato> so yes, that's also a solution. again, a general fix is much better :-)

12:11 <gabrielm> len is not passes over the all data?

12:12 <arigato> len(string)? no, that's constant-time

12:12 <gabrielm> oh, I see

12:12 <arigato> (rpython strings are not C "char *")

12:13 <LarstiQ> collect the indexes of \n in one loop, then append those slices?

12:13 <LarstiQ> or is that what gabrielm meant

12:13 <arigato> no, that would be the same problem

12:13 <arigato> the list of indexes that you collect would be pre-allocated

12:13 <LarstiQ> meh

12:14 <arigato> that's why we should find a general fix :-)

12:14 * LarstiQ nods

12:14 <gabrielm> I didn't meant that

12:14 <LarstiQ> gabrielm: I see now how the count gets around the preallocation

12:15 <gabrielm> where is the place to start for this problem?

12:16 <arigato> I won't say "don't touch", but it's in rpython/translator/simplify.py, a large class ListComprehensionDetector that does a lot of careful checks and that works in conjunction with the flowgraph (producing unoptimized input) and the rtyper (consuming optimized output)

12:17 <arigato> so it's not really a place I would recommend to start looking inside rpython internals

12:17 Guest1244 has quit [Ping timeout: 245 seconds]

12:18 <gabrielm> I will take a look for my curiosity :D

12:18 <LarstiQ> arigato: how general is the problem, preallocating lists when doing element comparisons but storing ranges?

12:18 <arigato> :-)

12:18 <LarstiQ> or handling them

12:18 <LarstiQ> right, now the micronumpy crash makes sense

12:18 * LarstiQ talks more to himself and gets back to dayjob

12:19 <arigato> the numpy crash makes sense: it crashes because the list new_shape = [s for s in cur_shape if s != 1] is supposed to be non-resizable

12:19 <arigato> that's a property that is used a bit later, and also, a property that we'd like to keep

12:20 <arigato> it crashes if I just disable completely the optimization

12:22 marky1991 has joined #pypy

12:32 Guest1244 has joined #pypy

12:35 <bbot2> Success: http://buildbot.pypy.org/builders/jit-benchmark-linux-x86-64/builds/2231

12:43 mattip has joined #pypy

12:59 jamescampbell has joined #pypy

13:01 amaury has joined #pypy

13:03 cstratak has quit [Ping timeout: 246 seconds]

13:04 gabrielm has quit [Quit: http://www.kiwiirc.com/ - A hand crafted IRC client]

13:07 jacob22_ has joined #pypy

13:09 <mattip> hi

13:10 <mattip> there have been many changes since the last merge-from-default, but I want to release 5.8 anyway

13:11 <mattip> as is

13:11 <mattip> any objections?

13:11 <antocuni> +1 for me

13:14 jamescam_ has joined #pypy

13:18 jamescampbell has quit [Ping timeout: 255 seconds]

13:19 <mattip> I will upload and then send out one last mail

13:23 amaury has quit [Ping timeout: 246 seconds]

13:24 jiffe has quit [Quit: WeeChat 1.8]

13:27 amaury has joined #pypy

13:33 <arigato> +1

13:35 jiffe has joined #pypy

13:41 amaury has quit [Ping timeout: 260 seconds]

14:09 gabrielm has joined #pypy

14:15 <kenaan> mattip default 8e0f40a44beb /.hgtags: Added tag release-pypy2.7-v5.8.0 for changeset c925e7381036

14:15 <kenaan> mattip default d17da6177617 /.hgtags: Added tag release-pypy3.5-v5.8.0 for changeset a37ecfe5f142

14:15 <kenaan> mattip default a0bfbbf2ecb2 /pypy/tool/release/repackage.sh: repackage script - mroe robust and quieter, check hg hash, refactor for 2.7/3.5

14:16 <danchr> mattip: is this the final release ready for packaging?

14:17 <mattip> danchr: the tag should be the release version. I am repackaging the builds from nightlies for uploading to bitbucket,

14:18 <mattip> then I will send a mail and wait for failure reports

14:18 Guest1244 has quit [Ping timeout: 260 seconds]

14:19 <mattip> arigato: could you build the ppc64, ppc64le packages?

14:19 <arigato> ok

14:19 <mattip> I wonder what the state of py3.5 is on those platforms

14:19 <arigato> likely needs a couple of tweaks

14:20 <danchr> I'll see if I can find the time to write a script for packaging PyPy+dependancies on a recent OS X, but I'll probably update MacPorts first

14:21 <Alex_Gaynor> arigato: FYI the pthreads bug you found on macOS a few months ago is slowly proceeding through apple :-)

14:21 <mattip> danchr: that's for 3.5 correct? AFAIK the OS X package we have works for 2.7

14:21 <danchr> mattip: it does, but the script might be useful for bundling a newer OpenSSL

14:22 <danchr> (OS X includes a deprecated copy of OpenSSL 0.9.8)

14:22 <danchr> sorry, gotta go

14:23 <mattip> bye

14:24 <mattip> haven't seen squeaky for a while, I wonder if the portable script could be a basis for OS X packaging

14:28 marr has joined #pypy

14:29 jamescam_ has quit [Quit: Leaving...]

14:32 Guest1244 has joined #pypy

14:33 <mattip> our www.pypy.org/download.html page has build instructions that are similar but not exactly doc.pypy.org/build.html

14:36 <kenaan> mattip default 197b78d9963b /pypy/doc/build.rst: copy packaging explaination from www.pypy.org to documentation

14:36 yuyichao has quit [Ping timeout: 260 seconds]

14:37 <kenaan> mattip pypy.org[extradoc] 344a346894fb /source/download.txt: update for 5.8.0, leave 5.7 hashes for now

14:37 <mattip> not regenerating the web site till we are ready for the release

14:38 <kenaan> plan_rich default 1c08349b2ad3 /: merge branch vmprof-0.4.8

14:39 <kenaan> plan_rich vmprof-0.4.8 eaaa49735879 /: close branch

14:41 <kenaan> plan_rich vmprof-0.4.8 9f8a7fe9894a /pypy/doc/whatsnew-head.rst: document branch vmprof-0.4.8

14:51 gabrielm has quit [Quit: http://www.kiwiirc.com/ - A hand crafted IRC client]

14:59 <antocuni> plan_rich: I think that vmprof-0.4.8 will NOT go into the 5.8 release, thus probably the "if pypy_version_info > (5, 8, 0)" in vmprof needs to be updated accordingly

14:59 <plan_rich> antocuni, ok

15:00 <antocuni> thanks

15:01 <plan_rich> pypy_version_info > (5, 8, 0) should work because the release is 5.8.0 right?

15:02 * plan_rich checks the release branch of pypy

15:03 <antocuni> ah indeed

15:05 yuyichao has joined #pypy

15:13 cstratak has joined #pypy

15:16 amaury has joined #pypy

15:25 amaury has quit [Ping timeout: 240 seconds]

15:32 cstratak has quit [Remote host closed the connection]

15:32 cstratak has joined #pypy

15:34 <kenaan> mattip pypy.org[extradoc] 9220a439ce17 /source/download.txt: add 3.5 hashes

15:38 <mattip> release candidates are up on https://bitbucket.org/pypy/pypy/downloads, please test them out

15:48 vkirilichev has quit [Remote host closed the connection]

15:53 <mattip> blog post preview

15:53 <mattip> https://morepypy.blogspot.com/b/post-preview?token=xZz5hFwBAAA.ozQykQq5kg45X6dmRTir15drhlhyK38Qij9Jl86USIVGVgCkjDr3OlzCWyDtrXTqcnAR2gTASLIWe0aJfgCw2Q.bS9ppENxEBn-Luz30dsAHw&postId=739876359584854017&type=POST

15:54 <mattip> it doesn't have any "stand out" features, just lots of bug fixes and minor performance enhancements

16:00 cstratak has quit [Read error: Connection reset by peer]

16:07 <antocuni> mattip: I think we should still find a couple of feature to highlight in the blog post, else it looks like there is no difference with 5.7

16:07 <antocuni> I see in the whatsnew that we had a lot of new cpyext features: did they allow us to run some new C package?

16:08 <antocuni> this feature is maybe also worth to mention in the blog post, IMHO: "Add native PyPy support to profile frames in vmprof"

16:09 <antocuni> and, if we want something else to highlight, the improvements of faster-rstruct-2 could be stressed more (although I'm biased :)). Currently they are listed as "Speed up struck.pack, struck.pack_into"

16:10 <mattip> antocuni: ok, thanks. not sure what the status is with pybind11, or other modules that previously failed to run

16:10 <mattip> antocuni: can we put a number on that speedup somehow?

16:10 amaury has joined #pypy

16:10 <antocuni> but actually, it did more: now struct.unpack{,_from} is super-fast also on raw buffers and bytearray (before it was only for strings)

16:11 <antocuni> mattip: sure, let me write some quick microbench

16:14 realitix has quit [Quit: Leaving]

16:19 Guest1244 has quit [Ping timeout: 260 seconds]

16:21 <kenaan> mattip default 68056ea67737 /pypy/doc/release-v5.8.0.rst: add more highlights to the top-level of the release notice (hopefully accurate?)

16:22 <mattip> is the statement "We can now profile native frames in vmprof, even in JITted code" accurate?

16:22 <antocuni> mattip: https://paste.pound-python.org/show/gGchOBYUneQft8nBWjgF/

16:23 amaury has quit [Ping timeout: 260 seconds]

16:23 <antocuni> so, struct.unpack on numpypy arrays is 10x faster

16:23 <mattip> wow

16:23 <antocuni> on array.array and bytearray is ~2x faster

16:24 <antocuni> and pack_into on bytearray is 6x faster (but I measured also 8x-10x if you increase the N)

16:24 <arigato> plan_rich: no, sys.pypy_version_info > (5,8,0) is true on 5.8.0 because it's actually a longer tuple

16:24 <arigato> that's why you should never compare versions like that, but using only < or =>

16:26 <antocuni> mattip: I think that "Add native PyPy support to profile frames in vmprof" is more accurate, because native frames work only outside the JIT

16:27 <mattip> antocuni: thanks, editing

16:27 <antocuni> also, something I just thought of: the work I did for faster-rstruct-2 has been paid by Gambit (the company I do consultancy for). Would be acceptable to mention them as sponsor in the blog post?

16:28 <fijal> I believe so

16:28 <mattip> +1. Do you want to link to them?

16:29 <antocuni> yeah, the website is this: http://gambitresearch.com/

16:29 <antocuni> let me ask them if they are fine with that though (but I think they will be)

16:30 <mattip> antocuni: the project is worthy of a blog post - how you saw the slowdown, tackled the problem, microbenchmarks,

16:31 <antocuni> indeed, that's probably a good idea

16:31 * mattip waiting for approval to push the link

16:31 jacob22 has joined #pypy

16:32 jacob22_ has quit [Ping timeout: 240 seconds]

16:32 <antocuni> I asked my boss but he seems to be AFK at the moment, I'll ping you as soon as he answers

16:32 sophiya has quit [Max SendQ exceeded]

16:33 Guest1244 has joined #pypy

16:34 <arigato> did someone verify that vmprof works on the proposed 5.8.0? I wouldn't be confident after the messages here about versions

16:34 sophiya has joined #pypy

16:34 * antocuni tries

16:38 <bbot2> Failure: http://buildbot.pypy.org/builders/pypy-c-jit-linux-armhf-raspbian/builds/1334

16:39 <bbot2> Failure: http://buildbot.pypy.org/builders/pypy-c-jit-win-x86-32/builds/3162 [py3.5]

16:39 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-win-x86-32/builds/3163 [default]

16:40 <antocuni> arigato: it seems to work for me

16:40 <antocuni> although I see a lot of native frames which probably should not be there, but this is probably something which should be fixed in vmprof: http://vmprof.com/#/4951b214-d568-41fb-ade2-60c6cb02e83f

16:43 <antocuni> mattip: they said they are ok with putting a link, please proceed

16:46 amaury has joined #pypy

16:47 <kenaan> arigo default 6cbfa075de61 /rpython/: Don't preallocate RPython lists if we only know an upper bound on the final size. See comments.

16:53 pilne has joined #pypy

16:54 amaury has quit [Ping timeout: 255 seconds]

16:55 vkirilichev has joined #pypy

17:05 <kenaan> mattip default ad3681ed2701 /pypy/doc/release-v5.8.0.rst: mention Gambit and actual struct speed improvement

17:05 <mattip> antocuni: better?

17:09 <cfbolz> arigato: fix looks good. We maybe should still check what is affected

17:09 <kenaan> antocuni default 3873f54fed26 /pypy/doc/release-v5.8.0.rst: fix the name of Gambit Research

17:10 <antocuni> mattip: yes, thanks; I fixed the name of Gambit into Gambit Research

17:10 <mattip> antocuni: +1

17:13 <kenaan> mattip default f3d63bdc598d /pypy/doc/release-v5.8.0.rst: fix link

17:14 marky1991_2 has joined #pypy

17:14 <arigato> I don't know if the release document mentions it, but one of the main reasons for pushing this release was that we fixed several instabilities

17:15 marky1991 has quit [Ping timeout: 240 seconds]

17:20 <arigato> cfbolz: right, I'm producing now a list of places where it changes something...

17:22 <cfbolz> arigato: I assume we will laugh and laugh

17:22 <arigato> yes, I started already

17:22 <cfbolz> :-)

17:22 <arigato> look at (rpython.rlib.rbigint:291)frombytes

17:24 <arigato> it makes a copy of 'digits' after the loop, but if it didn't, then this would build 'long' objects with a wildly over-allocated list in them

17:25 DragonSA has joined #pypy

17:26 DragonSA has quit [Changing host]

17:26 DragonSA has joined #pypy

17:27 <arigato> (pypy.module._locale.interp_locale:22)_fixup_ulcase creates 3 small lists that are also over-allocated to 256

17:27 <cfbolz> :-(

17:31 <kenaan> mattip default 325ce0ce6e7d /pypy/doc/release-v5.8.0.rst: mention the shadowstack issue that motivated the early release, other tweaks

17:31 <mattip> arigato: thanks

17:36 kipras`away is now known as kipras

17:40 <arigato> cfbolz: http://paste.pound-python.org/show/AF0e8vE8udMlG8ahkItH/

17:43 <antocuni> uhm, consider this example: https://paste.pound-python.org/show/PrH7YNNwlegjqeEHBFjS/

17:43 <antocuni> I expected the JIT to be able to detect that "x*100" is unused and remove it altogether

17:43 <kenaan> mattip default ff44fbcd839d /pypy/doc/release-v5.8.0.rst: formatting, fix links, tweak wording

17:44 <antocuni> but looking at the trace, it is still there (even in the assembler): http://vmprof.com/#/a6949747-193a-4f05-86a3-cf0a11fcf5f9/traces

17:50 <arigato> antocuni: clearly, it needs to know if it must raise an RPython-level OverflowError

17:51 <arigato> in which case it must build the result in a much more complicated way, and *then* ignore it

17:52 <antocuni> ouch, indeed

17:52 <arigato> a possible strength reduction would be to detect int_mul_ovf(_, CONST) where the result is not used, and replace it with a range check...

17:52 <antocuni> I didn't think of the OverflowError

17:53 <antocuni> so in theory, if I use "i*100" instead of x*100 it should be able to detect that it cannot overflow

17:53 <antocuni> but apparently we don't have this optimization because I still see the int_mul_ovs

17:54 <antocuni> uh? Even if I do "i+42", I get an int_add_ovf

17:54 <arigato> uh, I'd guess it's missing a detail, like it doesn't have a lower bound for i (so it could be very negative)

17:54 <arigato> ah

17:54 <antocuni> so it means that "for i in range(2000)" doesn't produce the intbounds? :(

17:54 <arigato> that's strange

17:55 <antocuni> indeed, if I add assert i<100, i*100 produces an int_mul

17:58 <arigato> ah, no, it cannot easily do that

17:58 <cfbolz> antocuni: the range size cannot propagate into the loop, because it's allocated before

17:58 <arigato> yes, the loop only knows it's a range(x) for an unknown x

17:58 <antocuni> I see

17:59 <arigato> (but not a three-argument range, which builds a different class, or something like that; the details changed in PyPy3 too)

18:00 <antocuni> so maybe should should add an "assert return_value < self.length()" inside W_RangeObject.next?

18:00 <arigato> no, that's already information that the jit has got

18:00 <arigato> but self.length() is not a constant

18:01 <antocuni> I see

18:01 <arigato> try to replace "i+42" with "i+1"

18:01 <arigato> it should get rid of the int_add_ovf

18:02 <arigato> because the jit knows that i is smaller than x, which is also a regular integer

18:02 <antocuni> indeed, I don't see an easy solution to that, although it's a bit disappointing that "for i in range" makes worse code than "while i < N"

18:02 <arigato> "while i < 2000" precisely, yes

18:03 <antocuni> arigato: nope, even i+1 produces and int_add_ovf

18:03 <arigato> uh

18:03 <arigato> note also that "while i < 2000" won't get rid of int_mul_ovf, because i can be a large negative number

18:04 <antocuni> arigato: then we have a bug

18:04 <antocuni> because if I insert an assert i < 2000 I get an int_mul

18:04 <antocuni> ah, maybe because it already knows it is >0?

18:05 <arigato> ah no, subtle

18:05 <arigato> do you get also int_mul in the preamble?

18:05 <antocuni> yes

18:05 <arigato> ok, no clue then

18:10 <antocuni> anyway, I'm off for today

18:10 <antocuni> thanks :)

18:14 vkirilichev has quit [Remote host closed the connection]

18:15 antocuni has quit [Ping timeout: 268 seconds]

18:19 Guest1244 has quit [Ping timeout: 255 seconds]

18:19 vkirilichev has joined #pypy

18:24 amerel_h has joined #pypy

18:24 amerel_h has left #pypy ["Leaving"]

18:33 Guest1244 has joined #pypy

18:34 nimaje has joined #pypy

18:38 jamesaxl has quit [Read error: Connection reset by peer]

18:42 jamesaxl has joined #pypy

18:50 vkirilichev has quit [Remote host closed the connection]

18:51 vkirilichev has joined #pypy

18:52 arigato has quit [Quit: Leaving]

18:54 amaury has joined #pypy

18:55 oberstet has quit [Ping timeout: 255 seconds]

18:56 jamesaxl has quit [Read error: Connection reset by peer]

18:58 ssbr has quit [Ping timeout: 260 seconds]

18:58 jamesaxl has joined #pypy

19:01 amaury has quit [Ping timeout: 255 seconds]

19:09 oberstet has joined #pypy

19:11 DragonSA has quit [Remote host closed the connection]

19:15 Rhy0lite has quit [Quit: Leaving]

19:28 vkirilichev has quit [Remote host closed the connection]

19:36 vkirilichev has joined #pypy

19:58 jamesaxl has quit [Quit: WeeChat 1.7.1]

20:13 ESphynx has joined #pypy

20:14 <ESphynx> hi guys, so if we're doing this: from build_module1 import FFI, ffi_module1

20:14 <ESphynx> does that automatically imply linking with that other module ?

20:14 <ESphynx> i.e. with ffi_module2.include(ffi_module1)

20:20 Guest1244 has quit [Ping timeout: 268 seconds]

20:27 tbodt has joined #pypy

20:34 Guest1244 has joined #pypy

20:49 tbodt has quit [Max SendQ exceeded]

20:52 <ESphynx> Hmm, now this: NotImplementedError: 'struct Size' is opaque in the ffi.include(), but no longer in the ffi doing the include (workaround: don't use ffi.include() but duplicate the declarations of everything using struct Size)

20:52 tbodt has joined #pypy

20:53 tbodt has quit [Client Quit]

20:55 vkirilichev has quit [Remote host closed the connection]

20:56 tbodt has joined #pypy

20:56 vkirilichev has joined #pypy

20:57 marky1991_2 is now known as marky1991

20:57 marky1991 has quit [Changing host]

20:57 marky1991 has joined #pypy

21:04 vkirilichev has quit [Remote host closed the connection]

21:05 mattip has left #pypy ["bye"]

21:23 benbange1 has joined #pypy

21:24 sbauman_ has joined #pypy

21:24 MarkMangoba_ has joined #pypy

21:24 norox_ has joined #pypy

21:25 jacob22_ has joined #pypy

21:27 tumbleweed_ has joined #pypy

21:29 MarkMangoba has quit [Ping timeout: 240 seconds]

21:29 sbauman has quit [Ping timeout: 240 seconds]

21:29 ionelmc has quit [Ping timeout: 240 seconds]

21:29 benbangert has quit [Ping timeout: 240 seconds]

21:29 untitaker has quit [Ping timeout: 240 seconds]

21:29 jacob22 has quit [Ping timeout: 240 seconds]

21:29 norox has quit [Ping timeout: 240 seconds]

21:29 tumbleweed has quit [Ping timeout: 240 seconds]

21:29 sbauman_ is now known as sbauman

21:29 MarkMangoba_ is now known as MarkMangoba

21:29 tumbleweed_ is now known as tumbleweed

21:29 tumbleweed has quit [Changing host]

21:29 tumbleweed has joined #pypy

21:33 kipras is now known as kipras`away

21:40 marr123 has joined #pypy

21:40 marr has quit [Ping timeout: 260 seconds]

21:45 <bbot2> Failure: http://buildbot.pypy.org/builders/pypy-c-jit-win-x86-32/builds/3163 [default]

21:45 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-win-x86-32/builds/3164 [py3.5]

21:46 <ESphynx> figured that one out :P thanks

21:46 ESphynx has left #pypy [#pypy]

21:47 vkirilichev has joined #pypy

21:52 vkirilichev has quit [Ping timeout: 258 seconds]

21:52 marr123 is now known as marr

21:59 vkirilichev has joined #pypy

22:00 <bbot2> Started: http://buildbot.pypy.org/builders/jitbackendonly-own-linux-armhf/builds/1493

22:00 <bbot2> Started: http://buildbot.pypy.org/builders/build-pypy-c-jit-linux-armel/builds/1828

22:00 <bbot2> Started: http://buildbot.pypy.org/builders/build-pypy-c-linux-armhf-raspbian/builds/1457

22:00 <bbot2> Started: http://buildbot.pypy.org/builders/build-pypy-c-linux-armel/builds/1717

22:00 <bbot2> Started: http://buildbot.pypy.org/builders/build-pypy-c-jit-linux-armhf-raring/builds/1579

22:00 <bbot2> Started: http://buildbot.pypy.org/builders/build-pypy-c-jit-linux-armhf-raspbian/builds/1584

22:00 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-macosx-x86-64/builds/3075 [default]

22:00 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-64/builds/4676 [default]

22:00 <bbot2> Started: http://buildbot.pypy.org/builders/own-linux-x86-64/builds/5992 [default]

22:00 <bbot2> Started: http://buildbot.pypy.org/builders/own-linux-x86-32/builds/5235 [default]

22:00 <bbot2> Started: http://buildbot.pypy.org/builders/own-win-x86-32/builds/1426 [default]

22:00 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-32/builds/3997 [default]

22:00 <bbot2> Started: http://buildbot.pypy.org/builders/own-linux-s390x/builds/600 [default]

22:05 benbange1 is now known as benbangert

22:07 oberstet has quit [Quit: Leaving]

22:16 tilgovi has joined #pypy

22:20 Guest1244 has quit [Ping timeout: 246 seconds]

22:35 Guest1244 has joined #pypy

22:39 tbodt has quit [Ping timeout: 240 seconds]

22:40 vkirilichev has quit [Remote host closed the connection]

22:58 tilgovi has quit [Quit: No Ping reply in 180 seconds.]

23:17 <bbot2> Success: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-64/builds/4676 [default]

23:21 ionelmc has joined #pypy

23:22 untitaker has joined #pypy

23:30 <bbot2> Failure: http://buildbot.pypy.org/builders/own-linux-x86-64/builds/5992 [default]

23:42 <bbot2> Success: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-32/builds/3997 [default]

23:45 <bbot2> Failure: http://buildbot.pypy.org/builders/own-linux-x86-32/builds/5235 [default]

23:55 tbodt has joined #pypy