#pypy on 2017-04-14 — irc logs at freenode.irclog.whitequark.org

00:00 <bbot2_> Started: http://buildbot.pypy.org/builders/pypy-c-jit-linux-s390x/builds/509 [default]

00:00 <simpson> TheMontyChrist: People ask for one every so often. You have a need for a massive heap on Win64?

00:01 <TheMontyChrist> given python is slow probably not

00:01 <TheMontyChrist> was thinking of loading 8+gb of data

00:01 <TheMontyChrist> but when I think about it

00:01 <TheMontyChrist> I don't think it's up for the job

00:01 <TheMontyChrist> nvm

00:03 yuyichao has quit [Ping timeout: 240 seconds]

00:04 <simpson> TheMontyChrist: Sounds like you don't want to try new things.

00:05 lritter has quit [Ping timeout: 260 seconds]

00:07 ramonvg has quit [Ping timeout: 255 seconds]

00:08 <simpson> TheMontyChrist: Have you read http://doc.pypy.org/en/latest/windows.html#what-is-missing-for-a-full-64-bit-translation yet?

00:10 johncc3_ has joined #pypy

00:11 <bbot2_> Success: http://buildbot.pypy.org/builders/build-pypy-c-jit-linux-armhf-raring/builds/1522

00:17 tbodt has quit [Quit: My Mac has gone to sleep. ZZZzzz…]

00:22 yuyichao has joined #pypy

00:27 tbodt has joined #pypy

00:27 vkirilichev has joined #pypy

00:50 <bbot2_> Success: http://buildbot.pypy.org/builders/build-pypy-c-jit-linux-armel/builds/1771

00:58 jcea has quit [Quit: jcea]

01:00 <bbot2_> Started: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-64/builds/4534 [py3.5]

01:00 <bbot2_> Started: http://buildbot.pypy.org/builders/own-linux-x86-64/builds/5859 [py3.5]

01:00 <bbot2_> Started: http://buildbot.pypy.org/builders/own-linux-x86-32/builds/5120 [py3.5]

01:00 <bbot2_> Started: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-32/builds/3882 [py3.5]

01:02 TheMontyChrist has quit [Quit: Page closed]

01:04 asmeurer__ has joined #pypy

01:16 asmeurer__ has quit [Quit: asmeurer__]

01:19 <bbot2_> Success: http://buildbot.pypy.org/builders/build-pypy-c-linux-armhf-raspbian/builds/1402

01:19 <bbot2_> Started: http://buildbot.pypy.org/builders/pypy-c-app-level-linux-armhf-raspbian/builds/1335

01:20 johncc3_ has quit [Ping timeout: 260 seconds]

01:34 johncc3_ has joined #pypy

01:35 <bbot2_> Success: http://buildbot.pypy.org/builders/pypy-c-jit-linux-s390x/builds/509 [default]

01:38 kipras is now known as kipras`away

01:45 tbodt has quit [Quit: My Mac has gone to sleep. ZZZzzz…]

01:47 <bbot2_> Success: http://buildbot.pypy.org/builders/build-pypy-c-linux-armel/builds/1661

01:47 tbodt has joined #pypy

01:56 asmeurer__ has joined #pypy

01:57 <bbot2_> Failure: http://buildbot.pypy.org/builders/pypy-c-jit-macosx-x86-64/builds/2960 [default]

01:57 <bbot2_> Started: http://buildbot.pypy.org/builders/pypy-c-jit-macosx-x86-64/builds/2961 [py3.5]

01:58 <bbot2_> Failure: http://buildbot.pypy.org/builders/pypy-c-jit-macosx-x86-64/builds/2961 [py3.5]

02:06 ArneBab has joined #pypy

02:11 ArneBab_ has quit [Ping timeout: 260 seconds]

02:11 <bbot2_> Failure: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-64/builds/4534 [py3.5]

02:15 <bbot2_> Success: http://buildbot.pypy.org/builders/own-linux-s390x/builds/540 [default]

02:27 tbodt has quit [Quit: My Mac has gone to sleep. ZZZzzz…]

02:31 <bbot2_> Failure: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-32/builds/3882 [py3.5]

02:41 <bbot2_> Failure: http://buildbot.pypy.org/builders/own-linux-x86-64/builds/5859 [py3.5]

02:55 <bbot2_> Success: http://buildbot.pypy.org/builders/build-pypy-c-jit-linux-armhf-raspbian/builds/1525

03:00 <bbot2_> Started: http://buildbot.pypy.org/builders/jit-benchmark-linux-x86-64/builds/2177

03:00 <bbot2_> Started: http://buildbot.pypy.org/builders/jit-benchmark-linux-x86-32/builds/2969

03:00 tbodt has joined #pypy

03:05 asmeurer__ has quit [Quit: asmeurer__]

03:13 <bbot2_> Failure: http://buildbot.pypy.org/builders/own-linux-x86-32/builds/5120 [py3.5]

03:24 derwolfe has left #pypy [#pypy]

03:25 tbodt has quit [Quit: My Mac has gone to sleep. ZZZzzz…]

03:34 tbodt has joined #pypy

03:35 tbodt has quit [Client Quit]

03:37 tbodt has joined #pypy

03:42 asmeurer__ has joined #pypy

04:00 oberstet2 has joined #pypy

04:03 oberstet3 has quit [Ping timeout: 245 seconds]

04:09 asmeurer__ has quit [Quit: asmeurer__]

04:09 mattip has joined #pypy

04:10 <mattip> hi

04:10 <mattip> buildbots are showing a regression in own testing, http://buildbot.pypy.org/summary?branch=%3Ctrunk%3E

04:12 <mattip> maybe after merge of "branch-prediction"?

04:41 inhahe_ has quit [Read error: Connection reset by peer]

04:42 inhahe_ has joined #pypy

04:44 jacob22 has quit [Quit: Konversation terminated!]

04:53 tbodt has quit [Quit: My Mac has gone to sleep. ZZZzzz…]

04:54 vkirilichev has quit [Ping timeout: 260 seconds]

05:10 jamadden has quit [Quit: Leaving.]

05:30 bogner has quit [Ping timeout: 240 seconds]

05:37 bogner has joined #pypy

05:38 mattip has left #pypy ["bye"]

05:51 vkirilichev has joined #pypy

06:17 realitix has joined #pypy

06:32 forgottenone has joined #pypy

06:47 <bbot2_> Failure: http://buildbot.pypy.org/builders/jitbackendonly-own-linux-armhf/builds/1436

06:47 <bbot2_> Started: http://buildbot.pypy.org/builders/pypy-c-jit-linux-armhf-raspbian/builds/1278

06:58 brechtm has quit [Remote host closed the connection]

06:58 vkirilichev has quit [Ping timeout: 252 seconds]

06:58 brechtm has joined #pypy

07:01 arigato has joined #pypy

07:15 kenaan_ has joined #pypy

07:15 <kenaan_> arigo pypy.org[extradoc] 3ab96624f0d2 /don1.html: update the values

07:16 arigato has quit [Quit: Leaving]

07:16 arigato has joined #pypy

07:39 DragonSA has joined #pypy

07:54 vkirilichev has joined #pypy

07:58 vkirilichev has quit [Ping timeout: 240 seconds]

08:24 <bbot2_> Success: http://buildbot.pypy.org/builders/jit-benchmark-linux-x86-32/builds/2969

08:38 oberstet2 has quit [Ping timeout: 258 seconds]

08:42 antocuni has joined #pypy

08:48 marr has joined #pypy

08:58 vkirilichev has joined #pypy

09:00 ramon has joined #pypy

09:00 ramon is now known as Guest77387

09:05 oberstet2 has joined #pypy

09:06 <haypo> ronan: morning! i ran performance benchmarks on speed-python: 250 values per run x 10 runs (run=process) -- http://www.haypocalc.com/tmp/pypy_p10_w0_n250.json.gz

09:06 <haypo> ronan: i will use these data to choose the number of warmups

09:08 <haypo> ronan: you can use my new plot.py script to plots these values

09:08 <haypo> ronan: example "python3 doc/examples/plot.py ~/pypy_p10_w0_n250.json.gz -b telco --split-runs --skip=1" gives http://www.haypocalc.com/tmp/telco.png

09:10 Guest77387 is now known as ramonvg

09:13 <arigato> nice regular spikes

09:19 <haypo> arigato: lovely, isn't it? :-)

09:20 <haypo> arigato: for this one, i think that i need to compute enough values per process to "smooth" the spikes

09:22 <haypo> it's a cycle of 7 values

09:22 <haypo> it seems like using first 32 values for warmup should be enough to reach the "steady state"

09:22 <haypo> i think that we should compute at least 5 cycles, so 35 values per process

09:23 <haypo> but i don't have time to analyze everything right now, i would prefer to do that next week ;)

09:23 oberstet2 has quit [Ping timeout: 260 seconds]

09:24 johncc3_ has quit [Ping timeout: 240 seconds]

09:27 <antocuni> haypo: where do I find your plot.py?

09:28 vkirilichev has quit [Ping timeout: 258 seconds]

09:28 <haypo> antocuni: download https://github.com/haypo/perf/raw/master/doc/examples/plot.py

09:29 <haypo> currently, it's just an example in the doc: http://perf.readthedocs.io/en/latest/examples.html#plot

09:30 <antocuni> thanks

09:31 commandoline has quit [Quit: Bye!]

09:32 <haypo> i started to play with the changepoint package of R

09:32 <haypo> but i don't get what I want, it seems like i don't pass the right parameters

09:34 <antocuni> haypo: sorry, I don't understand how I am supposed to generate ~/pypy_p10_w0_n250.json.gz :)

09:34 <haypo> antocuni: ah, you want to regenerate it? it took a whole night to compute it

09:35 <antocuni> ouch

09:35 <antocuni> well, maybe just a subset

09:35 <haypo> antocuni: ~/pypy2-v5.7.1-linux64/bin/pypy -m performance run -o pypy_p10_w0_n250.json -v

09:35 <haypo> antocuni: i used this command

09:35 <haypo> antocuni: oh

09:35 <haypo> antocuni: wait, i had to modify the performance module :)

09:35 <haypo> antocuni: http://paste.alacon.org/43707

09:36 <antocuni> basically, I wanted to see what happens if you do "pypyjit.set_param('off')" after some iterations

09:36 <haypo> antocuni: by default, performance tries to be nice and chooses parameters for you, but in my case, i wanted to always use: 10 processes, 0 warmup, 250 values (per process)

09:37 johncc3_ has joined #pypy

09:37 <haypo> antocuni: more generally, to use performance, there is a doc! http://pyperformance.readthedocs.io/usage.html

09:37 <antocuni> so, in the plot you pasted above, there are 10 different lines, one per process?

09:37 <haypo> antocuni: yes, one per process

09:37 <haypo> antocuni: i expected to see different results per process, and it's the case, even if i didn't reboot between each run

09:37 <antocuni> right

09:40 <haypo> antocuni: using --skip, you can ignore the first N values per run. on the go benchmark, one run was faster. it seems that we reached the steady state after 58 values: http://www.haypocalc.com/tmp/go.png

09:40 <haypo> haypo@selma$ python3 doc/examples/plot.py ~/pypy_p10_w0_n250.json.gz -b go --split-runs --skip=58

09:42 <haypo> at least, it confirms that it's good idea in perf to use multiple processes :-D

09:43 <antocuni> yeah, indeed. Do you see such a high variation also for CPython, or only pypy?

09:45 <haypo> antocuni: various between two processes? some performance microbenchmarks have medium variation between runs on CPython, but I removed microbenchmarks yesterday :-D

09:46 <haypo> antocuni: all CPython data (compressed JOSN files) are available at https://github.com/haypo/performance_results

09:46 <haypo> JSON*

09:46 <antocuni> wow, you are doing a very nice job, congrats

09:47 <haypo> to be honest, i didn't looked at CPython individual runs in depth

09:47 <haypo> antocuni: well, PyPy benchmark already produced a JSON file, but only stored the result

09:47 <haypo> antocuni: for me, it's important to store *all* data to allow deep analysis later

09:48 <haypo> antocuni: for example, i modified the "perf stats" command to count the number of outliers. it's now possible to compute that on *old* JSON files, without having to recompute these data

09:48 <haypo> which is nice since it takes 1 hour to compute a JSON file on CPython :)

09:49 <haypo> performance_results/2017-03-31-cpython/ contains 44 files, so it took something like 44 hours to compute all data :-p

09:49 <haypo> antocuni: ronan noticed some variation between values in the same process, like http://pyperformance.readthedocs.io/benchmarks.html#html5lib & http://pyperformance.readthedocs.io/benchmarks.html#sympy

09:50 <haypo> antocuni: ^^ on CPython

09:51 <antocuni> ok, managed to start my benchmark run, finally :)

09:52 <haypo> antocuni: if you only view data through an histogram, telco on CPython seems "perfect": nice gaussian curve, http://www.haypocalc.com/tmp/cpython_telco_hist.png

09:52 <haypo> antocuni: but now i hate you because i looked at invidial runs, and it's a mess: http://www.haypocalc.com/tmp/cpython_telco_runs.png

09:53 <haypo> oops, doc/examples/hist_scipy.py is outdated, it used median instead of mean

09:54 oberstet2 has joined #pypy

09:55 <haypo> antocuni: "wow, you are doing a very nice job, congrats" thank you :) i'm now trying to convince "you" (PyPy devs) to use my tools ;)

10:06 <antocuni> haypo: I'm puzzled; I modified performance/run.py by adding " cmd.extend(('-w0', '-p1', '-n100'))"

10:06 <antocuni> and I ran "pyperformance run -b telco -p `which pypy` -o pypy-telco.json"

10:06 <antocuni> but when I plot it, it says it has only 60 values

10:07 <antocuni> I expected 100

10:07 <haypo> antocuni: do you see these parameters passed on printed commands: "Running ..." ?

10:07 <antocuni> INFO:root:Running `/home/antocuni/pypy/perf/venv/pypy2.7-54f5305eeb69/bin/python -u /home/antocuni/pypy/perf/venv/pypy2.7-54f5305eeb69/site-packages/performance/benchmarks/bm_telco.py --output /tmp/tmpd1LxtL`

10:07 <haypo> antocuni: ah wait, there is a trick :)

10:07 <antocuni> uhm, seems now

10:07 <antocuni> *not

10:07 <haypo> antocuni: performance starts by installing itself :-)

10:07 <antocuni> ah :(

10:07 <haypo> antocuni: depending how you first installed performance on your system, it installs its local copy or download a fresh copy from PyPI

10:08 <antocuni> I applied the patch directly on the copy in my site-packages

10:08 <haypo> antocuni: if you want to hack performance, i suggest you to use 3 steps

10:08 <haypo> antocuni: 1) performance ... venv create

10:08 <haypo> antocuni: 2) hack performacen in the created virtual environment

10:08 <haypo> antocuni: 3) run the benchmark

10:08 jcea has joined #pypy

10:08 <antocuni> ok, I'll do

10:08 <antocuni> I have to run now

10:08 <antocuni> thanks!

10:09 <haypo> antocuni: performance creates a virtualenv to get a known list of modules. otherwise, some magic .pth files can have an impact on benchmarks

10:09 vkirilichev has joined #pypy

10:09 <haypo> antocuni: for example, slowdown python startup, which is measured by python_startup benchmark

10:12 brechtm_ has joined #pypy

10:12 brechtm has quit [Read error: Connection reset by peer]

10:13 antocuni has quit [Ping timeout: 255 seconds]

10:25 forgottenone has quit [Quit: Konversation terminated!]

10:26 forgottenone has joined #pypy

10:35 mattip has joined #pypy

10:36 brechtm has joined #pypy

10:39 brechtm_ has quit [Ping timeout: 252 seconds]

10:40 <mattip> finally, I got a segfault to happen only with threading and numpy, no cython needed

10:43 * mattip updating issue 2530

10:58 ramonvg has quit [Ping timeout: 260 seconds]

10:59 tormoz has joined #pypy

11:01 jamadden has joined #pypy

11:09 forgottenone has quit [Quit: Konversation terminated!]

11:11 forgottenone has joined #pypy

11:21 <LarstiQ> haypo: reading backlog, seems you're getting in a good spot

11:23 forgottenone has quit [Quit: Konversation terminated!]

11:24 forgottenone has joined #pypy

11:36 demonimin has quit [Remote host closed the connection]

11:37 commandoline has joined #pypy

11:40 demonimin has joined #pypy

11:41 jamesaxl has quit [Ping timeout: 260 seconds]

11:42 johncc3_ has quit [Ping timeout: 258 seconds]

11:44 jamesaxl has joined #pypy

11:44 brechtm has quit [Remote host closed the connection]

11:45 brechtm has joined #pypy

11:56 johncc3_ has joined #pypy

12:04 antocuni has joined #pypy

12:19 vkirilichev has quit [Read error: Connection reset by peer]

12:19 vkirilic_ has joined #pypy

12:21 adamholmberg has joined #pypy

12:23 vkirilic_ has quit [Read error: Connection reset by peer]

12:25 vkirilichev has joined #pypy

12:25 jacob22_ has joined #pypy

12:35 forgottenone has quit [Quit: Konversation terminated!]

12:35 <bbot2_> Success: http://buildbot.pypy.org/builders/jit-benchmark-linux-x86-64/builds/2177

12:36 forgottenone has joined #pypy

12:40 vkirilichev has quit [Remote host closed the connection]

12:58 girish946 has joined #pypy

12:58 forgottenone has quit [Quit: Konversation terminated!]

13:00 demonimin_ has joined #pypy

13:00 forgottenone has joined #pypy

13:03 demonimin has quit [Ping timeout: 260 seconds]

13:07 <antocuni> arigato, haypo: I think this is interesting: https://gist.github.com/antocuni/e3d2172c5e9273c0ec0a313b73e74415#file-pypy-telco-png

13:08 <antocuni> I run the telco benchmarks three times: 1) normally; 2) I disabled the jit after 100 iterations; 3) I called gc.collect *before* each iteration

13:08 <antocuni> (I wrote my own hackish runner because I could not find a way to modify perf and/or pyperformance to do what I wanted

13:09 <antocuni> by looking at the graph I think we can see that:

13:09 <antocuni> 1) the spikes are caused by gc collections, NOT jit compilations

13:11 <antocuni> 2) if we disable the JIT, the performance drop. This is a bit unexpected: maybe it means that there is some guard which constantly fail and thus cause the JIT to compile again and again the same code paths?

13:12 * antocuni tries an additional run with both gc.collect and jit-off-after-100

13:13 <haypo> antocuni: ah, i forgot to explain you that you don't need performance to run bm_telco.py. it's a standalone script. the script directly accepts -w0 -p10 -n250 options

13:13 <haypo> antocuni: see http://perf.readthedocs.io/en/latest/runner.html for all options, they are many of them :)

13:13 commandoline has quit [Quit: Bye!]

13:13 <haypo> antocuni: i'm not sure that calling gc.collect() is "correct"

13:14 <antocuni> haypo: sure, it is not correct at all

13:14 <haypo> i should describe somewhere what i want from performance

13:15 commandoline has joined #pypy

13:15 <antocuni> but it's a very different thing than JIT warmup: in case of the JIT, we can assume that after a (maybe arbitrarily long) warmup phase the performance stabilizes

13:15 <haypo> in short, benchmarks should be representative of "real" applications and be run as users run real code

13:15 <antocuni> if it's the GC, it means that the GC cost should be spread all over the iterations

13:15 <haypo> antocuni: the question is more why a GC collection is needed. GC is only supposed to be required to break cycles, no?

13:15 <antocuni> haypo: in pypy not at all

13:15 <antocuni> the gc runs constantly

13:15 <haypo> so, it looks closer to a bug in the decimal module

13:16 <haypo> antocuni: i mean, if you don't have cycles, the GC is supposed to do nothing, no?

13:16 <antocuni> no

13:16 <antocuni> pypy doesn't have refcount

13:16 <haypo> hum, ok

13:16 <antocuni> the only way to claim memory is by running the GC

13:17 <antocuni> and we have two phases: minor collections (which run often, probably multiple times during the execution of a benchmark)

13:17 <antocuni> and major collections, which are slower and runs less often

13:17 <antocuni> I bet that the spikes we see are because a GC major collection happen to run every N iterations

13:18 <haypo> antocuni: yeah, i wouldn't be surprised to see a correlation between GC major collection and spikes

13:18 <antocuni> yeah

13:19 <antocuni> basically, what I wanted to say is that in this particular case, the benchmark probably DO warm up, and so it is fine to take the average after N iterations

13:20 <arigato> ("the only way to reclaim memory is by running the GC" => note that 80% or 90% of the objects are dead already at a minor collection, and the minor collection algorithm needs some steps per *alive* object, making reclaiming free objects exactly zero cost)

13:21 <antocuni> arigato: sure, I was talking about the GC in general, not only major collections

13:21 <arigato> (i.e. it's more efficient than calling malloc() and free() even without counting the overhead of the reference counter in CPython)

13:21 <arigato> just want to make sure haypo doesn't get the wrong impression :-)

13:21 <antocuni> ok

13:21 necaris has joined #pypy

13:21 necaris is now known as necaris[away]

13:22 <arigato> "the GC" inside CPython is a very particular beast from the general GCs elsewhere

13:23 necaris[away] is now known as necaris

13:23 <antocuni> anyway, all of this is another hint that we cannot use a single number to represent "pypy speed for a given benchmark"

13:25 * arigato didn't fully read the conversation

13:25 <haypo> antocuni: "it is fine to take the average after N iterations" hum, it's more a requirement than just being fine

13:25 <haypo> antocuni: i want to include spikes in the result

13:25 <haypo> antocuni: we had long and painful discussions about mean vs median for exemple, and at the end, i decided to choose mean

13:26 <antocuni> I know, I'm not saying that it's wrong

13:27 Rhy0lite has joined #pypy

13:27 <antocuni> but for example, it means that you have to distinguish "JIT spikes" vs "GC spikes" to compute the warmup

13:31 <haypo> antocuni: ah, it should be easy to distinguish them: just disable the GC as you did, no?

13:32 <antocuni> yes, although you cannot "disable" the GC (else you run out of memory pretty quickly). What I did was to force a major collection before the run, to make sure that it didn't happen by chance inside the benchmark

13:33 <haypo> antocuni: ah yes, that's different and more reliable :)

13:34 <haypo> antocuni: gc.disable() can also behave differently :-)

13:34 <antocuni> this works because this particular benchmark does not allocate much memory. If a benchmark allocates a lot of memory, it might cause a full major collection on its own. But running gc.collect() before ensures that it always start from a "clean state"

13:35 <antocuni> another way to see this is: in PyPy, in general, allocating memory costs a bit of time, but the cost is delayed and you see it only when the major collection occurs

13:35 <antocuni> (the allocation itself is very quick; the "cost" is given by the fact that the more you allocate, the more often the GC runs)

13:36 <antocuni> although of all this is very imprecise of course. In particular, if the object dies quickly enough, it is collected in the minor collection and thus does not affect the speed of the next major collection

13:37 <antocuni> but in general, it is correct enough to say that "the more you allocate, the more you spend later in the GC"

13:40 girish946 has quit [Ping timeout: 255 seconds]

13:41 vkirilichev has joined #pypy

13:44 <arigato> I think it's wrong to call gc.collect() and not put it in any time

13:45 <antocuni> arigato: I agree. This was just to show that the spikes were caused by the GC, not by the JIT

13:46 <haypo> arigato: sorry, what do you mean by "not put it in any time"

13:52 girish946 has joined #pypy

13:52 brechtm has quit [Read error: No route to host]

13:53 brechtm has joined #pypy

13:57 johncc3_ has quit [Ping timeout: 255 seconds]

13:59 <haypo> d

14:06 <kenaan_> arigo default b13b7c8e4fe7 /rpython/translator/c/src/debug_print.c: Call fflush() after writing an end-of-section to the log file. Hopefully, this should remove the constant problem t...

14:06 <arigato> haypo: "not account for it"

14:07 <haypo> antocuni, arigato : i just wrote http://pyperformance.readthedocs.io/usage.html#what-is-the-goal-of-performance to write down what i want :)

14:07 lritter has joined #pypy

14:07 <haypo> so: gc enabled, no gc.collect(), ASLR enabled, ignore warmups, etc.

14:08 <haypo> i also write it for ronan who wants to include warmups :)

14:09 johncc3_ has joined #pypy

14:14 vkirilichev has quit [Ping timeout: 240 seconds]

14:23 brechtm_ has joined #pypy

14:26 brechtm has quit [Ping timeout: 252 seconds]

14:26 yuyichao has quit [Ping timeout: 258 seconds]

14:27 <mattip> arigato: ping (buildbot own test failures)There are some own test failures on default, linux 32/64 - stress tests

14:35 <arigato> mattip: ouch

14:36 <arigato> looks likely to be branch-prediction

14:36 <mattip> likely, but painful to find

14:37 DragonSA has quit [Quit: Konversation terminated!]

14:37 <mattip> I just wanted to point it out before the last good version goes off buildbot reports

14:37 <mattip> s/reports/summary/

14:37 <arigato> ah, no

14:37 <arigato> found it

14:38 <arigato> yes, thanks

14:40 yuyichao has joined #pypy

14:48 inhahe_ has quit [Read error: Connection reset by peer]

14:48 inhahe_ has joined #pypy

14:52 <mattip> maybe progress - in gdb I found a live PyObject with refcount== REFCNT_FROM_PYPY

14:53 <mattip> which AFAICT should not happen

14:56 <kenaan_> arigo default f0ba81de1e4f /rpython/jit/backend/x86/codebuf.py: Fix for untranslated tests

14:56 * mattip bye

14:56 mattip has left #pypy ["bye"]

14:56 <arigato> mattip (logs): it happens if there is no ref from CPython, only from PyPy

14:59 antocuni has quit [Ping timeout: 252 seconds]

14:59 marky1991 has joined #pypy

15:00 realitix has quit [Quit: Leaving]

15:12 vkirilichev has joined #pypy

15:12 John has joined #pypy

15:12 <John> hi all

15:12 <arigato> hi

15:12 <John> it seems that when reading from the sys.stdin, even if i use os.read(sys.stdin.fileno(),4) i cannot guarentee buffering is turned off

15:12 <John> and that only 4 bytes will actually be read

15:13 <danchr> odd; I get an exception saying "can't use a named cursor outside of transactions" when running my django app with psycopg2cffi

15:13 <danchr> if I just delete the check that raises the exception, everything appears to work

15:13 <arigato> John: os.read(_, 4) should never return more than 4 bytes, AFAIK

15:14 <John> arigato: it only reads 4 bytes, but it seems to read more from _ than 4, and presumably stores them in a buffer somewhere

15:14 <John> using "python -u" when running the program doesn't seem to turn buffering off either

15:15 <arigato> os.read() is directly calling the OS read() function. if that fails, then that's strange

15:15 <John> It's not failing, it's just buffering

15:15 <John> i will make a demo :)

15:16 <arigato> please do, I don't believe you :-) i.e. I think there is a different issue

15:21 <John> hahah, most likely :)

15:21 <John> So far, my demo has been unable to reproduce :P

15:21 <nimaje> John: is stdin the terminal or something else?

15:22 <John> The script is being run via "cat myfile | python myscript.py"

15:23 <arigato> how do you know there is buffering or not, in this situation?

15:25 <arigato> the pipe alone contains an OS-internal buffer, and cat also reads and writes in chunks

15:25 <John> right, right - and that's probably OK

15:25 <John> myscript will read the first 4 bytes of whatever it's getting via stdin, and then pipe the rest to a subprocess

15:26 <arigato> ah

15:26 <John> and what "the rest" is sees to be somewhat random. usually it's the 5th byte and on, but occasionally it's something else

15:28 <arigato> do you know it's something else from later in the pipe, or could it also be from earlier---i.e. os.read(_, 4) in the parent returned less than 4 bytes?

15:30 jamesaxl has quit [Read error: Connection reset by peer]

15:31 vkirilichev has quit [Ping timeout: 240 seconds]

15:31 <nimaje> John: why don't you use 'tail -c +4'?

15:32 <nimaje> s/+4/+5/

15:33 jamesaxl has joined #pypy

15:33 <John> arigato: Good idea, but I don't think that's it as i'm checking that the four bytes are what they are supposed to be

15:34 <arigato> ok

15:34 <John> nimaje: i'm reading the first 4 bytes from stdin, and if it's a GZIP file, subprocess gzip, if it's a XYZ file, subprocess xyz, etc

15:34 <John> So it's not about avoiding those four bytes, it's about reading them, then deciding what to subprocess :)

15:36 <nimaje> ok, does the subprocess don't need those bytes?

15:38 <John> it does! :D That was the hardest bit about this code

15:39 <John> the subprocess is: " { printf "abcd"; cat; } | gzip "

15:39 <John> Which reliably gives gzip the four bytes 'abcd' before the stdin

15:39 <John> But the stdin is not reliably the 5th + btyes

15:40 <John> It's really really ugly, but without a way to read from the stdin without changing it, or even see whats in python's buffers, it's really tricky to find a better way

15:43 girish946 has quit [Quit: Leaving]

15:50 <John> I found the cause! And it's super weird xD

15:50 <John> Somehow, making a call to subprocess is the problem

15:50 <John> I will make the demo

15:56 <bbot2_> Failure: http://buildbot.pypy.org/builders/pypy-c-app-level-linux-armhf-raspbian/builds/1335

16:00 <John> OK here's the code: https://paste.ofcode.org/yzgXbw65A7rHK8y248HzAa

16:00 <John> Switching bug from False to True turns the bug on

16:00 <arigato> ah, "stdin=sys.stdin" I guess fails

16:01 <arigato> you should try without that

16:01 <John> You mean rename it?

16:01 <arigato> no, leave out the argument completely

16:01 <John> oh, but i need it :(

16:01 <arigato> the stdin of the subprocess is by default the same as your own stdin

16:01 <John> I'm sending the subprocess the stdin of the python program

16:01 <John> ahhh

16:02 <John> hehe, cool - your right it's not needed

16:02 <John> but unfortunately it still gives the bug :/

16:02 <arigato> ok

16:02 <John> To demo the code yourself, i am running this like "cat some_file.gz | pypy python_buffer.py"

16:03 <John> But you have to run it 10+ times until you see the output randomly change

16:03 <John> but only when bug = True

16:03 <John> with bug = False the output is always the same

16:03 <John> well, i tried 100 times :P

16:04 <John> Maybe a flying spagetti monster will break it on the 101st time ;)

16:04 <arigato> does it work if you don't use subprocess at all? like, os.execvp("bash", ["bash", "-c", "the-same-command-line"])

16:06 <arigato> ah

16:06 <arigato> wait a sec, what are you doing in the "if bug"?

16:06 <arigato> you call gzip and immediately kill it??

16:06 <arigato> of course if you're unlucky it can start reading some bytes

16:07 <arigato> from the same stdin

16:07 <nimaje> he doesn't start the process or do I understand something wrong?

16:08 <John> It appears to work with os.execvp

16:08 <John> arigato: ahhhhhh

16:08 <John> you're a genius :D

16:09 <John> That's probably it

16:09 <John> the if part is to see if gzip is installed

16:10 <John> (because the other process uses shell=True)

16:10 <John> (because it uses { } and | )

16:11 <John> Woop! That fixes it!

16:11 <John> stdin=DEVNULL fixes it

16:11 <arigato> :-)

16:12 <John> So this only happens on PyPy, on CPython it doesn't seem to happen

16:12 <John> But i don't care - it now works on both <3

16:12 <John> Thanks arigato

16:12 <arigato> I'm sure you're lucky on CPython

16:12 DragonSA has joined #pypy

16:12 DragonSA has quit [Changing host]

16:13 <arigato> e.g. on PyPy maybe it happens only if there is a pause of a fraction of a millisecond at the wrong point

16:13 <John> Yeah, my first thought was that CPython is probably too slow

16:13 <John> lol

16:13 <arigato> the opposite in this case---CPython probably reaches p.kill() consistently quickly

16:14 <John> I seeee~ interesting

16:14 DragonSA has quit [Client Quit]

16:24 John has quit [Quit: http://www.kiwiirc.com/ - A hand crafted IRC client]

16:27 vkirilichev has joined #pypy

16:28 ramonvg has joined #pypy

16:28 forgottenone has quit [Quit: Konversation terminated!]

16:38 untitaker has quit []

16:39 untitaker has joined #pypy

16:45 arigato has quit [Read error: Connection reset by peer]

16:46 arigato has joined #pypy

16:55 vkirilichev has quit [Ping timeout: 240 seconds]

16:58 brechtm_ has quit [Remote host closed the connection]

17:00 asmeurer__ has joined #pypy

17:01 asmeurer__ has quit [Client Quit]

17:02 forgottenone has joined #pypy

17:02 yuyichao has quit [Ping timeout: 258 seconds]

17:04 brechtm has joined #pypy

17:05 tbodt has joined #pypy

17:08 brechtm has quit [Ping timeout: 240 seconds]

17:18 yuyichao has joined #pypy

17:19 johncc3_ has quit [Ping timeout: 260 seconds]

17:20 <nanonyme> arigato, I'm personally a bit annoyed subprocess interface didn't have stdin=None, stdout=None, stderr=None to mean that they are all redirected to dev null

17:21 <nanonyme> (but instead None == same as default)

17:21 necaris is now known as necaris[away]

17:22 necaris[away] has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

17:22 <nanonyme> But yeah, no one said subprocess module was exactly good

17:28 jamesaxl has quit [Read error: Connection reset by peer]

17:29 jamesaxl has joined #pypy

17:31 johncc3_ has joined #pypy

17:32 antocuni has joined #pypy

17:32 necaris has joined #pypy

17:51 vkirilichev has joined #pypy

17:52 <kenaan_> antocuni extradoc c812f32e4682 /talk/ep2017/the-joy-of-pypy-jit.txt: my ep2017 proposal

17:53 <antocuni> arigato: ^^^ I submitted this proposal before I forget the deadline. If you have feedback or suggestions, please tell me :) (maybe tomorrow or by email because now I'm off)

17:58 antocuni has quit [Ping timeout: 268 seconds]

18:01 tbodt has quit [Quit: My Mac has gone to sleep. ZZZzzz…]

18:05 vkirilichev has quit [Ping timeout: 240 seconds]

18:06 yuyichao has quit [Ping timeout: 260 seconds]

18:12 yuyichao has joined #pypy

18:27 asmeurer__ has joined #pypy

18:34 tbodt has joined #pypy

18:39 tbodt has quit [Read error: Connection reset by peer]

18:40 tbodt has joined #pypy

18:44 johncc3_ has quit [Quit: Leaving]

18:45 tbodt has quit [Ping timeout: 260 seconds]

18:50 asmeurer__ has quit [Quit: asmeurer__]

18:55 tbodt has joined #pypy

18:57 kipras`away is now known as kipras

19:02 vkirilichev has joined #pypy

19:05 Rhy0lite has quit [Quit: Leaving]

19:15 <bbot2_> Failure: http://buildbot.pypy.org/builders/pypy-c-jit-linux-armhf-raspbian/builds/1278

19:20 jamesaxl has quit [Read error: Connection reset by peer]

19:21 jamesaxl has joined #pypy

19:22 vkirilichev has quit [Remote host closed the connection]

19:22 vkirilichev has joined #pypy

20:06 black_ant has joined #pypy

20:23 black_ant has quit [Ping timeout: 260 seconds]

20:25 marky1991 has quit [Read error: Connection reset by peer]

20:36 black_ant has joined #pypy

20:41 black_ant has quit [Ping timeout: 252 seconds]

20:46 ramonvg has quit [Ping timeout: 240 seconds]

20:54 black_ant has joined #pypy

21:09 lritter has quit [Quit: Leaving]

21:10 black_ant has quit [Ping timeout: 258 seconds]

21:13 necaris is now known as necaris[away]

21:14 arigato has quit [Ping timeout: 252 seconds]

21:14 necaris[away] has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

21:21 forgottenone has quit [Quit: Konversation terminated!]

21:22 necaris has joined #pypy

21:22 necaris is now known as necaris[away]

21:23 forgottenone has joined #pypy

21:24 vkirilichev has quit [Remote host closed the connection]

21:37 necaris[away] has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

21:38 nimaje1 has joined #pypy

21:38 nimaje1 is now known as nimaje

21:38 nimaje has quit [Killed (verne.freenode.net (Nickname regained by services))]

21:54 forgottenone has quit [Quit: Konversation terminated!]

21:55 vkirilichev has joined #pypy

21:55 ramonvg has joined #pypy

21:56 adamholmberg has quit [Remote host closed the connection]

21:57 adamholmberg has joined #pypy

22:00 <bbot2_> Started: http://buildbot.pypy.org/builders/own-linux-s390x/builds/541 [default]

22:00 <bbot2_> Started: http://buildbot.pypy.org/builders/pypy-c-jit-macosx-x86-64/builds/2962 [default]

22:00 <bbot2_> Started: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-64/builds/4535 [default]

22:00 <bbot2_> Started: http://buildbot.pypy.org/builders/own-linux-x86-64/builds/5860 [default]

22:00 <bbot2_> Started: http://buildbot.pypy.org/builders/own-linux-x86-32/builds/5121 [default]

22:00 <bbot2_> Started: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-32/builds/3883 [default]

22:00 <bbot2_> Started: http://buildbot.pypy.org/builders/jitbackendonly-own-linux-armhf/builds/1437

22:00 <bbot2_> Started: http://buildbot.pypy.org/builders/build-pypy-c-jit-linux-armel/builds/1772

22:00 vkirilichev has quit [Ping timeout: 268 seconds]

22:00 <bbot2_> Started: http://buildbot.pypy.org/builders/build-pypy-c-jit-linux-armhf-raring/builds/1523

22:00 <bbot2_> Started: http://buildbot.pypy.org/builders/build-pypy-c-linux-armhf-raspbian/builds/1403

22:00 <bbot2_> Started: http://buildbot.pypy.org/builders/build-pypy-c-linux-armel/builds/1662

22:00 <bbot2_> Started: http://buildbot.pypy.org/builders/build-pypy-c-jit-linux-armhf-raspbian/builds/1526

22:01 adamholmberg has quit [Ping timeout: 252 seconds]

22:07 forgottenone has joined #pypy

22:21 asmeurer__ has joined #pypy

22:22 forgottenone has quit [Ping timeout: 260 seconds]

22:24 vkirilichev has joined #pypy

22:28 jamesaxl has quit [Quit: WeeChat 1.7]

22:41 Marqin has quit [Ping timeout: 264 seconds]

22:41 ramonvg has quit [Quit: Lost terminal]

22:42 asmeurer__ has quit [Quit: asmeurer__]

22:53 <bbot2_> Failure: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-64/builds/4535 [default]

22:58 vkirilichev has quit [Remote host closed the connection]

23:04 <bbot2_> Failure: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-32/builds/3883 [default]

23:16 forgottenone has joined #pypy

23:17 asmeurer__ has joined #pypy

23:21 adamholmberg has joined #pypy

23:23 <bbot2_> Failure: http://buildbot.pypy.org/builders/own-linux-x86-64/builds/5860 [default]

23:26 adamholmberg has quit [Ping timeout: 252 seconds]

23:27 asmeurer__ has quit [Quit: asmeurer__]

23:28 asmeurer_ has joined #pypy

23:37 <bbot2_> Failure: http://buildbot.pypy.org/builders/own-linux-x86-32/builds/5121 [default]

23:38 asmeurer_ has quit [Quit: asmeurer_]

23:40 forgottenone has quit [Quit: Konversation terminated!]

23:58 vkirilichev has joined #pypy