#pypy on 2018-05-10 — irc logs at freenode.irclog.whitequark.org

2018-02-26 15:52 cfbolz changed the topic of #pypy to: PyPy, the flexible snake (IRC logs: https://botbot.me/freenode/pypy/ ) | use cffi for calling C | the secret reason for us trying to get PyPy users: to test the JIT well enough that we're somewhat confident about it

00:00 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-linux-s390x/builds/856 [default]

00:08 nimaje has joined #pypy

00:10 jcea has joined #pypy

00:10 marr has quit [Ping timeout: 260 seconds]

00:12 antocuni has quit [Ping timeout: 255 seconds]

00:17 <bbot2> Failure: http://buildbot.pypy.org/builders/own-win-x86-32/builds/1777 [default]

00:25 [Arfrever] has joined #pypy

00:33 julius has left #pypy ["Leaving"]

00:41 tbodt has quit [Quit: My Mac has gone to sleep. ZZZzzz…]

00:44 tbodt has joined #pypy

01:00 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-macosx-x86-64/builds/3764 [py3.5]

01:00 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-64/builds/5469 [py3.5]

01:00 <bbot2> Started: http://buildbot.pypy.org/builders/own-linux-x86-64/builds/6746 [py3.5]

01:00 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-win-x86-32/builds/3900 [py3.5]

01:00 <bbot2> Started: http://buildbot.pypy.org/builders/own-linux-x86-32/builds/5857 [py3.5]

01:00 <bbot2> Started: http://buildbot.pypy.org/builders/build-pypy-c-jit-linux-armel/builds/2269 [py3.5]

01:00 <bbot2> Started: http://buildbot.pypy.org/builders/build-pypy-c-jit-linux-armhf-raspbian/builds/2029 [py3.5]

01:21 <bbot2> Failure: http://buildbot.pypy.org/builders/own-linux-s390x/builds/918 [default]

01:26 <kenaan> rlamy py3tests 64d59e349b56 /pypy/module/_demo/test/test_import.py: fix test

01:53 jcea has quit [Quit: jcea]

01:56 <bbot2> Failure: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-64/builds/5469 [py3.5]

02:03 <bbot2> Success: http://buildbot.pypy.org/builders/build-pypy-c-jit-linux-armel/builds/2269 [py3.5]

02:05 tbodt has quit [Quit: My Mac has gone to sleep. ZZZzzz…]

02:06 tbodt has joined #pypy

02:09 <bbot2> Success: http://buildbot.pypy.org/builders/build-pypy-c-jit-linux-armhf-raspbian/builds/2029 [py3.5]

02:11 tbodt has quit [Ping timeout: 260 seconds]

02:23 <bbot2> Failure: http://buildbot.pypy.org/builders/own-linux-x86-64/builds/6746 [py3.5]

02:26 tbodt has joined #pypy

02:28 <bbot2> Retry: http://buildbot.pypy.org/builders/pypy-c-jit-win-x86-32/builds/3900 [py3.5]

02:30 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-win-x86-32/builds/3901 [py3.5]

02:34 RemoteFox has quit [Ping timeout: 268 seconds]

02:55 <bbot2> Failure: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-32/builds/4622 [default]

02:55 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-32/builds/4623 [py3.5]

03:00 <bbot2> Started: http://buildbot.pypy.org/builders/jit-benchmark-linux-x86-64/builds/2532

03:00 <bbot2> Started: http://buildbot.pypy.org/builders/jit-benchmark-linux-x86-32/builds/3326

03:02 jerith has quit [Remote host closed the connection]

03:02 <bbot2> Failure: http://buildbot.pypy.org/builders/pypy-c-jit-macosx-x86-64/builds/3764 [py3.5]

03:04 tbodt has quit [Quit: My Mac has gone to sleep. ZZZzzz…]

03:06 dfee has joined #pypy

03:08 <dfee> what kind of madness is this pypy interpreter? https://paste.ofcode.org/VCKuQuigVXuuGRrb9b2Hb8

03:08 Graypup_ has quit [Quit: ZNC 1.6.1 - http://znc.in]

03:15 <dfee> filed a bug report: https://bitbucket.org/pypy/pypy/issues/2827/trailing-commas-are-occassionaly

03:16 asmeurer__ has joined #pypy

03:21 <njs> dfee: currently pypy is compatible with python 3.5

03:21 <njs> dfee: those trailing commas only started to be legal in 3.6

03:22 <njs> dfee: bizarrely enough

03:28 asmeurer__ has quit [Quit: asmeurer__]

03:34 asmeurer__ has joined #pypy

03:55 <bbot2> Failure: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-32/builds/4623 [py3.5]

03:57 jacob22 has quit [Quit: Konversation terminated!]

04:00 dddddd has quit [Remote host closed the connection]

04:08 asmeurer__ has quit [Quit: asmeurer__]

04:12 asmeurer__ has joined #pypy

04:19 <alcarithemad> dfee: that does in fact work in the 3.6 branch as of at least a couple months ago

04:41 <bbot2> Failure: http://buildbot.pypy.org/builders/own-linux-x86-32/builds/5857 [py3.5]

05:05 njs has quit [Quit: Coyote finally caught me]

05:20 njs has joined #pypy

05:28 jamesaxl has quit [Quit: WeeChat 2.1]

05:34 illume has joined #pypy

05:35 dfee has quit [Ping timeout: 256 seconds]

05:42 dmalcolm has quit [Ping timeout: 248 seconds]

05:58 Hasimir has quit [*.net *.split]

06:02 asmeurer__ has quit [Quit: asmeurer__]

06:03 asmeurer_ has joined #pypy

06:07 asmeurer_ has quit [Client Quit]

06:08 asmeurer__ has joined #pypy

06:12 lritter has joined #pypy

06:13 asmeurer__ has quit [Ping timeout: 255 seconds]

06:28 <bbot2> Failure: http://buildbot.pypy.org/builders/pypy-c-jit-win-x86-32/builds/3901 [py3.5]

06:35 dmalcolm has joined #pypy

06:38 dfee has joined #pypy

06:38 <dfee> njs: alcarithemad i had no idea those were illegal in 3.5. i think i used them in cpython 3.5?

06:39 asmeurer has joined #pypy

06:44 asmeurer has quit [Ping timeout: 264 seconds]

06:48 energizer has joined #pypy

07:14 jerith has joined #pypy

07:15 tayfun26 has joined #pypy

07:18 <arigato> dfee: no, cpython 3.5 behaves in the same way as pypy 3.5

07:19 <arigato> at least cpython 3.5.3. I guess it didn't change in minor versions but you never know

07:19 <arigato> ...in micro version, I think they are called

07:21 rubdos has quit [Quit: WeeChat 2.0.1]

07:23 rubdos has joined #pypy

07:26 dfee has quit [Ping timeout: 264 seconds]

07:32 wleslie has quit [Quit: ~~~ Crash in JIT!]

07:32 rubdos has quit [Quit: WeeChat 2.0.1]

07:34 rubdos has joined #pypy

07:38 <bbot2> Success: http://buildbot.pypy.org/builders/jit-benchmark-linux-x86-64/builds/2532

07:40 asmeurer__ has joined #pypy

07:40 asmeurer__ has quit [Client Quit]

07:41 RemoteFox has joined #pypy

07:41 marself has joined #pypy

07:47 dfee has joined #pypy

07:52 dfee1 has joined #pypy

07:54 dfee has quit [Ping timeout: 246 seconds]

07:55 wleslie has joined #pypy

07:59 marr has joined #pypy

08:02 dfee1 has quit [Ping timeout: 255 seconds]

08:02 inad924 has joined #pypy

08:04 dfee1 has joined #pypy

08:04 illume has quit [Quit: My MacBook Pro has gone to sleep. ZZZzzz…]

08:10 asmeurer has joined #pypy

08:10 asmeurer has quit [Client Quit]

08:12 asmeurer has joined #pypy

08:14 asmeurer has quit [Client Quit]

08:16 asmeurer__ has joined #pypy

08:16 asmeurer__ has quit [Client Quit]

08:18 asmeurer__ has joined #pypy

08:20 asmeurer__ has quit [Client Quit]

08:22 asmeurer__ has joined #pypy

08:22 asmeurer__ has quit [Client Quit]

08:23 asmeurer has joined #pypy

08:25 asmeurer has quit [Client Quit]

08:27 dfee1 has quit [Ping timeout: 240 seconds]

08:27 asmeurer_ has joined #pypy

08:29 Hasimir has joined #pypy

08:29 asmeurer_ has quit [Client Quit]

08:29 wleslie has quit [Ping timeout: 264 seconds]

08:29 dfee1 has joined #pypy

08:32 asmeurer_ has joined #pypy

08:33 asmeurer_ has quit [Client Quit]

08:34 asmeurer_ has joined #pypy

08:34 asmeurer_ has quit [Client Quit]

08:35 jamesaxl has joined #pypy

08:36 agates has quit [Ping timeout: 256 seconds]

08:36 yuvipanda has quit [Ping timeout: 260 seconds]

08:36 dash has quit [Ping timeout: 255 seconds]

08:36 bendlas has quit [Ping timeout: 269 seconds]

08:36 asmeurer_ has joined #pypy

08:38 asmeurer_ has quit [Client Quit]

08:40 asmeurer__ has joined #pypy

08:40 asmeurer__ has quit [Client Quit]

08:42 asmeurer__ has joined #pypy

08:45 ceridwen has quit [Ping timeout: 265 seconds]

08:46 antocuni has joined #pypy

08:47 Hasimir has quit [*.net *.split]

08:48 asmeurer__ has quit [Ping timeout: 248 seconds]

08:51 <antocuni> arigato: ping

08:51 <bbot2> Failure: http://buildbot.pypy.org/builders/pypy-c-jit-linux-armhf-raspbian/builds/1729 [default]

08:51 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-linux-armhf-raspbian/builds/1730 [default]

08:52 wleslie has joined #pypy

08:55 mcyprian has joined #pypy

09:06 Hasimir has joined #pypy

09:08 wleslie has quit [Quit: ~~~ Crash in JIT!]

09:29 energizer has quit [Disconnected by services]

09:30 dash has joined #pypy

09:36 energizer has joined #pypy

09:47 inad923 has joined #pypy

09:50 inad924 has quit [Ping timeout: 240 seconds]

09:59 marr has quit [Ping timeout: 255 seconds]

10:07 marself has quit [Ping timeout: 268 seconds]

10:08 marself has joined #pypy

10:12 Eran has joined #pypy

10:12 bendlas has joined #pypy

10:12 yuvipanda has joined #pypy

10:12 agates has joined #pypy

10:13 <Eran> Hi, We have a problem on production. We are running pypy code the fork processes and it seems like one if the forked processes is hang.

10:13 energizer has quit [Ping timeout: 260 seconds]

10:14 <Eran> How can we debug this issue when it happens? any Ideas?

10:14 jcea has joined #pypy

10:14 ceridwen has joined #pypy

10:14 antocuni has quit [Ping timeout: 250 seconds]

10:21 ceridwen has quit [Ping timeout: 256 seconds]

10:35 ceridwen has joined #pypy

10:46 <bbot2> Success: http://buildbot.pypy.org/builders/jit-benchmark-linux-x86-32/builds/3326

10:49 marr has joined #pypy

10:57 jcea has quit [Quit: jcea]

11:01 jcea has joined #pypy

11:03 marky1991 has joined #pypy

11:12 jcea has quit [Quit: jcea]

11:13 inad924 has joined #pypy

11:16 inad923 has quit [Ping timeout: 255 seconds]

11:20 jcea has joined #pypy

11:23 dfee1 has quit [Ping timeout: 268 seconds]

11:23 lritter has quit [Quit: Leaving]

11:24 marky1991 has quit [Remote host closed the connection]

11:26 dddddd has joined #pypy

11:31 antocuni has joined #pypy

11:42 user24 has joined #pypy

11:48 marky1991 has joined #pypy

12:02 mcyprian has quit [Ping timeout: 240 seconds]

12:13 raynold has quit [Quit: Connection closed for inactivity]

12:16 <arigato> antocuni: pong

12:17 <arigato> Eran: if the subprocess hangs for unknown reasons, try to attach a gdb to it (or a MSVC debugger on Windows)?

12:17 mcyprian has joined #pypy

12:18 dfee1 has joined #pypy

12:20 <antocuni> arigato: so, I have some questions about the GC

12:20 <antocuni> the first is about PYPY_GC_INCREMENT_STEP: looking at the code, it seems it does something different than what is documented in the docstring

12:22 <antocuni> in particular, "The minimum is set to size that survives minor collection * 1.5 so we reclaim anything all the time."

12:22 <antocuni> if I read the logic at incminimark.py:2316 correctly, I think that what happens is that "estimate" is always at least nursery_size*2; i.e., it always depends on the size of the whole nursery, not on the size of the surviving objects

12:22 dfee1 has quit [Ping timeout: 260 seconds]

12:24 <arigato> "estimate_from_nursery" is based on self.nursery_surviving_size

12:25 <arigato> not self.nursery_size

12:26 <antocuni> oh, right

12:27 <arigato> I think the numbers in the docstring are wrong

12:27 <antocuni> but then at least the magic numbers in the docstring look wrong?

12:27 tayfun26 has quit [Read error: Connection reset by peer]

12:27 <arigato> default size appears to be 4 * nursery size, and minimum is 2 * surviving_minor_collection

12:28 <antocuni> ok, but then it means that by default, it will always be based on the nursery_size, instead of nursery_surviving_size?

12:29 <antocuni> because estimate == 4*nursery_size, which is always > nursery_surviving_size*2

12:29 <arigato> no, I think that "nursery_surviving_size" includes the large young objects

12:29 <antocuni> ah

12:30 <arigato> ...no?

12:31 <antocuni> I see this line, "self.nursery_surviving_size += raw_malloc_usage(totalsize)"

12:31 <antocuni> which probably means you are correct

12:31 <arigato> where? it appears several times but AFAICT always on nursery objects

12:33 <antocuni> right

12:33 <arigato> no, it's self.size_objects_made_old I had in mind:

12:33 <arigato> see the long comment before line 360

12:34 <antocuni> ok. But then my original remark about "estimate" is correct

12:34 <arigato> yes

12:35 <arigato> it's just that self.size_objects_made_old is used elsewhere, to sometimes force more than one step of the major gc to occur

12:35 bremner has quit [Quit: Coyote finally caught me]

12:35 <antocuni> a bit of context: I started to dig this because I saw that on a real-world application, collect-step seemed to make long pauses

12:36 <antocuni> up to ~100 ms

12:37 <antocuni> so I investigated a bit, and found that this machine has a very large L2 cache (25MB), so the nursery is big, and "estimate" as well

12:38 <arigato> ah, argh

12:38 <arigato> then just setting PYPY_GC_NURSERY=4MB should help?

12:39 <antocuni> if I understand things correctly (which is unlikely :)), I think that we could improve the incrementality of our GC by making sure that "estimate" is always based on the surviving size, instead of the full nursery; is it ever useful to base it on nursery_size, after all?

12:39 <antocuni> arigato: yes, PYPY_GC_NURSERY=6M helped, as well as PYPY_GC_INCREMENT_STEP=2M

12:39 <antocuni> (although I'm not sure why the latter helps, looking at the code)

12:40 <arigato> probably because the default is 4 * PYPY_GC_NURSERY, which is still a bit too much

12:40 <antocuni> no, I'm saying that both settings improve the situation, even when used separately

12:40 marky1991 has quit [Remote host closed the connection]

12:41 <antocuni> ah right, I see

12:41 <arigato> h

12:41 <arigato> a

12:41 <antocuni> when I set PYPY_GC_INCREMENT_STEP, I'm saying: "collect either 2MB, or surviving_size*2, the largest"

12:42 marky1991 has joined #pypy

12:42 <antocuni> so if I have a situation in which many objects die young, surviving_size*2 is smallish

12:42 <antocuni> and since the nursery is so big, it is likely that they die young

12:43 bremner has joined #pypy

12:43 <arigato> right

12:43 <arigato> it's a bit unclear why we use nursery_size in that estimate

12:44 <antocuni> maybe because when you wrote it, you didn't have nursery_surviving_size yet?

12:44 <arigato> no, I think it's some fear I have but never made very concrete:

12:44 <arigato> what occurs if the memory usage from the program grows fast

12:45 <arigato> if the incremental gc is running too slowly, it means that memory usage grows faster than that

12:45 <arigato> (where "memory usage" means the really used, reachable memory)

12:45 <antocuni> but it cannot grow more than nursery_surviving_size, can it?

12:46 <antocuni> ah, maybe it can if you count young-but-large objects?

12:46 <arigato> yes, something like that

12:46 <arigato> self.size_objects_made_old is a later fix

12:46 <antocuni> but then it can grow indefinitely even if you use nursery_size*2

12:46 <arigato> I think the problem is fixed with self.size_objects_made_old

12:47 <antocuni> so, something like: estimate=size_objects_made_old * k, where k>1 ?

12:47 <arigato> but I'm not 100% sure, and I wasn't at the time either, so I preferred to keep a largish estimate

12:48 <antocuni> one way to handle it is that if we are wrong, eventually someone will report a memory leak :)

12:48 <arigato> meh :-)

12:48 * antocuni looks at how size_objects_made_old is computed

12:48 <arigato> I *think* that we could check what occurs even if we have estimate=smaller-than-nursery_surviving_size

12:49 <arigato> if I'm reading the loop at line 783 correctly

12:50 <antocuni> I think that size_objects_made_old doesn't help for computing estimate: the comment says that it's the size since the last major collection, but we want the size since the last MINOR one

12:50 <antocuni> and I'm not sure to get what you are saying about like 783

12:52 <arigato> I'm saying that the global invariant that we try to maintain is that the GC major steps do sufficient work so that after a full cycle, we have only 50% more memory used than at the start of the full cycle, or something

12:52 <antocuni> "full cycle" == "a complete major collection"?

12:53 <arigato> yes

12:54 <antocuni> uhm, I think I start to grasp the logic. Let's forget about large objects for now

12:55 <antocuni> at each minor collection, we increment the used memory by at most nursery_size (if everything survives)

12:55 <antocuni> but to collect nursery_size of memory, we need at least two steps: one for marking, and one for sweeping

12:56 <antocuni> so we mark an amount which is twice the nursery

12:56 <antocuni> or something along these lines?

12:58 <arigato> well, sweeping uses different thresholds anyway

12:58 <arigato> but yes

12:58 <arigato> the idea is to have marking progress "fast enough"

13:00 <antocuni> right, I see that sweeping is somehow based on 3*nursery_size

13:01 <antocuni> so, basically: I think that we should compute estimate from "size_objects_made_old_since_the_last_minor_collection"

13:02 <arigato> I also think the loop at :783 is printing, confusingly, that two major gc steps occurred, but the real program couldn't run between them

13:02 <antocuni> and possibly let the user to change the factor using an env variable, so that it can tweaks the incrementality

13:03 <antocuni> arigato: speaking of that, I have some interesting real-world chart to show, based on PYPYLOG

13:03 <arigato> antocuni: not quite, the current logic is probably better

13:03 <antocuni> why?

13:04 <arigato> because :783 does the same result, but is more general: it works for example even if the major gc is in another phase than marking

13:04 mcyprian has quit [Ping timeout: 248 seconds]

13:05 <antocuni> ah, you mean that it works even if we make estimate too small?

13:05 <arigato> yes, but also, it works also to speed up sweeping, for example

13:06 <antocuni> ok, so I guess we can safely say "estimate = self.nursery_surviving_size * k"

13:07 <arigato> yes, I think the conclusion is that we can use here an estimate that is generally good, and not worry about rare cases

13:08 <antocuni> coll, I'll try to implement this in a branch. What about killing PYPY_GC_INCREMENT_STEP and introduce PYPY_GC_INCREMENT_FACTOR (default=2)?

13:09 <arigato> yes, or maybe default = more than 2 because that's a big change from the current situation of 4 times nursery_size

13:10 <antocuni> true, but I think that 4*nursery_size is really "wrong", especially on high-end machines such as the one I found

13:11 <arigato> yes

13:12 <arigato> I still think we should keep a minimum value

13:12 <arigato> otherwise estimate might be very small

13:12 <arigato> and a call to major_collection_step() doesn't do anything at all

13:13 <antocuni> like, min_estimate = self.nursery_size / 8 or so?

13:13 <antocuni> do we have any statistics about the average ration of surviving_size/nursery_size?

13:13 <antocuni> s/ration/ratio

13:14 <arigato> it's low, is all I know

13:14 <arigato> maybe 20%?

13:15 <antocuni> it's probably useful to print it in the pypylog

13:15 <arigato> right

13:15 <antocuni> speaking of which: https://botbot.me/freenode/pypy/2018-05-09/?msg=99875082&page=1

13:15 <antocuni> here are the instructions to see my pypylog, if you are interested

13:15 <arigato> so yes, to avoid to change "too much" at once, I would go with a minimum of nursery_size/2

13:16 <antocuni> (the PYPYLOG viewer which I wrote is generally useful, I think)

13:16 <arigato> the reason is that we use nursery_size/2 in "self.threshold_objects_made_old += r_uint(self.nursery_size // 2)

13:16 <arigato> "

13:17 <antocuni> ok, I suppose it makes sense; and if we add enough env variables, we can still do experiments to see whether we find better defaults

13:18 <antocuni> arigato: in particular, the pypylog I linked to shows another collect-step problem which I think it's unrelated to what we are discussing now: a huge spike near the end of every single major collection

13:18 <antocuni> up to ~55ms

13:18 <arigato> right, but we're never careful enough in the GC: ideally we must never allow a corner case to do bad things

13:19 <arigato> that's why I'm not sure what occurs if we set estimate < nursery_size

13:19 <arigato> and in particular, <= nursery_size / 2

13:20 <antocuni> ok, it's likely that I am too optimist because I have not been bitten by these problems yet :)

13:21 <arigato> antocuni: likely, the spike is caused by the non-incremental steps of finalize and cpyext

13:21 mcyprian has joined #pypy

13:21 <antocuni> ah

13:22 <antocuni> finalize is "calling the __del__" ?

13:22 <arigato> yes

13:23 <arigato> well, not exactly doing the call to the __del__

13:23 <antocuni> I assume it's deal_with_objects_with_finalizers

13:23 <arigato> because that is not inside the pypylog reports for the gc

13:23 <arigato> yes

13:24 <antocuni> is there any fundamental reason why it's not incremental, or it's just that it has never been done?

13:25 <arigato> looks messy, but not fundamentally so

13:25 <antocuni> (in case it's not clear: I'm trying very hard to reduce the maximum observable pause from the user point of view)

13:26 <antocuni> calling the __del__s is also probably bad from this point of view, I didn't think about them

13:27 lazka has joined #pypy

13:34 Rhy0lite has joined #pypy

13:35 <bbot2> Failure: http://buildbot.pypy.org/builders/pypy-c-jit-linux-s390x/builds/856 [default]

13:38 <antocuni> arigato: actually, looking inside the log shows that these spikes are in the SWEEPING phase

13:42 <antocuni> (in case you are not using my pypylog viewer, here is a screenshot: http://antocuni.eu/misc/img/tD0DZyrV.png)

13:43 <antocuni> and here a zoomed-out screenshot of the whole log: http://antocuni.eu/misc/img/5rpDzaUV.png

13:55 inad923 has joined #pypy

13:58 inad924 has quit [Ping timeout: 260 seconds]

13:59 <arigato> how do you know it's sweeping?

14:00 <antocuni> I digged in the log and searched for the associated timestamp

14:00 <antocuni> it says: "starting gc state: SWEEPING

14:00 <antocuni> "

14:00 <antocuni> stopping, now in gc state: SWEEPING

14:01 <arigato> just one?

14:01 <arigato> or many that are bunched together?

14:01 <antocuni> in the chart, each pypylog section is one point

14:02 <antocuni> this specific one lasts 0.56 ms

14:02 <antocuni> but as you can see from the zoomed-out log, every collection follows a similar pattern

14:02 <arigato> unsure what you're saying

14:03 <arigato> did you find a single SWEEPING log section that is 55ms?

14:03 <antocuni> yes

14:03 <arigato> ok

14:03 <antocuni> moreover, each full cycle seems to have a single sweeping log section which is much slower than the others

14:04 <arigato> near the middle or always the last one?

14:04 <antocuni> look at the screenshots, it's easier :)

14:04 <arigato> I don't see that info in the screenshots

14:05 <antocuni> uh? http://antocuni.eu/misc/img/tD0DZyrV.png

14:05 <antocuni> the blue line represent the gc-collect-step

14:05 <antocuni> there is a big spike near the end

14:05 <arigato> but how I am supposed to know which of these steps is SWEEPING or something else?

14:06 dfee1 has joined #pypy

14:06 <antocuni> yes, you can't know from the screenshot this particular piece of info

14:06 <antocuni> but I assure you it's sweeping, I just checked :)

14:06 <arigato> *what* is sweeping? sorry, I don't understand you

14:07 <arigato> my question is: when does sweeping start, and when does it end?

14:08 <antocuni> each point of the graph is a single gc-section: on the X axis there is the "start", on the Y axis the "delta"

14:08 <arigato> ah, every point is a full cycle?

14:08 <antocuni> there is a section which starts at 716.51 seconds and ends at 716.45

14:08 <antocuni> every point is a gc-collect-step

14:08 <arigato> ok

14:09 <antocuni> the whole blue line line of the zoomed-in screenshot is a full cycle

14:09 <arigato> ok

14:09 <antocuni> if you look at the larger screenshot, you can see that at every full cycle (in blue), the memory drops (in green)

14:09 inad924 has joined #pypy

14:09 <arigato> then, some of these gc-collect-steps are for MARKING and some are for SWEEPING and a few for other things

14:09 <arigato> my question is: which ones?

14:09 <antocuni> yes

14:10 dfee1 has quit [Ping timeout: 265 seconds]

14:10 <antocuni> I suppose the first ones are marking, the last ones are sweeping; I don't know precisely where the border is, I can try to dig it in the log

14:11 <arigato> yes, I'd to know if the very-slow sweeping is the first, the last, or in the middle of the sweeping steps

14:11 <antocuni> ah ok, now I get your question

14:11 <antocuni> let me try to hack something

14:12 <antocuni> arigato: the phase of a gc-collect-step section is the "ending" phase, right?

14:12 inad923 has quit [Ping timeout: 240 seconds]

14:13 <antocuni> i.e., if a step starts in marking and ends in sweeping, should I consider it marking or sweeping?

14:15 <arigato> yes, it's the ending phase

14:15 <arigato> ah

14:15 <arigato> it prints both

14:15 <arigato> so no, you should read the first one

14:16 <arigato> if it starts in marking, then it is marking

14:16 <antocuni> ok

14:16 inad924 has quit [Quit: Leaving]

14:18 Taggnostr has quit [Ping timeout: 255 seconds]

14:21 <antocuni> arigato: http://antocuni.eu/misc/img/Jd7mTAfa.png

14:22 <antocuni> the points marked in yellow are SWEEPING

14:22 <antocuni> so it seems to be the very first sweeping phase

14:22 <antocuni> and indeed, digging in the log confirms

14:23 exarkun has joined #pypy

14:23 <arigato> uh

14:23 <arigato> ok what

14:23 exarkun has left #pypy [#pypy]

14:24 <arigato> the first sweeping step, it walks and frees a number of rawmalloced objects which is likely to be far too large

14:24 <antocuni> what is small_request_threshold?

14:24 <arigato> a number like 134816 in your case

14:25 <arigato> small_Request_threshold = 35*8

14:25 <antocuni> I'm not sure to follow the logic

14:25 <antocuni> limit is a number of objects or bytes?

14:26 <arigato> number of objects

14:26 <arigato> it happens to be the number such that, if all the objects are exactly 35*8 bytes long, then it'll sweep 3 nursery_sizes in bytes

14:26 <arigato> but the objects can be much larger

14:27 <antocuni> and a larger object takes a longer time to sweep

14:27 <arigato> in theory, no

14:28 <arigato> in practice, yes, because the headers of objects are on completely different pages

14:29 <antocuni> I don't understand whether 3*nursery is arbitrary or necessary to ensure termination

14:29 <arigato> it's a number that is sure to be large enough to ensure termination

14:30 <arigato> probably any number greater than 1 would do

14:30 <antocuni> is it? I can have arbitrarily as many rawmalloced objects as I want, without touching the nursery

14:32 <arigato> the major GC is started because the memory pressure has grown too much since the last GC

14:32 <arigato> so that means you can't think only about the nursery

14:33 <arigato> in this case, I think the nursery is used because one major gc step occurs because the nursery was full

14:33 <arigato> so, like during MARKING, we could instead base the number on nursery_survived_size instead

14:34 <antocuni> ok, so in the worst case I am sweeping some objects but I am allocating nursery_size more objects

14:34 <antocuni> right, it's exactly what I was about to suggest

14:35 <arigato> to be honest, I am not completely sure about any of these reasonings

14:36 <antocuni> do we have any test which checks whether the algo terminates/doesn't leak/

14:36 <antocuni> ?

14:36 <arigato> maybe not

14:37 <antocuni> "good"

14:39 <antocuni> I am about to go afk soon; I'll try to implement these ideas in a branch and see what happens. But we surely need to think more before merging

14:41 <arigato> the basic idea is probably still this loop at :783, which guarantees that there is at least one major gc step for every (nursery_size/2) allocated bytes outside the nursery

14:41 Taggnostr has joined #pypy

14:42 <arigato> so if every MARKING and every SWEEPING does its job on more than nursery_size bytes, then it should guarantee that we mark and sweep faster than we allocate

14:43 <antocuni> nursery_size or nursery_surviving_size?

14:43 <arigato> these two numbers of (nursery_size/2) and nursery_size are arbitrary, and they are related to the nursery size only because it looks like a good idea to make them so

14:44 <arigato> no, always nursery_size

14:44 <arigato> so maybe indeed it would be an idea to make them related to nursery_surviving_size instead, but then all of them, not just half

14:45 <antocuni> M-x replace string nursery_size -> nursery_surviving_size and we are done :)

14:46 <antocuni> arigato: leaving now, thanks for the help

14:47 <arigato> well, that's obscure. maybe we should instead ask major_collection_step() to do some progress, but it would return how much progress it really did, and we use that

14:47 Eran has quit [Quit: Page closed]

14:47 <arigato> antocuni: bye

15:22 jcea has quit [Read error: Connection reset by peer]

15:22 jcea has joined #pypy

15:24 user24 has quit [Remote host closed the connection]

15:24 lazka has quit [Quit: Leaving]

15:46 mcyprian has quit [Ping timeout: 260 seconds]

15:54 kanaka has quit [Ping timeout: 240 seconds]

15:59 kanaka has joined #pypy

16:02 kanaka has joined #pypy

16:02 kanaka has quit [Changing host]

16:03 exarkun has joined #pypy

16:04 xorAxAx has quit [Remote host closed the connection]

16:06 xorAxAx has joined #pypy

16:18 xorAxAx has quit [Remote host closed the connection]

16:19 dfee1 has joined #pypy

16:22 illume has joined #pypy

16:30 tbodt has joined #pypy

16:31 tbodt has quit [Client Quit]

16:32 tbodt has joined #pypy

16:40 marky1991 has quit [Ping timeout: 240 seconds]

16:41 <antocuni> arigato: "we use that" to do what? To compute how much to do at the next step? Or to continue calling major_collection_step in a loop until we reach a certain threshold?

16:44 illume has quit [Quit: My MacBook Pro has gone to sleep. ZZZzzz…]

16:47 user24 has joined #pypy

17:02 raynold has joined #pypy

17:12 antocuni has quit [Ping timeout: 240 seconds]

17:13 dfee1 has quit [Ping timeout: 265 seconds]

17:35 xorAxAx has joined #pypy

17:40 tbodt has quit [Quit: Textual IRC Client: www.textualapp.com]

17:41 tbodt has joined #pypy

17:43 exarkun has left #pypy [#pypy]

17:57 marky1991 has joined #pypy

18:07 mcyprian has joined #pypy

18:07 mcyprian has left #pypy [#pypy]

18:09 tbodt has quit [Quit: My Mac has gone to sleep. ZZZzzz…]

18:18 necaris has joined #pypy

18:18 marky1991 has quit [Remote host closed the connection]

18:18 <necaris> hey folks

18:18 <necaris> quick question

18:18 <necaris> what is the state of the `py3.6` branch?

18:20 <necaris> would love some tips on the best thing i could do to help on it

18:28 necaris has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

18:37 tbodt has joined #pypy

18:40 <cfbolz> necaris: hey Rami!

18:40 <cfbolz> How are things?

18:43 void__ has joined #pypy

18:46 user24 has quit [Ping timeout: 240 seconds]

18:48 demonimin has quit [Remote host closed the connection]

18:49 _aegis__ has quit [Ping timeout: 255 seconds]

18:51 mcyprian has joined #pypy

18:51 mcyprian has quit [Client Quit]

18:52 mcyprian has joined #pypy

18:54 tbodt has quit [Quit: My Mac has gone to sleep. ZZZzzz…]

18:56 tbodt has joined #pypy

18:56 mcyprian has quit [Ping timeout: 265 seconds]

18:57 mcyprian has joined #pypy

18:58 <bbot2> Started: http://buildbot.pypy.org/builders/own-linux-x86-64/builds/6747 [Carl Friedrich Bolz-Tereick: force build, py3.6]

18:58 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-app-level-linux-x86-64/builds/3956 [Carl Friedrich Bolz-Tereick: force build, py3.6]

18:59 <cfbolz> necaris: I don't actually know the answer to this question, but I kicked of a test run on the branch, which should give us a clue what is still missing

19:07 _aegis_ has joined #pypy

19:15 dfee1 has joined #pypy

19:17 void__ has quit [Ping timeout: 256 seconds]

19:18 lauren has joined #pypy

19:20 <lauren> is jitpy dead

19:20 <cfbolz> lauren: what is jitpy?

19:20 <lauren> https://github.com/fijal/jitpy

19:21 <cfbolz> Ah

19:21 <cfbolz> No clue, fijal?

19:21 <cfbolz> It looks more like an experiment to me

19:21 <lauren> dang

19:21 <lauren> I had it in some really old notes

19:21 <lauren> ...I'm actually not even sure which really old notes. oh maybe my old twitter posts

19:35 <cfbolz> lauren: do you have a use case in mind?

19:46 dfee1 has quit [Ping timeout: 256 seconds]

19:54 <lauren> yeah, web thing. I'm depending on this package list: {pyramid,sqlalchemy,psycopg2-binary,pyramid_jinja2,bcrypt,twisted,alchimia,SQLAlchemy-Utc,pytz,blessed,lxml, stripe} -

19:55 <lauren> psycopg and lxml are pain ones, right?

19:57 Rhy0lite has quit [Quit: Leaving]

19:58 <bbot2> Failure: http://buildbot.pypy.org/builders/pypy-c-app-level-linux-x86-64/builds/3956 [Carl Friedrich Bolz-Tereick: force build, py3.6]

20:08 <ronan> lauren: everything should work, I think

20:08 <lauren> damn really? including psycopg2-binary?

20:08 <ronan> yes

20:08 <lauren> wow

20:08 <lauren> what a world

20:08 <lauren> how is it that that would work

20:09 <simpson> cpyext keeps getting buffs.

20:09 <lauren> ah

20:09 <lauren> will it be fast enough to be usable?

20:10 <ronan> it's probably a bit slower than CPython but not horribly so

20:11 <ronan> as usual, perfs depend a lot on what your app actually does

20:11 tbodt has quit [Quit: My Mac has gone to sleep. ZZZzzz…]

20:14 tbodt has joined #pypy

20:14 dfee1 has joined #pypy

20:15 <ronan> lauren: well, you need psycopg2, actually, as there are no wheels for pypy

20:16 <lauren> ah

20:16 <lauren> reasonable re perf

20:16 <lauren> is psycopg2-cffi production ready

20:16 <lauren> or even a real thing rather than me misremembering

20:17 <ronan> I know that some people use it, but I can't tell how good it is

20:23 <bbot2> Failure: http://buildbot.pypy.org/builders/own-linux-x86-64/builds/6747 [Carl Friedrich Bolz-Tereick: force build, py3.6]

20:24 devwatchdog has joined #pypy

20:28 tbodt has quit [Quit: My Mac has gone to sleep. ZZZzzz…]

20:29 tbodt has joined #pypy

20:31 asmeurer__ has joined #pypy

20:41 illume has joined #pypy

20:41 asmeurer__ has quit [Quit: asmeurer__]

20:43 mcyprian has quit [Quit: Leaving.]

20:45 devwatchdog has quit [Quit: Leaving]

20:49 dfee1 has quit [Ping timeout: 265 seconds]

21:30 dfee1 has joined #pypy

21:34 asmeurer has joined #pypy

21:46 zmt00 has quit [Quit: Leaving]

21:46 zmt00 has joined #pypy

21:56 wleslie has joined #pypy

22:00 <bbot2> Started: http://buildbot.pypy.org/builders/own-linux-s390x/builds/919 [default]

22:01 jamesaxl has quit [Quit: WeeChat 2.1]

22:03 dfee1 has quit [Ping timeout: 264 seconds]

22:14 dfee1 has joined #pypy

22:23 illume has quit [Quit: My MacBook Pro has gone to sleep. ZZZzzz…]

22:30 asmeurer has quit [Quit: asmeurer]

22:36 asmeurer__ has joined #pypy

22:44 tbodt has quit [Ping timeout: 240 seconds]

22:46 dfee1 has quit [Ping timeout: 256 seconds]

22:48 asmeurer___ has joined #pypy

22:48 dfee1 has joined #pypy

22:49 froztbyt1 has joined #pypy

22:50 froztbyte has quit [Ping timeout: 240 seconds]

22:50 [Arfrever] has quit [Ping timeout: 240 seconds]

22:50 dmalcolm has quit [Ping timeout: 240 seconds]

22:50 dmalcolm has joined #pypy

22:50 asmeurer__ has quit [Ping timeout: 260 seconds]

22:51 [Arfrever] has joined #pypy

22:53 wleslie has quit [Quit: ~~~ Crash in JIT!]

23:11 marself has quit [Ping timeout: 260 seconds]

23:25 <bbot2> Failure: http://buildbot.pypy.org/builders/own-linux-s390x/builds/919 [default]

23:33 asmeurer___ has quit [Quit: asmeurer___]

23:35 asmeurer_ has joined #pypy

23:47 asmeurer_ has quit [Quit: asmeurer_]