#pypy on 2018-12-19 — irc logs at freenode.irclog.whitequark.org

2018-11-25 07:18 arigato changed the topic of #pypy to: PyPy, the flexible snake (IRC logs: https://quodlibet.duckdns.org/irc/pypy/latest.log.html#irc-end ) | use cffi for calling C | mac OS and Fedora are not Windows

00:01 themsay has joined #pypy

00:03 adamholmberg has quit [Remote host closed the connection]

00:04 adamholmberg has joined #pypy

00:08 adamholmberg has quit [Remote host closed the connection]

00:09 adamholmberg has joined #pypy

00:12 speeder39_ has joined #pypy

00:12 adamholmberg has quit [Remote host closed the connection]

00:13 adamholmberg has joined #pypy

00:17 adamholmberg has quit [Ping timeout: 240 seconds]

00:18 adamholmberg has joined #pypy

00:18 adamholmberg has quit [Remote host closed the connection]

00:25 <bbot2> Failure: http://buildbot.pypy.org/builders/own-linux-s390x/builds/1143 [default]

00:30 moei has quit [Quit: Leaving...]

00:50 themsay has quit [Ping timeout: 246 seconds]

01:07 adamholmberg has joined #pypy

01:11 adamholmberg has quit [Ping timeout: 240 seconds]

01:20 adamholmberg has joined #pypy

01:49 adamholmberg has quit [Remote host closed the connection]

02:00 <bbot2> Started: http://buildbot.pypy.org/builders/build-pypy-c-jit-linux-armel/builds/2593 [py3.5]

02:00 <bbot2> Started: http://buildbot.pypy.org/builders/build-pypy-c-jit-linux-armhf-raspbian/builds/2354 [py3.5]

02:00 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-64/builds/5875 [py3.5]

02:00 <bbot2> Failure: http://buildbot.pypy.org/builders/build-pypy-c-jit-linux-armel/builds/2593 [py3.5]

02:00 <bbot2> Failure: http://buildbot.pypy.org/builders/build-pypy-c-jit-linux-armhf-raspbian/builds/2354 [py3.5]

02:00 <bbot2> Started: http://buildbot.pypy.org/builders/own-linux-x86-64/builds/7150 [py3.5]

02:00 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-win-x86-32/builds/4296 [py3.5]

02:00 <bbot2> Started: http://buildbot.pypy.org/builders/own-linux-x86-32/builds/6191 [py3.5]

02:00 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-32/builds/4959 [py3.5]

02:00 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-macosx-x86-64/builds/4119 [py3.5]

02:11 themsay has joined #pypy

02:32 jcea has quit [Remote host closed the connection]

02:33 jcea has joined #pypy

02:50 Kipras_ has quit [Read error: Connection reset by peer]

03:02 speeder39_ has quit [Quit: Connection closed for inactivity]

03:08 <bbot2> Failure: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-32/builds/4959 [py3.5]

03:23 dddddd has quit [Remote host closed the connection]

03:24 jcea1 has joined #pypy

03:25 jcea has quit [Read error: Connection reset by peer]

03:25 jcea1 is now known as jcea

03:28 <bbot2> Failure: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-64/builds/5875 [py3.5]

03:52 jcea has quit [Quit: jcea]

04:22 <bbot2> Failure: http://buildbot.pypy.org/builders/own-linux-x86-64/builds/7150 [py3.5]

05:15 <bbot2> Failure: http://buildbot.pypy.org/builders/pypy-c-jit-macosx-x86-64/builds/4119 [py3.5]

05:36 tos9 has quit [Ping timeout: 264 seconds]

05:43 xcm has quit [Remote host closed the connection]

05:44 xcm has joined #pypy

06:37 <mattip> njs: for an example of why the IR approach is problematic, see numpy PR #12317, which reimplements pure python in c

06:37 <mattip> https://github.com/numpy/numpy/pull/12317

06:38 <mattip> for speed

06:42 tos9 has joined #pypy

06:46 <njs> mattip: why is that a problem for the IR approach?

06:47 <mattip> how would you rewrite that PR using the IR approach? It is a re-implemenation of python code in C-API

06:48 <mattip> and requires intimate knowledge of python structures, kwarg handling

06:48 <mattip> arg parsing error paths and attribute lookup

06:49 <njs> mattip: cython is pretty good at handling python structures, kwargs, error cleanup, etc. at C speeds?

06:49 <njs> mattip: I haven't read the patch

06:50 <mattip> not really, that is where cython marks the code as yellow (lots of C-API use)

06:50 <mattip> but it might be interesting to rewrite that PR as a pyx file ..

06:52 <njs> the color Cython uses for the code doesn't actually make it slower though does it :-)

06:53 <mattip> it indicates where cython is non-optimal

06:53 <mattip> and probably no faster than pure python

06:54 <njs> but if Cython colors any C API usage yellow, and the handwritten C code is also using the C API...

06:54 <njs> what is the handwritten code doing that Cython can't/doesn't?

06:56 <bbot2> Failure: http://buildbot.pypy.org/builders/own-linux-x86-32/builds/6191 [py3.5]

07:03 <bbot2> Failure: http://buildbot.pypy.org/builders/pypy-c-jit-win-x86-32/builds/4296 [py3.5]

07:10 jacob22__ has quit [Ping timeout: 252 seconds]

08:06 jacob22__ has joined #pypy

08:32 lritter has joined #pypy

08:50 themsay has quit [Ping timeout: 246 seconds]

09:38 Arfrever has joined #pypy

09:49 <lesshaste> njs, you can look at the C code that cython is producing to see

09:49 antocuni has joined #pypy

09:50 jamesaxl has joined #pypy

10:36 dddddd has joined #pypy

10:50 <mattip> here are the failing tests on numpy's CI

10:50 <mattip> https://dev.azure.com/numpy/numpy/_build/results?buildId=1349&view=ms.vss-test-web.test-result-details

10:52 <mattip> part of this pr https://github.com/numpy/numpy/pull/12594

10:52 commandoline_ has quit [Ping timeout: 246 seconds]

10:52 commandoline has joined #pypy

10:54 <antocuni> mattip: am I missing something, or looking at that page it seems that numpy does NOT run tests on python2.7?

11:05 <bbot2> Started: http://buildbot.pypy.org/builders/own-linux-x86-32/builds/6192 [antocuni: force build, gc-disable]

11:05 Arfrever has quit [Ping timeout: 272 seconds]

11:05 <bbot2> Started: http://buildbot.pypy.org/builders/own-linux-x86-64/builds/7151 [antocuni: force build, gc-disable]

11:05 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-32/builds/4960 [antocuni: force build, gc-disable]

11:05 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-64/builds/5876 [antocuni: force build, gc-disable]

11:09 <mattip> correct, numpy is dropping active support for python2.7 on master, bugfixes will be backported to 1.16

11:09 <mattip> antocuni ^^^

11:09 <mattip> until jan 2020, when even that will stop

11:09 <antocuni> poor py2.7

11:10 il_ratto has left #pypy [#pypy]

11:19 antocuni has quit [Ping timeout: 244 seconds]

11:26 Arfrever has joined #pypy

11:56 <bbot2> Failure: http://buildbot.pypy.org/builders/own-linux-x86-64/builds/7151 [antocuni: force build, gc-disable]

12:23 <bbot2> Failure: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-64/builds/5876 [antocuni: force build, gc-disable]

12:24 <bbot2> Failure: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-32/builds/4960 [antocuni: force build, gc-disable]

12:29 ionelmc has quit [Ping timeout: 260 seconds]

12:31 wallet42 has quit [Ping timeout: 252 seconds]

12:33 wallet42 has joined #pypy

12:44 ionelmc has joined #pypy

12:48 antocuni has joined #pypy

12:52 <bbot2> Failure: http://buildbot.pypy.org/builders/own-linux-x86-32/builds/6192 [antocuni: force build, gc-disable]

13:05 jcea has joined #pypy

13:09 jcea has quit [Remote host closed the connection]

13:22 jacob22__ has quit [Remote host closed the connection]

13:31 Rhy0lite has joined #pypy

13:39 <arigato> snow, snow

13:52 adamholmberg has joined #pypy

13:52 * antocuni wonders what's the result of "grep 'snow now' | wc" on the full log of #pypy

13:52 <antocuni> *snow snow

13:58 xcm has quit [Killed (livingstone.freenode.net (Nickname regained by services))]

13:58 <antocuni> arigato: I'd like to merge the gc-disable branch. The tests are passing and it has been used in production for quite a while now, it seems to work very well

13:59 xcm has joined #pypy

13:59 <antocuni> are you interested in reviewing it, or should I just proceed?

14:07 <arigato> antocuni: I can review it, at least check if there is no problem from my point of view

14:07 <antocuni> yes please

14:08 <antocuni> I am writing the documentation right now, but the TL;DR version is: gc.disable() disables major collections, then you call gc.collect_step() to run a single step manually

14:09 <antocuni> and here is an example of usage: https://bitbucket.org/antocuni/pypytools/src/0273afc3e8bedf0eb1ef630c3bc69e8d9dd661fe/pypytools/gc/custom.py?at=default&fileviewer=file-view-default

14:20 <vstinner> antocuni: i proposed to modify the multiprocessing module to no longer rely on the GC to cleanup stuff. Antoine Pitrou replied that it goes against the language philosophy :-(

14:21 <vstinner> https://mail.python.org/pipermail/python-dev/2018-December/155947.html "anti-pattern" "Python _is_ a garbage-collected language"

14:21 <Alex_Gaynor> Which philosophy "Implicit is better than explicit"?

14:22 <vstinner> i also proposed to avoid creating a reference cycle in xml.etree rather than fixing the GC https://bugs.python.org/issue35502#msg332051 but Serhiy only fixed treebuilder_gc_traverse(), he ignored my remark

14:22 rjarry has left #pypy ["Leaving"]

14:22 <vstinner> my small contribution: i proposed a change to start emiting ResourceWarning when multiprocessing.Pool isn't used closed/terminated explicitly: https://github.com/python/cpython/pull/10974

14:23 <vstinner> no idea if antoine pitrou is going to accept that one :)

14:27 <antocuni> vstinner: well, from my POV it's obvious than Antoine Pitrou is wrong, because gvr was very explicit at saying that Python-the-language doesn't guarantee destructors to be called immediately

14:27 <antocuni> (I can't find the reference to that though)

14:28 <vstinner> antocuni: i was surprised how deep multiprocessing rely on the GC

14:29 <antocuni> THAT is the antipattern :)

14:29 <vstinner> well, i explained that in my email https://mail.python.org/pipermail/python-dev/2018-December/155946.html

14:29 <vstinner> and I'm concerned by Pablo's proposed fix which adds a new reference cycle :-(

14:29 <vstinner> antocuni: i guess that on this channel, it's easy to convince you

14:30 <vstinner> antocuni: in the CPython world, things are different :)

14:30 <antocuni> CPython devs needs to decide whether they want to evolve the language in a way which is possible to have alternative implementations or not

14:31 <vstinner> antocuni: i'm pushing hard towards that, but it's hard to change people :)

14:31 <antocuni> all the rest is pure rhetoric

14:31 <vstinner> antocuni: i'm not sure that the problem is technical :)

14:31 <antocuni> vstinner: I know and I am glad you are doing that

14:32 <vstinner> the root of my https://pythoncapi.readthedocs.io/ project is "hey, Python can run on something different than CPython!"

14:32 <antocuni> it really depends on whether people care about Python or about CPython

14:32 <vstinner> but people reply "your changes are going against CPython"

14:32 <vstinner> antocuni: i do care a lot of CPython, it's my little baby :)

14:32 <vstinner> antocuni: i love an hate CPython

14:33 <vstinner> and*

14:33 <antocuni> CPython does a very good job; but Python-the-ecosystem can only benefits from having multiple implementations

14:33 <vstinner> i hate CPython because people rely on every single subtle behavior

14:34 <vstinner> people love to see their file/socket magically closed

14:35 <antocuni> then ask them why we introduced "with"

14:35 <vstinner> ... and recently i found a 5 years old reference cycle bug in socket.create_connection() which kept all variables alive for hours

14:36 <vstinner> https://bugs.python.org/issue31234 => https://github.com/python/cpython/commit/acb9fa79fa6453c2bbe3ccfc9cad2837feb90093

14:36 <vstinner> because Python 3 exceptions magically keeps all frames alive forever

14:36 <vstinner> i hate this stupid "feature"

14:37 <vstinner> because of that, asyncio code is full of "finally: self = None"

14:37 <antocuni> let's stick to py2.7 :)

14:38 nunatak has joined #pypy

14:38 <vstinner> antocuni: ok, i'm now depressed

14:38 <antocuni> vstinner: sorry :)

14:40 <vstinner> antocuni: my new big project is to add Py_SetType() to replace Py_TYPE(obj) = type :-)

14:41 <Alex_Gaynor> vstinner: will it be a safe API? e.g. `Py_SetType(<int>, PyListType)` will fail?

14:41 <antocuni> vstinner: btw, https://github.com/python/cpython/commit/acb9fa79fa6453c2bbe3ccfc9cad2837feb90093 is a VERY good example to convince people that __del__s should just die and not being used for anything

14:41 <antocuni> (and/or that py3k contains bad design choices)

14:43 <vstinner> Alex_Gaynor: currently, "Py_TYPE(obj) = type" has no way to return an error. i would like to fix Py_TYPE(), but if I add a new function, i would prefer to be able to report a failure

14:43 <vstinner> no idea what happens if you do Py_TYPE(None) = &PyUnicodeType ?

14:43 <Alex_Gaynor> I assume at random later points, things crash

14:44 <vstinner> antocuni: my socket.create_connection() fix is *one* example. it's easy to find way more of them. like the xml.etree bug

14:44 <vstinner> antocuni: it's super comlex to detect ref cycles during development, and worse on production

14:44 <antocuni> yes, sure. But a language which forces you to do something like that is, at least, obscure

14:44 <vstinner> Alex_Gaynor: maybe it's ok if the crash is deterministic :-D

14:44 <Alex_Gaynor> vstinner: it's probably exploitable

14:45 <vstinner> antocuni: come one, it's the Python philosophy!

14:45 <vstinner> antocuni: one obvious way to do something

14:45 <antocuni> :)

14:46 lritter has quit [Ping timeout: 244 seconds]

14:48 jcea has joined #pypy

14:52 xcm has quit [Killed (barjavel.freenode.net (Nickname regained by services))]

14:53 xcm has joined #pypy

15:02 <antocuni> arigato: I wrote some docs: https://bitbucket.org/pypy/pypy/commits/9b7167f7a2ace87e83b0101cca78f03c211143e7

15:25 nunatak has quit [Quit: Leaving]

15:29 jacob22__ has joined #pypy

15:54 <arigato> antocuni: I'm not sure it's a good idea to use gc.disable() like you do

15:55 <arigato> there are a few people on CPython that write their apps using gc.disable(), after making sure somehow that they don't have cycles

15:56 <arigato> though admittedly, these apps would not run any __del__ on pypy right now, but that sounds less bad than "leaking at a fast rhytm"

15:57 <antocuni> well, from some point of view you should not be surprised if you get a leak after you disable the GC

15:57 <antocuni> but yes, I see your point

15:57 <arigato> I agree, but "gc.disable" doesn't mean "disable the GC", for CPython it means something much more specific

15:58 <arigato> I'm maybe worried about cases where some code does "gc.disable(); run some stuff; gc.enable()"

15:59 <arigato> you'd hope that there is not too much stuff in the middle, but it might be large in some cases, then we get unexpectedly a "leak"

15:59 <antocuni> well, this is a good example of why it's a good idea to use my semantics: you want to make sure that the GC doesn't run in a specific section of the code, and you are fine with some objects not being collected in the mean time

15:59 <arigato> right now, gc.disable() disables calling __del__ for a very practical reason

15:59 <antocuni> although in pypy "some objects" is a much larger set of course

16:00 <antocuni> which is?

16:00 <arigato> on cpython, the point of gc.disable() and gc.enable() is that you're guaranteed to not have __del__ being called "randomly" in the middle by newly broken reference cycles

16:01 <arigato> well, at least that's (I think) the main reason for calling gc.disable()/enable() around a piece of code

16:01 <antocuni> ah. I thought it was to avoid unexpected pauses (or at least, that's what I always found reading code around)

16:01 <arigato> ah, I see where you're coming from

16:02 <antocuni> also, if the section between the disable() and the enable() is short (as it should be), the memory doesn't grow too much

16:03 <antocuni> but yes, thinking about that it's likely that there is code around which will be hit by this change

16:07 <antocuni> basically, we should decide whether we want to consider this case like we do for e.g. leaking file descriptors (you need to fix your code), or if we want to be super-conservative and try not to break existing code, at the expense of having a worse API

16:07 <antocuni> (I don't like too much the idea of introducing yet another obscure API like gc.disable_majors() or so)

16:08 <arigato> I suppose I'm fine with gc.disable() actually

16:09 <arigato> because if you really leave the gc disabled for a long while, no destructors are called, and you get strange effects too

16:09 <antocuni> I'll write a note in whatsnew to make sure to write something appropriate when we do the next release

16:10 <arigato> I'm trying to think about multithreaded examples where one thread does gc.disable()/long-blocking-call/gc.enable(),

16:10 <arigato> but that's a dummy thing to do because gc.disable/enable are not reentrant anyway

16:10 <antocuni> why would you want to do that anyway? If you have a long-blocking call, the GC doesn't run anyway

16:11 <arigato> no, it does in multithreaded examples

16:11 <arigato> random other python code would run during the long blocking call

16:12 <antocuni> ok, I thought you were proposing a pattern in which you disable the GC around the call *on purpose*; but I realize that you are probably talking about a case in which a thread disables the GC and then by chance it blocks

16:12 <arigato> but that's probably not a good thing to do because you have no sane way to do gc.disable/enable that really works in multithreaded cases

16:12 <arigato> yes

16:13 <arigato> just saying, if the original goal was to disable __del__s then in multithreaded programs you can't be sure another thread won't re-enable the GC

16:13 <arigato> which makes the 'subprocess' module basically buggy

16:13 <arigato> (there is gc.disable() in the subprocess module, for that reason)

16:13 <antocuni> ah, does subprocess use this pattern?

16:13 * antocuni looks

16:14 <arigato> meh, nothing we can really do

16:14 <arigato> ah, maybe the goal is to prevent __del__s from running in the child process, after os.fork(), and in the child there is only thread..?

16:15 <arigato> ...only one thread

16:15 <antocuni> arigato: I'm reading subprocess.py:943

16:16 <antocuni> to me, it looks like that they solved the bug using a workaround instead of a proper fix, which would have been to keep the file descriptor alive explicitly

16:16 <vstinner> arigato: /* We need to call gc.disable() when we'll be calling preexec_fn */ says _posixsubprocess

16:17 <arigato> it's hard to fix the bug properly, because the goal here is to mangle the file descriptors, precisely

16:18 <arigato> yes, on 3.x they moved that code to C but that doesn't fix all problems because preexec_fn() is still a python callable

16:21 <antocuni> anyway, this should still work in my branch, since the GC is disabled for a short period of time

16:23 <antocuni> arigato: I also remember that we often had to explain people "no, gc.disable() doesn't actually disable the GC" in the past. That's one of the main reasons why I used that, because I thought it was really a missing feature

16:23 marky1991 has joined #pypy

16:25 <arigato> yes, it's probably a good idea

16:25 <antocuni> good :)

16:25 <arigato> I must say I wouldn't mind answering "gc.disable() disables the gc, so you can't complain about running out of memory"

16:26 <antocuni> yes, exactly

16:28 <antocuni> arigato: do you want to review it further or I can merge?

16:35 <arigato> rpython/memory/gc/incminimark.py:767

16:35 <arigato> you changed "if gen < 0" to "if gen <= 0"

16:36 <arigato> now the logic doesn't make sense, because there is "if gen == 1 and ...", 10 lines later

16:37 <arigato> I think the idea was that gc.collect(0) would run self.minor_collection_with_major_progress() too,

16:37 <arigato> and gc.collect(1) would run that plus start a major collection if none was running now

16:37 <arigato> and the undocumented gc.collect(-1) would run only self._minor_collection() which is dangerous

16:39 <antocuni> arigato: note that this is only for the rpython API, not the app-level one

16:40 <arigato> ah right, didn't notice that the "generation" argument was not passed down

16:40 <antocuni> I don't remember exactly what I though when doing that commit, but I think I assumed that <0 was a typo because nobody ever calls rgc.collect(-1)

16:40 <arigato> right, but no

16:41 <arigato> the logic for the "gen" doesn't make sense now

16:41 <antocuni> why?

16:41 <arigato> I mean, I don't know if that was useful, but also, no point in changing this overcomplicated API to a different overcomplicated one

16:42 <arigato> because the "if gen == 1", 10 lines down

16:42 <arigato> now it's always true

16:42 <arigato> before, it wasn't always true

16:42 <antocuni> right

16:43 <arigato> you may remove the case "gen < 0" completely and argue that it is not tested and not used

16:43 <antocuni> I am also not very happy to abuse gc.collect to control the rgc, I just followed the existing style. But indeed, maybe it makes more sense to kill support for gc.collect and use more explicit rgc.* APIs?

16:44 <antocuni> I am not even sure that this logic is called by anyone

16:45 <arigato> in rpython? doesn't seem like such a big deal

16:45 <arigato> precisely

16:46 <arigato> ah, I see that collect_step() would be some "collect(1.5)" or something

16:47 <antocuni> so, a quicky grep seems to reveal that in rpython gc.collect(something) is called only in tests

16:47 <arigato> ok

16:47 <antocuni> in particular, test_newgc.py:1486 says "nursery-only collection, if possible"

16:48 <arigato> ok I think go ahead and merge, maybe with the "if gen <= 0" reverted to "if gen < 0"

16:49 <arigato> I'm sure test_newgc passes either way, because it's hard to write very specific gc tests that will catch future unexpected tweaks

16:49 <arigato> that's why I tend to be wary of changes in the gc

16:49 <antocuni> I am proposing to just killing the argument to rgc.collect()

16:50 <arigato> well, then you're making test_newgc.py:1486 no longer test what it is supposed to, I guess

16:51 <arigato> maybe just add the comment "don't use an argument here, it's only for tests?"

16:52 <antocuni> now I am no longer sure what test_newgc wants to do; does it want to do a minor collection, or to do a minor_collect_with_major_progress?

16:52 <arigato> I think the test was written because we have *incremental* minimark

16:53 <arigato> I think the test was written before we have *incremental* minimark

16:53 <arigato> I think the difference is not important: the test is expecting that most of the calls will be nursery-only

16:53 <antocuni> ok, so it's basically the same; but then I don't see why you propose to revert to the old logic "if gen < 0"; it seems this is the only place where it actually makes a difference

16:54 <antocuni> and the old logic didn't make much sense to me (was it written with gc.collect(-1) in mind?)

16:54 <arigato> I'm suggesting to their revert or kill that logic; just don't keep it as it is in the "gc-disable" branch because it makes even less sense

16:54 <arigato> gah

16:54 <arigato> I'm suggesting to either revert or kill that logic; just don't keep it as it is in the "gc-disable" branch because it makes even less sense

16:55 <antocuni> because of the "if gen == 1"?

16:55 <arigato> yes

16:56 <antocuni> what about killing just that?

16:56 <arigato> ok wait, I can rewrite that function to make it explicit what the cases it's trying to implement are

16:56 <antocuni> maybe it's just better/simpler to use == everywhere instead of <=

16:57 <arigato> yes, and comments

16:57 <antocuni> not sure what should the comments say that it's not already in the docstring

16:57 Zaab1t has joined #pypy

16:57 * arigato writes

16:57 <antocuni> thanks :)

17:07 <arigato> pushed

17:08 <antocuni> our hg hooks no longer work it seems :(

17:10 <antocuni> arigato: indeed, it is much clearer now

17:10 <antocuni> thanks

17:10 <antocuni> arigato: are you fine with merging?

17:11 <arigato> yes

17:11 <antocuni> cool, thanks

17:11 xcm has quit [Remote host closed the connection]

17:11 <arigato> I think the real motivation to keep these cases around is to explain what really occurs if you try to call this or that function

17:11 <arigato> in that order

17:12 <cfbolz> arigato: I have a random GC related question: can you think of a hack to tell the GC to allocate something (eg a string or so) in the old generation immediately?

17:12 xcm has joined #pypy

17:13 <arigato> it's not enough to allocate it non-movable?

17:14 <cfbolz> no

17:15 <cfbolz> the reason I want it: json parsing of huge files breaks the generational hypothesis, because the whole loaded json data stays alive till after the parser is done

17:15 <antocuni> merged & pushed

17:15 * antocuni off, see you

17:15 <arigato> antocuni: cool

17:15 <arigato> see you

17:15 <cfbolz> arigato: that means everything is allocated in the nursery and then moved again

17:15 <cfbolz> antocuni: nice!

17:16 <arigato> cfbolz: so, no, there's nothing like that

17:16 <cfbolz> arigato: it would be a mess to get it, too, I suppose?

17:18 <arigato> not necessarily

17:18 <arigato> see "def malloc_fixed_or_varsize_nonmovable" in incminimark.py

17:19 <cfbolz> I see

17:19 <arigato> we need the same thing, but with "alloc_young=False"

17:19 <cfbolz> right

17:19 <cfbolz> but a bit of a mess to get there from high level code

17:19 antocuni has quit [Ping timeout: 250 seconds]

17:21 <arigato> yes, you'd need to follow malloc_fixed_or_varsize_nonmovable via gctransform etc.

17:21 <cfbolz> right

17:22 <cfbolz> arigato: and what the high level API would look like is very unclear

17:22 <arigato> probably then adding something in lltypesystem/rstr.py

17:22 <arigato> yes

17:22 <cfbolz> yes, ideally I would like it for all kinds of objects. but strings would be a start

17:23 <arigato> I guess you'd have some obscure rlib function that allocates a rstr.STR using lltype.malloc(..., nonmovable=True, old=True)

17:23 <arigato> and then uses hlstr() to return it as a regular rpython string

17:23 <cfbolz> Right

17:23 <cfbolz> Actually we do such obscurity in the json parser already

17:24 <arigato> of course if that string comes from, say, a slice of a larger string, it's even more mess

17:24 <arigato> or you make the small string, call the function, and forget the small string, to be collected at the next minor collection

17:25 <arigato> that sounds like it's removing half the benefits though

17:25 <cfbolz> arigato: we already do the slicing in interp_decoder.py manually at the low level, because we have a fast path for ASCII to utf-8

17:25 marky1991 has quit [Ping timeout: 272 seconds]

17:26 <cfbolz> arigato: strslice2unicode_latin1

17:26 <cfbolz> so we could integrate it there

17:26 <arigato> "ah"

17:26 <cfbolz> :-)

17:26 <cfbolz> it's all a bit terrible

17:27 <arigato> strslice2unicode_latin1_careful_Im_returning_an_OLD_string

17:27 <cfbolz> yes

17:27 <cfbolz> arigato: but eg in one example json file I have, it takes 11s to parse, of which 7s are in the GC(!)

17:27 <arigato> meh

17:27 <cfbolz> yes :-(

17:28 <arigato> do the numbers change a lot if you change the nursery size?

17:28 <cfbolz> arigato: no, why would they? the problem is that basically almost everything that is allocated is copied

17:29 <cfbolz> arigato: (boehm on the same example takes 6s in total)

17:29 <cfbolz> arigato: the resulting datastructure is a couple of GB, so I can't make the nursery big enough ;-)

17:29 <arigato> wondering what the cache effects are

17:31 <cfbolz> arigato: PYPY_GC_NURSERY is in kb?

17:33 <cfbolz> arigato: seems I can get a +-10% effect by playing with the values, yes

18:00 <arigato> no, PYPY_GC_NURSERY is in bytes unless you give a number followed by K/KB/M/MB

18:01 <arigato> but ok, "only" 10% is kind of what I was expecting

18:15 jacob22__ has quit [Ping timeout: 245 seconds]

19:07 <mjacob> xorAxAx: invited where?

19:25 jacob22__ has joined #pypy

19:29 themsay has joined #pypy

19:55 themsay has quit [Ping timeout: 250 seconds]

20:01 infinite has quit [Ping timeout: 268 seconds]

20:04 infinite has joined #pypy

20:38 holdenjd has joined #pypy

20:38 AMIGrAve has joined #pypy

20:39 <AMIGrAve> someone knows a postgresql driver that could be used with asyncio ?

20:44 AMIGrAve has quit [Quit: leaving]

20:45 <xorAxAx> mjacob, at my place, for staying during the sprint

20:46 <xorAxAx> i have a guest room and a separate guest bed in the living room

20:49 Rhy0lite has quit [Quit: Leaving]

21:34 themsay has joined #pypy

21:54 Zaab1t has quit [Quit: bye bye friends]

22:32 xcm has quit [Remote host closed the connection]

22:34 xcm has joined #pypy

22:44 xcm has quit [Read error: Connection timed out]

22:46 xcm has joined #pypy

22:59 Arfrever has quit [Quit: 御出で]

23:00 <bbot2> Started: http://buildbot.pypy.org/builders/own-linux-s390x/builds/1144 [default]

23:00 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-64/builds/5877 [default]

23:00 <bbot2> Started: http://buildbot.pypy.org/builders/own-linux-x86-64/builds/7152 [default]

23:00 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-win-x86-32/builds/4297 [default]

23:00 <bbot2> Started: http://buildbot.pypy.org/builders/own-linux-x86-32/builds/6193 [default]

23:00 <bbot2> Started: http://buildbot.pypy.org/builders/own-win-x86-32/builds/1918 [default]

23:00 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-32/builds/4961 [default]

23:00 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-macosx-x86-64/builds/4120 [default]

23:00 <bbot2> Started: http://buildbot.pypy.org/builders/build-pypy-c-jit-linux-armel/builds/2594 [default]

23:00 <bbot2> Started: http://buildbot.pypy.org/builders/build-pypy-c-jit-linux-armhf-raspbian/builds/2355 [default]

23:00 <bbot2> Failure: http://buildbot.pypy.org/builders/build-pypy-c-jit-linux-armel/builds/2594 [default]

23:00 <bbot2> Failure: http://buildbot.pypy.org/builders/build-pypy-c-jit-linux-armhf-raspbian/builds/2355 [default]

23:03 inhahe has quit [Ping timeout: 250 seconds]

23:03 inhahe has joined #pypy

23:08 adamholmberg has quit [Remote host closed the connection]

23:09 adamholmberg has joined #pypy

23:13 adamholmberg has quit [Ping timeout: 246 seconds]

23:24 jamesaxl has quit [Quit: WeeChat 2.2]

23:47 adamholmberg has joined #pypy

23:50 <bbot2> Success: http://buildbot.pypy.org/builders/own-linux-x86-64/builds/7152 [default]

23:51 <bbot2> Failure: http://buildbot.pypy.org/builders/own-win-x86-32/builds/1918 [default]

23:57 adamholmberg has quit [Remote host closed the connection]

23:57 adamholmberg has joined #pypy