#m-labs on 2016-01-15 — irc logs at freenode.irclog.whitequark.org

2015-03-04 14:45 sb0 changed the topic of #m-labs to: ARTIQ, Migen, MiSoC, Mixxeo & other M-Labs projects :: fka #milkymist :: Logs http://irclog.whitequark.org/m-labs

00:18 sb0 has quit [Ping timeout: 240 seconds]

00:20 <rjo> whitequark: imagine you have a kernel that is complicated, consists of quite a bit of code (both source and compiled) takes long time to compile, but little time to execute (many pulses), and is called frequently by other kernels.

00:21 <whitequark> rjo: does it have to be a separate kernel?

00:21 <rjo> whitequark: naively i would think that the objects' lifetimes is just the lifetime of the kernel, there are no implicit host objects (all handled explicitly by rpcs)

00:22 <rjo> whitequark: yes. it is called often and too expensive to be compiled and uploaded with every kernel that calls it.

00:22 <whitequark> rjo: kernels currently assume a LIFO discipline

00:22 <whitequark> er

00:22 <whitequark> scratch that

00:22 <whitequark> rjo: "it is called often and too expensive to be compiled" this is a wrong line of reasoning

00:23 <whitequark> i'm not asking you to tell me how to implement it, i'm asking what do you want

00:23 <rjo> at least from at the ARTIQlang level kernels can call each other. where is the LIFO there?

00:23 <whitequark> call stack is LIFO

00:23 <rjo> i want "dynamic additions to the runtime"

00:23 <rjo> ah yes. there.

00:24 <whitequark> ok, that is much better, that allows me to design it to actual constraints

00:25 <whitequark> how dynamic? let's say I would like to introduce some restrictions, as a thought experiment

00:25 <whitequark> obviously adding new code is a must

00:25 <whitequark> what about adding new host objects?

00:25 <rjo> more specifically an example: we have persistent data now. lets take a big list of lists that parametrizes a huge pulse sequence. the code for that is short (iteration over the data). but in many cases you can not write the huge sequence as a simple/static list but you actually want to do some kind of metaprogramming the intermediate result being ARTIQlang.

00:26 <whitequark> you mean like scans?

00:26 <whitequark> I can add direct support for scans in the new compiler, it's fairly straightforward

00:26 <rjo> the handling of these kernels would be similar to the persistent arrays: you would ask for them to be compiled and then set them like the persistent data.

00:26 <rjo> not like scans.

00:26 <whitequark> ok.

00:26 <rjo> scans are the most basic parametrization (one-variable) of an experiment.

00:27 <whitequark> it's not realistic to expect kernels to be handled like persistent arrays, because correctness and safety of running kernels depends on too much state elsewhere

00:27 <whitequark> concrete issue number 1.

00:27 <rjo> doesn't the call stack guarantee that correctness?

00:28 <whitequark> we in general are unable to unload any newly added code or data, because it might have mutated global state and put references to it into it

00:28 <whitequark> e.g. if new code refers to a list, that list will be compiled into .data in the new code

00:29 <whitequark> the list is global, so new code can ask old code to give it a list of lists and it can put its new list there.

00:29 <whitequark> unloading new code now violates memory safety. there are no practical means of preventing that

00:29 <whitequark> (also works with closures)

00:30 <rjo> assume/enforce that the "dynamic/new" code is idempotent?

00:30 <rjo> or is that the same question as: "how does a dynamic linker work"?

00:30 <rjo> assume no global state is modified/maintained?

00:31 <whitequark> that's unsafe. not only this violates the Python contracts, but also we provide no tools to help debug unsafe code

00:31 <whitequark> so if you ever, for any reason, violate that? you're screwed. you'll never figure out why

00:31 <whitequark> now, mind you, this is not completely broken yet; "never unload any code" is something that surprisingly many systems do in production

00:32 <rjo> isn't it sufficient to restrict the arguments and return values (to non-mutable things or the like)?

00:32 <whitequark> nope, since new code can also use quoted values, and quoted values ~ globals

00:33 <rjo> quoted values, as in strings?

00:33 <rjo> rodata?

00:33 <whitequark> no, quoting in the sense of quasiquoting

00:33 <whitequark> anytime you refer to host object from kernel code, it (the host object) is "quoted"

00:34 <whitequark> that is, put into global storage, and a reference to that storage is used when the object is encountered at runtime

00:34 <whitequark> immutable values are simple; lists are copied and left like that; proper objects get attribute resynchronization

00:36 <whitequark> rjo: anyway, so the API I propose for what you want is as follows.

00:36 <whitequark> let's say you have several functions, f, g, h, ... that implicitly make use of some other (large, heavy) functions in kernel mode

00:36 <whitequark> to avoid recompilation, you do:

00:37 <whitequark> with NamingIsHard(f, g, h):

00:37 <whitequark> f(); g(); h()

00:37 <whitequark> ("NamingIsHard" is some context manager that I don't know how to name.)

00:38 <whitequark> compilation happens once, during entering the with statement.

00:38 <whitequark> the heavy kernel functions get dragged in implicitly, by virtue of being a dependency of f, g, h...

00:38 <rjo> and that "with ..." is in host-mode?

00:39 <whitequark> yes

00:39 <whitequark> in fact, this requires no changes to the compiler at all, just a bit of host Python code

00:39 <rjo> with bundled(f,g,h): ...

00:39 <whitequark> sure.

00:40 <rjo> and within that bundle the set of available kernels is frozen?

00:40 <whitequark> yes.

00:42 <rjo> afaict now this is very nice but might be just orthogonal enough to the actual use cases... will discuss this.

00:43 <whitequark> so otherwise, dynamic loading of code is trivial on runtime side, almost trivial on compiler backend side

00:43 <whitequark> however, it is very hard on correctness/safety side

00:44 <whitequark> if it is doable, I think it is likely that a form of `with bundled` would still be present, and severe restrictions on what the kernels not in the bundle can do

00:47 <rjo> is the unloading the only really hard problem?

00:51 <whitequark> hrm

00:54 <rjo> these kernels would be somewhat special "@freeze" and be compiled with a reduced ARTIQlang support, right?

00:55 <whitequark> no, the restrictions aren't on the preloaded kernels, they are on the things you want later

00:56 <whitequark> so another problem is attribute writeback.

00:56 <rjo> well. i suspect we would have to at least sketch these restrictions to decide whether the remaining stuff would be useful or not.

00:57 <rjo> yeah. kernels would not be methods, right?

00:57 <whitequark> no, that's mostly irrelevant

00:58 <whitequark> let me check some compiler guts...

01:00 <rjo> otoh, i think "with bundle_kernels(f,g,h):" or similar and that pageflipping/backbuffering/staging of kernel we should be able to do a lot.

01:00 <whitequark> ok, so for sure, you will not be able to use any attributes that you did not previously use in your longrunning code

01:01 <whitequark> for any objects that you *did* use in your longrunning code.

01:01 <rjo> i presume "with bundle_kernels()" would be recursive in its arguments.

01:01 <whitequark> i.e. since longrunning code refers to self.core, you will not be able to use self.core.ttl1.

01:02 <whitequark> (if you didn't use that one in longrunning code)

01:02 <rjo> "longrunning == dynamic kernel" or "longrunning == dynamic kernel caller"?

01:03 <whitequark> the one which compiles for a long time. longcompiling is a better word.

01:03 <whitequark> the reason for that restriction is that once you emit code, memory layout is fixed once and for all. no way around that.

01:04 <whitequark> ok, so shortcompiling code should be able to quote arrays. that seems easy.

01:04 <rjo> so if "def heavy(self): self.core.ttl.pulse()" and "def outer(self): self.heavy()" what can't I do?

01:04 <rjo> heavy being the one that hard to compile

01:04 <rjo> and should be "persistent"

01:04 <whitequark> you can't do "def outer(self): self.core.pmt.pulse()"

01:05 <rjo> unless i already used core.pmt in outer() before calling heavy()?

01:05 <whitequark> unless it is used in heavy()

01:06 <rjo> ah. dependency inversion?

01:06 <whitequark> you could put it like that, I guess.

01:07 <rjo> ok. i have no idea where that restriction comes from but i believe you that it would have to be enforced/assumed.

01:07 <rjo> and that restriction makes the entire excercise rather pointless.

01:07 <whitequark> "self.core" has to be compiled while compiling "heavy"; "heavy" has to have hardcoded knowledge of "self.core"'s memory layout

01:08 <whitequark> once "heavy" is on the core device, no changes to its layout are possible. so, no new fields.

01:09 <whitequark> there are quite similar but very slightly less severe issues with getting *any* host objects, that were not already used before, into shortcompiling code

01:09 <whitequark> the restrictions are less severe but severe enough that it probably makes no sense to allow to introduce new host objects in shortcompiling code at all either

01:13 <rjo> can't we have two independent and unsynchronized self.cores?

01:14 <whitequark> then if you use self.ttl1 in longcompiling code and same self.ttl1 in shortcompiling code, they will have unsynchronized state

01:14 <whitequark> so your inputs will break, your ddses will break, etc

01:14 <whitequark> and there's probably some way to violate memory safety with it but I don't yet see how exactly

01:14 <whitequark> and of course attribute writeback gets broken completely

01:15 <whitequark> btw what does bundle_kernels being recursive in its arguments mean?

01:16 <rjo> they don't have state. there is only state that is accessed through the runtime IIRC.

01:16 <whitequark> they do: self.i_previous_timestamp self.o_previous_timestamp

01:17 <rjo> recursive meanint that it will (recursively) gobble up all kernels that are used in that set and compile them together.

01:17 <rjo> ah.

01:17 <rjo> maybe that should go into the runtime then.

01:18 <whitequark> actually, your solution completely defeats the point of sharing kernels

01:19 <rjo> with the unsynchronized self?

01:19 <rjo> selfs?

01:19 <whitequark> yeah

01:20 <whitequark> you can't call any of the old methods because object layout might have changed

01:20 <whitequark> so, you cannot actually share anything at all.

01:20 <rjo> no. you would not do "self.heavy()" but something like "self.core.frozen['heavy']

01:20 <rjo> ()"

01:20 <whitequark> that makes no difference.

01:21 <rjo> you can share zero state. true.

01:21 <rjo> state in self.

01:21 <rjo> but you can share all state in the runtime

01:21 <whitequark> then why do we bother with precompiling? fast kernel swap should be enough.

01:21 <rjo> and you can share all the stuff that "happens".

01:21 <whitequark> well, not precompiling, with persistent kernels.

01:22 <whitequark> there's nothing to persist then!

01:22 <rjo> the code is.

01:22 <rjo> self.core.frozen[] persists kernel swaps.

01:23 <whitequark> no, you cannot reuse old code.

01:24 <rjo> that would be precisely the idea, assuming we have the same notion of "old".

01:24 <whitequark> I mean, you cannot ever pass any data to functions in "self.core.frozen[]" (it makes no sense to make those a hash, we don't have hashes, or put it into self.core...)

01:25 <whitequark> I guess you could pass tuples of numbers at most

01:25 <rjo> well. self.core.get_frozen_kernel_from_runtime("heavy")()

01:25 <rjo> self.core.get_frozen_kernel_from_runtime("heavy")(1,2,3)

01:25 <rjo> yes

01:26 <rjo> and the view of everything within self. from heavy() and from outer() will be inconsistent.

01:26 <rjo> the view of everything not managed by the runtime.

01:27 <whitequark> yes.

02:56 <GitHub115> [artiq] sbourdeauducq pushed 1 new commit to master: https://github.com/m-labs/artiq/commit/5bf257818dbde9953332f7bbced20977ed3eda75

02:56 <GitHub115> artiq/master 5bf2578 Sebastien Bourdeauducq: protocols/pc_rpc: use timeout only for connecting. Closes #161

04:17 evilspirit has joined #m-labs

06:25 jaeckel has quit [Ping timeout: 260 seconds]

06:32 jaeckel has joined #m-labs

07:38 fengling has joined #m-labs

07:40 FabM has joined #m-labs

07:50 evilspirit has quit [Ping timeout: 264 seconds]

08:43 evilspirit has joined #m-labs

10:58 fengling has quit [Quit: WeeChat 1.2]

12:46 sb0 has joined #m-labs

12:55 hozer has quit [Ping timeout: 272 seconds]

13:12 hozer has joined #m-labs

13:29 hozer has quit [Ping timeout: 250 seconds]

13:40 hozer has joined #m-labs

14:41 hozer has quit [Ping timeout: 260 seconds]

14:52 hozer has joined #m-labs

15:34 hozer has quit [Ping timeout: 276 seconds]

15:35 FabM has quit [Quit: ChatZilla 0.9.92 [Firefox 43.0.4/20160105164030]]

15:44 hozer has joined #m-labs

15:50 sb0 has quit [Quit: Leaving]

16:24 rohitksingh has quit [Ping timeout: 250 seconds]

16:34 <GitHub60> [artiq] sbourdeauducq pushed 1 new commit to master: https://github.com/m-labs/artiq/commit/15039e1d74feb899f9c8bd954d6e922388d31268

16:34 <GitHub60> artiq/master 15039e1 Sebastien Bourdeauducq: runtime/dds: DDS_ONEHOT_SEL -> CONFIG_DDS_ONEHOT_SEL

16:34 sb0 has joined #m-labs

16:35 <sb0> whitequark, how do you detect when to rebuild the gateware packages? modifying the runtime should also rebuild them

16:35 <whitequark> sb0: they are always rebuilt

16:35 <sb0> I just pushed right now, and it only rebuilt artiq

16:35 <sb0> (I forced the other ones)

16:35 <whitequark> ugh, why?

16:36 <whitequark> the gateware rebuild is only triggered after the main package succeeds

16:36 <whitequark> now it will build them twice.

16:36 <bb-m-labs> build #73 of artiq-kc705-nist_qc1 is complete: Exception [exception interrupted] Build details are at http://m-labs-buildserver.lan/buildbot/builders/artiq-kc705-nist_qc1/builds/73

16:36 <bb-m-labs> build #60 of artiq-kc705-nist_qc2 is complete: Exception [exception interrupted] Build details are at http://m-labs-buildserver.lan/buildbot/builders/artiq-kc705-nist_qc2/builds/60

16:36 <bb-m-labs> build #56 of artiq-pipistrello-nist_qc1 is complete: Exception [exception interrupted] Build details are at http://m-labs-buildserver.lan/buildbot/builders/artiq-pipistrello-nist_qc1/builds/56

16:36 <sb0> ah

16:36 <whitequark> I should make it so that their build is triggered only by runtime/gateware changes/changes in misoc

16:37 <whitequark> but I haven't had time yet

16:37 <whitequark> there's your annoying sftp setup first...

16:37 <whitequark> (I will use pysftp for that)

16:38 rohitksingh has joined #m-labs

16:39 <whitequark> sb0: ah wait, I now remember why I didn't make their rebuild conditional.

16:39 <whitequark> that's because we verify that gateware matches python code exactly.

16:39 <whitequark> both in the versioneer check and in the conda requirements

16:42 <GitHub158> [artiq] whitequark pushed 1 new commit to master: https://github.com/m-labs/artiq/commit/127b1171132062664c9b1ec6641d4fa48f1f27b2

16:42 <GitHub158> artiq/master 127b117 whitequark: Add @host_only function decorator (#172).

16:42 sb0 has quit [Quit: Leaving]

16:49 rohitksingh has quit [Quit: Leaving.]

16:51 evilspirit has quit [Ping timeout: 255 seconds]

17:00 <bb-m-labs> build #61 of artiq-kc705-nist_qc2 is complete: Success [build successful] Build details are at http://m-labs-buildserver.lan/buildbot/builders/artiq-kc705-nist_qc2/builds/61

17:02 <bb-m-labs> build #74 of artiq-kc705-nist_qc1 is complete: Success [build successful] Build details are at http://m-labs-buildserver.lan/buildbot/builders/artiq-kc705-nist_qc1/builds/74

17:09 <bb-m-labs> build #57 of artiq-pipistrello-nist_qc1 is complete: Success [build successful] Build details are at http://m-labs-buildserver.lan/buildbot/builders/artiq-pipistrello-nist_qc1/builds/57

17:09 <bb-m-labs> build #106 of artiq is complete: Failure [failed python_unittest] Build details are at http://m-labs-buildserver.lan/buildbot/builders/artiq/builds/106 blamelist: whitequark <whitequark@whitequark.org>

17:11 <GitHub34> [artiq] whitequark pushed 1 new commit to master: https://github.com/m-labs/artiq/commit/e0d5b77e275136b54cbbf3c70f0ab9c6d088aa3f

17:11 <GitHub34> artiq/master e0d5b77 whitequark: Commit missing parts of 127b117.

17:32 <bb-m-labs> build #107 of artiq is complete: Failure [failed python_unittest] Build details are at http://m-labs-buildserver.lan/buildbot/builders/artiq/builds/107 blamelist: whitequark <whitequark@whitequark.org>

17:56 ylamarre has joined #m-labs

18:42 sb0__ has joined #m-labs

18:42 <sb0__> whitequark: can you look into https://github.com/m-labs/artiq/issues/228 ?

18:43 <whitequark> yes, I was already working on it

18:44 <sb0__> also, regarding host object attributes and interleaving

18:45 <whitequark> yeah?

18:45 <sb0__> mh, no that won

18:45 <sb0__> 't work

18:46 <whitequark> there has to be a type system addition and the variable alias (const_x / with const:) trick

18:47 <whitequark> the former is relatively easy though a large amount of work. the latter needs consensus

18:50 <rjo> please don't encode metadata in the variable names.

18:50 <whitequark> oh, yes, it's an absolutely horrible solution and I hate it

18:51 <whitequark> but it does serve to illustrate my point

18:51 <whitequark> the problem is that we have condemned ourselves to a subset of Python, and Python is a really bad fit as a DSL for the problem it should solve here

19:26 <sb0__> whitequark: re. #228. seems the rtio channel number is borked.

19:27 <sb0__> ABI issue or something, maybe

19:28 sb0__ has quit [Quit: Page closed]

19:41 sb0 has joined #m-labs

19:49 <GitHub146> artiq/applets cc3a45d Sebastien Bourdeauducq: gui/applets: fix applet removal

19:49 <GitHub146> [artiq] sbourdeauducq pushed 2 new commits to applets: https://github.com/m-labs/artiq/compare/3d56ea5c7120...9acf8b7c1a59

19:49 <GitHub146> artiq/applets 9acf8b7 Sebastien Bourdeauducq: gui/applets: templates

19:50 <GitHub91> [artiq] sbourdeauducq pushed 1 new commit to applets: https://github.com/m-labs/artiq/commit/331ac37505b2c4422b9f949e116e5d13064d2088

19:50 <GitHub91> artiq/applets 331ac37 Sebastien Bourdeauducq: applets/plot_hist: better help message

23:44 <sb0> whitequark, have you found the problem with count()?

23:55 <whitequark> sb0: yes, found it just now, actually

23:55 <whitequark> patch incoming

23:58 <GitHub32> [artiq] whitequark pushed 1 new commit to master: https://github.com/m-labs/artiq/commit/bed62349d22d9f7e4c44ff480d414521d4e2fc20

23:58 <GitHub32> artiq/master bed6234 whitequark: transforms.llvm_ir_generator: i64 doesn't need sret (fixes #228).