#zig on 2019-12-23 — irc logs at freenode.irclog.whitequark.org

2019-02-22 01:34 ChanServ changed the topic of #zig to: zig programming language | https://ziglang.org | be excellent to each other | channel logs: https://irclog.whitequark.org/zig/

00:02 cshenton_ has joined #zig

00:11 <daurnimator> leeward: `try a()` *is* `a catch |e| return e`. the latter also adds the extra entry in the traceback.

00:12 <daurnimator> * `a() catch |e| return e`

00:15 <daurnimator> https://lwn.net/Articles/806776/ <-- article on OpenBSD locking down what can make syscalls.

00:27 <fengb> Are all BSDs supposed to use libc as the syscall layer?

00:28 <daurnimator> fengb: yes. except for when they don't :)

00:28 protty has quit [Remote host closed the connection]

00:28 protty has joined #zig

00:28 <daurnimator> if zig truely has the aim of "replace C" then at some point openbsd's loader/libc would need to be written in zig :P

00:30 <cshenton_> What's the go to way to debug print from a test block?

00:30 adamkowalski has joined #zig

00:30 <adamkowalski> how do I get the scalar type of a multi dimensional array? and it's rank?

00:31 <adamkowalski> so if I have a [_]f64{1, 2, 3} I would want f64 and 0

00:31 <adamkowalski> [_][3]f64{

00:31 <adamkowalski> .{ 1, 2, 3 },

00:31 <adamkowalski> };

00:31 <adamkowalski> should be f64 and 2

00:32 <adamkowalski> the first example should have been f64 and 1 sorry

00:32 <cshenton_> You could do it with @typeInfo(@typeOf(ar)) then recursively check the container type, terminating when it's no longer an array.

00:33 <mikdusan> sounds like a good candidate for std.meta enhancement

00:34 <cshenton_> Lemme write up something real quick

00:36 adamkowalski has quit [Ping timeout: 258 seconds]

00:40 cshenton_ has quit [Remote host closed the connection]

00:50 <leeward> daurnimator: I know, I was talking about fengb's proposal, which would change that.

00:51 cshenton has joined #zig

00:52 <cshenton> adamkowalski: here you go mate https://gist.github.com/cshenton/08db802c58ebb3777a80e5797a1b6238

00:54 <cshenton> Are you guys open to contributions to std.meta? Or are things a bit too unstable at the moment?

00:54 cshenton has quit [Remote host closed the connection]

01:04 cshenton has joined #zig

01:07 Aransentin has quit [Remote host closed the connection]

01:11 adamkowalski has joined #zig

01:16 adamkowalski has quit [Ping timeout: 268 seconds]

01:27 cshenton has quit [Remote host closed the connection]

02:00 adamkowalski has joined #zig

02:01 <adamkowalski> cshenton: awesome thanks!

02:02 <adamkowalski> do you know how to handle the case where you get a slice instead? I noticed the type info says it's a pointer and the child was a void

02:03 cshenton has joined #zig

02:04 <daurnimator> cshenton: we're happy for contributions anywhere :P

02:05 <cshenton> Ah so that's where slices are. So the Pointer field of the TypeInfo enum has a field size that records the pointer type, so I can ad that case in.

02:05 <pixelherodev> z

02:05 <pixelherodev> Whoops - sorry about that!

02:06 <adamkowalski> cshenton: yeah I think we went down the same path. I started out with something that looked like what you had since I took inspiration from how expectEqual was implemented, but I coudln't figure out how to match on slices

02:06 <daurnimator> pixelherodev: z is for zig?

02:06 <cshenton> Matching on slices isn't to bad, you just add .Pointer to the switch statement and check the contents in the block. Let me slap that together and if it looks good I'll PR it.

02:07 <adamkowalski> Well before you do that, we should talk about eltype

02:07 <adamkowalski> i'm a little concerned that for array the child is called type

02:07 <adamkowalski> for pointer it's called child

02:08 <adamkowalski> and now we are introducing eltype

02:08 <adamkowalski> I think we should keep consistency if possible

02:09 <cshenton> I think that's reasonable.

02:09 <cshenton> eltype is just what julia uses in its stdlib, so I defaulted to it, consistency is more important

02:09 <adamkowalski> ah you are a julia user too?

02:09 <adamkowalski> awesome, I could use your help on a project i'm working on

02:10 <cshenton> Also would you consider [*]T a valid container type? Or just arrays and slices?

02:10 <adamkowalski> i'm building out a reverse mode automatic differntiation package

02:10 <adamkowalski> have you used TensorFlow, PyTorch, or Flux

02:10 <cshenton> Yup, used all of them

02:11 <adamkowalski> Awesome, I'm currently implementing a library called compute graph, so it's similar to tensorflow v1 and it's a static graph approach

02:11 protty has quit [Remote host closed the connection]

02:11 <adamkowalski> I figured that way we can get distribution and parallization by analyzing the graph

02:11 <cshenton> Ah nice, you should also check out cpp-taskflow for inspiration.

02:12 <adamkowalski> i'm still working out exactly how to make everything work, but can you take a look at this test case and see what you think of the API so far?

02:12 <cshenton> The internals are much more readable than tensorflow

02:12 <cshenton> Yeah happy to take a look

02:12 <adamkowalski> https://github.com/adam-r-kowalski/compute_graph/blob/master/src/session.zig#L134

02:12 <adamkowalski> I may have blatently stole the names from tensorflow, but hopefully it makes reading the code easier

02:13 <adamkowalski> I'm currently working on expanding from just working on scalars to working on tensors, hence the question I had earlier haha

02:14 <cshenton> This is a good start but I can think of ways of improving it. If each op returned both an op-id and a reference to the graph, you could chain operators:

02:15 <cshenton> like graph.mul(a, b).add(c); or something

02:18 <cshenton> Or you could have that classic split and offer both like `c = Constant(&graph, 2.0)` or `c = graph.add_constant(2.0)`

02:20 <cshenton> A lot of this is a matter of taste though.

02:20 <adamkowalski> the reason why I dind't want to make the methods be on the graph itself is because then there will be some number of "blessed functions"

02:20 <adamkowalski> you can't add new methods of your own that would be callable like that

02:21 <cshenton> Yep, that's a good point.

02:21 <adamkowalski> so I figured the consistency would be nice

02:21 <adamkowalski> unless we have ufcs

02:21 <cshenton> I was under the impression that wasn't likely to happen.

02:21 <adamkowalski> for constant I was going to use the ArrayInfo based on the var thats passed in and I will automatically construct the appropriate tensor

02:22 <cshenton> Nice

02:22 <adamkowalski> Since I don't have something like numpy or julias native array type, I also need to build a eager tensor which is the thing that will actually get passed into your operations

02:23 <adamkowalski> I'm not sure if there is anything I can do about that virtual call I have for the operation while maintaining the ability for users of the library to add custom operaitons

02:23 <adamkowalski> if you have any ideas i'm all ears

02:24 <cshenton> So tensorflow handles user-defined c++ ops with some macro magic, and python ops (which are more subgraphs) with inheritance.

02:24 <cshenton> https://www.tensorflow.org/guide/create_op

02:25 <adamkowalski> It's still using inheritence

02:25 <adamkowalski> class ZeroOutOp : public OpKernel

02:25 <cshenton> Ah yeah, good point.

02:26 <adamkowalski> But all the actual eager tensor ops will be done using static compile time polymorphism and only the graph needs to be runtime polymorphic

02:26 <adamkowalski> at first I tried giving the graph itself an element type to get away from that

02:26 <adamkowalski> but you need to be able to injest different kind of data

02:26 <adamkowalski> so now i'm going to use a union(enum) over all the different primitive types that the cpu offers

02:27 <adamkowalski> that way you can have strings and one hot encode them to turn them into vector space representations and use word embeddings

02:27 <adamkowalski> but the thing I could really use help with is implementing the backprop

02:27 <cshenton> Yeah managing the allocations for that will be a bit awkward.

02:28 <adamkowalski> I was going to have a backward method on each operation which will be responsible for managing that

02:28 <adamkowalski> and you can chain ops that have backward defiend

02:28 <adamkowalski> the allocations are all in the arena that the graph uses

02:28 <adamkowalski> then when you're done with the graph you deallocate the whole arena at once

02:29 <adamkowalski> the session uses it's own arena allocator as well, so the actuall values needed to compute the node will be ephemeral

02:29 <cshenton> Tensorflow's GPU backend will probably have something like that, I know tensorflow dominates your GPU memory, so they probably use an arena.

02:29 <adamkowalski> the actual backprop computations are pretty tricky haha. I might study how ChainRules.jl works to find out the derivative of everything is

02:30 <cshenton> Even if you're using a static graph, I think you'll make your life easy to use dual numbers.

02:30 <adamkowalski> dual numbers are great for forward mode auto diff

02:30 <adamkowalski> reverse mode auto diff scales better if the number of outputs is small but the number of inputs is large

02:30 <adamkowalski> I think we should have both though

02:30 <adamkowalski> what are your thoughts on graph mode anyway?

02:31 <cshenton> So in terms of cutting edge stuff there's a push towards autodiff being a language level thing.

02:31 <cshenton> i.e. autodiff happens on IR inside the compiler

02:32 <adamkowalski> right like Zygote.jl

02:32 <cshenton> pytorch are doing this with torchscript, and julia with zygote

02:32 <cshenton> yup

02:32 <adamkowalski> torchscript and tf.function both seem to be just jit compiles though

02:32 <adamkowalski> it seems like the trend moved away from graphs, but now we're trying to add them back, just under the hood

02:32 <adamkowalski> julia is a HUGE pain to deploy

02:33 <adamkowalski> and deploying tensorflow/pytorch is all pretty much getting a trace of your program -> creating a graph -> exporting and reading in C++

02:33 <adamkowalski> so I figured let's just use the graph in the first place haha

02:34 <cshenton> That's not unreasonable, this split between what's convenient for research vs. deployment is pretty fundamental.

02:34 <adamkowalski> right, unless we can get andrewrk to help us build something like this into the compiler itself

02:34 <adamkowalski> but I think the explicit nature of this approach has advantages

02:34 <adamkowalski> each time you create an operation it allocates, which may fail, so you need to "try" it

02:35 <adamkowalski> and executing the graph allocates so same goes

02:35 <adamkowalski> also since shapes are known at runtime (for now) they may mismatch, which may fila

02:35 <adamkowalski> fail*

02:35 <cshenton> So I've seen teams attempt to roll their own graph autodiff framework before.

02:35 <cshenton> In java

02:36 <cshenton> And they really struggled because they sort of went halfway on ergonomics and performance and got the worst of both worlds.

02:36 <adamkowalski> I care a lot about reliability, and scalability. I think that matters much more then prototyping. Plus I think it's not that bad to write models even with this explicit API

02:36 <adamkowalski> I just forces you to think about allocation, failure, and types

02:36 <cshenton> For sure

02:37 <adamkowalski> We have all these models currently written in Python and it's a huge pain to refactor

02:37 <cshenton> So the place where current frameworks fail is definitely things like compilation / startup time.

02:37 <adamkowalski> there is no compiler so when you add a parameter to a function or remove one, how do you find all the places in your code that it is used

02:37 <adamkowalski> we just hope our unit test catches it

02:37 <cshenton> The push towards dynamic graphs is really about improving developer iteration times.

02:38 <adamkowalski> right, but I disaggree with the premise

02:38 <adamkowalski> I think that then you end up with a bunch of code which maybe you got out the door slightly faster

02:38 <adamkowalski> but just like dynamic typing vs static typing, the tracer baced approaches are harder to maintain in my opinion

02:39 <adamkowalski> development slows down as you increase the number of lines of code, and intent is lost because you don't have the documentaiton that types / shapes can provide

02:39 <cshenton> I agree, my point was that a graph computation framework in zig would have that as a key value-add.

02:39 <cshenton> Good iteration times without things being hard to debug at scale.

02:39 <adamkowalski> I hope so! hopefully as I work on this more and more you and others can keep reviewing the code and making sure you like the direction

02:40 <cshenton> I'd be keen to keep an eye on it, I'm using zig for games engine stuff primarily, but I used to do a lot of probabilistic programming stuff so I'm quite familiar with these kinds of weird scientific computing use cases.

02:41 <adamkowalski> probabilistic programming is definitily a huge use case. being able to backprop through a distribution is really important

02:41 <adamkowalski> in tensorflow that was a huge pain, but you could solve it with the reparametization trick

02:41 <adamkowalski> i don't think you can do that in Julia yet either

02:42 <cshenton> Yeah tensorflow-probability fixed a lot of that.

02:42 <adamkowalski> true, honestly I like a lot of things about tensorflow, I just don't like that it's in Python haha

02:42 <cshenton> I personally think the pytorch c++ apis are really nice. They factored them out really well.

02:42 <adamkowalski> I think we can do better though

02:43 <cshenton> Well the difficulty with backprop through samplers / likelihoods is that writing them in autodiff friendly form is awkward.

02:43 <adamkowalski> they use shared pointer everywhere

02:43 <cshenton> oof

02:43 <adamkowalski> it's reference semantics which doesn't make sense in C++ which is so value oriented

02:43 ur5us has joined #zig

02:43 <adamkowalski> They claim it's because it helps the Python data scientists transition

02:44 <adamkowalski> another use case we want to support is differential equations, just like Julia. It works so well with Flux

02:45 <adamkowalski> but first thing is first. I just want a simple "dense/linear/affine" layer that I can backprop through and use that to solve MNIST

02:46 <adamkowalski> then write a simple article about why Zig for machine learning, and get people excited and contributing

02:46 <adamkowalski> then we can do convolutions, lstms, and attention mechanisms

02:46 <cshenton> One thing to keep in mind is that the existing frameworks are the way they are not because the engineers were bad.

02:46 <cshenton> But because the users want a particular interface.

02:47 <adamkowalski> oh no I definitely agree with that, I think everyone working on those projects is doing an amazing job

02:47 <cshenton> So I guess the question is, who are the users who want this kind of API for graph computation? It might be they're in a particular field, so focussing on features in that field would be the way to go.

02:47 <adamkowalski> I think we have the benefit of hindsight and not needing to be backwards compatible though

02:48 <adamkowalski> I think the users of this API want something that is meant for production rather then research

02:48 <adamkowalski> however, I am hoping the API will be pleasant enough that you can still use it for research

02:49 <adamkowalski> i'm not sure there must be this clear distinction, I guess you could just argue it's slightly less ergonomic

02:49 <adamkowalski> but it will make sure that you handle all possible error conditions, and when you need to extend it, you don't need to drop into a lower level language

02:50 <adamkowalski> finally you can deploy it without needing swift, javascript, java, etc as you can just use web assembly or if you target the C ABI you can have much greater reach

02:50 cshenton has quit [Remote host closed the connection]

03:23 adamkowalski has quit [Quit: Lost terminal]

03:50 dddddd has quit [Remote host closed the connection]

03:59 ur5us has quit [Ping timeout: 260 seconds]

04:04 dingenskirchen has quit [Ping timeout: 248 seconds]

04:13 return0e has quit [Read error: Connection reset by peer]

04:14 return0e has joined #zig

04:45 darithorn has joined #zig

05:12 cshenton has joined #zig

05:12 <cshenton> adamkowalski: One thing you could do is to target your zig library as a backend for https://github.com/tensorflow/mlir or https://onnx.ai/ or even the protobuf serialisation of tensorflow's internal graph format.

05:12 <cshenton> Because then you could have a user story like: transition your existing models using this path, and start writing new models with the front end inside the library itself.

05:12 <cshenton> Either way, I think it's a compelling use case. Even a replacement for the tensorflow c++ api that isn't a nightmare to build would be a huge value add for a lot of use cases.

05:12 <cshenton> Also here's the updated arrayInfo that works with slices as well as arrays. You should also decide if you consider Vectors to be a container type or an integral type. https://gist.github.com/cshenton/08db802c58ebb3777a80e5797a1b6238

05:12 cshenton has quit []

05:21 return0e has quit [Read error: Connection reset by peer]

05:21 return0e has joined #zig

05:33 leeward has quit [Quit: *Poof*]

05:45 mahmudov has quit [Ping timeout: 265 seconds]

06:32 THFKA4 has quit [Ping timeout: 246 seconds]

06:44 lunamn has quit [Ping timeout: 258 seconds]

06:50 lunamn has joined #zig

08:45 darithorn has quit [Remote host closed the connection]

09:04 return0e_ has joined #zig

09:17 dingenskirchen has joined #zig

09:21 dddddd has joined #zig

09:36 WilhelmVonWeiner has joined #zig

09:49 WilhelmVonWeiner has left #zig [#zig]

09:53 dingenskirchen has quit [Remote host closed the connection]

09:54 dingenskirchen has joined #zig

09:58 metnel has quit [Quit: leaving]

10:02 return0__ has joined #zig

10:04 return0e has quit [Ping timeout: 260 seconds]

10:31 daex has quit [Ping timeout: 268 seconds]

10:32 daex has joined #zig

10:47 _whitelogger has joined #zig

11:44 mahmudov has joined #zig

12:19 daex has quit [Ping timeout: 268 seconds]

12:21 daex has joined #zig

12:37 frmdstryr has joined #zig

13:35 return0__ has quit [Ping timeout: 268 seconds]

13:48 return0e has joined #zig

14:46 leeward has joined #zig

16:16 return0e_ has quit [Remote host closed the connection]

16:17 return0e_ has joined #zig

16:23 return0e_ has quit [Ping timeout: 240 seconds]

16:24 return0e_ has joined #zig

16:55 <Snektron> Ah, an uni assignment with which you can pick the language

16:55 <Snektron> You know what that means

16:58 <leeward> whitespace?

16:59 <mq32> Malbolge?

16:59 <leeward> Y

16:59 <Snektron> I did hand in assembly at some point

17:00 <leeward> What architecture?

17:00 <Snektron> x86

17:00 * leeward goes back to sleep.

17:00 <Snektron> The assignment was to make a simple shell in C

17:08 <mq32> const uint8_t code[] = "\x11…"; int main() { return ((int(*)())code)(); }

17:08 <mq32> :D

17:10 <leeward> C is good for defeating its own purpose.

17:12 <mq32> oh yeah

17:13 <leeward> Though that's machine code, not assembler language.

17:19 wilsonk has quit [Ping timeout: 268 seconds]

17:21 <leeward> /pedant

17:21 return0e_ has quit []

17:25 <fengb> mq32: are you casting a string literal to a function pointer?

17:26 <mq32> fengb: yeah, so? :D

17:26 <mq32> that's called "inline machine code" :D

17:26 wilsonk has joined #zig

17:31 <Snektron> That shouldnt work

17:31 <Snektron> Unless you have some system without virtual memory that is

17:46 BitPuffin has quit [Quit: killed]

17:46 D3zmodos has quit [Quit: killed]

17:46 aperezdc has quit [Quit: killed]

17:46 Snektron has quit [Quit: killed]

17:46 fengb has quit [Quit: killed]

17:46 Demos[m] has quit [Quit: killed]

17:46 vegai has quit [Quit: killed]

17:47 AlexMax has quit [Quit: killed]

18:02 Demos[m] has joined #zig

18:20 return0e has quit [Read error: Connection reset by peer]

18:21 return0e has joined #zig

18:22 Snektron has joined #zig

18:22 <Snektron> How long are static builds hosted on ziglang.org/builds?

18:22 <Snektron> apart from major versions

18:30 BitPuffin has joined #zig

18:30 dtz has joined #zig

18:30 mattmurr has joined #zig

18:30 D3zmodos has joined #zig

18:30 AlexMax has joined #zig

18:30 fengb has joined #zig

18:30 vegai has joined #zig

18:35 mahmudov has quit [Ping timeout: 260 seconds]

18:45 hoppetosse has joined #zig

18:47 mahmudov has joined #zig

18:48 <leeward> Snektron: Why wouldn't it work with virtual memory?

18:48 <hoppetosse> would it be feasible to have the compiler output the actual active field on "access of inactive union field" safety check?

18:48 <hoppetosse> *not compiler, stacktrace

18:51 darithorn has joined #zig

18:59 <fengb> leeward: most OSs mark data separately from executable code so that’d most likely segfault when trying to execute data

19:03 <leeward> fengb: Oh, so not virtual memory.

19:04 <leeward> Yeah, I know about NX bits.

19:06 <leeward> It's more of a pain, but NX can be cleared on .bss.

19:08 <leeward> Or whatever page table entry holds .bss, anyway.

19:11 lygaret has joined #zig

19:27 adamkowalski has joined #zig

19:50 <leeward> Whoo, LoggingAllocator really doesn't work, does it?

20:00 clktmr has left #zig [#zig]

20:20 leeward has quit [Remote host closed the connection]

20:22 leeward has joined #zig

20:24 <lygaret> hey, if I have a slice of bytes, of a known length, is there a way to cast that to an array of the same length? I'm trying to use `std.mem.bytesAsValue(Header, &data[8..16])`, but I'm getting a compile error of: `error: expected *[N]u8 , passed [*]const u8`

20:25 <lygaret> er: `expected *[N]u8 , passed *[]const u8` (prev was from trying `data[8..16].ptr`)

20:25 <lygaret> it seems I'm missing something, but I'm having trouble figuring out why these wouldn't be the same type?

20:35 <fengb> Slices don’t cast back to arrays yet: https://github.com/ziglang/zig/issues/863

20:41 <lygaret> @fengb, that's perfect thank you - didn't think to look at the bug tracker, and there's a good workaround in the comments there

20:41 <lygaret> appriciated

20:42 daex has quit [Ping timeout: 260 seconds]

20:46 daex has joined #zig

20:51 <fengb> np

20:58 Pistahh has quit [Remote host closed the connection]

21:22 daex has quit [Ping timeout: 265 seconds]

21:24 casaca has quit [Ping timeout: 265 seconds]

21:27 daex_ has joined #zig

21:27 <Snektron> <leeward "Snektron: Why wouldn't it work w"> Because i expect static data to be in read/write memory and not in executable memory

21:50 lygaret has quit [Quit: Leaving.]

21:53 riba has joined #zig

22:39 WendigoJaeger has quit [Quit: Connection closed for inactivity]

22:52 adamkowalski has quit [Ping timeout: 240 seconds]

22:54 tbodt has joined #zig

23:39 tobbez has quit [Ping timeout: 250 seconds]

23:40 tobbez has joined #zig

23:52 riba has quit [Ping timeout: 265 seconds]