#zig on 2019-07-30 — irc logs at freenode.irclog.whitequark.org

2019-02-22 01:34 ChanServ changed the topic of #zig to: zig programming language | https://ziglang.org | be excellent to each other | channel logs: https://irclog.whitequark.org/zig/

00:37 bwb_ is now known as bbrittain

00:40 ltr- has quit [Quit: leaving]

00:43 kristoff_it has joined #zig

00:48 kristoff_it has quit [Ping timeout: 268 seconds]

00:59 bbrittain is now known as bwb_

02:02 laaron has joined #zig

02:10 kristoff_it has joined #zig

02:13 Ichorio has quit [Ping timeout: 264 seconds]

02:14 kristoff_it has quit [Ping timeout: 248 seconds]

02:16 curtisf has joined #zig

02:17 ltriant has quit [Ping timeout: 272 seconds]

02:19 dimenus has joined #zig

02:20 <dimenus> anyone having issues building zig (master) on mingw64?

02:22 <dimenus> getting a bunch of undef references to z3_* libclangStaticAnalyzerCore

02:22 <dimenus> even with Z3 installed

02:26 <daurnimator> dimenus: 2965 and 2958 were recently merged.... does it work from before they were merged?

02:26 <scientes> we don't need maskedLoad or maskedStore

02:27 <scientes> you can just use | and & and let the optimizer figure it out

02:31 <daurnimator> scientes: I don't think that fits zig's philosophy

02:31 <scientes> daurnimator, we could just put it in std lib

02:31 <scientes> we only need @gather and @scatter

02:31 <daurnimator> scientes: e.g. if you have half of a vector next to something with PROT_NONE: i'd want to be using a masked load.

02:32 <scientes> daurnimator, ahh unaligned loads?

02:32 <daurnimator> scientes: not unaligned. but when you *do not* want to read from memory

02:32 <scientes> the optimization should just be guaranteed

02:32 <scientes> daurnimator, yeah, but it will always be 16-byte aligned, so PROT_NONE is impossible

02:33 <daurnimator> howso?

02:33 <scientes> it has to be page aligned

02:33 <daurnimator> 16-byte aligned can still go over a page long if your vector is > 16 bytes.....

02:33 <scientes> and thus is simd aligned

02:34 <scientes> again, you just have to guarantee that the optimization is wrong

02:34 <scientes> optimization is right

02:34 <scientes> LLVM has already annonced that these instructions will be deprecated too

02:35 <scientes> I feel its a bug in LLVM

02:35 <daurnimator> whats a bug?

02:35 <scientes> if it isn't getting optimized to a maskedstore/maskedload

02:35 <scientes> if you use | and &

02:35 <daurnimator> why should it be?

02:35 <daurnimator> | and & can be much slower than a masked store/load

02:36 <scientes> its equilivent

02:36 <daurnimator> no its not

02:36 <scientes> the optimizer would figure it out

02:36 <daurnimator> from http://llvm.org/docs/LangRef.html#llvm-masked-load-intrinsics > Other targets may support this intrinsic differently, for example by lowering it into a sequence of branches that guard scalar load operations. The result of this operation is equivalent to a regular vector load instruction followed by a ‘select’ between the loaded and the passthru values, predicated on the same mask.

02:36 <daurnimator> a sequence of branches guarding loads is for sure slower than plain `|` and `&`

02:37 <scientes> oh I see

02:37 <scientes> uggh, its still code smell to me

02:37 <scientes> they could add a optimization attribute

02:38 <daurnimator> scientes: from the zig side: if I didn't have @maskedLoad: how would I indicate that I want to load half a vector without reading the other half?

02:38 <scientes> I only see that it could cause problems if the pointer was unaligned

02:38 <daurnimator> it's not about alignment

02:38 <scientes> daurnimator, how do you specifiy that you only want to read one bit?

02:39 <daurnimator> you don't. a byte is called a byte because its the smallest addressable quantity

02:39 <daurnimator> however a vector is more than a byte.

02:39 <daurnimator> (at least, most of the time... I guess we can have vectors of u1 )

02:39 <scientes> and those get bit-packed

02:40 <daurnimator> right

02:40 <daurnimator> but imagine a 64 byte vector

02:40 <daurnimator> split across two pages

02:40 <scientes> it would be two vector loads

02:40 <scientes> oh, x86 is weird in that way...

02:40 <scientes> and also supports unaligned

02:41 <daurnimator> a 64 byte vector only needs to be aligned to 16 bytes.

02:41 <scientes> daurnimator, https://lwn.net/Articles/793455/#Invented%20Loads

02:42 <scientes> the compiler can load whatever the hell it wants

02:42 <scientes> it just can't modify it

02:42 <daurnimator> ??

02:42 <daurnimator> the whole point of a maskedload is to tell it it can't load whatever it wants

02:44 <scientes> well two things 1) I still think an instruction is not necessarily, but could be an attribute flag, 2) it seems like using a hammer to solve a edge condition

02:44 <daurnimator> > Edge cases matter.

02:44 <scientes> like it would be faster to test if you are at a boundary

02:44 <scientes> yes of course

02:45 <scientes> but the compiler doesn't know where the boundries

02:45 <daurnimator> exactly!

02:45 <scientes> so it generates ship code

02:45 <daurnimator> which is why it needs to be a builtin

02:45 <scientes> *shit

02:45 <scientes> it would be easier to have a check before doing the load/store

02:45 <scientes> and only then use this expensive one

02:45 <scientes> which wouldn't have to then be implemented this way

02:46 <daurnimator> `x | mask` -> may or may not access masked off bits: don't care, do whatever is fast. `@maskedLoad(x, mask)` -> do *not* access masked off bits

02:46 <scientes> well, I guess this would be OK, like on power 8 there is a load/store that takes a length

02:46 <scientes> but then you would have to do a ctz + clz + popcount

02:47 <scientes> instead of just one clz or ctz depending on where the border was that you were avoiding

02:48 <scientes> i doubt they would have the optimization to avoid that

02:48 <scientes> and see how the mask was generated

02:49 <scientes> and you could instead just use differnt length vector types

02:49 <scientes> like use a 61-byte vector

02:49 <scientes> LLVM is deprecating these instructions too

02:50 <scientes> daurnimator, how is your use-case not solved by generating a llvm type that is the length you want?

02:50 <scientes> zig can't do that well right now, but LLVM is working on it

02:51 <scientes> basically you would just page align your vector loads/stores

02:51 <scientes> by splitting them

02:52 <scientes> <daurnimator> but imagine a 64 byte vector

02:52 <scientes> <daurnimator> split across two pages

02:52 <scientes> <daurnimator> it's not about alignment

02:52 <scientes> yes it is

02:52 laaron has quit [Quit: ZNC 1.7.1 - https://znc.in]

02:53 <scientes> you can't split pages if you are naturally aligned

02:53 <scientes> and page alignment is not relevent here

02:54 <scientes> *stack alignement is not relevent

02:55 ltriant has joined #zig

02:57 laaron has joined #zig

02:58 dimenus has quit [Ping timeout: 272 seconds]

02:58 <daurnimator> scientes: yes you can

02:59 <scientes> a 4k stack means the bottom 12 bits are zeros

02:59 <daurnimator> scientes: e.g. an avx-512 loads can be 64 bytes. yet it only needs to be 16-byte alignerd

02:59 <scientes> daurnimator, a 64-byte vector might have alignOf of 16 bytes

03:00 <scientes> yes, we were thinking the same thing

03:00 ltriant has quit [Ping timeout: 245 seconds]

03:00 <scientes> but that's not natural alignment

03:00 <daurnimator> ??

03:00 kristoff_it has joined #zig

03:00 <scientes> https://github.com/torvalds/linux/blob/master/Documentation/unaligned-memory-access.txt#L36

03:00 <scientes> natural alignment has a strict definition

03:01 <scientes> that there is only a single bit in the size

03:01 <daurnimator> oh okay; I didn't know that definition

03:02 <scientes> maskedLoad just seems very x86 specific to me, and kindof a hack, and doing the optimizations to make it possible to use right on ppc for example, I doubt those have been done

03:03 <scientes> but yeah with the way avx-512 has 16-bit alignOf, maybe it is necessary

03:03 <scientes> *128-bit

03:04 kristoff_it has quit [Ping timeout: 245 seconds]

03:04 <daurnimator> scientes: it probably defaults to the series of branches lowering I quoted above

03:05 <scientes> even though on vsx it doesn't need to, but the optimizations would be difficult

03:05 <scientes> and the programmer would probably not generate the mask right for that either

03:05 <scientes> even though C11 would allow such a zombie read

03:06 <daurnimator> scientes: though also: doesn't llvm allow arbitrary vector sizes?

03:06 <scientes> inside of the page

03:06 <scientes> daurnimator, yes it does, but not variable length

03:06 <scientes> the lowering of those is also quite bad right now

03:06 <scientes> but it does work

03:06 <daurnimator> scientes: e.g. I could have a @Vector(8, u32768) for a vector of pages.

03:08 <scientes> lol, I don't have enough RAM to compile that

03:09 <daurnimator> huh?

03:09 <daurnimator> Why would that take lots of ram to compile?

03:09 <daurnimator> it's just 8*4KB.

03:10 <scientes> well I did 1MB

03:10 <curtisf> while it's kinda neat that zig (theoretically) supports integers that large, I question how reasonable it is to support numbers that require compiling loops to do things like addition

03:10 <scientes> also clang has a more sane max

03:11 <scientes> I was using gcc

03:11 <scientes> d.c:2:36: error: vector size too large

03:11 <scientes> typedef int badass __attribute__ ((vector_size (1024*1024)));

03:11 <scientes> daurnimator, yeah that is too large for clang

03:12 <daurnimator> curtisf: not different to targetting a 8bit microcontroller and adding u32s...

03:13 <scientes> yeah clang's max is 1024 vector_length

03:13 <curtisf> there's a difference of scale, having 4 instructions unrolled is totally different from having hundreds

03:13 NI33_ has quit [Ping timeout: 245 seconds]

03:13 <curtisf> you'd probably not want to have addition of u32768 be unrolled, which is odd

03:14 <daurnimator> curtisf: my immediate thought is of https://en.wikipedia.org/wiki/Zero_one_infinity_rule

03:14 <scientes> yeah the gcc 1MB simd multiplication is still compiling

03:14 <daurnimator> scientes: you had to pick multiplication :P

03:14 <curtisf> I agree, there's no good place to put the limit. This is more like a "please don't do this" feature

03:15 <daurnimator> scientes: try e.g. a XOR instead. which isn't completely unreasonable for some cryptography algorithms.

03:16 <scientes> daurnimator, oh wow, it just turned it into a loop!

03:16 <scientes> without and simd at all

03:17 <scientes> oh, that is because its xor with itsself

03:17 <daurnimator> xor with itself should just give you a zero vector :P

03:17 <scientes> yeah with optimizations it used memset()

03:18 <scientes> d.c:3:8: note: The ABI for passing parameters with 1048576-byte alignment has changed in GCC 4.6

03:18 <scientes> lawl

03:19 <scientes> yeah, not enough RAM

03:19 <daurnimator> scientes: can you share your sample. I want to try for myselfd

03:20 <scientes> http://paste.debian.net/1093523/

03:20 <scientes> it would be easy to do in zig too

03:20 <scientes> i'm just lazy

03:20 <scientes> and i wanted to try gcc

03:20 ltriant has joined #zig

03:22 <daurnimator> scientes: https://godbolt.org/z/I2ulBM works

03:22 <scientes> but it doesn't generate a loop

03:22 <scientes> because we are abusing it

03:24 <scientes> daurnimator, aww yeah https://godbolt.org/z/rGLdw0

03:32 _whitelogger has joined #zig

03:49 kristoff_it has joined #zig

03:54 kristoff_it has quit [Ping timeout: 272 seconds]

04:21 darithorn has quit [Quit: Leaving]

04:34 Sahnvour has quit [Ping timeout: 245 seconds]

04:35 Sahnvour has joined #zig

05:09 curtisf has quit [Remote host closed the connection]

05:16 kristoff_it has joined #zig

05:21 kristoff_it has quit [Ping timeout: 248 seconds]

05:39 laaron has quit [Quit: ZNC 1.7.1 - https://znc.in]

05:43 laaron has joined #zig

06:14 jjido has joined #zig

06:29 jjido has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

06:48 hio has joined #zig

06:57 ltriant has quit [Quit: leaving]

07:08 kristoff_it has joined #zig

07:13 kristoff_it has quit [Ping timeout: 246 seconds]

07:23 laaron has quit [Quit: ZNC 1.7.1 - https://znc.in]

07:24 laaron has joined #zig

07:34 talin has joined #zig

07:38 marijnfs has joined #zig

07:45 kristoff_it has joined #zig

07:50 kristoff_it has quit [Ping timeout: 258 seconds]

08:11 laaron has quit [Remote host closed the connection]

08:13 laaron has joined #zig

08:51 avoidr has quit [Quit: leaving]

08:55 kristoff_it has joined #zig

09:12 commande1 has joined #zig

09:12 commander has quit [Ping timeout: 244 seconds]

09:29 laaron has quit [Remote host closed the connection]

09:34 laaron has joined #zig

09:54 shachaf has quit [Ping timeout: 245 seconds]

10:02 shachaf has joined #zig

10:05 marijnfs has quit [Ping timeout: 245 seconds]

10:38 omglasers2 has joined #zig

10:43 kristoff_it has quit [Remote host closed the connection]

10:43 NI33_ has joined #zig

10:55 kristoff_it has joined #zig

11:00 kristoff_it has quit [Ping timeout: 245 seconds]

11:20 knebulae has quit [Ping timeout: 248 seconds]

11:22 rappet has quit [Quit: -]

11:22 rappet has joined #zig

11:31 marijnfs has joined #zig

11:33 kristoff_it has joined #zig

11:34 kristoff_it has quit [Remote host closed the connection]

11:35 mattmurr has joined #zig

11:44 kristoff_it has joined #zig

11:58 GoorMoon has joined #zig

12:02 SimonNa has quit [Ping timeout: 246 seconds]

12:04 SimonNa has joined #zig

12:05 GoorMoon has quit [Quit: rcirc on GNU Emacs 25.2.1]

12:09 GoorMoon has joined #zig

12:09 SimonN has joined #zig

12:09 SimonNa has quit [Ping timeout: 268 seconds]

12:13 GoorMoon has quit [Quit: ERC (IRC client for Emacs 25.2.1)]

12:36 dimenus has joined #zig

12:41 <gonz_> Hmm, is subsystem detection broken?

12:41 dimenus has quit [Remote host closed the connection]

12:42 <gonz_> Having `WinMain` and not `main` seems to still give `windows.subsystem == .Console`

13:20 FireFox317 has joined #zig

13:22 FireFox317 has quit [Remote host closed the connection]

13:22 NI33_ has quit [Ping timeout: 272 seconds]

13:43 omglasers2 has quit [Quit: leaving]

13:44 omglasers2 has joined #zig

13:58 NI33_ has joined #zig

14:17 halosghost has joined #zig

14:53 andersfr has joined #zig

15:00 <Tetralux> What's a good example of how to use .format on your type so that you can print it with warn?

15:08 darithorn has joined #zig

15:11 <Tetralux> Nvm - figured it out :)

15:13 andersfr has quit []

15:13 sammich has quit [Quit: No Ping reply in 180 seconds.]

15:14 sammich has joined #zig

15:25 knebulae has joined #zig

15:32 omglasers2 has quit [Quit: leaving]

15:41 Akuli has joined #zig

16:02 laaron has quit [Remote host closed the connection]

16:22 tracernz has quit [Ping timeout: 252 seconds]

16:26 tracernz has joined #zig

16:35 tracernz has quit [Ping timeout: 245 seconds]

16:38 tracernz has joined #zig

16:42 samtebbs has quit [Quit: leaving]

16:52 marijnfs has quit [Ping timeout: 272 seconds]

17:09 omglasers2 has joined #zig

17:25 kristoff_it has quit [Ping timeout: 244 seconds]

17:34 kristoff_it has joined #zig

17:39 kristoff_it has quit [Ping timeout: 272 seconds]

17:41 emekankurumeh[m] has joined #zig

17:43 <emekankurumeh[m]> dimenus: it seems like I introduced a regression

17:59 kristoff_it has joined #zig

18:01 omglasers2 has quit [Quit: Leaving]

18:03 kristoff_it has quit [Ping timeout: 246 seconds]

18:03 omglasers2 has joined #zig

18:04 omglasers2 has quit [Client Quit]

18:08 omglasers2 has joined #zig

18:24 kristoff_it has joined #zig

18:27 darithorn has quit [Quit: Leaving]

18:29 kristoff_it has quit [Ping timeout: 272 seconds]

18:49 kristoff_it has joined #zig

18:53 kristoff_it has quit [Ping timeout: 258 seconds]

18:56 wootehfoot has joined #zig

19:14 kristoff_it has joined #zig

19:18 kristoff_it has quit [Ping timeout: 244 seconds]

19:19 Akuli has quit [Quit: Leaving]

19:24 Hourglass has joined #zig

19:26 kristoff_it has joined #zig

19:29 <Hourglass> Hello, I'm having trouble building clashos (https://github.com/andrewrk/clashos) with zig

19:30 kristoff_it has quit [Ping timeout: 245 seconds]

19:31 <Hourglass> I'm using zig 4.0 and the host machine is a 64bit debian

19:32 <scientes> Hourglass, you should use master for now

19:32 <Hourglass> the actual compilation seems to work fine but then the build script stalls when using objcopy

19:33 <Hourglass> so i'm not sure swithing to master will do any good

19:34 <Hourglass> specifically objcopy complains that the input file format is not recognizable

19:34 <Hourglass> I've checked with hexdump and it looks like a regular ELF file (and readelf agrees also)

19:36 <nrdmn> Hourglass: what does readelf -h say about it?

19:36 <emekankurumeh[m]> @dimenus: i'm not getting any build errors...

19:37 <emekankurumeh[m]> are you static linking or dynamic linking?

19:38 <nrdmn> Hourglass: I mean, what's the exact output

19:41 <Hourglass> En-tête ELF:

19:41 <Hourglass> Classe: ELF64

19:41 <Hourglass> Données: complément à 2, système à octets de poids faible d'abord (little endian)

19:41 <Hourglass> Version: 1 (current)

19:41 <Hourglass> Magique: 7f 45 4c 46 02 01 01 00 00 00 00 00 00 00 00 00

19:41 <Hourglass> OS/ABI: UNIX - System V

19:41 <Hourglass> Version ABI: 0

19:41 <Hourglass> Type: EXEC (fichier exécutable)

19:41 <Hourglass> Machine: AArch64

19:41 <Hourglass> Version: 0x1

19:41 <Hourglass> Adresse du point d'entrée: 0x0

19:41 <Hourglass> Début des en-têtes de programme : 64 (octets dans le fichier)

19:41 <halosghost> LC_ALL=C might be useful for some folk

19:43 <nrdmn> Hourglass: does your objcopy support AArch64?

19:45 <Hourglass> nrdmn: good question, I don't know. How can I check that ?

19:47 <nrdmn> objcopy --info

19:50 <Hourglass> nrdmn: good catch, it doesn't. I suppose its logical since my host computer is x86_64 and I'm trying to target arm64. I'm a bi new to this cross-compilation stuff

19:50 <Hourglass> *bit

20:00 <nrdmn> hmm, there seems to be no way to create a File with an outStream() method that returns an OutStream with a custom OutStream.stream.writeFn

20:03 kristoff_it has joined #zig

20:08 kristoff_it has quit [Ping timeout: 272 seconds]

20:13 Ichorio has joined #zig

20:17 <nrdmn> which makes it difficult to support platforms where stdin, stdout, stderr aren't files

20:21 jjido has joined #zig

20:26 <emekankurumeh[m]> daurnimator: can you point me to some implementations or header files that define `sockaddr_any`?

20:28 jjido has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

20:36 kristoff_it has joined #zig

20:37 jjido has joined #zig

20:38 omglasers2 has quit [Quit: Leaving]

20:45 FireFox317 has joined #zig

20:47 <FireFox317> gonz_: I found the issue for the subsystem detection. You have to declare the WinMain function as a pub function. In the example that andrewrk gave this wasn't specified and that caused the issue.

20:50 <FireFox317> Maybe we should add the same restriction that the 'normal' main function has. Namely the fact that the main function needs to be pub.

20:53 Hourglass has left #zig [#zig]

20:54 <FireFox317> Not sure how to properly solve this

20:54 <gonz_> Indeed, that was the reason

20:54 <gonz_> `pub export` seems to do the trick

21:02 <FireFox317> I will make a issue to track this.

21:04 wootehfoot has quit [Read error: Connection reset by peer]

21:04 <gonz_> In retrospect it's somehow obvious

21:07 knebulae has quit [Ping timeout: 268 seconds]

21:08 <FireFox317> Yeah it is, but the detection should be polished a bit

21:12 <gonz_> The annoying thing is it half-working without the `pub`, yes

21:12 <gonz_> If it didn't, you might just try adding it and end up where you needed to be

21:12 halosghost has quit [Quit: WeeChat 2.5]

21:25 FireFox317 has quit [Remote host closed the connection]

21:28 knebulae has joined #zig

21:30 kristoff_it has quit [Remote host closed the connection]

21:35 darithorn has joined #zig

21:42 kristoff_it has joined #zig

21:43 jmiven has quit [Quit: reboot]

21:44 jmiven has joined #zig

21:45 reductum has joined #zig

21:47 kristoff_it has quit [Ping timeout: 272 seconds]

22:03 jjido has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

22:05 <andrewrk> how does one order a pinebook pro?

22:06 <andrewrk> gonz_, it's a bug, pub should not be required if the function is exported

22:07 <THFKA4> i think you need a forum account that's older than a month

22:08 <THFKA4> which you then use to get a coupon code during checkout

22:14 <andrewrk> I found a support email address. I sent them a description of my use case, hopefully they let me buy one

22:43 <bwb_> woah, they are way cheaper than I expected

22:45 ltriant has joined #zig

22:46 <daurnimator> emekankurumeh[m]: https://github.com/wahern/cqueues/blob/master/src/lib/socket.h#L233

22:46 Ichorio has quit [Ping timeout: 264 seconds]

22:57 <fengb> Wow that's cheap

23:06 <daurnimator> andrewrk: FWIW I have a box from works-on-arm. it's a beast of a machine

23:06 <daurnimator> we'd be allowed to use it for zig if we wanted

23:06 <daurnimator> just give me a daemon to run on it to run CI jobs

23:06 <daurnimator> which I guess is where gitlab CI might come in

23:06 <andrewrk> daurnimator, I believe that's what https://github.com/ziglang/zig/pull/2978 is

23:06 <daurnimator> because you can run your own runners.

23:07 <daurnimator> andrewrk: ah yes. "Andrew could also set up an official gitlab mirror, and then just authenticate the arm64 CI runner"

23:08 <daurnimator> andrewrk: I think that would be a good way forward for a lot of platforms where existing CI providers don't exist

23:08 <andrewrk> agreed

23:09 kristoff_it has joined #zig

23:12 <daurnimator> andrewrk: if you set up the gitlab account I'm happy to start the runner

23:13 <daurnimator> the box is a https://www.packet.com/cloud/servers/c2-large-arm/ --> 32 cores; 128GB RAM; 20GBit net connection...

23:13 kristoff_it has quit [Ping timeout: 244 seconds]

23:26 <bwb_> I might have a spare cavium around here...

23:26 <bwb_> lemme check

23:31 hio has quit [Quit: Connection closed for inactivity]

23:55 <andrewrk> daurnimator, nice, let me try setting that up now