#ocaml on 2014-04-16 — irc logs at freenode.irclog.whitequark.org

2014-01-28 20:18 lapinou changed the topic of #ocaml to: Discussions about the OCaml programming language | http://caml.inria.fr/ | http://www.ocaml.org | OCaml 4.01.0 announce at http://bit.ly/1851A3R | Public logs at http://tunes.org/~nef/logs/ocaml/

00:00 <whitequark> Drup: yes, that's fine. but I wish it would be possible to use syntax(foo).

00:00 <Drup> whitequark: soon™

00:00 <whitequark> \o/

00:00 <companion_cube> oh, you can put attributes on fun% ?

00:00 <Drup> yes

00:01 <whitequark> yeah, pretty much everywhere

00:01 <Drup> on all syntactic constructs

00:01 dapz has joined #ocaml

00:01 <Drup> whitequark: I don't know if you saw it, but you should be able to put them in top level bindings too, now

00:01 <whitequark> [@@@foo] ?

00:01 <Drup> no, "let%foo ..."

00:01 <Drup> (not let in)

00:01 <whitequark> ah

00:02 <companion_cube> so I guess while%lwt and such will remain

00:02 <companion_cube> sooo nice

00:03 <Drup> yes

00:03 <Drup> but not the "finally" :/

00:03 <Drup> still not sure about this one

00:03 <companion_cube> try with e -> [@@finally "yolo"] ?

00:03 <whitequark> ^

00:03 <Drup> not very nice, to say the least.

00:04 <Drup> "That camlp4 can handle OCaml syntax (two OCaml syntaxes, in fact, the original one and a revised one introduced specifically for camlp4) is just a special case."

00:04 <whitequark> oh

00:04 <Drup> there is some grammatical weirdness in this sentence.

00:05 <whitequark> I accidentally a word

00:05 <companion_cube> ah, GETENV

00:06 <companion_cube> I'd like something like this to embed the current commit hash in the code

00:06 <whitequark> companion_cube: yep. I see no point in spending bytes on explaining how to extract attributes from perhaps the most boring AST I've seen

00:06 <whitequark> so I just cut it down to the simplest possible example

00:06 <companion_cube> indeed

00:07 <whitequark> I'll probably implement your commit hash thing, to experiment with packaging

00:07 <whitequark> and publish

00:07 <whitequark> by the way, any idea on naming conventions for ppx extensions? ppx_thing perhaps? similar to pa_thing.

00:08 <Drup> I think ppx_thingy is good

00:08 <companion_cube> my_little_ppx_foobar

00:08 <whitequark> my_little_ppx_friendship_is_magic

00:09 <whitequark> (alluding to the way ppx extensions easily interoperate)

00:09 <companion_cube> whitequark: this is good. You should post it on reddit :)

00:11 <Drup> whitequark: you forgot to mention the important bit

00:11 <whitequark> done: http://www.reddit.com/r/ocaml/comments/23516u/my_little_ppx_friendship_is_magic_or_a_guide_to/

00:11 <Drup> attributes and extensions are ugly x)

00:12 <whitequark> I disagree

00:12 <Drup> begin%foo[@bar baz] .. end

00:12 <whitequark> let%lwt and match%lwt is nearly perfect

00:12 <Drup> ok, just attributes then

00:12 <whitequark> it doesn't even significantly differ from current pa_lwt

00:12 <companion_cube> at least we can see where the vanilla syntax ends

00:12 <tautologico> wasn't there a [%foo] extension point syntax too?

00:13 <whitequark> tautologico: let%lwt x in y desugars to [%lwt let x in y]

00:13 <Drup> it's still there

00:13 <Drup> whitequark: you should mention that I think

00:13 <whitequark> I didn't even mention the desugared form because it looks so horrible

00:13 <Drup> ahah xD

00:13 <Drup> you see ! :D

00:13 <whitequark> but let%lwt is a-ok

00:14 <Drup> yes

00:14 lordkryss has quit [Quit: Connection closed for inactivity]

00:14 <whitequark> attributes used in type declarations are just fine, too, I believe, it's no uglier than with's sprinkled around

00:14 <whitequark> attributes inside code are a bit ugly.

00:14 <whitequark> but this is the price we pay for getting rid of camlp4, and I'd happily pay it twice

00:14 <Drup> agreed

00:15 <Drup> I'm still wondering how to handle js_of_ocaml's syntax

00:17 <whitequark> doesn't alain mention an example of how to do that?

00:17 <whitequark> [%js o <- v]

00:17 <whitequark> smth like tht

00:17 <Drup> yeah

00:17 <Drup> not very satisfying

00:17 <Drup> [%js o#foo <- v]

00:17 <Drup> in fact

00:17 <whitequark> right

00:17 <whitequark> I think I'm ok with that

00:17 <Drup> hum

00:18 <Drup> actually, this one doesn't work

00:18 <whitequark> oh?

00:18 <Drup> because <- parse only after a record/string/array/bigarray notation

00:18 <Drup> I asked alain if we could extend that, no answer yet, I think he's looking into it

00:18 <Drup> we could just cheat and use <--

00:19 <whitequark> *nod*

00:19 <Drup> anway

00:19 <Drup> it's far more verbose than o##fo <- v

00:21 <whitequark> right

00:21 <Drup> whitequark: you should mention the ocamlfind thingy

00:22 <companion_cube> or :=

00:24 <whitequark> Drup: hang on

00:30 <whitequark> Drup: hmm, how do you put an executable in an ocamlfind package?

00:30 <whitequark> I don't see the relevant META synta

00:30 <whitequark> *syntax

00:31 <whitequark> ah, it doesn't require one though

00:32 <Drup> yes, just give to "ocamlfind install"

00:33 dapz has quit [Quit: My MacBook Pro has gone to sleep. ZZZzzz…]

00:37 <whitequark> Drup: ok, all fixed

00:37 <whitequark> also, documented {||} and added a few more references

00:38 nikki93 has quit [Remote host closed the connection]

00:38 <Drup> nice :)

00:39 studybot has quit [Read error: Connection reset by peer]

00:39 <Drup> also, there is this : https://github.com/alainfrisch/ppx_tools but I think it's out of date with trunk

00:40 <whitequark> it was merged into trunk

00:40 <Drup> all of it ?

00:40 <whitequark> ast_mapper_class was

00:41 <Drup> ok

00:41 <whitequark> dumpast.exe (.exe!!) is replaced by ocamlc -dsource

00:41 <whitequark> I'm not sure I understand what genlifter does

00:42 <whitequark> oh, got it

00:43 ocp has quit [Quit: Leaving.]

00:43 nikki93 has joined #ocaml

00:45 <tautologico> whitequark: "I’ve found Alain Frisch’s two articles (1, 2) on the topic" both links point to the same URL

00:47 <whitequark> ouch

00:47 <tautologico> bugs

00:48 <whitequark> fixed

00:48 jwatzman|work has quit [Quit: jwatzman|work]

00:50 tnguyen1 has quit [Read error: No route to host]

00:51 <tautologico> retweeted the link

00:53 dapz has joined #ocaml

00:56 tlockney is now known as tlockney_away

01:01 <tautologico> I thought the new version of zmq would be on opam by now

01:02 divyanshu has joined #ocaml

01:02 <whitequark> nope, the maintainers didn't publish it :(

01:08 <tautologico> I'm holding off on installing iocaml because of it, to avoid having multiple versions of zmq around

01:09 <Drup> same here

01:09 <Drup> I tried once, but it was a bit of a mess

01:09 studybot has joined #ocaml

01:10 <Drup> (except iocaml, which was super nice)

01:13 <bernardofpc> let expr mapper expr -> is this REALLY valid for declaring a function expr ?

01:13 <bernardofpc> (hl whitequark )

01:13 Rota has joined #ocaml

01:14 <Rota> Why in functional programming, we prefer expressions to statements?

01:14 <Drup> Rota: easier to combine

01:14 <Drup> bernardofpc: yes, it's valid, since the function is not recursive, it's not ambiguous at all

01:15 <bernardofpc> humpf, sorry s/valid/clear/

01:15 <Drup> oh

01:15 <bernardofpc> or "didactic"

01:15 <Drup> :D

01:15 <Drup> maybe not

01:16 <bernardofpc> it's too late in here, but it's a good test to read a tutorial in dire situations to find such points

01:16 <tautologico> statements have no value, so they're harder to compose

01:17 <bernardofpc> whitequark: perhaps replace the first "expr" with "my_constructor_mapper" ?

01:17 q66 has quit [Quit: Leaving]

01:17 <tautologico> I thought it was a bit confusing too, the repetition of "expr"

01:17 <tautologico> no

01:18 <bernardofpc> sure verbose and not very Ocamly, but it appears only twice

01:18 <Rota> thanks

01:18 <tautologico> he could shorten the second expr to "e" too

01:18 <bernardofpc> or "constructor_modifier_mapper"

01:19 <malvarez> is there any easy way to get the column number of a syntax error from the lexbuf?

01:19 <tautologico> too long and vague... "expr_transformer" could be better :)

01:19 <malvarez> I can update the line number with new_line, but that doesn't seem to reset the column count

01:19 <tautologico> "expr_mapper"

01:20 tnguyen has quit [Ping timeout: 250 seconds]

01:20 <bernardofpc> getenv_mapper ?

01:21 <whitequark> bernardofpc: it's valid

01:21 <bernardofpc> hum, already used

01:21 * bernardofpc really tired

01:21 <whitequark> could just fold it into {with}

01:23 tnguyen has joined #ocaml

01:23 <Drup> malvarez: easy, I don't know

01:24 arjunguha has joined #ocaml

01:24 <Drup> malvarez: https://github.com/Drup/LILiS/blob/master/lilis/lisUtils.ml#L164 here is how I do it

01:24 <Drup> https://github.com/Drup/LILiS/blob/master/lilis/lisLexer.mll#L18 you need that too

01:24 lostcuaz has joined #ocaml

01:25 <malvarez> Drup: I see, thanks

01:26 <malvarez> I had completely overlooked the pos_bol field. of course bol stands for beginning of line...

01:28 <Drup> self explanatory, isn't it ? :D

01:28 <Drup> (tbh, I don't think I actually wrote this code, I probably stole it to someone)

01:28 <Drup> (if I did write it, I didn't remember)

01:28 <Drup> I don't*

01:37 dapz has quit [Quit: My MacBook Pro has gone to sleep. ZZZzzz…]

01:45 lopex has quit [Quit: Connection closed for inactivity]

02:02 tnguyen has quit [Ping timeout: 245 seconds]

02:03 cesar_ has joined #ocaml

02:04 cesar_ is now known as Guest66686

02:04 Guest66686 has quit [Remote host closed the connection]

02:13 everyonemines has joined #ocaml

02:19 nikki93 has quit [Remote host closed the connection]

02:32 lostcuaz has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

02:33 rgrinberg has quit [Quit: Leaving.]

02:35 nikki93 has joined #ocaml

02:40 tnguyen has joined #ocaml

02:41 nikki93 has quit [Ping timeout: 258 seconds]

02:42 nikki93 has joined #ocaml

02:51 ygrek has joined #ocaml

03:02 * whitequark headdesks

03:03 <whitequark> the reason camlp4 build failed was that I had a directory called camlp4 in cdpath, and camlp4's build script does 'cd camlp4'

03:05 <whitequark> it still explodes for some entirely unrelated reason though

03:05 <whitequark> can't find debug.ml, apparently

03:06 tnguyen_ has joined #ocaml

03:12 <whitequark> how does one get a camlp4boot.native ?

03:23 <ygrek> cdpath works in scripts??

03:24 <ygrek> pure crazy

03:25 <whitequark> ygrek: it also prints the path to stdout, exacebrating the failure

03:26 <whitequark> hmm, apparently ocaml tree used to include camlp4/boot/camlp4boot.ml, but camlp4 tree does not

03:26 nikki93 has quit [Ping timeout: 276 seconds]

03:27 nikki93 has joined #ocaml

03:27 <whitequark> oddly, transplanting it into camlp4 tree produces a circular build dependency

03:27 * whitequark starts to wonder whether anyone has ever tried to build it at all

03:28 <whitequark> jpdeplaix: were you ever able to build camlp4 successfully with your overlay?

03:31 <whitequark> ohhh. I should have specified --prefix.

03:34 <whitequark> ... no, it actually ignores --prefix. I should have performed `make all' on a really clean tree. the cdpath thing screwed something up. nevermind all of the above.

03:35 Don_Pellegrino has joined #ocaml

03:41 arjunguha has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

03:50 Don_Pellegrino has quit [Ping timeout: 265 seconds]

03:56 everyonemines has quit [Quit: Leaving.]

04:02 tnguyen_ has quit [Ping timeout: 245 seconds]

04:03 zxqdms has quit [Quit: leaving]

04:03 tnguyen_ has joined #ocaml

04:14 divyanshu has quit [Quit: Computer has gone to sleep.]

04:21 tnguyen_ has quit [Ping timeout: 265 seconds]

04:23 tnguyen_ has joined #ocaml

04:27 malvarez has quit [Remote host closed the connection]

04:29 <whitequark> is it normal that compiling camlp4boot takes over 7.6G of RAM and more than 55 minutes already?

04:34 yacks has joined #ocaml

04:40 divyanshu has joined #ocaml

04:40 <whitequark> poking it with gdb reveals that it has unbounded recursion with a cycle somewhere inside:

04:40 <whitequark> #12696 0x00000000004f1150 in camlBtype__iter_type_expr_1466 ()

04:40 <whitequark> #12697 0x00000000004f189f in camlBtype__it_type_expr_1518 ()

04:47 <whitequark> PR6371.

04:50 tnguyen_ has quit [Ping timeout: 265 seconds]

04:51 tnguyen_ has joined #ocaml

04:52 Arsenik has quit [Remote host closed the connection]

04:54 tristero has quit [Ping timeout: 240 seconds]

04:55 siddharthv_away is now known as siddharthv

04:58 manizzle has quit [Ping timeout: 240 seconds]

05:04 pyon has joined #ocaml

05:12 tristero has joined #ocaml

05:21 axiles has joined #ocaml

05:33 <tautologico> is that a compiler bug?

05:34 <whitequark> seems so

05:34 <whitequark> currently bisecting it

05:37 araujo has quit [Quit: Leaving]

05:38 ggole has joined #ocaml

05:41 michel_mno_afk is now known as michel_mno

05:57 ocp has joined #ocaml

06:10 <whitequark> so... fb74ef5e51a is responsible

06:26 <whitequark> fixed. garrigue is quick!

06:28 <adrien> oh, HEAD~ is also fairly interesting (requirement for C99)

06:28 <adrien> s/for/of/

06:29 <whitequark> adrien: hm? which commit?

06:29 <whitequark> oh, nevermind, I see.

06:29 <adrien> the one before the fix

06:30 Submarine has joined #ocaml

06:30 Submarine has quit [Changing host]

06:30 Submarine has joined #ocaml

06:30 ygrek has quit [Ping timeout: 258 seconds]

06:35 yezariaely has joined #ocaml

06:46 nikki93 has quit [Remote host closed the connection]

06:55 yezariaely has quit [Quit: Leaving.]

07:02 ggole has quit []

07:06 yezariaely has joined #ocaml

07:07 ygrek has joined #ocaml

07:17 nikki93 has joined #ocaml

07:26 nikki93 has quit [Ping timeout: 276 seconds]

07:27 Simn has joined #ocaml

07:28 Kakadu has joined #ocaml

07:30 ocp has quit [Ping timeout: 258 seconds]

07:42 yacks has quit [Ping timeout: 245 seconds]

07:48 avsm has joined #ocaml

07:53 lordkryss has joined #ocaml

08:01 rand000 has joined #ocaml

08:01 avsm has quit [Quit: Leaving.]

08:02 ggole has joined #ocaml

08:09 tane has joined #ocaml

08:11 yacks has joined #ocaml

08:16 ikaros has joined #ocaml

08:17 AltGr has joined #ocaml

08:21 avsm has joined #ocaml

08:27 tautologico has quit [Quit: Connection closed for inactivity]

08:38 thomasga has joined #ocaml

08:45 zpe has joined #ocaml

08:50 yezariaely has quit [Quit: Leaving.]

08:55 avsm has quit [Quit: Leaving.]

08:55 tobiasBora has joined #ocaml

09:04 zarul has quit [Read error: Connection reset by peer]

09:05 zarul has joined #ocaml

09:05 zarul has quit [Changing host]

09:05 zarul has joined #ocaml

09:06 shinnya has joined #ocaml

09:13 tane has quit [Quit: Verlassend]

09:13 maattdd has joined #ocaml

09:15 <jpdeplaix> 01:24:45 whitequark | I mean, it doesn't even start to build stuff. it just dies somewhere inside ocamlbuild // that's why I said that it's better with pr20 (but you can trick this by: install ocamlfind; install camlp4; reinstall ocamlfind)

09:15 <jpdeplaix> Yes, it's a little bit boring :D

09:18 <whitequark> jpdeplaix: nah, my problem was unrelated to pr20

09:18 <whitequark> it was $CDPATH and dirty tree

09:19 <whitequark> I mean, I'd eventually have bumped into lack of camlp4 META, but not yet

09:21 <jpdeplaix> what's your error message ?

09:21 <jpdeplaix> mmmh I just saw your mantis ticket

09:23 <jpdeplaix> well, ok. I didn't tried to compile camlp4 with trunk recently

09:25 <whitequark> jpdeplaix: that's actually a third, unrelated problem :D

09:25 <jpdeplaix> :DD

09:25 <whitequark> and now I have fourth, camlp4 isn't quite updated enough

09:26 <Drup> whitequark: remind me, why do you want to compile camlp4 against trunk ?

09:26 <Drup> I mean, you don't have to inflict that to yourself

09:27 <whitequark> Drup: ppx works only in trunk. everything else uses camlp4

09:27 <whitequark> e.g. I would depend on lwt and oasis and probably other things

09:28 <whitequark> I want to write my fancy protobuf library over ppx already.

09:29 <Drup> oh, right, you want to compile stuff that use camlp4

09:29 <Drup> fair enough

09:30 <whitequark> I can't even use utop without that

09:30 thomasga has quit [Quit: Leaving.]

09:30 <gasche> I think Jérémie plans to upgrade camlp4 to be correct wrt. trunk only after the feature freeze for 4.02

09:31 <gasche> of course, that makes sense for new syntactic constructs to support

09:31 <gasche> but one should still check that camlp4, without support for new constructs, at least compiles and work as expected, because that allows to spot regressions in the compiler

09:31 <gasche> (as your typing issue)

09:31 dapz has joined #ocaml

09:32 <gasche> only it should be OCaml and/or Camlp4's maintainers doing the checking work, not an almost-innocent end-user

09:32 <whitequark> gasche: I've just talked with Anil, he plans to do it sooner

09:32 <whitequark> or maybe me, if I would become free earlier

09:32 * whitequark shrugs

09:32 <mrvn> whitequark: CDPATH is evil

09:33 <whitequark> gasche: I'm not used to technology telling me what I can't do. so, often I have to fix it myself. :)

09:33 <whitequark> gasche: actually, camlp4 is mostly upgraded wrt/ trunk. it only misses annotations on one or two nodes, I believe.

09:33 <whitequark> it's not a lot of work at this point

09:33 <gasche> feel free to do the work

09:34 <gasche> but it does have the good property that if you don't, someone else will do it

09:34 <whitequark> yeah.

09:34 <gasche> (which cannot be said of other things in the OCaml ecosystem)

09:35 <gasche> (eg. reviewing Benoît's format+gadt work, which is my focus right now)

09:39 <gasche> amusing format bug: (Printf.printf "%.+f" 3.5)

09:39 lopex has joined #ocaml

09:39 <whitequark> weird

09:40 <whitequark> gasche: (other things) yeah, there's a lot of very interesting in-flight patches. ppx and gadt-format included.

09:40 jave has quit [Read error: Operation timed out]

09:40 <whitequark> also, record constructors are quite awesome.

09:41 <companion_cube> can't wait

09:41 <companion_cube> :)

09:47 <gasche> I don't think record constructors will be merged in 4.02

09:47 <gasche> (but that's only a personal guess)

09:47 thomasga has joined #ocaml

09:49 <whitequark> I'm actually quite happy with the evolution of ocaml, compared to what I've seen in other languages

09:50 <whitequark> initially I thought it would be far too conservative, but now I see that it is not the case at all

09:51 <whitequark> as a side note, I really should write a proper LLVM backend sometimes, with all the talk about ocamlopt not inlining things where it should

09:52 <whitequark> cmm is less than 100 lines. should be trivial to translate.

09:53 <companion_cube> I think a llvm backend has been discussed many times

09:53 tobiasBora has quit [Ping timeout: 246 seconds]

09:53 <whitequark> I think it's been tried at least twice

09:53 <companion_cube> problems being the GC or the calling convention

09:53 <ggole> Does LLVM support precise GC now?

09:53 <whitequark> ggole: there's been some promising work in that direction

09:53 <ggole> (There was somebody beginning work on that.)

09:53 <ggole> Right.

09:53 <whitequark> companion_cube: well... calling convention is simple. GC is more problematic, yes

09:54 <whitequark> how does ocaml handle roots in registers?

09:54 <ousado> does it have any GC?

09:54 <mrvn> whitequark: it doesn't.

09:54 <ggole> Everything is done with frametables afaik.

09:54 <NoNNaN> whitequark: what do you think it is possible to create something like this for ocaml? https://scala-lms.github.io/

09:54 <whitequark> ousado: LLVM requires you to spill all roots on stack. from that, you can generate stackmaps with custom C++ code

09:55 <whitequark> mrvn: ggole: then it sounds like LLVM and OCaml are a perfect match. I'd need to think further about it. I think I read an elaborate description of the problems somewhere.

09:55 <whitequark> NoNNaN: BER-MetaOCaml ?

09:55 <mrvn> When ocaml does a function call does it even keep any values in registers?

09:56 <mrvn> ocamlopt that is

09:56 <whitequark> mrvn: I think it spills everything, but mostly to make setjmp/longjmp very fast

09:56 jave has joined #ocaml

09:56 <whitequark> well, setjmp to be specific. with cdecl, it has to spill stuff. with ocaml's calling convention, it's a mov.

09:56 <companion_cube> hmm, isn't one supposed to put everything on the stack in llvm?

09:57 <companion_cube> leaving llvm itself decide of what goes in registers?

09:57 <companion_cube> (and handle GC roots, hopefully)

09:57 avsm has joined #ocaml

09:57 <NoNNaN> whitequark: i have checked it, it does not seems to possible without costly abstractions

09:57 <whitequark> companion_cube: it's the other way around. LLVM is a register machine. but there's a catch.

09:57 <whitequark> companion_cube: the LLVM gc intrinsics don't accept a *register*. they accept an *address*. so you have to explicitly alloca a stack slot.

09:58 <companion_cube> oh

09:58 <NoNNaN> whitequark: you can codegen your own gc from llvm, something like hlvm does

09:59 <whitequark> companion_cube: obviously, this kills the performance (https://fbcdn-sphotos-h-a.akamaihd.net/hphotos-ak-prn1/55338_462640263775586_1227318604_o.jpg)

09:59 <whitequark> NoNNaN: it's pointless. I mean, it's not an improvement over the scheme that LLVM already has.

10:00 <whitequark> HLVM doesn't support roots in registers as well, and frankly I'm not quite see the point of the project as it exists

10:00 <whitequark> I mean, it's not really high-level at this point. *shrug*

10:00 <mrvn> I have to implement my own Thread module. So I'm wondering if I need to dump registers into memory and register them as root in some way. Is everything already spilled and registered when the GC calls one of the hooks?

10:01 <whitequark> mrvn: since GC can only be called from an allocation routine, and with OCaml's calling convention ocamlopt would spill everything, yes

10:02 <ousado> 'since GC can only be called from an allocation routine' -why?

10:03 <mrvn> ousado: noalloc functions aren't allowed to call the GC.

10:03 <ousado> in llvm?

10:03 <whitequark> ousado: um... I think that's just how it works? I mean, you can only call GC from safepoints

10:03 <whitequark> how it works in OCaml

10:03 <ousado> ok

10:04 <mrvn> ousado: the noalloc keywoard say that the function doesn't call the GC so registers don't need to be spilled. makes them faster.

10:04 <NoNNaN> probably a dumb question, but it is possible to create a subset of ocaml that has linear typing (something like linearml)? than memory usage is known, gc is not required, and llvm backend may could target architectures where gc is not yet possible (ptx, r600, hsail, fpga, etc)

10:04 <mrvn> ousado: best not to use it

10:05 <mrvn> NoNNaN: how do you implement List.map then?

10:05 * whitequark sighs

10:05 <Drup> NoNNaN: afaik, mezzo is sort of going this way

10:05 <Drup> except it's not really a subset of ocaml, more like slightly different :)

10:05 <whitequark> what is up with this odd fetishization of FPGAs as compiler backends?!

10:05 <whitequark> s,FPGAs,RTL,; you got the idea

10:06 <companion_cube> Drup: and much more difficult :D

10:06 <mrvn> whitequark: boring. I want ocaml for GPUs

10:06 <NoNNaN> mrvn: it seems possible, see: https://github.com/pikatchu/LinearML/blob/master/stdlib/list.lml

10:06 <whitequark> it's not a magical sekret sauce that makes your program fast. if it's written in von neumann style, and chances are that it is, then a von neumann CPU is the best thing for running it

10:07 <Drup> mrvn: it's already there, it's called spoc

10:07 <whitequark> really, if something compiles down to GPUs or FPGAs, it just means that you arbitrarily select a few language construct and make them optimal, and everything else either doesn't compile, or is so horribly inefficient you wish it didn't

10:08 <whitequark> I mean, even look at verilog (or vhdl), the native language for FPGAs. it's essentially an abstract logic description language from which the pattern matcher in the FPGA toolchain selects the parts it likes

10:08 <mrvn> NoNNaN: How doesn't that require memory allocations?

10:08 <whitequark> in result, you spend your days twiddling your code until it has just the right syntactic form to generate just the right RTL for your target FPGA

10:08 <NoNNaN> mrvn: in linear typing you use your variables exactly once, so the memory allocation is known at compile time

10:09 <whitequark> there's a few great developments in that area (e.g. migen), but they step even further from traditional languages. </rant>

10:09 <mrvn> NoNNaN: but the amount of memory depends on the length of the list.

10:10 <whitequark> mrvn: I would think that the issue here is not allocation, but destruction

10:10 <whitequark> since you use every cell exactly once, you allocate it at creation and deallocate it at usage.

10:10 <ggole> There are region systems in which you can get rid of GC

10:10 <ggole> But they can use huge amounts of memory

10:11 <whitequark> ggole: rust's region system doesn't, but I believe it's substantially differs from ml-with-regions (however was it called?)

10:11 <mrvn> whitequark: Ok. And when you need it twice you have to call some special function that gives you 2 copies and destroys the input?

10:11 <ggole> MLKit tried to make it work

10:11 <whitequark> mrvn: I guess so

10:11 <mrvn> I think I was thinking of region systems

10:12 <whitequark> ggole: I believe the problem with MLKit is the odd way they interpret regions--dynamic arenas associated with stack frames, where values are allocated. in Rust, a linear type is just a wrapper for malloc() and free()

10:12 <mrvn> I've been trying to design a system that would not need dynamic allocation.

10:12 <ggole> The benefit is that you can free the region in constant time

10:13 <whitequark> well, or you can allocate it on stack, but you have to know statically how much stack would you need.

10:13 <whitequark> ggole: the drawback is that you cannot transfer ownership.

10:13 <mrvn> whitequark: and that's where I got stuck

10:13 <ggole> If you have allocation patterns that are stack-shaped, or close, that can work

10:13 <ggole> But when you have uncertain escapey lifetimes it seems problematic

10:13 <whitequark> ggole: and if Rust shows something, it's that passing ownership is an extremely powerful mechanism that makes abstractions work in a region-based lang

10:13 Hannibal_Smith has joined #ocaml

10:14 <mrvn> You end up with the halting problem. Deciding how much stack to reserve for arbitrary code is equivalent to the halting problem.

10:14 <ggole> I dunno how that works in a functional lang that depends on persistent structures with sharing, though

10:14 <ggole> You don't have to put regions on the stack

10:14 <whitequark> ggole: Rust has strong/weak reference counting for that.

10:15 <mrvn> whitequark: ref counts breaks with mutables

10:15 <ggole> You can put each one in its own, potentially large, memory area

10:15 <whitequark> mrvn: hm?

10:15 <whitequark> ggole: I know, I know. but still, expandable regions which are the only place to allocate are just asking for bloat trouble

10:15 <mrvn> whitequark: mutables allow cyclic structures and then the refcount never reaches 0

10:15 <whitequark> mrvn: hence strong/weak. that scheme disallows cycles.

10:16 <ggole> It might also be possible to back region storage with GC, so that once a fixed-size region is filled, the rest is allocated normally

10:16 studybot has quit [Remote host closed the connection]

10:16 <whitequark> mrvn: essentially, you have a tree of strong pointers, and some backedges via weak ones.

10:16 <ggole> Then you can free the stuff in the fixed-size part in constant time, and have to rely on a tracing GC for the rest as usual

10:16 <mrvn> ggole: or allocate things the compiler can proof in a region and unknown stuff in heap.

10:16 <Hannibal_Smith> ggole, this is something not needed in a generational GC, or am I wrong?

10:17 <ggole> Well, you can always fit everything in a region

10:17 <whitequark> ggole: that's a really odd way to solve it. maybe the right thing is not to try to couple lifetimes and allocation arenas?

10:17 <NoNNaN> just take a look at language shootout binary-trees: https://github.com/pikatchu/LinearML/blob/master/test/shootout/bintree.lml -> the generated code is small, and *faster* than the optimized c code (especially when you use caching memory allocator like jemalloc)

10:17 <ggole> Since your program has entry and exit points

10:17 <ggole> The problem is bounding the region size

10:18 <ggole> whitequark: yeah, I don't think it works except in certain situations

10:18 <ggole> And I'm not sure that the compiler can tell when those occur reliably

10:18 <ggole> Hannibal_Smith: we're discussing alternatives to GC

10:18 <Hannibal_Smith> NoNNaN, generally when some high level language is faster than C, it because the GC didn't starts compacting

10:18 <NoNNaN> for some problems, I would like to use the subset of the language (not solve every problem here) that can run as fast as it can, without gc, it can run on gpu, or other architectures

10:19 <Hannibal_Smith> ggole, ok sorry

10:19 <whitequark> NoNNaN: the problem is that "it can run on gpu, or other architectures" is so poorly defined, it doesn't define any specific subset of a language at all.

10:19 <whitequark> and often the requirements are so complex it is simply not viable to express them formally in a language specification.

10:20 <ggole> I can imagine having an annotation for "make this thing not escape upwards"

10:20 <NoNNaN> whitequark: I already mentioned, a custom dsl, where I can control the abstraction cost, eg.: https://scala-lms.github.io/

10:20 <whitequark> e.g. take a look at the mess in C and fused multiply-add

10:21 <Drup> NoNNaN: did you had a look at spoc ?

10:21 <whitequark> NoNNaN: that looks really similar to metaocaml, if I understand it correctly

10:21 <NoNNaN> Drup: yes, I did, unfortunately, not the same

10:21 <whitequark> why do you say it is not?

10:22 <whitequark> oh, well, no heterogenous targets. I would think this is not a fundamental limitation of metaocaml, though.

10:22 <NoNNaN> because I would like to control the cost of abstractions

10:22 <ggole> It'd be nice to have nice packed representations for data types, too

10:23 <ggole> Stuffing an 'a option into a word if the 'a fits, etc

10:24 <ggole> And there are some interesting list compaction tricks

10:24 <mrvn> ggole: that doesn't work with polymorphism

10:24 <ggole> Indeed, you need to specialise for that.

10:24 ygrek has quit [Ping timeout: 258 seconds]

10:24 <ggole> It would require a very different implementation.

10:24 <mrvn> or polymonomorphism

10:25 <NoNNaN> there are some work in this area: "abstraction without regret"

10:25 <ggole> MLton goes some distance down that road

10:25 <mrvn> It would be nice to have a type foo = packed { ... }

10:25 <ggole> They fully monomorphise and defunctionalise too

10:26 <ggole> But, whole program.

10:26 <NoNNaN> database systems also have extreme specializations to improve the instruction per clock cycle, eg.: monetdb will generate specialized code for every primitive operation for every type

10:27 <whitequark> iptables, too

10:27 <whitequark> and I think tcpdump?

10:27 <Hannibal_Smith> (this is very similar to what C++ do with templates?)

10:28 <ggole> Templates are similar in that source is made available in headers

10:28 <mrvn> Hannibal_Smith: which cause totaly useless code explosion.

10:28 <mrvn> You don't want to specialize every type.

10:29 <NoNNaN> if you combine the extreme specialization, with small batching (but the data size is smaller than your cpu cache), your performance will skyrocket

10:29 <whitequark> don't forget instruction cache

10:29 <whitequark> it's a major problem with C++

10:29 <ggole> And compilation time

10:29 <ggole> It does seem as though a careful design could do better

10:30 <mrvn> and if you specialize Pervasives.compare for 1000 types then it will be magnitudes slower than the polymorphic one.

10:30 <NoNNaN> mrvn: this is why I would like to control the cost of the abstraction, when I would like to perform opreation on lot's of data, I want specialization, where I have symbolic code I don't

10:31 <ggole> You might be able to reduce code explosion by mapping equivalent types together

10:31 <Hannibal_Smith> ggole, a sort of "type fusion"?

10:31 <mrvn> NoNNaN: I want to specify the set of types the compiler spcializes for. Both in the module implementing and the module using a function.

10:31 <whitequark> aka what LLVM attempts with mergefunc and its structural typing

10:31 <ggole> eg, all types that have two words and two pointers can be fused for the purposes of specialisation

10:32 <mrvn> ggole: or when the code does the same thing for any 'a

10:32 <whitequark> (but mergefunc has a bit of flawed implementation right now. well, it tends to assert, to be specific.)

10:32 <mrvn> ggole: E.g. List.length does not have to be specialized for every type.

10:32 <ggole> You could reorder fields, too

10:32 <ggole> mrvn: you'd have to specialise it for each size of entry

10:32 <ggole> Which would probably be quite affordable

10:32 <NoNNaN> mrvn: it is possible to extend it, so I can control the "abstraction" eg.: how the types will be mapped to primitive types that cpu understand

10:32 <Hannibal_Smith> ggole, is this what MLTon can do?

10:33 <mrvn> ggole: no. it only cares about the next pointer

10:33 <ggole> mrvn: the offset of the next pointer depends on the size of the entry

10:33 <ggole> ...unless the pointer is first, I suppose

10:33 <mrvn> ggole: put it first. cons of 'a list * 'a

10:34 <ggole> Hannibal_Smith: MLton can do something like this, yeah

10:34 <mrvn> ggole: a cons is a pair in ocaml anyway. so the offset is always the same.

10:34 <ggole> If you have a (int * float) list, the MLton cons will look like [header] [int] [float] [pointer to next]

10:34 <ggole> Instead of having pointers everywhere

10:35 <ggole> mrvn: that's because ocaml doesn't specialise

10:35 <mrvn> ggole: you sure the next isn't first?

10:35 <ggole> It could be

10:35 <whitequark> there's also invasive containers. they have their place as well

10:35 <mrvn> ggole: if next isn't first then you can't have any polymorphism at all.

10:35 <ggole> Yes you can?

10:35 <ggole> You just need to specialise

10:35 <mrvn> ggole: no. then every call must be specialized, at least by the offset for next

10:36 <mrvn> ggole: That's what I said

10:36 <NoNNaN> whitequark: I would like to have (extreme) specialization when I do operations on lot's of data (gigabyte of), so instruction cache is not a problem

10:36 <ggole> And you only need to specialise by the size of the list entry, like I said before.

10:36 <mrvn> But keeping a polymorphic flavour is critical.

10:37 <Hannibal_Smith> One moment, even polymorphic is bad for icache no?

10:37 <ggole> No, polymorphic code is considerably smaller

10:37 <ggole> There's one definition for everything.

10:37 <mrvn> Hannibal_Smith: with polymorphic code you have one function that works for any type. Not millions of duplicates.

10:38 <ggole> And there's no explosion of ancillary information (object maps), because everything looks the same

10:39 <whitequark> NoNNaN: have you read ulrich drepper's manuscript on memory hierarchy?

10:39 <ggole> That's the advantage: the disadvantage is that making everything look the same involves introducing lots of pointers

10:39 <mrvn> I like that ocaml doesn't specialize the memory representation (except for a few special cases).

10:39 <ggole> Which on modern hardware is at least half crazy

10:39 <mrvn> ggole: not that much. The amount of sharing of data it allows balances it.

10:40 <NoNNaN> whitequark: yes, I do, and I also read a tons of other publications on in memory databases too

10:40 <ggole> Sharing a float by pointing at it is not an advantage.

10:41 <ggole> And the same is probably true for most pairs, maybe triples

10:41 <whitequark> NoNNaN: ok, you know more about this topic than me, then :)

10:41 <mrvn> ggole: if it is shared once you break even on 32bit.

10:41 <mrvn> ggole: if it shared a million times ...

10:41 <ggole> No, you lose by having to follow the pointer.

10:43 <mrvn> ggole: and you win when the float is then cached instead of having to find it in memory.

10:43 <ggole> You would have to avoid copying recursive types though

10:44 <ggole> The float would be right there, where the pointer would otherwise be

10:44 <mrvn> I bet there are as many cases where sharing is faster than there are where sharing is slower.

10:45 <ggole> If that storage is not cached then you die just as badly because the pointer will have to be fetched

10:45 thomasga has quit [Quit: Leaving.]

10:45 zpe has quit [Remote host closed the connection]

10:45 <mrvn> ggole: in 32bit the pointer is smaller than a float.

10:45 maufred has quit [Remote host closed the connection]

10:45 <mrvn> ggole: and a single float is an extrem case anyway.

10:45 zpe has joined #ocaml

10:45 <NoNNaN> whitequark: if you want performance, you need specialized code so your computational problems could be be mapped to cpu operations (simd could be a big win), eg.: http://arxiv.org/abs/1209.2137 http://dm.kaist.ac.kr/lab/slides/isao_overview.pdf

10:46 <Hannibal_Smith> Uhm...for specialization is only one part of the problem, for example SIMD requires specific alignment too

10:46 <ggole> Even two or three, maybe more elements would be beneficial

10:46 <ggole> Cache lines are fairly large.

10:47 <mrvn> ggole: don't forget the overhead for the GC and the code duplications to deal with the different specializations.

10:47 <ggole> The overhead is *less*, since you don't have to inspect as many pointers

10:47 <mrvn> You are buying the benefit in the memory representation at the cost of the instruction cache.

10:47 <ggole> And you don't have to trace arrays of pointerless elements.

10:48 <NoNNaN> you could do operations on unboxed values lot faster, than boxed

10:48 <flux> mrvn, how many times do you really have such a big function that paying some for the cache costs you..

10:48 <ousado> mrvn: finding the right balance there is what this discussion is about, no?

10:48 <flux> mrvn, given you can now fit more data into the same cache

10:48 <Hannibal_Smith> (even Haskell let you says that embed in a type with packed)

10:48 <NoNNaN> your operations could directly mapped to cpu operations, no type verification, no pointer magic, just direct operation on data

10:49 <mrvn> flux: It's not the size of the function. It's the number of duplications.

10:49 <ggole> It would also be nice to get rid of int tags, yeah

10:50 <mrvn> NoNNaN: you want to unbox at the start of a function and rebox at the end and keep everything in registers inbetween. Then the boxing hardly matters.

10:50 zpe has quit [Ping timeout: 265 seconds]

10:50 <NoNNaN> mrvn: if you want an extreme example for polymorhic code, take a look at K language (www.kx.com), the whole binary is about ~50k (the whole code is ~200 line of code), the whole binary could fit the cpu intstruction cache

10:50 avsm has quit [Quit: Leaving.]

10:51 <mrvn> NoNNaN: isn't that an argument for my case?

10:51 <Hannibal_Smith> mrvn, registers are few...no?

10:51 <ggole> Of course you have to rebox on every function call or allocation point.

10:51 <mrvn> ggole: not necessarily.

10:51 <ggole> (Except for floats? I guess those can sit in xmm regs just fine.)

10:51 <NoNNaN> mrvn: and take a look at ocaml current binary sizes

10:51 <ggole> Or non-integral regs on whatever arch

10:52 dapz has quit [Quit: My MacBook Pro has gone to sleep. ZZZzzz…]

10:52 <ggole> mrvn: hmm... I guess an unboxed int could be marked as "don't touch" in the frame table

10:52 <mrvn> ggole: elf supports annotating which registers hold pointer for every instruction in a binary. the GC could use that so you wouldn't have to box or tag registers for function calls.

10:52 <NoNNaN> mrvn: if you make it fit on cpu instruction cache, than fine, but currently far from it, however the gcless linearml generated code is small

10:53 <mrvn> NoNNaN: because it is polymorphic.

10:54 jonludlam has joined #ocaml

10:55 <mrvn> ggole: a while back there was a discussion of changing the GC header format to include infos which fields contain pointers and such. That would be usefull for the stack frame but works for any record.

10:55 <NoNNaN> mrvn: no, it's because the controlled abstractions, the operations could directly mapped to cpu operations

10:56 <ousado> mrvn: that does only work up to certain sizes, though, right?

10:56 <mrvn> NoNNaN: if you specialize every type you end up with 1000 copies of the function. Even if you get the function to half the size that is still 500 times more than one polymorphic flavour.

10:56 <ousado> *works

10:56 <ggole> mrvn: that could be handy, yeah

10:56 <ousado> one doesn't have to specialize every type

10:56 <ggole> You can make it work with arbitrary sizes by being clever

10:57 <mrvn> ousado: depends. you could make the header arbitrary large.

10:57 <ousado> well, ok

10:57 <ggole> Ie, have a bit pattern than means "next word includes more info"

10:57 <ousado> I'm thinking about that for haxe

10:57 <mrvn> ousado: like the highest bit says there is another header word before this.

10:57 <ggole> And you can also compress into by bringing pointer fields together

10:57 <mrvn> ousado: How many structures do you have that have more than 8, 16, 32, 64 items?

10:58 <ousado> not many, probably, but it's possible

10:58 <ggole> So instead of a mask, you just need "there are this many pointers/non-pointers at the beginning of this record"

10:58 <mrvn> ggole: reordering would make row types impossible or costly.

10:58 <ggole> You could use another word for those

10:59 <ggole> But yes, there are tradeoffs everywhere in runtime system design

10:59 <ousado> I also thought about that

10:59 <ousado> yes

10:59 <ousado> reordering makes lots of things simpler

10:59 <ousado> also structural subtyping

10:59 <mrvn> I don't think I ever had a record with more than 16 fields that wasn't an array.

11:00 <ousado> but if the language allows it, what can I do?

11:00 <ggole> Sure, but the compiler still has to compile such code

11:00 <ggole> And sometimes people make huge stupid records with hundreds of fields in other langs, they might do the same in OCaml

11:00 <mrvn> allow 16 fields in the first header word with one bit saying there are more header words. Most records won't need more.

11:02 <ggole> There are also techniques for entirely tagless GC

11:02 <ggole> With no header at all

11:02 <mrvn> ggole: impossible

11:02 <mrvn> you have to have the info somewhere

11:02 <ggole> Not at all, go read Appel's paper

11:03 <ousado> is there a copy of that freely available?

11:03 <mrvn> A lot of the time you can use static infos. E.g. each function has the description of its stackframe statically.

11:03 <mrvn> But you need the info somewhere.

11:03 <ggole> In a strongly typed language, the shape is implied by the path taken through the heap to reach it

11:04 <ggole> You need to have type info for the root set.

11:04 <ggole> Ie, stack maps and a map for global values.

11:04 <mrvn> ggole: that's static tags.

11:04 <ggole> So? It isn't a header word.

11:05 <mrvn> ggole: it isn't tagless

11:05 <ggole> There are no tags on values in the heap. That's what it means.

11:06 <ggole> I suspect that it makes GC more expensive, which is why it hasn't been adopted

11:07 <mrvn> I think you can't always statically predict what is going to be a pointer and what not.

11:07 <ggole> It might also make the write barrier more expensive

11:08 <ggole> (Since you need to know the type of the edges recorded in the remembered set, and you won't be able to recover them by walking the entire heap during a minor gc.)

11:08 <mrvn> e.g. type 'a foo = Foo of int | Bar of 'a. The GC would have to know what is a foo and what the Foo and Bar tags mean for the type and what type

11:09 <mrvn> 'a is there.

11:09 <ggole> Read the paper. Appel covers all of that.

11:09 <mrvn> The beauty in ocaml is that the memory representation is verry simple.

11:09 <ousado> ggole: do you have a link please :)

11:09 <ggole> https://www.cs.princeton.edu/~appel/papers/

11:09 <ggole> Search for "runtime tags aren't necessary"

11:10 <ousado> thanks

11:10 <ggole> There's also a few followup papers by somebody else if you find that interesting

11:11 <ousado> I don't think we'll try to go for a tagless GC, but it's always good to look at things from different perspectives

11:12 <ggole> Also Tag-Free Garbage Collection for Strongly Typed Programming Languages - Goldberg

11:13 <ggole> Yeah, I'm not sure that it is a good approach

11:13 <ggole> Breaks Obj.magic

11:13 <flux> sounds like a good approach then ;)

11:13 <ggole> It's not clear to me how to type the queue elements in a copying collector

11:14 <ggole> Or how to deal with polymorphic functions which have been tail-called (ie, their caller is no longer available for inspection)

11:15 <ggole> You could have a parallel queue of typeinfo and then discard it at the end of a minor GC, I guess

11:15 <ggole> But the papers don't go into it.

11:16 <mrvn> ggole: ouch. that apple paper requires the GC to (worst case) do a full backtrace through all stack frames to figure out the type.

11:16 <ousado> ggole: did they implement it?

11:16 <mrvn> ggole: I imiagine you can only tail call when the return type allows it.

11:17 <mrvn> ggole: which I think would be always.

11:17 <ggole> ousado: not sure

11:17 <ggole> mrvn: right

11:17 <ggole> mrvn: note that specialisation would solve that problem ;)

11:18 <mrvn> ggole: I don't think there is a problem there in the first place.

11:18 <ggole> I think headers are a much simpler and probably superior approach.

11:18 <mrvn> ggole: or did you mean the backtracing?

11:18 <ggole> But it is a seductive and interesting idea.

11:18 <ggole> mrvn: the backtracing

11:18 <mrvn> ggole: the ocamls simple memory representation is certainly much simpler.

11:19 <mrvn> ggole: The idea is indeed nice. Would be greate for debuggers too. The pretty printers could print every value perfectly.

11:21 <ggole> Yeah. The toplevel could certainly use such a feature.

11:21 <mrvn> ggole: In chapter 7 (generational GC) they say to store the type when you copy a record. So only the minor heap is tagless.

11:22 <ggole> In fact if you created a lookaside type graph that echoes the heap structure, you might be able to add such a thing without changing representation.

11:23 <ggole> mrvn: my understanding is that you need the type if you are going to mark+sweep that region

11:24 nikki93 has joined #ocaml

11:24 <ggole> But you don't if you are going to trace or copyu.

11:24 <ggole> (It's been a little while since I looked at the paper though.)

11:25 <ggole> mrvn: so in a generational gc with a nursery + from and to space, like some JVMs, both spaces could be tagless

11:26 lordkryss has quit [Disconnected by services]

11:26 <mrvn> ggole: you construct the type as you mark&sweep. But if you have a ref/mutable then modifying a value outside the minor heap makes that a root for the minor heap and you need the type for every root. The paper says to store the type on copying so it is cached when modification happens.

11:26 <mrvn> ggole: you could limit that to ref/mutable. They are quite rare.

11:27 <ggole> Hmm.

11:27 <ggole> I should read it again.

11:27 <ousado> the new lua GC has an approach to a generational GC without copying

11:28 <mrvn> ggole: I don't see a way around that. Given a record you can't work back to the root to find its type and you don't want to scan the major heap every time to find the type.

11:28 <ousado> http://wiki.luajit.org/New-Garbage-Collector

11:28 <ggole> mrvn: if the type is known at each mutation site, you should be ok

11:28 <ggole> Otherwise, yeah

11:28 nikki93 has quit [Ping timeout: 250 seconds]

11:29 <ggole> ousado: is that like Baker's treadmill GC?

11:29 <mrvn> ggole: I would probably add one bit to the header saying "this has mutables, type pointer is in the word before this"

11:29 <ousado> I don't know :)

11:29 <ggole> Heh, fair enough

11:29 <ousado> he doesn't mention it at least

11:29 <mrvn> ggole: wait. do you even have headers?

11:30 <whitequark> ggole: (treadmill is only cool if all your data has the same size)

11:30 <mrvn> ggole: Every type is known so you don't need a header to store the size of each block.

11:30 <NoNNaN> any opinions about vcgc ( http://doc.cat-v.org/inferno/concurrent_gc/ )?

11:31 <ggole> mrvn: usually no, but there are some restrictions if you want generational gc

11:31 rgrinberg has joined #ocaml

11:31 <ggole> mrvn: you do need to be able to relocate the object, so there needs to be some way to indicate whether something has been forwarded

11:31 <mrvn> ggole: and you do.

11:32 <ggole> You can just use the first pointer in the object, if it has one

11:32 <mrvn> ggole: temporary memory during compation.

11:32 <ggole> But if it doesn't, then I think you need a header

11:32 <ggole> Or a table, dunno

11:32 <ggole> Right

11:32 <mrvn> ggole: with stop-the-world GC you simply move them all at once.

11:32 <ggole> That might suck for locality though - not sure

11:32 tobiasBora has joined #ocaml

11:33 maattdd has quit [Ping timeout: 245 seconds]

11:33 <ggole> How do you do that without forwarding?

11:33 _andre has joined #ocaml

11:33 <mrvn> ggole: you need a bitmap, pointer array or hashtable to record what has been copied already.

11:34 <ggole> Don't you also need to know *where* they have been copied?

11:34 <ggole> (A hashtable does allow that.)

11:34 maattdd has joined #ocaml

11:34 <mrvn> ggole: with a bitmap you store the address as first word of the old record.

11:35 <mrvn> ggole: with array or hashtable you store the new address in the table.

11:35 <ggole> Ah, I guess that would work

11:35 <ggole> I'm still thinking in terms of the classic in-place algo

11:35 <mrvn> ggole: yeah. you would loose that.

11:35 <mrvn> and you need memory to build the type infos.

11:36 <mrvn> Which is kind of a bad thing. You are out of memory or the GC wouldn't be running. Not a good thing to allocate more.

11:36 rgrinberg has quit [Ping timeout: 276 seconds]

11:36 <mrvn> Does ocamls GC allocate memory or only use stack?

11:36 <ggole> Mmm... and if you reserve space, you are potentially causing your application to OOM

11:37 <ggole> OCaml uses the classic in-place Baker algo afaik

11:37 <mrvn> ggole: if you reserve the space when allocating the heap then you didn't save memory from being tagless.

11:37 <ggole> With a few bits reserved in each header word

11:38 <ggole> Relocation markers aren't really tags, though

11:38 <ggole> But point taken

11:39 avsm has joined #ocaml

11:39 <mrvn> ggole: You know how ocaml uses a 0 tag for pointers and 1 for integers? I always wonder if anyone had tried it the other way around and compared what's faster.

11:39 thomasga has joined #ocaml

11:39 <ggole> Some lisp impls have zero tags for fixnums

11:40 <ggole> Although they tend to use more tag bits.

11:40 <mrvn> A 0 tag on ints make arithmetic simpler. e.g. a+b just works without touching the tags. And pointer access can be done with offset on most cpus.

11:40 <mrvn> should give less extra instructions.

11:41 thomasga has quit [Client Quit]

11:41 <ggole> I think you have to be careful about overflow, but yeah

11:41 <mrvn> The drawback being that you can't use scaled pointer access, e.g. R0[R1*8].

11:41 <ggole> Lisp impls also have clever ways to avoid tagging conses

11:42 <ggole> eg they are placed in a special place in the heap so that pointers to them are recognisable

11:42 <ggole> ...which is still tagging in some sense

11:43 <ggole> But they are only two words

11:43 <mrvn> ggole: apropo special place in the heap. I'm been plaing with the idea to have lots and lots of heaps. One per type. So all float arrays go to the float array heap, all int*int tuples to the int*int tuples heap and so on.

11:43 <ggole> Yeah, this is known as BIBOP

11:43 <ggole> BIg Bag Of Pages

11:44 <mrvn> For polymorphic functions I would pass an allocator that would point to the right heap for allocations. But that's where it gets tricky.

11:44 <ggole> I think it's been tried a few times, although not for an ML family language

11:45 <ggole> mrvn: nice chat, I have to go and get some dinner

11:45 <mrvn> lunchtime.

11:49 avsm has quit [Quit: Leaving.]

11:49 <orbitz> I wonder how bad it would be to add CSP to Ocaml, something like Goroutines. Message sends are implicit context switches. Could do multicore

11:50 <companion_cube> you need some stack-capturing operator to do the blocking stuff, I think

11:50 <companion_cube> if you don't use the preemptive threads

11:50 <companion_cube> maybe that's doable with delimcc ? :)

11:50 <orbitz> how does the go runtime do it?

11:51 <companion_cube> I think they have small stacks

11:51 <companion_cube> as linked lists

11:51 <orbitz> hrm

11:51 <companion_cube> https://groups.google.com/forum/#!topic/golang-nuts/j51G7ieoKh4

11:52 <companion_cube> so, maybe delimcc would actually work (although it would probably be slow)

11:52 <orbitz> it woudl be nice of lwt or async offered a reasonable message passing framework then the runtime coudl add across-core message passing at which point you could go multi core just by where youer Deferred runs

11:52 <companion_cube> orbitz: the alternative is to use a monadic-ish approach, where the user writes continuations with >>=

11:52 <orbitz> yeah that's a second reasonable option I think

11:53 <orbitz> the problem is a lot of monad code, i think, depends on running in the same memory space

11:53 <orbitz> Maybe you could add a new concept to deferreds for more heavyweight long running things

11:53 <companion_cube> well you would have several processes, anyway, wouldn't you?

11:54 <orbitz> What do you mean?

11:55 <Drup> gasche: I have a gadt-variance question that needs your expertise

11:55 <orbitz> I'm looking for soemthig nlike Erlang or Go where you are agnostic to if you have 1 process or multiple since the message passing takes care of it for you

11:56 Hannibal_Smith has quit [Quit: Sto andando via]

11:56 <gasche> (ocamlopt does pass roots in registers)

11:56 <gasche> Drup: ?

11:57 <Drup> gasche: I'm building an AST for Z3 expressions

11:57 <Drup> beh, I will just past you the code, and you will tell me how terrible it is to use gadt's for this :D

11:57 <companion_cube> orbitz: in a static language like OCaml, you need good serialization support for multi-process things

11:57 <companion_cube> does go really handle multiple processes transparently??

11:57 <Drup> http://pastebin.archlinux.fr/499801

11:58 <gasche> doesn't Z3 already have an OCaml API?

11:58 <companion_cube> I know erlang does, but it's designed specifically to this end

11:58 <Drup> an unsafe and terrible one, yes

11:58 <orbitz> companion_cube: I'm talking about multiplecores, whatever the method of getting there is i'm agnostic

11:58 <Drup> so I'm writing an AST for the formulas in order to be able to manipulate it

11:58 <orbitz> companion_cube: running multipel interpeters in side teh same process would be aceptbale

11:58 ocp has joined #ocaml

11:58 <orbitz> companion_cube: go handles utilizing multiple cores, a does erlang

11:59 <gasche> Drup: you would make your life *much* simpler by separating zint and zreal and adding an explicit cast from int to real

11:59 <companion_cube> ah, yes, but in a single process

11:59 <companion_cube> otoh go has a quite bad GC so far

11:59 <orbitz> yes

11:59 <Drup> gasche: that was I was thinking

11:59 <orbitz> companion_cube: I just want to seemlessly utilize multiple cores

11:59 <Drup> gasche: but It means I have to duplicate all operators

12:00 <Drup> of add casts everywhere

12:00 <companion_cube> orbitz: heh.

12:00 <Drup> +what

12:00 <orbitz> there is already plenty of work going on to add multi core support to the runtime but it sounds like you're going to have to be aware of the fact that you're doing it

12:00 <companion_cube> I'm pessimistic about this :/

12:00 <gasche> I'd add casts instead of duplication, yeah

12:00 <Drup> huum

12:00 <orbitz> companion_cube: How so? Running multiple interpreters (1 per thread) with message pssing between them sounds rather reasonable

12:00 <nicoo> gasche: Also, even when only keeping the S constructor (in +_ t), it failed to typecheck :(

12:00 <nicoo> Drup: ^

12:00 <gasche> but maybe the polymorphic variant thing can work

12:00 <companion_cube> orbitz: it does if you have good serialization

12:00 <companion_cube> why not

12:00 <gasche> it is not covariant, though

12:01 iorivur has joined #ocaml

12:01 <Drup> gasche: yes, that's precisely the problem now

12:01 <Drup> the function to_expr works nicely

12:01 <Drup> but of_*_expr doesn't, because of variance issues

12:01 <orbitz> companion_cube: why do you need good serialziation?

12:01 <mrvn> orbitz: multiple interpreter requires changing the code to message passing.

12:01 <orbitz> mrvn: I know

12:02 <mrvn> people are woring on a multi core GC so you don't have to.

12:02 <gasche> Drup: I can't try the code myself

12:02 <Drup> gasche: I know, you need the z3 binding :/

12:02 <gasche> if you made it a functor against Z3's signature I could give it a look

12:02 <Drup> urk

12:02 dapz has joined #ocaml

12:02 <gasche> only the parts you use

12:03 <nicoo> mrvn: Wasn't the proposal about having multiple independant thread with each its own (stack,heap,GC) ?

12:03 <orbitz> mrvn: my understanding was the semantics of mutation were one of the bigger roadblocks to wanting threads with a shared heap

12:03 <mrvn> nicoo: not sure. didn't read the proposal, was in the kitchen.

12:04 <mrvn> orbitz: one core changing a value while another does a collection is a problem.

12:05 <companion_cube> orbitz: sorry, I was still thinking separate processes

12:05 <companion_cube> the simplest solution imho

12:05 sgnb has joined #ocaml

12:05 <mrvn> orbitz: in a trivial implementation every modify would have to tell all cores about it.

12:06 <ousado> or just the original thread

12:06 tnguyen_ has quit [Ping timeout: 245 seconds]

12:06 <mrvn> Which brings us back to specialization. The compiler could generate a local core and multi core flavour and call local core for input that is not shared between cores.

12:07 <flux> companion_cube, so I have this program that 1) retrieves jpeg 30 fps from a camera and 2) decodes them and 3) draws them to a bitmap with Cairo 4) shows them with lablgtk. suggestions how I would easily split this into multiple processes? decoding jpeg is the costly process.

12:07 <mrvn> flux: decoding is done in C, right?

12:07 <flux> mrvn, yes. so yes, that can be threadized. but I wasn't asking that ;)

12:08 <mrvn> flux: well, easy would be to just have one thread per core doing decoding without the ocaml runtime lock.

12:08 <companion_cube> flux: you'd need shared memory, I guess

12:08 <flux> also I overlay a set of vector data over the bitmap I've drawn

12:08 zpe has joined #ocaml

12:08 <companion_cube> netmulticore does something like this, in ocamlnet, I believe

12:09 <flux> companion_cube, how do I put a Cairo canvas in shared memory? it really works?

12:09 <mrvn> flux: use Bigarray to mmap shared memory between processes and use message passing to pass offsets into that array for the image data.

12:09 <hcarty> flux: One somewhat simple approach - Use zeromq to send bits back and forth. Have the decoding run in a separate process. It receives the filename/encoded bytes and sends back the decoded bytes as the decoding completes.

12:09 <flux> mrvn, yeah, I should use threads some day, and fix the libjpeg bindings to release the GC lock

12:09 <companion_cube> flux: I don't know!

12:09 <flux> the point I was making, though, was that splitting into processes in the presence of existing code isn't qutie as easy as it is with real threads

12:09 <mrvn> hcarty: sending the full image might be costly.

12:10 <Drup> gasche: http://pastebin.archlinux.fr/499810

12:10 <flux> and look as the Apple's Grand Central Dispatch

12:10 <flux> how great would that be in OCaml?

12:10 <hcarty> mrvn: It could be. Not terribly expensive if both processes are running on the same machine.

12:10 <Drup> a bit quick and dirty, but you should be able to work with it

12:10 <flux> LWT could be churning with the power of 8 threads, even computational tasks

12:10 <mrvn> flux: splitting is easy. splitting efficiently is harder.

12:10 zpe has quit [Read error: Connection reset by peer]

12:10 <nicoo> flux: What is Apple's Grant Central Dispatch ?

12:10 zpe has joined #ocaml

12:10 <hcarty> mrvn: For some definition of "not terribly expensive"

12:10 <flux> nicoo, basically it's a queue where you send lambda functions to be evaluated

12:11 <flux> nicoo, the jobs have dependencies

12:11 eizo_ has joined #ocaml

12:11 <flux> so they can be dispatched to run on many cores

12:11 <flux> I haven't used it, only read of it

12:11 <mrvn> flux: do you know if LWT uses threads internally?

12:11 <hcarty> mrvn: It can.

12:11 <flux> mrvn, I don't know, but I would think it would not use threads except for some curious case of working with existing functionality

12:12 <hcarty> mrvn: Lwt_preemptive is the module IIRC

12:12 maattdd has quit [Ping timeout: 252 seconds]

12:12 tnguyen_ has joined #ocaml

12:12 <mrvn> hcarty: but that then uses the ocamls Thread module, right?

12:12 <hcarty> mrvn: Yes

12:12 <flux> Lwt was just an example, it is certainly built with single-thread processing in mind

12:12 <mrvn> hcarty: Ok. No wonder I couldn't find any Thread implementation in LWT.

12:12 contempt has joined #ocaml

12:12 <flux> but a similar 'm:n' thread mapping system could be built with 'real' threading

12:13 <flux> I suppose a library facilitating just as easy use of multiple processes could be built, but for example on my case I would wonder if a Cairo canvas is marshalable or not; I would gess it's not

12:13 <adrien_oww> isn't m:n thought to be way too complex in practice?

12:14 <adrien_oww> at the kernel level that is

12:14 <mrvn> flux: The first thing one has to do is switch the bindings over from using strings / arrays to Bigarray.

12:14 <adrien_oww> well, I'm going back to work instead of commenting on IRC without reading the backlog and without thinking o/

12:14 <mrvn> flux: or other unmovable types.

12:14 <flux> I would guess many folks who write C bindings for stuff don't write - or test - the marshalling functions

12:14 <companion_cube> m:n is hard because of the GC

12:14 <hcarty> flux: If you do a memory-backed canvas then you can marshal the underlying storage

12:14 <flux> adrien_oww, well, GCD is sort of like m:n, where the number of 'user threads' is infinite :)

12:15 <flux> hcarty, I think I can almost guarantee it's going to be more complicated than it is now..

12:15 <hcarty> flux: s/canvas/surface/

12:15 <mrvn> companion_cube: you could run m:n and every now and then stop-the-world and run the GC.

12:15 tane has joined #ocaml

12:15 <hcarty> flux: I expect so. It's possible to do. Certainly more complex than a naive sequential approach.

12:16 <companion_cube> mrvn: that's fine the major heap, I guess, but the minor heap has to be extremely fast

12:16 thomasga has joined #ocaml

12:16 <mrvn> companion_cube: what i ment was to coordinate the GC across all cores. Do a minor collection on all cores at the same time.

12:16 marcux has joined #ocaml

12:17 <mrvn> companion_cube: that avoids the problem of one core modifying data while another runs the GC.

12:17 <ousado> but that synchronization is expensive itself

12:17 maattdd has joined #ocaml

12:17 <mrvn> ousado: extremly if one thread doesn't do allocations.

12:18 <ousado> for certain workloads it might be no problem, though

12:18 <mrvn> You would have to wait for all threads to reach a safe point before the GC can start.

12:18 <mrvn> note: ocaml only does cooperative multitasking.

12:19 <adrien_oww> I wouldn't call it that way

12:19 <mrvn> you can't preempt an ocaml task. It has to reach a safe point first.

12:19 <ousado> in the paper NoNNaN linked they do concurrent mark and sweep without any synchronization

12:19 <mrvn> ousado: with one bit per core?

12:20 <ousado> .. but the exectution times of the tests are not competitive

12:21 <ousado> they use color transitions that are specific to dedicated mark and sweep threads

12:21 <ousado> I've just taken a superficial look

12:21 <ousado> http://doc.cat-v.org/inferno/concurrent_gc/concurrent_gc.pdf

12:22 siddharthv is now known as siddharthv_away

12:23 maattdd has quit [Ping timeout: 252 seconds]

12:23 <companion_cube> mrvn: that wouldnt' work

12:23 <mrvn> To me it feels like the compiler has to do some analysing and figure out what data is shared and what not and generate different code then. Keep temporary allocs in thread private heaps and side step the whole problem.

12:23 <companion_cube> minor heap must be very very very fast

12:23 <companion_cube> and different threads might allocate at different rates

12:24 <mrvn> companion_cube: as long as they allocate fast the loss is negible.

12:24 <companion_cube> well, at each allocation a thread need check whether the minor heap is full

12:24 <companion_cube> if this requires a synchronisation it's a deal breaker

12:24 <mrvn> companion_cube: one minor heap per core.

12:24 <companion_cube> indeed.

12:25 <companion_cube> but then, what about sharing somethihg that's still in the minor heap?

12:25 <companion_cube> you'd get a reference from a foreign core, to the local minor heap

12:25 <mrvn> companion_cube: then when it runs full you stop all threads and do a multi-core GC run across all minor heaps.

12:25 <NoNNaN> ousado: some other work on pauseless collector (for openjdk): https://rkennke.wordpress.com/2013/06/18/shenandoah-gc-an-overview/

12:25 zpe has quit [Remote host closed the connection]

12:25 <mrvn> companion_cube: each core has a root set too.

12:26 zpe has joined #ocaml

12:27 <mrvn> companion_cube: It's just a simple idea and the obvious problem is having to stop all threads. Unless they do a lot of allocations (which they usualy do) stoping can take forever.

12:27 maufred has joined #ocaml

12:28 zpe_ has joined #ocaml

12:29 <NoNNaN> mrvn: take a look at the openjdk new Shenandoah GC, it's a regional collector

12:29 <mrvn> NoNNaN: meaning the compiler has to analyze the code to statically define regions?

12:30 zpe has quit [Ping timeout: 240 seconds]

12:30 lostcuaz has joined #ocaml

12:30 lostcuaz has quit [Client Quit]

12:31 lostcuaz has joined #ocaml

12:31 lostcuaz has quit [Read error: Connection reset by peer]

12:31 maattdd has joined #ocaml

12:31 lostcuaz has joined #ocaml

12:32 <mrvn> Another idea for better multi-core support would be to have 2 kinds of values. Private and sharable. Let the user decide wether to use private/fast or shared/slow.

12:33 <Drup> isn't what ocamlabs is doing ?

12:34 <NoNNaN> mrvn: it's a divide and conquer approach, take a look at here: http://rkennke.files.wordpress.com/2014/02/shenandoahtake4.pdf

12:34 <Drup> gasche: did you looked at it ? ideas ?

12:36 <orbitz> mrvn: IMO, multiple intepreters with some messag passing between them is probably easiest, and tehn people can build libraries on top of things like async to wrap the built in message passing operators

12:36 Hannibal_Smith has joined #ocaml

12:37 <ousado> NoNNaN: that also looks interesting, but involves copying again, which is an issue for us

12:38 <NoNNaN> ousado: you have to copy, to compact, the best of class is azul c4 yet: http://www.azulsystems.com/sites/default/files/images/c4_paper_acm.pdf

12:39 <ousado> I don't think so

12:40 <NoNNaN> ousado: well, you can compact in place, so yes, possible without copy, could you give some pointers to noncopy collectors?

12:40 <NoNNaN> ousado: i mean, noncopy compacting

12:40 <ousado> the luajit one I linked above

12:41 <ousado> I'm not an expert in the field, but the issues of avoiding fragmentation and scanning free lists are orthogonal to compaction/copying

12:42 <ousado> so I don't buy neither argument given for compaction there

12:45 <mrvn> NoNNaN: With that every read needs an indirection, all the time. And modify becomes complex. It can fail and then has to roll back and try again.

12:45 avsm has joined #ocaml

12:45 <mrvn> NoNNaN: The read bothers me. The write is probably irelevant.

12:46 <NoNNaN> FYI: a recent overview garbage collectors (include azul collie, other recent collectors) from Inria: http://hal.inria.fr/docs/00/86/80/12/PDF/gidra13asplos-naps.pdf

12:48 <ggole> Read barriers seem like a pretty heavyweight thing

12:49 <mrvn> and you need to annotate values as volatile or you can't do any uboxing/untaging.

12:50 pyon is now known as pyon-away

12:50 <mrvn> It all comes back to: ref/mutable is bad

12:51 <ousado> indeed

12:52 tnguyen_ has quit [Ping timeout: 265 seconds]

12:53 divyanshu has quit [Quit: Computer has gone to sleep.]

12:53 <Drup> gasche, nicoo : removing the subtyping and using explicit cast is a bit of an issue, because I can't do "(3 + x) mod 5" anymore :/

12:53 <Drup> I can't do Q to Int casts, obviously

12:53 tnguyen_ has joined #ocaml

12:54 <ggole> Azul seem have hardware support for their read barrier

12:54 <adrien_oww> yup

12:54 <ggole> Shades of Lisp Machines there

12:55 * ggole wonders what an ML machine would look like

12:55 * adrien_oww throws a lisp machines at ggole

12:55 <orbitz> tall

12:55 <NoNNaN> ggole: take a look at reduceron

12:56 <ggole> There are some cool tricks with list compaction that you could make very cheap with hardware support

12:56 <NoNNaN> ggole: azul rewrote the linux mm, it can allocate/remap memory at TB/sec rate

12:57 <ggole> Remapping = TLB nuke though

12:57 <mrvn> ggole: invlpg

12:57 <ggole> I think that was one of the things they tried to make cheap on their custom hardware

12:57 <NoNNaN> ggole: you could combine it with user level memory manager eg.: by using libdune: http://dune.scs.stanford.edu/

12:58 <ggole> Pretty cool.

12:58 <mrvn> ggole: building custom hardware to make your language fast is cheating.

12:58 <orbitz> Azul does some sweet things

12:59 <ggole> mrvn: I'm not above cheating :)

12:59 <ggole> mrvn: I do think it is risky though

12:59 rand000 has quit [Ping timeout: 240 seconds]

12:59 <ggole> There were "Java chips" and "Lisp Machines" and some other custom stuff

12:59 <NoNNaN> there is no (big) reward without risks

12:59 <ggole> All dead

13:00 <adrien_oww> and now we're all going to run with the mill

13:00 <ggole> Actually I think list compaction can be done in software affordably

13:00 <NoNNaN> ggole: well, not yet, http://www.jopdesign.com -> you could do hard real-time stuff, it has a real-time gc too

13:00 <ggole> It just complicates the runtime and GC

13:01 <ggole> NoNNaN: well, real-time and fast aren't the same thing

13:02 <mrvn> my system is real-time. It will at most delay a job for 10 years.

13:02 <ggole> I'm not sure that real-time techniques are a good idea for general purpose systems

13:03 <Hannibal_Smith> <NoNNaN> ggole: well, not yet, http://www.jopdesign.com -> you could do hard real-time stuff, it has a real-time gc too <-IBM Monotone

13:03 <mrvn> ggole: most general purpose problems don't have a deadline.

13:03 <Hannibal_Smith> Uhm...

13:03 <adrien_oww> when you do network, you have deadlines

13:04 <ggole> Sure: that's why real-time techniques have evolved, to specialise systems for this unusual requirement

13:04 <adrien_oww> but tbh, most of the time, a large buffer and a good throughput will do the job well

13:04 <NoNNaN> Hannibal_Smith: far from it, it provides cpu instruction wcet too

13:04 <orbitz> Hannibal_Smith: do you mean metronome?

13:04 <Hannibal_Smith> Yes, I have pretty bad memory

13:05 <ggole> NoNNaN: by the way, the "Java chip" stuff I had in mind was the ARM instruction set, uh, Jazelle

13:05 dapz has quit [Quit: My MacBook Pro has gone to sleep. ZZZzzz…]

13:05 <nicoo> mrvn: Azul also ships a JVM that runs on commodity hardware and uses (AFAIK) a wait-free concurrent GC

13:06 <orbitz> Zing

13:06 <NoNNaN> "At 100 MHz we measured 40 μs maximum blocking time introduced by the GC thread."

13:07 marcux has quit [Quit: marcux]

13:07 <mrvn> nicoo: and how much does the read barrier and indirection cost you there?

13:08 arjunguha has joined #ocaml

13:08 <mrvn> NoNNaN: One thing I noticed in the shenandoahtake4.pdf. The concurrent makr phase is shown as occuring at the same time. So all threads are stoped. This is a problem in ocaml code since it isn't preemptible like that.

13:09 <NoNNaN> mrvn: "LVB differs from a Brooks-style [6] indirection barrier in that, like a Baker-style [4] read barrier, it imposes invariants on references as they are loaded, rather than applying them as they are used. By applying to all loaded references, LVB guarantees no uncorrected references can be propagated by the mutator, facilitating certain single-pass guarantees."

13:09 <mrvn> But that may be just an artefact of how they drew it.

13:09 <nicoo> mrvn: I never had my hands on it, but from the paper they published, they don't use a read barrier/indirection scheme, but “cheat” using mprotect to intercept writes to pages that are being compacted (all the magic left is “how to get an atomic stack snapshot”, IIRC)

13:09 <NoNNaN> it's from the c4 paper

13:10 <mrvn> nicoo: huh? No. They have indirections all nover the paper.

13:10 dapz has joined #ocaml

13:10 <mrvn> NoNNaN: lvb?

13:10 <nicoo> mrvn: I must be confusing with another paper, then

13:11 <nicoo> Sorry

13:11 * nicoo read the C4 paper quite a while ago

13:11 <def-lkb> azul c4 vs redhat shenandoah

13:11 <NoNNaN> mrvn: it's from this paper: http://www.azulsystems.com/sites/default/files/images/c4_paper_acm.pdf

13:12 <NoNNaN> LVB=Loaded Value Barrier

13:13 <ousado> I don't think the shenandoah thing would be a good match for ocaml, they don't even mention 'immutable' there once

13:13 <def-lkb> their only kernel modification is to make all calls to mm sub-system (like this mprotect) faster ?

13:13 <NoNNaN> they have an another paper in this topic: The Collie: a wait- free compacting collector available at: http://www.lirmm.fr/~ducour/Doc-objets/ECOOP2012/ISMM/ismm/p85.pdf

13:13 shinnya has quit [Ping timeout: 240 seconds]

13:14 Hannibal_Smith has quit [Quit: Sto andando via]

13:15 <ggole> "The hardware TLB supports an additional privilege level, the GC-mode, between the usual user- and kernel-modes."

13:15 <ggole> This is for their custom hardware, though.

13:15 <NoNNaN> def-lkb: you could have user level memory manager with virtualization, take a look at libduke

13:15 <def-lkb> NoNNaN: yeah, I just read the libdune presentations, this looks really cool

13:16 <mrvn> NoNNaN: Look at figure 1. All steps are synchron for all threads.

13:16 thomasga has quit [Quit: Leaving.]

13:16 <mrvn> I always see it drawn that way. But why should all threads mark at the same time? Why should they all take the same amount of time?

13:18 tnguyen has quit [Read error: Connection reset by peer]

13:18 tnguyen has joined #ocaml

13:18 tnguyen has quit [Read error: Connection reset by peer]

13:18 tnguyen has joined #ocaml

13:20 thomasga has joined #ocaml

13:24 divyanshu has joined #ocaml

13:25 <NoNNaN> mrvn: why not? why limit the performance of the gc? your code should run the program most of the time, and not wait for gc

13:25 nikki93 has joined #ocaml

13:25 <pippijn> I heard that x86 has special instructions to support GC

13:25 <pippijn> but I don't know what they are, and I can't find any information on them

13:26 <mrvn> NoNNaN: why should thread1 have to wait for thread2 to finish its mark?

13:28 <NoNNaN> pippijn: you mean this? no magic here: http://www.managedruntime.org/files/downloads/AzulVmemMetricsMRI.pdf https://github.com/GregBowyer/ManagedRuntimeInitiative

13:29 <pippijn> NoNNaN: no

13:30 <pippijn> NoNNaN: I don't know what I am talking about

13:30 nikki93 has quit [Ping timeout: 258 seconds]

13:30 <pippijn> NoNNaN: an invited speaker at ETAPS this year was talking about it, but not in detail, he just mentioned it but his talk was not about it

13:31 <ggole> pippijn: do you mean transactional memory?

13:31 <pippijn> ggole: yes, that was his topic

13:31 <ggole> I don't think they are intended for GC

13:31 <pippijn> and he mentioned GC insns

13:31 <pippijn> no

13:32 <pippijn> he mentioned them as something that already existed

13:32 <pippijn> before HTM

13:32 <ggole> I'm not aware of any such instructions

13:32 <pippijn> me neither, and I used to be quite deep into x86 assembly

13:32 <pippijn> (so I'm curious)

13:33 <mrvn> you have exclusive access and atomic read-modify-write instructions.

13:33 <def-lkb> neither I am. maybe he was refering to some tricks with the mmu ?

13:33 rgrinberg has joined #ocaml

13:33 <def-lkb> (keeping track of dirty pages, hardware assistance for barrier, …)

13:33 <ggole> There have been undocumented instructions before. salc and friends.

13:34 <ggole> I wouldn't expect that sort of thing today though.

13:34 tani has joined #ocaml

13:35 <NoNNaN> well, the x86 mmu is turing complete, you can do instructionless computation with it: https://www.usenix.org/conference/woot13/workshop-program/presentation/bangert

13:35 <mrvn> WTF?

13:35 <pippijn> NoNNaN: I did not know that :)

13:35 <pippijn> that is fun

13:36 <NoNNaN> pippijn: here is the compiler for it: https://github.com/jbangert/trapcc

13:36 <ggole> That is such a gorgeous hack.

13:36 tane has quit [Ping timeout: 245 seconds]

13:37 rgrinberg has quit [Ping timeout: 250 seconds]

13:39 studybot has joined #ocaml

13:39 <def-lkb> "gorgeous" :|

13:40 studybot_ has joined #ocaml

13:45 studybot has quit [Ping timeout: 276 seconds]

13:46 <gasche> I'm starting to understand what the 6 parameters of format6 mean

13:46 <gasche> I feel like documentation is in order: nobody wants to know, until someone tries to read the implementation...

13:46 <companion_cube> :D

13:47 <adrien_oww> gasche: isn't it documented in Pervasives?

13:47 <gasche> I'd say it is quasi-document

13:48 <gasche> ['e] is the type of the receiver function for the [scanf]-style functions,

13:48 <gasche> (live quote)

13:48 <gasche> *documented

13:51 tnguyen has quit [Ping timeout: 245 seconds]

13:58 darkf has quit [Quit: Leaving]

14:00 cesar_ has joined #ocaml

14:00 dapz has quit [Quit: My MacBook Pro has gone to sleep. ZZZzzz…]

14:00 cesar_ is now known as Guest58571

14:02 manizzle has joined #ocaml

14:06 arjunguha has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

14:10 avsm has quit [Quit: Leaving.]

14:13 waneck_ has quit [Remote host closed the connection]

14:15 rz has quit [Quit: Ex-Chat]

14:15 rz has joined #ocaml

14:15 zpe_ has quit [Remote host closed the connection]

14:16 zpe has joined #ocaml

14:16 <mrvn> compiling your programm to a decrement-and-branch-if-negative instruction set implemented as x86 task-segments is insane. :)

14:17 nikki93 has joined #ocaml

14:17 avsm has joined #ocaml

14:20 zpe has quit [Ping timeout: 240 seconds]

14:23 ta`e has joined #ocaml

14:24 nikki93 has quit [Ping timeout: 265 seconds]

14:26 tani has quit [Ping timeout: 245 seconds]

14:28 arjunguha has joined #ocaml

14:28 zpe has joined #ocaml

14:32 avsm has quit [Quit: Leaving.]

14:34 rgrinberg has joined #ocaml

14:38 rgrinberg has quit [Ping timeout: 252 seconds]

14:38 tani has joined #ocaml

14:39 divyanshu has quit [Quit: Computer has gone to sleep.]

14:41 rand000 has joined #ocaml

14:41 ta`e has quit [Ping timeout: 265 seconds]

14:44 <mrvn> How do I test if my Thread module causes the GC to free the wrong thing? ideas?

14:44 <flux> maybe finalizers could alert you of the fact happening

14:44 <flux> maybe that finalizer could somehow convince you that you either have or don't have the value references you think you have referenced

14:46 <mrvn> good idea. knowing it did is enough. so I can just print BOOM in an endless loop there.

14:47 <mrvn> I'm thinking allocate a big string in thread1 and go to sleep. then do a lot allocations in thread2 and see if the string gets freed.

14:47 maattdd has quit [Ping timeout: 252 seconds]

14:48 avsm has joined #ocaml

14:49 Thooms has joined #ocaml

14:51 tlockney_away is now known as tlockney

14:55 rgrinberg has joined #ocaml

15:02 Guest58571 has quit [Remote host closed the connection]

15:02 ygrek has joined #ocaml

15:05 Rota has quit [Ping timeout: 250 seconds]

15:06 Hannibal_Smith has joined #ocaml

15:11 thomasga has quit [Quit: Leaving.]

15:12 malc has joined #ocaml

15:19 Rota has joined #ocaml

15:19 nikki93 has joined #ocaml

15:24 divyanshu has joined #ocaml

15:26 cesar_ has joined #ocaml

15:26 cesar_ is now known as Guest48019

15:29 avsm has quit [Quit: Leaving.]

15:29 ikaros has quit [Quit: Ex-Chat]

15:29 avsm has joined #ocaml

15:31 malc has quit [Quit: leaving]

15:31 Guest48019 has quit [Remote host closed the connection]

15:33 araujo has joined #ocaml

15:33 araujo has quit [Changing host]

15:33 araujo has joined #ocaml

15:36 tautologico has joined #ocaml

15:38 jonludlam has quit [Ping timeout: 245 seconds]

15:42 tani has quit [Ping timeout: 240 seconds]

15:49 waneck has joined #ocaml

15:52 brendan has quit [Ping timeout: 246 seconds]

15:53 jonludlam has joined #ocaml

15:55 Rota has quit [Ping timeout: 245 seconds]

15:56 eizo_ has quit [Ping timeout: 240 seconds]

15:57 tani has joined #ocaml

16:04 arjunguha has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

16:07 thomasga has joined #ocaml

16:08 jwatzman|work has joined #ocaml

16:08 avsm has quit [Quit: Leaving.]

16:08 avsm has joined #ocaml

16:10 thomasga has quit [Client Quit]

16:10 avsm has quit [Client Quit]

16:10 thomasga has joined #ocaml

16:12 thomasga has quit [Client Quit]

16:14 thomasga has joined #ocaml

16:16 thomasga has quit [Client Quit]

16:17 thomasga has joined #ocaml

16:19 thomasga has quit [Client Quit]

16:19 maattdd has joined #ocaml

16:20 thomasga has joined #ocaml

16:22 thomasga has quit [Client Quit]

16:23 michel_mno is now known as michel_mno_afk

16:23 ygrek has quit [Ping timeout: 252 seconds]

16:29 thomasga has joined #ocaml

16:31 thomasga has quit [Client Quit]

16:31 WraithM has joined #ocaml

16:32 lostcuaz has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

16:34 thomasga has joined #ocaml

16:35 lostcuaz has joined #ocaml

16:36 lostcuaz has quit [Client Quit]

16:37 lostcuaz has joined #ocaml

16:37 lostcuaz has quit [Read error: Connection reset by peer]

16:37 lostcuaz has joined #ocaml

16:38 rgrinberg1 has joined #ocaml

16:38 rgrinberg has quit [Ping timeout: 276 seconds]

16:39 thomasga has quit [Client Quit]

16:44 thomasga has joined #ocaml

16:45 dapz has joined #ocaml

16:47 thomasga has quit [Client Quit]

16:50 arjunguha has joined #ocaml

16:51 arjunguha has quit [Client Quit]

16:52 avsm has joined #ocaml

16:53 thomasga has joined #ocaml

16:54 Thooms has quit [Quit: WeeChat 0.3.8]

16:55 thomasga has quit [Client Quit]

16:58 tobiasBora has quit [Ping timeout: 246 seconds]

16:59 NoNNaN has quit [Remote host closed the connection]

16:59 arjunguha has joined #ocaml

17:00 NoNNaN has joined #ocaml

17:02 avsm has quit [Quit: Leaving.]

17:02 nikki93 has quit [Remote host closed the connection]

17:03 thomasga has joined #ocaml

17:03 bunzen has quit [Remote host closed the connection]

17:03 jonludlam has quit [Ping timeout: 245 seconds]

17:04 avsm has joined #ocaml

17:04 thomasga has quit [Client Quit]

17:05 arjunguha has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

17:06 thomasga has joined #ocaml

17:06 rgrinberg1 has quit [Quit: Leaving.]

17:07 rgrinberg has joined #ocaml

17:08 arjunguha has joined #ocaml

17:09 avsm has quit [Ping timeout: 250 seconds]

17:09 Geir_ has joined #ocaml

17:12 thomasga has quit [Quit: Leaving.]

17:12 lostcuaz has quit [Ping timeout: 258 seconds]

17:13 lostcuaz has joined #ocaml

17:13 thomasga has joined #ocaml

17:15 thomasga has quit [Client Quit]

17:15 nikki93 has joined #ocaml

17:20 iorivur has quit [Ping timeout: 258 seconds]

17:21 Kakadu has quit [Ping timeout: 240 seconds]

17:21 nikki93 has quit [Remote host closed the connection]

17:22 thomasga has joined #ocaml

17:24 thomasga has quit [Client Quit]

17:28 ocp has quit [Ping timeout: 245 seconds]

17:28 manizzle has quit [Ping timeout: 240 seconds]

17:30 AltGr has left #ocaml []

17:31 maattdd has quit [Ping timeout: 265 seconds]

17:32 q66 has joined #ocaml

17:32 q66 has quit [Changing host]

17:32 q66 has joined #ocaml

17:35 tlockney is now known as tlockney_away

17:36 tlockney_away is now known as tlockney

17:36 zpe has quit [Remote host closed the connection]

17:37 zpe has joined #ocaml

17:40 ygrek has joined #ocaml

17:41 zpe has quit [Ping timeout: 245 seconds]

17:41 dapz has quit [Quit: My MacBook Pro has gone to sleep. ZZZzzz…]

17:43 claudiuc has joined #ocaml

17:44 q66 has quit [Quit: Leaving]

17:46 q66 has joined #ocaml

17:46 q66 has quit [Changing host]

17:46 q66 has joined #ocaml

17:46 dapz has joined #ocaml

17:48 dapz has quit [Client Quit]

17:48 dapz has joined #ocaml

17:50 nikki93 has joined #ocaml

17:50 dapz has quit [Client Quit]

17:51 thomasga has joined #ocaml

17:52 nikki93_ has joined #ocaml

17:53 dapz has joined #ocaml

17:55 nikki93 has quit [Ping timeout: 276 seconds]

17:57 dapz has quit [Client Quit]

17:58 maattdd has joined #ocaml

18:02 dapz has joined #ocaml

18:03 arjunguha has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

18:06 maattdd has quit [Ping timeout: 265 seconds]

18:07 manizzle has joined #ocaml

18:08 maattdd has joined #ocaml

18:12 Kakadu has joined #ocaml

18:32 thomasga has quit [Quit: Leaving.]

18:44 Hannibal_Smith has quit [Quit: Sto andando via]

18:45 avsm has joined #ocaml

18:47 zpe has joined #ocaml

18:52 zpe has quit [Ping timeout: 240 seconds]

19:01 rgrinberg has quit [Quit: Leaving.]

19:02 rgrinberg has joined #ocaml

19:06 dsheets has joined #ocaml

19:07 avsm has quit [Quit: Leaving.]

19:07 tautologico has quit [Quit: Connection closed for inactivity]

19:10 maattdd has quit [Ping timeout: 258 seconds]

19:10 martintrojer_ has joined #ocaml

19:15 ygrek has quit [Ping timeout: 252 seconds]

19:18 zpe has joined #ocaml

19:20 ocp has joined #ocaml

19:23 nikki93_ has quit [Remote host closed the connection]

19:25 sheijk has joined #ocaml

19:28 nikki93 has joined #ocaml

19:28 maattdd has joined #ocaml

19:29 martintrojer_ has quit [Quit: ZNC - http://znc.in]

19:33 zpe has quit [Remote host closed the connection]

19:33 thomasga has joined #ocaml

19:33 zpe has joined #ocaml

19:34 thomasga1 has joined #ocaml

19:36 thomasga1 has quit [Client Quit]

19:37 zarul has quit [Ping timeout: 276 seconds]

19:37 eizo has joined #ocaml

19:37 thomasga has quit [Ping timeout: 240 seconds]

19:38 zpe has quit [Ping timeout: 276 seconds]

19:42 manizzle has quit [Read error: Connection reset by peer]

19:56 jonludlam has joined #ocaml

19:58 srcerer has joined #ocaml

20:04 maattdd has quit [Ping timeout: 252 seconds]

20:14 ollehar has joined #ocaml

20:19 maattdd has joined #ocaml

20:20 divyanshu has quit [Quit: Textual IRC Client: www.textualapp.com]

20:21 paolooo has joined #ocaml

20:23 ggole has quit []

20:24 rgrinberg has quit [Quit: Leaving.]

20:24 pyon-away is now known as pyon

20:25 lostcuaz has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

20:31 rgrinberg has joined #ocaml

20:36 paolooo has quit [Ping timeout: 240 seconds]

20:37 thomasga has joined #ocaml

20:41 thomasga has quit [Ping timeout: 265 seconds]

20:42 eizo has quit [Ping timeout: 240 seconds]

20:55 axiles has quit [Remote host closed the connection]

20:58 ocp has quit [Quit: Leaving.]

20:59 _andre has quit [Quit: leaving]

21:00 lostcuaz has joined #ocaml

21:11 tani has quit [Quit: Verlassend]

21:15 Submarine has quit [Quit: Leaving]

21:17 zpe has joined #ocaml

21:19 tianon has quit [Ping timeout: 252 seconds]

21:23 tianon has joined #ocaml

21:37 Thooms has joined #ocaml

21:37 thomasga has joined #ocaml

21:39 Kakadu has quit [Quit: Konversation terminated!]

21:39 manizzle has joined #ocaml

21:39 ollehar has quit [Ping timeout: 252 seconds]

21:41 thomasga has quit [Ping timeout: 240 seconds]

21:51 zpe has quit [Remote host closed the connection]

21:52 zpe has joined #ocaml

21:53 Muzer has quit [Excess Flood]

21:54 Muzer has joined #ocaml

21:56 zpe has quit [Ping timeout: 250 seconds]

21:57 cesar_ has joined #ocaml

21:57 cesar_ is now known as Guest62986

22:07 Guest62986 has quit [Remote host closed the connection]

22:12 ollehar has joined #ocaml

22:17 dapz has quit [Quit: My MacBook Pro has gone to sleep. ZZZzzz…]

22:25 waneck_ has joined #ocaml

22:27 dapz has joined #ocaml

22:27 waneck has quit [Ping timeout: 276 seconds]

22:28 rgrinberg has quit [Quit: Leaving.]

22:34 nikki93 has quit [Remote host closed the connection]

22:34 Simn has quit [Quit: Leaving]

22:34 lostcuaz has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

22:35 lostcuaz has joined #ocaml

22:36 lostcuaz has quit [Client Quit]

22:37 Thooms has quit [Ping timeout: 276 seconds]

22:40 clan has joined #ocaml

22:42 lostcuaz has joined #ocaml

22:44 dapz has quit [Quit: My MacBook Pro has gone to sleep. ZZZzzz…]

22:45 dapz has joined #ocaml

22:51 clan has quit [Quit: clan]

22:53 clan has joined #ocaml

22:54 maattdd has quit [Ping timeout: 258 seconds]

22:57 clan has quit [Remote host closed the connection]

22:58 clan has joined #ocaml

23:07 rand000 has quit [Quit: leaving]

23:07 ollehar has quit [Quit: ollehar]

23:07 clan has quit [Quit: clan]

23:08 lostcuaz has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

23:09 clan has joined #ocaml

23:26 darkf has joined #ocaml

23:27 manizzle has quit [Ping timeout: 250 seconds]

23:36 lostcuaz has joined #ocaml

23:36 lostcuaz has quit [Read error: Connection reset by peer]

23:36 madroach has quit [Ping timeout: 252 seconds]

23:36 lostcuaz has joined #ocaml

23:37 thomasga has joined #ocaml

23:38 madroach has joined #ocaml

23:39 nikki93 has joined #ocaml

23:41 iorivur has joined #ocaml

23:42 thomasga has quit [Ping timeout: 265 seconds]

23:43 NoNNaN has quit [Remote host closed the connection]

23:44 nikki93 has quit [Remote host closed the connection]

23:44 nikki93 has joined #ocaml

23:44 NoNNaN has joined #ocaml

23:44 nikki93 has quit [Remote host closed the connection]

23:44 dotfelix has joined #ocaml

23:46 iorivur has quit [Ping timeout: 252 seconds]

23:49 jwatzman|work has quit [Quit: jwatzman|work]

23:56 nikki93 has joined #ocaml

23:59 dotfelix has quit [Ping timeout: 265 seconds]