#nanoc on 2013-09-10 — irc logs at freenode.irclog.whitequark.org

01:24 jugglinmike has quit [Quit: Leaving.]

04:18 jadd_ has joined #nanoc

04:27 alerante_ has quit [Remote host closed the connection]

05:32 <bobthecow> musicmatze: nanoc checksums are stored in a PStore: https://github.com/nanoc/nanoc-core/blob/master/lib/nanoc/core/store.rb#L129-L131

05:32 <bobthecow> which is this: http://www.ruby-doc.org/stdlib-1.9.3/libdoc/pstore/rdoc/PStore.html

05:32 <bobthecow> it's basically transactional hash persistence to a file.

05:32 <bobthecow> which is kinda awesome.

05:32 <bobthecow> the format you're looking at is "marshalled" ruby objects.

05:32 <bobthecow> they're one of the fastest ways to store things in Ruby.

05:33 <bobthecow> http://ruby-doc.org/core-2.0.0/Marshal.html

05:33 <bobthecow> basically it's a bytestream representation that allows serializing and unserializing objects.

05:33 <bobthecow> when they come back to life, they have the exact same methods and everything as before.

05:34 <bobthecow> so it's not just a list to look things up in, it's a map from object identifiers to checksums.

05:35 <bobthecow> instead of saying "does this checksum exist" you can ask "what was the last checksum for that"

05:35 <bobthecow> and because it's a hash, it's actually faster than searching a string.

05:39 <bobthecow> i didn't write this particular bit, but i can imagine the reasons are (1) PStore + Marshal are really performant in Ruby, (2) Hash lookups are also really performant in Ruby, (3) PStore is transactional, so it's threadsafe, (4) by using a generic "store" object type, nanoc can store the dependency graph, checksums, rule caches, and a bunch of other things in the same way, reducing bugs and increasing code reuse. See: https://github.com/nanoc/nan

05:44 <bobthecow> musicmatze: sorry for the wall-o-text :)

06:57 <musicmatze> bobthecow: Thanks for your explanation! One thing: (3) threadsafe... doesnt matter, ruby is not real multithreading... but yeah, makes sense to me!

06:57 skroon_ has quit [Ping timeout: 245 seconds]

06:57 skroon has quit [Ping timeout: 245 seconds]

07:04 <musicmatze> Sadly, I cannot implement the caching like this in my ssg, I guess. Things are much different :-P

07:09 <musicmatze> ... but a sorted list of checksums would have one problem: Some time, the file with the checksums would get really really large, as we cannot figure out which checksum is old :-P ... I have to think about this... :-D

07:09 <bobthecow> what language are you building yours in?

07:09 <musicmatze> C

07:09 <bobthecow> you could make a second file every time you compile.

07:09 <bobthecow> then move it over the first after compiling.

07:10 <musicmatze> that does not work with inc. generation, as not everything gets compiled when compiling. Some checksums would be lost...

07:10 <bobthecow> ahh, right.

07:10 <bobthecow> well, no.

07:11 <bobthecow> for incremental generation, you'd still have to check the source files.

07:11 <bobthecow> to know whether to recompile something.

07:11 <bobthecow> and that's where the checksum comes from.

07:11 <bobthecow> so keep track of 'em during that phase.

07:11 <musicmatze> aaah, you're right!

07:12 <bobthecow> for really optimal stuff, you could generate the new list of checksums, sort 'em, then walk through the old list and new list side by side and decide which of the new needs to be recompiled.

07:13 <musicmatze> and that's easy to implement, as I don't need the stuff around the checksums. After the loading is done, I can store the "old" checksums in a simple list...

07:13 <bobthecow> that would save you searching, because you'd only have to look at most one or two forward.

07:14 <musicmatze> that's really great! I think I will talk to you again later because of this, but I have to go to work now!

07:14 <bobthecow> what is the current state of your ssg?

07:14 <musicmatze> The kernel is kinda ready, I have to write at least five modules to have a complete ssg which can be compared to jekyll or nanoc!

07:14 jadd_ has quit [Quit: Leaving...]

07:14 <bobthecow> how's it compare performance-wise so far?

07:15 <bobthecow> you might even be able to skip the incremental compiles and DAG stuff.

07:15 <musicmatze> But there will be changes in the kernel if I realize I need a functionality it does not have yet.

07:15 <bobthecow> http://hugo.spf13.com

07:15 <bobthecow> a friend of mine wrote that.

07:15 <bobthecow> and it's *crazy* fast.

07:15 <musicmatze> I cannot compare it by now, as it cannot compile by now!

07:15 <bobthecow> like, milliseconds to compile entire sites fast.

07:15 <bobthecow> so he doesn't even bother with checksums or incremental compiles or dependencies or anything.

07:16 <musicmatze> yeah, I read about that...

07:16 <bobthecow> just reads in the whole site, compiles everything, and prunes the output.

07:18 <musicmatze> off to work now...

07:18 skroon_ has joined #nanoc

07:18 skroon has joined #nanoc

07:18 <bobthecow> have fun!

07:27 yogsototh has joined #nanoc

07:43 jadd_ has joined #nanoc

08:18 jadd_ has quit [Ping timeout: 276 seconds]

08:38 <guardian> o/

08:41 <bobthecow> hallo guardian.

08:54 jadd_ has joined #nanoc

09:11 bobthecow has quit [*.net *.split]

09:12 skroon has quit [*.net *.split]

09:12 skroon_ has quit [*.net *.split]

09:12 koan has quit [*.net *.split]

09:12 _br_ has quit [*.net *.split]

09:12 pepijndevos has quit [*.net *.split]

09:17 pepijndevos has joined #nanoc

09:20 skroon has joined #nanoc

09:20 _br_ has joined #nanoc

09:20 bobthecow has joined #nanoc

09:20 skroon_ has joined #nanoc

09:20 koan has joined #nanoc

09:28 jadd_ has quit [Ping timeout: 245 seconds]

09:45 <ddfreyne> musicmatze: look into cap'n proto if you want to serialise stuff

09:54 <guardian> bobthecow: how customizable is Hugo compared to Nanoc?

09:55 <bobthecow> not nearly as customizable.

09:55 <bobthecow> it's more like jekyll in that regard.

09:55 <bobthecow> in that you can't really customize it with code.

09:55 <bobthecow> but you *can* do a lot more than with jekyll.

09:56 <bobthecow> for example, it's not limited to one "blog"-ish content type

09:56 <guardian> ok

09:56 <bobthecow> you can have as many types and representations of them as you want.

09:56 <bobthecow> and can compile them into index pages and such.

09:56 <bobthecow> but it's all done with templating and metadata, not with code.

09:57 <bobthecow> https://github.com/spf13/hugo/tree/master/docs

09:57 <bobthecow> that's a fairly standard hugo site.

09:58 <guardian> I'll stick to nanoc I guess

09:58 <bobthecow> :)

09:58 <bobthecow> i'm still using nanoc.

09:58 <bobthecow> as tempting as the millisecond compile times are.

09:58 <bobthecow> :P

09:59 <guardian> yeah my site compiles really slowly :(

09:59 <guardian> but I know why

09:59 <bobthecow> oh yeah?

09:59 <guardian> my images go through nanoc's pipeline

09:59 <bobthecow> mine's not horrible anymore.

09:59 <bobthecow> oh. yeah. don't do that.

09:59 <guardian> an image has 3 reps

09:59 <guardian> thumbnail, default, original

09:59 <guardian> original is pass through

10:00 <bobthecow> honestly, i'd define a second nanoc site in a subdirectory, and have it output images into ../static/

10:00 <guardian> thumbnail uses smart cropping —> it detects the region in the original image that has interest

10:00 <bobthecow> then you only compile that when it changes.

10:00 <guardian> and default is just keep aspect ratio and resize to max width

10:24 jadd_ has joined #nanoc

10:38 <ddfreyne> guardian: You could probably also use Make to pre-render the images

10:39 <ddfreyne> maybe compile then to an output_assets/ dir and don't let those go through nanoc at all

10:39 <ddfreyne> Assets are a bit of a pain point in nanoc, but I don't have a proper way around that

10:40 <ddfreyne> nanoc doesn't provide a "clean" command to rm -rf tmp output, and I want to keep it that way

10:43 <bobthecow> ddfreyne: is it a dependency graph issue that makes nanoc poorly suited to an image workflow?

10:43 <bobthecow> it seems to work out fine for me, but i've got a decidedly not image-heavy site.

10:44 <bobthecow> it seems like checksums and the like could almost always make images not recompile.

10:44 <bobthecow> since it's not like they depend on anything.

10:58 jadd_ has quit [Ping timeout: 245 seconds]

11:01 tbm_ has quit [Quit: .]

11:01 tbm_ has joined #nanoc

11:02 tbm_ is now known as tbm

11:34 jachymko has joined #nanoc

11:54 jadd_ has joined #nanoc

11:55 alerante has joined #nanoc

11:56 jugglinmike has joined #nanoc

12:01 jachymko has quit [Ping timeout: 245 seconds]

12:11 jachymko has joined #nanoc

12:21 <ddfreyne> bobthecow: Yeah. Any changes to config.yaml/nanoc.yaml, lib and some other things will cause everything to be recompiled

12:21 <ddfreyne> That is clearly subobtimal, but required if you don't want a "nanoc clean" command

12:22 <guardian> the point imho is to get everything through nanoc's pipeline

12:22 <guardian> as soon as you start hacking around nanoc, with Rake or Make, then it shows a deficiency :)

12:23 <ddfreyne> guardian: It is not super easy to fix :(

12:23 <guardian> I know

12:28 jadd_ has quit [Ping timeout: 268 seconds]

12:36 alerante has quit [Remote host closed the connection]

12:46 <ddfreyne> Dependency tracking on config.yaml is possible (that would be the fine-grained dependencies that have been in the queue for a long time)

12:47 <ddfreyne> For cahnges in lib, the situation is harder. I'm thinking of *requiring* a lib/ structure like lib/filters/, lib/helpers/ and then explicitly declaring which helpers are used.

13:09 yogsototh has quit [Remote host closed the connection]

13:10 yogsototh has joined #nanoc

13:18 <guardian> I won't be against structure inside lib/

13:24 jadd_ has joined #nanoc

13:48 pavelkunc has joined #nanoc

13:58 jadd_ has quit [Ping timeout: 264 seconds]

14:36 skroon_ has quit [Ping timeout: 268 seconds]

14:36 skroon has quit [Ping timeout: 268 seconds]

14:50 jadd_ has joined #nanoc

14:52 jeremyjarvis has joined #nanoc

15:00 skroon has joined #nanoc

15:01 skroon_ has joined #nanoc

15:01 jadd_ has quit [Quit: Leaving...]

15:02 jadd_ has joined #nanoc

15:03 jeremyjarvis has left #nanoc [#nanoc]

15:03 jachymko has quit [Ping timeout: 240 seconds]

15:29 jachymko has joined #nanoc

15:49 jachymko has quit [Ping timeout: 264 seconds]

16:19 pavelkunc has quit [Quit: Leaving.]

17:23 jadd_ has quit [Quit: Leaving...]

17:30 jadd_ has joined #nanoc

17:31 jadd_ has quit [Client Quit]

17:38 cDlm has joined #nanoc

17:38 cDlm has quit [Client Quit]

20:18 bghost has joined #nanoc

20:36 jachymko has joined #nanoc

20:47 jachymko has quit [Quit: ZNC - http://znc.in]

20:52 jachymko has joined #nanoc

20:52 jachymko has quit [Client Quit]

20:52 jachymko has joined #nanoc

20:52 jachymko has quit [Client Quit]

20:56 jachymko has joined #nanoc

22:02 yogsototh has quit [Remote host closed the connection]

22:05 bghost has quit [Ping timeout: 276 seconds]

22:06 bghost has joined #nanoc

22:38 stbuehler has quit [Quit: leaving]

22:43 stbuehler has joined #nanoc

22:44 stbuehler has quit [Client Quit]

22:45 stbuehler has joined #nanoc

22:59 cDlm has joined #nanoc

23:01 stbuehler has quit [Quit: leaving]

23:01 stbuehler has joined #nanoc

23:33 bghost has quit [Ping timeout: 246 seconds]

23:39 dbast has quit [Quit: '']

23:40 dbast has joined #nanoc