#rom-rb on 2013-12-30 — irc logs at freenode.irclog.whitequark.org

2013-06-04 15:26 solnic changed the topic of #rom-rb to: Ruby Object Mapper | Mailing List: https://groups.google.com/forum/?fromgroups#!forum/rom-rb | Logs: http://irclog.whitequark.org/rom-rb

00:24 coop-cooper has joined #rom-rb

00:31 coop-cooper has quit [Ping timeout: 272 seconds]

00:31 mbj has quit [Quit: leaving]

00:46 mbj has joined #rom-rb

00:58 snusnu has joined #rom-rb

01:31 mbj has quit [Ping timeout: 264 seconds]

05:32 snusnu has quit [Quit: Leaving.]

06:49 juhakaja has joined #rom-rb

07:02 juhakaja has quit [Remote host closed the connection]

07:02 juhakaja has joined #rom-rb

07:07 juhakaja has quit [Ping timeout: 272 seconds]

08:13 juhakaja has joined #rom-rb

08:17 juhakaja has quit [Ping timeout: 245 seconds]

09:15 mbj has joined #rom-rb

09:24 <mbj> dkubb: morning

09:25 <dkubb> mbj: good morning

09:25 <dkubb> just about to log off now though ;)

09:25 <dkubb> was up late doing some sql stuff

09:25 <dkubb> just getting it polished to the point where I can either go onto parsing or hook it into axiom-sql-generator

09:28 <mbj> nice

09:28 <mbj> I'll rip out the last known unparser transitivity issues than focus in morpher again.

09:28 <mbj> Once I finished this I'll introduce the kill expressions to mutant and move to the ROM operator :D

09:29 <mbj> The ROM operator is the pattern I discovered with snusnu and is IMHO a very good abstraction to capture operational transforms of the DB state.

09:32 <dkubb> if you have time tomorrow i'd be curious to know what that is

09:33 <mbj> I hope to have time.

09:33 <mbj> Currently I'm fighting to deal with %r(/) in unparser.

09:34 <dkubb> escaping slashes?

09:34 <mbj> (regexp (str "/") (regopt))

09:34 <mbj> vs

09:34 <mbj> (regexp (str "\\/") (regopt))

09:34 <mbj> first is %r(/)

09:35 <mbj> second is /\//

09:35 <mbj> IMHO parser should give the SAME AST for both.

09:35 <dkubb> yeah it should

09:35 <dkubb> I bet that's a bug

09:36 <mbj> mbj@mbj ~/devel/mutant (master) % bundle exec ruby-parse -e '"\""'

09:36 <mbj> (str "\"")

09:36 <mbj> mbj@mbj ~/devel/mutant (master) % bundle exec ruby-parse -e '%q(")'

09:36 <mbj> (str "\"")

09:36 <mbj> Here it works perfectly.

09:36 <dkubb> the string in both should be passed through Regexp.escape

09:36 <dkubb> and the resultin value stored in the str node

09:36 <mbj> For // style regexps this would lead to double escape.

09:37 <mbj> I think only %r() style regexp should be "transquoted".

09:37 <mbj> for %r() style regexp you must quote )

09:37 <mbj> For // style regexp you must quote /

09:37 <dkubb> parser needs to normalize the string *or* use two separate node types

09:37 <mbj> Exactly.

09:37 <dkubb> how are you supposed to know what the original source was

09:37 <mbj> I could use the source maps

09:38 <mbj> But unparser should work WITHOUT source maps.

09:38 <dkubb> all you've got is the nodes, and they need to provide enough information to be unparsed

09:38 <dkubb> exactly

09:38 <mbj> I think parser should emit str nodes without quotes.

09:38 <dkubb> what's a source map in the context of parser?

09:38 <dkubb> yeah, I agree

09:39 <mbj> Source map allows to recover the original string where the node originates from.

09:39 <dkubb> the string you get should be suitable in this instance: Regexp.new(string)

09:39 <mbj> https://github.com/whitequark/parser/blob/master/doc/AST_FORMAT.md

09:40 <mbj> The ^ and ~~~~ informations are in the source maps.

09:40 <dkubb> yeah, but then *you* become responsible for parsing regexp nodes

09:40 <mbj> I'll open a ticket.

09:40 <mbj> But I'm thinking about a workaround.

09:40 <mbj> I could use the source maps to determine if the string is quoted or not.

09:40 <dkubb> I think you're right about unparsed not needing source maps

09:41 <mbj> Mutant will NOT generated source maps.

09:41 <dkubb> yeah, I can understand it as a work-around

09:41 <mbj> s/generated/generate/

09:41 <mbj> Source maps are used for diagnostics.

09:41 <dkubb> anyone else who uses parser to parse source with regexps is going to run into the same issue

09:41 <dkubb> right

09:42 <mbj> IMHO there is only a two valid use for source maps in unparser

09:42 <mbj> detecting __FILE__ and __LINE__

09:42 <dkubb> debugging?

09:43 <mbj> And to associate comments with nodes for comment unparsing.

09:43 <mbj> ruby-parse -e '__LILE__' # => (int 1)

09:43 <mbj> But source maps will tell me it was a __LINE__

09:44 <mbj> I expected to have a special node for __LINE__ and __FILE__

09:44 <mbj> And there is a option for parser to generate these, but this is not the default.

09:50 travis-ci has joined #rom-rb

09:50 <travis-ci> [travis-ci] Build details : http://travis-ci.org/dkubb/sql/builds/16132737

09:50 <travis-ci> [travis-ci] dkubb/sql#181 (collapse-select - acf585c : Dan Kubb): The build has errored.

09:50 travis-ci has left #rom-rb [#rom-rb]

09:50 <travis-ci> [travis-ci] Change view : https://github.com/dkubb/sql/compare/1a9fda6b75fc...acf585c56a0d

09:54 <mbj> dkubb: https://github.com/whitequark/parser/issues/125

09:56 <dkubb> yeah that makes a good point about how %q works

10:06 mbj has quit [Read error: Operation timed out]

10:28 travis-ci has joined #rom-rb

10:28 <travis-ci> [travis-ci] Build details : http://travis-ci.org/dkubb/sql/builds/16133557

10:28 <travis-ci> [travis-ci] Change view : https://github.com/dkubb/sql/compare/f4a4f8dedde0...309bb615b092

10:28 <travis-ci> [travis-ci] dkubb/sql#187 (master - 309bb61 : Dan Kubb): The build has errored.

10:28 travis-ci has left #rom-rb [#rom-rb]

10:51 mbj has joined #rom-rb

11:12 postmodern has quit [Quit: Leaving]

11:33 travis-ci has joined #rom-rb

11:33 <travis-ci> [travis-ci] Build details : http://travis-ci.org/dkubb/sql/builds/16136040

11:33 <travis-ci> [travis-ci] dkubb/sql#201 (add-parent-node - d70ac48 : Dan Kubb): The build has errored.

11:33 <travis-ci> [travis-ci] Change view : https://github.com/dkubb/sql/compare/c8a575e9f45e...d70ac480de69

11:33 travis-ci has left #rom-rb [#rom-rb]

11:35 mbj has quit [Read error: Connection reset by peer]

11:39 travis-ci has joined #rom-rb

11:39 <travis-ci> [travis-ci] dkubb/sql#203 (add-parent-node - 9703ae6 : Dan Kubb): The build has errored.

11:39 travis-ci has left #rom-rb [#rom-rb]

11:39 <travis-ci> [travis-ci] Build details : http://travis-ci.org/dkubb/sql/builds/16136121

11:39 <travis-ci> [travis-ci] Change view : https://github.com/dkubb/sql/compare/d70ac480de69...9703ae696d61

11:46 travis-ci has joined #rom-rb

11:46 <travis-ci> [travis-ci] dkubb/sql#205 (add-parent-node - b16906b : Dan Kubb): The build has errored.

11:46 <travis-ci> [travis-ci] Build details : http://travis-ci.org/dkubb/sql/builds/16136162

11:46 travis-ci has left #rom-rb [#rom-rb]

11:46 <travis-ci> [travis-ci] Change view : https://github.com/dkubb/sql/compare/9703ae696d61...b16906be9b2e

12:02 jfredett-w1 has joined #rom-rb

12:04 snusnu has joined #rom-rb

12:06 jfredett-w has quit [Ping timeout: 272 seconds]

12:34 snusnu has quit [Quit: Leaving.]

12:41 snusnu1 has joined #rom-rb

12:52 snusnu1 has quit [Quit: Leaving.]

13:11 mbj has joined #rom-rb

13:20 breakingthings has joined #rom-rb

13:42 snusnu has joined #rom-rb

14:02 snusnu has quit [Quit: Leaving.]

14:15 snusnu1 has joined #rom-rb

14:29 CraigBuchek has joined #rom-rb

14:36 snusnu1 has quit [Quit: Leaving.]

14:38 snusnu has joined #rom-rb

14:44 snusnu has quit [*.net *.split]

14:47 cored has joined #rom-rb

14:50 snusnu has joined #rom-rb

14:51 namelessjon_ has joined #rom-rb

14:52 namelessjon has quit [Ping timeout: 246 seconds]

14:53 mbj has quit [Ping timeout: 260 seconds]

14:56 namelessjon has joined #rom-rb

14:58 namelessjon_ has quit [Ping timeout: 245 seconds]

15:01 namelessjon_ has joined #rom-rb

15:03 namelessjon has quit [Ping timeout: 272 seconds]

15:05 namelessjon_ is now known as namlessjon

15:05 namlessjon is now known as namelessjon

15:30 lgierth has joined #rom-rb

15:49 _br_ has quit [Ping timeout: 252 seconds]

15:52 _br_ has joined #rom-rb

16:08 mbj has joined #rom-rb

16:13 solnic has joined #rom-rb

16:34 <mbj> solnic: hola

16:49 snusnu has quit [Quit: Leaving.]

16:52 snusnu1 has joined #rom-rb

17:01 lgierth has quit [Quit: Ex-Chat]

17:09 snusnu1 has quit [Quit: Leaving.]

17:35 solnic has quit [Quit: Linkinus - http://linkinus.com]

17:36 <mbj> dkubb: hola

17:37 <mbj> dkubb: That wierd %r(/) vs /\/ behavior is reflected in Regexp#source its RUBY.

17:37 <mbj> dkubb: So I'll have to workaround it.

17:38 <mbj> dkubb: Gonna try to quote unquoted / in regexps. I think all regexp engines in ruby will contain workarounds for this stuff.

17:38 <mbj> dkubb: Actually I'm not correct about Regexp#source

17:43 mbj_ has joined #rom-rb

17:44 mbj has quit [Read error: Connection reset by peer]

18:18 vovchanskiy has joined #rom-rb

18:19 vovchanskiy has quit [Remote host closed the connection]

18:25 pussen has joined #rom-rb

18:27 pussen has quit [Remote host closed the connection]

19:36 <dkubb> mbj_: good morning

19:37 <dkubb> mbj_: so Regexp#source works properly by normalizing things?

19:38 <dkubb> mbj_: I wonder, does rbx have two different nodes to represent %r(/) vs /\// ?

19:39 <dkubb> dbussink: ^^^

19:41 <dbussink> dkubb: they end up with different sources yes

19:41 <dkubb> dbussink: we were discussing if ruby impls represent %r(/) and /\// differently

19:42 <dkubb> like maybe they have different ast nodes

19:42 <dbussink> dkubb: if you have rbx installed, you can do rbx compile -A -e '/

19:42 <dbussink> dkubb: if you have rbx installed, you can do rbx compile -A -e '/\//'

19:42 <dbussink> and compare that to the other

19:42 <dbussink> -A prints an ast

19:42 <dbussink> -B bytecode

19:43 <dkubb> interesting

19:43 <dkubb> dbussink: so when the regexp is parsed, does there need to be conditional logic or normalization somewhere so that they are treated the same.. since I assume they compile down to the same thing under the hood?

19:44 <dbussink> well, looks like they are actually different under the hood

19:44 <dbussink> but i don't really know the exact details

19:45 <dkubb> the original assumption mbj and I had was that when they are parsed by whitequark/parser, they would be represented the same in the ast .. we thought looking at how ruby impls do it that we might understand why/if they are different

19:54 <dkubb> mbj_: Regexp.new(Regexp.new('/').to_s).source == Regexp.new(Regexp.new('\\/').to_s).source # => true

19:55 <dkubb> mbj_: the ruby docs for Regexp#to_s say "This string can be fed back in to Regexp::new to a regular expression with the same semantics as the original."

19:55 <dkubb> mbj_: so maybe you can just normalize it yourself if you wanted

20:01 <dkubb> mbj_: i was thinking about doing a bit of the parsing logic for sql.rb. I was wondering if you had any projects to point me towards or if you're interested in helping me get started? I think for me the biggest problem is that there's a lack of docs and example projects using ragel (aside from whitequark/parser) and I'm not yet sure if there's a better approach to start with or if I have to find it via trial and error

20:22 mbj_ is now known as mbj

20:24 postmodern has joined #rom-rb

20:26 <mbj> dkubb: From what I now, ruby is a language you have to lex and parse at the same time.

20:26 <mbj> dkubb: Because of the lvar / method call ambiguity at lexer level.

20:26 <mbj> dkubb: And ragel is not able to generate LR(0) grammars. For that reason wquitequark used racc

20:27 <mbj> dkubb: I think you should find a .y file from a well known implementation (postgres) and implement the C parts in ruby.

20:27 <mbj> dkubb: I'd pick parts of the files.

20:31 <postmodern> are you building your own Ruby implementation now?

20:47 <mbj> postmodern: I'd not implement my own ruby.

20:47 <mbj> postmodern: I'd implement a subset ;)

20:48 <mbj> postmodern: Reason behind this discussion, unparsing %r(/) vs /\//

20:48 <mbj> postmodern: for unparser (mutant)

20:48 <postmodern> ah ha

20:49 <postmodern> supposedly you can get access to the internal regexp tree

20:49 <postmodern> i was looking into it to write a regexp fuzzer

20:49 <mbj> postmodern: I'll do one for mutant

20:50 <mbj> postmodern: I'm on phone bbl

20:50 <postmodern> of course things like /a*/ are hard to fuzz :)

21:15 <mbj> postmodern: The thing is, unparser tries to archieve the following invariant:

21:15 <mbj> Parser.parse(Unparser.unparse(Parser.parse(source))) == Parser.parse(source)

21:16 <mbj> So it is totally okay to emit a regexp literal in original source like %r() as //

21:16 <mbj> BUT the regexp contents should be the same.

21:17 <postmodern> ah ha

21:17 <mbj> %r(/) gets parsed as (regexp, (str "/))

21:17 <postmodern> yeah by regexp fuzzer, i was referring to taking a regexp and generating all possible inputs

21:17 <mbj> /\// gets parsed as (regexp, (str "\\/"))

21:17 <postmodern> where as mutant wants to mutant the regexp itself

21:17 <mbj> postmodern: Ahh I thought the other way round.

21:17 <mbj> postmodern: got it.

21:18 <mbj> postmodern: Yeah I think mutant has the easier problem ;)

21:18 <mbj> postmodern: So a generic unparser that does NOT know the original delimiter has a problem.

21:19 <mbj> postmodern: Because literals like /\// already contain the quoted delimiter in str body, and literals like %r(/) do not.

21:19 <mbj> postmodern: I think I need to make unparser source map aware for this node, wich I dislike.

21:24 lgierth has joined #rom-rb

21:36 CraigBuchek has quit [Quit: Leaving.]

21:46 lfox has joined #rom-rb

22:07 CraigBuchek has joined #rom-rb

22:13 breakingthings has quit []

22:51 <dkubb> postmodern: I thought it might be possible to use https://github.com/ammar/regexp_parser for parsing regexps, and from there the same kind of structure as mutant could be used to mutate each kind of node

22:54 <dkubb> postmodern: I'd guess when mutant begins to mutate regexps most code will have dozens or uncovered mutations for each regexp.. people are pretty bad at testing regexps against possible inputs

22:54 <dkubb> *dozens of

22:55 <postmodern> ^ $ vs. \A \z

22:56 <dkubb> oh yeah

22:57 <dkubb> what I would typically do is'

22:57 <postmodern> but doesn't that mean developers will have to write tests for every malformed input?

22:57 <dkubb> mutate from the weaker to strong nodes

22:57 <postmodern> just to ensure the regexp rejects it?

22:57 <mbj> postmodern: You need to have a counter example for each mutation.

22:57 <dkubb> I dunno, I think they might have to write a test for each class of valid input

22:58 <mbj> postmodern: I think the set of counter examples is finite and probably 2 times the amount of nodes in the regexp AST.

22:58 <postmodern> ah like [a-z] -> [^a-z]

22:58 <mbj> exactly

22:58 <mbj> Or a|b => b

22:58 <postmodern> mbj, hmm what about really complex regexps, like email validation?

22:58 <postmodern> mbj, that might generate a ton of counter examples

22:58 <mbj> postmodern: tbh dunno.

22:59 <mbj> postmodern: Next problem mutant currently only "sees" nodes inside def and defs nodes.

22:59 <mbj> my regexps these days are within class / module bodies.

23:00 <mbj> postmodern: I expect you'll not try to mutation cover email regexps via testing against a corpus of valid email addresses.

23:00 <mbj> And also I think the libraries shipping this regexps would be mutation covered and a typicall user would not reinvent such "complex" regexps.

23:01 <mbj> And if he does, he might thing: Uneasy to mutation cover, maybe I should refer to a lib before failing myself.

23:01 <mbj> Wich is a commond side effect of mutation covering your code :D

23:01 <postmodern> mbj, good points

23:02 <mbj> postmodern: With the next version of mutant I expect to have a very fine grained configuration, I dont expect most users will go for "full rom style coverage".

23:02 <mbj> postmodern: But its very helpfull to explore the quality of your software via just using the mutations as a metric.

23:03 <mbj> postmodern: I'll need to add tons of documentation and invent some wording, for example we have a problem with explicit and implicit mutation coverage, wich is IMHO not in the current mutation coverage literature.

23:05 <mbj> postmodern: The configuration for not removing explicit returns for implicit ones is already in a branch. You'll like it. And I hope ronin could be mutation covered :D

23:06 <mbj> dkubb: In unparser I have the concept of "terminated" nodes.

23:06 <mbj> dkubb: terminated nodes are guaranteed to get emitted as composable expressions you could use "everywhere".

23:07 <mbj> dkubb: For example a fixnum literal is "terminated".

23:07 <dkubb> mbj: what's a counter-example?

23:07 <mbj> dkubb: range

23:07 <mbj> dkubb: 1..2

23:07 <mbj> dkubb: if you have a range as receiver you need parenthesis

23:08 <mbj> dkubb: an ast like (send (irange (int 1, int 2)), :foo) must be emitted as (1..2).foo

23:08 <dkubb> mbj: in sql I have it so the emitter can parenthesize based on what the parent node is

23:08 <mbj> dkubb: The emitters all support #terminated?

23:08 <dkubb> mbj: so effectively all my sql statements are terminated now

23:09 <mbj> dkubb: Yeah, I used the same strategy for unparser a while

23:09 <mbj> dkubb: But it could manifest in "douple parenthesis"

23:09 <dkubb> how do you think?

23:09 <mbj> dkubb: Because sometimes termination does not depend only on node type

23:09 <dkubb> I have it so the node is responsible for parenthesizing itself

23:09 <dkubb> or rather the node's emitter

23:09 <mbj> I think this will work for SQL

23:10 <dkubb> yeah, it's much simpler than ruby

23:10 <mbj> For ruby it lead to unneded terminals in the output

23:10 <mbj> dkubb: Base implementation relying on node type here: https://github.com/mbj/unparser/blob/master/lib/unparser/emitter.rb#L147

23:11 <mbj> dkubb: Instance specific here: https://github.com/mbj/unparser/blob/master/lib/unparser/emitter/send.rb#L29-L34

23:12 <mbj> dkubb: Just mentioning this now because you asked for input abaut how to handle parenthesis generation.

23:12 <mbj> dkubb: And I lost a verbose github comment in edit ;)

23:12 <dkubb> that's cool

23:12 <dkubb> I ack'd the comment

23:13 <mbj> dkubb: Here is the helper https://github.com/mbj/unparser/blob/master/lib/unparser/emitter.rb#L201-L214

23:13 <mbj> dkubb: The helper I use for cases where a non terminated node must get parenthesis.

23:14 <mbj> Maybe this will be the default behavior.

23:14 <mbj> See the difference of #visit(node) and #visit_termianted(node)

23:14 <dkubb> mbj: I thought you might like this: https://github.com/dkubb/sql/blob/master/lib/sql/generator/emitter.rb#L95-L102

23:14 <dkubb> mbj: so much of SQL fits this

23:15 <mbj> dkubb: yeah

23:15 <mbj> dkubb: I could use this instead of my max < index thing!

23:16 <dkubb> sweet

23:16 <dkubb> I refactored the original code to use this

23:16 <dkubb> I like it better, no conditional tests, etc

23:16 <mbj> Yeah, full ack.

23:17 <mbj> thats the normal evolution when multiple eye pairs do a pass over the code.

23:17 <mbj> I'm gonna write an JS unparser soon.

23:17 <mbj> I think we might end up in sharing code between sql / unparser / js / aql (if I get fundet to rewrite it)

23:17 <mbj> *funded

23:31 lfox has quit [Quit: ZZZzzz…]

23:36 lfox has joined #rom-rb