#nmigen on 2020-04-20 — irc logs at freenode.irclog.whitequark.org

2020-01-27 18:31 ChanServ changed the topic of #nmigen to: nMigen hardware description language · code at https://github.com/nmigen · logs at https://freenode.irclog.whitequark.org/nmigen

00:14 Degi_ has joined #nmigen

00:17 Degi has quit [Ping timeout: 256 seconds]

00:17 Degi_ is now known as Degi

00:57 <ianloic> shouldn't it just be subtraction & checking the underflow?

01:03 <awygle> Yeah

01:30 <zignig> ianloic: https://zipcpu.com/blog/2019/06/28/genclk.html , it's verilog, but detailed and explains well (as with all ZipCPU's stuff)

01:54 <awygle> lol that's kinda doofy

01:56 <zignig> doofy ? , kind of silly , but still works?

01:57 <awygle> yeah

01:57 <awygle> like i would not go into a design with that as the plan

01:57 <awygle> but it's a fun little game

01:57 <awygle> and it might be a good back-pocket trick to save you if you mess up something greiviously and are trying to hack around it

02:00 <zignig> indeed, did you manage to sort out your SLIP .vcd weirdness ?

02:00 <awygle> i did by rewriting the module >_> and it magically went away

02:00 <awygle> but i was doing some iffy stuff, so i think i maybe had a combinational loop

02:01 <zignig> BORK! , glad you fixed it.

02:02 <awygle> thanks :)

02:02 <awygle> how's your bonelessing going?

02:06 <zignig> getting there, just updating it to the new instruction set. Looking to put a streaming uart so it can store unprocessed char without exploding.

02:06 <awygle> cool cool

02:07 <zignig> and then CSR integration, I wrote my own , but nmigen-soc is a better plan.

02:08 <zignig> jnfg as been working on some peripheral designs in https://github.com/lambdaconcept/lambdasoc , looks kind of cool.

02:08 <zignig> *jfng

02:09 <awygle> yep

02:18 <awygle> this is maybe a poorly formed question but is there a way in nmigen to indicate that a set of conditions should be mutually exclusive? for guard in guards: if guard: action, and make sure all the "guard"s are orthogonal?

02:18 <awygle> that might be undecidable or something i guess

02:25 <zignig> in terms of an FSM or general logic ?

02:25 <awygle> my specific case is FSM transitions

02:25 <awygle> wanting to make sure they're not conflicting

02:26 <awygle> but i could see it being useful in other contexts

02:28 <zignig> perhaps a one hot encoder (lib/coding.py) in your FSM logic could force mutual exclusivity ?

02:30 <awygle> hm

02:30 <awygle> maybe

02:30 <awygle> i'll think about that approach, thanks

02:31 <zignig> no problem

05:11 ____ has joined #nmigen

06:41 thinknok has joined #nmigen

06:44 chipmuenk has joined #nmigen

07:24 thinknok has quit [Ping timeout: 246 seconds]

07:31 Asu has joined #nmigen

08:02 futarisIRCcloud has quit [Quit: Connection closed for inactivity]

08:10 thinknok has joined #nmigen

08:31 FFY00 has quit [Remote host closed the connection]

08:31 FFY00 has joined #nmigen

08:34 FFY00 has quit [Max SendQ exceeded]

08:35 FFY00 has joined #nmigen

08:44 Ultrasauce has quit [Ping timeout: 240 seconds]

09:12 <MadHacker> awygle: Can't you just use a normal assert and do signal1 & (signal2 | signal3 | signal4) == 0 for example?

09:44 Ultrasauce has joined #nmigen

09:53 Asuu has joined #nmigen

09:56 Asu has quit [Ping timeout: 264 seconds]

10:00 thinknok has quit [Ping timeout: 272 seconds]

10:05 proteus-guy has quit [Ping timeout: 256 seconds]

10:08 FFY00 has quit [Remote host closed the connection]

10:11 FFY00 has joined #nmigen

10:18 proteus-guy has joined #nmigen

10:28 thinknok has joined #nmigen

10:58 <Sarayan> gtkwave has the most atrocious coding style ever

10:58 <Sarayan> at least for a non-joke one

11:00 <daveshah> As far as indentation goes, arachne-pnr was quite interesting...

11:10 <whitequark> doen't arachne-pnr use the gnu style?

11:11 <daveshah> Oh, maybe it does

11:11 <daveshah> I don't think I've ever really looked at the source of any gnu stuff

11:11 <whitequark> i love how the gnu code all uses \t as a shortcut for 8 spaces

11:12 <whitequark> but uses 2 columns for indentation

11:12 <whitequark> so it's quite literally combining the worst things about tabs *and* space.

11:13 <daveshah> Yeah, my experiences of editing arachne while preserving formating were never fun

11:13 <whitequark> apparently emacs does it automatically, because conserving space characters was important in 1980 was important or something idiotic like that

11:13 <whitequark> this is one of the reasons to never contribute to gnu code

11:13 <daveshah> If nextpnr didn't exist and maintenance of arachne continued, then I had plans to run the whole code base through clang format

11:16 <Sarayan> arachne was a better name :-P

11:18 <Sarayan> I need to convert the output of a LA to something gtkwave handles. Any recommendations? The gtkwave guys seem to like the fst format the most, but that may not be for the right reasons

11:20 <Sarayan> (like it's our format ! it's cool ! and it's compressed and stuff !)

11:24 <whitequark> never really looked close at FST

11:24 <whitequark> what kind of LA? how much data?

11:25 <Sarayan> Agilent LA, ala/alb formats

11:25 <Sarayan> never really big

11:26 <Sarayan> https://twitter.com/PhilBennett3D/status/1252179341523382272

11:26 <Sarayan> Phil is doing probes for me :-)

11:27 <whitequark> pyvcd should be ok

11:28 <Sarayan> hmmm, and do the converter in python? That makes some sense

11:32 <whitequark> ye

11:42 Vinalon has quit [Ping timeout: 264 seconds]

12:09 <MadHacker> Sarayan: If it's an *old* agilent LA I may have code for doing that already.

12:10 <MadHacker> I have an HP1661CS that you can pull the data off via FTP, and I've code for that -> vcd in python somewhere.

12:19 <Sarayan> dunno, the alb format is quite generic

12:50 <MadHacker> Looking at the code that's here (was written by a friend, not me) I don't think it's suitable. Seems to read a setup binary config from the scope and then a raw buffer of the data.

12:52 <Sarayan> that's must be the ala format, seems way more model-specific

12:52 <Sarayan> alb is a text description of the contents followed by the uncompressed contents in binary

12:52 <Sarayan> easy, tbh

13:49 thinknok has quit [Ping timeout: 272 seconds]

13:58 <awygle> Fst is actually quite reasonable afaict

13:58 <awygle> I read the paper and some of the source code

14:03 Asuu has quit [Read error: Connection reset by peer]

14:06 Asuu has joined #nmigen

14:56 Vinalon has joined #nmigen

15:08 <tpw_rules> is there a guide or tutorial or notes anywhere on testing and formal verification with nmigen?

15:09 <whitequark> tpw_rules: it is unfortunately somewhat incomplete, in the sense that there is no good integration with sby

15:09 <whitequark> Robert Baruch has been using it extensively though so I would look at his tutorial

15:19 thinknok has joined #nmigen

15:49 Asuu has quit [Ping timeout: 256 seconds]

15:49 Asuu has joined #nmigen

16:11 ____2 has joined #nmigen

16:13 ____ has quit [Ping timeout: 264 seconds]

16:57 Asuu has quit [Read error: Connection reset by peer]

17:00 Asu has joined #nmigen

17:04 <awygle> g'morning

17:04 <whitequark> hi!

17:08 <awygle> how's everybody doing today?

17:09 <whitequark> i made cxxrtl 30% faster

17:09 <whitequark> and opened the way towards making it 3 times faster on top of that

17:10 <whitequark> i... might actually achieve parity with single-threaded verilator? not just "order of magnitude" but "basically the same speed"

17:10 <awygle> i saw that

17:10 <whitequark> not sure yet

17:10 <awygle> dorbs little diff

17:10 <awygle> must have been a _very_ hot path

17:10 <whitequark> nope

17:10 <whitequark> it's in a code generator

17:10 <whitequark> not in the generated code

17:11 <whitequark> it changed the way some things are scheduled, eliminating a lot of delta cycles

17:11 <awygle> ah, ok

17:13 <awygle> well excellent work :)

17:13 <Sarayan> I don't think verilator does anything special, does it?

17:14 <awygle> it cheats in a few ways but i suspect cxxrtl cheats in those same ways (e.g. two-valued logic)

17:14 <Sarayan> oh, you need to -I share/yosys/install

17:15 <Sarayan> oh, you need to -I share/yosys/iclude that is

17:15 <Sarayan> oh, you need to -I share/yosys/include that is

17:15 <whitequark> awygle: cxxrtl doesn't cheat by assuming purely sync logic

17:15 <whitequark> unlike verilator

17:15 <awygle> ah ok

17:15 <awygle> i had forgotten verilator does that

17:15 <whitequark> literally the only thing that cxxrtl does if you make a latch from NAND cells is it becomes a lil bit slower

17:15 <awygle> that's a _big_ cheat

17:15 <whitequark> yes.

17:15 <whitequark> and it is much more insidious than you think

17:15 <awygle> speaking of latches, does nmigen warn on inferred latches? my guess is "no"

17:16 <awygle> because that's a backend thing

17:16 <whitequark> do you ever think why verilator always flattens your designs?

17:16 <Sarayan> urgh, everything is broken, life is terrible

17:16 <awygle> i don't actually use verilator

17:16 <awygle> Sarayan: i think you've migrated your terribleness from #yosys to #nmigen without realizing :)

17:16 <whitequark> because if you have e.g. comb feedback in wishbone, if you do not flatten your design, you will get apparent feedback arcs

17:16 <Sarayan> awygle : indeed

17:16 <whitequark> (lots of other things too)

17:16 <Sarayan> -ETOOMANYCHANNELS

17:17 <whitequark> so verilator, which cannot cope with any feedback loops *at all*, even apparent ones, has to require you to flatten

17:17 <awygle> sure, absolutely

17:17 <whitequark> cxxrtl on the other hand, you can even compile every verilog module as a separate c++ file

17:17 <Sarayan> do you topo-sort if flattened?

17:17 <whitequark> this... is unlikely to lead to an overall decrease in compile+run time unless your designs are stupidly large, but you can do it

17:18 <whitequark> Sarayan: better

17:18 <Sarayan> Oh?

17:18 <whitequark> i do not just topo-sort

17:18 <whitequark> topo sort doesn't work on graphs with loops, right?

17:18 <whitequark> so if i have a graph with a loop, i actually *minimize the amount of edges that loop back*

17:18 <Sarayan> well, you can always BFS it, but otherwise no

17:19 <whitequark> meaning that cxxrtl not just simulates your designs with feedback loops, but it does so in a near-optimal way

17:19 <whitequark> (with a heuristic, the optimal solution is NP-complete)

17:19 <whitequark> awygle: also, i plan to add X-propagation to cxxrtl

17:20 <whitequark> not sure about full 4-valued logic because i kind of hate the way Z works inside a design

17:20 <awygle> that np-complete problem is at generation time though, right? if it has a well-known solution (e.g. graph coloring) it might be nice to have a switch to use it if sim time is more important than compile/generation time

17:20 <whitequark> yes, at generation time

17:20 <whitequark> the problem is called "feedback arc set"

17:21 <whitequark> I'm using this: https://pdfs.semanticscholar.org/c7ed/d9acce96ca357876540e19664eb9d976637f.pdf

17:21 <sorear> you like X more than Z?

17:22 <awygle> so the "Well known solution" would be the Berger/Shor algorithm then i guess

17:22 <whitequark> they claim O(edges) runtime, which is actually not true, because they suggest I "obviously" use a data structure which does not appear to exist

17:22 <whitequark> >. Unfortunately, the Berger/Shor algorithm

17:22 <whitequark> is complicated and requires running time O(mn). In this paper we present a simple

17:22 <whitequark> FAS algorithm which guarantees a good (though not optimal) performance bound

17:22 <whitequark> and executes in time O(m). Further, for the sparse graphs which arise frequently in

17:22 <whitequark> graph drawing and other applications, our algorithm achieves the same asymptotic

17:22 <whitequark> performance bound that Berger/Shor does

17:22 <whitequark> the algo in this paper is stupidly simple

17:22 <Sarayan> X and Z tend to not exist in fpgas, right?

17:22 <Sarayan> not much in ICs either

17:22 <awygle> yeah doesn't sound worth it then

17:23 <awygle> whitequark: in defense of Z, we have talked about wanting to simulate e.g. Pins for whole-system simulation before, which would benefit from having Zs

17:23 <whitequark> awygle: https://github.com/YosysHQ/yosys/blob/master/backends/cxxrtl/cxxrtl.cc#L30-L172

17:23 <awygle> if it can be had without costing too much perf in the non-z case

17:23 <whitequark> 140 lines including all the C++ junk

17:23 <sorear> real Z was common when verilog was new, not so much now

17:24 <awygle> lol "it is clear"... no it isn't

17:24 <whitequark> now the funny thing about this paper is they suggest a data structure that gives you O(1) unlink *and* O(1) find-max-key

17:24 <Sarayan> awygle: Lately I tend to just set the output to ff for Z, and and every output

17:24 <whitequark> i have no earthly clue how such a structure would even look like

17:25 <Sarayan> depends on the O() of insert?

17:25 <Sarayan> I mean, sorted linked lis?

17:25 <whitequark> unlink and relink

17:26 <awygle> use an O(1) unlink structure and store index(max-key) on insert?

17:26 <mwk> ... wait, RTLIL doesn't allow logic loops?

17:26 <awygle> oh, relink also

17:26 <sorear> van der warden tree and play dumb if anyone asks you about the word size dependence?

17:27 <Sarayan> Errr, O(1) link, unlink and find-max makes sort O(n), so dream on paper author

17:27 <whitequark> mwk: RTLIL requires logic loops to be broken up with `sync always` rule

17:27 <awygle> that would be easy for me, because idk what that is :)

17:27 * awygle googles

17:27 <mwk> the what

17:27 <mwk> the process thing?

17:28 <Sarayan> wq: You can have O(log(n)) link unlink and O(1) find max with a heap, that's oflen good enough before log is low

17:28 <sorear> sorry, van emde boas

17:28 <whitequark> Sarayan: yes, there are tons of ways to get it "fast enough"

17:28 <whitequark> Sarayan: and in fact right now i literally sort the entire bin mappign each time i need a max key

17:28 <Sarayan> ok, not a real issue then

17:29 <sorear> but the "play dumb about whether the word size counts as an O(1) or O(log)" is the important part

17:29 <whitequark> which is totally braindead and yet never shows up in profile

17:29 <whitequark> it's just uh

17:29 key2 has quit [Ping timeout: 245 seconds]

17:29 <whitequark> i spent literal days trying to figure out how the fuck their O(m) bound can possibly be real

17:29 <whitequark> and i still have no idea

17:29 <Sarayan> nlog(n) find-max is bad you know :-)

17:30 <whitequark> oh wait

17:30 <whitequark> Travis Downs figured it out

17:30 <whitequark> https://twitter.com/trav_downs/status/1204423657872928770

17:30 key2 has joined #nmigen

17:31 <mwk> so what happens when you have a logic loop (say you implemented a latch thru gates) and are far enough in synthesis flow that there are no more processes? is the netlist invalid?

17:31 <whitequark> https://twitter.com/trav_downs/status/1204424812539629569

17:31 <whitequark> this is *extremely* not obvious but i guess it's fair

17:31 <Sarayan> heh

17:31 <Sarayan> "obvious structure, geee"

17:32 <whitequark> mwk: i think logic loops are only invalid for assigns within the non-sync part of a process

17:32 <awygle> lol

17:32 <whitequark> i lost some context in that explanation

17:32 <mwk> ... ah

17:32 <sorear> by the time you start caring about log factors you often have to care about the whole "random-access machines don't exist" thing as well :/

17:32 <whitequark> awygle: re nmigen and latches: you literally cannot write any nmigen code that *infers* a latch

17:33 <whitequark> awygle: you *can* currently write some nmigen code that makes a latch out of gates using a logic loop, but this is considered a bug (at least the fact that it won't warn)

17:33 <Sarayan> .comb loop?

17:33 <whitequark> Sarayan: think of the textbook latch from NAND gates

17:34 <Sarayan> Sure, but you'd need a .comb loop, right?

17:34 <whitequark> yes

17:34 <whitequark> and that is invalid RTLIL

17:34 <Sarayan> which is something I tend to consider verboten in nmigen

17:34 <Sarayan> yeah

17:34 <whitequark> yes

17:34 <whitequark> it's certainly intended to be banned

17:35 <whitequark> i just never got around to properly forbidding it yet

17:35 <Sarayan> technically, verilator has it right... to simulate nmigen, not verilog :-)

17:35 <whitequark> yes

17:35 <whitequark> verilator is perfectly suited to simulating the kinds of designs nmigen is perfectly suited for

17:35 <whitequark> this is not a coincidence

17:35 <awygle> whitequark: what does `s = Signal(); with m.Switch(..): with m.Case(0): s.eq(1): with m.Case(1): pass` do?

17:36 <Sarayan> s.eq(1) is a bug

17:36 <whitequark> nope

17:36 <Sarayan> m.comb += s.eq(1) or m.sync += s.eq(1) ?

17:36 <awygle> ok pretend i have the required boilerplate lol

17:36 <whitequark> er

17:36 <whitequark> yes

17:36 <Sarayan> it's not boilerplate, it changes the behaviour massively

17:36 <whitequark> i will pretend you wrote `m.comb += s.eq(1)`

17:36 <awygle> m.d.comb += s.eq(1)

17:37 <whitequark> in that case, suppose `..` is some expression `expr`. your code is equivalent to `m.d.comb += s.eq(~expr)`

17:37 <Sarayan> then it's equivalent to m.d.comb += s.wq(~(..))

17:37 <whitequark> undriven comb signals just return to their reset value

17:37 <whitequark> which is what you will find as a recommendation in any verilog style guide

17:38 <Sarayan> (yay I'm really starting to understand base nmigen, weeee)

17:38 <awygle> ahhh ok

17:38 <awygle> i literally have `# don't infer a latch` as a comment on like six lines of code in this document

17:38 <awygle> so i should go investigate whether that's needed

17:38 <Sarayan> .sync otoh makes s a latched value

17:38 <whitequark> a... flopped value?

17:38 <awygle> well sure, it's flip-flopped because it's in a clock domain

17:39 <awygle> in verilog the comb version infers a latch, which is why i asked

17:39 <whitequark> awygle: bottom line: in nmigen i consider footguns reportable bugs

17:39 <whitequark> you know, like reportable incidents in some industries

17:39 <Sarayan> wq: call them UB and call it a day ;-)

17:39 <awygle> sure

17:39 <whitequark> Sarayan: hey, you invented synthesizable verilog!

17:40 <Sarayan> was it inspired by C?

17:40 <Sarayan> (probably was, too)

17:40 <awygle> i kinda want to argue that assigning a signal in one branch of a switch but not the other should be an error... but i won't

17:40 <whitequark> awygle: so, if you really want to have belt *and* suspenders, how about writing an nmigen linter?

17:41 <whitequark> the AST is close to fully stable, the IR is not quite there but will be soon

17:41 <whitequark> I can even show you some examples

17:42 <awygle> uhhhhhhhhhhhhhhhhhhh 1) i am definitely interested in that 2) i have already signed up for too much stuff

17:42 <whitequark> you really can go as pedantic about it as you want. prohibit x.eq(y) where len(x) != len(y) for example, easily

17:43 <whitequark> well, feel free to open an issue on the tracker, and i'll post a simple example linter there, maybe for the two cases we discussed here

17:44 <awygle> sure

17:44 <whitequark> there's a reason every damn AST node has precise source locations, and it is to enable downstream tooling

17:45 <whitequark> sorear: i like X more than Z, yes

17:45 <whitequark> well, it's a bit "i like eggs more than public transport"

17:45 <_whitenotifier-3> [nmigen] awygle opened issue #360: nMigen Linter - https://git.io/JfTnx

17:45 <awygle> i am not sure whether i like eggs or public transport more. in both cases i am pro in principle but lukewarm in practice.

17:46 <_whitenotifier-3> [nmigen] whitequark commented on issue #360: nMigen Linter - https://git.io/JfTnh

17:46 <whitequark> Z gives you... well, tristates

17:46 <awygle> also you have (sunny side up, seattle busses) on one side and (deviled, shinkansen) on the other lol

17:46 <Sarayan> I read that as "downstream trolling", make of that what you want

17:46 <whitequark> and the thing is that you only really ever have tristates in a very small area of the toplevel module

17:47 <whitequark> (nmigen, of course, always instantiates IOBs in toplevel module, since some toolchains break if you don't do that)

17:47 <whitequark> (and don't flatten)

17:47 <awygle> i could see Z being lifted to some other level, like an IBIS-type system sim

17:47 <whitequark> mhm

17:48 <awygle> on the other hand if you want nmigen to be usable for asics it feels like you'd need to handle z someday

17:48 <whitequark> IMO, for Z, it is sufficient to have some yosys pass that maps `inout x` to `input i_x, output o_x, output oe_x`

17:48 <whitequark> (assuming you want to simulate with the vendor libs at all; otherwise you can just stick the Pin in your ports=[] array)

17:49 <whitequark> regarding ASICs: this came up for negative level resets

17:49 <mwk> ... that pass is on my TODO list, by the way

17:49 <whitequark> the problem here is that we cannot *simply* add an option for active low resets because there's a bunch of code that's like `with m.If(ResetSignal()):` which will get broken immediately

17:49 <Sarayan> awygle: do asics do Z internally?

17:50 <awygle> :shrug: they certainly uh... could.

17:50 <mwk> Sarayan: they can, and back in the olden days they actually did

17:50 <whitequark> I asked an ASIC guy on the nmigen issue tracker what he thinks we should do, and the answer is basically "add an inverter, then expect the inverter to be folded away in synthesis"

17:50 <mwk> as in, I'm reasonably sure ~2000-era nvidia GPUs did

17:50 <sorear> that recently?

17:51 <whitequark> which is fair? the only thing nmigen can do here is to add this inverter itself, twice, but it's really hard and probably isn't any better

17:51 <mwk> think so, yes

17:51 <whitequark> you are probably being pretty careful about the way your ASIC is reset, anyways, not something you can fuck up easily

17:51 <Sarayan> Of course in nmos we're usually working with pullup and Z/0, but nmos is beyond obsolete at this point :-)

17:51 <mwk> also note that in the old days, *FPGAs* could do internal Z

17:51 <whitequark> there is actually a similar issue with ClockSignal

17:52 <mwk> as in, every xilinx FPGA from xc3000 up to Virtex 2 supported internal tristate buses

17:52 * sorear thought the old days for this purpose ended at roughly 1µm and 1990

17:52 <whitequark> in that the code you write does not necessarily know whether the flops it requests are posedge or negedge triggered

17:52 <whitequark> because of DomainRenamer

17:52 <Sarayan> I'm pretty sure I don't understand reset in nmigen yet. One day maybe :-)

17:52 <mwk> and Virtex 2 was 2001

17:52 <whitequark> Sarayan: it's just late binding. you know how environment in UNIX works?

17:53 <whitequark> Sarayan: imagine that each elaboratable is a single UNIX executable, and it does getenv("RESET_SIGNAL") each time you write ResetSignal(), and the toplevel (or some module in between) can set that variable for its children

17:53 <whitequark> does this analogy help?

17:53 <Sarayan> oh-kay

17:53 <Sarayan> well

17:54 <Sarayan> you know I'm playing "reimplement the schematics in nmigen"

17:54 <whitequark> sure

17:54 <Sarayan> whre the reset line is explicit, and resets some things and not others

17:54 <whitequark> how many reset lines do you have, compared to clock lines?

17:54 <Sarayan> one of each

17:54 <Sarayan> even in the real world

17:55 <whitequark> oh

17:55 <Sarayan> at the pin level of couse

17:55 <Sarayan> I have phi2 and... rs? mr? not sure, depends on the chip

17:55 <whitequark> so, if a register has reset, you write Signal(...), if it has not, Signal(..., reset_less=True)

17:55 <whitequark> that's it

17:55 <Sarayan> oh

17:56 <whitequark> reset_less=True literally disconnects it from this "implied" reset line

17:56 <Sarayan> but reset is synced on a specific phase

17:56 <whitequark> ooh!

17:56 <whitequark> can you explain more about that?

17:56 <Sarayan> sure

17:57 <tpw_rules> phi2? what are you reimplementing?

17:57 <Sarayan> reset is a signal like any other for the chip, and iirc it's latched on one of the three phases

17:57 <Sarayan> need to recheck though

17:57 <Sarayan> tpw: via6522

17:57 <tpw_rules> figures. i know that signal name

17:58 <Sarayan> One of the two otput clocks from the 6502 :-)

17:58 <Sarayan> output

17:58 <MadHacker> For a 6502 in nmigen or similar, why not just double the clock rate?

17:58 <Sarayan> MH: I triple the clock rate, but yes

17:58 <MadHacker> Or have a phase signal for different bits of the signal.

17:58 <MadHacker> Ew. Triple why??

17:59 <MadHacker> It's a two-phase clock.

17:59 <MadHacker> + explicitly has no overlap so it's not like you need to cover all four hypothetical combinations of the two bits.

17:59 <Sarayan> because the via, internally, generates three phases

17:59 <Sarayan> up edge, down edge, just after down edge

17:59 <Sarayan> analog ftw

17:59 <MadHacker> For sampling on the shift register bits?

17:59 <Sarayan> for everything

18:00 <MadHacker> You *sure*? I mean, I've got two of them within arms reach and I've been programming them for the last 35 years and never noticed. :D

18:00 <MadHacker> I'll trust you if you've checked. :)

18:00 <whitequark> Sarayan: so, reset is latched in one of the phases, I get that

18:00 <whitequark> when does it actually reset the registers?

18:00 <Sarayan> https://og.kervella.org/via6522/via6522/

18:01 <whitequark> instantly? when the corresponing clock occurs?

18:01 <Sarayan> instantly, I think

18:01 <whitequark> okay, so it's an async reset latched in one of the phases

18:01 <whitequark> fortunately, nmigen can represent that!

18:01 <whitequark> (migen couldn't, and i remember getting into an argument with its author about whether such a thing is desirable...)

18:02 <Sarayan> an no, misremembered, it's not even latched

18:02 <Sarayan> it's just full async

18:02 <whitequark> oh! that's super easy then

18:02 <whitequark> right now you have three clock domain right?

18:02 <Sarayan> no, one

18:02 <Sarayan> three phase signals

18:02 <whitequark> ah, and three phase signals

18:02 <Sarayan> yeah, I've learned my lesson :-)

18:03 <whitequark> create the domain with `ClockDomain(async_reset=True)`

18:03 <MadHacker> I haven't, I'm doing a 6502 in nmigen for fun and because my logic design's rusty af.

18:03 <whitequark> drive `cd.rst` as active high

18:03 <whitequark> that's it

18:05 <whitequark> awygle: sorear: back to X-prop

18:05 <whitequark> in Verilog, X does three totally unrelated things

18:06 <Sarayan> MH: The small mess of mosfets under the phi2 pad (top left mostly) generates a small pulse on neg edge (bottom) and the top part does ~phi2 & ~pulse, so it's active in ~phi2 but only *after* the pulse is done

18:06 <Sarayan> hence the third pahse

18:06 <whitequark> awygle: sorear: first, it means "uninitialized but determinate value". second, it means "indeterminate value", e.g. as a result of setup/hold violation. third, it means "don't care for synthesis", aka LLVM undef

18:06 <MadHacker> OK, but are you reimplementing from schematic, or reimplementing just the datasheet behaviour?

18:06 <Sarayan> From schematic

18:07 <MadHacker> OK, then fair enough indeed.

18:07 <Sarayan> shematic I've built from extracting the mosfets from a vectorization of die shots

18:07 <MadHacker> I would have done a 6522 from datasheet, but I understand where you're coming from now.

18:07 <Sarayan> there are four different and somewhat contradictory datasheets

18:08 <MadHacker> As I said, I have two 6522s within arms' reach. I'm painfully aware. :D

18:08 <whitequark> awygle: sorear: i like (1) because i don't have a choice. real hardware does not always let you initialize things. even if it does, resetting a domain with reset_less=True registers effectively sets them to X in general (though in certain specific circumstances they may be assumed to be set to their reset value, e.g. on FPGA after bitstream load)

18:08 <MadHacker> I learned to code on a KIM-1, and I've a BBC master sitting next to me here.

18:08 <MadHacker> Somewhat used to the 65xx chaos. :)

18:08 <Sarayan> I *think* my closest 6522 is in the cave, unless it's still at my mother's

18:09 <awygle> i accept 1 and like 3, and don't like 2

18:09 <whitequark> awygle: sorear: (2) is not applicable to either nmigen (which attempts, although for now not very insistently, to prevent setup/hold violations in first place) or cxxrtl (which uses zero delay model, though more on that later)

18:09 <MadHacker> The beeb has the joys of clock-stretching for the 6522s because the 6502s existed in faster speed-grades than the 6522s did at the time.

18:09 <MadHacker> So the clocking there I somewhat understand too.

18:09 <Sarayan> wq: there's reset and there's reset, you reset a 68000 it just won't touch the registers for instance

18:09 <awygle> although i suspect the tools are not really set up to take advantage of 3

18:10 <Sarayan> note that they're random at poweron

18:10 <whitequark> awygle: unfortunately they are, and this is a massive problem no one in HDL properly acknowledges IMO

18:10 <Sarayan> so well... :-)

18:11 <whitequark> awygle: sorear: so what happens there is that if you put 'x somewhere, it will behave like LLVM undef, meaning that constant propagating one 'x makes it into two 'x, which the optimizer can then assume have different values later in the pipeline

18:11 <Sarayan> but that means you don't want to clear them on simulated reset, but you don't want them indeterminate at startup on a fpga because indeterminate is bad

18:11 <whitequark> this means that you can make a design where a module enforces certain invariant, and this specific module can even be formally verified to do it

18:12 <whitequark> but then, in a bigger design, you feed it 'x... and if the synthesizer can statically prove that it has 'x as an input, it can, and often will, "optimize" your module into something that violates the invariant

18:12 <whitequark> e.g. if you had `s & ~s` in that module, and the synthesizer can prove that s=='x, then it can optimize `s & ~s` to `1`

18:12 <awygle> sure. that's a bit different than what i was talking about, which is "if the synthesizer can statically prove it has an 'x as an input it should throw an error"

18:13 <awygle> or... not quite that but closer to that than the other

18:13 <whitequark> as an input where?

18:13 <awygle> i don't think i actually want 'x anyway

18:13 <whitequark> the whole point of synthesis don't-cares is that they *do* propagate

18:14 <whitequark> the problem with them is that there is no way to tell the synthesizer "ok buddy. i know you love optimizing stuff so that it breaks. but. i know that this wire is ACTUALLY either 0 or 1 not both. so please give me something that will never be both"

18:14 <whitequark> in LLVM parlance this operation is called "freeze" and it stops const-propagation of 'x

18:14 <whitequark> i have proposed it for yosys and Claire has tentatively accepted it

18:14 <Sarayan> don't-care-but-defined

18:15 <whitequark> so you could freeze all of your inputs for example, and then your internal invariants will be maintained

18:15 <sorear> did freeze actually happen? I remember the extremely long thread proposing it

18:15 <whitequark> the problem is there is absolutely no way to (a) express this in synthesizable Verilog that Xilinx, Altera, etc understand

18:16 <whitequark> or (b) stop *other kinds of 'x* from becoming *this* kind of x because they're all conflated

18:16 <whitequark> for example, suppose the synthesizer statically proves you never write to BRAM and the BRAM is not initialized

18:16 <whitequark> then it will drag 'x out of that BRAM and fuck up your perfectly well defined design

18:17 <whitequark> in the sense that not only will the design not work, but it will also have violations of safety invariants

18:17 <Sarayan> otoh, the design not owrkign tends to be a hint

18:17 <Sarayan> working

18:18 <daveshah> Interestingly, I have had a few people report things like this as a supposed Yosys bug. Usually disconnected submodule inputs, which the vendor tools tend to assume as 1'b0 but Yosys treats as undefined, sometimes resulting in very different behaviour.

18:18 <whitequark> Sarayan: the real issue is when this happens in generic reusable code that you convince yourself works fine

18:18 <whitequark> or when two such pieces of code interact in unexpected ways

18:19 <whitequark> there are *lots* of subtle ways to statically get a 'x in verilog where you really don't intend to

18:20 <whitequark> awygle: X-propagation in case (1) is super handy because if you have a bench that's all lit up with red it means you fucked up some reset

18:20 <whitequark> and, moreover, it is required to be supported in nmigen for ASICs

18:20 <whitequark> since ASICs don't really have initializable BRAMs

18:21 <whitequark> so you *have* to have some way to say "uninitialized memory" and then you have to support it in sim, too, or you'll get sim/synth mismatch

18:21 <whitequark> the good thing about case (1) is that it requires no language support

18:22 <whitequark> only simulator support, and toolchain support to a degree

18:23 <whitequark> of course, without the $freeze cell, it is not actually safe, but it is impossible to fix it on nmigen side, even though it is a flaw literally all of the industry suffers from

18:23 <whitequark> it should be safe enough for real designs though

18:25 <whitequark> sorear: i think freeze is still not in llvm

18:25 <whitequark> Sarayan: nmigen will never initialize signals to indeterminate values on FPGA

18:25 <whitequark> it... can't even do that if it tried, I think

18:26 <whitequark> since most FPGAs have a global post-configuration reset for all flops

18:26 <whitequark> so if you model 68k registers as reset_less, then it will work as you expect

18:27 <MadHacker> Need a chaos monkey on reset option. :D

18:27 <daveshah> Oh, I think I know an obscure edge case where FPGA flops have an indeterminate state

18:27 <whitequark> MadHacker: i have had this request for nmigen pysim, actually

18:28 <daveshah> If you use partial reconfiguration on an ECP5, the PUR isn't activated (which is good for many things) but as a result any flops have the value of the flop previously placed at that location, if any

18:28 <whitequark> and i imagine you could do it for a real FPGA by permuting post-place flop init values with a tcl script or something

18:28 <whitequark> daveshah: I was actually expecting this

18:28 <sorear> so you transform the design to add a "soft" scan chain, then connect it to a random bit generator, ,

18:28 <whitequark> but the key part is that nmigen can't really decide to do that on its own

18:29 <daveshah> Yeah

18:31 ____2 has quit [Quit: Nettalk6 - www.ntalk.de]

18:40 <MadHacker> If we can pass it through to nextpnr then it can generate random values based off the seed. :)

19:01 <Sarayan> wq: Very nice

19:01 <whitequark> Sarayan: which part?

19:01 <Sarayan> So it's pretty much reset_less by default, unless there's an explicit reset signal in there

19:01 <Sarayan> (when translating shcematics that is)

19:01 <whitequark> yes

20:08 FFY00 has quit [Remote host closed the connection]

20:09 FFY00 has joined #nmigen

20:17 _whitenotifier-3 has quit [Ping timeout: 260 seconds]

21:10 thinknok has quit [Ping timeout: 272 seconds]

21:16 Asu has quit [Quit: Konversation terminated!]

21:20 FFY00 has quit [Read error: Connection reset by peer]

21:21 FFY00 has joined #nmigen

21:27 chipmuenk has quit [Quit: chipmuenk]

22:34 <ktemkin> whitequark: are we okay to start using the nextpnr --12k option that was added a month or so ago, in master?

22:35 <ktemkin> (for ECP5, sorry)

22:35 <ktemkin> I'd love to make a change like this: https://github.com/ktemkin/nmigen/commit/902a56ff498223aabd7e44eadc30689cefaa31a1 , but I don't know if breaking compat with earlier nextpnr versions is an issue

22:36 <ktemkin> [--12k was added here: https://github.com/YosysHQ/nextpnr/commit/3b49c20f4345f05bb92e6fc0a8dfa4c87c9cfa46]

22:40 <awygle> ktemkin: the current "treat it like a 25F" doesn't actually work (can't program) so I can't imagine there's a compat hazard. I'm not wq tho

22:40 <ktemkin> it does if you add an IDCODE overrride

22:41 <ktemkin> so there /are/ valid platform files that could contain 12F and --idcode{}; those'd be broken on older nextpnrs

22:42 <cr1901_modern> 25F and idcode override doesn't continue to work for back compat?

22:43 <ktemkin> cr1901_modern: it does; but every platform needs to specify the override itself

22:43 <ktemkin> nmigen doesn't automatically add it on getting device = "<...>-12F"

22:44 <ktemkin> I'd expect user expectations going forward to be that they can set "LFE5U-12F" and it'd just work, since nextpnr now does

22:45 <ktemkin> (an option that'd both compat friendly and user-friendly might be to set the override ourselves -and- set 25F; but that feels a bit like carrying around legacy cruft)

22:45 <cr1901_modern> I don't see the issue w/ keeping 25F and override for existing platforms, but for new platforms use 12F

22:45 <cr1901_modern> but admittedly I'm not following this closely :P

22:46 <whitequark> ktemkin: sometimes you have to break the code to fix it

22:46 <whitequark> this case is unambiguously "make the change"

22:50 <awygle> woo

22:51 <awygle> score one for the future

22:58 <daveshah> If you are making the change you can also remove the um and um5g 12k parts from the list as they don't exist

23:53 <ktemkin> I though those looked off

23:54 _whitenotifier-9 has joined #nmigen

23:54 <_whitenotifier-9> [nmigen] ktemkin opened pull request #361: vendor: use nextpnr -12k for -12F devices; remove theoretical devices - https://git.io/JfTav

23:54 <_whitenotifier-9> [nmigen] ktemkin edited pull request #361: vendor: use nextpnr -12k for -12F devices; remove theoretical devices - https://git.io/JfTav

23:55 <ktemkin> daveshah: added to the (now open) PR :)