#nmigen on 2020-03-23 — irc logs at freenode.irclog.whitequark.org

2020-01-27 18:31 ChanServ changed the topic of #nmigen to: nMigen hardware description language · code at https://github.com/nmigen · logs at https://freenode.irclog.whitequark.org/nmigen

00:51 proteus-guy has quit [Ping timeout: 250 seconds]

01:08 <_whitenotifier-3> [nmigen-boards] WRansohoff synchronize pull request #57: Add a board file for Gnarly Grey's iCE40UP5K 'Upduino' board - https://git.io/JvMIc

01:13 <_whitenotifier-3> [nmigen-boards] WRansohoff commented on pull request #57: Add a board file for Gnarly Grey's iCE40UP5K 'Upduino' board - https://git.io/JvDQb

01:14 <_whitenotifier-3> [nmigen-boards] whitequark commented on pull request #57: Add a board file for Gnarly Grey's iCE40UP5K 'Upduino' board - https://git.io/JvDQh

01:14 <_whitenotifier-3> [nmigen-boards] whitequark closed pull request #57: Add a board file for Gnarly Grey's iCE40UP5K 'Upduino' board - https://git.io/JvMIc

01:14 <_whitenotifier-3> [nmigen/nmigen-boards] whitequark pushed 1 commit to master [+2/-0/±0] https://git.io/JvDQj

01:14 <_whitenotifier-3> [nmigen/nmigen-boards] WRansohoff 18315d8 - Add Upduino v1/v2.

02:12 Degi has quit [Ping timeout: 246 seconds]

02:14 Degi has joined #nmigen

02:58 Vinalon has quit [Quit: Leaving]

04:30 _whitelogger has joined #nmigen

07:24 q3k has quit [Ping timeout: 240 seconds]

07:27 q3k has joined #nmigen

08:24 Asu has joined #nmigen

13:46 <_whitenotifier-3> [nmigen] ZirconiumX opened issue #339: Add rotate left/right - https://git.io/Jvym8

13:50 <_whitenotifier-3> [nmigen] whitequark commented on issue #339: Add rotate left/right - https://git.io/Jvymz

13:52 <_whitenotifier-3> [nmigen] ZirconiumX commented on issue #339: Add rotate left/right - https://git.io/JvymV

13:53 <_whitenotifier-3> [nmigen] whitequark commented on issue #339: Add rotate left/right - https://git.io/Jvym6

13:58 <_whitenotifier-3> [nmigen] ZirconiumX commented on issue #339: Add rotate left/right - https://git.io/JvymH

14:05 <_whitenotifier-3> [nmigen] whitequark commented on issue #339: Add rotate left/right - https://git.io/JvyY2

14:05 <_whitenotifier-3> [nmigen] whitequark edited a comment on issue #339: Add rotate left/right - https://git.io/JvyY2

14:05 <_whitenotifier-3> [nmigen] whitequark edited issue #339: Add rotate left/right by constant amount - https://git.io/Jvym8

14:28 <_whitenotifier-3> [nmigen-soc] jfng opened pull request #11: csr.periph: add Peripheral base class. - https://git.io/JvyOn

14:29 <_whitenotifier-3> [nmigen-soc] Failure. 98.17% (+-1.47%) compared to 967a65f - https://codecov.io/gh/nmigen/nmigen-soc/compare/967a65f7e0a4648a4b2ffcf59f8f1a215cb84078...ecc943b85c112ae663424aea7256fded8e5626f7

14:29 <_whitenotifier-3> [nmigen-soc] Failure. 90.38% of diff hit (target 99.63%) - https://codecov.io/gh/nmigen/nmigen-soc/compare/967a65f7e0a4648a4b2ffcf59f8f1a215cb84078...ecc943b85c112ae663424aea7256fded8e5626f7

14:29 <_whitenotifier-3> [nmigen-soc] codecov[bot] commented on pull request #11: csr.periph: add Peripheral base class. - https://git.io/JvyOW

15:07 <_whitenotifier-3> [nmigen-soc] whitequark commented on pull request #11: csr.periph: add Peripheral base class. - https://git.io/Jvysm

15:16 <_whitenotifier-3> [nmigen-soc] jfng synchronize pull request #11: csr.periph: add Peripheral base class. - https://git.io/JvyOn

15:16 <_whitenotifier-3> [nmigen-soc] codecov[bot] edited a comment on pull request #11: csr.periph: add Peripheral base class. - https://git.io/JvyOW

15:16 <_whitenotifier-3> [nmigen-soc] Failure. 98.17% (-1.47%) compared to 967a65f - https://codecov.io/gh/nmigen/nmigen-soc/compare/967a65f7e0a4648a4b2ffcf59f8f1a215cb84078...ecc943b85c112ae663424aea7256fded8e5626f7

15:17 <_whitenotifier-3> [nmigen-soc] Success. 99.69% (+0.05%) compared to 967a65f - https://codecov.io/gh/nmigen/nmigen-soc/compare/967a65f7e0a4648a4b2ffcf59f8f1a215cb84078...b9ffd36dfcfca11ebd0d983570796c705ca700b3

15:17 <_whitenotifier-3> [nmigen-soc] Success. 100.00% of diff hit (target 99.63%) - https://codecov.io/gh/nmigen/nmigen-soc/compare/967a65f7e0a4648a4b2ffcf59f8f1a215cb84078...b9ffd36dfcfca11ebd0d983570796c705ca700b3

15:17 <_whitenotifier-3> [nmigen-soc] codecov[bot] edited a comment on pull request #11: csr.periph: add Peripheral base class. - https://git.io/JvyOW

15:56 <_whitenotifier-3> [nmigen-soc] jfng commented on pull request #11: csr.periph: add Peripheral base class. - https://git.io/JvyGo

16:37 SingularitySurf has joined #nmigen

16:38 proteus-guy has joined #nmigen

16:40 <SingularitySurf> Hi, sorry I'm new to nMigen and Python as well actually.. '=D What's the best way of saying a= b? c : d in nMigen?

16:40 <whitequark> a.eq(Mux(b, c, d))

16:41 <SingularitySurf> ah tanks! didn't know about the Mux

16:41 <SingularitySurf> :)

17:07 <_whitenotifier-3> [nmigen-soc] awygle commented on pull request #11: csr.periph: add Peripheral base class. - https://git.io/Jvycv

17:16 <_whitenotifier-3> [nmigen-soc] whitequark commented on pull request #11: csr.periph: add Peripheral base class. - https://git.io/Jvycc

17:17 <ZirconiumX> whitequark: Been thinking about how best to give temporary expressions clearer names

17:17 <ZirconiumX> Because they crop up a lot in my codebase

17:18 <ZirconiumX> And the `(* src = "..." *)` attributes aren't helpful in that situation

17:18 <whitequark> in verilog?

17:18 <ZirconiumX> Yeah

17:18 <whitequark> you could get write_verilog -inline to work

17:19 <whitequark> and I believe that's the way forward

17:19 <whitequark> unfortunately, verilog is awful garbage, and I largely gave up on that Yosys PR

17:19 <ZirconiumX> I was actually wondering about _Namer in back.rtlil, but that works too

17:20 <ZirconiumX> Do you think SEDA would accept something in a more gradual approach rather than a giant PR like this?

17:20 <whitequark> it's not that big

17:20 <whitequark> the problem is that you have to thread the implicit width through the entire expression printer anyways

17:20 <whitequark> so you could shrink the PR but probably not that much

17:21 <ZirconiumX> I'm also a bit scared of not knowing the rules of Verilog nearly as well as you :P

17:21 <whitequark> the other problem is that Claire (rightfully) insists that the PR be tested for several days under a fuzzer

17:21 <whitequark> so if it's many small PRs, that cost compounds

17:21 <whitequark> I can procure something with 200-odd cores for that though

17:23 <ZirconiumX> Actually, this reminds me, what happened to proc_match?

17:31 <whitequark> I never got it to generate muxes

17:31 <whitequark> there was a conceptual stumbling block and then I got too sick to be productive

17:35 <ZirconiumX> Ah, I see. I've at least rebased the branch to latest Yosys master

17:43 <ZirconiumX> And after building it and regenerating the Verilog...nothing changes aside from the Yosys sha1.

17:43 <ZirconiumX> Am I missing something here?

17:53 <whitequark> hmm

17:53 <whitequark> it should be on by default

17:53 <ZirconiumX> https://github.com/ZirconiumX/yosys/commit/2d51984de5589d5eef59321e9bdd16458ddf15c6

17:53 <whitequark> are you sure the yosys binary nmigen runs is the right one?

17:54 <ZirconiumX> Yes, the sha1 of the nmigen generated header matches up with that commit

17:54 <ZirconiumX> /* Generated by Yosys 0.9+1706 (git sha1 2d51984d, clang 9.0.0-2 -fPIC -Os) */

17:58 <ZirconiumX> My input code here is the combinational chess move generator

17:59 <ZirconiumX> It's at the point where nextpnr-ecp5 can't route it anymore

17:59 <ZirconiumX> Despite using ~20% of an UM-45F

18:01 Vinalon has joined #nmigen

18:17 <Degi> You can try changing the router

18:18 <ZirconiumX> I did, and ended up getting a router2 bugfix ported over to mainline nextpnr

18:18 <Degi> In nMigen you can add to the build() nextpnr_opts="--placer sa -r" --router router1/2 to randomize the seed and use placer "sa" and

18:18 <Degi> Oh

18:19 <ZirconiumX> Also I recommend not using `--placer sa`

18:20 <ZirconiumX> HeAP is the default because it's as good as SA while being much faster

18:34 <ZirconiumX> Okay, so I dropped some log_asserts in can_inline_cell_expr and it seems to always be returning false

18:55 <ZirconiumX> D'oh, I think I know the problem here

18:56 <ZirconiumX> nMigen is decorating everything with `(* src *)` attributes which inhibit inlining

19:24 <ZirconiumX> ...Oddly it's now inlining the *right hand side* of an expression...

19:24 <ZirconiumX> ...

19:24 <ZirconiumX> Left hand side

19:24 <ZirconiumX> assign (((((((((((~ i_pbq) & i_nbk)) & (~ i_rqk))) >>> 4'ha)) & 62'h3f3f3f3f3f3f3f3f)) & target_mask)) = (((((((((((~ i_pbq) & i_nbk)) & (~ i_rqk))) >>> 4'ha)) & 62'h3f3f3f3f3f3f3f3f)) & target_mask));

19:29 <cr1901_modern> that's a legal verilog expr ._.?

19:32 <ZirconiumX> No, Verilog isn't quite mad enough to let you do this shit on the left hand side of an assignment

19:32 <ZirconiumX> However, isn't this a tautology?

20:54 SingularitySurf has quit [Remote host closed the connection]

21:37 Sarayan has joined #nmigen

21:46 <Vinalon> Does anyone have a feel for what the overhead is like with AsyncFIFO objects? Like, if I want to use one to nest CPU contexts, would it be better to have a separate FIFO for each register or one very wide FIFO to store all of the values?

21:47 <Sarayan> Is there a way to say "tick until that signal is 1" in a python sim?

21:49 <ZirconiumX> Vinalon: The FIFOs are built out of Memory cells internally: small Memory cells can be turned into flops, but giant Memory cells will be built out of block RAMs

21:50 <Vinalon> You can use 'yield <signal>' to get a value in a simulation; maybe something like: https://bpaste.net/H7VA

21:51 <Vinalon> so, smaller widths are probably easier for the tools to optimize? Okay, thanks

21:51 <Sarayan> Vinalon: Can you have that in a sub-function? I remember the sim not liking to yield in sub-functions, but I may be wrong

21:52 <Vinalon> I'm far from an expert, but I think it should work if you call the function with 'yield from funct(...)'.

21:53 <Sarayan> errr ok

21:53 <ZirconiumX> Yeah, you need to `yield from` subfunctions

21:53 <Sarayan> thanks. Bedtime, I guess I'll try tomorrow

21:54 <Vinalon> Good luck!

21:58 <awygle> i'd guess "lots of small FIFOs" ends up bigger than "one very wide FIFO", depending on a number of factors including whether the synthesis tool can merge flop RAM into dist RAM into block RAM intelligently

22:06 <Vinalon> huh - so would a 'smarter' synthesis tool be more likely to do a good job of handling a bunch of smaller ones, or vice-versa? I usually like to lean towards trusting the compiler (or synthesizer) since low-level tools are always improving.

22:07 <daveshah> In general combined is going to be better, as that way no information is lost

22:10 <Vinalon> so it'd be okay to have a FIFO that's around 1024-2048 bits wide?

22:10 <ZirconiumX> Widths like that scare me :P

22:10 <Vinalon> that's what I was thinking, but it would be nice...

22:13 <daveshah> I can't see any particular problem with that

22:15 <Vinalon> Cool, I'll give it a try - thanks for the information!

22:18 <daveshah> Report back any bugs!

22:19 <daveshah> I'm presuming this isn't on the LP383 btw

22:21 <Vinalon> No, I'm hoping to start with an UP5K but I might have to move up to an ECP5...

22:21 <daveshah> I think this is going to need an ECP5

22:22 <Vinalon> I'm trying to implement a simple RISC-V CPU and this is how I'm planning to store CPU registers for trap handlers.

22:22 <daveshah> Is that something that RISC-V even needs?

22:23 <ZirconiumX> Yeah, you can spill the registers to stack if you need to

22:23 <Vinalon> Not that I'm aware of, but I really like how ARM Cortex-M cores do all of that in hardware

22:23 <ZirconiumX> Actually there's a privileged register for that

22:23 <daveshah> That's going to be a very inefficient way to do it

22:24 <daveshah> It would force the register file to be implemented using FFs rather than BRAM too

22:24 <ZirconiumX> Actually a full spill presents a lot of issues

22:24 <daveshah> As there will be spare space at the end of BRAM for a typical register file implementation a shadow register approach would be much more efficient

22:25 <daveshah> But I'm not sure if RV actually needs this at all

22:25 <ZirconiumX> Like environment calls being completely unimplementable

22:25 <Vinalon> Oh...yeah, I'm sure that the whole CPU design is very inefficient, considering how little I know about the tooling and FPGA resources.

22:27 <Vinalon> I guess I'm barking up the wrong tree with using FIFOs like this, then, thanks.

22:27 <ZirconiumX> FIFOs - async FIFOs especially - are for clock-domain crossing

22:28 <Vinalon> yeah, my plan was to use different clock edges for reads and writes, but actually, this brings up another question I had about clock domains

22:28 <daveshah> The main problem with this approach is that it would need to access every register bit at once

22:29 <daveshah> Which forces a much less efficient implementation than using the dedicated RAM

22:29 <daveshah> Unless you really know what you are doing using different clock edges is going to cause more problems, and worse performance, than it solves

22:29 <Vinalon> oh; it's better to always use the same edge and wait more cycles?

22:30 <daveshah> In general, yes

22:30 <Vinalon> and here I thought I was being clever...oh well, thanks.

22:32 <Vinalon> Anyways, it sounds like if I want to store a couple of sets of 32 32-bit values, 'Memory' objects might be better than 'FIFO's?

22:32 <daveshah> The best bet would be to use one deeper memory and just control the upper address bits

22:33 Asu has quit [Remote host closed the connection]

22:35 <Vinalon> ohhhh, that would make a lot of sense - thanks! I'll have to stop using an Array of Signals for the main CPU registers, then...how's that for an inefficient design? :)

22:43 <ZirconiumX> Oh, yeah, that's gonna be awful

22:46 <Vinalon> but it still works better than a Python array of Signals with 'for' loops to generate long 'if/elif' blocks in the 'elaborate' method. Sometimes I'm a little slow on the uptake :P

22:56 <awygle> :) we're all learning

23:23 <Vinalon> The Python syntax makes these sorts of changes really easy, though. Since the '[]' operators work the same, I only needed to change how the objects were initialized.

23:24 <Vinalon> ("Array(Signal(x) for i in range(y))" -> "Memory(width=x, depth=y)")