#nmigen on 2020-05-19 — irc logs at freenode.irclog.whitequark.org

2020-01-27 18:31 ChanServ changed the topic of #nmigen to: nMigen hardware description language · code at https://github.com/nmigen · logs at https://freenode.irclog.whitequark.org/nmigen

00:15 cr1901_modern has joined #nmigen

00:24 <awygle> yes, i like the idea of attaching it to the platform

00:25 <awygle> it seems the logical place

00:25 <awygle> but we may run into limitations of course, we'll see

00:33 <tpw_rules> is there a way to make the verilog export add some kind of prefix to all the module names (except the top)? for various reasons i need to generate verilog for inclusion in larger projects and there can be name conflicts since all the exported modules have the names used in the python code

00:35 <whitequark> connect_rpc already does that

00:35 <whitequark> but other than that, i don't know of an easy way to do it

00:35 <tpw_rules> i don't know what connect_rpc means

00:36 <whitequark> oh

00:36 <whitequark> https://github.com/YosysHQ/yosys/pull/1406

00:37 <tpw_rules> i see. i don't think that would work for my application

00:37 <whitequark> wait, why not?

00:37 <whitequark> lots of toplevel python code?

00:37 <tpw_rules> no, it needs to spit out a verilog file that gets dumped into another fpga project

00:39 <whitequark> sure

00:39 <whitequark> you can just do write_verilog after yosys is done importing

00:39 <whitequark> like it already happens in nmigen anyways

00:41 <tpw_rules> i'm confused. i have a large fpga project which is not mine and to which i want to contribute a module. if any of my submodules are called the name of a module already in that project, it won't work. so i'd like to prefix all my submodule names with something that uniquifies them with respect to the rest of the project.

00:49 <whitequark> yes

00:50 <whitequark> hang on, i'll approach this a bit differently

00:51 <whitequark> so you know how nmigen currently outputs verilog? it emits rtlil, imports it via read_ilang, then exports via write_verilog

00:52 <whitequark> what i'm suggesting is that you could have nmigen emit rtlil, import it via connect_rpc, then export via write_verilog. as a side effect of how connect_rpc works, this will add prefixes to the modules, exactly like you want

00:52 <whitequark> without any changes to nmigen or yosys or anything else

00:52 <whitequark> it's just that i wrote this exact code for connect_rpc, but it's not accessible in any other way.

00:52 <whitequark> maybe it should become a separate pass

00:58 <whitequark> tpw_rules: does that help?

01:39 <tpw_rules> oh i see, i thought it had to communicate with the rest of the synthesis chain or something. i'll try that out if it becomes a significant problem

01:39 <tpw_rules> i was out walking the doggers

01:42 <whitequark> np

02:07 <whitequark> awygle (and everyone else): https://people.eecs.berkeley.edu/~krste/papers/donggyu-phd-2019.pdf

02:24 winocm has joined #nmigen

02:25 winocm has quit [Client Quit]

02:25 winocm has joined #nmigen

02:28 guan has joined #nmigen

03:16 Degi has quit [Ping timeout: 265 seconds]

03:17 Degi has joined #nmigen

03:24 zkms has joined #nmigen

03:24 <zkms> hi

04:06 <whitequark> hi!

04:20 <awygle> hello

04:46 <bubble_buster> Fun to see people use Twitter to coordinate their irc activity :D

05:36 Guest30583 has joined #nmigen

05:48 _whitelogger has joined #nmigen

05:48 chipmuenk has joined #nmigen

05:58 peepsalot has joined #nmigen

06:38 thinknok has joined #nmigen

07:06 <MadHacker> o/

07:15 <whitequark> awygle: so about the paper i just linked (which zkms discovered)

07:15 <whitequark> do you need an ILA specifically? or do you want introspectability in general?

07:16 <daveshah> Eddie has done some work on introspectability/debug too

07:16 <whitequark> and if the latter, can it be invasive? because we can implement adding scan chains in yosys

07:16 <whitequark> the main thing that's missing there is mapping of registers back to original wires

07:17 <daveshah> The big problem with that is can it run fast enough and how much complexity does it add

07:17 <daveshah> e.g. if you want to capture raw data coming out of a serdes at full rate then a ILA with a buffer is needed

07:17 <whitequark> sure

07:17 <whitequark> but for that application you probably aren't going to use microscope-like ILA

07:17 <daveshah> If you are on Xilinx you don't even need a scan chain

07:17 <daveshah> You can just use readback

07:17 <whitequark> well

07:17 <whitequark> that ties you to vendor tools hard

07:18 <daveshah> Not if someone implements that separately

07:18 <whitequark> and prevents you from using generic nmigen code that can map readback back to signals

07:18 <daveshah> Yes

07:19 <daveshah> fwiw, I have used litescope to look at DDR3 transactions before (just before the gearboxes)

07:19 <daveshah> So there are definitely use cases where something faster than a scan chain is needed

07:21 <MadHacker> Don't you end up getting dangerously close to something like the openbench logic sniffer if you're trying to extract that much info? You're going to need to buffer like crazy.

07:21 <MadHacker> + may as well have triggering conditions too at that point.

07:22 <whitequark> MadHacker: depends on what you're doing

07:22 <daveshah> A buffer of 32 cycles was enough for this case

07:22 <daveshah> Then read it out at your leisure

07:22 <whitequark> if you trigger once per second it's pretty easy

07:22 <whitequark> if you trigger at kHz you probably need an ILA

07:22 <whitequark> there is much value in using a number of approaches

07:22 <whitequark> for example scan chains don't really work for non fully static designs

07:23 <whitequark> hm

07:23 <whitequark> wait, no, i'm wrong

07:23 <whitequark> scan chains don't work for them if you reuse the register bits for the chain (or use readback?)

07:23 <whitequark> (not sure about readback, does xilinx have shadow registers?)

07:24 <MadHacker> Unless what you're scanning is just a buffer. Snapshot state from read signals into shift register, or even a stack of shift registers?

07:24 <daveshah> mwk: ^

07:24 <daveshah> Questions about xilinx readback

07:24 <whitequark> MadHacker: yeah you can definitely buffer the scan chain

07:24 <whitequark> and really nice thinking on using a multilevel shift register

07:24 pdp7 has quit [Ping timeout: 252 seconds]

07:25 <daveshah> That works well on Xilinx with hard shift registers (using LUTs as them)

07:25 <MadHacker> So, trigger (or system clock) clocks snapshot of state into shift reg and push of stack of shift regs, read out at your leisure?

07:26 <whitequark> yep

07:26 <whitequark> the more i think about this design the more i like it

07:26 <whitequark> on xilinx you could actually capture "last n states"

07:26 guan has quit [Ping timeout: 252 seconds]

07:26 <whitequark> last... 32?

07:26 <daveshah> It's mostly the extra size and routing that worries me

07:26 <daveshah> Yes

07:26 <daveshah> Or 16 if you use the smaller SRLs

07:26 <whitequark> that's quite a lot

07:26 <daveshah> Indeed

07:26 <whitequark> for routing, hm, you could cascade the registers in hard routing, right?

07:27 pdp7 has joined #nmigen

07:27 <whitequark> so a smart enough pnr could probably lay out the scan chain adjacent to actual regsiters

07:27 <daveshah> You can't load them then

07:27 <whitequark> ah

07:27 <whitequark> right ok, so needs to be tested. still it seems very promising to me

07:27 <daveshah> The other problem is that you really want the be doing the scan chain ordering in PnR

07:27 <daveshah> When you know how everything is laid out

07:28 <daveshah> Yes, definitely for many cases it seems like a good approach

07:28 <MadHacker> OK, but the chain is always going to affect PNR anyway, since you're going to tie up routing resources at a minimum.

07:28 <MadHacker> There's no point pretending you can just add it in afterwards.

07:28 guan has joined #nmigen

07:28 <whitequark> so what i'm thinking here is we need some sort of solution for mapping bits of registers at yosys input to bits of registers at pnr output

07:28 <whitequark> cxxrtl needs this too

07:29 <whitequark> this will also allow yosys to merge or remove registers

07:29 <daveshah> MadHacker: yeah but if you connect it up after placement then at least you avoid it having to go all over the place, you can reorder it into a neat line

07:30 <daveshah> Anyway, that is a nice extra at some point

07:31 <daveshah> Yeah, mapping register bits would be useful for anything readback based too

07:31 <MadHacker> OK, but again that's still going to affect the original placement. If a chunk of logic is in a region that's tight on long-range routing then suddenly it'll shift when you add in the extra wires. I get your general point that it'd be better to allow it to reorder arbitrarily, but sometimes you've just got to accept that it's a little invasive.

07:31 <daveshah> yes definitely

07:31 <whitequark> MadHacker: there's going to be at least a small impact

07:31 <whitequark> and hopefully a small impact

07:32 <whitequark> it's not the right solution for designs that push the FPGA to limits, but many don't

07:32 <whitequark> and it only really impacts you routing-wise, not critical path wise

07:32 <daveshah> It might result in a more spaced out placement but shouldn't be too big a timing impact

07:32 <whitequark> yep

07:32 <MadHacker> It's like any scope probe, there's no getting away from the fact that a fast probe is going to dump a 20k load onto your signal. The equivalent applies.

07:32 <whitequark> yep

07:33 <daveshah> It would also be possible to do a combinational simulation to recover all combinational signals too

07:33 <whitequark> yep.

07:33 <whitequark> have you seen the pdf i linked earlier?

07:33 <daveshah> Not fully yet no

07:33 <whitequark> they're using some sort of C++ backend, which i hope can be cxxrtl in our case as it already exists

07:33 <whitequark> well, ok, not quite as it exists, since it needs mapping too

07:33 <daveshah> Nice

07:34 <whitequark> here's something fun i had in mind for cxxrtl

07:34 <whitequark> using high -O levels removes the calculations for internal signals, right? that's the point

07:34 <whitequark> but you still want them in VCD

07:35 <whitequark> so i thought i'd generate "debug info" inspired by DWARF that contains all the elided calculations over, using the state bits

07:35 <whitequark> meaning you can get 100% visibility *and* 100% performance with no recompilation of the model

07:35 <whitequark> then you could just use the same thing but initialize it with a scan chain

07:35 <daveshah> Yep

07:43 <daveshah> FWIW, this is some of Eddie's observability work https://dl.acm.org/doi/10.1145/2435264.2435272

07:44 <daveshah> This uses partial reconfiguration to connect a subset of signals to a deeper BRAM based buffer but partially preroutes the signals

07:44 <daveshah> Interesting but a lot more arch and tool dependent

07:48 futarisIRCcloud has quit [Quit: Connection closed for inactivity]

07:49 <whitequark> interesting!

07:49 <MadHacker> The "route everything interesting into a region" and then separately "build an analyser in the region" steps are nice for that reordering point you made earlier.

07:51 <MadHacker> Does nextpnr already let us exclude a chunk from use?

07:52 <daveshah> Sort of but it's not exposed in the ideal way for something like this

07:59 <awygle> dang things got interesting right after I left to go to sleep

07:59 <awygle> I've had that paper in my 'to read' pile for weeks, shoulda read it oops

08:00 <MadHacker> Damn timezones, why can't people on the Wrong Side of the planet be awake when I am?

08:00 <MadHacker> UTC+0 is the only valid timezone, right?

08:00 <awygle> Scan chain would be fine in this specific case definitely. Eventually I'd like to look at DDR2 transactions, so a true ILA might be necessary

08:01 <awygle> Agree that there's no reason to limit ourselves to one approach, in fact I strongly believe we shouldn't

08:02 <whitequark> MadHacker: i just stay awake when i need to talk to someone on the other side of planet

08:02 <whitequark> my sleep schedule isn't sun-synchronized so it's ok

08:02 <whitequark> awygle: so we can build these in parallel, maybe

08:02 <Sarayan> wq: The "sun-" part feels unnecessary ;-)

08:03 <whitequark> Sarayan: well it can be synchronized to US time occasionally

08:03 <whitequark> which is distinct from being synchronized to local time

08:04 <whitequark> awygle: i'd be happy if you took care of ILA and i could take a look at register mapping and scan chains

08:04 <whitequark> unless you have a burning desire to dig into some C++ code

08:04 <awygle> Insert some anime gif subtitled "no no no no no"

08:05 <awygle> I feel much more comfortable with an ILA style approach in terms of implementation anyway, feels like less to fill in before I can be useful.

08:06 <Sarayan> Fuck, I'm reading a project proposal draft I'm supposed to work on, and I don't understand it

08:07 <Sarayan> I'm not sure whether it's genius or bullshit, but I tend towards the latter

08:08 <Sarayan> The scientific vision of the AÇAÍ project is that applying, in a new interdisciplinary synergy, principles from diverse fields of computer science and administrative law can lead to discover a canonical core of essential Artificial Intelligence (AI) methods that is simultaneously (a) maximally parsimoniously versatile, (b) meta-circularly autonomic and (c) cost-effectively certifiable for practical use in critical economic sectors

08:08 <Sarayan> Not sure if serious

08:09 <awygle> So far my personal record for "not sure if genius or talking nonsense" is like four years, held by someone most of you probably follow on Twitter

08:09 <awygle> My jury is still out

08:09 <awygle> My point being it can be hard to tell

08:10 <Sarayan> To refine the sensation, I'm not sure if it's "makes sense in his head" or "makes sense in his research domain"

08:10 <whitequark> awygle: cool!

08:11 <whitequark> i find the C++ parts pretty easy to do, actually

08:11 <Sarayan> C++ can be easy

08:12 <awygle> It's not the c++ that I find intimidating, it's the rest of it

08:12 <awygle> As usual it's all about knowing what code to write lol

08:12 <Sarayan> whitequark, you who knows magic and python and C++

08:14 <Sarayan> I'm building a python module in C++, interfacing with a big program/library we've made. I'd like the install to be single-file (e.g. a .so), but I'd also like to have part of the interfacing to be actually written in python. Do you know how much hybridization can be done? I have no issue with embedding python source code in the .so

08:14 <awygle> Somebody debug my endocrine(?) system and figure out why falling asleep is such a chore lately... sigh. gonna go give it another shot, goodnight

08:15 <Sarayan> awygle: I have tricks for that, but I have no idea whether they'd work on you

08:16 <Sarayan> I have a feeling that the limit of hybridization is that classes can either be full-C++ or full-python, but outside that you can actually mix stuff

08:27 <whitequark> Sarayan: take a look at cython, perhaps

08:27 <whitequark> but i haven't done much with it

08:28 Asu has joined #nmigen

09:25 <mwk> daveshah: what do you want to know about xilinx readback?

09:26 <daveshah> the main question was whether there is a shadow register

09:26 <mwk> depends

09:26 <mwk> for a plain FF, yes

09:26 <mwk> for SRL/RAMs, no

09:26 <daveshah> That makes sense

09:26 <mwk> also you cannot look into DSPs

09:27 <mwk> so if you have a pipelined multiplier, forget about introspecting its state

09:27 <mwk> I don't quite recall the blockram output register rules, I think if you're using the pipelined version you're likewise screwed

09:28 <daveshah> Hmm, that's an advantage for soft scan chain insertion too (which is what most of the discussion was about), probing the DSP/BRAM output should always be fine

09:28 <mwk> and, of course, the whole readback-from-ff thing is completely gone on ultrascale

09:28 <mwk> well the output is easy

09:29 <mwk> the problem is internal pipeline stages

09:29 <mwk> I suppose you'd have to replicate them in an introspectable way somehow

09:29 <daveshah> Other than the input registers, the area cost of that seems quite high

09:30 <mwk> yes

09:31 <mwk> one possibiity would be to just record inputs in SRLs and recover state in sw

09:32 <mwk> ... or not, clock enables mean that DSP can hold state arbitrarily long, ugh

09:33 <daveshah> Using SRLs was the plan

09:33 <daveshah> in general

09:33 <daveshah> Oh yeah CE is a pain

09:55 futarisIRCcloud has joined #nmigen

10:00 <Sarayan> wq: ok

10:28 thinknok has quit [Quit: Leaving]

10:28 thinknok has joined #nmigen

11:05 * zignig has a output pin with a led on it.

11:05 <zignig> I would like to attach N elaboratables to the pin with a muxy thing.

11:05 <zignig> what is the best way to do break before make ?

11:06 <ZirconiumX> You don't need to

11:06 <ZirconiumX> If you have "a muxy thing", then just use it

11:06 <zignig> no , I don't need , I want to.

11:07 <zignig> the muxy thing is the issue , I can do a 2X with a Mux(switch,a,b) , but I would like an N way.

11:25 <hell__> mux the muxes?

11:26 <whitequark> Array, perhaps?

11:52 <MadHacker> whitequark: +1 for portrait mode LCD.

11:53 <MadHacker> (sorry, reading tweets on phone and it's easier to type here)

12:09 chipmuenk has quit [Quit: chipmuenk]

12:13 Asuu has joined #nmigen

12:14 Asu has quit [Ping timeout: 260 seconds]

12:19 <ZirconiumX> zignig: That actually seems like a useful thing, hmm

12:20 <ZirconiumX> Though I guess the correct tool for the job is probably switch/case

12:48 <zignig> ZirconiumX: not sure, but I think it has a multitude of applications , hence my question

12:48 <ZirconiumX> Sure, but generally switch/case *is* a multiplexer

12:49 <zignig> it is also applicable if you have a spi interface and you request a new device, declare a new CS pin and "muxy thing" between devices.

12:52 * zignig continues to battle argparse.

13:03 _whitelogger has joined #nmigen

13:07 <hell__> zignig: right, so if you have three SPI devices, you need a mux that can choose one of them at a time?

13:07 <hell__> I'd just chain two muxes

13:26 Asuu has quit [Read error: Connection reset by peer]

13:29 Asuu has joined #nmigen

14:31 thinknok has quit [Quit: Leaving]

15:01 chipmuenk has joined #nmigen

15:39 <zignig> hell__: I'm thinking so, just stack the muxes, I also think tha ZirconiumX is right that a switch/case will elaborate to a mux stack anyway.

15:39 <zignig> Sarayan: me for now. mWHAHAHAHA !

15:48 <Sarayan> hmmm what?

15:51 * hell__ panics and hides under dozens of mainboards

15:56 <cr1901_modern> >(8:57:02 AM) sarayan: who wins?

16:11 thinknok has joined #nmigen

16:15 <Sarayan> Oh :-)

16:15 <Sarayan> Forgot by now

16:55 <awygle> Morning

16:56 <daveshah> afternoon awygle

17:01 <ZirconiumX> Evening awygle

17:05 <awygle> Whelp I guess the day is over, back to bed...

18:18 <ktemkin> mood

18:40 chipmuenk has quit [Quit: chipmuenk]

19:10 cr1901_modern1 has joined #nmigen

19:12 cr1901_modern has quit [Ping timeout: 256 seconds]

19:15 cr1901_modern1 has quit [Quit: Leaving.]

19:16 cr1901_modern has joined #nmigen

19:21 Guest30583 has quit [Quit: Nettalk6 - www.ntalk.de]

20:00 chipmuenk has joined #nmigen

20:02 thinknok has quit [Read error: Connection reset by peer]

20:10 <chipmuenk> Hi,

20:10 <chipmuenk> I've started a little project on DSP using (n)migen at https://github.com/chipmuenk/dsp_nmigen. To keep me from doing real work, I updated the migen logo a little for the nmigen logo at

20:10 <chipmuenk> https://github.com/chipmuenk/dsp_nmigen/blob/master/doc/img/nmigen_logo.svg

20:30 <ZirconiumX> I'm not a huge fan of the Migen logo to begin with >.>

20:43 chipmuenk has quit [Quit: chipmuenk]

20:46 * hell__ gets scared and runs away

21:17 Asuu has quit [Ping timeout: 260 seconds]

23:33 <_whitenotifier-c> [nmigen/nmigen] whitequark pushed 1 commit to master [+0/-0/±1] https://git.io/Jfz4R

23:33 <_whitenotifier-c> [nmigen/nmigen] whitequark fbf9e1f - back.rtlil: handle signed and large Instance parameters correctly.

23:33 <_whitenotifier-c> [nmigen] whitequark closed issue #388: Integer parameters over 32 bits - https://git.io/JfEJd

23:40 <_whitenotifier-c> [nmigen/nmigen] whitequark pushed 1 commit to master [+0/-0/±2] https://git.io/Jfz4i

23:41 <_whitenotifier-c> [nmigen/nmigen] whitequark 404b2e0 - hdl.dsl: check for unique domain name.

23:41 <_whitenotifier-c> [nmigen] whitequark closed issue #385: Bad error message for duplicate ClockDomain - https://git.io/Jf8gV