#m-labs on 2018-12-21 — irc logs at freenode.irclog.whitequark.org

2018-12-09 21:21 sb0_ changed the topic of #m-labs to: https://m-labs.hk :: Logs http://irclog.whitequark.org/m-labs

01:02 <GitHub-m-labs> [artiq] drewrisinger opened issue #1221: No ARTIQ 4 Manual Link on m-labs website https://github.com/m-labs/artiq/issues/1221

01:08 <GitHub-m-labs> [artiq] drewrisinger opened issue #1222: Rust manual paths broken https://github.com/m-labs/artiq/issues/1222

01:53 <_whitenotifier-6> [m-labs/nmigen] whitequark pushed 3 commits to master [+0/-0/±7] https://git.io/fhv1l

01:53 <_whitenotifier-6> [m-labs/nmigen] whitequark f7fec80 - ir: allow non-Signals in Instance ports.

01:53 <_whitenotifier-6> [m-labs/nmigen] whitequark 221f108 - back.rtlil: fix translation of Cat.

01:53 <_whitenotifier-6> [m-labs/nmigen] whitequark 6672ab2 - back.rtlil: explicitly pad constants with zeroes.

01:59 <_whitenotifier-6> [nmigen] Success. The Travis CI build passed - https://travis-ci.org/m-labs/nmigen/builds/470800049?utm_source=github_status&utm_medium=notification

01:59 <_whitenotifier-6> [nmigen] Failure. 78.31% (-0.37%) compared to 8cc900c - https://codecov.io/gh/m-labs/nmigen/commit/6672ab2e3fd78d539c0efe1f13f301d5ce0aee3e

01:59 <_whitenotifier-6> [nmigen] Success. Coverage not affected when comparing 8cc900c...6672ab2 - https://codecov.io/gh/m-labs/nmigen/commit/6672ab2e3fd78d539c0efe1f13f301d5ce0aee3e

02:07 <_whitenotifier-6> [m-labs/nmigen] whitequark pushed 2 commits to master [+2/-0/±5] https://git.io/fhv1d

02:08 <_whitenotifier-6> [m-labs/nmigen] whitequark 6d9a6b5 - hdl.mem: implement memories.

02:08 <_whitenotifier-6> [m-labs/nmigen] whitequark 2b4a851 - back.rtlil: implement memories.

02:09 <_whitenotifier-6> [nmigen] Success. The Travis CI build passed - https://travis-ci.org/m-labs/nmigen/builds/470802855?utm_source=github_status&utm_medium=notification

02:09 <_whitenotifier-6> [nmigen] Failure. 77.25% (-1.07%) compared to 6672ab2 - https://codecov.io/gh/m-labs/nmigen/commit/2b4a8510ca3653e796b0a6f8ac0d7868cfdf9a9b

02:09 <_whitenotifier-6> [nmigen] Success. Coverage not affected when comparing 6672ab2...2b4a851 - https://codecov.io/gh/m-labs/nmigen/commit/2b4a8510ca3653e796b0a6f8ac0d7868cfdf9a9b

02:37 <GitHub-m-labs> [artiq] sbourdeauducq pushed 1 new commit to master: https://github.com/m-labs/artiq/commit/e80d80f13323536840dcb1b1806d378ae1026648

02:37 <GitHub-m-labs> artiq/master e80d80f Sebastien Bourdeauducq: manual: move to correct directory for building rust crates. Closes #1222

02:37 <GitHub-m-labs> [artiq] sbourdeauducq pushed 1 new commit to release-4: https://github.com/m-labs/artiq/commit/a53065ffc870d848973dbf663a381fd1d6a11132

02:37 <GitHub-m-labs> artiq/release-4 a53065f Sebastien Bourdeauducq: manual: move to correct directory for building rust crates. Closes #1222

02:38 <GitHub-m-labs> [artiq] sbourdeauducq pushed 1 new commit to release-3: https://github.com/m-labs/artiq/commit/af1f87833a080dad4d126426129416854db71f81

02:38 <GitHub-m-labs> artiq/release-3 af1f878 Sebastien Bourdeauducq: manual: move to correct directory for building rust crates. Closes #1222

02:40 <GitHub-m-labs> [artiq] sbourdeauducq closed issue #1221: No ARTIQ 4 Manual Link on m-labs website https://github.com/m-labs/artiq/issues/1221

02:43 <sb0> byte == deserialized word (10-bit, 20-bit, ...)?

02:48 <whitequark> sb0: byte == word

02:48 <whitequark> inconsistent use of terms

02:53 <sb0> cr1901_modern: yes, the xilinx stuff does only barrel-shifting. there are features to shift the clock instead, but they are complex and broken.

02:53 <sb0> the hack I'm using is to reset this crap until the comma is aligned

02:54 <sb0> you get a random clock phase at each reset

02:55 <whitequark> with ecp5 you just get a bitslip input you can strobe

02:55 <whitequark> doesn't really get simpler than that

02:55 <whitequark> i have only used the barrel shifting mode with pcie though, since it's the only mode supported with native pcie mode for some reason

02:56 <sb0> someone wrote a paper on how determining the clock/comma phase, then using a MMCM to compensate, but it's quite a mess

02:56 <sb0> "for some reason" - sounds like xilinx :)

02:56 <whitequark> i think it's related to the custom pcie features like receiver detectin

02:57 <whitequark> not sure though

02:57 <whitequark> anyway, i never had to do comma alignment manually in the first place

02:57 <whitequark> with ecp5 you put the commas (normal and inverted) into the serdes as parameters

02:57 <whitequark> and it automatically aligns to them and locks

02:57 <whitequark> that actually works well

02:58 <whitequark> sb0: oh, i know why it does barrel shifting in pcie mode, actually

02:58 <whitequark> the UG says that it can align to commas in some low time interval, like 4 symbol times

02:58 <whitequark> you cannot do that with bitslip

02:59 <whitequark> so i think what it does is it simultaneously matches commas in several different alignments

02:59 <whitequark> probably not all 10

02:59 <whitequark> but maybe 5 or 4

03:00 <whitequark> and then reconfigures the barrel shifter after each symbol

03:00 <whitequark> instead of doing bitslip 10 times

03:00 <sb0> yeah well, that's not very hard to do and you can do it in fabric

03:00 <whitequark> i know why they did it in the serdes

03:00 <sb0> I implemented it for HDMI (with barrel shift)

03:01 <whitequark> when you use 5 Gbps mode of SERDES, it only works with 1:2 gearing, because the fabric is not fast enough for 500 Msps rate

03:01 <whitequark> actually it isn't really fast enough for 250 Msps rate either, you really want to use a PSCLKDIV primitive to do a 1:4 total gearing

03:01 <whitequark> and then process it at 125 Msps

03:02 <whitequark> with a very small gearbox in 250 MHz domain

03:02 <whitequark> so, if you were doing bitslip in fabric, you'd probably have to do it post gearboxes, and if you do that, i think you blow timing constraints of some protocols it supports

03:03 <sb0> what's the latency of those things?

03:03 <sb0> with xilinx it's very high, 100-300ns, for some reason

03:03 <whitequark> there is not enough info in UG for me to give you a precise number

03:04 <whitequark> I can measure it for you though

03:04 <whitequark> what methodology should i use?

03:04 <whitequark> send a stream of 0 symbols, put a pulse in it, compare transmitted pulse time to received pulse time?

03:05 <sb0> yes, with 8b10b disabled

03:05 <sb0> or well, you can keep it enabled

03:05 <sb0> just connect transceiver TX with RX, send something, and measure latency inside the fpga

03:06 <sb0> they don't document the latency?

03:06 <whitequark> maybe I can't read

03:06 <whitequark> but they are definitely better at making bug-free IP than at documenting it

03:07 <whitequark> remember when i said about several departments that dont talk to each other?

03:08 <whitequark> sb0: oh, found the table

03:08 <whitequark> they do document latenc

03:08 <whitequark> sb0: what specific configuration are you interested in

03:08 <whitequark> speed, encoding, preemphasis/deemphasis

03:09 <whitequark> equalization

03:09 <whitequark> do you want CTC or not

03:10 <sb0> no CTC

03:10 <sb0> 3gbps

03:10 <whitequark> sb0: interestingly, other than the FPGA bridge (which you can bypass), all latencies they give are exact

03:10 <sb0> 8b10b (does this take more than a few cycles?)

03:10 <whitequark> 8b10b takes exactly 2 symbol clocks

03:11 <bb-m-labs> build #2158 of artiq-board is complete: Success [build successful] Build details are at http://buildbot.m-labs.hk/builders/artiq-board/builds/2158

03:13 <whitequark> ok, with 3 Gbps, you need at least 1:2 gearbox in SERDES

03:13 <whitequark> or in fabric I guess, 300 MHz might be okay

03:13 <whitequark> assuming you go for a fabric gearbox...

03:14 <whitequark> sb0: transmit latency will be 34.433 ns

03:15 <whitequark> er, sorry, I miscounted

03:17 <whitequark> transmit 28.345ns, receive 37.519ns

03:17 <whitequark> this is with pre-emphasis off, equalization off

03:17 <whitequark> and with SERDES word aligner enabled

03:17 <whitequark> without SERDES word aligner enabled, receive latency is 24.186ns

03:18 <whitequark> sb0: anyway, i can make you a demo design based on ecp5-5g versa board if you want

03:18 <whitequark> that's like half a hour of work

03:28 <bb-m-labs> build #2159 of artiq-board is complete: Success [build successful] Build details are at http://buildbot.m-labs.hk/builders/artiq-board/builds/2159

03:30 <sb0> ah, that's much better than xilinx

03:30 <sb0> I don't need a demo design, don't even have a lattice board

03:43 <bb-m-labs> build #973 of artiq-win64-test is complete: Success [build successful] Build details are at http://buildbot.m-labs.hk/builders/artiq-win64-test/builds/973

03:44 <bb-m-labs> build #2772 of artiq is complete: Success [build successful] Build details are at http://buildbot.m-labs.hk/builders/artiq/builds/2772

03:47 <whitequark> sb0: https://eu.mouser.com/ProductDetail/Lattice/LFE5UM5G-45F-VERSA-EVN

03:57 rohitksingh_work has joined #m-labs

04:01 hartytp has joined #m-labs

04:01 <hartytp> sb0: "why does it need rtm drtio?"

04:03 <hartytp> I'd like to do the sync from the kernels, which means accessing DAC SPI from kernels, so DRTIO seems nicer than serwb

04:03 <_whitenotifier-6> [m-labs/nmigen] whitequark pushed 1 commit to master [+0/-0/±2] https://git.io/fhvyB

04:03 <_whitenotifier-6> [m-labs/nmigen] whitequark b0bd7bf - hdl.ir: correctly handle named output and inout ports.

04:03 <hartytp> also, I'd prefer to hook the sysref gen up to the RTM fpga

04:03 <whitequark> ok, nmigen has working memories in verilog...

04:03 <hartytp> means we don't have to worry about lack of pins on the AMC<->RTM connector

04:04 <hartytp> if you think that RTM DRTIO is going to be an issue then we can do things differently

04:04 <whitequark> now i need to add memory in simulation

04:04 <hartytp> "we can probably move to kernels, the reason it's done in rust was the potential dependency of drtio on the clock tree"

04:04 <_whitenotifier-6> [nmigen] Success. The Travis CI build passed - https://travis-ci.org/m-labs/nmigen/builds/470823885?utm_source=github_status&utm_medium=notification

04:04 <_whitenotifier-6> [nmigen] Success. 77.75% (+0.5%) compared to 2b4a851 - https://codecov.io/gh/m-labs/nmigen/commit/b0bd7bfaca2a549358b740d6325780b51fb7f26c

04:04 <_whitenotifier-6> [nmigen] Success. 100% of diff hit (target 77.25%) - https://codecov.io/gh/m-labs/nmigen/commit/b0bd7bfaca2a549358b740d6325780b51fb7f26c

04:05 <hartytp> ack. that made sense then, but now kernels seem like a nicer way of doing things. I'm happy to handle the port

04:16 <whitequark> sb0: attie: is it intended that re does nothing for transparent read ports?

04:16 <whitequark> i.e. re only has meaning for synchronous, non-transparent read ports, and not for anything else?

04:18 <bb-m-labs> build #2160 of artiq-board is complete: Success [build successful] Build details are at http://buildbot.m-labs.hk/builders/artiq-board/builds/2160

04:19 <hartytp> sb0: also, if I configure the new PLL via a kernel then we can re-use it as a Mirny driver, which seems nice and avoids duplication of work

04:24 <hartytp> whitequark: btw, I've been following progress on nmigen, it looks really cool! nice work

04:25 <whitequark> thanks

04:25 <whitequark> hopefully artiq will run on nmigen (in compat mode) before end of year

04:26 <whitequark> not much remains to be done, really

04:29 <whitequark> it's remarkable how each time i begrudgingly write a test to get coverage like 0.5% up, i find a bug

04:29 hartytp has quit [Ping timeout: 256 seconds]

04:36 <bb-m-labs> build #2161 of artiq-board is complete: Success [build successful] Build details are at http://buildbot.m-labs.hk/builders/artiq-board/builds/2161

04:47 <bb-m-labs> build #974 of artiq-win64-test is complete: Success [build successful] Build details are at http://buildbot.m-labs.hk/builders/artiq-win64-test/builds/974

04:47 <bb-m-labs> build #2773 of artiq is complete: Success [build successful] Build details are at http://buildbot.m-labs.hk/builders/artiq/builds/2773

04:58 <attie> whitequark: what does "transparent" mean for a port?

04:59 <cr1901_modern> the address currently being written on a given clk is available on the read port

04:59 <whitequark> attie: WRITE_FIRST ~ transparent

04:59 <whitequark> xilinx even uses them alternately

05:00 <cr1901_modern> yosys only uses transparent in its cell lib if memory serves

05:00 <sb0> the only potential issue I see with RTM DRTIO is the size of the satman firmware that has to fit in BRAM

05:00 <sb0> the rest should work fine

05:01 <attie> sounds more like a footgun to me, tbh

05:01 <attie> when I use an re it's usually because I want the read output to be stable until the next re

05:02 <whitequark> attie: well, migen has that footgun right now

05:02 <whitequark> it ignores re even if you specify it

05:02 <attie> it wouldn't be the only one...

05:02 <whitequark> interestingly, so does yosys

05:02 <whitequark> $memrd cells will ignore re unless they're configured as non-transparent

05:02 <whitequark> anyway, in nmigen i will make it a hard error

05:02 <attie> I mean, I guess you're assumed to think about what mode it's in first

05:02 <whitequark> re will be hardwired to a constant 1 unless it will be used

05:04 <attie> although I can think of contexts where I might want to keep the read address stable and see changes in content reflected

05:05 <attie> is it possible to make the difference between those two somehow explicit?

05:09 <whitequark> hm

05:09 <whitequark> attie: which two?

05:10 <attie> latch address vs. latch output

05:11 <attie> I'm still not entirely clear, what exactly does it do if you put has_re=True on a WRITE_FIRST port?

05:11 <whitequark> migen latches address, nmigen doesn't latch anything

05:11 <whitequark> nmigen doesn't have has_re.

05:12 <attie> ok, so it is what I was thinking of.

05:12 <attie> there's a use case for both modes, latch address and latch output.

05:13 <attie> but it should be a bit clearer which one you're getting, it being a combination of has_re and mode is very cryptic

05:14 <whitequark> yosys doesn't currently support latch address directly afaict

05:16 <attie> I guess if you use async_read you can manually implement the address register.

05:16 <whitequark> well to me it seems like while re is low, output should never change

05:17 <whitequark> so while latch address mode is useful, it should not be controlled by re

05:17 <attie> conceptually, absolutely those are two different things and they shouldn't both be called re.

05:18 <bb-m-labs> build #2162 of artiq-board is complete: Success [build successful] Build details are at http://buildbot.m-labs.hk/builders/artiq-board/builds/2162

05:18 <attie> but they use the same signal in xilinx bram so you have to make the output look like that eventually.

05:20 <whitequark> strictly speaking, xilinx synthesizer has to fold unnecessary registers in

05:20 <attie> although I guess it doesn't matter if it's a different signal, as long as they're not both present simultaneously?

05:21 <attie> actually idk what vivado does if you put two, isn't that the optional output register

05:24 <attie> hmm actually latch address mode is only useful if you are writing on another port and waiting to see those changes. and unless you are in read first mode, that's undefined behavior.

05:25 <attie> no wait, unless *the other port* is in read first mode

05:26 <attie> this whole component is a footgun.

05:26 <bb-m-labs> build #975 of artiq-win64-test is complete: Success [build successful] Build details are at http://buildbot.m-labs.hk/builders/artiq-win64-test/builds/975

05:26 <bb-m-labs> build #2774 of artiq is complete: Success [build successful] Build details are at http://buildbot.m-labs.hk/builders/artiq/builds/2774

05:27 <whitequark> indeed

05:27 <attie> maybe just wrapping it into something that disallows all the more complicated options is fine...

05:28 <attie> would it be possible to have a lower-level "expert mode" module that has all the options, and a higher level one that offers the most sensible options sans footgun?

05:31 <whitequark> attie: i think a set of warnings (on by default) would be useful

05:31 <whitequark> can you look at uh

05:31 <whitequark> give me a moment to push

05:32 <attie> btw PR #90 was exactly that footgun, and the bug was in migen for years before anyone noticed

05:33 <whitequark> yeah...

05:33 <whitequark> and my ghetto solution (making re a constant) would have caught the bug

05:34 <attie> for extra fun there was also a bug in the sim implementation that hid this bug

05:35 <sb0> hartytp: so all SPI would be RTIO/realtime then?

05:35 <sb0> then it's exactly the same driver as mirny

05:36 <sb0> btw when will you test the PLL chip for phase determinism? that would be a big problem if it doesn't work

05:38 <sb0> rewriting into kernels will add some more development time before the board does anything, though, and joe will likely be unhappy with that. and it seems joe still hasn't learned from what happens when he pushes unrealistic deadlines

05:43 <sb0> actually, if we do that then we can strip the non-RT SPI support from the DRTIO firmware on the RTM side, and gain some memory

05:44 <sb0> all satman would do then is basically program the si5324 and answer aux ping, moninj, and a few minor things

05:49 <sb0> we can strip routing management since it's always a leaf node

05:54 <whitequark> attie: sb0: do asynchronous memory write ports ever make sense?

05:54 <whitequark> how would that even work?

05:54 <whitequark> technically, yosys implements them, though i don't think it actually does anything meaningful with the cells

05:55 <whitequark> write_verilog definitely emits invalid verilog for asynchronous write ports

05:57 <whitequark> oh, nevermind, I've realized

05:57 <whitequark> asynchronous write ports use enable as a strobe

05:57 <whitequark> it definitely never makes sense to instantiate it in an nmigen design

05:58 <whitequark> but the semantics of a $memwr cell like that is at least defined

06:04 <whitequark> wow, there are *so many* errors migen isn't checking when using memories

06:07 <_whitenotifier-6> [m-labs/nmigen] whitequark pushed 3 commits to master [+1/-0/±6] https://git.io/fhvHB

06:07 <_whitenotifier-6> [m-labs/nmigen] whitequark 8d58cbf - back.rtlil: more consistent prefixing for subfragment port wires.

06:07 <_whitenotifier-6> [m-labs/nmigen] whitequark a061bfa - hdl.mem: tie rdport.en high for asynchronous or transparent ports.

06:07 <_whitenotifier-6> [m-labs/nmigen] whitequark c49211c - hdl.mem: add tests for all error conditions.

06:07 <whitequark> attie: okay, can you go over the code i just pushed and see if you can find any combinations that are definitely undefined?

06:09 <_whitenotifier-6> [nmigen] Success. The Travis CI build passed - https://travis-ci.org/m-labs/nmigen/builds/470845762?utm_source=github_status&utm_medium=notification

06:09 <_whitenotifier-6> [nmigen] Success. 79.27% (+1.51%) compared to b0bd7bf - https://codecov.io/gh/m-labs/nmigen/commit/c49211c76ab18762039918c41384435ac793943a

06:09 <_whitenotifier-6> [nmigen] Success. 100% of diff hit (target 77.75%) - https://codecov.io/gh/m-labs/nmigen/commit/c49211c76ab18762039918c41384435ac793943a

06:20 <attie> whitequark: I think only combinations of two ports are undefined, no? if you have a transparent write port, a read from the same address is undefined.

06:21 <attie> you have only write or read ports now, no ports that can do both?

06:23 <whitequark> attie: hmm

06:23 <whitequark> but you can request several ports from the same memory

06:24 <whitequark> I'm confused as to how "transparent" is a feature of combinations of two ports

06:25 <whitequark> "transparent" means that if a write (somehow) touches a memory location, it is immediately reflected at the output of this read port

06:25 <attie> yeah, but your write port doesn't even have a read data signal any more

06:25 <attie> so where would that even be reflected?

06:25 <whitequark> on a different read port of the same memory

06:25 <whitequark> that (dynamically) reads the same address

06:25 <attie> yeah, except no because that's not how xilinx bram works

06:26 <attie> it literally cannot offer that feature, and gives you undefined instead

06:26 <attie> (IIRC in practice comes out to value changes next cycle)

06:27 <attie> xilinx bram is transparent *only* on the same port you are writing from

06:28 <whitequark> hm

06:29 <whitequark> so how does their synthesizer know when to assemble read and write ports into a single bram port?

06:29 <whitequark> does it look at the address syntactically?

06:29 <attie> yeah one address signal = one port i think

06:30 <cr1901_modern> pretty sure any pipelined softcore relies on a write being immediately available to read on a second port... and well, I've seen plenty of soft cores work on Xilinx

06:32 <whitequark> (drive them to the same address)

06:32 <whitequark> is you request a read port and a write port with the same address

06:32 <whitequark> attie: right so the way you get a read/write port in nmigen

06:32 <whitequark> since this *can* represent xilinx (with caveats) but the opposite cannot represent e.g. ice40

06:32 <attie> cr1901_modern: well, once you start pipelining you have to detect hazards anyway. so you can wait one clock cycle longer.

06:33 <cr1901_modern> attie: Fair, wasn't meant to derail, just I could've sworn I've relied on the exact behavior to work that Xilinx claims doesn't work. Hmmm...

06:34 <attie> cr1901_modern: did you check that the component did end up in bram and not in dram?

06:34 <attie> the *same* verilog description will infer to either, and merrily change behavior, merely based on size and the mood of the synthesizer that day.

06:35 <cr1901_modern> attie: Will check tomorrow/when I'm a bit more focused. Tbh, I don't remember where the reg file goes on Xilinx devices

06:35 <cr1901_modern> (same thing of course works on ice40, where only BRAM is present)

06:35 <attie> unless you have an extremely large reg file, likely to be dram.

06:35 <whitequark> 06:34 < attie> the *same* verilog description will infer to either, and merrily change behavior, merely based on size and the mood of the

06:35 <whitequark> wtf

06:36 <whitequark> is this ise or vivado?

06:36 <attie> vivado

06:36 <attie> that's what PR #105 is about and why I run my own fork of migen atm

06:36 <whitequark> ugh

06:37 <attie> because I have a SyncFIFO, and depending on its depth and width, sometimes it works and sometimes it doesn't

06:37 <whitequark> incredible

06:37 <whitequark> i hate this

06:37 <whitequark> i hate this passionately

06:37 <whitequark> why do we even *use* xilinx

06:37 <attie> personally, I do because someone gave me free FPGAs

06:37 <cr1901_modern> attie: Is PR #105 your PR?

06:38 <attie> yes, I'm nakengelhardt on github

06:38 <_whitenotifier-6> [nmigen] whitequark opened issue #12: Implement a sanitizer for memory port combinations - https://git.io/fhvQl

06:38 <whitequark> attie: ok so

06:38 <whitequark> i've opened #12

06:39 <whitequark> do you think you can write in the most painfully verbose way the exact requirements xilinx has for its bram and dram

06:39 <whitequark> because i can implement the sanitizer but i find the requiremnts extremely confusing

06:39 <whitequark> there will probably also have to be a "mode" parameter for Memory, to select between bram and dram and possibly dff

06:40 <cr1901_modern> attie: Btw, since you wrote mist https://twitter.com/cr1901/status/1064996800988438528 (everything in this tweet is out of date now that nmigen targets RTLIL, btw)

06:40 <cr1901_modern> s/targets RTLIL/devel is underway/

06:41 <attie> honestly, I've come to the conclusion that 'portable' verilog is a lie, and you need to manually instantiate xilinx BRAM to have any behavior guarantees.

06:41 <whitequark> or at least stuff it full of attributes

06:41 <whitequark> which is my plan here

06:41 <attie> yeah that would work

06:42 <attie> could also extend in the future to ultraram, which is only inferred with the attribute

06:42 <attie> (haven't used it at all yet, no idea what new and exciting pitfalls await there :)

06:42 <cr1901_modern> That's what attributes are for, no? :)

06:43 <whitequark> no

06:43 <whitequark> attributes are not to keep the synthesizer from illegally converting valid behavioral verilog to completely different gateware

06:44 <cr1901_modern> I meant using attributes in the context of representing inherently non-portable HDL in a unified manner

06:45 <cr1901_modern> If you can't reliably get xilinx BRAM to behave in the context of nmigen, using attributes is, well IMO, a good solution

06:47 <cr1901_modern> which vivado output file would have the "this signal X mapped to Y primitive" information?

06:48 <attie> At one point last year I tried to experiment with what verilog descriptions would synthesize into what BRAM settings, but I could not because *there was not enough other crap* in my design and so it never mapped to BRAM at all

06:55 <whitequark> daveshah: question, TN1250 says that RE is only available for 256x16 BRAM configuration

06:55 <whitequark> however, all the other primitives still have RE

06:55 <whitequark> what is the reality like

06:57 <cr1901_modern> attie: http://ix.io/1wul Gonna go ahead and guess Xilinx has been using dist ram all this time

06:57 <_whitenotifier-6> [nmigen] whitequark commented on issue #12: Implement a sanitizer for memory port combinations - https://git.io/fhv73

07:00 <attie> yep, that one has the behavior you'd expect (writes are immediately visible on the read port).

07:00 <attie> hope you don't have a generic somewhere to change the size of the register file, or it might suddenly give you stale data without notice :)

07:01 <cr1901_modern> nope, for lm32 it's set

07:02 <attie> I think it's fairly rare to run into these problems because the usual use cases map to the appropriate sizes

07:02 <cr1901_modern> I did some experiments w/ ice40 lm32 early this year; transparent reads are implemented using $memrd bypass circuitry that compare the write address to the current read address and enable a mux if they match.

07:03 <attie> yeah, I guess they added that to the hard logic in xilinx bram but didn't add the cross-check to the other port.

07:03 <cr1901_modern> IIRC, yosys does this at the RTLIL stage though (so the other backends might have the same impl w/ $memrd bypass as well)

07:03 <cr1901_modern> I've never used the xilinx backend of yosys to check tho

07:04 <attie> since they could be both write addresses and then you are personally responsible for ensuring no conflict anyway.

07:04 <cr1901_modern> one write port, two read ports- write conflict isn't possible in this case (unless 2am me is more tired than usual)

07:06 <sb0> whitequark: afaict intel isn't better and there are no alternatives to xilinx and intel for large and less-slow FPGAs

07:06 <whitequark> ok

07:06 <attie> yeah, if each of the ports only has one direction then there are fewer problematic cases and interactions.

07:06 <sb0> might have used ECP5 on kasli though, if the speed is acceptable

07:06 <cr1901_modern> sb0: pretty sure you've said intel is even worse. Like their high speed transceivers are completely unusable

07:07 <cr1901_modern> does this ring a bell/like something you've said?

07:07 <whitequark> it's not that they are unusable

07:07 <whitequark> it's that they kill themselves if you stop the clock

07:07 <whitequark> which is a hilarious silicon bug if you aren't trying to use that silicon

07:07 <attie> ahaha that was also my discovery :D

07:07 <attie> november last year was a fun month!

07:07 <whitequark> i mean transceivers are fun in every family

07:08 <sb0> yeah, other than that they just seem roughly as crappy as the xilinx transceiver

07:08 <whitequark> ah.

07:08 <whitequark> fun fact: ECP5 was likely going to be all 5G, but they fucked up

07:08 <whitequark> with transceivers

07:08 <whitequark> so they have to heavily bin them and *then* they overvolt them by 100 mV to actually get them to work at 5G

07:08 <attie> They do give you a big critical warning about it though so at least I didn't fry my hosts' brand-new cards before they even got to use them.

07:09 <sb0> I guess we can put a ECP5 on the sayma rtm too

07:09 <whitequark> and the entire ECP4 series got cancelled because they couldn't get transceivers to behave at all

07:09 <whitequark> sb0: this doesn't apply to operating at 3G btw

07:09 <whitequark> you can use the cheaper non-5G parts at normal Vcore

07:09 <whitequark> Vtr* even

07:10 <whitequark> ECP4 was also going to get much larger fabric

07:10 <whitequark> as it is, ECP3 has FPGAs with more LUTs than the largest ECP5 in existence

07:11 <whitequark> which is pretty weird

07:11 <sb0> what about speed?

07:11 <sb0> this is usally the FPGA metric that sucks the most

07:11 <whitequark> ECP5 has a smaller node so it's generally faster than ECP3

07:11 <whitequark> do you want any specific numbers I can find?

07:12 <sb0> how fast does misoc run on it?

07:12 <whitequark> I haven't run misoc on it

07:12 <whitequark> but I can try

07:12 <sb0> also, artiq rtio

07:12 <whitequark> oh something interesting wrt RTIO on ECP5

07:15 <whitequark> many ECP5 pins (I think half of them? this part is a bit confusing) have an integrated gearbox

07:15 <whitequark> so you can select 1:2, 1:4 or 1:7

07:16 <whitequark> is this useful for RTIO maybe?

07:16 <sb0> is it like a small SERDES?

07:16 <sb0> Xilinx has this too, no?

07:16 <whitequark> essentially

07:16 <sb0> isn't that used for the SDRAM?

07:16 <sb0> and 1:2 is a DDR register, right?

07:17 <whitequark> yes and yes

07:17 <whitequark> well, they group them all together

07:17 <sb0> yeah we use this for the TTL PHY on Xilinx to get 1ns I/O resolution

07:17 <whitequark> ah ok

07:17 <whitequark> ECP5 clocking for this feature is very complicated and obnoxious

07:18 <whitequark> I am not sure if it really is so complicated, or it is just because it is made for MIPI and SDRAM

07:18 <whitequark> which are by themselves complicated and obnoxious

07:18 <sb0> SDRAM is relatively reasonable given the constraints

07:18 <whitequark> you can also align the data to clock edge or between clock transitions

07:18 <sb0> it's basically designed like that to be cheap

07:18 <whitequark> without having to mess with PLLs

07:19 <whitequark> ah I see

07:19 <sb0> low pin count, high bandwidth, high SDRAM chip yields

07:20 <sb0> the problem is most SDRAM PHYs are fucked, but this is the case for most commercial IPs that have CDCs or asynchronous parts, especially when it's from xilinx

07:20 <sb0> and SDRAM has a lot of async stuff

07:22 <sb0> in misoc we're doing a bit of a hack for reading (not using DQS), and there's another hack I want to try to clean that up

07:22 <sb0> the main issue is DQS is not free-toggling, so some data gets stuck unless you have some async logic that isn't implementable on xilinx fabric

07:23 <whitequark> hm this is definitely complicated and obnoxious

07:23 <sb0> what I want to do is have the controller issue 1 dummy-read after each legitimate read (just repeat the last read command) to make the SDRAM chips toggle DQS and pump data out of the FPGA I/O cell and into the elastic buffer

07:23 <whitequark> as a matter of fact

07:23 <whitequark> i should study how SDRAM works

07:24 <sb0> then everything is implementable cleanly and without any xilinx hard-IP bullshit

07:24 <sb0> well, yes, but it's done like that because DQS is bidirectional

07:24 <sb0> doing it otherwise requires more pins, and SDRAM is optimized for rock-bottom cost

07:27 <sb0> those pins have to be routed on every DIMM, connector, motherboard etc.

07:28 <sb0> also if it's a free-running clock, you can't mux it. with the current design you can connect DQS pins of several DRAM ranks on a bus, and use chip select

07:29 <whitequark> so why is DQS a thing? don't you already have a clock?

07:29 <whitequark> or is that clock used only as reference and DQS for actually transferring data?

07:29 <sb0> yes, but the skew isn't known and can vary from chip to chip and with PVT

07:29 <whitequark> ok

07:30 <sb0> with DQS all data transfers are source-synchronous

07:30 <whitequark> i see, your workaround makes sense

07:31 <sb0> we need some sort of FIFO that can write on both clock edges

07:32 <sb0> this isn't doable on xilinx fabric

07:32 <sb0> so we need IDDR + regular FIFO, which needs some cycles to flush the pipeline

07:32 <whitequark> what if you use two FIFOs?

07:34 <sb0> maybe, but matching the delays (since this will be coming straight from the I/O pin at >1Gb/s) will be very tricky

07:34 <whitequark> hmm okay

07:34 <sb0> also, more routing delays inside the FPGA = more absolute VT drift

07:36 <sb0> the "repeat last read command" hack seems much more reliable and easier, and also should have a negligible impact on performance

07:36 <whitequark> i mean you only need that before going to a different state, right?

07:36 <whitequark> back to back reads should be fine

07:36 <sb0> the main issue with it is the additional cycle it would take for read-to-write turnaround

07:36 <sb0> absolutely, it's a pipeline

07:37 <whitequark> yeah, it definitely sounds much more reliable

07:45 <daveshah> whitequark: I'm pretty sure the RE/WE pins on the ice40 are subtly broken in various ways and both Yosys and icecube use RCLKE/WCLKE

07:45 <whitequark> wtf

07:46 <whitequark> ok good old fpga bugs

07:46 <daveshah> I think tying WCLKE high and using WE instead breaks initialised BRAM

07:47 <daveshah> https://github.com/YosysHQ/yosys/issues/101

07:48 <whitequark> thanks

07:48 <whitequark> this is definitely what i see in techmap files

07:49 <daveshah> ad DQS BTW, the ECP5 has a hard FIFO on all the left and right IO pins for DQS CDC

07:50 <whitequark> daveshah: any idea if iCECube infers transparent ports?

07:50 <daveshah> whitequark: no, not sure

07:51 <whitequark> their example goes like:

07:51 <whitequark> always @(posedge clk)

07:51 <whitequark> begin

07:51 <whitequark> if (write_en)

07:51 <whitequark> mem[(addr)] <= din;

07:51 <whitequark> dout = mem[addr]; // Output reend

07:51 <whitequark> ... did they really mix <= and = here?

07:51 <whitequark> what semantics is this *supposed* to have?

07:52 <whitequark> I *think* this is read-before-write?

07:53 <daveshah> Mixed <= and = is the sort of thing that breaks stuff

07:54 <daveshah> But yes, that is read before write

07:58 <whitequark> this is depressing

07:58 <whitequark> can FPGA companies please refrain from ever writing any verilog

07:58 <whitequark> they do not appear capable of doing it

07:59 <whitequark> daveshah: why do they even have RCLKE/WCLKE?

07:59 <daveshah> idek

07:59 <whitequark> is it for power saving?

07:59 <daveshah> yes

08:00 <daveshah> except they seem to be the only read or write enable icecube uses

08:00 <whitequark> ... why do they even have RE and WE then?..

08:00 <daveshah> That's the bigger mystery

08:01 <whitequark> maybe they had a macro from the fab or something

08:01 <whitequark> I have no better ideas

08:02 <daveshah> If you look at the SPRAM, you can actually find out which macro they used

08:02 <whitequark> ha, really?

08:02 <daveshah> The user guide inexplicably gives you a mapping from Verilog port names to RAM macro port names

08:02 <whitequark> like physically look or at the interface?

08:02 <daveshah> Without even telling you what half the power control ports do

08:03 <whitequark> lol yeah it does

08:03 <daveshah> So you have to Google those port names and find the macro

08:03 <whitequark> it refers to "RAM datasheet"

08:03 <daveshah> Or look at the sim model in Radiant which also came from the memory compiler

08:03 <daveshah> It's from Virage btw

08:04 <daveshah> who now seem to be Synopsys Designware

08:06 <whitequark> so

08:06 <whitequark> daveshah: do you think you can add info about ECP5 RAM similar to https://github.com/m-labs/nmigen/issues/12#issuecomment-449280778 ?

08:07 <sb0> whitequark: hey maybe the new laser PDH locker can use Lattice

08:08 <whitequark> PDH locker?

08:08 <sb0> pound-drever-hall

08:08 <whitequark> what does the FPGA need to do?

08:09 <sb0> https://github.com/sinara-hw/meta/issues/28

08:10 <whitequark> so this is ECP5 territory

08:10 <whitequark> okay

08:10 <whitequark> >

08:10 <whitequark> Is setting up PDH locks really something you want/need to do with ARTIQ, or is this better done as a simple "standalone" board of some sort (e.g. an Arduino shield) where you write some quick and dirty GUI and then set-and-forget?

08:10 <whitequark> arduino shield...

08:11 <whitequark> really?

08:11 <sb0> probably even less than ECP5, maybe even MCU

08:11 <whitequark> ah, then put iCE40 there

08:12 <sb0> though a nice FPGA potentially allows fancier locks, but the NIST folks know a lot more than I do...

08:15 <whitequark> I should see how fast picorv32 can get on UP5K

08:16 <whitequark> remember those numbers you said are shit? well I looked closer and we've actually improved the toolchain quite a bit since then

08:16 <whitequark> there was a lot of low-hanging fruit in yosys and of course arachne was not timing-driven

08:16 <sb0> I have very little experience with lasers, chiefly because doing it with trashed equipment from eBay (which is the only thing I can afford) is a pain and time sink

08:19 <whitequark> well I know iCE40 practically inside out at this point

08:20 <sb0> it's much worse than vacuum where parts are more repairable (give them a good cleanup), cheaper, and more generally available

08:20 <whitequark> it's easy to design for, easy to write HDL for, and generally not a pain

08:20 <whitequark> there are a few stupid gotchas with pin assignment, mainly

08:21 <whitequark> the way they do clocks and PLLs is just not very good and requires forethought at PCB design phase

08:21 <whitequark> each PLL is assigned to an IO buffer and if you feed a PLL to a GB then it eats the I part of IO (?!)

08:22 <whitequark> this is of course only mentioned in the footnotes and there is no table anywhere that says which PLL goes where

08:22 <_whitenotifier-6> [nmigen] nakengelhardt commented on issue #12: Implement a sanitizer for memory port combinations - https://git.io/fhvFk

08:22 <whitequark> you have to look at bondouts and such

08:22 <whitequark> again the department htat made silicon did not talk to the one making documentation

08:22 <attie> ok, I've added what I think I know about the xilinx BRAM behavior.

08:23 <whitequark> thanks!

08:24 <whitequark> >sometimes it will find a read register somewhere even if you intended to write an asynchronous read port

08:24 <whitequark> ...

08:24 <whitequark> sb0: what do you think about running *only* $mem cells through Yosys Xilinx techmapping in nMigen?

08:24 <whitequark> because this kind of shit is just absurd

08:24 <whitequark> are they even trying?

08:25 <attie> that one isn't really a "bad xilinx" though

08:26 <attie> it's just that if your address didn't have to go through many combinatorial stages, the resulting verilog looks the same

08:26 <whitequark> syntactically or semantically?

08:26 <attie> both

08:27 <whitequark> hm ok

08:27 <attie> I mean, if you got the address from some module that has an output register

08:27 <attie> and then feed it to your asynchronous read port

08:27 <attie> and migen puts the whole thing in a single file

08:28 <whitequark> ah I think I see

08:28 <whitequark> so what does Yosys do here...

08:28 <attie> how would it know that the register was meant to be associated with *this* bit of code rather than *that* bit of code

08:28 <_whitenotifier-6> [nmigen] daveshah1 commented on issue #12: Implement a sanitizer for memory port combinations - https://git.io/fhvFg

08:30 <whitequark> hm, Yosys always infers 7-Series BRAMs in READ_FIRST moed.

08:30 <daveshah> whitequark: a small picorv32 design can do about 26MHz with nextpnr

08:30 <whitequark> daveshah: iirc when I last tried it was about 13 MHz with arachne

08:30 <whitequark> but it was mor1kx

08:30 <whitequark> I should try misoc again

08:30 <daveshah> That was on up5k

08:31 <whitequark> oh it was on hx8k

08:31 <daveshah> Probably 60MHz on hx8k is doable

08:31 <whitequark> sb0: ^ that's far more reasonable than arachne numbers

08:31 <whitequark> though picorv32 is still weirdly slow

08:31 <whitequark> I should definitely try mor1kx again

08:32 <daveshah> I think nextpnr is at least 30% better than arachne on picorv32

08:32 <daveshah> Plus there have been the recent improvements to Yosys by you and tnt

08:37 <whitequark> attie: do I understand it right that READ_FIRST/WRITE_FIRST/NO_CHANGE is primarily to determine what happens with data out of the writing port?

08:40 <whitequark> effectively that configures some sort of mux and register of the read port, right?

08:40 <whitequark> in READ_FIRST it just does a read, in WRITE_FIRST it muxes data in to data out, in NO_CHANGE, WE gates the clock to that register

08:40 <whitequark> this probably makes more sense if you're looking at the bitstream...

08:41 <daveshah> At least on ECP5 it's very uninformative - just one random bit for either READBEFOREWRITE or WRITETHROUGH modes

08:41 <daveshah> Which I think behave quite similarly

08:41 <whitequark> hm, I think I get it

08:42 <whitequark> >one port is writing AND it is in READ_FIRST mode, in which case the other port will read the old data. (Note that it is the setting of the write port that defines the behavior of the read port. Setting of the read port is irrelevant.)

08:42 <whitequark> this part is just what it would do anyway, isn't it

08:43 <whitequark> like, you are configuring the read behavior through the write port because the read mux/register is fed by the WE signal

08:43 <whitequark> of the write port

08:43 <whitequark> daveshah: does this make sense to you?

08:43 <whitequark> it feels like a really sloppy abstraction to me, barely covering the underlying macro

08:44 <whitequark> which is why it's so weird

08:44 <daveshah> Interestingly the ECP5 also has a NORMAL mode where the read port is undefined while writing

08:44 <whitequark> what bit does that set?

08:44 <whitequark> or

08:44 <daveshah> No bits

08:44 <daveshah> It's the default

08:44 <whitequark> does that just remove a false path between DIN and DOUT?

08:44 <whitequark> ah

08:45 <whitequark> so you have two bits for READBEFOREWRITE and WRITETHROUGH

08:45 <whitequark> interesting

08:45 <daveshah> Yes

08:45 <whitequark> I wonder what the former bit does

08:45 <daveshah> I'm not fully sure how they behave when dealing with both ports

08:45 <whitequark> yeah I'm coming to conclusion that we need some sort of testbench

08:46 <whitequark> to validate whether we actually instantiated memory in a sane way

08:46 <daveshah> I wonder if they have to have explicit bypass logic even for the READBEFOREWRITE case

08:46 <whitequark> this is very obnoxious

08:46 <daveshah> I dare say no one has mentioned Intel yet either

08:46 <daveshah> God knows what cursed shit they do

08:59 <attie> I didn't really think about the underlying hardware yet.

09:00 <attie> But to sum up my feeling about xilinx bram handling, "I can see how you got here but maybe you should reevaluate your life choices."

09:00 <whitequark> yeah i definitely concur

09:00 <whitequark> i sort of understand it now but it's still a nightmare

09:03 <attie> oh since I just had to scroll through another hundred of this delightful message, Xilinx's official advice on this: "It is suggested to confirm via simulation that an address collision never occurs and if so it is suggested to try and avoid this situation."

09:04 <whitequark> wonderful.

09:22 <whitequark> yeah there's definitely a huge sim/synth mismatch in migen wrt RE

09:34 <whitequark> actually, this doesn't even make any sense

09:35 <whitequark> WRITE_FIRST merely latches address

09:35 <whitequark> so in case of address conflict MemoryToArray is just wrong

09:35 <whitequark> but it's wrong in a yet another way, different from Xilinx

09:35 <whitequark> this is a horrible mess

09:43 <whitequark> i think MemoryToArray is just mostly completely wrong

09:43 <whitequark> this needs to be simulated in a completely different way

09:50 <attie> I thought I made it match with what BRAM is doing at least. What else is wrong?

09:51 <whitequark> it might match the Xilinx behavior, I don't understand it that well

09:51 <whitequark> but it's wrong in general

09:51 <whitequark> well

09:52 <whitequark> ohhhh I see

09:52 <whitequark> WRITE_FIRST effectively implements an async read port

09:53 <whitequark> except the address is latched

09:53 <whitequark> this is horrible but it should be correct

09:53 <whitequark> ah, no, it's not exactly correct

09:53 <whitequark> attie: so, let's say you have a WRITE_FIRST port with has_re=True

09:53 <whitequark> now let's say you set re low

09:53 <attie> mm yeah that case might be badly handled

09:54 <whitequark> this is the general problem with the address latching trick

09:54 <whitequark> I thought it was more broken than it is, but it's still broken

09:54 <whitequark> can't exactly blame you

09:54 <attie> it's not "address latching trick" so much as "this is the xilinx macro for write first"

09:54 <whitequark> attie: what I do in nMigen simulator is I abuse one of the simulator implementation details to do actual forwarding

09:55 <whitequark> abuse in the sense that this is an implementation detail no nMigen consumer may rely on

09:55 <whitequark> but it will work correctly with any number and any kind of ports

09:55 <whitequark> even multiple write ports

09:55 <whitequark> let me write the tests for it and push so you can see

09:56 <attie> but the re signal is not conforming to the xilinx macros, so I'm not sure exactly how the xilinx tools interpret it

09:56 <whitequark> yes, this thing also bothers me about Migen's Memory primitive

09:56 <whitequark> it's tailored to Xilinx enough to make it badly fit other FPGAs, but at the same time not enough that you won't get UB

09:56 <whitequark> like half of the possible combinations give you nonsensical behavior on Xilinx

09:56 <whitequark> honestly, the more i look at it, the more unhappy i am about it

09:57 <whitequark> and xilinx FPGAs

09:57 <whitequark> and Verilog

09:57 <whitequark> really, fuck all of this shit, who thought synthesis from Verilog was a good idea in the first place

09:57 <attie> that's a permanent way of life if you work in this area :D

09:57 <whitequark> well ice40 does not give me this kind of pain, and ecp5 also seems to behave sanely

09:58 <attie> we practically have weekly bitching sessions about "why are our tools so terrible"

09:58 <whitequark> "weekly" implies this ever stops

09:58 <attie> well in a 20 person office you have to shut up sometimes or the people from other departments will be cross.

09:58 <whitequark> lol

10:25 <_whitenotifier-6> [m-labs/nmigen] whitequark pushed 1 commit to master [+0/-0/±1] https://git.io/fhffT

10:25 <_whitenotifier-6> [m-labs/nmigen] whitequark a40e2ca - back.pysim: fix an issue with too few funclet slots.

10:27 <_whitenotifier-6> [nmigen] Success. The Travis CI build passed - https://travis-ci.org/m-labs/nmigen/builds/470913575?utm_source=github_status&utm_medium=notification

10:27 <_whitenotifier-6> [nmigen] Success. Absolute coverage decreased by -0.01% but relative coverage increased by +20.72% compared to c49211c - https://codecov.io/gh/m-labs/nmigen/commit/a40e2cac4bb5d082d2c2c754a43cce5c8563b061

10:27 <_whitenotifier-6> [nmigen] Success. 100% of diff hit (target 79.27%) - https://codecov.io/gh/m-labs/nmigen/commit/a40e2cac4bb5d082d2c2c754a43cce5c8563b061

11:47 rohitksingh_work has quit [Read error: Connection reset by peer]

11:52 <whitequark> attie: can you take a look at my simulation models and tests?

11:52 <_whitenotifier-6> [m-labs/nmigen] whitequark pushed 1 commit to master [+0/-0/±2] https://git.io/fhfIy

11:52 <_whitenotifier-6> [m-labs/nmigen] whitequark fbb5eab - hdl.mem: add simulation model for memory.

11:53 <whitequark> basically, in nmigen, there is no MemoryToArray; it directly emits $memrd and $memwr cells for Yosys while providing a behavioral model for the simulator. the simulator actually doesn't understand memory at all.

11:53 <_whitenotifier-6> [nmigen] Success. The Travis CI build passed - https://travis-ci.org/m-labs/nmigen/builds/470944339?utm_source=github_status&utm_medium=notification

11:53 <_whitenotifier-6> [nmigen] Success. 79.67% (+0.41%) compared to a40e2ca - https://codecov.io/gh/m-labs/nmigen/commit/fbb5eab18353ef872ed62861138b37bb970d622d

11:53 <_whitenotifier-6> [nmigen] Success. 96.77% of diff hit (target 79.25%) - https://codecov.io/gh/m-labs/nmigen/commit/fbb5eab18353ef872ed62861138b37bb970d622d

11:54 <whitequark> and I model transparent ports by combining an async port with a latch that prevents any changes while clock is high

11:54 <_whitenotifier-6> [m-labs/nmigen] whitequark pushed 1 commit to master [+0/-0/±2] https://git.io/fhfIN

11:54 <_whitenotifier-6> [m-labs/nmigen] whitequark e58d9ec - hdl.mem: add simulation model for memory.

11:55 <whitequark> the *only* case where you could observe mismatch is if you gate a clock to some submodule while it is low and start changing the address and look at the outout

11:55 <whitequark> but this is sufficiently pathological that it's not worth fixing I think

11:56 <_whitenotifier-6> [nmigen] Success. The Travis CI build passed - https://travis-ci.org/m-labs/nmigen/builds/470945317?utm_source=github_status&utm_medium=notification

11:56 <_whitenotifier-6> [nmigen] Success. 79.67% (+0.41%) compared to a40e2ca - https://codecov.io/gh/m-labs/nmigen/commit/e58d9ec74d1bbb17b3214514cea72d97b9d41955

11:56 <_whitenotifier-6> [nmigen] Success. 96.77% of diff hit (target 79.25%) - https://codecov.io/gh/m-labs/nmigen/commit/e58d9ec74d1bbb17b3214514cea72d97b9d41955

12:05 <attie> whitequark: I have to go now, can you remind me again when you next see me around?

12:06 <whitequark> sure

12:32 <_whitenotifier-6> [m-labs/nmigen] whitequark pushed 3 commits to master [+0/-0/±4] https://git.io/fhfqP

12:32 <_whitenotifier-6> [m-labs/nmigen] whitequark af7db88 - hdl.mem: use different naming for array signals.

12:32 <_whitenotifier-6> [m-labs/nmigen] whitequark 7ae7683 - back.pysim: give numeric names to unnamed subfragments in VCD.

12:32 <_whitenotifier-6> [m-labs/nmigen] whitequark 48d13e4 - back.pysim: handle out of bounds ArrayProxy indexes.

12:34 <_whitenotifier-6> [nmigen] Success. The Travis CI build passed - https://travis-ci.org/m-labs/nmigen/builds/470959501?utm_source=github_status&utm_medium=notification

12:34 <_whitenotifier-6> [nmigen] Success. 79.67% (+<.01%) compared to e58d9ec - https://codecov.io/gh/m-labs/nmigen/commit/48d13e47ec085bb8921bf7bff77803a17cab3fe1

12:34 <_whitenotifier-6> [nmigen] Success. 100% of diff hit (target 79.67%) - https://codecov.io/gh/m-labs/nmigen/commit/48d13e47ec085bb8921bf7bff77803a17cab3fe1

13:05 <whitequark> sb0: all of glasgow gateware tests pass with nmigen.

13:05 <whitequark> ok, almost all, one test uses FIFO, which isn't shimmed yet...

13:16 <_whitenotifier-6> [m-labs/nmigen] whitequark pushed 2 commits to master [+0/-0/±4] https://git.io/fhfOF

13:16 <_whitenotifier-6> [m-labs/nmigen] whitequark fa2af27 - hdl.mem: ensure transparent read port model has correct latency.

13:16 <_whitenotifier-6> [m-labs/nmigen] whitequark 568d3c5 - compat: provide Memory shim.

13:18 <_whitenotifier-6> [nmigen] Success. The Travis CI build passed - https://travis-ci.org/m-labs/nmigen/builds/470976473?utm_source=github_status&utm_medium=notification

13:24 hartytp has joined #m-labs

13:25 <hartytp> white quark: "hopefully artiq will run on nmigen (in compat mode) before end of year " cool!

13:25 <hartytp> sb0: "the only potential issue I see with RTM DRTIO is the size of the satman firmware that has to fit in BRAM " shall we stick a bigger Artix on the RTM?

13:26 <hartytp> we can always go back to a smaller one in a future design revision once we've finished debugging

13:26 <hartytp> but right now I need something that works and FPGAs are cheaper than code optimization

13:28 <hartytp> "hartytp: so all SPI would be RTIO/realtime then? "

13:28 <hartytp> "then it's exactly the same driver as mirny "

13:28 <hartytp> yes and yes

13:28 <hartytp> seems like a much nicer way of doing things so long as RTM DRTIO works

13:28 <hartytp> if it doesn't then we can fall back to doing things in fw

13:28 <hartytp> but that feels like a hack

13:29 <hartytp> "btw when will you test the PLL chip for phase determinism? that would be a big problem if it doesn't work "

13:29 <hartytp> soon. The eval board is waiting for me in Ox, but I'm in the US now for Christmas

13:29 <hartytp> will test it in first week of Jan

13:30 <hartytp> but, even if it doesn't work, it's still a better choice than the HMC830. We'd just need to add some extra logic to measure the phase and reset until it gives us what we want

13:30 <hartytp> "rewriting into kernels will add some more development time before the board does anything, though, and joe will likely be unhappy with that. and it seems joe still hasn't learned from what happens when he pushes unrealistic deadlines "

13:30 <hartytp> not necessarily

13:31 <hartytp> my plan would be that I do the port to kernels in a branch

13:31 <hartytp> you can focus on getting the existing system working without synchronisation (which no one needs straight away anyway)

13:32 <hartytp> and I'll get the kernel port working in parallel. Also a good time to clean up some of the code which is getting to be a bit of a mess

13:32 <hartytp> "actually, if we do that then we can strip the non-RT SPI support from the DRTIO firmware on the RTM side, and gain some memory "

13:32 <hartytp> that sounds like a good plan! Let's just make the RTM a minimal satman

13:38 <hartytp> "whitequark: hey maybe the new laser PDH locker can use Lattice "

13:38 <hartytp> why would that card need an FPGA? I'd assumed it would be a simple micro controller to just configure some SPI settings and do some non-realtime ADC readout for diagnostics

13:39 <hartytp> the PDH modulation/demodulation would usually be on a separate card to the servo. the servo might want an FPGA if you really want to push BW to the few MHz level (max you can get out of a diode)

13:40 <hartytp> but that would be on a different board, or could just use stabilizer if you are happy with BW in the hundreds of kHz range (which one usually is)

13:55 <_whitenotifier-6> [m-labs/nmigen] whitequark pushed 2 commits to master [+2/-0/±1] https://git.io/fhfGg

13:55 <_whitenotifier-6> [m-labs/nmigen] whitequark fc7da1b - hdl.ir: do not flatten instances or collect ports from their statements.

13:55 <_whitenotifier-6> [m-labs/nmigen] whitequark 00ef7a7 - compat: provide verilog.convert shim.

13:58 hartytp has quit [Ping timeout: 256 seconds]

13:58 <_whitenotifier-6> [nmigen] Success. The Travis CI build passed - https://travis-ci.org/m-labs/nmigen/builds/470992071?utm_source=github_status&utm_medium=notification

13:58 <_whitenotifier-6> [nmigen] Failure. 78.7% (-0.98%) compared to 48d13e4 - https://codecov.io/gh/m-labs/nmigen/commit/00ef7a78d38f542c39488df918e42013bbc65c56

13:59 <_whitenotifier-6> [nmigen] Success. Coverage not affected when comparing 48d13e4...00ef7a7 - https://codecov.io/gh/m-labs/nmigen/commit/00ef7a78d38f542c39488df918e42013bbc65c56

14:50 d_n|a has quit [Quit: http://quassel-irc.org - Chat comfortably. Anywhere.]

15:06 key2 has joined #m-labs

15:10 rohitksingh has joined #m-labs

15:51 <sb0> hartytp: the 15t is the same silicon as the 50t :)

15:52 <sb0> but we can put devices that say 50t on the package and the idcode if that's safer...

15:54 <sb0> 26MHz or even 60 is much slower than xilinx. and I don't know why everyone uses picorv32, it's a pretty bad CPU

15:55 <sb0> whitequark: synthesizing the memory cells is a good idea. i was considering doing it within migen, but it's even better if we can reuse the yosys code

15:56 hartytp has joined #m-labs

15:56 <sb0> note that SyncFIFOBuffered uses an external register to turn the RAM read from async into sync

15:56 <sb0> so it will need rewriting

15:56 <hartytp> sb0: you mean that we can probably tell vivid we have a 50T and flash that onto a 15T and it will probably work?

15:56 <sb0> iirc that's the only significant place where this trick is used

15:57 <hartytp> s /vivid /vivado

15:57 <sb0> hartytp: no, it will fail idcode check. so that and the bitstream CRC need to be edited after synthesis.

16:01 <hartytp> so, question is how many Sayma boards we'll make in the next few years and the cost of FPGAs versus cost of paying someone to hack bitstreams

16:01 <hartytp> my guess is the FPGAs are cheaper ;)

16:04 <sb0> oh it's a trivial hack. i've been wondering if it should be enabled in migen by default.

16:04 <sb0> that and smashing other obnoxious vivado features such as webtalk

16:06 <hartytp> well, ultimately I'll leave FPGA-related decisions up to you and rjo

16:07 <hartytp> my only comment is that Sayma working quickly is quite valuable to me, so if we're going to rely on doing this then it had better work with minimal fuss

16:10 <sb0> yeah, maybe we can stuff 50t on the protos and use 15t later...

16:11 <sb0> if that needs to be done in a hurry then i'd say go for 50t

16:11 <sb0> they're pin compatible

16:12 hartytp has quit [Ping timeout: 256 seconds]

16:12 <sb0> but then those protos will have a small compatibility issue

16:12 hartytp has joined #m-labs

16:12 <hartytp> sb0: ack

16:13 <hartytp> the real question is how many Sayma boards are ever going to be produced

16:13 <hartytp> if they become widely used then there will be plenty of incentive to improve things

16:14 <hartytp> but, right now, we have a really expensive RF card produced in small quantities and we're worrying about like $30 worth of additional FPGA costs. Unless the batch size increases that FPGA cost is nothing compared to even relatively trivial gateware work

16:15 <hartytp> anyway, as I said, your call.

16:16 <hartytp> if we need to, TS/Creotech also offer FPGA replacement pretty cheaply (we've had some dead Kaslis fixed that way).

16:16 <sb0> so the 15T has ~100kbytes of BRAM

16:16 <sb0> (usable)

16:17 <sb0> and the 50T has 300

16:18 <sb0> without any optimization and with switching support, satman is at 88K of code

16:20 <sb0> you need some memory on top of that for bss and stack

16:21 <sb0> if we really want to spend the absolute minimum amount of time on this, then the 50t is a safer choice

16:24 <hartytp> let's assume Sayma is >$500 per channel (so $4k for the AMC + RTM), which would be really good going for an RF card like this and is almost certainly an underestimate of actual costs

16:25 <hartytp> so adding $40 for the RTM is a 1% increase on the cost.

16:25 <hartytp> obviously it's a slippery slope if one takes that attitude everywhere, but in this case it seems like a no brainer to me

16:26 <hartytp> ultimately, the thing that is most likely to kill Sayma as a project is lots of small issues causing delays until everyone gets fed up and gives up

16:26 <hartytp> anywhere where we can inject a small amount of cash to reduce time/risks seems worth doing to me

16:28 <sb0> oh but hacking xilinx fpgas is fun

16:28 <sb0> unlike e.g. microtca supplies not working

16:29 <hartytp> haha

16:29 <hartytp> well, up to you

16:29 <sb0> let's go for 50t, we can populate the other one later, and we have sayma v1 to test

16:30 <hartytp> really nothing more I can add to this other than to reiterate that currently my plans for Sayma rely on DRTIO so I need it to work fairly fast

16:30 <hartytp> :)

16:34 hartytp has quit [Ping timeout: 256 seconds]

16:49 <cr1901_modern> sb0: The idea behind picorv32 is that it can in most designs it can be run as a control CPU without requiring a separate clock domain between your speed-sensitive logic and the CPU. It runs at like 700MHz on Virtex devices.

16:50 <cr1901_modern> But of course to meet that goal, everything is registered, and CPI is > 1. The fact that it is small is a nice side effect.

17:00 m4ssi has quit [Remote host closed the connection]

17:25 <key2> whitequark: we managed to port minerva to nMigen

17:25 <key2> but that crashes yosys

17:26 <key2> (have not tested it yet tho, just generated)

17:26 <key2> so we generate the rtlil, and use yosys to do opt before generating the verilog

17:31 <GitHub-m-labs> [artiq] drewrisinger opened issue #1223: Docs: wrong path for https://github.com/m-labs/artiq/issues/1223

17:51 <GitHub-m-labs> [artiq] drewrisinger opened issue #1224: Docs: artiq_flash has no -m option https://github.com/m-labs/artiq/issues/1224

17:53 <GitHub-m-labs> [artiq] sbourdeauducq commented on issue #1224: Yes, send a PR. https://github.com/m-labs/artiq/issues/1224#issuecomment-449455028

18:07 rohitksingh has quit [Ping timeout: 250 seconds]

18:28 key2_ has joined #m-labs

18:29 key2 has quit [Quit: Page closed]

18:29 key2_ is now known as key2

19:06 rohitksingh has joined #m-labs

19:17 nurelin_ has quit [Ping timeout: 245 seconds]

19:17 nurelin_ has joined #m-labs

20:47 rohitksingh has quit [Remote host closed the connection]

22:57 <whitequark> key2: ah nice

22:59 <whitequark> 15:56 < sb0> note that SyncFIFOBuffered uses an external register to turn the RAM read from async into sync

22:59 <whitequark> Yosys synthesis is smart enough to fold that into memory cells.

22:59 <whitequark> 15:54 < sb0> 26MHz or even 60 is much slower than xilinx. and I don't know why everyone uses picorv32, it's a pretty bad CPU

23:00 <whitequark> iCE40 is the slowest FPGA series currently on market

23:00 <whitequark> it's optimized for power consumption not fabric speed

23:52 mumptai has joined #m-labs

23:53 <_whitenotifier-6> [m-labs/nmigen] whitequark pushed 1 commit to master [+0/-0/±2] https://git.io/fhfp0

23:53 <_whitenotifier-6> [m-labs/nmigen] whitequark 913339c - hdl.ir: fix port propagation between siblings.

23:55 <_whitenotifier-6> [m-labs/nmigen] whitequark pushed 1 commit to master [+0/-0/±1] https://git.io/fhfpz

23:55 <_whitenotifier-6> [m-labs/nmigen] whitequark a4183eb - hdl.mem: use more informative signal naming for ports.