#m-labs on 2018-06-04 — irc logs at freenode.irclog.whitequark.org

2015-03-04 14:45 sb0 changed the topic of #m-labs to: ARTIQ, Migen, MiSoC, Mixxeo & other M-Labs projects :: fka #milkymist :: Logs http://irclog.whitequark.org/m-labs

00:06 <GitHub168> [smoltcp] jhwgh1968 commented on issue #106: I've been thinking about using smoltcp for a project of mine. If no one else has started on this task yet, I could give it a try over the next week or two. https://github.com/m-labs/smoltcp/issues/106#issuecomment-394203335

01:14 <sb0> tpw_rules, go ahead

01:15 <GitHub-m-labs> [artiq] sbourdeauducq commented on issue #998: I don't think this has anything to do with the LOCs. https://github.com/m-labs/artiq/issues/998#issuecomment-394209765

01:15 <GitHub-m-labs> [artiq] sbourdeauducq commented on issue #1043: Yes. https://github.com/m-labs/artiq/issues/1043#issuecomment-394209776

02:04 kaolpr has quit [Ping timeout: 256 seconds]

02:04 kaolpr has joined #m-labs

03:29 rohitksingh_work has joined #m-labs

03:47 futarisIRCcloud has joined #m-labs

04:28 rohitksingh_work has quit [Ping timeout: 260 seconds]

04:34 rohitksingh_work has joined #m-labs

04:38 sb0 has quit [Quit: Leaving]

04:52 mumptai has joined #m-labs

04:59 sb0 has joined #m-labs

06:27 sb0 has quit [Quit: Leaving]

06:36 futarisIRCcloud has quit [Quit: Connection closed for inactivity]

06:45 <rjo> hartytp: it's good modulo a few details.

06:50 <GitHub-m-labs> [artiq] jordens opened pull request #1046: Suservo docs (master...suservo_docs) https://github.com/m-labs/artiq/pull/1046

06:51 <GitHub-m-labs> [artiq] jordens closed pull request #1046: Suservo docs (master...suservo_docs) https://github.com/m-labs/artiq/pull/1046

07:02 mumptai has quit [Quit: Verlassend]

07:19 sb0 has joined #m-labs

07:28 <GitHub-m-labs> [artiq] jordens pushed 1 new commit to master: https://github.com/m-labs/artiq/commit/bb87976d4fdb6163c9a7fecad980630df9e1d3ae

07:28 <GitHub-m-labs> artiq/master bb87976 Robert Jordens: suservo: docstring fixes, revert parametrization of r_rtt

07:34 FabM has quit [Quit: ChatZilla 0.9.93 [Firefox 52.8.0/20180509233012]]

07:38 <bb-m-labs> build #1607 of artiq-board is complete: Success [build successful] Build details are at http://buildbot.m-labs.hk/builders/artiq-board/builds/1607

07:42 <bb-m-labs> build #2416 of artiq is complete: Failure [failed python_unittest_2] Build details are at http://buildbot.m-labs.hk/builders/artiq/builds/2416 blamelist: Robert J?rdens <rj@quartiq.de>

07:51 <GitHub53> [smoltcp] jD91mZM2 opened issue #226: Ethernet::poll infinite loop https://github.com/m-labs/smoltcp/issues/226

08:10 <GitHub-m-labs> [artiq] hartytp commented on issue #1043: @sbourdeauducq my question was addressed to @jbqubit (see the text I quoted) as he is still using 2017.4, where Sayma didn't meet timing for me... https://github.com/m-labs/artiq/issues/1043#issuecomment-394269914

08:11 <GitHub-m-labs> [artiq] hartytp commented on commit bb87976: Thanks.... https://github.com/m-labs/artiq/commit/bb87976d4fdb6163c9a7fecad980630df9e1d3ae#commitcomment-29229123

08:25 <bb-m-labs> build #1608 of artiq-board is complete: Success [build successful] Build details are at http://buildbot.m-labs.hk/builders/artiq-board/builds/1608

08:29 <bb-m-labs> build #2417 of artiq is complete: Failure [failed python_unittest_2] Build details are at http://buildbot.m-labs.hk/builders/artiq/builds/2417 blamelist: Robert Jordens <jordens@gmail.com>

08:57 cjbe_ has joined #m-labs

09:02 cjbe_ has quit [Ping timeout: 256 seconds]

09:42 hartytp has joined #m-labs

09:42 <hartytp> remind me what you use to reset Sayma USB JTAG lockups

09:42 <hartytp> (sb0,whitequark: ^)

09:43 <sb0> hartytp, usbreset.c

09:46 <hartytp> thanks!¬

09:46 <hartytp> thanks!

09:59 <GitHub-m-labs> [artiq] sbourdeauducq commented on issue #1045: > As a bonus this will make the compiler faster (though IIRC I didn't use any particularly expensive asserts).... https://github.com/m-labs/artiq/issues/1045#issuecomment-394300809

10:24 <sb0> is there still no website, cfp, etc. for ecti5?

10:54 sb0 has quit [Quit: Leaving]

11:21 sb0 has joined #m-labs

11:24 <_florent_> hartytp: we can discuss here if you want for the hmc7043 traces

11:25 hartytp_ has joined #m-labs

11:25 <hartytp_> _florent_ sounds good

11:26 <_florent_> ok, so it's always crashing after https://github.com/m-labs/artiq/blob/master/artiq/firmware/libboard_artiq/hmc830_7043.rs#L121?

11:26 <_florent_> hmm sorry

11:26 <_florent_> that's the hmc830

11:26 hartytp_ has quit [Client Quit]

11:26 <hartytp> seenms si

11:26 <hartytp> so

11:26 <_florent_> ok so here: https://github.com/m-labs/artiq/blob/master/artiq/firmware/libboard_artiq/hmc830_7043.rs#L279

11:27 <hartytp> https://github.com/m-labs/artiq/blob/bb87976d4fdb6163c9a7fecad980630df9e1d3ae/artiq/firmware/libboard_artiq/hmc830_7043.rs#L279

11:27 <hartytp> yes

11:27 <_florent_> can we try to do the hmc7043 configuration but mute the outputs?

11:29 <_florent_> so set bit3 of register 1

11:29 <hartytp> yep, can try that

11:29 <_florent_> https://github.com/m-labs/artiq/blob/master/artiq/firmware/libboard_artiq/hmc830_7043.rs#L242

11:29 <hartytp> maybe even better would be to just leave it shutdown?

11:30 <_florent_> change that to write(0x1, 0x48);

11:30 <GitHub-m-labs> [artiq] cjbe opened pull request #1047: correct documented siphaser VCO frequency [NFC] (master...siphaserdoc) https://github.com/m-labs/artiq/pull/1047

11:32 <_florent_> if it stops crashing, maybe something we can try next is to enable the outputs only when the rest of the configuration has been done

11:34 <hartytp> yes, I did implement that before (change the shutdown function to mute and then only call unmute after the init)

11:34 <hartytp> it didn't help then, but we've fixed a few things since, so maybe now it will do something

11:34 <hartytp> I'll try

11:34 <_florent_> ok thanks

11:34 <hartytp> but, might be shutting the proverbial stable door after the horse has kicked the shit out of our FPGA

11:35 <hartytp> i.e. we can't mute the 7043 until after boot has been completed, so maybe enough time for it to cause memory corruption, etc that only shows up later on?

11:36 <_florent_> ah yes indeed...

11:36 <hartytp> hmmm what about using the reset line

11:38 <_florent_> now that you are able to see the broadband noise, do you see if we only have it on the first start after power on, or if we have it at each restart?

11:39 <hartytp> each restart

11:39 <hartytp> (each time I call artiq_flash ... load)

11:39 <_florent_> ok interesting, i was thinking it was only at the first power on

11:39 <hartytp> nope

11:40 <hartytp> I guess that loading the RTM FPGA resets things

11:40 <_florent_> ok

11:40 <hartytp> (regulators?)

11:40 <hartytp> do you remember how the resets work

11:40 <hartytp> ?

11:40 <hartytp> looking on schematic sheet 9, it looks like the Si5324 and HMC7043 reset lines are tied together

11:41 <hartytp> I guess we don't need the SI5324 atm, so I can hold both in reset and see what happens

11:41 <_florent_> no, but i'm going to look too.

11:43 <hartytp> yes, it resets both chips

11:44 <hartytp> okay, I'll add a CSR to control the HMC7043 reset and see what happens if I keep it disabled until after HMC830 boot...

11:45 <_florent_> yes we can do that

11:46 <_florent_> do you want i add this?

11:47 <hartytp> I think I'm fine doing it

11:49 <_florent_> ok good

11:51 <hartytp> remind me: on the AMC, where do you disable the inputs from the HMC7043 during boot?

11:52 <_florent_> ah, i was also thinking about disabling this feature to be sure we really eliminate the broadband noise :)

11:52 <_florent_> i'm looking at the code

11:53 <_florent_> so here: https://github.com/m-labs/artiq/blob/master/artiq/gateware/targets/sayma_amc.py#L62

11:53 <_florent_> and here: https://github.com/m-labs/artiq/blob/master/artiq/gateware/targets/sayma_amc.py#L72

11:53 <_florent_> if you regenerate the AMC, you can replace the self.jreset.storage with 1

11:55 <hartytp> 1 or 0?

11:56 <hartytp> 1 would keep it disabled

11:56 <hartytp> if you want to really test for noise then 0 might be better....

11:56 <_florent_> ah yes sorry, 0

12:04 <GitHub-m-labs> [artiq] enjoy-digital pushed 1 new commit to master: https://github.com/m-labs/artiq/commit/925b47b077b5f84e2ec7a0c1b222afe6f7b249f9

12:04 <GitHub-m-labs> artiq/master 925b47b Florent Kermarrec: firmware/ad9154: reset the dac between each configuration attempt

12:04 <_florent_> hartytp: ^ it will may be help for the serdes pll timeout issue

12:06 <_florent_> hartytp: to get the serdes pll to lock, the dac only needs a correct clock and configuration.

12:07 <_florent_> hartytp: i don't explain why it fails getting serdes pll to lock, but that' probably a different issues than the hang/crash.

12:09 <hartytp> agreed

12:30 rohitksingh_work has quit [Read error: Connection reset by peer]

12:30 <rjo> bb-m-labs: force build --props=package=artiq-board,artiq_target=kasli,artiq_variant=ptb artiq-board

12:30 <bb-m-labs> The build has been queued, I'll give a shout when it starts

12:48 <bb-m-labs> build #1609 of artiq-board is complete: Success [build successful] Build details are at http://buildbot.m-labs.hk/builders/artiq-board/builds/1609

12:48 <bb-m-labs> build forced [ETA 34m41s]

12:48 <bb-m-labs> I'll give a shout when the build finishes

12:54 <GitHub-m-labs> [artiq] sbourdeauducq pushed 1 new commit to master: https://github.com/m-labs/artiq/commit/07d4145a35c73978b2ef4b648bcf8630ea43a1e1

12:54 <GitHub-m-labs> artiq/master 07d4145 Chris Ballance: correct documented siphaser VCO frequency [NFC]

12:58 <bb-m-labs> build #2418 of artiq is complete: Failure [failed python_unittest_2] Build details are at http://buildbot.m-labs.hk/builders/artiq/builds/2418 blamelist: Florent Kermarrec <florent@enjoy-digital.fr>

13:00 <hartytp> _florent_: building this atm https://github.com/hartytp/artiq/tree/hmc7043_rst

13:02 <_florent_> hartytp: ok that seems fine

13:02 <GitHub-m-labs> [artiq] jbqubit commented on issue #1043: > FWIW, with 2018.1 I've run two different Sayma boards (after the various fixes for bugs like SDRAM, HMC7043 noise, 1V8, etc.) continuously for days without any bug of this sort.... https://github.com/m-labs/artiq/issues/1043#issuecomment-394346575

13:05 <hartytp> (building without sawg because life is too short)

13:07 <_florent_> hartytp: yes, if rtm is already build, you can probably also start doing some tests without the input buffers always enabled on the AMC.

13:08 <GitHub-m-labs> [artiq] sbourdeauducq commented on issue #1043: Always running the same kernel that uses SAWG, but there was Ethernet traffic processed by the comms CPU due to TCP keepalive and network broadcasts.... https://github.com/m-labs/artiq/issues/1043#issuecomment-394348143

13:23 <GitHub-m-labs> [artiq] sbourdeauducq commented on issue #984: This is also affecting 3.6. https://github.com/m-labs/artiq/issues/984#issuecomment-394352659

13:24 <bb-m-labs> build #1610 of artiq-board is complete: Success [build successful] Build details are at http://buildbot.m-labs.hk/builders/artiq-board/builds/1610

13:27 <hartytp> uurgh, looking closely at my HMC7043, it seems that one of Greg's bits of rework has come loose

13:30 <_florent_> ah, what was this rework supposed to do?

13:30 <hartytp> I actually don't know

13:30 <hartytp> looks like it was pulling reset low

13:31 <hartytp> fyi, the dac reset patch didn't help the pll lock issues I see atm

13:32 <hartytp> yes, see the erata

13:32 <hartytp> damn

13:33 <hartytp> I think he cut the f****** trace

13:33 <_florent_> sorry, which trace?

13:33 <hartytp> HMC7043 reset

13:33 rohitksingh has joined #m-labs

13:33 <_florent_> ah ok

13:34 <hartytp> one big mess up with Sayma is using the smallest traces and vias allowed with all routing on internal layers and no test points

13:34 <_florent_> maybe that's when he was investigating on the hmc7043

13:34 <hartytp> I think this was right when the boards first arrived

13:35 <hartytp> https://github.com/sinara-hw/sinara/issues/216

13:35 <hartytp> near the top

13:35 <GitHub-m-labs> [artiq] jbqubit commented on issue #1043: I'll use 2018.1 going forward. --without-sawg builds with problem. Now building with SAWG.... https://github.com/m-labs/artiq/issues/1043#issuecomment-394356634

13:35 <hartytp> I'll have a look, but I'm not really set up for that kind of rework

13:36 <GitHub-m-labs> [artiq] jbqubit commented on issue #1044: @sbourdeauducq Do you see this on your hardware? https://github.com/m-labs/artiq/issues/1044#issuecomment-394356809

13:36 <hartytp> haven't the optics or the soldering tips

13:36 <_florent_> i see, so the hmc7043 reset is floating?

13:37 <hartytp> just on my board right now yes

13:37 <hartytp> on other boards, it's tied to gnd afaict

13:37 <hartytp> there should be a small wire that Greg added on pin 5

13:37 <hartytp> mine has travelled so much that it's come loose at some point

13:38 <hartytp> there comes a point where one has to spin up new boards, rather than dealing with too many fragile hacks

13:38 <_florent_> hartytp: could this explain the broadband noise you see and that don't seems to happen on others boards (except maybe joe's board)

13:39 <hartytp> probably not

13:39 <hartytp> but who knows

13:39 <hartytp> let me try to fix it

13:39 <_florent_> ok

13:44 <hartytp> re grounded it

13:44 <hartytp> noise is still there

13:45 <GitHub-m-labs> [artiq] sbourdeauducq commented on issue #1043: > --without-sawg builds with problem.... https://github.com/m-labs/artiq/issues/1043#issuecomment-394359722

13:46 <_florent_> ok

13:46 <GitHub-m-labs> [artiq] sbourdeauducq commented on issue #1044: As pointed out elsewhere by Tom and Robert, SAWG tests are not relevant until further JESD204 testing and debugging. https://github.com/m-labs/artiq/issues/1044#issuecomment-394360087

13:46 <_florent_> and are you able to reconnect it to the fpga trace?

13:52 <hartytp> hmmm right now, I'm getting a lot of serwb issues and the 830 is not identifying

13:52 <hartytp> ffs

13:55 <_florent_> then maybe you should use your last amc bistream with the enable on the buffers

13:55 <GitHub-m-labs> [artiq] jbqubit commented on issue #1043: > --without-sawg builds with problem. ... https://github.com/m-labs/artiq/issues/1043#issuecomment-394363070

13:55 <hartytp> I didn't change the input buffer enables yet

13:56 <_florent_> so you only changed the rtm?

13:57 <hartytp> and the AMC runtime https://github.com/hartytp/artiq/tree/hmc7043_rst

13:59 <bb-m-labs> build #1611 of artiq-board is complete: Success [build successful] Build details are at http://buildbot.m-labs.hk/builders/artiq-board/builds/1611

14:01 <hartytp> okay either serwb init fails or hmc830 acks 0

14:03 <bb-m-labs> build #2419 of artiq is complete: Failure [failed python_unittest_2] Build details are at http://buildbot.m-labs.hk/builders/artiq/builds/2419 blamelist: Chris Ballance <chris.ballance@physics.ox.ac.uk>

14:05 <_florent_> and the hmc7043 rst is floating or connected to the trace?

14:05 <hartytp> was grounded

14:05 <hartytp> now tied to 3v3

14:06 <hartytp> will tell you what happens when I get JTAG to stop playing silly buggers

14:09 <hartytp> okay, tying the reset high does stop the noise

14:18 <_florent_> ok, and are you able to connect it to the trace or is it complicated?

14:20 <hartytp> hmc830 is just not acking

14:20 <hartytp> I saw this once before where it stopped responding

14:20 <hartytp> I left it for an hour and it started again

14:20 <hartytp> may be some thermal thing...

14:20 <hartytp> **shudder**

14:23 <_florent_> ok, so maybe you should power off the board, try to connect the rst to the trace, and continue the test in one hour

14:27 hartytp has quit [Ping timeout: 260 seconds]

14:31 hartytp has joined #m-labs

14:31 <hartytp> I'll have another quick go in the morning.

14:32 <hartytp> but, I'm beginning to think that it's better to just focus on the boards that work

14:32 <GitHub-m-labs> [artiq] jbqubit commented on issue #1043: Using latest from master 20180604 with SAWG vivado 2018.1 07d4145a35c739. Meets timing. I've run 25 scripts involving SAWG via Ethernet. I don't see any errors on UART. https://github.com/m-labs/artiq/issues/1043#issuecomment-394375956

14:32 <hartytp> the board to board variation could well be some piece of rework that's failed

14:42 <GitHub-m-labs> [artiq] jbqubit commented on issue #1026: Using latest from master 20180604 with SAWG vivado 2018.1 07d4145a35c739. Meets timing. I've run 25 scripts involving SAWG via Ethernet. No panics. https://github.com/m-labs/artiq/issues/1026#issuecomment-394379327

14:44 hartytp has quit [Ping timeout: 260 seconds]

14:55 <GitHub-m-labs> [artiq] sbourdeauducq commented on issue #1026: So you fixed Ethernet? https://github.com/m-labs/artiq/issues/1026#issuecomment-394384373

14:58 hartytp has joined #m-labs

14:59 <hartytp> _florent_ did the JTAG rework that Greg suggested (short pins 11 and 13 with a solder blob)

14:59 <hartytp> board seems much happier now

15:00 <hartytp> with the HMC7043 enable tied high, 5 out of 5 times I get to SERDES PLL lock timeout (now expected since there is no output from the HMC7043)

15:00 <hartytp> scope verifies that there is no noise on the HMC7043 output during boot

15:01 <hartytp> spoke too soon, now I have 2/2 serwb init failed, so that's definitely not connected to the 7043

15:01 <GitHub81> [smoltcp] pothos commented on issue #224: Hello,... https://github.com/m-labs/smoltcp/issues/224#issuecomment-394386392

15:01 <hartytp> in that case, this doesn't seem so different to having the 7043 enabled

15:02 <hartytp> well, not quite true, it has not crashed after HMC7043 init even once, so maybe that was a noise issue from the 7043

15:02 <hartytp> I'll see if I can do the 7043 rework

15:03 <GitHub40> [smoltcp] pothos commented on issue #224: Ah, I applied the proposed patch with a saturating add, btw. So the numbers jumping back again should not be caused by overflows. https://github.com/m-labs/smoltcp/issues/224#issuecomment-394387036

15:07 <GitHub-m-labs> [artiq] jbqubit commented on issue #1026: Yes. https://github.com/sinara-hw/sinara/issues/553#issuecomment-394362405 https://github.com/m-labs/artiq/issues/1026#issuecomment-394388599

15:09 rohitksingh has quit [Ping timeout: 255 seconds]

15:12 rohitksingh has joined #m-labs

15:14 <_florent_> hartytp: yes we have many problems together, so that's a bit difficult

15:15 <GitHub-m-labs> [artiq] jbqubit commented on issue #1022: What I assume:... https://github.com/m-labs/artiq/issues/1022#issuecomment-394391309

15:16 <_florent_> hartytp: i would not focus too much on serwb for now

15:16 <_florent_> hartytp: for now let's try to get rid of the crashes

15:18 <_florent_> hartytp: if you are able to do the 7043 rework and see that crashes stop, then i would recommend regenerating serwb with 1gbps linerate

15:21 <_florent_> hartytp: also if you no longer have noise due to hmc7043, there is no reason to use low speed serwb, we should be able to use the 1gbps version on all boards

15:24 rohitksingh has quit [Read error: Connection reset by peer]

15:27 <hartytp> how do I enable 1GSPS line rate

15:28 <hartytp> okay, that seems to have worked!

15:32 <rjo> hartytp: github annoyingly put me as the author of that commit of yours. sorry about that.

15:33 <hartytp> np

15:33 <hartytp> thanks again for all the work on the servo

15:34 <hartytp> it's really lovely

15:34 <sb0> rjo, I still find it a bit strange that the various sawg problems would stem from jesd breakage... https://github.com/m-labs/artiq/issues/1022#issuecomment-394391309

15:35 <sb0> there is definitely jesd breakage but I'm not sure if that explains everything

15:36 <hartytp> sb0: probably not, but shall we try to fight one fire at a time?

15:36 <sb0> I've done one test where I set a small amplitude in the SAWG, but the generated signal would still be full-range; samples getting swapped all over the place would not explain that

15:41 rohitksingh has joined #m-labs

15:41 <sb0> whitequark, ping

15:42 <rjo> jesd (at least the core) splits it up into nibbles.

15:48 <rjo> i'd debug sawg right now if i knew where to look. imho the proper way to approach this is with prbs (check), stpl, and then the ramp generator.

15:50 <GitHub-m-labs> [artiq] hartytp commented on issue #794: To look at this, I changed the FPGA_CLOCK divider to 12 (100MHz output) and looked at J61 on a fast scope triggered from my 100MHz reference. I can confirm that the HMC7043 configuration currently used in ARTIQ master does not provide deterministic latency. I'll apply the patch I proposed above and recheck.... https://github.com/m-labs/artiq/issues/794#issuecomment-3

15:56 <_florent_> hartytp: ok, now that you no longer have crashes, can you use this patch to use 1gbps serwb?:

15:56 <_florent_> hartytp: https://hastebin.com/huxokiwowu.rb

15:57 <GitHub-m-labs> [artiq] hartytp commented on issue #794: Nope, even with that patch, the 7043 outputs don't have deterministic latency w.r.t. the reference. https://github.com/m-labs/artiq/issues/794#issuecomment-394406731

15:57 <hartytp> will try that tomorrow

15:57 <GitHub-m-labs> [artiq] hartytp commented on issue #794: Will dig into this further tomorrow. https://github.com/m-labs/artiq/issues/794#issuecomment-394406793

15:59 <_florent_> rjo: i'm going to add the stpl test

16:04 <GitHub-m-labs> [artiq] sbourdeauducq commented on issue #794: Great, thanks for all your help! https://github.com/m-labs/artiq/issues/794#issuecomment-394409078

16:08 <rjo> _florent_: thanks. just a quick q: what is the smallest granularity that jesd could end up doing wrong ordering or misalignment at? octets? nibbles?

16:09 <rjo> or maybe larsc ^^^

16:10 <_florent_> rjo: here we are in mode 0: http://www.analog.com/media/en/technical-documentation/data-sheets/AD9154.pdf p49

16:11 <_florent_> rjo: so i would say octets, but i have to have a closer look

16:13 <rjo> _florent_: ack.

16:14 <larsc> rjo: what do you mean with wrong ordering?

16:20 <larsc> if you have problems with amplitude, maybe MS octet and LS octet are swapped?

16:20 <larsc> although that should not be random

16:22 <larsc> when I look at your broken waveform I'd say offset binary vs two's complement problem, no idea how that would happen though

16:22 <rjo> larsc: well. i am not certain i'm asking the right question. lmfc alignment granularity is frames, frame alignment granularity is octets, right?

16:23 <larsc> I don't thing lmfc matters here, lmfc is just for determinisitc latency

16:24 <larsc> lane alignment is done based on the first non /K/ character that is received

16:25 <larsc> so unless you have a underflow/overflow in the transceiver after the link has been established things should stay aligned

16:27 <rjo> right. underflows is one thing.

16:28 <rjo> but https://pasteboard.co/HnSYE20.jpg that's from a counter that wraps around and outputs the same sample 4 times into the jesd core.

16:28 <rjo> that is a sample ordering issue.

16:28 <larsc> yes

16:29 mumptai has joined #m-labs

16:29 <larsc> but that kind of reordering would only happen in the FPGA

16:29 <larsc> never seen this in a DAC

16:30 <larsc> any CDC FIFOs?

16:30 <larsc> that almost looks like a gray counter

16:31 <rjo> larsc: hmm. yes.

16:31 <_florent_> rjo: could it be related to the elastic buffers?

16:32 <larsc> any 3 bit gray counters in your system?

16:34 <rjo> larsc: sure. EB depth comes to mind.

16:36 <rjo> larsc: why 3 bit?

16:37 <larsc> looks like 3-bit, I don't know

16:37 <rjo> _florent_: i don't know whether any of the sc1 changes are now "actually making use" of the EB.

16:39 <rjo> larsc: there is the 4-periodicity. that matches both the EB depth and the "samples-per-fabric-clock" number.

16:40 <_florent_> rjo: do you mean we should remove the EB?

16:41 <rjo> _florent_: i have no idea. in general: if the two sides of the EB always have the same phase then it can be removed.

16:44 <rjo> but i'd probably debug this with stpl first (assuming the EB is after the STPL gen).

16:44 <_florent_> rjo: no, the EB are not used for STPL

16:47 <rjo> ok. then let's still do stpl to ddx between upstream/downstream of the stpl injection point.

16:47 <swivel> win 11

16:47 <larsc> your data path width is 4? you always process 4 samples in 1 clock cycle?

16:47 <swivel> oops

16:49 <larsc> do you have different elastic buffers for different samples?

16:50 <larsc> or all samples through the same elastic buffer?

16:53 <rjo> iirc data path is 4 samples (certainly in the fabric up to the jesd core). i forget whether the jesd core continues then at 4 samples or at 2 samples. and then i don't know it goes. _florent_ is the man.

16:53 <larsc> it looks like half the sample are one clock cycle late/early, which makes no sense if they are always processed 4 at a time

16:53 <rjo> but i think it is 4 throughout including the EBs

16:56 <larsc> even if the order gets messed up, with a data path width of 4 and 4 consecutive samples with the same value there should at least be 2 consective samples in the output that have the same value

16:56 <rjo> sorry, 2 sample wide EB. from the looks of it.

16:57 <rjo> 2 EBs per channel. 1 per lane.

16:59 <rjo> and the eb is 4 entries deep.

17:01 <larsc> but 4 conecutive samples would be 1 entry in the EB

17:02 <rjo> yes.

17:03 <larsc> and if you generate 4 samples with the same value the only patterns we'd see are 0000 0001 0011 0111

17:03 mumptai_ has joined #m-labs

17:03 <larsc> so at least 2 samples with the same value

17:03 <larsc> even if the order in the EB is messed up

17:03 cjbe_ has joined #m-labs

17:03 mumptai_ has quit [Remote host closed the connection]

17:03 <larsc> but the pattern we see in the scope is 0101

17:04 <larsc> or rather 1010

17:04 <rjo> yes. what we are seeing is 1010 2121 3232...

17:04 cjbe has quit [Ping timeout: 256 seconds]

17:05 <larsc> makes no sense :)

17:05 <rjo> well. i don't know about the octets. that's a binary counter that has the lowest octet 0.

17:07 <rjo> and i don't know if the sequence assignment between the EBs is 02/13 or 01/23.

17:09 <larsc> and mode 0 is confirmed?

17:10 <rjo> i hope so.

17:10 <larsc> I could see this happen with mode 1

17:10 <larsc> where is the framer?

17:11 <rjo> url? or logically?

17:12 <larsc> url

17:12 <rjo> m-labs/jesd204b somewhere.

17:13 <larsc> https://github.com/m-labs/jesd204b/blob/master/jesd204b/transport.py

17:13 <rjo> but i think digging into this too much right now is pointless. let's wait for new data with fixed clock fanout and stpl.

17:16 <larsc> aha!

17:16 <larsc> you are using mode 1

17:16 <larsc> https://github.com/m-labs/artiq/blob/77fc5c599fd53fc7dcc9f48925def5c592618f6b/artiq/gateware/targets/sayma_amc.py#L80

17:17 rohitksingh has quit [Quit: Leaving.]

17:17 <larsc> in mode one if the entries in the EB are swapped the output makes sense again

17:17 <larsc> cause it is only 2 samples per 4 octets

17:18 <larsc> lane 0 has sample 0 and 1 and lane 1 has sample 2 and 3

17:18 <larsc> rather than each lane having 1 octet of each sample

17:26 <travis-ci> batonius/smoltcp#131 (master - 91f5891 : Dan Robertson): The build passed.

17:26 <travis-ci> Change view : https://github.com/batonius/smoltcp/compare/3fb0c22fd4ed...91f5891dbbea

17:26 <travis-ci> Build details : https://travis-ci.org/batonius/smoltcp/builds/387899076

17:27 <rjo> larsc: nice find! thanks

17:27 <rjo> _florent_: ^^

17:28 <rjo> indeed.

17:29 <rjo> would stpl have found that?

17:29 <_florent_> sorry, with family, i'll be back later

17:29 <travis-ci> batonius/smoltcp#133 (layers - 7d37aa0 : Egor Karavaev): The build is still failing.

17:29 <travis-ci> Change view : https://github.com/batonius/smoltcp/compare/f0497536802a...7d37aa0b5ecb

17:29 <travis-ci> Build details : https://travis-ci.org/batonius/smoltcp/builds/387899568

17:29 <rjo> _florent_: ok.

17:29 <larsc> I assume that one of the EBs glitches by 1 clock cycle relative to the other

17:30 <rjo> "glitches"? or "ends up being ahead of the other"?

17:31 <rjo> and in addition to the mode1/mode0 error?

17:33 <GitHub-m-labs> [artiq] jbqubit commented on issue #794: Analog Devices talks about deterministic latency of HMC7043 this in a report on a huge clock tree.... https://github.com/m-labs/artiq/issues/794#issuecomment-394435901

17:37 sb0 has quit [Ping timeout: 245 seconds]

17:37 <larsc> rjo: here is what happens https://image.ibb.co/jW2vQ8/jesd.png

17:38 <larsc> I'm pretty sure the DAC is also configured for mode 1, otherwise the output would never be correct

17:39 <rjo> yes. mode1 might be fine per se.

17:39 <rjo> larsc: i need a legend to read that figure ;) but i think i know what you mean.

17:41 <larsc> https://image.ibb.co/j9vbXo/jesd.png

17:41 <larsc> this might be clearer

17:41 <rjo> larsc: the way our EB is implemented is also without any flow control other than reset. it assumes that after a reset the phase won't make excursions beyond depth/2

17:41 <larsc> top is when it works

17:41 <larsc> bottom is when it doesn't

17:41 <larsc> and you can see in the bottom half lane 1 is 1 clock cycle behind lane 0

17:41 <larsc> samples are read in the order A, B, C, D

17:41 <rjo> yes. one lane+EB being one sample deeper than the other

17:42 <rjo> well beyond depth/2 - 1 = 1

17:46 <GitHub-m-labs> [artiq] hartytp commented on issue #794: Joe I'm being daft and measuring the wr on thing. That was never going to work as I was measuring ref to hmc output phase which we can't control. Should have measured phase between hmc7043 outputs! https://github.com/m-labs/artiq/issues/794#issuecomment-394439986

17:47 <GitHub-m-labs> [artiq] hartytp commented on issue #794: Well can't = can't via SPI alone https://github.com/m-labs/artiq/issues/794#issuecomment-394440352

17:49 <larsc> I'd assume the EB doesn't get it's reset properly or the reset has a asynchronous de-assert or something like that

18:41 <GitHub-m-labs> [artiq] whitequark commented on issue #1007: Yes. I tried to fix this purely in the ARTIQ compiler, and it didn't work. Specifically, hoisting invariant loads requires inlining, which requires devirtualization, which is quite hard to implement due to Python's semantics. (We had devirtualization to support compiler-assisted interleaving, but it broke a while ago, and I wasn't successful in fixing it).... htt

18:53 <larsc> does each for your EBs has its own reset synchronizer?

18:53 <larsc> that would explain it

18:53 <larsc> https://github.com/m-labs/migen/blob/02bccefbb7072f0ca953d86cca7783689a150e0e/migen/genlib/cdc.py#L174-L176

18:54 <larsc> so, fix is one EB to rule them all

18:56 <_florent_> larsc: i'm back, thanks a lot for looking at this and for the infos

18:57 <_florent_> so you are suggesting to have the same write/read reset for all the EBs?

18:58 <larsc> where is the /K/ generated? before or after the EB?

19:00 <larsc> if I read the source correctly it is after the EB

19:00 <larsc> so what is happening is that your EBs introduce a random offset of one clock cycle on each lane for the data stream

19:01 <larsc> but at the same time /K/ and ILAS do not have this random offset since they are after the EB and have the same control signals

19:01 <larsc> so two options

19:01 <larsc> either place EB after the link

19:01 <larsc> or use the same EB for all lanes

19:01 <larsc> I'd do the later

19:01 <larsc> no need to have a read/write pointer for each EB

19:03 <larsc> ah I see

19:03 <_florent_> yes but the EB are here because all lanes are operating with their own clocks (same frequency but phase can vary)

19:03 <larsc> you want to have the option to have one clock for each phy

19:04 <larsc> you need to swap the order of link and EB here https://github.com/m-labs/jesd204b/blob/c4632479eaf2c5f436ed7ac89dde59d2782ecd7a/jesd204b/core.py#L76-L79

19:04 <_florent_> larsc: ideally, we should run all the phy with the same clock and use the "tx multilane alignement" of the transceiver

19:05 <_florent_> larsc: but that's not what we did here, maybe we'll do that later, but i see what you mean and i'll probably implement that first

19:05 <larsc> the other alternative is to make sure that all the EBs use the same reset on the write side

19:10 <larsc> but this is a bit tricky to implement using the current code

19:10 <larsc> I think swapping the order and run all of the link in the same domain will work

19:11 <larsc> the jesd204 protocol will then filter out that additional skew from the EBs

19:13 <_florent_> thanks, i'll try that

20:46 <GitHub-m-labs> [artiq] jbqubit commented on issue #1022: @sbourdeauducq @jordens What's the next step here? It looks to me like there may be both JESD and SAWG bugs here. https://github.com/m-labs/artiq/issues/1022#issuecomment-394493440

21:25 <GitHub-m-labs> [artiq] hartytp commented on issue #1022: Probably just a small JESD issue. Fix on the way. See IRC logs... https://github.com/m-labs/artiq/issues/1022#issuecomment-394504890

21:26 jbqubit has joined #m-labs

22:21 <GitHub-m-labs> [artiq] cjbe opened issue #1048: SUServo channel number confusion https://github.com/m-labs/artiq/issues/1048

22:40 <GitHub-m-labs> [migen] enjoy-digital pushed 1 new commit to master: https://github.com/m-labs/migen/commit/33bb06ab3a61f803d5ae94acd14bea855615db39

22:40 <GitHub-m-labs> migen/master 33bb06a Florent Kermarrec: genlib/cdc: add optional master parameter to ElasticBuffer to allow sharing write reset between ElasticBuffers

22:44 <bb-m-labs> build #276 of migen is complete: Success [build successful] Build details are at http://buildbot.m-labs.hk/builders/migen/builds/276

23:34 <GitHub50> [smoltcp] barskern commented on issue #224: @pothos I found a bug in the fast retransmit code based on your logs! ... https://github.com/m-labs/smoltcp/issues/224#issuecomment-394533109

23:35 mumptai has quit [Remote host closed the connection]