#m-labs on 2018-03-03 — irc logs at freenode.irclog.whitequark.org

2015-03-04 14:45 sb0 changed the topic of #m-labs to: ARTIQ, Migen, MiSoC, Mixxeo & other M-Labs projects :: fka #milkymist :: Logs http://irclog.whitequark.org/m-labs

00:48 marmelada has quit [Quit: Page closed]

01:49 attie has quit [Ping timeout: 245 seconds]

01:49 attie has joined #m-labs

02:40 attie has quit [Ping timeout: 240 seconds]

02:40 attie has joined #m-labs

02:43 <GitHub-m-labs> [artiq] klickverbot opened issue #943: try/finally on kernel corrupts passed-through host exception name https://github.com/m-labs/artiq/issues/943

02:45 <GitHub-m-labs> [artiq] whitequark commented on issue #943: Probably [this](https://github.com/m-labs/artiq/blob/master/artiq/firmware/ksupport/eh.rs#L403-L405). https://github.com/m-labs/artiq/issues/943#issuecomment-370110454

02:48 <GitHub-m-labs> [artiq] whitequark commented on issue #943: Probably [this](https://github.com/m-labs/artiq/blob/master/artiq/firmware/ksupport/eh.rs#L403-L405) in combination with [this](https://github.com/m-labs/artiq/blob/master/artiq/firmware/ksupport/lib.rs#L148-L167). https://github.com/m-labs/artiq/issues/943#issuecomment-370110454

02:48 <GitHub-m-labs> [artiq] whitequark commented on issue #943: Probably [this](https://github.com/m-labs/artiq/blob/ddcc68cff9f52224150f00e42e46fdb107d24e49/artiq/firmware/ksupport/eh.rs#L403-L405) in combination with [this](https://github.com/m-labs/artiq/blob/ddcc68cff9f52224150f00e42e46fdb107d24e49/artiq/firmware/ksupport/lib.rs#L148-L167). https://github.com/m-labs/artiq/issues/943#issuecomment-370110454

02:50 <GitHub-m-labs> [artiq] whitequark commented on issue #943: This is exceptionally (no pun intended) annoying to fix. We already take a copy of the exception itself but it's not easily possible to take a copy of data without an allocation of some sort. Maybe put the data into runtime (not kernel) memory... https://github.com/m-labs/artiq/issues/943#issuecomment-370110743

02:54 <GitHub-m-labs> [artiq] klickverbot commented on issue #943: (cc @kesht123) https://github.com/m-labs/artiq/issues/943#issuecomment-370111025

03:36 attie has quit [Read error: Connection reset by peer]

03:36 attie has joined #m-labs

04:00 mumptai_ has joined #m-labs

04:03 mumptai has quit [Ping timeout: 240 seconds]

04:06 attie has quit [Ping timeout: 240 seconds]

04:07 attie has joined #m-labs

04:14 <sb0> so the kernel module is there to provide a NIH socket interface?

04:37 <davidc__> sb0: usually the GIGE kernel modules are to work around shitty prioritization / crappy userspace code

04:37 <davidc__> er, GIGE vision

04:37 <davidc__> basically, GIGE vision just dumps the frame to you over UDP. If you get all the packets, good for you. If not, your loss

04:38 <davidc__> so for shitty devices, or programmers who write shitty code, its easier to provide a kernel module that just grabs the frame when requested

04:38 <davidc__> no idea if thats what this particular GIGE vision driver does, but I've seen similar for other machine vision cameras

04:43 <sb0> huh. why not a shared library, or subprocess?

04:46 <davidc__> sb0: there are many questions with no good answers :)

04:46 <davidc__> sb0: in all seriousness, probably because somebody wrote that library $X years ago for $Y shitty oversubscribed hardware

04:47 <davidc__> and thats the way they've always done it. Or, it guarantees that valid frames are grabbed even under worst case loading conditions, regardless of whether the user tunes their code/system right

04:47 <davidc__> so they get less support calls

04:48 <davidc__> (really, I got no idea why they'd do it that way in particular, all I know is that its not particularly uncommon)

04:54 <davidc__> sb0: FWIW, I have a GIGE vision camera kicking around the lab if you need another GIGE vision device to test against, but its probably not useful unless you are writing a universal GIGE vision driver

05:04 <whitequark> pretty sure kernel can still drop packets with a module

05:06 <davidc__> whitequark: sure it can, but depending how you hook it in and depending on the system loading conditions (and depending how how well the userspace code is written), it might drop packets less

05:06 <davidc__> to be clear, I'm not saying its a good or valid design decision

05:20 <GitHub-m-labs> [artiq] sbourdeauducq commented on issue #942: Planning to introduce a ``RTIOLinkError`` exception when attempting a RTIO operation that involves a link that is down. It cannot be precise (since we usually don't wait for feedback from the satellite devices for latency/throughput reasons) but it should catch most cases. https://github.com/m-labs/artiq/issues/942#issuecomment-370120046

05:21 <GitHub-m-labs> [artiq] sbourdeauducq commented on issue #941: Yes, maybe the startup kernel can explicitly wait for all relevant links to be up. I propose introducing a separate API call to check for link status, which will be cleaner than attempting RTIO operations until ``RTIOLinkError`` (#942) is no longer raised. https://github.com/m-labs/artiq/issues/941#issuecomment-370120134

05:23 <GitHub-m-labs> [artiq] sbourdeauducq pushed 2 new commits to master: https://github.com/m-labs/artiq/compare/ddcc68cff9f5...ba74013e3e82

05:23 <GitHub-m-labs> artiq/master ba74013 Sebastien Bourdeauducq: runtime: add a missing overflow flag reset

05:23 <GitHub-m-labs> artiq/master abfbade Sebastien Bourdeauducq: doc: DMA can also raise RTIOUnderflow

05:55 <bb-m-labs> build #1304 of artiq-board is complete: Success [build successful] Build details are at http://buildbot.m-labs.hk/builders/artiq-board/builds/1304

06:01 <bb-m-labs> build #743 of artiq-win64-test is complete: Warnings [warnings python_unittest] Build details are at http://buildbot.m-labs.hk/builders/artiq-win64-test/builds/743 blamelist: Sebastien Bourdeauducq <sb@m-labs.hk>

06:04 <bb-m-labs> build #2143 of artiq is complete: Success [build successful] Build details are at http://buildbot.m-labs.hk/builders/artiq/builds/2143

06:09 attie has quit [Ping timeout: 265 seconds]

06:09 attie has joined #m-labs

06:28 <whitequark> sb0: wtf, tests pass now?!

06:28 <whitequark> why?

06:29 <whitequark> the failing test didn't use DMA...

06:37 attie has quit [Ping timeout: 256 seconds]

06:37 attie has joined #m-labs

06:59 <GitHub51> [smoltcp] LuoZijun opened issue #174: Add Ipv4Range on wire module ? https://github.com/m-labs/smoltcp/issues/174

07:02 <GitHub177> [smoltcp] whitequark commented on issue #174: Why should this functionality be in smoltcp? Is it going to be used inside it? https://github.com/m-labs/smoltcp/issues/174#issuecomment-370125435

07:10 attie has quit [Ping timeout: 256 seconds]

07:11 attie has joined #m-labs

07:35 <GitHub187> [smoltcp] LuoZijun commented on issue #174: > Why should this functionality be in smoltcp?... https://github.com/m-labs/smoltcp/issues/174#issuecomment-370127107

07:35 <GitHub38> [smoltcp] LuoZijun commented on issue #174: > Why should this functionality be in smoltcp?... https://github.com/m-labs/smoltcp/issues/174#issuecomment-370127107

07:35 <GitHub164> [smoltcp] LuoZijun commented on issue #174: > Why should this functionality be in smoltcp?... https://github.com/m-labs/smoltcp/issues/174#issuecomment-370127107

07:37 <GitHub27> [smoltcp] LuoZijun commented on issue #174: > Why should this functionality be in smoltcp?... https://github.com/m-labs/smoltcp/issues/174#issuecomment-370127107

07:55 <GitHub19> [smoltcp] whitequark commented on issue #174: I think the implementation you propose (with `Iterator<Ipv4Cidr>`) is too niche, I don't see a lot of uses for it. Since it can be freely implemented outside of smoltcp I don't think it should be included in smoltcp. https://github.com/m-labs/smoltcp/issues/174#issuecomment-370128369

08:53 attie has quit [Ping timeout: 265 seconds]

08:53 attie has joined #m-labs

09:12 attie has quit [Ping timeout: 240 seconds]

09:12 attie has joined #m-labs

09:22 <sb0> whitequark, i didn't touch DMA. cache effect?

09:33 <rjo> sb0: could you look at slave_fpga when you get a chance?

09:33 <sb0> rjo, let me finish the two drtio issues that chris reported first

09:34 <rjo> whitequark: was a firewall missconfig

09:35 <rjo> sb0: ack. the positive slack with external clock thing is also mysterious.

09:36 <rjo> i'll be afk today.

09:38 <rjo> sb0: on slave_fpga, i also tested various pullups/downs, slew=fast on cclk, drive=16, driving done high (and checking serwb afterwards), various speed changes, slower cclk, traced the individual bits being sent, confirmed sync word bit order, tested just for fun byte-swapping the rtm bitstream, compared with u-boot xilinx slave serial code, linux kernel xilinx slave serial code.

09:39 <rjo> and more things i don't remember right now.

10:12 _whitelogger has joined #m-labs

10:21 mithro has quit [Ping timeout: 240 seconds]

10:22 mithro has joined #m-labs

10:35 reportings has joined #m-labs

10:37 reportingsjr has quit [Ping timeout: 256 seconds]

10:38 attie has quit [Ping timeout: 256 seconds]

10:39 attie has joined #m-labs

10:59 <GitHub22> [smoltcp] LuoZijun closed issue #174: Add Ipv4Range on wire module ? https://github.com/m-labs/smoltcp/issues/174

11:24 attie has quit [Ping timeout: 256 seconds]

11:24 attie has joined #m-labs

13:27 attie has quit [Ping timeout: 256 seconds]

13:27 attie has joined #m-labs

14:13 futarisIRCcloud has quit [Quit: Connection closed for inactivity]

14:13 <cjbe> sb0, a thought: rather than reworking on a si5326, could we just turn up the si5324 bandwidth, then use a MMCM to phase shift the recovered clock that feeds the si5324

14:14 <cjbe> then phase lock the si5324 output to the recovered clock inside the FPGA using a DDMTD method like White Rabbit (http://ieeexplore.ieee.org/document/5556289/)

14:15 <sb0> cjbe, that won't correct the phase wander

14:16 <cjbe> as long as the si5324 lock b/w to the input clock, it will

14:16 <cjbe> worst case requires a 150 MHz reference crystal bodged onto the clkin2 so we can use a higher PFD frequency and hence higher lock b/w

14:17 <cjbe> So minimal hardware rework in the short term

14:18 <sb0> if there is higher bandwidth (which is also what the 5326 is providing) then there is no need for phase lock, just measure the skew and compensate for it with either a mmcm or (for the 5326) the skew adjust feature

14:18 <cjbe> Also, if we decide to go for an TCVCXO and DAC, like White Rabbit, the 'only' gateware we change out is replacing the interface to the MMCM with a DAC SPI interface

14:20 <sb0> my understanding is the wander is due to a combination of low bandwidth + unstability of the reference. input corrections to the wander would be outside the bandwidth.

14:21 <cjbe> sb0: as the input-output phase stabilty is not specced on the si5326, I expect this to drift over time, hence the 'measure and correct the skew' operation will need to be repeated frequently - I would call this a phase lock

14:24 <sb0> cjbe, what kind of wander do you get with the 5324 and a good quality 150MHz clkin2?

14:24 <cjbe> sb0, yes - the input correction would be attenuated by the si bandwidth, but we could turn up the gain of the feedback and precompensate for this

14:25 <cjbe> sb0, I have not measured that carefully - will measure that today

14:26 <sb0> I'm afraid turning up the gain too much will lead to oscillations of the loop, or maybe problems with the MMCM

14:26 <cjbe> another possiblity is to go straight to the White Rabbit TCVCXO + DAC solution - we could bodge this on pretty easily onto the clkin1 of the si chip, then run that in bypass mode

14:29 <cjbe> sb0, possibly - I am not sure entirely what is going on inside the si to cause this in the first place

15:13 <cjbe> sb0: just had a look at the si5324 phase offsets

15:14 <cjbe> using a good clock on clkin2, looking at phase shift between this clock and the si5324 output (MMCX)

15:16 <cjbe> using the default Artiq settings (PFD at 16 kHz, BWsel=3) I see phase wander at the ~10 Hz timescale with pk-pk of 4ns over a minute. Touching the si reference crystal gives many many cycles of phase shift

15:17 <cjbe> using the PFD at 2 MHz with 540 Hz bandwidth (BWsel=4) I see a jitter stddev of 7ps, and a pk-pk over a minute of 75ps. Touching the reference crystal gives a pk-pk of ~150ps

15:20 <cjbe> using the si in bypass mode, I see a jitter stddev of 8ps, and a pk-pk of 70ps

15:21 <cjbe> and the jitter of my nice clock against itself (from a power splitter) I see a jitter of 8ps stddev and pk-pk 72ps

15:23 <cjbe> (this is all using the stock Kasli, without a nicer si reference crystal)

15:23 <cjbe> so this is not consistent with my earlier measurements, where I stated that even at this higher bandwidth the si was not locking to the recovered clock properly

15:29 <sb0> cjbe, okay. turns out I was having a 2MHz PFD and BWSEL=4 with the initial kc705 tests (at 62.5MHz)

15:34 <cjbe> sb0, aha! that makes sense

15:34 <sb0> https://github.com/m-labs/drtio_transceiver_test/blob/master/si5324_kc705.py

15:36 <cjbe> so with the master and satellite running at 2 MHz PFD and BWSEL=4 (using an external 150 MHz clock on both), the satellite si output has a good phase lock to the master si output - (6ps stddev, 57ps pk-pk)

15:37 <cjbe> once everything is up, I can disconnect the external clock from both master and satellite, and everything still is phaselocked nicely (so no funny business going on)

15:38 <sb0> alright! so it was basically a si5324 config error. what about the skew between power-ups?

15:39 <sb0> in the kc705 tests we did that was constant, even though the DS says it's not

15:40 <cjbe> If I disconnect and reconnect the fiber (so reset the DRTIO link) I see the skew varying

15:40 <cjbe> it may be quantised, but it appears to vary over a full turn

15:41 <sb0> okay. so we can add some simple FPGA calibration

15:42 <cjbe> indeed

15:43 <sb0> what were your 150MHz sources?

15:44 <cjbe> the only remaining issue is how to get the PFD frequency up - we cannot do this using the si in free run mode with a ~114.3 MHz crystal.

15:44 <sb0> OCXO-grade?

15:45 <cjbe> But we could generate a 150 MHz / 125 MHz clock from a MMCM on the FPGA and switch it into the si input (instead of rtio_rx0)

15:45 <cjbe> I am using a synth and a splitter to generate the two 150 MHz external references

15:45 <sb0> well if the references are from the same oscillator, it's cheating

15:46 <cjbe> Or we could replace the ~114.3 MHz crystal with (say) 125 MHz to make nice numbers

15:46 <cjbe> sb0, I need the external clock to startup the si - I can startup the master, then disconnect the external clock, and startup the slave and everything still works

15:47 <sb0> oh ok, I see

15:47 <sb0> it still has the original crystal as reference

15:47 <cjbe> yep

15:48 <sb0> okay, good. there's hope we can get the hardware to <<1ns with just gateware

15:52 <cjbe> indeed - hallelujah

16:40 attie has quit [Ping timeout: 256 seconds]

16:41 attie has joined #m-labs

16:59 <GitHub125> [smoltcp] dlrobertson opened pull request #175: Add has_solicited_node to EthernetInterface (master...solicited_node) https://github.com/m-labs/smoltcp/pull/175

17:00 <GitHub100> [smoltcp] dlrobertson commented on issue #175: Adding IPv6 address resolution to `EthernetInterface` will take quite a bit of work. I'll try my best to break it down into small bite-sized chunks like this, when possible. https://github.com/m-labs/smoltcp/pull/175#issuecomment-370162840

17:04 <GitHub-m-labs> [artiq] sbourdeauducq pushed 1 new commit to master: https://github.com/m-labs/artiq/commit/928d5dc9b30e9db1329db76cd2b8b60f79d36dcc

17:04 <GitHub-m-labs> artiq/master 928d5dc Sebastien Bourdeauducq: drtio: raise RTIOLinkError if operation fails due to link lost (#942)

17:05 <GitHub-m-labs> [artiq] sbourdeauducq pushed 2 new commits to release-3: https://github.com/m-labs/artiq/compare/232940e17fc5...7337842ff927

17:05 <GitHub-m-labs> artiq/release-3 87b51cb Sebastien Bourdeauducq: doc: DMA can also raise RTIOUnderflow

17:05 <GitHub-m-labs> artiq/release-3 7337842 Sebastien Bourdeauducq: runtime: add a missing overflow flag reset

17:13 <bb-m-labs> build #2144 of artiq is complete: Failure [failed python_unittest] Build details are at http://buildbot.m-labs.hk/builders/artiq/builds/2144 blamelist: Sebastien Bourdeauducq <sb@m-labs.hk>

17:43 <bb-m-labs> build #1305 of artiq-board is complete: Success [build successful] Build details are at http://buildbot.m-labs.hk/builders/artiq-board/builds/1305

17:48 <bb-m-labs> build #744 of artiq-win64-test is complete: Warnings [warnings python_unittest] Build details are at http://buildbot.m-labs.hk/builders/artiq-win64-test/builds/744 blamelist: Sebastien Bourdeauducq <sb@m-labs.hk>

17:52 <bb-m-labs> build #2145 of artiq is complete: Success [build successful] Build details are at http://buildbot.m-labs.hk/builders/artiq/builds/2145

18:35 RexOrCine has quit [Ping timeout: 240 seconds]

18:36 RexOrCine has joined #m-labs

18:49 attie has quit [Read error: Connection reset by peer]

18:49 attie has joined #m-labs

19:23 <GitHub104> [smoltcp] whitequark commented on issue #175: @m-labs-homu r+... https://github.com/m-labs/smoltcp/pull/175#issuecomment-370172919

19:23 <GitHub70> [smoltcp] m-labs-homu commented on issue #175: :pushpin: Commit 5cb8717 has been approved by `whitequark`

19:23 <GitHub139> [smoltcp] m-labs-homu pushed 1 new commit to auto: https://github.com/m-labs/smoltcp/commit/89999a69812659e5241f8561d1ecb3fd1c949a9f

19:23 <GitHub176> [smoltcp] m-labs-homu commented on issue #175: :hourglass: Testing commit 5cb8717fd2caa17d9ed853bc212a32a5bf141b77 with merge 89999a69812659e5241f8561d1ecb3fd1c949a9f... https://github.com/m-labs/smoltcp/pull/175#issuecomment-370172936

19:23 <GitHub139> smoltcp/auto 89999a6 Dan Robertson: Add has_solicited_node to EthernetInterface...

19:32 <GitHub191> [smoltcp] m-labs-homu commented on issue #175: :sunny: Test successful - [status-travis](https://travis-ci.org/m-labs/smoltcp/builds/348720074?utm_source=github_status&utm_medium=notification)

19:32 <travis-ci> m-labs/smoltcp#787 (auto - 89999a6 : Dan Robertson): The build passed.

19:32 <travis-ci> Change view : https://github.com/m-labs/smoltcp/compare/2d2b90fd0469...89999a698126

19:32 <travis-ci> Build details : https://travis-ci.org/m-labs/smoltcp/builds/348720074

19:32 <GitHub180> [smoltcp] m-labs-homu closed pull request #175: Add has_solicited_node to EthernetInterface (master...solicited_node) https://github.com/m-labs/smoltcp/pull/175

19:32 <GitHub10> [smoltcp] m-labs-homu merged auto into master: https://github.com/m-labs/smoltcp/compare/2d2b90fd0469...89999a698126

19:41 <travis-ci> m-labs/smoltcp#788 (master - 89999a6 : Dan Robertson): The build passed.

19:41 <travis-ci> Change view : https://github.com/m-labs/smoltcp/compare/2d2b90fd0469...89999a698126

19:41 <travis-ci> Build details : https://travis-ci.org/m-labs/smoltcp/builds/348722428

22:17 mumptai_ has quit [Quit: Verlassend]

22:17 mumptai has joined #m-labs