#m-labs on 2017-06-26 — irc logs at freenode.irclog.whitequark.org

2015-03-04 14:45 sb0 changed the topic of #m-labs to: ARTIQ, Migen, MiSoC, Mixxeo & other M-Labs projects :: fka #milkymist :: Logs http://irclog.whitequark.org/m-labs

00:06 <whitequark> rjo: about delayed ACKs, I see the requirement to send an ACK every other full-size packet

00:06 <whitequark> is that really necessary? it seems to negate most of the advantage delayed ACK offers

00:06 <whitequark> let me see what linux does I guess

01:27 _whitelogger has joined #m-labs

02:27 <sb0> hmm, I'd consider putting all those ad-hoc scripts in a separate repos

02:32 <whitequark> sb0: why?

02:33 <whitequark> developing artiq remotely is painful enough as it is, and the scripts work on more than lab.m-labs.hk

02:33 <sb0> it's noise for the users

02:34 <whitequark> does no one outside of m-labs flash core device remotely? or pull logs from it etc

02:37 <GitHub109> [artiq] sbourdeauducq commented on issue #524: Fixed. https://github.com/m-labs/artiq/issues/524#issuecomment-310950275

02:37 <GitHub129> [artiq] sbourdeauducq closed issue #524: Browser crashes if maximised when no pyon file exists https://github.com/m-labs/artiq/issues/524

02:37 <sb0> probably not often enough to figure out those ad-hoc scripts

03:18 rohitksingh_work has joined #m-labs

03:48 <GitHub128> [smoltcp] whitequark pushed 1 new commit to master: https://git.io/vQ3it

03:48 <GitHub128> smoltcp/master 534842b whitequark: Make sure representation emission covers every octet of the buffer....

03:51 <travis-ci> m-labs/smoltcp#130 (master - 534842b : whitequark): The build passed.

03:51 <travis-ci> Change view : https://github.com/m-labs/smoltcp/compare/8ace8cd94e33...534842b929ce

03:51 <travis-ci> Build details : https://travis-ci.org/m-labs/smoltcp/builds/246951875

04:35 <GitHub164> [smoltcp] whitequark commented on issue #19: Looks all good! I have a few nits but no real issues. https://git.io/vQ3PF

05:09 sb0 has quit [Quit: Leaving]

05:35 rohitksingh_work has quit [Ping timeout: 260 seconds]

05:48 sb0 has joined #m-labs

05:50 rohitksingh_work has joined #m-labs

05:57 <GitHub47> [smoltcp] whitequark pushed 2 new commits to master: https://git.io/vQ3MF

05:57 <GitHub47> smoltcp/master 921f2e3 Egor Karavaev: Don't reply to a TCP RST packet with another TCP RST packet.

05:57 <GitHub47> smoltcp/master feccd96 Egor Karavaev: Refactor EthernetInterface::poll....

05:57 <GitHub113> [smoltcp] whitequark commented on issue #19: In fact I went ahead and merged it. https://git.io/vQ3Mb

06:00 <travis-ci> m-labs/smoltcp#131 (master - 921f2e3 : Egor Karavaev): The build passed.

06:00 <travis-ci> Change view : https://github.com/m-labs/smoltcp/compare/534842b929ce...921f2e36c839

06:00 <travis-ci> Build details : https://travis-ci.org/m-labs/smoltcp/builds/246974729

06:03 <GitHub92> [smoltcp] whitequark commented on issue #19: > I propose I rename that I called SocketDispatcher to SocketDispatchTable, put it inside SocketSet... https://git.io/vQ3Ds

06:09 <GitHub79> [smoltcp] whitequark pushed 1 new commit to master: https://git.io/vQ3DV

06:09 <GitHub79> smoltcp/master b3e3554 whitequark: Add missing #[derive]s on wire::IpVersion.

06:13 <travis-ci> m-labs/smoltcp#132 (master - b3e3554 : whitequark): The build passed.

06:13 <travis-ci> Change view : https://github.com/m-labs/smoltcp/compare/921f2e36c839...b3e355437f68

06:13 <travis-ci> Build details : https://travis-ci.org/m-labs/smoltcp/builds/246976887

06:42 <GitHub109> [smoltcp] whitequark commented on issue #19: Moving forward, do you think you can cover EthernetInterface with tests? A basic coverage of every `Ok` and `Err` returned from the `process_*` functions would be a great start, I'll chime in then, adding support for the newly landed rustc support for gcov. https://git.io/vQ3SL

06:56 <rjo> i am ok with the scripts as long as they are clean, documented, and maintained.

07:24 <GitHub65> [smoltcp] batonius commented on issue #19: >Moving forward, do you think you can cover EthernetInterface with tests?... https://git.io/vQ3H6

07:25 attie has quit [Remote host closed the connection]

07:25 <GitHub55> [smoltcp] batonius commented on issue #19: >Moving forward, do you think you can cover EthernetInterface with tests?... https://git.io/vQ3H6

07:26 <travis-ci> batonius/smoltcp#13 (master - b3e3554 : whitequark): The build passed.

07:26 <travis-ci> Change view : https://github.com/batonius/smoltcp/compare/a4b33c916638...b3e355437f68

07:26 <travis-ci> Build details : https://travis-ci.org/batonius/smoltcp/builds/246991895

07:29 attie has joined #m-labs

07:32 <whitequark> rjo: did I already ask you as to why you're not using them btw?

07:32 <whitequark> do you have your own scripts?

07:32 <whitequark> or just do everything on lab.* via ssh?

08:07 rohitksingh_work has quit [Ping timeout: 246 seconds]

08:14 <rjo> whitequark: a mixture of git, rsync, and mosh/tmux. yes. but i always wanted to give your scripts a try.

08:15 <whitequark> rjo: ack.

08:15 <whitequark> rjo: I believe I found the root cause behind all our throughput issues btw.

08:15 <whitequark> smoltcp did not send duplicate ACKs when it detected a missing segment

08:16 <whitequark> this meant that every missing segment incurred at least 0.5s of delay waiting for the host to retransmi

08:16 <whitequark> if I send duplicate ACKs this gets resolved within milliseconds in my testing (not on the core device yet)

08:17 <whitequark> I've missed this because this behavior of duplicate ACKs is an implementation detail of congestion control algorithms and isn't in RFC793, though it is in RFC1122

08:28 rohitksingh_work has joined #m-labs

08:31 <GitHub77> [smoltcp] whitequark pushed 3 new commits to master: https://git.io/vQ3F2

08:31 <GitHub77> smoltcp/master ac6efbf whitequark: Try to trigger fast retransmit when we detect a missing TCP segment....

08:31 <GitHub77> smoltcp/master a2f233e whitequark: In examples, trace the packets being dropped by the fault injector.

08:31 <GitHub77> smoltcp/master 86c1cba whitequark: In examples, print packet dumps with timestamps, too....

08:37 <travis-ci> m-labs/smoltcp#133 (master - ac6efbf : whitequark): The build was broken.

08:37 <travis-ci> Change view : https://github.com/m-labs/smoltcp/compare/b3e355437f68...ac6efbf99945

08:37 <travis-ci> Build details : https://travis-ci.org/m-labs/smoltcp/builds/247009825

08:38 <GitHub143> [smoltcp] whitequark force-pushed master from ac6efbf to a7da851: https://git.io/vMLjV

08:38 <GitHub143> smoltcp/master a7da851 whitequark: Try to trigger fast retransmit when we detect a missing TCP segment....

08:38 <GitHub143> smoltcp/master 1a3a6b2 whitequark: In examples, trace the packets being dropped by the fault injector.

08:44 sb0 has quit [Quit: Leaving]

08:45 <travis-ci> m-labs/smoltcp#134 (master - a7da851 : whitequark): The build is still failing.

08:45 <travis-ci> Change view : https://github.com/m-labs/smoltcp/compare/ac6efbf99945...a7da851c5dc7

08:45 <travis-ci> Build details : https://travis-ci.org/m-labs/smoltcp/builds/247011784

08:47 <GitHub50> [smoltcp] whitequark force-pushed master from a7da851 to 1746702: https://git.io/vMLjV

08:47 <GitHub50> smoltcp/master 1746702 whitequark: Try to trigger fast retransmit when we detect a missing TCP segment....

08:47 <GitHub50> smoltcp/master 103fca2 whitequark: In examples, trace the packets being dropped by the fault injector.

08:56 <travis-ci> m-labs/smoltcp#135 (master - 1746702 : whitequark): The build was fixed.

08:56 <travis-ci> Change view : https://github.com/m-labs/smoltcp/compare/a7da851c5dc7...1746702f53a7

08:56 <travis-ci> Build details : https://travis-ci.org/m-labs/smoltcp/builds/247014272

08:57 <GitHub71> [artiq] whitequark commented on issue #685: (Commenting here, since #732 was closed.)... https://github.com/m-labs/artiq/issues/685#issuecomment-301056599

09:00 <GitHub6> [artiq] whitequark pushed 3 new commits to master: https://github.com/m-labs/artiq/compare/f4624e086af5...f36f00a83d6d

09:00 <GitHub6> artiq/master f36f00a whitequark: artiq_devtool: do not chop up the TCP stream into 1024 byte chunks....

09:00 <GitHub6> artiq/master 282f425 whitequark: artiq_pcap: atomically replace pcap file....

09:00 <GitHub6> artiq/master d6f4f1f whitequark: artiq_devtool, artiq_pcap: better option naming.

09:03 <GitHub144> [artiq] whitequark commented on issue #685: (Commenting here, since #732 was closed.)... https://github.com/m-labs/artiq/issues/685#issuecomment-301056599

09:03 <rjo> whitequark: nice.

09:09 rohitksingh_wor1 has joined #m-labs

09:10 rohitksingh_work has quit [Ping timeout: 260 seconds]

09:18 <GitHub163> [artiq] whitequark commented on issue #685: > It should be perfectly workable to keep a free list backed by a static pool around to store per-segment metadata (sequence number ranges, …), while building up the payload directly in the circular buffer.... https://github.com/m-labs/artiq/issues/685#issuecomment-311006676

09:19 <whitequark> sb0: rjo: I get 1 Mbps of throughput consistently, up to 1.8 Mbps of throughput in good conditions

09:19 <whitequark> so this actually exceeds lwip I believe

09:19 <whitequark> that's host pushing data to the core device

09:20 <whitequark> wait, no

09:21 <whitequark> 1 mega*byte* per second

09:21 <rjo> whitequark: by the way, could you or sb0, look at upgrading the rigol's firmware? it hangs much more than mine on the same commands.

09:21 <whitequark> is there an upgrade?

09:21 <GitHub99> [artiq] whitequark commented on issue #685: > It should be perfectly workable to keep a free list backed by a static pool around to store per-segment metadata (sequence number ranges, …), while building up the payload directly in the circular buffer.... https://github.com/m-labs/artiq/issues/685#issuecomment-311006676

09:22 <rjo> iirc there have been some in the last months.

09:22 <whitequark> ah.

09:22 <whitequark> rjo: I'm not in HK

09:23 <rjo> whitequark: ok. 1 MB/s is what i remember from lwip as well. good if we can go faster, better if we can identify how we can go even faster then ;)

09:23 <rjo> whitequark: ok.

09:25 <bb-m-labs> build #670 of artiq-board is complete: Success [build successful] Build details are at http://buildbot.m-labs.hk/builders/artiq-board/builds/670

09:33 <whitequark> rjo: so the way we can go faster is if we don't sit there all time with a full window

09:34 <whitequark> there are two things we can do

09:35 <whitequark> first, we can enable window scaling and have really huge buffers for kernel comms to absorb bursty traffic

09:35 <whitequark> second, I'm pretty sure we can eliminate the last copy that's happening with RPCs, which will free up the CPU to do useful things

09:37 sb0 has joined #m-labs

09:38 <whitequark> the reason we *already* win against lwip is that smoltcp itself is zero-copy

09:38 <bb-m-labs> build #503 of artiq-win64-test is complete: Success [build successful] Build details are at http://buildbot.m-labs.hk/builders/artiq-win64-test/builds/503

09:38 <bb-m-labs> build #1586 of artiq is complete: Success [build successful] Build details are at http://buildbot.m-labs.hk/builders/artiq/builds/1586

09:42 <sb0> rjo, I'll have a look this week

09:42 <rjo> whitequark: all ack.

09:42 <sb0> whitequark, excellent

09:43 <rjo> sb0: no urgency though, i can work around it.

09:45 <whitequark> sb0: rjo: there's another issue though, because of a silly bug the throughput in the *other* direction is 28 kBps

09:45 <whitequark> but that's even easier to fix and it's completely obvious why that happens.

09:45 <whitequark> (smoltcp waits for an ACK after sending exactly one packet)

09:48 <whitequark> rjo: another issue with sitting there with a full window is it destroys throughput through artiq_devtool

09:49 <whitequark> not sure exactly why, it might be something about the way ssh does forwarding

09:49 <whitequark> but via artiq_devtool I only get about half that

09:49 <whitequark> over a long fat pipe, I mean

09:49 <whitequark> this *shouldn't* matter since the transfer only goes in one direction...

09:51 <GitHub147> [artiq] whitequark commented on issue #685: > It should be perfectly workable to keep a free list backed by a static pool around to store per-segment metadata (sequence number ranges, …), while building up the payload directly in the circular buffer.... https://github.com/m-labs/artiq/issues/685#issuecomment-311006676

09:52 <GitHub38> [artiq] whitequark commented on issue #685: Here are the TCP throughput graphs for the test above.... https://github.com/m-labs/artiq/issues/685#issuecomment-311014537

10:02 <whitequark> tried to implement klickverbot's suggestion of not artificially limiting the receive window size to limit bursty traffic

10:03 <whitequark> the core device ends up sending bursts of exactly 38 duplicate ACKs

10:03 <whitequark> not just *around* 38, *exactly* 38

10:04 <GitHub49> [artiq] whitequark commented on issue #685: @klickverbot ... https://github.com/m-labs/artiq/issues/685#issuecomment-311017131

10:06 <GitHub199> [artiq] whitequark commented on issue #685: @klickverbot ... https://github.com/m-labs/artiq/issues/685#issuecomment-311017131

10:06 <rjo> whitequark: just in case you don't know it already, tcptrace seemed a really good tool to debug TCP. better than the tools in wireshark.

10:07 <GitHub0> [artiq] jordens commented on issue #685: The two pcaps (1.1M and 1.8M) are exactly the same. https://github.com/m-labs/artiq/issues/685#issuecomment-311017706

10:08 <GitHub61> [artiq] whitequark commented on issue #685: > The two pcaps (1.1M and 1.8M) are exactly the same.... https://github.com/m-labs/artiq/issues/685#issuecomment-311017874

10:08 <GitHub51> [artiq] whitequark commented on issue #685: > The two pcaps (1.1M and 1.8M) are exactly the same.... https://github.com/m-labs/artiq/issues/685#issuecomment-311017874

10:09 <whitequark> rjo: never heard of tcptrace, actually

10:09 <whitequark> but I'm fine with wireshark

10:15 <GitHub26> [artiq] whitequark commented on issue #685: Here are the TCP throughput graphs for the test above.... https://github.com/m-labs/artiq/issues/685#issuecomment-311014537

10:16 <GitHub2> [artiq] whitequark commented on issue #685: @jordens Packet captures reacquired and comment above updated. https://github.com/m-labs/artiq/issues/685#issuecomment-311019570

10:16 <whitequark> rjo: that throughput difference is very weird.

10:16 <whitequark> it looks like the captures are identical in every respect except one of them is slower

10:17 <whitequark> the % of window full packets is also the same

10:17 <whitequark> well, 28% vs 30%, not much of a difference

10:19 <GitHub28> [artiq] whitequark commented on issue #685: @klickverbot ... https://github.com/m-labs/artiq/issues/685#issuecomment-311017131

10:24 <GitHub103> [artiq] whitequark commented on issue #731: @cjbe As a matter of fact, with the current master the entire experiment finishes in under 1s. https://github.com/m-labs/artiq/issues/731#issuecomment-311021303

10:28 <GitHub178> [artiq] whitequark commented on issue #732: @cjbe As a matter of fact, with the current master the entire experiment finishes in under 100ms. https://github.com/m-labs/artiq/issues/732#issuecomment-311022070

10:43 <whitequark> rjo: ah no unfortunately we already do as few copies as possible

10:44 <whitequark> there's copy #1 that takes the ethmac buffer and puts it into the TCP circular buffer

10:44 <whitequark> and there's copy #2 that takes the TCP circular buffer and puts it into an allocation owned by kernel CPU

10:44 <whitequark> neither can *really* be eliminated

10:45 <whitequark> then it looks like the only option here is implementing proper window management

10:46 <whitequark> we essentially get permanently stuck with a MTU*4 window right now despite the TCP buffer being almost entirely empty

10:47 <whitequark> er, correction. MTU*4 window regardless of the state of the buffer so long as it isn't almost entirely full

11:18 <GitHub77> [artiq] klickverbot commented on issue #685: @sbourdeauducq: It wasn't better at the time of your message, but the duplicate ACK handling from earlier today should indeed make a difference. I'll have a go at reproducing the results soon.... https://github.com/m-labs/artiq/issues/685#issuecomment-311031321

11:29 <rjo> whitequark: ethernet DMA...

11:38 <cr1901_modern> Does misoc support a DMA controller (I suppose you could just implement the ethernet core as a WB master for DMA if not)

11:44 <sb0> what is a DMA controller? hardware memcpy? no

11:47 <cr1901_modern> sb0: Yes, basically. But I recall you saying something else a while back: that a DMA controller shouldn't use the same bus as the CPU to read/dump data to memory.

11:48 <cr1901_modern> ^So I was asking if that was supported (do other SoCs use this approach?) either

12:10 <sb0> rjo, did your modified vivado script improve the dma/rtio timing, or is that still an important issue?

12:33 <rjo> it did have a small impact but the path is still there. and it is extremely long and i expect it to cause problems soon again.

12:35 <GitHub39> [smoltcp] whitequark opened issue #20: TCP reset generation is not quite correct https://git.io/vQsn5

12:36 <GitHub57> [smoltcp] whitequark opened issue #21: Challenge ACKs are not always generated https://git.io/vQsnb

12:39 <GitHub51> [smoltcp] whitequark opened issue #22: ACKs are not generated when receiving segments and the window is zero https://git.io/vQscm

12:44 rohitksingh_wor1 has quit [Read error: Connection reset by peer]

13:06 <sb0> rjo, okay, it's the ack (flow control) path

13:06 <sb0> it's long because many components have combinatorial logic in flow control

13:06 <sb0> that can be broken with a 2-entry FIFO

13:10 <whitequark> I'm not sure how much sense it makes to have a hardware memcpy

13:10 <whitequark> or1k is a pipelined CPU with prefetch, right?

13:11 <sb0> or, more simply, by having a component that reads/writes sequentially. unlike the FIFO this limits the throughput but should not be the bottleneck

13:12 <sb0> that maybe can be combined with the time offset stage, which would use a negligible amount of FPGA resources (the wide data makes buffers expensive)

13:12 <whitequark> if you unroll the memcpy loop then it can spend many of its cycles actually doing copying

13:12 <sb0> whitequark, you can access the SDRAM with a wider bus than the CPU

13:13 <whitequark> if you go the hardware memcpy route though then you will pay the cache flush penalty

13:13 <whitequark> unless you make it cache-coherent which is not gonna happen for ARTIQ

13:13 <sb0> cache coherency isn't *that* bad

13:14 <whitequark> well can you justify implementing MOESI just to get faster memcpy/

13:14 <whitequark> ?

13:14 <sb0> even in FPGAs it can work, for example milkymist had a VGA framebuffer that had cache coherency with the L2 cache

13:14 <whitequark> well

13:14 <whitequark> it will also give us faster kernel/comms CPU data transfer

13:14 <whitequark> right now every RPC is a slog

13:14 <whitequark> so maybe it can be justified after all

13:15 <sb0> are cache misses the main slowdown for RPCs?

13:15 <whitequark> they are a significant slowdown iirc the last time I was measuring that

13:16 <whitequark> first, you have this massive loop that iterates through entire l2 cache, as a fixed penalty

13:16 <whitequark> and then you get all of your working set evicted

13:16 <whitequark> also does or1k really have no way to flush *specific* dcache lines?

13:17 <whitequark> it already has the CAM...

13:17 <sb0> access another address with an offset at a multiple of the cache size

13:17 <whitequark> oh, you don't flush l2 cache for RPCs, my bad

13:18 <whitequark> sb0: that's a waste of time

13:18 <whitequark> well I suppose since we have no MMU we could calculate it from the way/set/block count

13:19 <whitequark> and do an appropriate SPR_DCBIR write

13:19 <sb0> if you access in the on-chip SRAM it's a lesser waste of time

13:24 <GitHub184> [smoltcp] whitequark commented on issue #19: Something I just remembered that might be very relevant to your work is that having several open sockets in LISTEN state with the same local endpoint is perfectly legal. That's how listen backlog is implemented (by a layer on top of smoltcp). https://git.io/vQs4Q

13:41 <GitHub44> [smoltcp] whitequark opened issue #23: Revise errors returned from `TcpSocket::process()` https://git.io/vQs0K

13:48 <GitHub78> [smoltcp] batonius commented on issue #19: Right, I somehow missed that point completely, it's not enough to dispatch tcp packets based on the dst endpoint, a server can have several clients connected to it, we need to consider src endpoint as well, and we don't know it until we established a connection. This means we need another layer of dispatching and a way for a socket to report the fact it has established a connection to a remote endpoint.

14:03 <GitHub106> [smoltcp] batonius commented on issue #19: Now I think if it, it should be easy enough to do by checking if socket's `remote_endpoit` has changed after `process` in `process_tcpv4`. https://git.io/vQsgX

14:22 rohitksingh has joined #m-labs

15:22 hartytp has joined #m-labs

15:35 <hartytp> sbo: DRTIO switching

15:35 <hartytp> why do you need to store an entry for each DRTIO channel in a table?

15:36 <hartytp> Isn’t 1 entry per satellite device is enough?

15:42 <hartytp> what is the planned implementation of the DRTIO switching funded by ARL?

15:59 <sb0> hartytp, the ARL design is for the Sayma RTM FPGA, and unlike Kasli, the two ends of the switch operate at different data rates

16:00 <sb0> currently the DRTIO master needs to store how much space is available in the RTIO FIFO of each channel, to avoid querying the satellite every time which would cause poor performance

16:01 <sb0> with the current switch support plan, there is only one hop at most, and the number of RTIO channel on Sayma RTM is rather small. this makes the DRTIO master block RAM more manageable ...

16:02 <hartytp> Okay, so this is about keeping track of room in FIFOs, rather than about constructing a routing table?

16:03 <hartytp> and, we can't just use overflow errors for flow control?

16:10 <hartytp> "If we design the route->index mapper in a naive and trivial way (encode each hop with 2 bits, concatenate the results, and multiply by the memory allocation for one device) then the required amount of memory is very high at 10 megabytes, with a tree 5 layers deep."

16:11 <hartytp> I assumed we'd store a list of DRTIO devices and, for each one, store the route information.

16:12 <hartytp> Thus, it's only a few extra bits of information for each DRTIO slave we add, rather than an exponentially increasing amount of data

16:12 <hartytp> "hartytp, the ARL design is for the Sayma RTM FPGA, and unlike Kasli, the two ends of the switch operate at different data rates"

16:14 <hartytp> If you can do different data rates, don't you get switching with the same data rate more or less for free? (different data rates sounds like a general case)

16:19 <sb0> the different data rate design will have less performance (needs to buffer whole packets etc.). same data rate, you can do cut-through switching

16:20 <sb0> no, we can't just use overflow errors for flow control

16:22 <sb0> having a list of drtio devices and storing route information in gateware is an option, yes. but it needs to be done...

16:23 <sb0> and even with that, it's still a 200KB table

16:26 <hartytp> sb0, okay so the latency is quite high for the current ARL funded DRTIO switch (how high?). The estimate you gave me is for reducing the latency in the same-speed case by implementing cut-through switching, right?

16:26 <hartytp> "having a list of drtio devices and storing route information in gateware is an option, yes. but it needs to be done..." would that be a lot simpler to implement?

16:27 <sb0> it's much simpler in gateware than encoding the route in the RTIO channel numbers and then having to map that efficiently to table addresses

16:27 <sb0> but then there is the problem of loading the route table.

16:28 <sb0> I suppose the only option is to put it as a config option in the core device flash, otherwise startup/idle kernels would not run properly

16:28 <sb0> yes, that estimate is implementing cut-through switching

16:28 <hartytp> "and even with that, it's still a 200KB table" true, assuming we need to store 10 bytes per DRTIO channel (what are they for?) and we want to support 1024 RTIO channels per device (256 seems plenty IMO)....

16:29 <hartytp> "I suppose the only option is to put it as a config option in the core device flash, otherwise startup/idle kernels would not run properly" that doesn't sound too bad to me

16:29 <sb0> well, the user interface needs a bit of thought

16:30 <hartytp> yes

16:30 <sb0> 10 bytes = last timestamp (64 bits) + FIFO space (16 bits)

16:31 <hartytp> remind me what last timestamp is needed for

16:31 <sb0> sequence error detection

16:31 <hartytp> detecting out-of-order events?

16:32 <sb0> trying to post an event on a channel with a timestamp smaller than the previous one

16:32 <hartytp> okay

16:33 <sb0> with SRTIO it is generally OK to do that, so this "sequence error" doesn't exist anymore

16:34 <hartytp> can the error detection be done by the DRTIO slave, rather than the master?

16:34 <hartytp> That way, you're down to 2 bytes per DRTIO channel on the master

16:36 <sb0> then either you lose precise exceptions, or performance, since a round-trip would be required for every event

16:37 <sb0> if you add a microsecond of latency by crossing switches, then the event rate really drops...

16:37 <hartytp> precise exceptions? The DRTIO slave can raise an exception over the DRTIO aux channel, telling you which instruction caused the error. What other information do you need?

16:38 <sb0> there are two problems with that:

16:39 <sb0> 1) it cannot work like a Python exception, e.g. the CPU may already be out of the "except:" clause when the error arrives

16:40 <hartytp> yes, it'd be more like an underflow error

16:40 <sb0> 2) the kernel may even have already terminated by the time the error arrives, so if you store just a program counter value it still is a bit tricky to know where the error came from

16:40 <sb0> underflow errors also use precise exceptions.

16:40 <sb0> you can catch them etc

16:42 <sb0> "try: ttl.on() except RTIOUnderflow: ..." has precisely defined behavior

16:42 <hartytp> okay.

16:43 <hartytp> how are underflows handled if not via drtio aux?

16:45 <sb0> locally by looking at the local timestamp counter, and checking that there is enough time considering the various latencies into account

16:45 <sb0> drtio switches also complicate that, by the way

16:45 <sb0> contrary to what you think, they are not easy, even for small networks

16:46 <hartytp> Never thought this was easy

16:47 <hartytp> just trying to understand the issues

16:49 <hartytp> (sorry, just re-read the drtio docs and noticed some of my questions were answered there)

16:52 <hartytp> In general:

16:53 <hartytp> - Kasli as master is something I'm keen on, as some of our experiments will only need minimal uTCA hardware (others will need lots of it, so will use Metlino).

16:54 <hartytp> - but, I am a bit concerned with potential resource/speed limitations of Kasli (as discussed previously)

16:55 <hartytp> - we can potentially fund the SRTIO proposal, depending on the costs

16:57 <sb0> ok, good :)

16:59 <rjo> hartytp: are you guys doing CameraLink-based readout from (Andor?) (EM) CCDs? just heard it mentioned here (PTB) that someone from oxford had that in a thesis.

16:59 <hartytp> cjbe is the person to ask about that

17:00 <rjo> his thesis?

17:00 <hartytp> https://www2.physics.ox.ac.uk/sites/default/files/Burrell_Thesis.pdf

17:00 <rjo> thanks

17:01 <hartytp> that's from quite a while ago

17:02 <hartytp> Chris has done some (unpublished stuff). IIRC, triggering the camera via TTL and then reading back later via a PC card

17:04 <rjo> oh. i remember that. ack.

17:04 <hartytp> real-time readout via CameraLink is something we're keen on/thinking about. Are PTB considering funding it?

17:07 <hartytp> sb0: the simpler/cheaper we can keep the switching proposal, the easier it will be for us to fund; even if it doesn't address all the issues required for long-term scalability, at least it would be a start

17:08 <hartytp> other than that, I'll wait to hear from you re Kasli speed and a more firm estimate of switching costs.

17:09 cjbe has joined #m-labs

17:12 <cjbe> rjo: I have looked into the Andor EMCCD CameraLink implementation, and stuck a scope on it to confirm the protocol and latency is not crazy, but have not written any gateware for this (yet...)

17:15 hartytp has quit [Quit: Page closed]

18:26 <GitHub91> [smoltcp] whitequark opened issue #24: Use timestamp for TCP initial sequence number https://git.io/vQGLb

18:28 <whitequark> sb0: I have an idea for handling exceptions

18:28 <whitequark> we could add a hook so that before the try: block is exited, the compiler issues an exception barrier, and pulls in any that might have arised

18:29 <whitequark> basically, mark the RTIOUnderflow (or whichever) exception as "this needs additional code emitted before try: block that catches it finishes"

18:29 <whitequark> could be even just python code to be fully generic

18:29 <whitequark> easy to implement, seems pretty ergonomic

18:37 <travis-ci> batonius/smoltcp#14 (master - 1746702 : whitequark): The build passed.

18:37 <travis-ci> Change view : https://github.com/batonius/smoltcp/compare/b3e355437f68...1746702f53a7

18:37 <travis-ci> Build details : https://travis-ci.org/batonius/smoltcp/builds/247210822

18:42 <travis-ci> batonius/smoltcp#15 (packet_dispatch - e4e0f1e : Egor Karavaev): The build is still failing.

18:42 <travis-ci> Change view : https://github.com/batonius/smoltcp/compare/1aef68a611c2...e4e0f1e1cc15

18:42 <travis-ci> Build details : https://travis-ci.org/batonius/smoltcp/builds/247211547

18:47 <GitHub93> [smoltcp] whitequark pushed 2 new commits to master: https://git.io/vQGYB

18:47 <GitHub93> smoltcp/master 5c3fc49 whitequark: Discard packets with non-unicast source addresses at IP level....

18:47 <GitHub93> smoltcp/master e47e94e whitequark: Transmit actual UDP checksum of all-zeroes as all-ones instead.

18:48 <GitHub168> [smoltcp] klickverbot commented on issue #24: Also see RFC 1948/6528 – timestamps have been augmented by a PRNG since to avoid sequence number attacks. https://git.io/vQGY2

18:51 <GitHub133> [smoltcp] whitequark commented on issue #24: @klickverbot Is there some source of truth for which RFCs are actually authoritative for TCP? RFC 793 is hopelessly outdated and has errata, RFC 1122 fixes some of that, highlights a few common errors, many of which I did make, but also piles completely useless junk on top of it (I think every ICMP message it specifically mentions except unreachables and echo request/reply is deprecated, strongly disco

18:55 <GitHub110> [smoltcp] klickverbot commented on issue #24: @whitequark: Unfortunately, I don't know of any up to date list of RFCs relevant for the various areas, but I found the review in RFC 7414 to be quite useful (from 2015). https://git.io/vQGOa

18:56 raghu has joined #m-labs

18:57 <raghu> bb-m-labs: force build --props=package=artiq-kc705-nist_qc2 artiq-board

18:57 <bb-m-labs> build forced [ETA 16m43s]

18:57 <bb-m-labs> I'll give a shout when the build finishes

18:59 raghu has quit [Client Quit]

19:00 mumptai has joined #m-labs

19:00 <GitHub118> [smoltcp] whitequark commented on issue #24: @klickverbot Thanks https://git.io/vQG3m

19:00 <GitHub78> [smoltcp] whitequark commented on issue #19: Yeah that works. https://git.io/vQG3Y

19:05 rohitksingh has quit [Quit: Leaving.]

19:20 <bb-m-labs> build #671 of artiq-board is complete: Success [build successful] Build details are at http://buildbot.m-labs.hk/builders/artiq-board/builds/671

19:30 ChanServ has quit [shutting down]

19:35 ChanServ has joined #m-labs

20:13 Gurty has quit [Ping timeout: 255 seconds]

20:14 Gurty has joined #m-labs

20:14 Gurty has quit [Changing host]

21:45 mumptai has quit [Quit: Verlassend]

22:17 <travis-ci> batonius/smoltcp#16 (master - 5c3fc49 : whitequark): The build passed.

22:17 <travis-ci> Change view : https://github.com/batonius/smoltcp/compare/1746702f53a7...5c3fc4935eb0

22:17 <travis-ci> Build details : https://travis-ci.org/batonius/smoltcp/builds/247290619

22:18 <travis-ci> batonius/smoltcp#17 (packet_dispatch - ab18a49 : Egor Karavaev): The build is still failing.

22:18 <travis-ci> Change view : https://github.com/batonius/smoltcp/compare/e4e0f1e1cc15...ab18a49e26a9

22:18 <travis-ci> Build details : https://travis-ci.org/batonius/smoltcp/builds/247290698

22:30 <GitHub177> [smoltcp] batonius commented on issue #19: I've updated https://github.com/m-labs/smoltcp/compare/master...batonius:packet_dispatch .... https://git.io/vQGM2

22:34 <GitHub187> [smoltcp] whitequark commented on issue #19: Hmm, I'm not sure if I like this idea very much, we already have drop magic in Device and that's pretty bad already. But I can give it a look. https://git.io/vQGDU

23:03 cjbe has quit [Ping timeout: 260 seconds]

23:37 Gurty has quit [Excess Flood]

23:37 Gurty has joined #m-labs