#glasgow on 2020-10-05 — irc logs at freenode.irclog.whitequark.org

2020-08-09 01:18 ChanServ changed the topic of #glasgow to: glasgow interface explorer · code https://github.com/GlasgowEmbedded/glasgow · logs https://freenode.irclog.whitequark.org/glasgow · discord https://1bitsquared.com/pages/chat · production https://www.crowdsupply.com/1bitsquared/glasgow · no ETAs at the moment

00:00 <whitequark> Attie: in_fifo.read() is already non-blocking

00:00 <ebb> Ah, temple-tap moment

00:00 <whitequark> the part that makes it feel like it blocks is the `await`

00:00 <d1b2> <Attie> ... is it?

00:00 <d1b2> <Attie> oh, well.. yes

00:00 <whitequark> you probably want to use some kind of asyncio construct

00:01 <d1b2> <Attie> i've reworked it a bit

00:01 <d1b2> <Attie> would you mind looking over?

00:01 <d1b2> <Attie> (when you have a chance)

00:01 <whitequark> re "I can't quite figure out if the system will bring as much data into the host as possible"

00:01 <whitequark> it will if you ever yield control to the scheduler

00:01 <whitequark> in practical terms, if you have a compute-bound loop it won't

00:01 <d1b2> <Attie> https://github.com/attie/glasgow/blob/i2s/software/glasgow/applet/audio/i2s_capture/__init__.py

00:01 <d1b2> <Attie> right... so I'm trying to be "nice" here, but seeing odd things

00:02 <d1b2> <Attie> this sleep seems to make things on the FPGA side unhappy https://github.com/attie/glasgow/blob/i2s/software/glasgow/applet/audio/i2s_capture/__init__.py#L193

00:02 <d1b2> <Attie> the larger it is, the larger the retrieved buffers get, which is fine... but we also miss out on more samples

00:02 <whitequark> yeah don't do that

00:02 <d1b2> <Attie> then here, i'm yielding to let other tasks run: https://github.com/attie/glasgow/blob/i2s/software/glasgow/applet/audio/i2s_capture/__init__.py#L260

00:03 <whitequark> don't try to be "nice"

00:03 <whitequark> also, checksum?

00:03 <whitequark> what?

00:04 <whitequark> i think the reason your code doesn't work that well is that you're doing a lot more than necessary

00:04 <d1b2> <Attie> the plan with the checksum was to be more confident in what's going on

00:04 <whitequark> don't add checksums (USB has them already), don't process sample buffers in Python, *especially* don't copy samples (line 227)

00:04 <d1b2> <Attie> i currently have low confidence that data is put into the FIFO when it needs to be

00:05 <whitequark> don't try to guess when f.write will write to disk

00:05 <whitequark> for streaming samples to disk, the following should be sufficient:

00:05 <whitequark> while True: f.write(await iface.read())

00:06 <whitequark> now, it's true that you do need some framing, so you can't do *quite* that

00:07 <whitequark> hm

00:07 <whitequark> you have inputs that are not synchronized to the FPGA clock

00:07 <whitequark> you need some FFSynchronizers to do that, or your gateware will misbehave

00:08 <whitequark> that might be the cause of the issue you're trying to find with checksums

00:08 FFY00 has quit [Remote host closed the connection]

00:08 <whitequark> to handle framing, I strongly suggest that you (a) rely on `iface.read()`'s internal buffering, and (b) avoid auto-flush

00:08 FFY00 has joined #glasgow

00:09 <d1b2> <Attie> okay, i'll take a look at these things

00:09 <whitequark> if you flush once you finish a frame, you will, in most circumstances, greatly reduce the amount of work that has to be done in Python

00:10 <d1b2> <Attie> good to know - as in line 176?

00:10 <whitequark> yes

00:10 <d1b2> <Attie> i presume there'll be a flush signal i need to assert then

00:10 <whitequark> yes

00:10 <whitequark> in_fifo.flush

00:11 <d1b2> <Attie> yup

00:11 <whitequark> though in this case, it's not strictly necessary--if you write enough bytes to the buffer it'll flush all on its own

00:11 <d1b2> <Attie> (i added auto_flush while working on the not-resetting-properly issue, thinking it was a startup flush / purge)

00:12 <whitequark> True is the default

00:12 <whitequark> you have to use False explicitly

00:12 <d1b2> <Attie> ok

00:13 <d1b2> <Attie> would you mind filling me in a bit on when w_rdy is set?

00:13 <d1b2> <Attie> if I don't wait for it, then things go really bad, which i imagine is to be expected

00:14 <whitequark> when the FIFO is not full

00:16 <whitequark> and yeah, if you write to the FIFO when it's not ready, it'll just drop that byte on the floor

00:16 <d1b2> <Attie> yeah

00:16 <whitequark> on the other hand, if you wait instead of writing, you'll lose sync

00:16 <d1b2> <Attie> yip

00:16 <d1b2> <Attie> which i think is where lots of my issues are

00:16 <d1b2> <Attie> i'll look into FFSynchronizer() and this more tomorrow

00:17 <d1b2> <Attie> thanks very much for your input!

00:17 <whitequark> currently the most reasonable way to handle this is to add another FIFO in front of the FIFO you get from the interface

00:17 <whitequark> hm, wait, no

00:17 <d1b2> <Attie> i was wondering about something like that

00:17 <d1b2> <Attie> oh

00:17 <whitequark> there are more ways than that

00:17 <whitequark> hmm

00:18 <whitequark> i would suggest adding a counter for the skipped samples that you have nowhere to write

00:18 <whitequark> thus splitting your subtarget into two FSMs

00:18 <whitequark> the first one either writes a sample or records the fact that it can't write a sample, the second one actually writes things

00:18 <d1b2> <Attie> I was going to add a register to hold that info, but didn't yet

00:19 <whitequark> so I don't recall how I2S works exactly

00:19 <d1b2> <Attie> can you calrify "writes a sample" vs "actually writes things"?

00:19 <whitequark> but I'm thinking something like the following might work...

00:19 <d1b2> <Attie> i2s is fundamentally: clock, word clock, and data... each channel is framed by the word clock, to give you sync

00:20 <whitequark> you encounter a new frame. while there is space in FIFO, you keep writing. once there is no more space in FIFO, you transition to a new, special, state where first you count how many samples are there in the frame that is in progress

00:20 <d1b2> <Attie> https://cdn.discordapp.com/attachments/613461005392936961/762469238493020200/unknown.png

00:20 <whitequark> it would work something like this

00:22 <whitequark> if stb_sample and !w_rdy then skipped++; if stb_sample and w_rdy then w_data=0; w_en=1; if !stb_sample and w_rdy then w_data=0; w_en=1; skipped--;

00:22 <whitequark> basically you ensure that the frame you already started will be padded with zeroes

00:22 <whitequark> since by now you have no way to escape from the frame, but you gotta finish it *somehow*

00:23 <d1b2> <Attie> okay

00:23 <whitequark> then you have a different issue, new frames starting when the FIFO is full

00:23 <whitequark> i *think* the most reasonable way to solve that one is as follows

00:24 <d1b2> <Attie> [side note: i just tried again with auto_flush=False, and got a perfect 5sec capture, and much more sensibly sized reads]

00:24 <d1b2> <Attie> [...and zero blinks out of LED0]

00:25 <whitequark> hm

00:25 <whitequark> actually the second issue is pretty tricky

00:26 <whitequark> honestly

00:26 <whitequark> if with auto_flush=False it works fine in practice, you might get away with "on overflow the applet borks itself and stays borked until restart"

00:26 <whitequark> this is what i did in a few other places

00:26 <d1b2> <Attie> i think it might be the way for this

00:26 <whitequark> handling overflow gracefully is really hard and the payoff can often be minimal

00:27 <d1b2> <Attie> yeah

00:27 <d1b2> <Attie> when flushing the fifo... what does that actually mean?

00:27 <whitequark> since that code is not only complex but is bug-prone as well

00:27 <d1b2> <Attie> it pushes data over USB?

00:27 <whitequark> so, glasgow applets use a stream-based interface, like TCP

00:27 <whitequark> but USB is packet-based, like UDP

00:27 <whitequark> you have to packetize somehow

00:27 <d1b2> <Attie> i was seeing incoming buffers of ~300 bytes, which feels small and poor wrt overhead

00:27 <whitequark> i didn't want the packetization to be directly user-visible because that sucks for a variety of reasons

00:28 <whitequark> for OUT packets i just use a FIFO

00:28 <d1b2> <Attie> now i see stable ~19kiB buffers

00:28 <whitequark> for IN packets, no dice; ideally you want sending full-sized packets, but you also sometimes want non-full-sized packets

00:28 <whitequark> the way it works is:

00:28 <d1b2> <Attie> *16KiB

00:29 <whitequark> with auto_flush=True, the FX2 crossbar stuffs the FX2-side buffers from the applet-side buffers while the latter fill up. when the applet-side buffer becomes empty, the crossbar cuts a new packet.

00:29 <whitequark> this works "the way people expect" in that it never results in in_fifo writes getting stuck forever in the FX2

00:29 <whitequark> so it's good for novices who program their first applet. but performance sucks.

00:30 <whitequark> with auto_flush=False, the FX2 crossbar always cuts a new packet when the FX2 512-byte buffer fills up

00:30 <whitequark> and also when flush is 1

00:30 <whitequark> (it also sends ZLPs as necessary)

00:30 <whitequark> in your case, since you never explicitly flush, you just get a stream of full-sized packets

00:30 <whitequark> then, the PC-side code has two more layers of buffering

00:31 <whitequark> first, it aggregates the 512-byte packets it gets from the device into something more substantial, because if it doesn't, throughput and latency would suck

00:31 <whitequark> that's the 16 KiB buffers you're seeing

00:32 <whitequark> it can only aggregate until it receives a non-full-sized packet (basically until the applet flushes), so maybe you *shouldn't* explicitly assert .flush here

00:32 <whitequark> the next layer of buffering is something similar to what you tried to do with your _read_fifo_in function

00:32 <d1b2> <Attie> ok... so when the fifo is "being flushed", is there a hold put on it or similar? could this explain why i was dropping so many samples while transferring relatively tiny packets?

00:33 <whitequark> well

00:33 <whitequark> the answer to your question as asked is "no", but basically yes

00:33 <d1b2> <Attie> [i just captured what "audibly sounded" like a perfect ~3.20 song]

00:33 <d1b2> <Attie> heh - a good answer then 🙂

00:33 <whitequark> and the answer is "yes" because, well, how do i put it

00:33 <whitequark> because USB is kinda badly designed

00:34 <whitequark> the HCD has to poll the device for each IN packet, and we use bulk transfers for reasons too complex to go into here

00:34 <whitequark> the HCD will repeatedly poll the device, modern ones even several times per microframe, as long as the device is sending full length packets

00:35 <whitequark> but when you don't send a full length packet, the HCD will not poll the device again in the same microframe, and in general it assumes, when scheduling, that you're done for a while

00:35 <whitequark> when this happens, the FX2-side buffers, which are fairly small (1K or 2K, depending on the OS, with one applet), fill up

00:35 <d1b2> <Attie> i see, yes that makes sense

00:35 <whitequark> when that happens, the FPGA-side buffer fills up

00:36 <whitequark> when that happens, you drop samples

00:37 <whitequark> the FPGA-side buffer is just 512 bytes by default

00:37 <whitequark> extending it is *usually* a sign something went wrong elsewhere

00:37 <whitequark> but it's occasionally truly necessary

00:38 <sorear> well you have 16KB total on the hx8k

00:38 <whitequark> yes, which is why it's so small by default

00:38 <whitequark> you might want to use those for other things. so one BRAM per EP it is

00:39 <d1b2> <Attie> great, thanks very much for the support wq!

00:39 <whitequark> np

00:39 <d1b2> <Attie> i seem to now have what is a "reasonably" working applet... starting to remove some of the crud

00:39 <whitequark> congrats

00:39 <d1b2> <Attie> and i should look into FFSynchronizer()

00:49 FFY00 has quit [Read error: Connection reset by peer]

00:53 <d1b2> <Attie> ... i should go, thanks again wq

00:53 <d1b2> <Attie> if you wanted to take a look over it as it stands, then please feel free - i just cleaned and pushed, TODOs at the top

00:54 <whitequark> tomorrow prob

00:54 <d1b2> <Attie> np... bye

01:44 _whitelogger has joined #glasgow

02:41 <_whitenotifier-f> [glasgow] brainstorm deleted a comment on issue #151: False positive results in selftest - https://git.io/JU5xM

03:18 electronic_eel has quit [Ping timeout: 265 seconds]

03:18 electronic_eel has joined #glasgow

03:52 PyroPeter_ has joined #glasgow

03:56 PyroPeter has quit [Ping timeout: 258 seconds]

03:56 PyroPeter_ is now known as PyroPeter

03:59 balrog has quit [Ping timeout: 260 seconds]

04:14 Stormwind_mobile has quit [Remote host closed the connection]

04:20 Stormwind_mobile has joined #glasgow

04:26 balrog has joined #glasgow

04:29 ma1 has quit [Quit: ma1]

04:47 _whitelogger has joined #glasgow

04:50 ma1 has joined #glasgow

05:07 jevinskie[m] has joined #glasgow

06:29 _whitelogger has joined #glasgow

08:19 <_whitenotifier-f> [glasgow] russss commented on pull request #210: Move factory flashing instructions and add basic example to README - https://git.io/JUFaR

08:48 Stormwind_mobile has quit [Read error: Connection reset by peer]

08:50 Stormwind_mobile has joined #glasgow

08:55 samlittlewood has joined #glasgow

09:56 Stormwind_mobile has quit [Ping timeout: 265 seconds]

10:02 _whitelogger has joined #glasgow

12:06 Stormwind_mobile has joined #glasgow

12:56 FFY00 has joined #glasgow

14:02 Sellerie has quit [Quit: The Lounge - https://thelounge.chat]

14:03 Sellerie has joined #glasgow

16:26 Stormwind_mobile has quit [Ping timeout: 260 seconds]

17:00 bvernoux has joined #glasgow

20:13 XgF has quit [Remote host closed the connection]

20:15 XgF has joined #glasgow

20:21 StM has joined #glasgow

20:23 whitequa1k has joined #glasgow

20:25 fridtjof[m] has quit [Ping timeout: 244 seconds]

20:25 smkz has quit [Ping timeout: 244 seconds]

20:25 russss has quit [Ping timeout: 244 seconds]

20:25 midnight has quit [Ping timeout: 244 seconds]

20:25 ZerataX1 has quit [Ping timeout: 244 seconds]

20:25 whitequark has quit [Ping timeout: 244 seconds]

20:25 StM_ has quit [Ping timeout: 244 seconds]

20:27 ZerataX1 has joined #glasgow

20:27 russss_ has joined #glasgow

20:27 russss_ has quit [Changing host]

20:27 smkz has joined #glasgow

20:27 russss_ is now known as russss

20:27 russss has joined #glasgow

20:27 russss has quit [Changing host]

20:27 russss has joined #glasgow

20:28 tomtastic has quit [Ping timeout: 246 seconds]

20:28 midnight has joined #glasgow

20:28 tomtastic has joined #glasgow

20:28 fridtjof[m] has joined #glasgow

21:33 bvernoux has quit [Quit: Leaving]

22:39 Stormwind_mobile has joined #glasgow

23:41 _whitelogger has joined #glasgow