##openfpga on 2019-07-21 — irc logs at freenode.irclog.whitequark.org

00:41 _whitelogger has joined ##openfpga

00:46 emeb has quit [Quit: Leaving.]

00:51 emeb_mac has joined ##openfpga

03:05 Maylay has quit [Quit: Pipe Terminated]

03:07 Maylay has joined ##openfpga

04:18 conmega has joined ##openfpga

04:21 <conmega> Hey everyone o/

04:48 <conmega> Got a tinyfpga BX awhile ago and decided to port a project I was working on with an ALTERA board in VHDL over to Verilog with this board. Its to drive an EL display and so I have a video memory. Now I need 9600 bytes of storage to fit the video ram, which the BX should have 16KB so I should be within the relm of fitting on the board. But the video data is in 4-bit chunks and I'd like to

04:48 <conmega> https://pastebin.com/h75F8nrB

04:48 <conmega> be-able to write to the display in bytes. So to start I made an 8-bit wide reg with 9600 positions then just added a statement to address either the top nibble or bottom nibble based off the LSB of the address for the video generation logic. This gets stuck in PNR for hours... Tried changing it to a 4 bit register with 19200 position and just addressing it straight, works fine. Any ideas?

05:04 <tnt> conmega: idea about what ?

05:05 <tnt> conmega: RAM inference is often very dumb, so unless your verilog match a very precise pattern it's designed to match, it won't infer a ram and it will use flip-flops.

05:12 <conmega> Well so ok, I suppose I should be clearer, it just fails to route if I try and access only 4-bits of an 8-bit wide memory, if I access all 8-bits of an 8-bit reg or 4-bits of a 4-bit reg it can route? Also I was wondering that but I can't find any other way people are defining block ram other than just a massive reg statement? I checked the datasheet and there are a few ways to address the

05:12 <conmega> block ram, down to 4Kb chunks in either 16,8,4,2-bit wide configurations. So it seems to have difficulty routing my if statement basically.

05:16 <conmega> So I suppose your saying unless the verilog is written in a specific way to address the reg then the place and router won't know to use BRAM? I suppose I could write it to just be an 8-bit block ram accessable via all 8-bits then have another module that splits the data up. Maybe seperating it like that will make it more happy?

05:19 <tnt> What's wrong with the "THIS WORKS" version in your pastebin ?

05:20 Miyu has joined ##openfpga

05:21 <conmega> Its not the format of data I want. I excluded some other code for a second port which I hadn't gotten to yet. But I'm basically trying to get a ram that allows writting data 8-bits at a time and reading 4-bits at a time. I figured the easiest way to do this would just be to just either grab the top nibble or bottom nibble based on the LSB of the address in. Then I was going to add write

05:21 <conmega> logic to write 8-bits into the 8-bit wide ram.

05:22 <conmega> I simply made the part that works as a simple test to ensure it will actually allow me to use that much BRAM and actually route and place on the chip.

05:28 <tnt> Then yeah, describe the ram as 8 bits, and on the side, in another process, take the 8 bits you read from the RAM and add a mux to select which nibble to use.

05:31 <conmega> Ok, I'll give that a go, thanks.

05:33 <tnt> conmega: https://pastebin.com/YSmFmhbk

05:33 <tnt> this works

05:34 <tnt> (oh it's missing a "reg lsb;" declaration)

05:38 <conmega> Hah, awesome works beautifully! Thanks a bunch. I suppose I could see how you defined it would better define the logic I meant to define...

05:43 <tnt> Yeah, that's why it's called 'Hardware Description Language', you should always try to describe the hardware you want built rather than the behavior you want.

05:45 <conmega> Yea... Bit of a learning curve heh... I've dabbled in a little bit of everything from ASM to Python but still quite new to HDL.

05:47 <conmega> https://media.discordapp.net/attachments/424014648757452800/602375900960522284/DSC_2339.JPG

05:47 <conmega> This is what that is for by the way :)

05:47 <conmega> An older EL display with a 4-bit data, Video clock, Hsync, Vsync interface. Seemed like a reasonable project to get started with some HDL.

05:48 <tnt> That's a facny EL display :)

05:49 <tnt> the 4 bits are just consecutive pixels right ? not intensity ?

05:55 <conmega> Yea, this EL has no concept of intensity, but can be driven at 120hz for refresh so you could theoretically get intensity by toggling the bit 4 times within a second to get 30 FPS with maybe a few levels of intensity but it'll probably not look quite right and require a lot more ram.

05:57 <conmega> But yea, really loving HDL once I figured out how to write test benches haha. Before that its like ripping your hair our poking at pins with the logic analyzer/scope heh...

06:40 Thorn has quit [Read error: Connection reset by peer]

06:43 Thorn has joined ##openfpga

07:04 emeb_mac has quit [Ping timeout: 248 seconds]

08:06 <xobs> Update on valentyusb: I think I found the source of metastability. Possibly in the io -> usb_48 crossing, and almost certainly in the usb_12 -> usb_48 crossing. I'm synthesizing a bunch of tests now to make sure it'll work with these fixes.

09:16 <tnt> xobs: did you ever switch to using IO registers ?

09:17 <xobs> tnt: I'm not quite sure what you mean, but I'm using a two-register chain on the inputs now.

09:18 <tnt> IO blocks have input / output registers built-in.

09:19 <whitequark> < tnt> Yeah, that's why it's called 'Hardware Description Language'

09:19 <whitequark> but in a Verilog, you describe hardware "behaviorally"..

09:19 <whitequark> ... or are supposed to, anyway, I agree that it doesn't work very well :)

09:20 <tnt> whitequark: yup, it sucks :) You basically play a game of writing the behavioral patterns that you know will match the pattern recognition of the synthesizer to generate the hardware you want.

09:21 <whitequark> ... or you use nMigen, which outputs a structural description instead

09:22 <tnt> well that's the input that matters, not the output.

09:22 <whitequark> oh?

09:24 <tnt> I might just be mis-expressing what I mean, but I'm busy atm, gotta finish something.

09:24 <whitequark> yeah sure

09:45 <daveshah> The biggest problem with BRAM inference is not that you need to match an exact pattern, but that there is no standard for what that pattern is, imo

09:45 <daveshah> Anything remotely non-standard is unlikely to work in anything other than the synthesis tool it was written for

09:46 <whitequark> yeah

09:46 <whitequark> if the pattern is standardized then the behavioral and structural verilog that only use standard patterns are semantically close to equivalent

09:47 Asu has joined ##openfpga

09:49 <daveshah> This is quite an FPGA-specific problem (not unimportant though), I don't think the ASIC world really has BRAM inference

09:49 <daveshah> Small RAMs are bit-blasted anyway, large RAMs are built with a memory compiler and dropped in

09:58 <mwk> bit-blasted?

09:58 <daveshah> Converted to logic and DFFs

09:58 <mwk> ah

10:36 bibor has quit [Ping timeout: 268 seconds]

12:44 _whitelogger has joined ##openfpga

13:03 <xobs> "ICESTORM_LC: 5278/ 5280 99%" <--- How did this manage to route /and/ meet timing?

13:05 <whitequark> lots of registers?

13:07 <xobs> Must be.

13:08 <daveshah> iCE40 also have more routing resources per LC than many other FPGAs

13:08 <xobs> Yeah. It's really is impressive.

13:09 <whitequark> they're so stringy on SEM

13:09 <tnt> Well your target is 12 MHz ...

13:09 <xobs> tnt: the domain that usually has trouble meeting timing is the 48 MHz one.

13:09 <daveshah> Routing resource density is mostly a function of how much confidence the vendor has in their PnR flow

13:09 <whitequark> ha

13:09 lutsabound has joined ##openfpga

13:09 <daveshah> Hence the Ultrascale having ridiculously sparse routing compared to iCE40

13:10 <tnt> lol

13:10 <daveshah> For a startup like SiliconBlue a bit more VC cash on a larger die is much easier than finding coders for better PnR algorithms

13:38 pie_ has quit [Ping timeout: 276 seconds]

13:38 pie_ has joined ##openfpga

13:39 pie_ has quit [Remote host closed the connection]

13:40 pie_ has joined ##openfpga

13:41 pie_ has quit [Remote host closed the connection]

13:41 pie_ has joined ##openfpga

13:44 Asu has quit [Remote host closed the connection]

13:45 Asu has joined ##openfpga

13:51 <ZirconiumX> daveshah: tangentially related, why was nextpnr created if arachne-pnr already existed?

13:52 <daveshah> ZirconiumX: To add support for multiple architectures (eg ECP5), timing driven pnr, and a framework for developing more advanced pnr algorithms

13:53 <ZirconiumX> So arachne-pnr wasn't flexible enough?

13:54 <daveshah> No it was really hard coded as a first pnr for the iCE40

13:54 <ZirconiumX> Okay, that's fair

13:55 <daveshah> There was nowhere near enough of arachne-pnr that would be kept to be worth using it as a base

13:55 wpwrak has quit [Ping timeout: 245 seconds]

13:58 wpwrak has joined ##openfpga

14:10 Miyu has quit [Read error: Connection reset by peer]

14:10 Miyu has joined ##openfpga

14:18 Asu has quit [Ping timeout: 245 seconds]

14:18 Asu` has joined ##openfpga

15:03 cr1901 has quit [Quit: Leaving.]

15:03 cr1901 has joined ##openfpga

16:02 Jybz has joined ##openfpga

16:17 Jybz has quit [Quit: Konversation terminated!]

16:36 cr1901 has quit [Quit: Leaving.]

16:36 cr1901 has joined ##openfpga

17:23 Asu` has quit [Remote host closed the connection]

17:24 Asu has joined ##openfpga

18:07 anuejn has quit [Quit: No Ping reply in 180 seconds.]

18:11 tnt has quit [Ping timeout: 248 seconds]

18:12 tnt has joined ##openfpga

18:12 vup has quit [Ping timeout: 272 seconds]

18:13 vup has joined ##openfpga

18:13 anuejn has joined ##openfpga

18:19 vup has quit [Remote host closed the connection]

18:21 vup has joined ##openfpga

18:22 anuejn has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

18:22 vup has quit [Client Quit]

18:24 anuejn has joined ##openfpga

18:24 vup has joined ##openfpga

18:25 Zorix has quit [Quit: Leaving]

18:29 Zorix has joined ##openfpga

19:00 <azonenberg> daveshah: yeah i have yet to see an asic tool that can infer srams from rtl

19:00 <azonenberg> What i do in FPGA is have a "MemoryMacro" module for any nontrivial size ram

19:01 <azonenberg> that has parameters to specify target as block, ultraram, lutram, etc

19:01 <azonenberg> width, depth, number of pipeline stages

19:01 <azonenberg> inside is a behavioral description of a ram that is designed to match the synthesis tool's patterns

19:02 <azonenberg> then you can tweak that one module as needed if you move to a new fpga

19:09 Miyu has quit [Read error: Connection reset by peer]

19:09 Miyu has joined ##openfpga

19:10 <whitequark> daveshah: do you remember if there's some FPGA family that has native latches?

19:11 <whitequark> i remember there was some weird family of FPGAs (chinese maybe?) that was LUT-only and used latches as storage

19:11 <whitequark> but i can't find it

19:12 <whitequark> wait. is that just efinix trion

19:13 <tnt> You mean modern ones? I know coolrunner had latches (although I guess they're cpld not really fpga)

19:13 <whitequark> yeah modern

19:15 <whitequark> hm, no, trion is a regular 4-lut arch

19:15 <whitequark> 4-lutff

19:23 <daveshah> whitequark: all xilinx FPGAs have native latches too

19:23 <daveshah> Technically speaking ECP5 does too (asynchronous data load mode on the FFs), but it's not exposed in the vendor tools or really in nextpnr either

19:23 <daveshah> As the vendor tools don't use it I suspect it is broken in some subtle 3ay

19:24 <daveshah> *way

19:24 <whitequark> daveshah: so the story goes is emily wants some FPGA family to target with asynchronous logic

19:24 <whitequark> and i remember there was *some* family with guaranteed glitchless LUTs

19:24 <whitequark> but i can't find out what it is

19:24 <azonenberg> i've never seen one that specifies that

19:24 <azonenberg> maybe an older achronix part?

19:24 <azonenberg> that was their big thing for a while

19:24 <whitequark> rings a bell

19:25 <azonenberg> they were trying to be the async fpga

19:25 <azonenberg> But these days i think their chips are more general purpiose, they realized async doesnt really work with current eda tools

19:25 <whitequark> well the point here is to develop better eda tools

19:32 <daveshah> I suspect glitch free logic should be doable even with any FPGA arch LUTs if you are careful about input wire delays

19:32 <daveshah> But some characterisation might be needed first

19:33 <whitequark> yeah, ice40 would be interesting

19:34 <daveshah> iCE40 probably makes most sense because of its low static power

19:34 <azonenberg> I want to spend more time characterizing greenpak first because it's slow and cheap

19:34 <azonenberg> then move up to something higher end like ice40

19:34 <whitequark> sentence no one in the world has ever said before

19:39 <tnt> Damn, I was surprised measuring HFOSC of UP5k on a random board, it was withing 3ppm of 48 MHz. Then i tried on another board and it's like 2000ppm off. Random sampling ...

19:39 <daveshah> Still, I've never found one near the 5% or whatever the datasheet says

19:39 <daveshah> But I guess that has to cover aging and temperature

19:50 Miyu has quit [Ping timeout: 276 seconds]

20:10 Richard_Simmons has joined ##openfpga

20:13 Bob_Dole has quit [Ping timeout: 264 seconds]

20:41 emeb_mac has joined ##openfpga

20:49 lutsabound has quit [Quit: Connection closed for inactivity]

21:47 Asu has quit [Quit: Konversation terminated!]

21:52 feuerrot has quit [Ping timeout: 245 seconds]

21:57 feuerrot has joined ##openfpga