##openfpga on 2019-07-04 — irc logs at freenode.irclog.whitequark.org

00:00 emeb_mac has joined ##openfpga

00:00 azonenberg_work has quit [Ping timeout: 272 seconds]

00:06 Richard_Simmons2 has joined ##openfpga

00:09 Richard_Simmons has quit [Ping timeout: 252 seconds]

00:28 dj_pi has quit [Ping timeout: 245 seconds]

00:28 gsi__ has joined ##openfpga

00:29 dj_pi has joined ##openfpga

00:31 gsi_ has quit [Ping timeout: 245 seconds]

01:07 <mithro> How have I not come across wikichip before? https://en.wikichip.org/wiki/WikiChip -- lots of interesting information there...

01:07 dj_pi has quit [Ping timeout: 246 seconds]

01:07 <whitequark> yeah, nice website

01:08 <mithro> I'm assuming it's not new?

01:08 dj_pi has joined ##openfpga

01:10 <mithro> I never realized that the metal layers have reduced pitch as they go up..... - https://en.wikichip.org/wiki/14_nm_lithography_process#IBM

01:13 <whitequark> my understanding is that the fine pitch masks cost so much that the fewer of them you need the better

01:14 oeuf has quit [Read error: Connection reset by peer]

01:15 oeuf has joined ##openfpga

01:18 dj_pi has quit [Ping timeout: 248 seconds]

01:26 <sorear> that’s my guess as well

01:27 <sorear> consider also that a long thin trace would have awful RC delays

01:37 <whitequark> indeed

02:07 <mithro> I mean it makes a lot of sense and also makes sense why top metal only changes are cheaper too....

02:14 * zignig reads the new boneless v3 code.

02:23 <whitequark> note that it is only a prototype yet

02:23 <whitequark> the general shape will be the same, the exact code not so much

02:38 <zignig> whitequark: understood, the structure is very different to v2.

02:39 <zignig> I have been working on a serial bootloader ( for the v2 core ) , that I will need to port once v3 has settled.

02:43 <azonenberg> mithro: there's other reasons too

02:43 <azonenberg> it also allows you to have two metal etch lines

02:43 <azonenberg> one for fine pitch and one for coarse

02:44 <azonenberg> you can run both in parallel with wafers at different steps of fab, but you can use older, less expensive hardware for the upper layers

02:44 <azonenberg> or even move wafers to an older fab for the upper layers

02:45 <azonenberg> some classified gov chips do exactly this, they outsource the std cells to e.g. TSMC because no US fabs can go that small, but in an obfuscated pattern so you can't infer much/anything about the chip's functionality

02:45 <azonenberg> then they use a less fancy but trusted fab, say IBM, to do the upper layers

02:46 <sorear> i wonder how "obfuscated pattern" compares to "standard gate array"

02:47 <sorear> gate arrays (non-programmable, field- or otherwise) are apparently still a thing but not a market I have much visibility into

03:10 flea86 has joined ##openfpga

03:31 <mithro> sorear: I think I have heard that called a "sea of gates"?

03:31 <sorear> I've heard that

03:49 vonnieda has joined ##openfpga

04:09 flea86 has quit [Quit: Goodbye and thanks for all the dirty sand ;-)]

04:55 rohitksingh_work has joined ##openfpga

05:08 _whitelogger has joined ##openfpga

05:12 OmniMancer has joined ##openfpga

05:32 gsi__ is now known as gsi_

06:13 craigjb has joined ##openfpga

06:14 craigjb_ has quit [Ping timeout: 246 seconds]

06:40 Maya-sama has joined ##openfpga

06:41 m4ssi has joined ##openfpga

07:02 emeb_mac has quit [Ping timeout: 245 seconds]

07:31 Maya-sama has quit [Ping timeout: 258 seconds]

07:33 <azonenberg> my understanding is that there is actual research into cryptographic means of doing exactly this

07:33 <azonenberg> basically figure out how to perturb a physical netlist to make it as hard as possible to figure out connectivity

07:34 <azonenberg> while not hurting timing more than X amount

07:35 <sorear> "logic encryption" is what usually gets studied and it's a harder problem where the adversary gets the complete netlist but not a small number of key bits that get fused later

07:43 Maya-sama has joined ##openfpga

07:54 Asu has joined ##openfpga

08:11 Maya-sama is now known as Miyu

08:40 Miyu has quit [Ping timeout: 244 seconds]

09:24 linzhi-sonia has joined ##openfpga

10:17 Miyu has joined ##openfpga

10:26 rohitksingh_work has quit [Ping timeout: 258 seconds]

10:37 rohitksingh_work has joined ##openfpga

10:45 rohitksingh_work has quit [Ping timeout: 245 seconds]

10:47 rohitksingh_work has joined ##openfpga

10:51 rohitksingh_wor1 has joined ##openfpga

10:53 rohitksingh_work has quit [Ping timeout: 246 seconds]

11:03 Dolu has quit [Ping timeout: 244 seconds]

11:11 rohitksingh_wor1 has quit [Ping timeout: 245 seconds]

11:38 rohitksingh_work has joined ##openfpga

11:41 Dolu has joined ##openfpga

12:21 Bike has joined ##openfpga

12:22 rohitksingh_work has quit [Ping timeout: 245 seconds]

12:25 rohitksingh_work has joined ##openfpga

12:56 cr1901_modern1 has joined ##openfpga

12:57 rohitksingh_work has quit [Read error: Connection reset by peer]

12:59 cr1901_modern has quit [Ping timeout: 258 seconds]

13:11 OmniMancer has quit [Quit: Leaving.]

13:46 rohitksingh has joined ##openfpga

13:46 rohitksingh_ has joined ##openfpga

13:49 rohitksingh_ has quit [Client Quit]

13:49 rohitksingh has quit [Client Quit]

13:49 rohitksingh has joined ##openfpga

13:57 renze has quit [Ping timeout: 245 seconds]

14:01 renze has joined ##openfpga

14:07 rohitksingh has quit [Ping timeout: 272 seconds]

14:16 rohitksingh has joined ##openfpga

14:41 rohitksingh has quit [Ping timeout: 258 seconds]

14:41 rohitksingh has joined ##openfpga

15:09 genii has joined ##openfpga

16:12 Asu has quit [Remote host closed the connection]

16:20 Asu has joined ##openfpga

16:56 emeb has joined ##openfpga

17:00 <emeb> fun getting an ILI9341 color LCD working on a picorv32 in a up5k using the hard SPI core.

17:28 <Xark> emeb: Neat. What kind of SPI clock speed can that use?

17:32 <emeb> Xark: the SPI IP core really needs at least a 1/3 baud rate divide from whatever the system clock is. I'm running it from 24MHz right now, so 8MHz sclk I guess. Plus, with the wishbone overhead and the status polling it slows down a bit more.

17:33 <Xark> emeb: Nice. IIRC the datasheep on those displays says 20Mhz (but that seems to be conservative).

17:34 <emeb> A full screen refresh on this 320x240 LCD is not instantaneous - you can see it wipe, but it's not awful. I suspect that if the SPI core were clocked at 48MHz it would be nicer, but the picorv32 doesn't really want to run faster than 24MHz on the up5k with the conservative timing that nextpnr uses.

17:36 <Xark> emeb: With 8Mhz the max speed is not super impressive (but decent). I did a fair bit of work to see how fast these could go on 16Mhz AVR (8Mhz SPI clock). However, a lot of "compute delays" between SPI data on AVR (a buffer helps a lot). Here is my AVR "speedy LCD" project (with benchmarks at the bottom): https://hackaday.io/project/6038-pdqgfx-optimzed-avr-lcd-graphics/details

17:36 m4ssi has quit [Remote host closed the connection]

17:37 <emeb> Cool. It's fun trying to get the most out of these constrained systems.

17:38 <emeb> Fun part of this is that I'm using both SPI cores - one for the flash memory and the other for the LCD. Next thing to try is copying images from flash to LCD.

17:38 <Xark> emeb: I got my 6502 firmware back operational (with color+gplyh output/scroll). However, now I notice top scan line isn't "right" (like line 0 shows data from middle of the screen - my fixes aren't quite perfect).

17:40 <emeb> Hmm... Well, debugging hardware is part of the fun. I need to study that design and think about how I'd fix the issue you discovered.

17:42 Bike has quit [Quit: Lost terminal]

17:50 <Xark> emeb: Of course. I had lots of these issues in my VGA text module for f32c when I was developing it (https://github.com/f32c/f32c/blob/master/rtl/soc/vgahdmi/VGA_textmode.vhd). I'll look at the code a bit more, but I need to get Verilog simulation set up.

17:51 degasus has left ##openfpga ["WeeChat 2.3"]

18:01 <Xark> emeb: I only notice top line issue with my other "fixes" and when scrolling entire 100x75 screen (like with LOAD 11 "maze").

18:04 <Xark> emeb: https://justpaste.it/2jy1v

18:05 cr1901_modern1 has quit [Quit: Leaving.]

18:05 cr1901_modern has joined ##openfpga

18:06 * Xark decides things are working well enough to think about putting an initial IceBreaker version on GitHub....

18:07 <emeb> Xark: Great - looking forward to seeing what you've done with it.

18:15 rohitksingh has quit [Ping timeout: 258 seconds]

18:15 emeb has quit [Quit: Leaving.]

18:20 emeb_mac has joined ##openfpga

19:41 Miyu has quit [Ping timeout: 258 seconds]

20:17 <tnt> ZipCPU: if you want to also debug my spi slave (well, AFAIK it works fine), feel free to :p https://github.com/smunaut/ice40-playground/blob/master/cores/spi_slave/rtl/spi_fast_core.v

20:18 <ZipCPU> ;)

20:18 <tnt> ZipCPU: Having it work on real hw rather than simulation is the trick though because you have in/out delays to/from outside of the FPGA that are in the order of magnitude of the clock period ...

20:18 <tnt> so unless you account for that, proving it in sim doesn't mean much.

20:19 <ZipCPU> Hmmm ....

20:23 <ZipCPU> tnt: The difficult part of the slave core you propose is that ... it's so hardware dependent

20:23 <ZipCPU> I'd rather create something that was a bit more hardware independent ... pure Verilog if you will

20:24 <tnt> ZipCPU: sure. I just couldn't come up with one that worked reliably at high SPI frequency.

20:25 <ZipCPU> The reality is that I'd love to place this on an iCE40 as well, so ... there is that in common

20:25 <ZipCPU> How fast was the SPI frequency you were working with?

20:25 <tnt> ~ 50 MHz

20:25 <ZipCPU> HX8k?

20:25 <tnt> UP5k

20:26 <ZipCPU> 50MHz should be reasonable

20:26 <ZipCPU> Indeed, you should be able to run at up to 100MHz SPI clock

20:26 <tnt> Yeah, the frequency isn't so much the problem. The two reasons I had to resort to hw dependent things are :

20:27 <whitequark> ZipCPU: this is why nMigen has a platform layer providing an abstraction over DDR primitives that is the same on every family.

20:27 <whitequark> pure Verilog just isn't enough for things like that

20:27 <tnt> (1) next pnr doesn't support delay constraint across clock domain and I needed to limit the possible skew between bits during CDC.

20:27 <ZipCPU> Technically, DDR from Verilog works just fine. Practically, its abysmal.

20:28 <tnt> (2) Couldn't use the IOB register (for various reason), so to get deterministick Clock-to-out and setup-hold times vs spi-clk pad, I need to manually lock where those signals go to / come from.

20:28 <tnt> Which again, you can only do by manual instanciations of low level primitives.

20:28 <ZipCPU> tnt: (1) Why? Why not just guarantee that there will be enough skew by holding the source constant while the valid goes through the synchronizer?

20:29 <whitequark> ZipCPU: the nMigen platform layer provides more guarantees about DDR than you can do in pure Verilog

20:29 <ZipCPU> (2) ... sadly, I understand this one. I keep "finding" bugs in code sent to me--things that work in version XYZ or Yosys but not xyz, and it usually turns out to be that clock-to-output issue

20:29 <whitequark> specifically, it synchronizes both samples to the next clk posedge

20:30 <tnt> ZipCPU: if I want to support system clock of a picrorv 32 at 12 MHz with a SPI at 60 MHz (untested yet, but that was the design target), I need to pass 1 data almost every 12 MHz clock ...

20:30 <whitequark> (by instantiating the appropriately configured primitive)

20:30 <ZipCPU> whitequark: I believe it, and I can understand why you do it as well. Tell, me though, can you do this across multiple hardware architectures (yet)?

20:30 <whitequark> ZipCPU: yes. this is currently done across ECP5, iCE40, Xilinx S7, Xilinx Spartan6, and there is a prototype for MachXO2 (not upstream yet)

20:31 <cr1901_modern> Spartan3 will also prob follow this afternoon

20:31 * cr1901_modern waves

20:31 <ZipCPU> tnt: All of my clock passing primitives are failing in formal for SPI speeds > 2x my system clock. I just can't seem to get the data through the 2ff synchronizers fast enough

20:31 <ZipCPU> whitequark: Forgive me for saying, but that's awesome!

20:32 <whitequark> ZipCPU: I have spent a lot of time looking at various FPGA primitives and found the configuration that is either natively supported, or can be very easily and reliably provided, on pretty much every interesting architecture.

20:32 <cr1901_modern> whitequark: I have an overwhelming urge to actually finish something today. So why not choose the task w/ that seems like the least amount of work and watch it gloriously blow up in my face :P

20:32 <whitequark> you *do* have to consider pipelining in the input buffer

20:32 <cr1901_modern> (there will be a PR hopefully tonight)

20:32 <whitequark> i.e. both posedge and negedge sample are re-registered, delaying them by 1 clock.

20:33 <whitequark> but that is a small price to pay for I/O that is (a) portable, (b) is entirely in a single clock domain

20:33 <ZipCPU> The input buffer ... that's one thing I'm not examining at this point

20:33 <tnt> whitequark: I don't really see why you couldn't write a HAL layer in verilog ... (my last company had one in VHDL that abstracted BRAMS / Multipliers / ... to allow instanciation and napping to different vendor/architecture depending on the compile time option).

20:33 <whitequark> tnt: you could, absolutely

20:34 <whitequark> but you can't *rely* on it if you're shipping Verilog code

20:34 <whitequark> whereas nMigen guarantees that you can use that, and also gives you a specific interface to write code against

20:35 <tnt> Sure, you'd need to ship it with the code ... (or as a library/dependency whatever). And I agress it's very nice than nmigen comes with "batteries included" with things like this HAL or FIFOs or memories or that kind of stuff "ready out of the box".

20:35 <whitequark> ZipCPU: here by "input buffer" I mean something more abstract than the physical buffer on the FPGA.

20:35 <ZipCPU> Ok, then ... I'm confused

20:36 <ZipCPU> That's what I had thought you were talking about

20:36 <whitequark> let me try to explain

20:36 * ZipCPU remembers Prince Bride, "No, there is too much. Let me sum up." :D

20:37 <whitequark> if you ask nMigen for a DDR I/O, it essentially gives you a bunch of wires (like an SV struct, maybe?) that follow a specific contract. for example, if you ask it for a DDR input, it gives you "clk", "i0" and "i1".

20:38 <ZipCPU> ... and it automatically maps it to a hardware abstraction layer, sort of like tnt was suggesting?

20:38 <whitequark> the contract is that, provided that you drive "clk" from the same domain as your logic is in, on each "clk" posedge, "i1" contains the value at the previous negedge, and "i0" at the posedge before that (i.e. with 1 clock delay).

20:38 <whitequark> yes. in practice, nMigen has a hardware abstraction layer it uses to implement that.

20:39 <whitequark> but that should not matter very much if you are following its contract.

20:39 <tnt> I might have lost a step here, but why are DDR primitives needed at all here ?

20:39 <whitequark> on twitter, ZipCPU was talking about MOSI and MISO changing on different clock edges

20:39 <whitequark> SCK edges*

20:40 <whitequark> of course nMigen provides SDR primitives as well

20:40 <ZipCPU> Ooohhh, I had missed the connection ... that makes a lot more sense now, thanks!

20:40 <ZipCPU> The problem, though, is that ... I'm not driving SCK. This is a slave SPI port

20:40 <ZipCPU> SCK is being driven externally

20:41 <whitequark> I see. I would use a similar approach here as well.

20:41 Asu has quit [Remote host closed the connection]

20:42 <whitequark> I would use a DDR primitive configured such that the input data is valid on SCK posedge, and have a domain clocked by SCK. then any desirable CDC primitive transferring data to the system domain.

20:42 <tnt> sure, but (1) you're outputting on one and capturing on the other so that's not really DDR. (2) in the IO timing analysis I did, when you account for the skew between clock path and data path, you're better off capturing MOSI on the rising edge as well (the data capture valid window will be somewhere between falling and rising edge)

20:43 <ZipCPU> ^ +1

20:44 OmniMancer has joined ##openfpga

20:45 <ZipCPU> Where I could see DDR primitives becoming valuable here is if you took all of the incoming SCK signals and sent them through a DDR primitive

20:45 <ZipCPU> ... that is, with the system clock controlling the DDR primitive

20:45 <ZipCPU> That would spare you a cycle of clock synchronization time, would it not?

20:45 <whitequark> you could oversample too.

20:46 <ZipCPU> That's essentially what I'm describing, only using the DDR primitive to do it

20:46 <whitequark> (side note: one thing I want to provide in nMigen is an abstraction for low-speed SERDES... like Xilinx ISERDES and Lattice IDDRX2)

20:47 <whitequark> ZipCPU: yeah, I meant that you're describing oversampling :)

21:09 Asu has joined ##openfpga

21:15 Asu has quit [Remote host closed the connection]

21:37 Asu has joined ##openfpga

21:39 genii has quit [Remote host closed the connection]

21:42 Asu has quit [Remote host closed the connection]

21:44 Asu has joined ##openfpga

21:49 Asu has quit [Remote host closed the connection]

21:50 Asu has joined ##openfpga

21:51 Asu has quit [Client Quit]

22:43 Richard_Simmons2 has quit [Ping timeout: 252 seconds]

22:51 Richard_Simmons2 has joined ##openfpga

22:51 Richard_Simmons2 has quit [Remote host closed the connection]

22:51 Richard_Simmons2 has joined ##openfpga

22:59 sgstair has quit [Read error: Connection reset by peer]

22:59 sgstair has joined ##openfpga

23:12 Zorix has quit [Read error: Connection reset by peer]

23:29 <Xark> emeb_mac: Hi. I was thinking I found another minor issue with PS/2 ASCII decoding with "|" and "}" keys, but I found out that OSI messed up the ASCII character order in the font (D'oh). :)

23:30 <Xark> emeb_mac: I am tempted to "fix" the font - since "OSI compatiblity" is mostly gone with larger screen (and other differences).

23:35 * Xark does note that OSI didn't even have keys for those characters (in their defense). Few C programmers on OSI back then... :)

23:35 Zorix has joined ##openfpga

23:36 <Xark> emeb_mac: I also found that "--timing-allow-fail" is handy for your "false path" timing error (since typing make twice gets old).

23:49 <emeb_mac> Xark: That's weird - hadn't noticed it

23:50 <emeb_mac> but then those characters don't often come up in Basic

23:51 <emeb_mac> also, thanks for the --timing... suggestion. Will add that to the make