##openfpga on 2019-10-20 — irc logs at freenode.irclog.whitequark.org

01:36 X-Scale` has joined ##openfpga

01:38 X-Scale has quit [Ping timeout: 268 seconds]

01:38 X-Scale` is now known as X-Scale

01:49 inoor has joined ##openfpga

01:58 emeb_mac has joined ##openfpga

02:17 X-Scale has quit [Ping timeout: 265 seconds]

02:22 X-Scale has joined ##openfpga

02:31 finsternis has quit [Excess Flood]

02:31 finsternis has joined ##openfpga

03:45 OmniMancer has joined ##openfpga

03:47 rohitksingh has quit [Ping timeout: 264 seconds]

04:08 Bike has quit [Quit: Lost terminal]

04:17 _whitelogger has joined ##openfpga

04:30 freemint has quit [Ping timeout: 268 seconds]

04:53 _whitelogger has joined ##openfpga

04:57 rohitksingh has joined ##openfpga

05:22 rohitksingh has quit [Ping timeout: 250 seconds]

05:23 rohitksingh has joined ##openfpga

05:25 jemk has quit [Ping timeout: 246 seconds]

05:25 jemk has joined ##openfpga

05:56 Jybz has joined ##openfpga

06:05 _whitelogger has joined ##openfpga

06:37 _whitelogger has joined ##openfpga

06:51 inoor has quit [Quit: inoor]

06:59 emeb_mac has quit [Quit: Leaving.]

08:03 Asu has joined ##openfpga

09:49 bwidawks has joined ##openfpga

09:49 bwidawsk has quit [Ping timeout: 250 seconds]

09:50 bwidawks is now known as bwidawsk

09:52 SpaceCoaster has quit [Ping timeout: 250 seconds]

09:52 SpaceCoaster has joined ##openfpga

10:07 freemint has joined ##openfpga

10:34 rohitksingh has quit [Ping timeout: 250 seconds]

11:16 rombik_su has joined ##openfpga

11:42 <ZirconiumX> I think one of the most important skills for FPGA work is having good internal estimates for how big something should be

11:43 <daveshah> 10% bigger than the FPGA you are using :p

11:46 <ZirconiumX> Like, I can synthesise a pixel pipeline for ECP5 and get a design with 736 LUT4s and 1489 FFs, but I have no idea if that's good or inefficient for this

11:51 <rombik_su> Do you mean, like, how efficient your RTL code is? Or how well it can be inferred by a particular tool?

11:52 <qu1j0t3> daveshah: :-)

11:53 <ZirconiumX> rombik_su: The former

12:13 _whitelogger has joined ##openfpga

12:25 Bike has joined ##openfpga

14:51 freemint has quit [Ping timeout: 245 seconds]

14:57 lutsabound has joined ##openfpga

15:44 OmniMancer has quit [Quit: Leaving.]

15:45 emeb has joined ##openfpga

16:31 balrog has quit [Quit: Bye]

16:37 balrog has joined ##openfpga

16:48 <sorear> “pixel pipeline” is not a single well defined thing

17:05 <ZirconiumX> Sure, but I can send you the reference I'm working from plus the source code

17:33 rohitksingh has joined ##openfpga

17:46 <hackerfoo> There's probably not a good answer for how big something should be because there are many tradeoffs.

17:47 <hackerfoo> And it doesn't matter as long as it does what you want running on the hardware you have.

17:53 <ZirconiumX> Given that there is barely any size difference between one pipeline and sixteen, I'm pretty sure I have a bug somewhere...

19:01 show has quit [Quit: WeeChat 2.5]

19:03 forksand has quit [Ping timeout: 265 seconds]

19:04 show has joined ##openfpga

19:17 forksand has joined ##openfpga

19:28 Jybz has quit [Quit: Konversation terminated!]

19:42 mumptai has joined ##openfpga

20:39 emeb_mac has joined ##openfpga

20:39 Asu has quit [Ping timeout: 268 seconds]

20:49 rombik_su has quit [Quit: Leaving]

20:57 Bob_Dole has joined ##openfpga

21:09 zkms has quit [Quit: zkms]

21:10 zkms has joined ##openfpga

21:16 Asu has joined ##openfpga

21:16 <ZirconiumX> So, I'm presuming when your design synthesises to 24k DFFs, you need to look at optimising it

21:17 <ZirconiumX> I mean, that's "only" 1.5k DFFs per pipeline, but still

21:18 <daveshah> Well, depends what your design is

21:18 <daveshah> If a Core i9 CPU synthesised to 24k DFFs then I'd say there'd be little need to optimise further

21:19 <ZirconiumX> 16-pixel GPU pipeline

21:19 <ZirconiumX> Really I should build a front-end for it (so that Quartus won't yell at me for it not fitting into the I/O pins of my chip) and then try to meet timing

21:20 Asu has quit [Remote host closed the connection]

21:20 <daveshah> Does Intel have hard shift registers?

21:20 <ZirconiumX> Not that I know of

21:20 <daveshah> Xilinx has them and they make pipeline delays much more efficient subject to some constraints (eg no reset)

21:22 <ZirconiumX> I suppose we shall see

21:23 zkms has quit [Ping timeout: 276 seconds]

21:25 <sorear> so lattice gives you a single nearly-free DFF after each LUT

21:26 <sorear> I think, retiming would allow pipeline stages to be hidden in the preceding or following logic in many cases?

21:28 zkms has joined ##openfpga

21:33 <whitequark> yes

21:36 <daveshah> I was thinking about ZirconiumX's previous comment about 736 LUT4s and 1489 FFs so that wouldn't work

21:36 <ZirconiumX> That's for a single pipeline (so far)

21:36 <daveshah> And indeed on ECP5 in theory the LUTs and FFs are usable separately

21:36 <daveshah> So no need for retiming

21:36 <daveshah> Although routing congestion requires some care as to packing density

21:37 <ZirconiumX> On Cyclone V, you can use LUTs and spare DFFs separately

21:38 <sorear> can you say what exactly you're counting when you say "16 pixels"?

21:39 <sorear> was this the PS2 emulator or am I thinking of someone else's project?

21:45 <ZirconiumX> This is part of the PS2 GPU emulator, yeah

21:47 <ZirconiumX> sorear: each pipeline is RGBA32, plus X/Y/Z, texture coordinates and channel culling

21:47 <ZirconiumX> (I'm calling it culling; I don't know the correct term for it, but you can selectively not overwrite certain channels)

21:50 wpwrak has quit [Ping timeout: 240 seconds]

21:50 wpwrak has joined ##openfpga

21:53 <mwk> masking?

22:13 lopsided98 has quit [Quit: Disconnected]

22:18 lopsided98 has joined ##openfpga

22:22 mumptai has quit [Remote host closed the connection]

22:28 freemint has joined ##openfpga

22:39 emeb has quit [Quit: Leaving.]

22:46 <adamgreig> what document is best for understanding ecp5 I/O pads? specifically the available primitives and their attributes etc. I've got tn 2032 but it seems a bit lacking...

22:47 <adamgreig> i'd really like the ecp5 equivalent of the ice40 "technology library" doc

22:47 <adamgreig> or if anyone knows if lvds pairs (non-serdes) can be runtime swapped from in to out, that would be good too :p

22:50 <daveshah> adamgreig: the direct equivalent is http://www.latticesemi.com/view_document?document_id=52656

22:51 <daveshah> Just using a regular bidirectional IO in LVDS mode is fine

22:51 <daveshah> Unlike Xilinx there's no need to use a different primitive for differential IOs

22:55 <adamgreig> ah cool, thanks!

22:55 <adamgreig> that doc looks perfect

22:56 <daveshah> The primitive you want to use is probably "BB"

22:58 <adamgreig> I see there's ILVDS and OLVDS

23:01 zkms has quit [Quit: zkms]

23:02 zkms has joined ##openfpga

23:03 <daveshah> Those primitives are redundant though

23:03 <daveshah> There's no reason to use them over a single ended primitive and IOSTANDARD of LVDS

23:03 <adamgreig> doesn't look like they're mentioned in any of the ecp5 specific docs either

23:03 <adamgreig> got you, cool