#litex on 2020-05-14 — irc logs at freenode.irclog.whitequark.org

2020-02-07 11:13 _florent_ changed the topic of #litex to: LiteX FPGA SoC builder and Cores / Github : https://github.com/enjoy-digital, https://github.com/litex-hub / Logs: https://freenode.irclog.whitequark.org/litex

00:00 tpb has quit [Remote host closed the connection]

00:00 tpb has joined #litex

00:26 Skip has joined #litex

00:36 CarlFK has quit [Ping timeout: 260 seconds]

00:47 rohitksingh has quit [Ping timeout: 260 seconds]

01:05 _whitelogger has joined #litex

01:11 _franck_ has quit [Ping timeout: 256 seconds]

01:21 _franck_ has joined #litex

03:14 _whitelogger has joined #litex

03:27 Degi has quit [Ping timeout: 256 seconds]

03:27 Degi has joined #litex

03:33 Skip has quit [Remote host closed the connection]

03:56 _whitelogger has joined #litex

04:32 <kgugala> CarlFK I pasted a link where you can find prebuilt toolchains

05:02 CarlFK has joined #litex

05:13 <CarlFK> kgugala: 1. Connect to NeTV2 board using JTAG (ARM-USB-TINY-H JTAG adapter was used)

05:14 <CarlFK> I don't have a jtag anything, but I do have a pi and some jumpers

05:14 <kgugala> that should also work

05:14 <kgugala> this step is for programming the bitstream

05:16 <kgugala> if you use rpi, skip the make gateware/reload step

05:17 <kgugala> this target is for programming FPGA from host PC via jtag

05:24 <CarlFK> do I need to hook up to the jtag headers on the netv2, or is the 20 pin connector do that too?

05:31 <kgugala> I never used rpi with netv2

05:31 <kgugala> I suppose the 20 pin connector is rpi format

05:32 <kgugala> if you have Xilinx platform cable you can connect it via jtag header

05:42 <CarlFK> I don't have that either

05:43 awordnot has quit [Ping timeout: 264 seconds]

05:43 awordnot has joined #litex

06:00 rohitksingh has joined #litex

06:14 kgugala__ has joined #litex

06:14 rohitksingh has quit [Ping timeout: 244 seconds]

06:15 kgugala97 has joined #litex

06:17 kgugala has quit [Ping timeout: 256 seconds]

06:19 kgugala__ has quit [Ping timeout: 265 seconds]

06:32 rohitksingh has joined #litex

06:46 kgugala97 is now known as kgugala

07:04 <bunnie> The 20-pin connector is designed to plug into an RPi so you can use openOCD running on the Pi to talk directly to the FPGA. It also maps the UART to the RPI's UART

07:05 <bunnie> Just make sure you plug in the RPi aligned to the board correctly. A few people have been off by one row and well, it didn't end well for their hardware.

08:27 kgugala has quit [Read error: Connection reset by peer]

08:27 kgugala has joined #litex

09:06 <benh> Folks that have a long FPGA experience around here... one thing that's been bugging me with microwatt on Arty for a while...

09:06 <benh> when building with litedram (it *looks* like it's only happening in that case so far, ie, clock comes from litedram's PLL), when starting up

09:07 <benh> the messages out of the UART are garbled for a few dozen/hundred characters then are ok

09:07 <benh> if I hold the core stopped for a second or so at reset then start it, the problem goes away

09:07 <benh> everything is fine in sim

09:08 <benh> if I make the core do a loop of a few dozen thousands of dummy reads from the UART status before printing anything out it's ok

09:08 <benh> Paul scoped the UART output and the bit duration looks fine

09:08 <benh> this has been eluding us for ages

09:09 <benh> (it's not a LiteX UART, some simple "potato" uart we picked up ages ago, we'll replace it eventually, but it seems ok, I fixed a bug or two in there but nothingg so far that had any impact on that phenomenon

09:09 <kgugala> benh looks like core's reset is reasserted before the pll is locked

09:09 <kgugala> *deasserted

09:09 <benh> kgugala: the core reset comes from the soc reset which comes from the reset controller which doesn't even start countingg until the pll_locked signal is 1

09:09 <benh> kgugala: but maybe we have an obscure bug in there

09:09 <kgugala> could be

09:10 <benh> kgugala: but yeahm that was my first reaction too...

09:10 <benh> I've never managed to use chipscope successfully but I can try routing those signals to pins and use an actual scope

09:10 <benh> I can probably borrow one and find a crappy uSB one somewhere

09:15 <kgugala> the other option is that the pll is reset incorrectly. AFAIK pll has some strict reset routine

09:16 <kgugala> I mean Xilinx 7 series pll

09:17 <benh> Ok. I wonder...

09:17 <benh> _florent_: I notice arty.py creates one S7PLL

09:18 <benh> _florent_: with all the clocks out of it, including iodelay via an S7DELAYCTL

09:18 <benh> _florent_: however, litedram_gen creates 2 PLLs, one sys_pll and one just for iodelay

09:19 <benh> (without specifying a speed grade for the second one)

09:19 <benh> Now .. I dont' know that much about Artix PLLs, but I wouldnt' mind knowing if there's a rationale for this ;-)

09:20 <_florent_> benh: i'll fix the missing speedgrade on the second PLL

09:21 <benh> _florent_: so why two ?

09:21 <benh> _florent_: the LiteX standard arty.py seems to create only one ...

09:21 <benh> also, am I getting lost in migen python or is LiteX not actually using pll.locked ?

09:22 <_florent_> benh: a second PLL is used in LiteDRAM to allow more frequency steps (since it's difficult to meet generate both sys_clk/iodelay_clk from a single PLL)

09:22 <benh> (note that I didn't see a problem with the LiteX generated microwatt, only with the standalone one + litedram, so I'm looking at differences)

09:22 <benh> _florent_: ok but on the limited "scope" of an Arty, a single is enough then ?

09:23 <_florent_> benh: yes, and we could eventually try to use only one if we are able to generate a working configuration with only 1 PPLL

09:24 <benh> _florent_: ok, not a big deal for me... unless you think that could be behind some of my weird issues above ...

09:25 <benh> _florent_: iodelay is only used internally to litedram right ?

09:26 <benh> so why is it ok for LiteX to not wait for pll.locked before lifting reset ?

09:26 <_florent_> benh: if you are using IODELAY primitives, you need to have at least one IDELAYCTRL in the design

09:26 <benh> _florent_: ok, I think litedram is the only one that does in my current design

09:26 <benh> _florent_: so that's probably not related to that weird issue

09:27 <_florent_> the targets in LiteX are using the pll.locked: https://github.com/enjoy-digital/litex/blob/master/litex/soc/cores/clock.py#L97

09:27 <tpb> Title: litex/clock.py at master · enjoy-digital/litex · GitHub (at github.com)

09:28 <_florent_> benh: if you share a project i can build easily, i could investigate a bit

09:30 <benh> _florent_: I would love that but beware, it's microwatt fusesoc project in vhdl :-)

09:30 <benh> _florent_: let me try to investigate a bit more first, esp. since you probably wont' be able to regenerate litedram on it with its current sccripts until we finalize our current work and I update microwatt to match it upstream

09:31 <benh> _florent_: but I'll definitely take your offer if I draw a blank in the next few days :)

09:31 <_florent_> benh: ok, it's easier for me to investigate if i can just have an archive with the sources and small script to build the design.

09:33 <benh> _florent_: yeah ... "small script" means install/run fusesoc sadly

09:33 <benh> _florent_: at least for now ... though that's not hard to pip install it

09:33 <benh> and fusesoc generates a xilinx prj

09:34 <benh> anyway, I'll dig a bit more.

09:34 <benh> _florent_: the user_reset that comes out of litedram standalone is what I feed as reset to the rest of my loggic

09:34 <benh> _florent_: does it wait for pll_locked already ?

09:36 <benh> hrm ... actually I don't .. I must have hacked that a while ago... I use pll_locked and feed that into the reset controller

09:36 <benh> but I use it as a sync signal... I dont' have synchronizers there, I sample pll_locked on a sys_clk edgge

09:37 <benh> maybe that's wrong...

09:37 <benh> should I treat pll_locked as asynchronous ?

09:38 <_florent_> pll_locked is currently asynchronous yes, but i could make it synchronous to the sys_clk

09:40 <_florent_> benh: you could also use user_rst that is synchronous to user_clk

09:40 <benh> _florent_: is user_rst guaranteed to be only deasserted until after pll is locked ?

09:40 <benh> I'm building a test with synchronizers on pll_locked see if that makes a difference

09:41 <_florent_> user_rst is only deasserted when the pll is locked yes

09:41 <benh> we have this crappy reset controller someone wrote (I forgot whome, maybe anton) which uses a counter to delay reset release

09:41 <benh> but it doesn't have synchronizers on the main reset and pll_locked inputs

09:41 <benh> despite being a synchronous circuit

09:41 <benh> maybe it's going a bit nuts

09:42 <benh> _florent_: so I could just use user_rst as a clean synchronous source of reset then, great

09:56 <benh> Hrm...

09:56 <benh> took out our custom reset controller and just used user_rst out of litedram as the SoC reset (core reset delayed 64K clocks), and the problem still occurs

09:56 <benh> fun .. :-)

09:57 <benh> I'm even wondering whether there's a voltage drop when everything comes up... I'm powering the Arty off USB

09:57 <benh> I should try an external psu at some point

09:57 <benh> anyway, dinner time :)

10:24 <_florent_> benh: is the behavior different at the first startup after loading the bitstream and with a manual reset?

10:36 scanakci has quit [Quit: Connection closed for inactivity]

10:46 <benh> _florent_: we don't have a manual reset but I'll add one and test

10:46 <benh> _florent_: we do have a manual soc/core reset but that doesn't reset litedram (somewhat on purpose)

10:47 <benh> I'll try something later, let me first test what you committed and the system.h include on csr.h, I want to get that stuff done and dusted :)

10:49 <benh> _florent_: is that safe to assume you'll eventually merge my rework-csr-accessors ?

10:50 <benh> or rather csr-access-rework :)

10:57 <benh> _florent_: so the whole inclusion of system.h gets a bit messy in the generated csr.h

10:58 <benh> _florent_: my thinking is that in gen_csr_header, if with_access_functions, I'll just unconditionally hw/common.h, which itself will include system.h

10:58 <benh> _florent_: with my rework, hw/common.h will do the right thing cs. CSR_ACCESSORS_DEFINE

10:58 <benh> _florent_: which is to define all the "new" fancy high level ones based on the simple ones and leave the simple one to the platofrm

11:00 <_florent_> benh: i'm not the best qualified to evaluate theses changes, so if somlo, xobs are happy with it, i'm fine merging it

11:00 <xobs> Sure, okay by me.

11:05 <benh> _florent_: wait

11:05 <benh> _florent_: let me merge the system.h addition into that patch

11:05 <benh> and fix some leftover commented out code that was in there while at it

11:11 <benh> xobs: somlo: Please re-check that csr-access-rework

11:11 <benh> I've folded in the suggestion of including system.h to give the platform/cpu a chance to override CSR_BASE and CSR_ACCESSORS_DEFINED and provide inline simple accessors

11:11 <benh> I'm hoping it won't break anything but I would appreciate your eyes (and possibly testing)

11:12 <benh> _florent_: if that passes muster I think that's all I need for standalone litedram on microwatt, we're good

11:17 <benh> _florent_: on another note...

11:17 <benh> paul noticed that loading a cache line from litedram seems to take

11:17 <benh> about 15 cycles for the first read and then about 12 cycles per read (64-bit)

11:18 <benh> at the moment, my wb<->litedram bridge is a bit dumb. Each 64-bit read is a complete litedram cycle where I use either the top or bottom half of the data

11:19 <benh> now I haven't looked too closely at how I could pipeline/stream the user port there ... would it be possible to send a single read command and then pump data out of the read port multiple times ?

11:19 <benh> or it's just the speed I should expect and the best I can get might be to cache the other 64-bit of data coming in for the next access ?

11:22 <benh> I've tried but I've found myself so far unable to understand the Hw design from reading the python mygen stuff :) It's ... very hard to parse for someone not experienced

11:24 <benh> ie. do you need a CMD phase for each transfer ?

11:24 <benh> is 12 cycles per transfer of 128 bytes something expected ?

11:26 <benh> or can I pipeline N commands and separately do N data cycles to get the data ?

11:27 <benh> (if yes, how much is N practically speaking ?)

11:33 <benh> xobs: somlo: doing some more changes to that branch... I don't like that csr.h includes hw/common.h... it might make things harder for your etherbone cases...

11:39 <benh> xobs: somlo: done. Pls check (and see my response on github)

11:49 <benh> _florent_: I think I get the gist of it ... with pipelined wb I should be able to turn each read or write on the wb into a "command" to the native port

11:50 <benh> _florent_: and separately handle the data... let's assume I keep write simple for now, ie, I send a write command and write data together when both ports are ready

11:50 <benh> _florent_: that mean for reads, I can send commands on each 'stb' from pipelined wb

11:50 <benh> _florent_: and return data+ack on each valid i get on the read port completely independently

11:50 <benh> _florent_: right ?

11:51 <benh> _florent_: now, my wb is 64bit wide, the native port is 128... is there a gain in caching the last read data/address in my wrapper

11:51 <benh> _florent_: to return the "other" half since that's typically the next thing happening on a cache line refill

11:51 <benh> _florent_: or I may as well send another read command down to litedram ?

11:52 <benh> if you don't have time to respond, I'll try this week-end to build some kind of test setup in verilog with a self-initializing litedram and a micron DDR model or something like that

11:52 <benh> to experiment with

11:53 Dolu has joined #litex

12:05 <_florent_> benh: the 12cycles per transfer if because of the latency, when pipelining and in the ideal case (sequential accesses), you should be able to nearly write/read 128 bytes/cycle (at least with a DMA).

12:06 <_florent_> benh: we are currently working on an adapter for the LiteDRAM native interface that will be able to do the data width adaptation for both reads and writes

12:07 <_florent_> so you will be able to request a 64-bit port directly from LiteDRAM and data width convertion will be handled internally

12:10 <benh> _florent_: ok. In the meantime, is my undertanding of how the port work correct ?

12:10 <benh> ie, setting aside the width issue

12:10 <benh> _florent_: and ignoring writes for which I'll, for now, just wait for both cmd and write port to be ready as today

12:10 <benh> _florent_: for reads, I can send all the commands as I get the stb's from the pipelined wb

12:11 <benh> _florent_: and separately retrieve the data & send data & acks to the wb as I get the valids from the read port ?

12:11 <benh> _florent_: ie, is my undertanding correct that I always need a command per access, but I can pipeline a bunch of commands and do the data transfers from the read/write ports separately ?

12:12 <benh> (in the order the commands were done of course, which gets messy if I mix up reads and writes but is nicely suitable for a string of reads such as a cache refill)

12:14 <_florent_> yes it's correct, but there is no data buffering with the native interface, so that's possible you'll have to add Write/Read FIFOs

12:15 <_florent_> for writes, once the command is accepted, write the data to the FIFO and LiteDRAM will request it when it will be able to tranfer it on the physical interface

12:16 <_florent_> for reads, LiteDRAM is not handling the ready of the rdata stream (which would require buffering), so the data should be accepted when valid is set to 1

12:27 <benh> _florent_: ok so as a first step, writes are simple as I have the command and data as one "stb' on the wb, I can just wait for both ready to be 1 before I send it

12:27 <benh> _florent_: (for now)

12:27 <benh> _florent_: for reads, I can arrange to always be ready.. the way pipeline wb works, I am in a cyc=1 cycle, I can always send ack+data back if I had commands

12:28 <benh> _florent_: the master won't send commands if it can't handle the data

12:28 <benh> _florent_: the only issue is the 64-bit vs. 128-bit but I can trivially handle that with a small latch until you have that sorted on your side

12:28 <benh> _florent_: improving writes might require a fifo indeed

12:31 <_florent_> benh: it will probably not work for writes, since the command will be acked before the write, so you need to be sure you won't send additional commands while waiting for the write to be acked.

12:32 <benh> _florent_: right, that's what I'm doing now aleady, it's not fast but works

12:32 <benh> _florent_: my plan is to eventually turn our BRAM into an L2 (with an option to keep it linearly mapped so we can still use it as boot "firmware")

12:33 <benh> _florent_: at which point I'll probably pipeline both reads and writes as whole cachelines with a direct 128-byte bus between litedram and L2

12:33 <benh> but right now it's more than my spare time can cope with, maybe in the next few weeks...

12:34 <benh> first, once you've merged all that csr gunk and I've pushed the microwatt updates to anton, I want to gggo back to baby sitting your LiteX/microwatt intgration

12:34 <benh> see if I can get the external interrupt going

12:34 <benh> etc...

12:47 <_florent_> that would be nice, i should spend time finishing the ghdl-synth/verilator support to ease this work (this would also help for litedram integration)

13:16 <benh> definitely

13:56 <Finde> benh: has anyone tried running the microwatt RTL through commercial tools? I tried to use vhdlan from vcs yesterday and it was really complaining a lot

14:01 <somlo> benh, _florent_: fwiw, commit 1e35b0e7 (still) works OK for me :)

15:43 scanakci has joined #litex

15:51 shuffle2 has quit [Quit: WeeChat 2.6]

16:39 acathla has quit [Quit: segfault]

16:39 lambda has quit [Quit: WeeChat 2.8]

16:41 lambda has joined #litex

16:41 Skip has joined #litex

16:53 acathla has joined #litex

16:53 Skip has quit [Remote host closed the connection]

17:24 Dolu has quit [Quit: Leaving]

17:34 kgugala__ has joined #litex

17:37 kgugala has quit [Ping timeout: 256 seconds]

18:08 Skip has joined #litex

18:09 _franck_8 has joined #litex

18:11 _franck_8 has quit [Client Quit]

18:11 _franck_ has quit [Ping timeout: 265 seconds]

18:18 fjullien has joined #litex

18:19 fjullien is now known as _franck_

18:30 acathla has joined #litex

18:30 acathla has quit [Changing host]

18:46 <CarlFK> bunnie: thanks.

19:09 kgugala has joined #litex

19:11 kgugala__ has quit [Ping timeout: 265 seconds]

20:01 darren099 has quit [Quit: Leaving]

20:21 Skip has quit [Remote host closed the connection]

21:40 <benh> Finde: Vivado for sure ;-) I think Mikey tried at some point a commercial tool for verilog conversion and it was .. painful

21:40 <benh> Finde: it looks like VHDL 2008 support in tools is somewhat flawky

21:40 <benh> somlo: great thanks !

22:01 <CarlFK> kgugala: http://paste.ubuntu.com/p/bV2nB8mysr/ application-specific initialization failed: couldn't load file "librdi_commontasks.so": libtinfo.so.5: cannot open shared object file: No such file or directory

22:01 <tpb> Title: Ubuntu Pastebin (at paste.ubuntu.com)

22:01 <Finde> benh: I submitted a PR for some issues I saw trying to run the core through VCS

22:06 <benh> I tried to use some more advanced VHDL features recently to define my wishbone as a record with unspecified address/data width

22:06 <benh> and use subtype specialisation in places

22:07 <benh> and/or generic types

22:07 <benh> these are quite nice features of vhdl 08 for making things a bit parametric... except they work with no tool

22:08 <benh> at least ghdl bugs get fixed quickly... Vivado bugs, you get CRin and you hope your grandchildren will have a fix

22:08 CarlFK has quit [Ping timeout: 256 seconds]

22:10 <benh> Finde: can you ammend your commit to add a DCO ? (ie. Signed-off-by:...)

22:10 <benh> Finde: I don't think Anton will merge anything without one bcs lawers

22:11 CarlFK has joined #litex

22:31 <CarlFK> kgugala: make clean does something alarming: ssh antmicro@10.0.0.2 "sudo rm /lib/modules/\`uname -r\`/litepcie.ko"

22:41 <CarlFK> fixed the No such file with sudo ln -s libtinfo.so.6 libtinfo.so.5

23:57 Skip has joined #litex