#milkymist on 2011-11-29 — irc logs at freenode.irclog.whitequark.org

2011-11-28 00:09 Topic for #milkymist is now Milkymist One, Milkymist SoC & Flickernoise development channel (LLHDL/Antares are welcome too) :: Logs: http://en.qi-hardware.com/mmlogs :: JFDI

00:58 aw_ joined #milkymist

00:58 aw joined #milkymist

01:16 wolfspraul joined #milkymist

01:31 xiangfu joined #milkymist

01:33 aw_ joined #milkymist

01:33 Guest42658 joined #milkymist

02:00 errordeveloper joined #milkymist

03:14 wolfspraul joined #milkymist

04:25 Gurty joined #milkymist

04:59 rejon joined #milkymist

06:19 <GitHub187> [scripts] xiangfu pushed 2 new commits to master: http://git.io/vmKSDw

06:19 <GitHub187> [scripts/master] reflash_m1.sh snapshot: don't flash data by default - Xiangfu Liu

06:19 <GitHub187> [scripts/master] update the power-on message - Xiangfu Liu

06:50 wolfspraul joined #milkymist

07:12 xiangfu joined #milkymist

07:13 aw joined #milkymist

07:13 mumptai joined #milkymist

07:13 aw_ joined #milkymist

07:43 Martoni joined #milkymist

07:52 lekernel_ joined #milkymist

08:02 nightlybuild joined #milkymist

08:19 <qi-bot> The Firmware build was successfull, see images here: http://fidelio.qi-hardware.com/~xiangfu/build-milkymist/milkymist-firmware-11292011-0735/

08:34 azonenberg joined #milkymist

09:11 <GitHub134> [flickernoise] sbourdeauducq pushed 1 new commit to master: http://git.io/JI_FKw

09:11 <GitHub134> [flickernoise/master] performance: fix unmapped key handling - Sebastien Bourdeauducq

09:37 <azonenberg> lekernel_: Any experience working with multiprocessor softcores?

09:38 <azonenberg> I'm particularly interested in the interconnect

09:39 <azonenberg> in terms of cache coherency and how multiple processors share the bus

09:39 <azonenberg> i'm working on a triple-core SoC from scratch

09:39 <azonenberg> and am designing the interconnect fabric now

09:39 <azonenberg> i'm using a shared bus (only one core can talk at a time, but it's full duplex)

09:40 <lekernel_> what's at the other end of the shared bus? DRAM?

09:40 <lekernel_> also, softcores are slow. why not use dedicated accelerators?

09:40 <azonenberg> A fixed-mapping MMU

09:40 <azonenberg> That splits the address bus between memory mapped IO and DDR2

09:40 <azonenberg> the DDR2 has an L2 cache in front of it

09:40 <lekernel_> MMU? you mean address decoder?

09:40 <azonenberg> each core has its own dedicated L1

09:41 <azonenberg> Basically, yes

09:41 <azonenberg> hardwired mapping

09:41 <azonenberg> The L1 is gonig to be structured in such a way as to be a passthrough for the IO address range and cache DRAM and flash addresses

09:41 <azonenberg> then DRAM and flash will have their own SoC-wide L2 caches

09:42 <azonenberg> I know i'm reinventing the wheel a bit, its mostly an educational exercise

09:42 * azonenberg is writing a dissertation on computer architecture soon and wants to sharpen his skills first

09:42 <azonenberg> But its actually going to be quite fast

09:42 <azonenberg> on spartan6 -2 speed i am shooting for 200 MHz

09:43 <azonenberg> * 2-way superscalar

09:43 <azonenberg> = 800 mflops for 2 cores

09:44 <azonenberg> I had to pipeline the heck out of it, but its looking feasible

09:44 <lekernel_> until you get timing paths into the bus arbiter? :)

09:46 <azonenberg> Actually, the bus arbiter is looking just fine

09:46 <azonenberg> i just did a standalone test of it at 200 mhz and it works just fine

09:46 <azonenberg> on hardware

09:47 <azonenberg> My solution to this thing is, pipeline it like crazy

09:47 <azonenberg> its a barrel processor

09:47 <lekernel_> with all the cores and memory controller connected to it?

09:47 <azonenberg> so a 16 stage pipeline means zero latency

09:47 <azonenberg> and 32 stages means one stall

09:47 <azonenberg> i run 16 threads and context switch every clock

09:47 <lekernel_> ah, i see

09:47 <azonenberg> Right now its looking like when running out of L1 cache with a 16 stage pipeline i will have no stalls

09:47 <azonenberg> despite not having any forwarding whatsoever

09:47 <azonenberg> an L1 cache miss that hits in L2 will most likely stall one instruction

09:48 <azonenberg> if i can fit the L1=>L2 and back in 16 clocks

09:48 <azonenberg> or 2 instructions if it takes me 32

09:48 <azonenberg> as long as i can keep the entire bus structure pipelined

09:48 <azonenberg> this is a very GPU-esque architecture

09:48 <azonenberg> hiding latency by multithreading

09:48 <lekernel_> what about cache miss rates when you have 16 threads switching so fast?

09:48 <azonenberg> I envision it being something like CUDA, each thread executing mostly the same instructions

09:48 <azonenberg> But they can branch as they see fit'

09:49 <azonenberg> The entire architecture is mostly an experiment

09:49 <lekernel_> you should compile dedicated hardware accelerators ...

09:49 <azonenberg> you mean, ASIC level?

09:49 <lekernel_> adding layers over layers makes things slow

09:49 <azonenberg> Sure, go get me $30K and i'll get it fabbed in MOSIS :p

09:49 <lekernel_> yes, generate VHDL from CUDA directly

09:49 <azonenberg> and no, this is mostly an educational exercise

09:49 <lekernel_> no, I mean use the FPGA fabric directly

09:49 <azonenberg> The goal is to see how many flops i can pull out of a softcore CPU

09:50 <azonenberg> running real code

09:50 <lekernel_> softcores are only good to run housekeeping or legacy software

09:50 <azonenberg> also i have a project in mind that will involve me working with non-hardware people

09:50 <azonenberg> I have dedicated accelerators for stuff like JPEG encoding that i'm working on

09:50 <azonenberg> But the flight control code has to be in C

09:50 <azonenberg> or C++

09:50 <azonenberg> or assembly

09:51 <azonenberg> since i am working with CS people who dont knowh hardware

09:51 <azonenberg> So i want to design a nice powerful architecture for them to run it on

09:51 <azonenberg> the other motivation as i said is just cutting my teeth on computer architecture

09:51 <azonenberg> this is not something i envision being a softcore forever, but custom ASICs are not cheap

09:52 <azonenberg> if things go well and it works as planned i might try sending it out to mosis eventually

09:52 <azonenberg> i would love to have a laptop running a CPU i designed

09:52 <azonenberg> in 180nm TSMC or something

09:52 <azonenberg> But i'm not that advanced yet :p

09:53 <azonenberg> I read your post about the latticemico32 synthesis lol

09:53 <azonenberg> and i think my processor will be faster

09:54 <azonenberg> But i'd have to reimplement some of the xilinx hard IP cores like the memory controller

09:54 <azonenberg> and their soft FPU

09:54 <azonenberg> I'm pretty sure i can write a better FPU but i havent gotten around to it yet, and as long as it's interface-compatible with theirs it'd be a drop-in replacement

09:56 <lekernel_> their soft fpu? what's that?

09:56 <lekernel_> you're using coregen for a fpu?

09:56 <azonenberg> Yes, for now

09:56 <azonenberg> i wanted to focus on the datapath and interconnect first

09:56 <azonenberg> then go and write myself an FPU when i had all of the surrounding stuff done

09:56 <azonenberg> in the meantime i have theirs because it tells me an FPU of that size and speed is possible

09:56 <azonenberg> iow, setting a lower bound

09:56 <azonenberg> then i can try and outperform it with an open one

09:57 <azonenberg> Coregen lets you generate floating point add/sub, multiply, divide, and sqrt units separately

09:57 <azonenberg> So i'll replace them with my own one by one

09:58 <azonenberg> But again the focus for now is on the datapath and microarchitecture more than implementation

09:59 <lekernel_> you can use the milkymist pfpu pipelines btw ...

10:00 <azonenberg> The goal here is to practice efficient pipelined architecture

10:00 <azonenberg> So i want to use as little premade code as possible

10:00 <azonenberg> like i said i'm doing a thesis on computer architecture soon and i want practice

10:01 <lekernel_> but you reused the coregen pipelines already :-)

10:01 <azonenberg> Temporarily, so i could build the other stuff around them

10:01 <azonenberg> its not expected to stay

10:01 <azonenberg> if i had used a free one i'd have less incentive to replace it :p

10:04 <lekernel_> so that's what I get for developing free hardware ...

10:04 <azonenberg> production project? Sure

10:04 <azonenberg> But for educational value sometimes its better to reimplement

10:05 <azonenberg> Once i build mine, i'll compare it to yours and any other open ones i find

10:05 <azonenberg> and use the best one in real projects

10:18 Gurty joined #milkymist

10:51 Thihi_ joined #milkymist

10:56 Thihi joined #milkymist

11:20 Thihi_ joined #milkymist

11:28 <qi-bot> The Firmware build was successfull, see images here: http://fidelio.qi-hardware.com/~xiangfu/build-milkymist/milkymist-firmware-11292011-1026/

11:36 Thihi joined #milkymist

11:59 aw_ joined #milkymist

11:59 aw joined #milkymist

12:22 rejon joined #milkymist

12:38 <GitHub122> [flickernoise] sbourdeauducq pushed 5 new commits to master: http://git.io/va42-g

12:38 <GitHub122> [flickernoise/master] Do not create ramdisk folder - Sebastien Bourdeauducq

12:38 <GitHub122> [flickernoise/master] filedialog: lock in ssd - Sebastien Bourdeauducq

12:38 <GitHub122> [flickernoise/master] filedialog: prevent slash in filenames - Sebastien Bourdeauducq

12:45 gbraad joined #milkymist

13:17 Martoni joined #milkymist

13:38 sh4rm4 joined #milkymist

13:44 xiangfu joined #milkymist

13:49 Gurty joined #milkymist

13:52 <GitHub168> [flickernoise] sbourdeauducq pushed 1 new commit to master: http://git.io/TnE0CQ

13:52 <GitHub168> [flickernoise/master] shutdown: rename button - Sebastien Bourdeauducq

13:59 <GitHub120> [flickernoise] sbourdeauducq pushed 1 new commit to master: http://git.io/qJvhUw

13:59 <GitHub120> [flickernoise/master] png: enable loading of RGBA images - Sebastien Bourdeauducq

14:06 xiangfu joined #milkymist

14:09 r33p joined #milkymist

14:18 Martoni joined #milkymist

14:25 <qi-bot> The Firmware build was successfull, see images here: http://fidelio.qi-hardware.com/~xiangfu/build-milkymist/milkymist-firmware-11292011-1343/

14:30 wolfspraul joined #milkymist

14:30 <GitHub36> [flickernoise] sbourdeauducq pushed 1 new commit to master: http://git.io/ND-mFA

14:30 <GitHub36> [flickernoise/master] New patch - Sebastien Bourdeauducq

14:31 DJTachyon joined #milkymist

14:36 r33p joined #milkymist

14:36 azonenberg joined #milkymist

14:59 zer1her1 joined #milkymist

15:03 Martoni joined #milkymist

15:13 Gurty joined #milkymist

15:22 wolfspraul joined #milkymist

15:27 wolfspraul joined #milkymist

15:43 wolfspraul joined #milkymist

16:11 <xiangfu> Hi

16:12 <xiangfu> what is the different between MicroBlaze and LM32.

16:12 <xiangfu> is that same thing in one SOC system. on LM32 is open but MicroBlaze?

16:14 <xiangfu> s/on/only

16:15 <wpwrak> kinda like MIPS vs. ARM. same purpose, different origin, different style, etc.

16:16 <xiangfu> wpwrak, got it.

16:18 <qi-bot> The Firmware build was successfull, see images here: http://fidelio.qi-hardware.com/~xiangfu/build-milkymist/milkymist-firmware-11292011-1535/

16:21 wolfspraul joined #milkymist

16:22 <lekernel_> http://www.milkymist.org/flickernoise.html

16:22 <lekernel_> new screenshots

16:33 <kristianpaul> MMM... :-)

16:34 <kristianpaul> too much zoomed effects i think

16:38 <wpwrak> bah. the end of the year is nearing. fireworks !! :)

16:38 wolfspraul joined #milkymist

16:38 <kristianpaul> :-)

16:38 <kristianpaul> yeah , fireworks are nice

16:39 <lekernel_> kristianpaul: if you design new patches that look better, there's no reason I would refuse them...

16:40 * wpwrak is amazed by how well USB can work even though he completely misunderstood the handshake between fpga and navre ...

16:40 <wpwrak> let's see if anything still works after fixing that

16:40 <lekernel_> ?

16:41 <wpwrak> i thought the SYNC would also set rx_pending ...

16:41 <lekernel_> no, rx_pending is only set after the first byte is completely received

16:41 <wpwrak> (but i never tried to retrieve it. sometimes, two wrongs make an almost right :)

16:41 <lekernel_> but it doesn't make much change, does it?

16:42 <lekernel_> (I mean the first byte of "payload" after the sync, ofc)

16:42 <wpwrak> yeah, just means that my loop was a little late

16:43 <wpwrak> and unnecessarily complicated, too

16:44 <kristianpaul> lekernel: i dont wanted to mean that, i just a comment (from what i like) no rush :-)

16:45 <kristianpaul> and no i dont imaging designing patches soon

16:46 r333p joined #milkymist

17:12 Alarm joined #milkymist

17:38 <Alarm> What is the best way to load the latest binary M1.?

17:38 <lekernel> Alarm: as I said, web update

17:38 <lekernel> http://www.xilinx.com/about/customer-innovation/index.htm

17:41 <Alarm> no with the jtag ?

17:41 <lekernel> no, JTAG is for developers

17:42 <lekernel> and generally slower and harder to use than the web update if you just want a release upgrade

17:43 <lekernel> http://www.linux-kvm.org/wiki/images/1/1f/2011-forum-usb.pdf "Remove funky (ab-)use of the usb devices in bluetooth and milkymist." wtf?

17:48 <kristianpaul> lekernel: nice !!!)

17:48 <kristianpaul> ""

17:48 <kristianpaul> Once an application for custom ASIC cores, this demanding computer graphics process is now the province of low-cost FPGAs.

17:49 <Alarm> The problem is to download the latest version I'm using wget but it's not great for a set of files

17:50 <lekernel> the M1 downloads the latest version itself

17:50 <lekernel> just connect it to your internet router ...

17:59 <wpwrak> lekernel: (ab-use) what on earth is that presentation about anyway ?

18:01 <lekernel> USB in QEMU it seems

18:01 <lekernel> but I asked myself the same question for a while ;)

18:01 Alarm joined #milkymist

18:01 <wpwrak> ;-))

18:01 <Alarm> I want to do the update by the jtag for pedagogic reasons. The method "WebUpdate" has no interest for me

18:03 <Alarm> my problem is basic. I am looking for a simple command to download binaries

18:04 <Alarm> "wget-r" aspire all files

18:16 DJTachyon joined #milkymist

18:39 Gurty` joined #milkymist

18:39 mumptai joined #milkymist

18:51 * lekernel is giving orcc a try. of course, hundreds of MB of java bloat to install ...

19:23 Alarm_ joined #milkymist

19:24 mumptai joined #milkymist

19:55 Alarm joined #milkymist

20:06 errordeveloper joined #milkymist

20:13 juliusb joined #milkymist

20:56 mumptai joined #milkymist

21:03 <kristianpaul> some comments from a friend "you can get video switch for 8usd, but mixer.. as minimun do fading from one picture to another"

21:04 <kristianpaul> and please dont be angry with me for posting this, i'm just replying comments

21:05 <lekernel> the M1 isn't a video switch or mixer. the switch functionality is just a little add-on. you can also get an arduino led blinker for $25 which can do the same as the front panel LEDs on the M1... same kind of stupid comparison

21:05 <wpwrak> mixer may be tricky: you need two codecs for that

21:05 <wpwrak> and i'm not sure if the chip we use has multiple codecs inside

21:06 <lekernel> it does not

21:06 <lekernel> M1 was never intended as a video mixer

21:06 <kristianpaul> i'm very exited to bug other friends about M1/FN new features also bring back some feedback

21:06 <kristianpaul> sure not

21:08 <lekernel> the main feature of this software update is image support - and stress that it can be used with MIDI controllers. the rest is secondary.

21:08 juliusb_ joined #milkymist

21:11 <kristianpaul> sure sure

21:12 <kristianpaul> and for you hapiness he really likes the pacman video from wpwrak

21:12 <wpwrak> and one more device enumerates :)

21:13 <wpwrak> hehe ;-)

21:13 <wpwrak> we need a few more images per patch. then we can have real games :)

21:14 <kristianpaul> wee :)

21:14 <wpwrak> C64 retro style :)

21:14 <wpwrak> of course, the LV3 is still mute. that one's a tough cookie

21:22 juliusb joined #milkymist

21:33 antgreen joined #milkymist

21:45 <wpwrak> stekern: the latest patch set may also fix the low-speed regression you experienced.

21:46 <wpwrak> stekern: at least it removes quite a bit of confusion i had added before :)

21:58 <stekern> wpwrak: cool, do you keep those patches in a git repo somewhere?

21:59 <wpwrak> only locally

22:00 <stekern> ok, well, lekernel seems to be quite quick to apply them anyways

22:01 <stekern> I need to sign up on the ML

22:01 <wpwrak> yeah. he probably has his alarm clock connected to "grep PATCH" :)

22:50 <mwalle> lekernel: (usb abuse) thats qemu and it used the hid layer in a strange way

22:50 <mwalle> gerd and i fixed that some time ago ;)