#milkymist on 2012-07-30 — irc logs at freenode.irclog.whitequark.org

2012-06-17 20:21 lekernel changed the topic of #milkymist to: Milkymist One, Migen, Milkymist SoC & Flickernoise :: Logs: http://en.qi-hardware.com/mmlogs :: EHSM Berlin Dec 28-30 http://ehsm.eu :: latest video http://www.youtube.com/playlist?list=PL181AAD8063FCC9DC

00:15 voidcoder has quit [Read error: Connection reset by peer]

00:15 voidcoder has joined #milkymist

00:25 xiangfu has joined #milkymist

00:30 Jia has joined #milkymist

01:02 rejon has joined #milkymist

01:07 rejon has quit [Ping timeout: 240 seconds]

01:10 rejon has joined #milkymist

01:36 rejon has quit [Ping timeout: 252 seconds]

02:48 rejon has joined #milkymist

03:53 xiangfu has quit [Ping timeout: 244 seconds]

05:08 xiangfu has joined #milkymist

05:14 xiangfu has quit [Ping timeout: 272 seconds]

05:21 jimmythehorn has joined #milkymist

05:38 xiangfu has joined #milkymist

05:39 voidcoder has quit [Read error: Connection reset by peer]

05:43 xiangfu has quit [Ping timeout: 272 seconds]

05:46 voidcoder has joined #milkymist

05:58 xiangfu has joined #milkymist

06:03 xiangfu has quit [Ping timeout: 272 seconds]

06:05 sh4rm4 has quit [Ping timeout: 276 seconds]

07:07 rejon has quit [Ping timeout: 252 seconds]

07:29 Martoni has joined #milkymist

07:32 rejon has joined #milkymist

07:42 rejon has quit [Ping timeout: 244 seconds]

07:54 rejon has joined #milkymist

08:49 kilae has joined #milkymist

08:50 xiangfu has joined #milkymist

08:55 <lekernel> https://www.coursera.org/course/vlsicad

08:56 <lekernel> A modern VLSI chip has a zillion parts -- logic, control, memory, interconnect, etc. How do we design these complex chips? Answer: CAD software tools. Learn how to build these tools in this class.

09:00 <lekernel> wolfspraul :)

09:02 rejon has quit [Ping timeout: 272 seconds]

09:02 rejon has joined #milkymist

09:04 voidcoder has quit [Remote host closed the connection]

09:05 voidcoder has joined #milkymist

09:13 rejon has quit [Ping timeout: 244 seconds]

09:17 Jia has quit [Remote host closed the connection]

09:18 Jia has joined #milkymist

09:26 rejon has joined #milkymist

09:41 jimmythehorn has quit [Read error: Connection reset by peer]

09:42 jimmythehorn has joined #milkymist

09:56 kilae_ has joined #milkymist

09:58 kilae has quit [Ping timeout: 246 seconds]

10:03 Jia has quit [Quit: Konversation terminated!]

10:09 rejon has quit [Ping timeout: 250 seconds]

10:55 mumptai_ has joined #milkymist

11:21 sh4rm4 has joined #milkymist

13:08 voidcoder has quit [Read error: No route to host]

13:13 voidcoder has joined #milkymist

13:42 mumptai_ has quit [Ping timeout: 248 seconds]

13:47 voidcoder has quit [Quit: See you next time]

13:48 voidcoder has joined #milkymist

14:11 antgreen has joined #milkymist

14:15 <wpwrak> thinking of how to overcome the LM32's slowness ... if we'd have several lm32 cores, complete with cache and tlb, and ignoring cache coherence for a moment, in M1, room for how many such cores would there be in M1 ?

14:18 <lekernel> since most software is single-threaded, you won't overcome slowness this way

14:19 <lekernel> and if you have to rewrite software to make it parallel, then you're better off designing proper hardware accelerators instead instead of introducing the CPU overhead

14:24 <wpwrak> lekernel: think concurrent but loosely related programs. or different layers working concurrently. there, you can get a speedup.

14:25 <wpwrak> so, how many cores would fit ? 2 ? 4 ? 10 ?

14:27 <lekernel> maybe 8 or so

14:27 <wpwrak> wow, great.

14:27 <lekernel> perhaps even more, but I'm not sure about the block RAM for the caches

14:29 <wpwrak> i think something around 4 may be interesting for a general-purpose workload. one for the kernel, one for the main application, one for background tasks, and one for whatever else comes along.

14:31 <lekernel> doesn't sound too good... imo the real way out of the CPU slowness is ASIC

14:31 <wpwrak> may need a bit of kernel tuning because the kernel tries to keep related things on the same cpu, assuming cycles are cheap but memory accesses (i.e., moving data accessed by one core to another) aren't. in our case, it's almost the opposite.

14:31 <wpwrak> if you have the money ... ;-)

14:32 <lekernel> well, aerospace institutes do. they're paying a lot of money for eg LEON chips.

14:32 <wpwrak> not that i'd disagree with the technical merit of having the core in a dedicated asic ...

14:33 <wpwrak> do you have any that would finance such work ?

14:33 <lekernel> and maybe those parts that don't pass space qualification could still be used elsewhere

14:33 <wpwrak> it's not only what they'd pay for the chip, but also what they'd pay to have it developed ...

14:33 <lekernel> (or don't need, since a lot of the radiation hardening stuff is in the package, not the silicon)

14:35 <lekernel> sounds much easier to me to get aerospace funding than anything else for this purpose

14:35 <wpwrak> well, if you have the contacts ...

14:36 <lekernel> you only need a few. not 11k.

14:36 <lekernel> otherwise you're running rat races like http://www.kickstarter.com/projects/joylabs/makey-makey-an-invention-kit-for-everyone vs. http://www.kickstarter.com/projects/1091976372/open-source-5-axis-cnc-router-and-plasma-machine-p

14:38 <wpwrak> the numbers for makey makey look rather happy

14:38 <lekernel> ...which is my point

14:39 <wpwrak> the monster cnc machine .. well, consider how many people would even have the room for such a monster :)

14:39 <lekernel> yes, about the same number of people who'd buy a free CPU instead of a $35 rasperry pi or similar piece of crap

14:40 <wpwrak> well, but with your aerospace contacts, you'd probably not go to kickstarter anyway

14:41 <wpwrak> and the rpi will be a victim of its own success anyway. i wouldn't worry too much about them.

14:59 <kristianpaul> lekernel: cparty, are you giving a talk about overclocking fpgas? :-)

15:06 <kristianpaul> what aditional hw besides adding the other lm32 cores to the SoC is required to get SMP?

15:07 <Fallenou> adapt wishbone code maybe to have one more master

15:09 <kristianpaul> ah well conbus said upto 8 both master and slaves...

15:12 <wpwrak> you also need to consider cache coherency. that can be done in hw or in sw, though.

15:13 <wpwrak> of course, doing it in sw can make things slow. and limits the type of tasks you can use it for.

15:25 <Fallenou> and for now lm32 caches are not doing any kind of bus snooping :(

15:27 <lekernel> be happy that since they are write-through, you only need bus snooping and not relatively complicated protocols like MSI or its variants

15:27 <wpwrak> yeah :)

15:30 <Fallenou> hehe sure

15:33 lekernel_ has joined #milkymist

15:33 lekernel has quit [Ping timeout: 272 seconds]

15:34 lekernel_ is now known as lekernel

15:40 wpwrak has quit [Remote host closed the connection]

15:43 wolfspra1l has joined #milkymist

15:45 wolfspraul has quit [Ping timeout: 250 seconds]

15:49 rejon has joined #milkymist

15:58 rz2k has joined #milkymist

15:58 rz2k has left #milkymist [#milkymist]

16:20 jimmythehorn has quit [Quit: jimmythehorn]

16:30 hypermodern has joined #milkymist

16:42 Martoni has quit [Quit: ChatZilla 0.9.88.2 [Firefox 14.0.1/20120713225625]]

17:06 rejon has quit [Ping timeout: 264 seconds]

17:06 jimmythehorn has joined #milkymist

17:36 xiangfu has quit [Ping timeout: 252 seconds]

17:48 xiangfu has joined #milkymist

17:51 xiangfu has quit [Client Quit]

18:22 voidcoder has quit [Read error: Connection reset by peer]

18:23 voidcoder has joined #milkymist

18:25 hypermodern has left #milkymist [#milkymist]

19:35 wpwrak has joined #milkymist

20:01 Gurty` has quit [Ping timeout: 265 seconds]

20:02 Gurty` has joined #milkymist

20:06 <mwalle> lekernel: i bought an rpi for me, so i'm banned now in this channel? ;)

20:07 <mwalle> but actually, i didnt use it yet, just installed an xbmc distro, took ages and didnt work in the end..

20:08 <mwalle> lm32/milkymist/qemu is better to hack on ;)

20:32 antgreen has quit [Remote host closed the connection]

20:34 <kristianpaul> better and still a long/lot to :-)

20:37 <larsc> mwalle: I have some issues with the latest qemu. Whenever I send a multi-character key (like the arrow keys) all further keypresses get delayed by the number of extra characters in a multi-character key sequence

20:38 <larsc> e.g. press left and then type "Hello" the H will appear when you press the e, the e will appear when you press the l and so on

20:38 <larsc> if i press left twice, the H appears when I press the l and so on

20:41 kilae_ has quit [Quit: ChatZilla 0.9.88.2 [Firefox 14.0.1/20120713134347]]

20:45 stekern has quit [Ping timeout: 272 seconds]

20:47 <Fallenou> mwalle: I bought one as well

20:48 <mwalle> larsc: yeah i noticed that too

20:48 <Fallenou> as I told wolfspra1l , I turned it on for 5 minutes, it booted on my TV, and then back in the box ^^

20:48 <Fallenou> "cool it boots", boxed

20:49 <mwalle> larsc: once the input buffer overflows, the ringbuffer will always be N characters 'behind'..

21:07 mumptai has joined #milkymist

22:03 jimmythehorn has quit [Read error: Connection reset by peer]

22:04 jimmythehorn has joined #milkymist

22:07 <mwalle> larsc: bug fixed in my qemu repository

22:12 <Fallenou> :)

22:31 <mwalle> Fallenou: how hard is it to add a cache inhibit bit to the tlb?

22:32 <Fallenou> a cache inhibit bit ?

22:33 <mwalle> non-cacheable

22:33 <mwalle> eg, set this bit to bypass the d/icache

22:34 <Fallenou> maybe a simple way would be to trigger a cache miss when this is is set ?

22:34 <Fallenou> so that it fetches from main memory anyway

22:34 <Fallenou> when this bit is set*

22:35 <wpwrak> that would also make sure the cache is kept in sync

22:36 <Fallenou> yes

22:36 <Fallenou> but is this happens too often the cache is like useless

22:36 <Fallenou> very suboptimal

22:36 <mumptai> and you don't need an ugly bypass

22:37 <mwalle> mh but it will replace other entries, yes..

22:37 <Fallenou> and it would just be a single ( && tlb_lookup_bypass) addition to the assign miss = line

22:37 aeris has quit [Ping timeout: 276 seconds]

22:38 <mwalle> Fallenou: there should be already some logic to bypass the cache, theres some define

22:38 <mwalle> CFG_ICACHE_LIMIT

22:38 <mwalle> CFG_DCACHE_LIMIT

22:42 <mwalle> have to go

22:42 <mwalle> gn8

22:42 <mumptai> gute nacht

22:42 aeris has joined #milkymist

22:50 <Fallenou> ah yes you're right

22:50 <Fallenou> it's even easier

23:03 <Fallenou> https://github.com/milkymist/milkymist/blob/master/cores/lm32/rtl/lm32_load_store_unit.v#L413

23:03 <Fallenou> wishbone is selected if dcache is not, if address < base or address > upper_limit

23:06 <wpwrak> so would we even need a special bit ?

23:07 <Fallenou> I don't exactly know why mwalle asked that

23:07 <Fallenou> what does he wants to do ?

23:07 <Fallenou> what did he have in mind ? :)

23:08 * Fallenou cannot locate the similar trick for icache though, wonders if icache has limit/base stuff working

23:08 <Fallenou> datasheet seems to say "yes"

23:08 <Fallenou> but cannot locate it in the code

23:08 <wpwrak> you don't need this for icache. i think he's after data access to memory-mapped devices

23:08 <Fallenou> oh, right

23:09 <Fallenou> then yes it's limit/base stuff to have memory mapped regions non cachable

23:09 voidcoder has quit [Remote host closed the connection]

23:09 voidcoder has joined #milkymist

23:12 <Fallenou> If an instruction cache is used, attempts to fetch instructions from outside of the range of cacheable addresses result in undefined behavior, so only one cached region is supported.

23:12 <Fallenou> hum ok

23:12 <Fallenou> so basically the BASE/LIMIT stuff does not work for icache :)

23:13 <Fallenou> gute Nacht :)

23:16 <wpwrak> you don't need that for icache anyway

23:18 <Fallenou> well you could want to fetch from wishbone directly in case of DMA containing code :p

23:19 <Fallenou> but then maybe it's best to just invalidate Icache

23:19 mumptai has quit [Quit: Verlassend]