#m-labs on 2014-04-17 — irc logs at freenode.irclog.whitequark.org

2013-12-11 12:34 lekernel changed the topic of #m-labs to: Mixxeo, Migen, MiSoC & other M-Labs projects :: fka #milkymist :: Logs http://irclog.whitequark.org/m-labs

01:21 ohama has quit [Disconnected by services]

01:22 ohama has joined #m-labs

02:55 nicksydney has quit [Remote host closed the connection]

06:20 stekern has quit [Read error: Connection reset by peer]

06:23 stekern has joined #m-labs

07:50 sb0 has joined #m-labs

07:58 <ysionneau> mwalle: in NetBSD generic tlb code, ASID 0 is only for kernel, so he is proposing that a tlb entry which ASID is 0 would always match if we are in kernel mode (PSW.USR == 0)

08:01 <ysionneau> then you can directly change the ASID while still being executing a bit of kernel code

08:02 <ysionneau> then I am just wondering how to handle tlb misses for kernel pages when the asid is switched and the page table is switched as well ...

08:04 nicksydney has joined #m-labs

08:07 Alain_ has joined #m-labs

09:51 zeiris_ has quit [Ping timeout: 240 seconds]

10:17 zeiris has joined #m-labs

11:34 <ysionneau> sb0: wow, seems like a big piece of work

11:34 <ysionneau> (github link)

11:38 <sb0> yeah, his blog is quite interesting too

11:43 <ysionneau> oh and the kickstarter project is not bad also

11:43 <ysionneau> cool to see at least $13k

11:44 <ysionneau> ah but it's already failed

11:44 <ysionneau> too bad :(

11:55 <sb0> https://github.com/Florent-Kermarrec/misoc-de0nano

11:55 <sb0> port to altera board using the new build system

12:03 sb0 has quit [Ping timeout: 245 seconds]

12:16 sb0 has joined #m-labs

12:58 <ysionneau> nice, the port is pretty small with this new build system

14:11 Alain_ has quit [Remote host closed the connection]

14:12 <mwalle> ysionneau: i see, so if USR=0 and ITLB=1 then ASID is always 0, regardless whats written to the ASID field

14:14 <mwalle> but USR=1 ITLB=1 and ASID=0 is also valid, right?

14:14 <mwalle> eg that would be user space program running with asid 0

14:15 <mwalle> (of course not in netbsd but maybe some standalone stuff

14:27 <ysionneau> 16:12 < mwalle> ysionneau: i see, so if USR=0 and ITLB=1 then ASID is always 0, regardless whats written to the ASID field < we could see it that way, but I was more seeing it like "even if asid is not 0, tlb entries with asid 0 match anyway, if USR==0 and *TLB==1"

14:27 <ysionneau> so that would mean that if for instance current ASID is 1

14:27 <ysionneau> tlb entries with asid 1 would match as well

14:28 <ysionneau> 16:14 < mwalle> but USR=1 ITLB=1 and ASID=0 is also valid, right? < it's valid, but NetBSD will not do it, but yes I think it should stay valid

14:34 <ysionneau> mwalle: about qemu, when you are in the

14:34 <ysionneau> translate.c (for instance dec_b)

14:34 <ysionneau> you cannot do "flush_tlb()" directly, right?

14:34 <ysionneau> you need to generate code that flush the tlb

14:34 <ysionneau> and then you need to wrap it inside an helper, right?

14:37 <ysionneau> I guess that's why you used gen_helper_wcsr_psw () inside the dec_wcsr() in translate.c

14:45 <mwalle> (two asid matches) mhh, is this a good idea?

14:46 <ysionneau> honestly, I don't know

14:46 <ysionneau> but it just seems natural, at first, that when current asid is "1" then it is matched in tlb

14:48 <ysionneau> and it seems that it would simplify copyout code as Matt said

14:48 <ysionneau> copyout is just copyout(void *uaddr, void *kaddr, size_t len)

14:48 <ysionneau> you copy from kernel address space to user address space

14:49 <ysionneau> mwalle: I guess it's not an issue as long as user and kernel spaces are disjoints

14:50 <ysionneau> like [0->MAX_USER] ; [MIN_KERNEL ; 0xffff.ffff]

14:50 <ysionneau> like linux 3G/1G

14:54 <mwalle> and if they are not?

14:54 <mwalle> eg. the worst case :)

14:55 <mwalle> translate.c: this is only executed on code translation

14:56 <mwalle> eg in the best case only once

14:57 <mwalle> so if you do a flush_tlb() there this will only be executed during translation time

14:57 <ysionneau> ok so that's not what I wanted to do :p

14:57 <mwalle> translation time: the time when the binary is translated to intermediate operation which will then be executed on the host

14:57 <ysionneau> for instance if we switch ASID upon eret, we want to flush tlb upon eret

14:58 <ysionneau> but not during "translation" of an eret

14:58 <ysionneau> so I was right about needing to wrap the tlb_flush inside an "helper" ?

14:59 <mwalle> do we want this to be implicit on eret/bret or may it also be explicitly done with wcsr?

14:59 <ysionneau> what do you mean?

14:59 <mwalle> mhh, flush_tlb() means actually flush the tlb on the (actual) hardware, right?

14:59 <mwalle> erm

15:00 <mwalle> actual=emulated

15:00 <ysionneau> no, flushing the qemu tlb

15:00 <mwalle> ah

15:00 <ysionneau> in hw no need to flush tlb when switching asid, it's the all purpose of asid

15:00 <mwalle> sorry, then, forgot what i was saying ;)

15:00 <ysionneau> ok :)

15:00 <ysionneau> sorry I must have badly explained

15:01 <mwalle> nah i have to refresh my qemu wisdom, i guess ;)

15:02 <ysionneau> ah, it's tlb_flush(env, 1), not flush_tlb

15:04 <ysionneau> maybe that will ring a bell :p

15:05 <mwalle> nah :b

15:06 <ysionneau> ok, it was just to make sure what I understood was OK

15:06 <ysionneau> but after what you described about the translation process I think it's ok :)

15:06 <ysionneau> thanks for the details

15:07 <ysionneau> 16:54 < mwalle> and if they are not? < if they are not, then ... the user space mapping can be used instead of the kernel one :x

15:07 <ysionneau> and it can start to get messy

15:08 <ysionneau> I have the feeling that all of this would be much less trouble prone if we would just add a "global" bit in tlb entries

15:08 <ysionneau> and not tread ASID 0 in any special manner

15:08 <ysionneau> so that if you know that your user space/kernel space are disjoints, you can always put Global bit to kernel tlb entries

15:10 <ysionneau> and then ... maybe indeed switch only asid upon eret with the usual latching (BASID, EASID)

15:11 <ysionneau> well, switching asid upon eret would be enough to solve our problem in fact

15:13 <ysionneau> that would mean that the kernel will need to map user space pages to his own address space upon copyin / copyout but that's not so big an issue

15:13 <mwalle> btw the index to the CAM is still the lower address bits, right?

15:14 <ysionneau> we don't use CAM but yes it's the lowest address bits which is not in the page offset part

15:15 <mwalle> well isnt your block ram sth like a CAM? :)

15:15 <mwalle> yes it is not really a cam

15:15 <ysionneau> well I guess we can say it's a very special case of CAM

15:16 <ysionneau> where it's index addressed

15:16 <ysionneau> nevermind

15:16 <mwalle> but the global bit would have the same "there might be multiple matches" problem, right?

15:17 <mwalle> only that the user is in control of it

15:17 <ysionneau> yes, but in this case, the user controls it

15:17 <mwalle> which is fine

15:17 <ysionneau> so the user knows what he is doing

15:17 <mwalle> yeah

15:17 <ysionneau> I would prefer ASID0 to stay generic enough

15:17 <ysionneau> not to bind the MMU to some OS

15:18 <mwalle> me too

15:18 <mwalle> but instead of using a global bit for each entry

15:18 <mwalle> we could use just one

15:18 <mwalle> which says ASID0 is 'global'

15:18 <ysionneau> good idea :)

15:19 <ysionneau> a 5 bit register somewhere, where you can put the ID of the global asid

15:19 <ysionneau> then we take for granted we only want one ASID to be global

15:19 <ysionneau> and that we don't have global entries among several asids

15:19 <mwalle> or just pin it to ASID0

15:19 <mwalle> or do that need to change at runtime

15:19 <mwalle> ?

15:20 <ysionneau> I don't know where to put the threshold between genericity and hardware optimization choices :p

15:20 <ysionneau> for NetBSD , just having ASID0 global is fine

15:20 <ysionneau> but we could imagine some other application having different needs

15:20 <ysionneau> dunno

15:20 <ysionneau> but I like the idea of saving RAM

15:21 nicksydney has quit [Remote host closed the connection]

15:21 nicksydney has joined #m-labs

15:22 <ysionneau> on another hand, by using "ASID0 global or not controlled by one bit" the code would be like

15:23 <ysionneau> if ( ( current_asid == tlbe[ASID] ) OR ( tlbe[ASID] == 5'b0 and asid0_global ) )

15:24 <ysionneau> by using just a global bit it becomes

15:24 <ysionneau> if ( ( current_asid == tlbe[ASID] ) OR ( tlbe[g] ) )

15:24 <ysionneau> so there is one less5-bits comparator in the path

15:24 <ysionneau> don't know if it is important or not

15:26 <mwalle> well actually, asid0 is global and one asid bit for every entry is the same, isnt it?

15:26 <mwalle> because if the global bit is set, the asid doesnt matter anymore

15:27 <mwalle> so you could say, your asid range is 1..0x1f and asid = 0 is global bit

15:28 <mwalle> so having and unconditional asid0 seem equal to have global bits to me

15:29 <mwalle> if you agree, i'd prefer the unconditional asid0 maching

15:29 <mwalle> matching

15:29 <mwalle> bbl

15:29 <ysionneau> so, asid0 would be reserved for global mappings?

15:30 <ysionneau> only if PSW.USR == 0

16:24 FabM has quit [Quit: ChatZilla 0.9.90.1 [Iceweasel 24.4.0/20140319080549]]

17:40 <GitHub161> [misoc] sbourdeauducq pushed 2 new commits to master: http://git.io/LGppdA

17:40 <GitHub161> misoc/master 97311fc Florent Kermarrec: make: add clean action

17:40 <GitHub161> misoc/master 2fca8d4 Florent Kermarrec: programmer: add USBBlaster and use platform.bitstream_ext in make

17:48 <GitHub176> [misoc] sbourdeauducq pushed 2 new commits to master: http://git.io/kLGmlA

17:48 <GitHub176> misoc/master 41c35e7 Florent Kermarrec: simple: create PowerOnRst and use it (remove vendor-dependent code)

17:48 <GitHub176> misoc/master 1adceb8 Florent Kermarrec: sdramphy: move and clean up s6ddrphy, add generic SDRAM PHY

17:50 <GitHub102> [migen] sbourdeauducq pushed 2 new commits to master: http://git.io/EbVdjA

17:50 <GitHub102> migen/master 8c03cb0 Florent Kermarrec: mibuild: force shell script generation to unix format (will be executed with cygwin's bash on windows)

17:50 <GitHub102> migen/master d1a96bc Florent Kermarrec: mibuild/altera_quartus: enforce use of SystemVerilog in Quartus (Verilog does not support global parameters)

19:43 mumptai has joined #m-labs

21:49 <rjo> sb0, florent: nice work! would the sdram phy work with the sdram on the papilio pro?

21:55 <sb0> rjo, yes, it should

21:56 <sb0> haven't tested it yet, but it should be fine

21:56 <sb0> but one patch is missing - the WB/LASMI bridge won't handle the 16-bit LASMI bus

21:57 <sb0> there are a few things to clean up before I commit it

21:57 <sb0> also, the PPro SDRAM could use IDDR/ODDR and run at 2x the system clock rate :) but of course, it's a bit harder than just reuse that PHY

22:05 <rjo> sb0: i see. thanks.

22:07 <rjo> sb0: did i understand that correctly: you are working on the wb/lasmi bridge currently thus pushing to papilio pro to complete support?

22:08 <sb0> Florent did most of the wb/lasmi bridge work, but yes

22:09 <rjo> by the way. i have tried to get higher uart speeds to work but failed so far. anyone have success with that? i know the rates come out slightly off (more than the few percent that are usually ok).

22:10 <rjo> sb0, Florent: excellent.

22:11 <sb0> got it to work at 230400bps, failed at higher speeds, and I didn't insist

22:12 <sb0> maybe it's a good idea to generate the rate signal with this technique: http://hamsterworks.co.nz/mediawiki/index.php/FM_SOS

22:13 <sb0> which gives high jitter, but high precision

22:17 <rjo> sb0: yes. but in that case one could just have the dcm generate the fastest sensible uart clock (brobably 16MHz) and then divide jitter-free down from that.

22:18 <rjo> sb0: i also tried to use the custom baud rate stuff to match the actual rate but that did not work either. ftdi broken or custom baud rate bugs.

22:18 <sb0> yeah, but this needs several clocks (and the associated ISE bugs and other mishaps), and an asynchronous design

22:19 <rjo> custom baud rte stuff in flterm, that is.

22:20 <sb0> jitter at the ~80MHz system clock rate should actually be rather small compared to a transmission rate of a few Mbps at most

22:20 <rjo> sb0: but the DDS-style solution from that web link also needs those, doesn't it?

22:21 <rjo> sb0: oh. you want to use sys_clk

22:21 <sb0> yes

22:21 <rjo> then not.

22:21 <rjo> sb0: max jitter will be one sys_clk period. not much, yes.

22:24 <rjo> sb0: and it doesn't really matter for a UART anyway AFAICT

22:53 mumptai has quit [Ping timeout: 240 seconds]