#milkymist on 2011-02-28 — irc logs at freenode.irclog.whitequark.org

04:58 <wpwrak> kristianpaul: (tuner on lpt) UBB ?!?

04:59 <wpwrak> (age of tuner circuit) the html files are from 2007. also the PDF of the schematics says january 2007. amazing, all this looks more 1970-ish ;-)

05:02 <roh> i dont get whats new there

05:02 <roh> isnt it a dead boring default cable tuner?

05:06 <wpwrak> dunno. maybe it has an unusually wide range or such ?

05:23 <roh> not that i can see

05:33 <wpwrak> ah well. maybe just good marketing then ;-)

09:55 <terpstra> hello all!

09:58 <terpstra> I've ported the LM32 from milkymist to a small SoC system for use on our Altera FPGAs. In the process I got the JTAG working and wrote a little tool that talks to the FPGA over the USB Blaster's JTAG. I've been able to happily load and execute small programs via this tool into the SRAM. As my next step I wanted to try to get gdb working (as a pre-step to configuring a kernel for our SoC). I've seen that milkymist somehow uses gdb with the LM32 alr

09:58 <terpstra> eady. Where can I find out more about this?

10:45 <Fallenou> Hi terpstra really nice job :)

10:45 <Fallenou> is your project open source ? do you have a web page or something ?

10:45 <Fallenou> I would like to have a look :)

10:46 <terpstra> it is open source

10:46 <terpstra> but it's not got a project page per-se

10:47 <terpstra> at the moment i've been tasked to evaluate the alternative soft CPUs, and part of that is determining what they can do---jtag, simulation, debugger, toolchain, LUT size, speed, etc

10:47 <terpstra> the contenders are: leon3, openrisc1000, zpu, and lm32

10:48 <terpstra> so if we end up picking the lm32, then it would end up as a visible part of the project

10:48 <terpstra> i'd be happy to send you a tarball tho

10:49 <terpstra> Fallenou, am i mistaken about milkymist and gdb? all i've found is the page about using gdb with qemu, but i'm trying to get it to talk directly to the CPU in-chip

10:49 <Fallenou> I really don't know, sorry

10:50 <Fallenou> but mwalle or lekernel would know

10:50 <terpstra> i already have a working connection to the CPU via JTAG and was just thinking of whipping up a small debug ROM on the instruction bus and a register dump WOM on the data bus

10:50 <terpstra> but i don't want to reinvent the wheel

10:50 <terpstra> especially the wheel that implements the gdbserver protocol

10:50 <Fallenou> You can send an e-mail to the Milkymist mailing list if you want, it would be great to say a little "hello I am doing this and this about lm32" :)

10:50 <Fallenou> it's always good to know what others are doing

10:50 <kristianpaul> wpwrak: (UBB) yes, well, i need the 3V3 to 5V shifter, but is on my todo once i tune and can listen something..

10:51 <terpstra> ok, i'll do that

10:51 <lekernel> terpstra: hi

10:51 <lekernel> what you're looking for is there: http://git.serverraum.org/?p=mw/openocd-lm32.git;a=summary

10:51 <terpstra> btw, where can i find a "clean" copy of the milkymist lm32? i've fixed a few bugs in the copy i got from your tree and probably should give you the patches

10:51 <lekernel> it's undocumented however

10:52 <lekernel> and, afaik, not thoroughly tested

10:52 <terpstra> these things are not roadblocks ;)

10:53 <lekernel> terpstra: hmm, there is no copy of the "milkymist lm32" other than in the milkymist github repository

10:53 <lekernel> what kind of bugs have you fixed?

10:53 <terpstra> the jtag tweaks you guys did had problems with clock domains (for altera at least)

10:54 <terpstra> i 'fixed it' by making it use the capture JTAG state instead of just grabbing data on e1dr

10:54 <terpstra> and thus removed sensitivity to stuff that wasn't the clock

10:54 <terpstra> the other problem was that you couldn't flush the icache over jtag

10:54 <terpstra> the jtag write csr was just ignored

10:54 <lekernel> interesting

10:55 <terpstra> this is needed if you load your firmware over jtag (as i do) and then want the cpu to execute it

10:55 <terpstra> also i changed the register file

10:55 <terpstra> you had it using actual registers

10:55 <terpstra> i switched it to use the positive edge EBR implementation

10:55 <terpstra> which costs 2k of memory bits but saves like 1-1.5k LUTs

10:55 <lekernel> mh, last time I checked Xst was able to synthesize the LM32 register file on distributed RAM

10:56 <terpstra> (making the LM32 only 3k instead of 4.5k on cyclone3 and 1.5k on aria2)

10:56 <terpstra> the code i copied was still using lattice blackbox logic for this.. .?

10:56 <terpstra> i switched it to inferred

10:57 <lekernel> ha, so you're not using the milkymist code I think... I stripped out all the lattice logic

10:57 <terpstra> not all

10:57 <terpstra> oh: i also pipelined the multiplier

10:57 <terpstra> that let me get it up to 175MHz

10:57 <Fallenou> wow :)

10:57 <lekernel> in cyclone?

10:57 <Fallenou> nice !

10:57 <terpstra> you stripped out a lot of the ram stuff in the i/d-cache

10:57 <terpstra> cyclone3 is 125MHz

10:57 <terpstra> aria2 is 175

10:57 <lekernel> ah, yes :)

10:58 <lekernel> that's still pretty fast

10:58 <terpstra> i am pretty happy with it

10:58 <lekernel> even without a multiplier at all spartan6 barely reaches 100MHz

10:58 <terpstra> it seems to run stable (and quartus timequest was happy anyway)

10:58 <terpstra> i have everything except DIV enabled

10:59 <terpstra> the DIV seems quite expensive for very poor performance gain :P

10:59 <lekernel> where is the remaining lattice blackbox logic? i'm checking the code atm

10:59 <lekernel> it's probably `ifdef'd out anyway, since Xst doesn't whine

10:59 <terpstra> you need dual port ram

10:59 <terpstra> that's why you didn't do it

10:59 <terpstra> you made a single port lm32_ram

10:59 <terpstra> but the register file needs dual port

11:00 <terpstra> look in lm32_cpu.v

11:00 <terpstra> search down to

11:00 <lekernel> lm32_ram is always supposed to be single port, no?

11:00 <terpstra> search to: 'Register file instantiation as Pseudo-Dual Port EBRs.'

11:00 <terpstra> that code is disabled b/c you don't set:

11:01 <terpstra> `define CFG_EBR_POSEDGE_REGISTER_FILE

11:01 <terpstra> in your include.v

11:01 <terpstra> that saved me a lot of area and might be why mine is faster than yours

11:01 <terpstra> i made a lm32_dp_ram.v

11:02 <terpstra> for inferring dual-port memory and plopped it on top of the lattice blackbox

11:02 <terpstra> they use 2x dual-port for the register file as follows:

11:02 <terpstra> each cycle, the target register in both is updated

11:02 <terpstra> each cycle, the source register r0 is read from the copy0 and r1 from copy1

11:02 <terpstra> so single-port won't cut it

11:02 <lekernel> iirc now the register file is implemented on asynchronous distributed RAM. yeah, maybe putting it into the block RAM might improve things a bit

11:03 <terpstra> like i said, it cut 150% to 100% area for me

11:03 <lekernel> though I doubt it would be as much as 125MHz

11:03 <lekernel> yeah of course, if you had it on pure LUTs in the beginning, it becomes slow and bloated

11:03 <lekernel> with distributed RAM, it's not as bloated as pure LUTs

11:03 <terpstra> afaik, that's what you're doing atm?

11:03 <terpstra> you're using the registers[] array

11:04 <Fallenou> the array if inferred in blockram i guess

11:04 <Fallenou> is*

11:04 <terpstra> reg [`LM32_WORD_RNG] registers[0:(1<<`LM32_REG_IDX_WIDTH)-1];Â Â // Register file

11:04 <terpstra> that didn't become inferred blockram

11:04 <terpstra> and i doubt it can

11:04 <terpstra> since it's used with multiple access

11:04 <lekernel> because the LUT becomes used as an optimized RAM

11:04 <lekernel> i.e. the portion of the LUT that is normally used for configuration stores instead the RAM data

11:04 <lekernel> it's a special mode of the Xilinx LUTs

11:05 <terpstra> i've not used xilinx yet

11:05 <lekernel> and, iirc, Xst infers this mode for the LM32 register file

11:05 <kristianpaul> (125MHz yay !)

11:05 <terpstra> kristianpaul, it gets slower once you hook the WB up to a crossbar

11:05 <terpstra> atm my design only hits 124.6MHz (which is extremely frustrating)

11:05 <kristianpaul> ah :--/

11:05 <lekernel> terpstra: still it'd be interesting to examine the possibility to map it to block RAM

11:06 <lekernel> could you send your patch to the mailing list, along with the jtag fixes?

11:06 <terpstra> lekernel, like i said, i just made an inferred dual port memory and plopped it in

11:06 <Fallenou> please share your code ;)

11:06 <terpstra> give me a pointer to where your clean tree is

11:06 <terpstra> and i'll break my tree into patches wrt. it

11:06 <lekernel> https://github.com/lekernel/milkymist/tree/master/cores/lm32/rtl

11:07 <lekernel> cyclone 3 seems pretty fast anyway

11:07 <terpstra> (*runs off to find documentation for git*)

11:07 <lekernel> how much % of it is used for LM32? and how much does your chip cost?

11:07 <terpstra> uhm

11:07 <terpstra> i'm using the 'cyclone3 starter board'

11:07 <terpstra> it was just lying around our office

11:08 <terpstra> i don't know how much we paid for it

11:08 <terpstra> i've heard pricing varies a lot based on who you buy it from though

11:08 <terpstra> my design atm is 4.6k LUTs

11:08 <terpstra> sorry

11:08 <lekernel> http://www.altera.com/products/devkits/altera/kit-cyc3-starter.html ?

11:08 <terpstra> 3.4k

11:08 <terpstra> at 14% of chip

11:09 <lekernel> he, pretty good

11:09 <terpstra> yes

11:09 <terpstra> that's the one

11:09 <terpstra> we also have a few aria2 PCIe board

11:09 <terpstra> on the arria2 it was 1.5k LUTs -- or 1.5% area

11:09 <terpstra> (ie: could fit tons of them on it)

11:09 <lekernel> but arria2 is expensive, no?

11:09 <terpstra> probably

11:09 <terpstra> i don't have to buy these things, so don't know ;)

11:10 <terpstra> for our SoC arria2 is the eventual end chip

11:10 <Fallenou> cyclon3 starter kit seems a nice board :) not so expensive

11:10 <lekernel> the same design uses 3.4k Cyclone 3 LUTs and only 1.5k Arria2 LUTs?

11:10 <terpstra> yes

11:10 <terpstra> arria2 seems much smaller

11:10 <Fallenou> definitely less expensive than spartan6 xilinx boards

11:11 <terpstra> fatal: https://github.com/lekernel/milkymist/tree/master/cores/lm32/rtl/info/refs not found: did you run git update-server-info on the server?

11:12 <terpstra> i'm not a git user

11:12 <lekernel> you should use git://github.com/lekernel/milkymist.git as git clone URL

11:12 <lekernel> not the HTTP link

11:12 <terpstra> -.-

11:12 <terpstra> thanks

11:14 <terpstra> you guys use pure verilog, yeah?

11:14 <lekernel> yes

11:14 <terpstra> our interconnect is all vhdl so i guess you don't want that

11:14 <roh> uh. 200$ is surprisingly cheap for a develboard

11:14 <Fallenou> yes

11:14 <lekernel> when we move to our own synthesis technology, it'll be easier if we have only one language :)

11:15 <terpstra> it's quite nice to use, the cyclone3

11:15 <roh> terpstra: well.. does it have nicer tools than quartus ;)

11:15 <terpstra> what's wrong with quartus?

11:15 <terpstra> you can still use joe and make ;)

11:16 <terpstra> ok

11:17 <terpstra> got a patch file with all the whitespace and/or warning-silencing edits removed

11:17 <terpstra> but it kinda rolls all the changes together

11:18 <terpstra> where to send it?

11:18 <lekernel> devel at lists.milkymist.org

11:18 <roh> terpstra: well.. its the same crap as xilinx tools.. in 'getting them' as well as 'installing them'

11:19 <roh> they are _huuuughe_ and the vendors are a pain in the ass with only for account users and stuff..

11:19 <lekernel> roh: you are welcome to send me llhdl contributions ;)

11:20 <lekernel> let's replace this crap

11:21 <roh> lekernel: heh... first i need to be able to WRITE code in verilog. currently i am quite happy to make sense of it when reading. knowing electronics and C helps a lot tho

11:21 <roh> my last attemt to install ISE was prohibited by available diskspace *sigh*

11:22 <roh> what the f*ck do they need 2-digit gbytes for?

11:22 <lekernel> own copy of the C libraries, own copy of the C++ libraries, JVM, own copy of Perl, ....

11:23 <roh> last time i installed quartus i needed win32 for it and it crashed on a (guided) attempt to compile something.. i guess because it was a guide and code for another version or so... sigh.

11:23 <lekernel> plus a ton of bloated pseudo-cross-platform libraries

11:23 <roh> jvm? wtf?

11:23 <lekernel> yeah, some parts of the xilinx toolchain are in java

11:23 <terpstra> ok, email sent

11:23 <roh> if their licence wouldnt suck someone could make it _very_ small i guess.. removing all the stuff already packaged in the distro (if you got an os with packaging and not win32)

11:24 <lekernel> also some of their executables aren't stripped

11:24 <terpstra> i suppose i should subscribe to this list

11:24 <lekernel> terpstra: i'll moderate your message

11:24 <larsc> roh: i guess you are lukey that it doesn't bring its own apache with php

11:25 <lekernel> terpstra: so, you're doing heavy ion research? interesting :)

11:26 <terpstra> i appologize for writing the firmware loader in tcl.... altera still won't give me the headers to talk jtag via C

11:26 <terpstra> that's what GSI has done in the past

11:26 <roh> larsc: these would be actually small ;)

11:26 <terpstra> we will be producing positrons and stuff now

11:26 <terpstra> we need a softcore as part of the control system that runs the accelerator

11:26 <scrts`> how much did You pay for that arriaII pci-e board?

11:27 <scrts`> I am looking for a pci-e board :)

11:27 <lekernel> terpstra: there's also Uwe Bonnes from the Institut fÃ¼r Kernphysik in Darmstadt who's doing similar things

11:28 <lekernel> with softcores I mean

11:28 <terpstra> my colleague says we paid 1500EUR

11:28 <terpstra> for the PCIe

11:28 <lekernel> he's on the milkymist list

11:28 <terpstra> but that's the more expensive development one we use for prototyping

11:29 <wolfspraul> lekernel: do you have some Milkymist news I should mention in the qi february community update?

11:32 <lekernel> wolfspraul: on http://en.qi-hardware.com/wiki/Community_news_2011-02-01 ?

11:32 <lekernel> JTAG cable fix, power-up fix

11:33 <wolfspraul> 03-01

11:33 <wolfspraul> http://en.qi-hardware.com/wiki/Community_news_2011-03-01

11:34 <lekernel> so, power-up fix

11:34 <wolfspraul> I'll move Milkymist up (before NanoNote), we have to push it more :-)

11:34 <lekernel> he, I didn't see that http://en.qi-hardware.com/wiki/File:Sigemm1.jpg

11:34 <wolfspraul> anything on the rtems or flickernoise side?

11:35 <lekernel> flickernoise improvements: PDF-based online help system, GUI usability improvements (like mouse wheel scrolling support)

11:35 <lekernel> on rtems, there has been some bugfixing on Ethernet

11:35 <lekernel> but it'd still need work

11:36 <Fallenou> yes ethernet is improving, but there is much more to do

11:36 <wolfspraul> ok this is helpful

11:37 <lekernel> wolfspraul: also you can post about the french linuxmag article that I released

11:37 <wolfspraul> but the article is a year old, no?

11:37 <terpstra> this openocd project is not affiliated with milkymist?

11:37 <lekernel> wolfspraul: yeah, but it's a technical article anyway (on MM SoC)

11:38 <lekernel> it's still valuable content I think

11:38 <lekernel> some people still don't get what mm soc is about

11:38 <wolfspraul> why did you publish it now?

11:38 <wolfspraul> you had to wait 12 months?

11:38 <lekernel> yeah

11:38 <lekernel> it was published on paper before

11:38 <wolfspraul> sure

11:39 <wolfspraul> so the news is 'after a 12 months freeze period, Sebastien was able to publish...'

11:39 <wolfspraul> or I leave the freeze out, need to shorten it

11:39 <wolfspraul> ok I'll mention it

11:39 <lekernel> just leave the freeze out

11:40 <lekernel> most of the technical info in there is still accurate

11:41 <lekernel> also, mwalle posted some Linux patches in order to have clean bFLT binary support

11:45 <lekernel> you may want to put those forward, Linux support is often what rings bells in the open source community

11:47 <terpstra> !

11:47 <terpstra> http://git.serverraum.org/?p=mw/openocd-lm32.git;a=blob;f=src/target/lm32.c;h=8eb141561ed3f3e10b981be28c21c257a6b016bd;hb=1373f61cacc2e3012e58dd2083254dc198126ffb

11:47 <terpstra> i wish i'd seen this file earlier -.-

11:47 <larsc> i'm hoping that i'll find some time this week to finally turn the kernel and uclibc patch into an actuall linux distro using openwrt

11:47 <terpstra> would've saved me quite some wirj\

11:47 <terpstra> work*

11:48 <lekernel> terpstra: openocd isn't part of milkymist, it's a generic JTAG debugger that mwalle (temporarily) forked and added milkymist support to

11:49 <terpstra> so the jtag changes you guys made to the lm32 were actually working for you?

11:49 <lekernel> as I said, this wasn't thoroughly tested

11:49 <lekernel> but at many parts of it worked, yes

11:50 <terpstra> you are using the uart to communicate with your debug rom

11:50 <terpstra> i'd hoped to avoid doing that

11:52 <terpstra> where is the rom that goes with this port?

11:52 <lekernel> terpstra: https://github.com/lekernel/milkymist/tree/master/software/monitor

11:53 <terpstra> thanks

11:53 <terpstra> wow. quite small!

11:53 <terpstra> i'll look into it after lunch.

11:53 <terpstra> thank you very much for all these links.

11:54 <terpstra> i hope my patch might prove useful in speeding up your LM32 too :)

11:56 <lekernel> hopefully :)

11:56 <lekernel> but meeting timing in mm soc is a bloody quagmire

11:56 <lekernel> sometimes, _removing_ logic breaks timing

11:57 <lekernel> just because the placer's heuristic algorithm then picks the wrong numbers

11:59 <lekernel> and so far I haven't really found a better way of getting things to work than to run multiple instances of the place and route on a multicore machine each with a different PRNG sequence

11:59 <lekernel> until one happens to work

12:00 <roh> eh. how do you know which one 'works' ?

12:00 <lekernel> there's a timing model that tells you if the design is ok or not

12:01 <lekernel> this, of course, is also subject to bugs

12:01 <roh> ouch.

12:01 <lekernel> especially with spartan6 it seems

12:01 <lekernel> i've overclocked designs without problems by as much as one nanoseconds

12:01 <roh> the more i learn about fpga and their develtools.. the more frightend i am about the wicked state of affairs. man thats dirty and bloody.

12:02 <lekernel> and, on the other hands, designs that were supposed to meet timing exhibited intermittent issues until I froze the FPGA to some -40C

12:03 <lekernel> fortunately it's rare, but it happens

12:03 <lekernel> bottom line, a good freeze spray is sometimes handy when you're tracking down weird FPGA bugs

12:05 <roh> lekernel: hehe. yes. also when debugging other electronics.

12:05 <roh> even on repairing analog electronics

12:06 <lekernel> roh: all silicon compilers are dirty and bloody. you can read even better stories e.g. on http://deepchip.com/

12:20 <terpstra> lekernel, why are you using such a bad tool for place and route?

12:20 <terpstra> if it's so unreliable, pick another

12:21 <lekernel> there's no other

12:21 <terpstra> or is this the xilinx tools you speak about?

12:21 <lekernel> yeah, it's the xilinx place and route

12:21 <terpstra> quartus seems quite reliable when it comes to synthesis

12:21 <terpstra> and it can show you nicely where and why your design is slow

12:22 <lekernel> well, I had my share of Altera bugs too

12:22 <terpstra> i'm not exactly experienced with these things... only learned vhdl and verilog three months ago ;)

12:23 <lekernel> while I think their software is slightly less bad than Xilinx's, I wouldn't be surprised if such p&r woes also happen with Altera

12:24 <terpstra> you mean the inconsistent timings that come out of placed and routed logic?

12:24 <terpstra> supposedly altera can use timing-driven synthesis to help here

12:25 <lekernel> no, I'm talking about timing model bugs

12:25 <terpstra> i see

12:26 <terpstra> well altera has two different timing analsys tools

12:26 <terpstra> hopefully at least one of them will work ;)

12:26 <terpstra> (tho so far i've only seen both work)

12:26 <lekernel> that sounds quite painful too :)

12:26 <terpstra> nah

12:26 <terpstra> the newer chips use the newer tool "time quest"

12:26 <lekernel> i'd say: let's simply develop our own open source timing engine

12:26 <terpstra> the older ones used some other tool

12:27 <terpstra> lekernel, i think you underestimate how hard it is to do this well

12:27 <lekernel> oh, I never said it was easy

12:28 <terpstra> it's my understanding also that the bitstream you need to program FPGAs is a closely guarded trade secret

12:28 <terpstra> so making a new synthesis tool would ba a PITA

12:29 <lekernel> i'll probably get the complete (reverse engineered) spartan6 xilinx bitstream format in my mailbox during the next weeks :)

12:29 <terpstra> hah

12:29 <lekernel> and btw, it's not that hard to reverse engineer

12:30 <terpstra> are you seriously planning on building your own synthesis tool?

12:30 <lekernel> I wouldn't say it's easy, but it's not impossible or super-hard either

12:30 <terpstra> i didn't say it would be hard, i said it would be a PITA. expect lawsuits coming your way if you use that reverse engineered info.

12:30 <lekernel> phew. that's what everyone says. but i've heard a lot of rumors and all turned out to be false

12:30 <terpstra> as part of installing quartus, i agreed not to reverse engineer it

12:31 <terpstra> (in the licence text(

12:31 <terpstra> don't get me wrong, though: building an open source synthesis tool would be a great project!

12:31 <terpstra> just like gcc is the cornerstone of open source

12:31 <lekernel> iirc the xilinx licenses prohibits decompilation and disassembly. the bitstream format is recovered using black box techniques

12:32 <lekernel> the guy runs the toolchain the normal way, and then uses custom binary analysis tools on the result

12:33 <lekernel> and btw, I'm not even sure Xilinx would go after a project that produces bitstreams for their devices

12:33 <lekernel> there are tons of rumors depicting FPGA companies as "evil guys", but an astonishing amount of them are pure bullshit

12:34 <terpstra> i've been pretty impressed by how open altera has been with me

12:34 <terpstra> i asked them for the header files for their jtag client library

12:34 <terpstra> and mentioned that an NDA would be a problem

12:34 <lekernel> Xilinx even provides (unsupported) lists of all the interconnect in their chips

12:34 <terpstra> ... and they sent me complete documentation and a whole whack of source

12:35 <lekernel> i'm even writing a parser for them atm ;)

12:35 <lekernel> those are multi-GB text files

12:36 <lekernel> what's missing is 1. timing information (this will be hard, probably need to build a chip characterization system) and 2. how this information relates to bitstream content (not extremely hard to find out)

12:36 <terpstra> this debug ROM is evil. r0!=0. bad. :)

12:36 <lekernel> terpstra: it's done on purpose... and works nicely

12:36 <terpstra> i know

12:36 <terpstra> i was planning on doing it too ;)

12:37 <terpstra> it's the only available register that one can smash to point to the register save region

12:37 <lekernel> the characterization system would probably involve building various ring oscillators with the elements to characterize in the loop

12:37 <lekernel> then measuring the resulting frequency

12:37 <terpstra> eh?

12:38 <lekernel> and finally solving the system of equations to find out the timing property of each element

12:38 <lekernel> this sounds like a lot of fun

12:38 <terpstra> wouldn't you just trace the signal from inputs to outputs?

12:38 <terpstra> some sort of graph traversal algorithm

12:38 <terpstra> with weights based on chip-specific timing information

12:38 <lekernel> yes, but it's easier to do that automatically and on-chip with a ring oscillator

12:38 <terpstra> why would you want to do timing analysis on-chip?

12:39 <lekernel> oh, the purpose is to recover that timing information

12:39 <terpstra> i'd rather be running it on my fat intel devel system

12:39 <terpstra> oh

12:39 <terpstra> sorry, i gotcha

12:39 <lekernel> it's built into the xilinx software atm

12:39 <lekernel> and we can either reverse engineer the software, or measure it ourselves

12:39 <terpstra> so you want to make what amounts to a bitstream that can measure the delays in the chip

12:39 <lekernel> imo the second technique is more fun, accurate and legal

12:39 <terpstra> and then use that to feed into the timing analysis program

12:39 <lekernel> yes

12:39 <terpstra> nifty

12:40 <terpstra> you would be able to account for per-chip variability that way

12:40 <lekernel> yeah, we'll probably need to run measurements on many chips and at different voltages and temperatures

12:40 <lekernel> but it's easy. all it would need is a jtag probe and a on-board stable clock source

12:40 <terpstra> in terms of open source synthesis tools

12:40 <lekernel> the rest is automated

12:41 <terpstra> i'd be more interested in the front-end stuff

12:41 <lekernel> the front end stuff is already working to some extent

12:41 <lekernel> https://github.com/lekernel/llhdl/wiki

12:42 <terpstra> you do the necessary optimizations/etc already?

12:42 <lekernel> this works for example:

12:42 <lekernel> https://github.com/lekernel/llhdl/blob/master/designs/blinker/blinker.v

12:42 <lekernel> no... I spent less than two months on that

12:42 <lekernel> at first I focus on producing working netlists

12:42 <lekernel> not necessarily fully optimized

12:42 <terpstra> yep

12:42 <lekernel> though it's already capable of using carry chains and such

12:43 <lekernel> missing optimizations are a good "random logic" LUT mapper (I'm thinking of using the BDS-PGA algorithm)

12:43 <terpstra> i like the idea of a LLVM for hardware a lot

12:43 <lekernel> FSM re-encoding

12:44 <lekernel> and a couple of smaller things, like shift register extraction, large mux extraction, large comparator extraction, ...

12:44 <lekernel> most can be implemented with the current architecture as Mapkit "plug-ins"

12:45 <lekernel> also, there are a couple of things that the Verilog front end doesn't support, e.g. instantiations, parameters, case statements and generate

12:46 <lekernel> lots of work :p

12:46 <terpstra> indeed

12:46 <lekernel> but still not too bad for < 2 months

12:46 <terpstra> i'm somewhat surprised no one else has started doing this already?

12:46 <lekernel> well, there have been attempts

12:46 <lekernel> but usually they all degenerate into sterile debate and often undue trolling towards FPGA companies

12:47 <terpstra> hah

12:47 <lekernel> and, sometimes, fail because of mere technical incompetence

12:47 <lekernel> but I think the main factor is trolling and other management problems

12:48 <terpstra> if i had more time, i'd be interested in helping out

12:48 <terpstra> maybe in a few months i'll be back

12:49 <lekernel> http://www.fpgarelated.com/usenet/fpga/show/36355-1.php is a good example of what typically happens in this field

12:50 <lekernel> and the funniest thing is JHDLBits never got squashed by Xilinx

12:50 <lekernel> it's all rumors

12:51 <terpstra> by not interfacing with their code at all

12:51 <terpstra> but building from bitstream+jtag up, you avoid a lot of the stickiness

12:51 <lekernel> LLHDL puts out EDIF, which is a standard format...

12:52 <lekernel> which is then read by the xilinx p&r (for now)

12:52 <lekernel> LLHDL is just the front end, it doesn't do any physical implementation

12:53 <lekernel> this will be handled by a separate project (and now by the fpga vendor's tools, through the standard EDIF interface)

12:53 <terpstra> so you can already run your compiled llhdl?

12:53 <terpstra> that's nice!

12:53 <lekernel> yeah, the verilog file I posted works nicely on the MM1 board

12:53 <terpstra> nice

12:53 <lekernel> with llhdl synthesis

12:54 <lekernel> I expect that in some other two months, it'll be with open source antares p&r and bitstream generation :)

12:58 <terpstra> do i understand this rom that on a breakpoint it saves everything to ram, reports via uart that its ready and then reads the offset (!) of the command to execute?

12:59 <lekernel> mh... I don't know... mwalle wrote this

12:59 <terpstra> you guys are french, yeah?

13:00 <lekernel> not everyone

13:01 <lekernel> actually most people here are German

13:01 <lekernel> and I live in Berlin, though I'm French :)

13:01 <terpstra> i think i understand this ROM enough now to use it. now to try and compile this openocd-lm32 :)

13:01 <terpstra> i live in darmstadt, though i'm canadian

13:04 <lekernel> he, we can visit GSI?

13:04 <lekernel> http://gsi.de/informationen/visitors/index_e.html

13:05 <lekernel> sounds fun :)

13:05 <terpstra> sure

13:05 <terpstra> they have tour guides and everything

13:05 <lekernel> I was at LHC last year and ILL this summer

13:05 <terpstra> then they lead you into the accelerator room, look the door, and turn on the beam!

13:05 <lekernel> http://lekernel.net/summer2010

13:05 <lekernel> haha

13:05 <roh> .oO(we need a particle accellerator at the camp!11!)

13:06 <roh> http://events.ccc.de/2010/08/10/chaos-communication-camp-2011/

13:06 <roh> must-be for hackers this summer

13:06 <lekernel> roh: we can take stuff out of that X-Ray system I told you about. last attempt stopped when I ran into the probably PCB-contaminated oil cooling system

13:06 <lekernel> maybe i'll come later with appropriate gloves etc.

13:06 <roh> *g*

13:10 <lekernel> The number of participants should exceed 10 persons and stay below a maximum of 60 persons.

13:10 <lekernel> ok, who's in?

13:13 <roh> lekernel: ccc berlin does 'hackertours' sometimes.. maybe you should ask for participants there.

13:15 <terpstra> lol

13:15 <terpstra> if you do come to the GSI, let me know

13:15 <terpstra> you can meet the real hardware hackers here

13:15 <roh> terpstra: bring people to the camp 2011 ;)

13:15 <roh> we always want to meet people working on the interresting stuff nobody else understands

13:16 <roh> http://chaosradio.ccc.de/ctv113.html explains the camp

13:17 <terpstra> hmm

13:21 <scrts`> whats the camp 2011? :)

13:22 <scrts`> link?

13:22 <lekernel> terpstra: i'm driving to Paris in May. Darmstadt is a very small detour.

13:24 <terpstra> cool

13:27 <roh> scrts`: 2 lines above your question

13:27 <lekernel> roh:Â Â btw, are there lectures there?

13:27 <lekernel> like HAR

13:27 <roh> lekernel: yes. ofcourse. also workshops.

13:28 <lekernel> well, from now on I'll avoid CCC workshops

13:28 <lekernel> but perhaps a LLHDL talk would be nice

13:28 <roh> naah. dont be a little girl...Â Â self-organisation also doesnt work sometimes

13:29 <roh> if you really want to do something.. just do. nobody will hinder you. you could even set up your own big tent if you like

13:29 <roh> from a certain size up we ask to announce that in advance for planning and reserving the space

13:31 <lekernel> roh: do they send a cfp and when?

13:32 <roh> we are working on that the coming weeks

13:32 <terpstra> openocd-lm32 doesn't build cleanly as version.texi is missing

13:32 <terpstra> git add? :)

13:32 <roh> i think for the camp it will be rather a 'call for participation' since people really seldomy send in papers lately

13:34 <lekernel> I must say that after the milkymist workshop quagmire, this doesn't really encourage me to submit :)

13:34 <terpstra> which one of you is sebastien?

13:35 <lekernel> it's me

13:35 <terpstra> ok :)

13:35 <terpstra> i realized i forgot to send you lm32_dp_ram.v -.-

13:35 <lekernel> send it to the list

13:36 <lekernel> open reviews and horizontal communication are good

13:42 <roh> lekernel: maybe you should rather submit a talk and do the workshop less planned

13:43 <terpstra> lm32> load milkymist/software/monitor/monitor.elf

13:43 <terpstra> Capturing the CPU at address 0x0: done

13:43 <terpstra> Loading 0x000054+0x00554 to 0x10000000

13:43 <terpstra> section .text: 1364 bytes - complete

13:43 <terpstra> Releasing CPU: done

13:43 <terpstra> :)

13:43 <roh> that always worked good afaik. also we know that the workshop-orga was not good at the congress (there wasnt any) and we need to get better there (wanna help with it? ;)

13:44 <terpstra> doh.Â Â my initial DEBA dosen't match

13:45 <lekernel> well, maybe :)

13:58 <lekernel> terpstra: out of those 1.5K LUTs on Arria2, how many of those LUTs use the "fracturing" feature?

13:58 <terpstra> i don't know what that is ;)

13:58 <lekernel> i'm quite amazed that this FPGA architecture cuts the LUT count in more than half

13:58 <terpstra> yeah

13:58 <lekernel> and I even suspect some figure manipulation ;)

13:59 <terpstra> what is the fracturing feature?

13:59 <lekernel> make two LUTs with one

13:59 <terpstra> i've never seen it mentioned in the consumed resources reports

13:59 <lekernel> with fewer inputs each

13:59 <lekernel> http://www.altera.com/products/devices/stratix-fpgas/stratix-ii/stratix-ii/features/architecture/st2-lut.html

14:00 <terpstra> i'll copy-paste the relevant bits from the report

14:01 <terpstra> ; FamilyÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â ; Cyclone IIIÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â ;

14:01 <terpstra> ; DeviceÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â ; EP3C25F324C6Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â ;

14:01 <terpstra> ; Timing ModelsÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â ; FinalÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â ;

14:01 <terpstra> ; Total logic elementsÂ Â Â Â Â Â Â Â Â Â Â Â Â Â ; 3,571 / 24,624 ( 15 % )Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â ;

14:01 <terpstra> ;Â Â Â Â Total combinational functionsÂ Â ; 3,330 / 24,624 ( 14 % )Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â ;

14:01 <terpstra> ;Â Â Â Â Dedicated logic registersÂ Â Â Â Â Â ; 1,650 / 24,624 ( 7 % )Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â ;

14:02 <terpstra> ; Total registersÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â ; 1650Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â ;

14:02 <terpstra> that's for the cyclone3

14:02 <terpstra> i'll rebuild it now for arria2

14:02 <lekernel> ah, it's "logic elements"

14:02 <lekernel> so a LUT fractured in two would count as one

14:02 <lekernel> (imo)

14:06 <lekernel> who, are they shipping cyclone 5 now, or is it the same vaporware as xilinx 7 series?

14:06 <terpstra> argh

14:06 <terpstra> i can't compile for arria2 under linux

14:06 <terpstra> i forgot

14:06 <lekernel> so, you see, software problems with altera too :)

14:07 <terpstra> the stupid parallel port dongle only works under windows :P

14:07 <terpstra> well, if we'd bought linux-friendly licences...

14:07 <terpstra> it will let me target a generic arria2

14:08 <terpstra> but that won't give an accurate fill %age and it picks the smallest that works

14:08 <terpstra> (i don't want to reboot)

14:08 <lekernel> kk never mind

14:08 <terpstra> ; FamilyÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â ; Arria II GXÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â ;

14:08 <terpstra> ; Met timing requirementsÂ Â Â Â Â Â Â Â Â Â ; N/AÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â ;

14:08 <terpstra> ; Logic utilizationÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â ; N/AÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â ;

14:08 <terpstra> ;Â Â Â Â Combinational ALUTsÂ Â Â Â Â Â Â Â Â Â ; 1,805Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â ;

14:08 <terpstra> ;Â Â Â Â Memory ALUTsÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â ; 0Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â ;

14:08 <terpstra> ;Â Â Â Â Dedicated logic registersÂ Â Â Â ; 1,650Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â ;

14:08 <terpstra> ; Total registersÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â ; 1650Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â ;

14:08 <terpstra> ; Total pinsÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â ; 4Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â ;

14:08 <terpstra> ; Total virtual pinsÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â ; 0Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â ;

14:08 <terpstra> ; Total block memory bitsÂ Â Â Â Â Â Â Â Â Â ; 126,976Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â ;

14:09 <terpstra> here it talks about ALUTs instead of logic elements

14:11 <lekernel> seems detailed here: http://www.altera.com/literature/wp/wpstxiiple.pdf

14:12 <terpstra> ... lol at figure 1

14:14 <terpstra> in my design, i use a full crossbar interconnect

14:14 <terpstra> so their little example showing a 2* savings is somewhat relevant

14:14 <terpstra> but that's not the majority of the used area ...

14:15 <lekernel> "The benchmark comparison uses 80 real customer designs." ...does the Altera software includes, like the Xilinx one, a mandatory phone home "feature" to gather those statistics?

14:16 <terpstra> it's opt-in

14:16 <terpstra> but, yes

14:17 <lekernel> if you take the free of charge version of the xilinx tool, it's always enabled and you can't "opt out"

14:17 <terpstra> the web edition version of quartus is quite nice

14:17 <terpstra> i use it under linux even though i have a fully licenced windows version

14:17 <lekernel> well, the way to opt out anyway is to delete its curl library, so it isn't too hard

14:18 <terpstra> i just miss the signaltap2 logic analyzer (which to be fair is quite essential)

14:18 <terpstra> and synthesis to the higher end fpgas

14:18 <lekernel> yeah... one should design an open replacement to signaltap/chipscope

14:18 <lekernel> preferably platform independent

14:18 <terpstra> doesn't seem that hard a task really

14:18 <lekernel> no, it isn't

14:19 <terpstra> it could be done as a compiler pass in your llhdl

14:19 <lekernel> sure

14:19 <terpstra> just a bit of tooling of the llhdl to add hooks and a capture logic

14:19 <lekernel> well, in LLHDL you can write the IR to files and manipulate that in custom applications

14:19 <lekernel> you won't even need to touch the core code, just develop an independent utility

14:20 <terpstra> i doubt that would work as cleanly as you envision

14:20 <terpstra> you need access to the original symbol names and hierarhcy

14:20 <terpstra> otherwise the user won't be able to say what signals he wants

14:20 <lekernel> those are accessible from the "external" flow

14:20 <terpstra> (i assume your llhdl will perform optimizations which rename the signals during their work)

14:20 <lekernel> only in the last passes

14:20 <lekernel> but you can hook before aht

14:20 <lekernel> that

14:21 <terpstra> regardless, not a difficult task

14:21 <lekernel> the first passes compile Verilog (and maybe VHDL) without any optimization

14:21 <terpstra> just some work

14:21 <lekernel> and directly write LLDHL interchange files that you could pass to linker, optimizers and mappers

14:21 <lekernel> or fancy things like logic analyzer insertion utilities

14:22 <terpstra> anyway, the moral of that document you sent me seems to be this:

14:22 <terpstra> stratix 2 ALUTs are bigger than stratix 1 / cyclone 3 LEs

14:22 <terpstra> so it's apples and oranges

14:25 <terpstra> don't suppose you know where the openocd.cfg for the lm32 is?

14:27 <lekernel> iirc there were some threads on the mailing list about that some months ago

14:27 <lekernel> but i'm not sure

14:33 <larsc> terpstra: thats what i have in my milkymist openocd.cfg: http://pastebin.com/ghq1wVKU

14:34 <terpstra> :-/

14:34 <terpstra> it claims to support usb-blaster

14:34 <terpstra> but doesn't seem to work

14:35 <lekernel> larsc: does openocd work for you?

14:35 <lekernel> (I have never tried it)

14:35 <larsc> lekernel: it was unstable the last time i tried it

14:36 <larsc> to unstable to be useful

14:36 <terpstra> my tcl script is stable, but doesn't talk to gdb :/

14:37 <terpstra> fairly certain that my problem is that openocd doesn't support usb-blaster properly, despite it's claim to the contrary

14:37 <terpstra> Error: IR capture error at bit 2, saw 0x3FFFFFFFFFFFFD55 not 0x...3

14:38 <larsc> terpstra: you enabled it in ./configure and changed the interface in the config file?

14:38 <terpstra> ofc

14:39 <terpstra> got it

14:39 <terpstra> Info : JTAG tap: ep3c25f324.tap tap/device found: 0x020f30dd (mfg: 0x06e, part: 0x20f3, ver: 0x0)

14:39 <terpstra> :)

14:40 <larsc> nice

14:42 <terpstra> aha :)

14:42 <terpstra> Â Â Â Â Â Â Â Â if (strcmp(variant, "xc6s") == 0)

14:42 <terpstra> Â Â Â Â Â Â Â Â {

14:42 <terpstra> needs some generalization ;)

14:44 <lekernel> terpstra: feel free to send contribs to the same ML ;)

14:44 <terpstra> don't have it working yet

14:44 <terpstra> it sees the jtag chain, but not the lm32

14:45 <terpstra> but will get there :)

14:45 <terpstra> i also know my jtag chain is perfectly stable as i've loaded and read multiple MBs of firmware with my tcl tool

14:45 <terpstra> so whatever problems i have are just openocd specific, which should help narrow it down

15:15 <lekernel> terpstra: how do you make 92+ uranium?!

15:15 <terpstra> don't ask me stuff like that

15:15 <terpstra> i just work here!

15:15 <lekernel> well, you happen to make this stuff at GSI :)

15:16 <terpstra> yes

15:16 <terpstra> the physicists make the machine go

15:16 <lekernel> the GSI website says "Fully stripped U92+ ions from the heavy-ion synchrotron SIS"

15:16 <terpstra> they make it go round and roung

15:16 <terpstra> round*

15:16 <terpstra> vroom vroom

15:16 <lekernel> but from what I know about ion sources, it's pretty difficult to strip more than a dozen electrons off an atom

15:17 <lekernel> so 92... wow

15:17 <terpstra> (they strip the ions by hitting them through some sort of thin metal i believe)

15:17 <terpstra> and they do it multiple times

15:17 <lekernel> mh, maybe... must be very small quantities then

15:17 <terpstra> well, they start with very big quantities

15:17 <terpstra> and they get smaller and smaller ;)

15:21 <terpstra> hm

15:21 <terpstra> i've noticed a strange behaviour with the lm32

15:21 <terpstra> issuing a 'break' over jtag has no affect until you do either a uart recv or memory read

15:21 <terpstra> wtf :P

15:22 <terpstra> my led blink keeps running until then

16:37 <lekernel> http://www.efyexpo.com/category/programmesactivities/design-engineers-conference/speakers/

16:37 <lekernel> Marcus Erlandsson, Chief Technology Officer and Founder, OpenCores

16:37 <lekernel> Abstract: Open-source hardware IP-cores is today the only efficient way of developing the next generation of products. A problem today with product development is that when product complexity increases, the verification workload increases exponentially, which leads to significant higher development costs. Open-source hardware enables companies to significantly reduce verification costs and therefore allow a more cost-effective developme

16:37 <lekernel> nt method.

16:37 <lekernel> oh my...

16:39 <lekernel> there are more bugs in a opencores project than in the average rainforest, and they invite _him_ to talk about _verification_

16:39 <lekernel> omg

16:40 <lekernel> oh, and Arduino "the father of Open Source hardware"

16:40 <lekernel> ok, got it

16:47 <lekernel> I wonder what percentage of that opencores bullshit talk is delusions and what is outright lies to please some investor or (poor) ORSoC customer

16:49 <wpwrak> lekernel: maybe you should speak at such conferences, too ? ;-)

16:49 <lekernel> I don't know

16:50 <kristianpaul> i agree with wpwrak

16:51 <wpwrak> show people that there's life beyond the bovine feces :)

16:54 <lekernel> do they want to know? everyone feels good about blinking LEDs...

16:57 <lekernel> the only way to pull that off is to make big lectures at large/central conferences, preferably where there are lots of journalists and well-known people

16:58 <lekernel> otherwise you're just gesticulating

16:59 <tuxbrain_away> lekernel: If it wasn't for you and for your work I had still part of this bovine feces.... well in fact I'm still there but at least you let me know there are higher grounds out there to walktrough time to time :)

17:08 <kristianpaul> bovine = moo ? ;-)

17:16 <scrts> hm, there are workng cores on opencores!

17:16 <scrts> I mean exists

17:16 <scrts> :))

17:19 <lekernel> yeah, some 0.1%, sadly not including their flagship openrisc

17:24 <lekernel> how many products can you count that reliably use opencores IPs? except the cases where ORSoC claims one is used for "a large customer" which is never named?

17:27 <lekernel> that, and the "tracking everything" e.g. compulsory registration - about which they have double standards, they whine because lattice does the same - "because download statistics are essential to build credibility" (well, they should quit FPGAs and do lolcats and pr0n then)

17:29 <kristianpaul> lekernel: usrp2 i think uses zpu, not sure is that is on opencores

17:29 <lekernel> zpu is on opencores, and zpu is crap

17:29 <lekernel> well, it's not an "official" opencores project

17:29 <kristianpaul> that 0.1% is your sdram controller? ;-)

17:30 <kristianpaul> I saw it on opencores last tiem..

17:30 <kristianpaul> also navre

17:30 <lekernel> nah, there are also some other decent designs there - aeMB for example

17:30 <kristianpaul> who else

17:30 <kristianpaul> ?

17:30 <terpstra> hey, jumping in here

17:30 <terpstra> could you be more specific in your rant against openrisc and zpu :)

17:31 <lekernel> well, openrisc uses 3 times as many LUTs as LM32 for half the speed

17:31 <kristianpaul> ah, i remenber you are benchmarking those too, isnt terpstra ?

17:31 <terpstra> yes

17:32 <terpstra> i already more-or-less rejected openrisc as fatter than the leon3 with less functionality

17:32 <lekernel> and, last time I checked, the design contained latches

17:32 <terpstra> but thought maybe you lot had more to say

17:32 <lekernel> which are 1. surprising in a design I thought would be synchronous and 2. undocumented

17:32 <terpstra> some people at CERN really like the ZPU and i need more ammunition against it

17:33 <lekernel> so I guess they come from the usual beginner's HDL pitfall

17:33 <kristianpaul> you fint the right place for that ;-)

17:33 <kristianpaul> find**

17:33 <terpstra> found* ;)

17:33 <kristianpaul> oh, sorry

17:33 <lekernel> I concede the ZPU isn't as crappy as OpenRISC, it's only problem is it's ridiculously slow

17:33 <kristianpaul> what are doing with zpu at CERN?

17:33 <terpstra> same thing we are looking at the LM32 for

17:33 <kristianpaul> oh

17:34 <terpstra> use to run DHCP/ARP/PTP inside a timing controller to coordinate devices

17:34 <kristianpaul> zpu have his own ethernet controller?

17:34 <terpstra> they like the ZPU because it's small. and it is. about 1/3rd the LM32

17:34 <terpstra> nope.

17:34 <lekernel> that's not counting the microcode ROM

17:34 <terpstra> the ethernet core will be a custom wishbone device from us

17:35 <lekernel> though it can be OK in a FPGA if you have spare block RAMs you wouldn't use otherwise

17:35 <terpstra> the microcode is less than 4k i think?

17:35 <terpstra> using a LM32 means you have fat icache and dcache

17:35 <terpstra> which clock in at more

17:35 <lekernel> 4K is already a lot of area in an ASIC

17:36 <lekernel> you can disable the LM32 caches, can't you?

17:36 <terpstra> in theory

17:36 <lekernel> and it still would be faster than ZPU

17:36 <terpstra> in practice, without icache you have no JTAG

17:36 <scrts> lekernel, regarding question about used cores from opencores: my collegues use i2c core from opencores in our company, he said it works :)

17:36 <terpstra> speed is clearly in favour of the LM32... but we don't need soooo much speed from something just running dhcp/arp/etc

17:37 <lekernel> terpstra: there's also the navre (AVR core) that I made for the USB controller

17:37 <terpstra> i guess i could fix jtag for the lm32 without icache. it does seem strange that it doesn't have the hooks in the instruction_unit.v

17:37 <terpstra> i have that in my list... i rejected it, because ...

17:38 <lekernel> iirc it's some 1k Spartan-6 LUTs

17:39 <terpstra> things i listed against navre: only 1 committer/he could die (i guess that's you), self-reported status: beta, # of pages documentation: 0, tested # of FPGAs: 2, debug support/JTAG: no, no wishbone bus

17:40 <lekernel> well, yes. I only wanted those damn USB ports to work, not make a softcore

17:40 <lekernel> unfortunately they all were unusable

17:40 <terpstra> it was 1k LUTs on an arria2

17:40 <terpstra> for the navre

17:40 <terpstra> compared to 2-3k for the LM32

17:40 <kristianpaul> terpstra: howfast is zpu iÂ Â your cyclone?

17:40 <terpstra> and 500 for the ZPU

17:41 <terpstra> my table must be wrong

17:41 <terpstra> it says 300MHz

17:41 <terpstra> but i don't believe that

17:41 <lekernel> seriously ZPU is super-slow. but if you can live with that slowness, good for you

17:41 <kristianpaul> what? ;)

17:41 <lekernel> oh, it's perhaps really 300MHz

17:41 <terpstra> the instructions do almost nothing tho

17:41 <lekernel> but ZPU takes some 50-100 cycles to do what another processor would do in one

17:41 <terpstra> this was all timed on an arria2

17:41 <terpstra> where LM32 is 175

17:42 <terpstra> so it's 'possible'

17:42 <lekernel> yeah, I'm not surprised

17:42 <lekernel> in terms of clock speed, ZPU was also very fast for me

17:42 <terpstra> i haven't done an in-depth test of the ZPU yet tho

17:42 <terpstra> so take those #s with a grain of saly

17:42 <terpstra> salt*

17:42 <terpstra> the leon3 is pretty nice too

17:42 <lekernel> so your 300MHz ZPU might perform like a 3MHz LM32

17:43 <terpstra> the code is hideous tho

17:43 <lekernel> maybe even worse, depending on code

17:43 <terpstra> i know the ZPU is much slower than the LM32

17:43 <terpstra> but is that the only bad thing i can say?

17:43 <terpstra> i've seen several implementations of the ZPU floating around

17:43 <terpstra> do any of them have JTAG?

17:43 <terpstra> i didn't find any

17:43 <lekernel> if you do the speed/LUT ratio, the ZPU doesn't look that good

17:44 <terpstra> my table is at the bottom of this page btw:

17:44 <terpstra> http://www.ohwr.org/projects/white-rabbit/wiki/EmbeddedCPU

17:44 <terpstra> it's not 100% up-to-date tho

17:44 <lekernel> ha, funny I posted the OHWR link on the mailing list a few days ago

17:44 <kristianpaul> yeap

17:44 <rjeffries> is there a qi-bot log for this channel? URL pls?

17:45 <lekernel> rjeffries: it's in the topic

17:45 <kristianpaul> rjeffries: en.qi-hardware.com/mmlogs

17:45 <rjeffries> thx

17:45 <rjeffries> in smuxi the topic seems hidden

17:45 <lekernel> terpstra: and no, I didn't find any ZPU with JTAG

17:45 <lekernel> afaik only LM32 and LEON3 have it

17:46 <terpstra> the 'documentation' made vague promises about jtag, is all

17:46 <terpstra> eh?

17:46 <terpstra> openrisc too

17:46 <kristianpaul> terpstra: typo latticemico23

17:46 <lekernel> openrisc is a no-go

17:46 <terpstra> at least via simulation

17:46 <lekernel> do you really trust something from people who have undocumented latches in their design?

17:47 <kristianpaul> I still wonder how leon3 can be shiped to a spacecraft

17:47 <lekernel> kristianpaul: leon3 is a good design

17:47 <terpstra> the code is pretty awful, but i guess very well tested

17:47 <terpstra> and they have a redundant version

17:47 <lekernel> yeah, except the coding styleÂ Â :)

17:47 <terpstra> where any single bit error in the chip is corrected

17:47 <terpstra> that's something no other softcore can do afaik

17:47 <kristianpaul> redundtant is a good point :-)

17:48 <terpstra> so cosmic rays != dead spaceship

17:48 <terpstra> plus it has a fully working MMU

17:48 <terpstra> that's the big downside to the LM32

17:48 <lekernel> yeah... only softcore to date which have it iirc

17:48 <terpstra> openrisc has MMU

17:48 <lekernel> and please don't tell me about openrisc, I even managed to get the orsoc people to admit it doesn't work

17:48 <kristianpaul> are you planing run linux on zpu?

17:48 <terpstra> (and several closed source ones)

17:48 <scrts> microblaze? nios? :)

17:49 <terpstra> nios is closed ;)

17:49 <terpstra> kristianpaul, no

17:49 <terpstra> i hope!

17:49 <terpstra> at least you lot have gotten uclinux to work already on lm32

17:49 <terpstra> so it's less risky even with our own SoC than the ZPU in that regard

17:49 <scrts> btw, noone ever tried to add MMU tu LM32?

17:50 <terpstra> not as far as i know

17:50 <terpstra> i actually think you could probably make a wishbone MMU adapter ;)

17:50 <lekernel> terpstra: here's a more accurate openrisc project page: http://www.beyondsemi.com/page/products/processor_cores/openrisc

17:50 <lekernel> made by Damjan Lampret, the original Openrisc developer

17:51 <terpstra> lekernel, is that for real?

17:51 <lekernel> of course it is :)

17:51 <terpstra> how is it still listed as the 'flagship product' of opencores??

17:51 <kristianpaul> lekernel: you mentioned a ppc softcore some time ago, what is it?

17:51 <lekernel> basically, Damjan took the personal challenge to build a CPU, and was somehow sucessful at it

17:51 <lekernel> that became the openrisc, but it was a "draft" design

17:52 <lekernel> then he gave up his open source hardware activities and went on to found Beyond Semi, with a complete redesign of the CPU

17:52 <kristianpaul> terpstra: (flagship product) asic proven, already made baords for it,i bet some marketing too

17:52 <lekernel> since then, Opencores and Openrisc have been taken over by ORSoC, but they barely did anything to improve OpenRISC

17:52 <scrts> kristianpaul that ppc softcore, isn't it LEON3?

17:52 <scrts> or LEON4 now afaik

17:52 <terpstra> at any rate, openrisc is pretty much inferior to the leon3 in every way

17:53 <lekernel> so that's what it is, a zombie draft design

17:53 <terpstra> so it was never a serious contender

17:53 <kristianpaul> scrts:i want to confirm ;-)

17:53 <scrts> it is a confirm :)

17:53 <lekernel> http://www.lampret.com/

17:54 <lekernel> you can find some untold Opencores stories now and then...

17:55 <lekernel> yup. leon3 is a lot more serious than openrisc. not only technically-wise

17:55 <kristianpaul> I guess moxiecpu is not ready for production yet..

17:56 <lekernel> moxie looks promising, but it's not finished yet

17:57 <terpstra> what's this about moxie saying WB doesn't do pipelined reads/writes?

17:57 <terpstra> B.4 supports that just fine

17:58 <lekernel> B.4 was released after Moxie development began I think

17:58 <terpstra> not so hard to refactor

17:58 <terpstra> our LM32 speaks B.4 :)

17:58 <lekernel> dunno. maybe lack of time. moxie has been moving rather slowly

17:59 <lekernel> ah? interesting :)

17:59 <terpstra> what i don't much like about B.4 is you can't tell ahead of time if it will pay off to do a burst transfer

17:59 <terpstra> i read someone's thesis in the milkymist project and they talk about their SDRAM controller always doing bursts

17:59 <terpstra> and that's fine

17:59 <terpstra> but it would be nice if you had some warning still in pipelined mode that sequential access is coming up

18:00 <terpstra> so as not to waste the work

18:00 <lekernel> well, for the SDRAM, going out of burst mode requires a lengthy reload of the mode register

18:00 <lekernel> iirc you can't do isolated single-word transfers

18:01 <terpstra> sure

18:01 <terpstra> but for SRAM, say, would be nice.

18:01 <lekernel> and newer SDRAM chips do not support non-burst mode

18:01 <terpstra> anyway, not a deal breaker

18:01 <terpstra> we are using WB4 and that's not up to me

18:01 <lekernel> also, doing those optimizations need extra logic. need to determine if it's worth the deal :)

18:02 <lekernel> (and development+debug time too)

18:03 <terpstra> yup

18:04 <lekernel> there is something that allows you to explore this design space in high level simulations but for CPUs only

18:04 <lekernel> http://www.virtutech.com/products/ I think

18:04 <lekernel> (it's proprietary w/license fee)

18:05 <lekernel> but you can simulate some software and tune the number of CPU pipeline stages, enable/disable out of order execution, make the CPU superscalar with different issue widths, etc.

18:05 <lekernel> and it would tell you in minutes how fast your software would go

18:05 <terpstra> nios has such configurability as well

18:06 <lekernel> yup. but you need to go through a lengthy logic synthesis and, for unimplemented features, weeks or more of development time

18:07 <terpstra> so, i guess i have finally figured out why openocd doesn't work for me

18:07 <terpstra> it uses the usb blaster jtag directly

18:08 <terpstra> to access jtag devices in the core logic you need to go through the 'jtag sld hub' indirection

18:08 <terpstra> and openocd doesn't know how

18:08 <terpstra> guess i have to teach it. :-/

18:11 <lekernel> scrts: we _might_ participate in GSoC this year. it could _maybe_ be a good opportunity to get that LM32 MMU done

18:13 <roh> eeh. wasnt LEON softcores sparc?

18:13 <scrts> heh, would be cool :)

18:13 <terpstra> roh, yes

18:13 <terpstra> leon* is sparc compatible

18:13 <terpstra> and thus rather fat ;)

18:14 <roh> jap. http://en.wikipedia.org/wiki/LEON

18:14 <terpstra> it takes just hours to get linux running on your fpga tho

18:14 <terpstra> which is pretty nice

18:15 <lekernel> I never managed to get it to work... the LEON3/GRLIB code breaks Xst all the time

18:15 <lekernel> the answer was "use synplify"

18:15 <terpstra> hmm

18:15 <terpstra> well, we only have altera chips

18:15 <terpstra> they all works ;)

18:16 <terpstra> lekernel, please don't commit my jtag patch to your tree just yet

18:16 <terpstra> i may need to tweak it a tad bit for openocd yet

18:16 <lekernel> kk

18:19 <terpstra> it's a real shame there are only 2 unused opcodes in the LM32

18:20 <terpstra> that one of them is 42 softens the blow, i suppose

18:21 <lekernel> yup... maybe they did that so the instructions can be decoded with few levels of logic

18:21 <terpstra> it's definitely why

18:22 <terpstra> just a shame that you can't add much to it

18:22 <terpstra> you guys hooked up some odd vector floating point processor to it, yeah?

18:22 <lekernel> yeah, but it's only using DMA buffers and CSRs

18:22 <terpstra> ah

18:22 <lekernel> no special CPU interface

18:23 <terpstra> i want to add 'branch both ways' :)

18:25 <kristianpaul> oh, Etherbone just uses UDP for sending data..

18:26 <terpstra> kristianpaul, yup

18:27 <terpstra> we assume a reliable medium

18:27 <terpstra> (which we will have)

18:27 <kristianpaul> So what you to dont lost packages?

18:27 <kristianpaul> ah, assume..

18:27 <terpstra> we have FEC at the ethernet layer

18:28 <kristianpaul> You made your own swcihes/hubs ?

18:28 <terpstra> yes

18:28 <kristianpaul> What is FEC?

18:28 <terpstra> forward error correction

18:28 <kristianpaul> k

18:28 <terpstra> http://en.wikipedia.org/wiki/Forward_error_correction

18:29 <terpstra> the main reason for our custom network cards will be to get the clocks phase aligned to much better than 8ns

18:29 <lekernel> terpstra: what is your system doing, exactly?

18:29 <terpstra> so we have distributed and precise timing

18:30 <kristianpaul> Your work sounds amazing :-)

18:30 <terpstra> it's not really my work

18:30 <terpstra> it's the groups work

18:30 <terpstra> lekernel, we need to control devices that direct the beam

18:30 <terpstra> light moves quite fast

18:30 <kristianpaul> ah, WhiteRabbit is the Swich and the zpu will go in there?

18:30 <terpstra> so we need the devices to be precisely coordinated

18:31 <terpstra> kristianpaul, correct

18:31 <kristianpaul> groups work, yes

18:31 <terpstra> also in the endpoints

18:31 <terpstra> the ZPU is just what the CERN guys have been messing around with

18:31 <kristianpaul> s/zpu/lm32 ;-)

18:31 <roh> too bad the stuff from maintech is closed http://www.maintech.de/produkte/ip-cores/

18:31 <roh> but well.. i guess thats their cashcow

18:31 <terpstra> that would be pretty cool :)

18:32 <terpstra> before a really big open hardware revolution can begin---where vendors end up opensourcing b/c they used quality opensource IPcores ... we need a gcc for hardware.

18:32 <terpstra> looks at lekernel.

18:33 <roh> gcc? naaah. someting free and open.. just not so bitrotten ;) better compare it to clang

18:33 <terpstra> gcc's bitrot is the proof of its success ;)

18:34 <roh> and a problem for everybody wanting to do fancy experiments and develop new stuff for compilers

18:34 <terpstra> sure, llvm is great

18:34 <terpstra> but if there had not been gcc, i doubt there'd have been llvm

18:35 <roh> a friend of mine will travel to nasa soon.. and give a talk at JPL about his 'theorem proover'

18:35 <terpstra> also know as an ML compiler? ;)

18:35 <terpstra> known*

18:36 <roh> sure. my guess is gcc will stay a macroassembler, while llvm will be the future of c compilers

18:36 <roh> and a backend for lots of other languages

18:36 <terpstra> at this point, that's a pretty safe "guess" to make :)

18:37 <kristianpaul> all altera?, "Virtex 6 GTX simulation (Nikhef)." :-)

18:37 <terpstra> cern uses xilinx

18:37 <terpstra> we use altera

18:37 <kristianpaul> haha

18:37 <kristianpaul> nice

18:37 <terpstra> so our stuff has to work on both

18:39 <lekernel> it'll be used at CERN as well?

18:39 <terpstra> yes

18:39 <terpstra> the LHC

18:40 <lekernel> mh? I thought they were done with the design

18:40 <terpstra> to be honest, i'm not entirely clear why they need it either

18:40 <terpstra> we're still in the design phase for the new accelerator here

18:40 <terpstra> i've even heard that want to use it for their RF devices

18:40 <terpstra> which seems quite perplexing!

18:44 <kristianpaul> reads http://silicone.homelinux.org/category/electronics/open-source-cpu/

18:45 <terpstra> ciao

18:45 <kristianpaul> chao

18:49 <lekernel> bye, thanks for passing by!

18:49 <kristianpaul> yeah :-)

18:52 <kristianpaul> imagines if *RF* devices at CERN have a ohwr like project

18:53 <lekernel> kristianpaul: there is a surprising amount of this stuff being published. only you need to go looking for it

18:55 <kristianpaul> lekernel: it seems, i still amazed with ohwr

19:11 <antgreen> lekernel: re: "moxie looks promising, but it's not finished yet". Had to take long break.Â Â Back at it this week!

19:13 <antgreen> as soon as I wrap up the libffi 3.0.10 release.

20:05 <antgreen> what is Antares?

20:06 <antgreen> oh, a mentor spin-off

20:07 <lekernel> used to

20:07 <lekernel> but it no longer exists... this refers to https://github.com/lekernel/antares

20:08 <antgreen> you are evicting xilinx's tools from your workflow?

20:08 <lekernel> yes

20:09 <lekernel> but it will take time

20:09 <antgreen> everything worthwhile takes time!

20:09 <antgreen> cool stuff.

20:49 <Fallenou> Applications for mentoring organization for gsoc are now being accepted

21:24 <lekernel> yup. Jon is taking care of it this year

21:25 <lekernel> after last year's experience I'm not that much into applying myself

22:18 <mwalle> terpstra: (debug rom) bascially thats a reverse engineered version of lattice original rom, with some tweaks regarding its size and added 16 bit and 32 bit access

22:21 <mwalle> terpstra: (openocd) my lm32 port is wip (at least it was in progress .. ;) it should be at least possible to set some breakpoints and stepping

22:26 <mwalle> terpstra: BREAK is sent as a JTAG DP (lattice calls it debug protocol, the real jtag commands, not the JTAG UART), isnt it?

22:27 <mwalle> so i guess it should jump to DEBA right after issuing the command

22:31 <mwalle> terpstra: (jtag core) iirc i just coded the (xilinx) jtag core according to xilinx schematics in some user guide. i dont know for sure if capture and reset are synchronous to tck for the xilinx BSCAN cell

22:34 <mwalle> gn8 :)