#milkymist on 2011-09-22 — irc logs at freenode.irclog.whitequark.org

01:31 <wolfspraul> hey btw, I just did a little counting

01:32 <wolfspraul> the rc3 yield as of right now is 49 100% perfect units

01:32 <wolfspraul> 49 out of 90

01:32 <wolfspraul> the original goal was 80 (out of 90)

01:34 <wolfspraul> adam takes a few days off from tomorrow (friday) to tuesday, and then he's back at bringing this up more

01:34 <wolfspraul> next target: 60

01:34 <wolfspraul> :-)

01:35 <wolfspraul> who knows maybe in the end we can get close to the original 80... too early to tell now, have to wait and see which troublemakers remain at the end and what analysis shows for them

01:38 <wolfspraul> aw: thanks a lot for the hard and persistent work!

01:38 <wolfspraul> http://en.qi-hardware.com/wiki/Milkymist_One_run_3_schedule#Test_Results

01:40 <aw> wolfspraul, you are welcome, i ought to. will back to continue rest.

01:42 <wolfspraul> he :-)

01:42 <wolfspraul> werner would love that typo

01:42 <wolfspraul> aw: you probably mean "back to continue test" not "back to continue rest"

01:42 <wolfspraul> but if you want to rest a little - GO AHEAD! :-)

01:44 <aw> wolfspraul, oah~ yes, typed wrong...is "continue to test"..:-)

03:04 <wpwrak> good typo indeed :)

03:05 <wpwrak> btw, good news: http://downloads.qi-hardware.com/people/werner/m1/perf/chart-20110921b

03:06 <wpwrak> i now made the test for equivalent output more comprehensive/strict. and the latest version(s) produce perfect matches for all patches we have.

03:06 <wolfspraul> wow

03:08 <wpwrak> the generated code is now a bit less efficient that what i had on monday. so now there are four patches that get longer even with optimization. all the rest is about the same and some significantly more compact.

03:10 <wpwrak> without optimization, it still gets a bit worse. in all cases, the new scheduler is a lot faster than the original one. with profiling, no optimization (-O), on x86-64, and including parsing and all other compilation steps, on average about 10x faster.

03:11 <kristianpaul> wow indeed

03:12 <wpwrak> the next step is to build flickernoise and see how it works in its native context. i would expect the new scheduler to optimize (-O) better than the old one, because all frequently traveled code paths are in the same compilation unit and there are no terms greater than O(n^2) in any of the processing.

03:14 <wpwrak> and even O(n^2) would be a degenerate case. things like foo = <expression> and then gazillions of varN = fn(foo), i.e., everything becomes executable after the first operation

03:16 <wpwrak> the O(n^2) would hit the optimizer (the longest critical path first algorithm) hardest, because it always considers all available choices. (i could defang it with a merge sort, but that's probably excessive)

03:21 <wpwrak> of the bad things that may still happen, we have the relatively inefficient parser. it relies heavily on string compares and identifier lookups are always O(n*m) while they could be O(m) or at last O(m+C*log n). n would be the number of known identifiers, m the average length of an identifier, C a constant.

03:23 <wpwrak> but we'll see. maybe it doesn't matter so much and the scheduler dominates.

03:33 <wpwrak> wolfspraul: in what categories do the remaining gremlins in M1 fall ? i know we have a few (2 ?) "NOR suddenly going completely mad" cases, which may be damaged chips, but what are the rest ?

03:39 <wolfspraul> haven't tried to categorize yet

03:40 <wolfspraul> sorry bbiab

03:45 <wpwrak> then i guess, while aw is taking his days off, wolfgang will be data mining in the mines of mordor :)

03:51 <larsc> one bug to break them all!

03:54 <wpwrak> that would be too easy ;-)

03:57 <wpwrak> sometimes i hate statistics. after realizing that the NOR corruption distribution seems to lack a very "late" corruption, my M1 promptly proceeded to have a run that takes forever to have a corruption. now in the 4th day ...

03:58 <kristianpaul> :/

03:58 <wpwrak> i somehow suspect that temperature does have an effect after all. the last days were relatively warm (today, the first day of spring brought excellent weather)

03:59 <kristianpaul> (warm) good !!

03:59 <larsc> time to make room in the fridge

04:00 <wpwrak> i might have to return to the idea of putting the M1 into the fridge ... well, i could also cool my guest room down to 18 C and move things over there

04:00 <wpwrak> larsc: yeah :)

04:01 <larsc> any suspicions what might case the corruption?

04:01 <wpwrak> kristianpaul: of course, outdoors events in winter may be about the last places where you want a surprise NOR corruption :) at least they should be less common than hot indoors events

04:02 <wpwrak> i suspect it's some glitches caused by power ramping down unevenly

04:02 <kristianpaul> there is no winter here ;)

04:02 <kristianpaul> ramping?

04:02 <kristianpaul> leakage?

04:03 <wpwrak> e.g., I/O power still good but FPGA core power dropping out of range, and the core then acting crazy

04:03 <wpwrak> after cutting power

04:04 <wpwrak> adam documented this here: http://en.qi-hardware.com/wiki/File:M1rc2_powerOnOff_sequences_manuscript.jpg

04:04 <larsc> but it should be possible to powerdown the flash before the fpga, or not?

04:05 <wpwrak> with a little hardware change, we could hold it in reset, yes

04:05 <wpwrak> alas, the current reset circuit only does this reliably when powering up, not when powering down

04:10 <larsc> but there is nothing on the board which asserts reset globally if the voltage drops below a certain threshold?

04:13 <wpwrak> a) there's no global reset, and b) the voltage the reset monitors is 3V3, not the 5 V input. so if the input drops but one rail stays up longer than the others, things can get nasty

04:41 <wpwrak> http://milkymist.org/wiki/index.php?title=RTEMS_build_instructionsÂ Â "This page has been accessed 2,936 times."Â Â wow !

05:20 <wolfspraul> wpwrak: back. I think it's too early to categorize already. let's wait until more dust settles.

05:21 <wolfspraul> there are probably a lot more low hanging fruits in terms of boards that will pass all fixes and tests just fine

05:21 <wolfspraul> then we focus on analyzing the rest

05:22 <wpwrak> good. let's hope for the best then :)

05:22 <wpwrak> the ones with mad NOR will be tricky

05:23 <wpwrak> we should try to see if we can detect them reliably with a boundary scan

05:23 <wolfspraul> let's see what you find in the end

05:24 <wolfspraul> if you feel you can reliably reproduce the nor corruption, one angle is to manually rework a board with the planned rc4 design (if that is possible)

05:24 <wpwrak> no, i mean the ones where the NOR gets some oscillations. not the single-word corruption

05:24 <wolfspraul> including gate and 4.4v reset ic

05:24 <wolfspraul> if that prooves that the nor corruption becomes unreproducible, at least we have an exit path

05:24 <wpwrak> yup. i think for NOR corruption the path is clear. just need to find the hidden variables :)

05:24 <wolfspraul> I'm not clear about the difference between nor oscillation and single-word corruption right now

05:25 <wolfspraul> waiting for dust to settle...

05:25 <wolfspraul> meanwhile I feel good about the rc3 we sell

05:25 <wolfspraul> Adam hasn't seen a single problem in 49 boards now

05:27 <wpwrak> (the two types of NOR corruptions) i think they're radically different. single-word corruption seems to affect all boards and the cause appears to be relatively benign (some glitch). the oscillation is signals that should be unrelated getting synchronized, with a strong smell of chip-level damage.

05:27 <wolfspraul> which chip? the nor chip?

05:27 <wolfspraul> then we just replace it :-)

05:27 <wpwrak> well, NOR problems actually. i'm not sure if we;ve actually seen corruption connected to the oscillation

05:27 <wpwrak> probably the FPGA

05:27 <wolfspraul> ah ok

05:28 <wpwrak> trickier :)

05:28 <wolfspraul> then replace that, or write off the board

05:28 <wolfspraul> xray etc will also still come

05:28 <wolfspraul> what do you mean with "some glitch"?

05:28 <wpwrak> let's hope the number of such boards stays around 2. i wouldn't really trust a board where the FPGA has been reworked.

05:28 <wolfspraul> you mean something fixable in software/soc ?

05:29 <wolfspraul> why not [fpga rework]

05:29 <wpwrak> according to joerg, the smt fab grade xray won't show anything. but we can of course try. maybe there are surprises.

05:29 <wolfspraul> the smt fab xray is only good for checking the soldering joints

05:29 <wpwrak> (49 good boards) yes, it seems we have a neat division between trouble boards and regular boards. that's encouraging.

05:30 <wolfspraul> it cannot see much inside a chip, unless a really big burn maybe

05:30 <wpwrak> (fpga rework) seems difficult -> good chance of creating new/more problems

05:31 <wolfspraul> nah, that's why we run tests afterwards

05:31 <wolfspraul> I can still use such boards, for example for internal units (like my own, xiangfu, sebastien), or for journalist review units, etc.

05:31 <wpwrak> which may or may not catch them :)

05:31 <wpwrak> of course, yes

05:31 <wolfspraul> then the test needs to be improved

05:31 <wolfspraul> I trust the test, by definition

05:32 <wpwrak> the problem with this testing approach is that you drive the bugs into a corners your tests don't reach

05:32 <wolfspraul> well let's see

05:32 <wolfspraul> I have no problem reworking the fpga, if the smt fab (who would do it) thinks they can do it then why not

05:32 <wpwrak> i think the tests are still relatively narrowly focused. you'd need very broad coverage to catch truly exotic bugs that way.

05:33 <wolfspraul> maybe but everybody is testing

05:33 <wolfspraul> we don't need to be worried about ghosts or invisible things, I am not

05:33 <wolfspraul> let's just see

05:34 <wolfspraul> also the smt fab etc. have a lot of rework experience. they can give us advice what makes sense and what not.

05:34 <wpwrak> (some glitch) my current pet theory is that, when powering down, the FPGA core loses power before the I/O (3V3) does. then the core may act funny but since the I/O is still sufficiently powered, it would send out all the dying spasms of the core at full power. some of that may be write pulses to the NOR

05:37 <wpwrak> (everybody is testing) with tests designed for broad coverage you have a reasonable chance. but i don't think the current process is very strong there. i'm not saying that it's bad but that the tests are fairly narrow and probably high-level, too. e.g., some glitches may even get auto-corrected without you noticing.

05:37 <wolfspraul> that glitch would be fixed with the gate+4.4v reset ic solution planned for rc4, no?

05:38 <wpwrak> (4V4 reset) i would expect that, yes

05:39 <wpwrak> meanwhile, my suspicion grows that temperature has an effect, too. the last few days and nights have been quite warm and lo and behold, i've had a run that's been free of NOR corruption for > 3 days. and still counting. it could of course be coincidence, but ... :)

05:41 <wpwrak> i wonder if i already have enough data points for a frequency domain analysis ... a temperature pattern should show up as a ~24 hours cycle, too

05:42 <wpwrak> anyway, i don't think we're in a great hurry with this (yet :). so i'm taking my time to collect data and improve my analysis methods. will be handy when a real emergency hits.

05:45 <wpwrak> ah, and after the latest firmware improvements, i haven't seen a single glitch of labsw. so i think my sw-based debouncing/denoising works well. the next hw revision will also have analog filters, for even better interference hardening.

05:46 <wpwrak> my plan is, once the new scheduler is done, to update the labsw design, make another prototype, and if it behaves well, also make one for adam. then he can do his testing in his sleep, much like i do ;-)

05:49 <wpwrak> then i should document the new schedule while my memory is still reasonably fresh. the efficiency comes at the price of some non-obvious dependencies. maybe i'll also find some bugs when documenting. wouldn't be the first time :)

05:50 <wpwrak> (document) white paper style. like this critter: http://abiss.sourceforge.net/doc/elv-wp-0.ps

05:55 <wolfspraul> hmm

05:55 <wolfspraul> do you think tuxbrain can sell labsw boards?

05:55 <wolfspraul> or anybody?

05:56 <wolfspraul> maybe that is something sparkfun/adafruit and friends could be interested in...

05:56 <wpwrak> i don't know. if there's enough interest, it would make sense to make proper boards, yes. also for internal use.

05:56 <wpwrak> you could then add all the loose components and sell it as a kit. save assembly ;-)

05:56 <wolfspraul> sure sure

05:56 <wolfspraul> I'd leave that to sparkfun :-)

05:57 <wolfspraul> I haven't looked at the tech details of labsw at all yet, I confess

05:57 <wpwrak> one problem is the case. i use a locally sourced case and replace the front (and later also the rear) plate. works great but may not be very portable.

05:57 <wolfspraul> maybe as part of the 10-01 news

05:58 <wpwrak> hehe ;-)

05:58 <wpwrak> ah yes, just a few days left. time flies :)

06:23 <wpwrak> scheduler comparison chart with highlighting: http://downloads.qi-hardware.com/people/werner/m1/perf/chart-20110921b.html

06:25 <roh> hm.. we were looking into making 'kits'

06:26 <wpwrak> "kits" of what ?

06:30 <roh> well.. in that case its difficult.. maybe all non-smt parts

06:31 <roh> kits are not really much trouble weee wise... devices are more complicated

06:33 <wpwrak> labsw is a bit messy to build, yes. a bit of smt at the bottom, but then plenty of through-hole on top. and then a lot more items on the front panel.

06:33 <wpwrak> (weee) is see ;-)

06:33 <wpwrak> s/is/i/

06:44 <wpwrak> hmm, the flickernoise build instructions imply removal of flex and bison. very funny :-(

06:53 <xiangfu_> wpwrak, Hi if you want build flickernoise. you may want use the old SDK. : http://fidelio.qi-hardware.com/~xiangfu/build-milkymist/milkymist-firmware-07062011-0000/Flickernoise-lm32-rtems-4.11-SDK-for-Linux-x86_64.tar.bz2

06:54 <wolfspraul> roh: kits of what?

06:54 <wolfspraul> kits of labsw?

06:54 <xiangfu_> wpwrak, all recently build use the latest RTEMS code. which have some problem now. flickernoise became very slow. let me find the IRC log.

06:55 <xiangfu_> wpwrak, or build from source: check those file may some help: https://github.com/milkymist/scripts

06:55 <wpwrak> xiangfu_: (sdk) ah, thanks ! does that have the slowdown issue too ?

06:55 <wpwrak> yes, i'm currently on my way through README.html :)

06:55 <xiangfu_> wpwrak, "http://fidelio.qi-hardware.com/~xiangfu/build-milkymist/milkymist-firmware-07062011-0000/Flickernoise-lm32-rtems-4.11-SDK-for-Linux-x86_64.tar.bz2" this one works fine.

06:56 <xiangfu_> all the build start "09082011-1842/" to "09182011-1746/" have the slow problem.

06:56 <wpwrak> so if i build from source, i would also have the slowdown ?

06:57 <xiangfu_> wpwrak, yes.

06:58 <wpwrak> ah, then i better don't do that. my objective is to make it faster, not slower ;-)

06:58 <xiangfu_> wpwrak, check here: http://en.qi-hardware.com/mmlogs/milkymist_2011-09-07.log.html#t08:50

06:59 <wpwrak> does the slowdown also happen if i rebuild the things in the milkymist (for libfpvm) and flickernoise (for src/compiler.c) repos ?

07:00 <xiangfu_> wpwrak, no. it's RTEMS bug.

07:00 <wpwrak> excellent ;-)

07:00 <xiangfu_> wpwrak, unless there is a new bug in your new libfpvm or compiler.c :D

07:01 <wpwrak> we'll see :)

07:01 <xiangfu_> wpwrak, what can I do for help you about speed up compiler.c?

07:04 <wpwrak> let's see how the SDK and then the build goes

07:04 <wpwrak> if i'm lucky, i can just drop in the new scheduler and things will fly

07:04 <wpwrak> of course, if think murphy won't agree with this plan, as usual :)

07:22 <wpwrak> now the moment if truth ... compiling flickernoise ...

07:24 <wolfspra1l> if we have a mmu and really good Linux support one day, what are the reasons that still go for rtems then?

07:24 <wolfspra1l> or is rtems just a temporary placeholder because Linux is harder to pull off?

07:25 <wolfspra1l> will rtems always be smaller and easier to customize?

07:26 <wpwrak> it'll probably be smaller. but i think switching to linux would make a lot of sense. more drivers and protocols, widely known environment, standard tools, and so on.

07:27 <wpwrak> of course, all the RT aspects need to be handled as well. not sure how demanding flickernoise is in the regard. and i also don't know the status of the RT extensions (some RT features are in the standard kernel, but there's more stuff)

07:29 <wpwrak> hmm, make -C compile-flickernoise flickernoise.fbiÂ Â still seems to build the SDK. let's see how this goes ...

07:30 <wpwrak> or maybe not ... confusing :)

07:31 <xiangfu__> not SDK. but all depends libs. like RTEMS, gtk etc..

07:31 <xiangfu__> those libs + cross toolchainÂ Â is the SDK :)

07:32 <wpwrak> okay. now i get a build failure:

07:32 <wpwrak> target architecture: lm32.

07:32 <wpwrak> make[3]: Entering directory `/home/qi/m1/scripts/compile-flickernoise/build_dir/bsp-milkymist/tools/build'

07:32 <wpwrak> configure: configuring in ../../cpukit

07:32 <wpwrak> checking whether CLOCK_PROCESS_CPUTIME_ID is declared... no

07:33 <wpwrak> configure: error: missing define CLOCK_PROCESS_CPUTIME_ID

07:33 <wpwrak> configure: error: /bin/bash '/opt/milkymist/rtems/c/src/../../cpukit/configure' failed for ../../cpukit

07:33 <xiangfu__> wpwrak, what toolchian you using?

07:33 <wpwrak> whatever "make -C compile-flickernoise flickernoise.fbi" uses :)

07:33 <wpwrak> i seeÂ Â --target=lm32-rtems4.11

07:34 <xiangfu__> wpwrak, do you have 'lm32-rtems4.11-gcc' installed?

07:34 <wpwrak> yes. and it's in the PATH

07:34 <xiangfu__> where you get it? compiled from scripts.git?

07:34 <wpwrak> which lm32-rtems4.11-gcc

07:34 <wpwrak> /opt/rtems-4.11/bin/lm32-rtems4.11-gcc

07:34 <wpwrak> from the SDK

07:35 <xiangfu__> the ***-0000 sdk is gcc 4.5.2 and old newlib code. which is can not build latest source code :(

07:35 <xiangfu__> you have to use 4.5.3

07:36 <wpwrak> so the SDK is useless ?

07:36 <xiangfu__> wpwrak, if you already have SDK and you want compile flickernoise. no needs the script.git

07:36 <xiangfu__> just clone the flickernoise.git and compile it

07:37 <wpwrak> ah :) okay. let's see how this goes ...

07:37 <xiangfu__> wpwrak, make -C compile-flickernoise flickernoise.fbi will compile all from 0

07:37 <wpwrak> cd /opt/milkymist/flickernoise.git/src#

07:37 <wpwrak> make

07:38 <wpwrak> yaffs.c:27:31: fatal error: yaffs/rtems_yaffs.h: No such file or directory

07:38 <xiangfu__> wpwrak, checkout to 'stable_1.0' branch

07:38 <xiangfu__> the compiler.c is same in 'master' and 'stable_1.0'

07:39 <wpwrak> much better :)

07:39 <xiangfu__> too many update here and there. :)

07:40 <wpwrak> now i have a bin/flickernoise

07:41 <xiangfu__> 'make load' will compile a bin and copy to ftfp folder for netboot. then you don't needs to reflash

07:41 <xiangfu__> if you want reflash, there is a flickernoise.git/flash/flash.sh

07:42 <xiangfu__> wpwrak, for netboot, the m1 will setup ip address 192.168.0.42, and try to fetch 'boot.bin' from 192.168.0.14

07:44 <wpwrak> hmm, but i didn't see it rebuild milkymist/software/libfpvm/Â Â i guess i need to build there too ...

07:44 <wpwrak> i'll try flashing via jtag. haven't even set up ether yet.

07:45 <wpwrak> ah, nice. make bin/flickernoise.fbiÂ Â that was easy :)

07:45 <wpwrak> now the milkymist libs ...

07:46 <xiangfu__> wpwrak, libfpvm. you can manually compile it. and copy the 'libfpvm.a' to '/opt/rtems-4.11/lm32-rtems4.11/milkymist/lib'Â Â then recompile flickernoise by 'make clean load'

07:49 <wpwrak> ... make bin/flickernoise.fbiÂ Â works. excellent :)

07:50 <wpwrak> does stable_1.0 already have the skipping of patches that use the camera if there's no camera connected ?

07:51 <xiangfu__> wpwrak, no.

07:51 <xiangfu__> but cherry-pick should works fine.

07:51 <wpwrak> pity. that'll be a great feature to have.

07:51 <wpwrak> pheew. make the tree grow stranger and stranger ;-)

07:52 <xiangfu__> wpwrak, for libfpvm, just found there the Makefile is under: 'milkymist.git/software/libfpvm/lm32-rtems'

07:52 <xiangfu__> 'make clean install' should works fine at that folder.

07:53 <wpwrak> cool, even better

07:56 <xiangfu__> (skip video) needs those three commits: 'f8e9008016285560e1826a48e0716d719d330387' '2776d11c50d88aa88f4372adabace3803004779e' '9523567c8bdc07b750e6b922e5c0d3acf90865f6', just run git cherry-pick ... should ok.

07:57 <xiangfu__> it will also skip MIDI, OSC, DMX

08:00 <wpwrak> writing it down ...

08:01 <wpwrak> would there be an easy way to make the master branch compile ? i'd rather be at the current head, also for making patches

08:01 <xiangfu__> it not skip compile them, it skip render those patches. just fyi

08:02 <xiangfu__> wpwrak, I guess recompile and install the new yaffs libs should ok.

08:02 <wpwrak> heh, skipping compilation would be something ;-)

08:03 <xiangfu__> try 'make -C compile-flickernoise rtems-yaffs2' under scripts.git

08:03 <xiangfu__> then compile the 'master' branch

08:05 <xiangfu__> I am trying now. :)

08:06 <wpwrak> hmm, it's unhappy. after make -C compile...

08:06 <xiangfu__> not working,Â Â the new yaffs2 needs new RTEMS API.

08:06 <wpwrak> rtems/rtems_yaffs.c:839:13: error: 'rtems_filesystem_default_write' undeclared here (not in a function)

08:06 <wpwrak> (and more errors)

08:06 <xiangfu__> yes.

08:06 <xiangfu__> wpwrak, there are not much update in 'master'

08:07 <xiangfu__> there are new 'yaffs' api and new 'skip video' code in 'master'

08:07 <xiangfu__> we should fix the rtems bug fast.

08:08 <xiangfu__> then we will happy on 'master'Â Â branch

08:10 <wpwrak> do you already know what the rtems bug is ? in the irc log, it looks as if it hadn't been quite identified yet

08:12 <xiangfu__> wpwrak, I don't know that. :(

08:16 <wpwrak> for further study then :)

08:20 <xiangfu__> yes.

09:12 <xiangfu__> wpwrak, I meet the NOR error again, here is the standy diff: http://pastebin.com/rvu4kGyz

09:14 <wpwrak> funny address. but indeed, it's our old friend. is that an rc2 or an rc3 board ?

09:14 <xiangfu__> rc2 board.

09:16 <wpwrak> seems that rc2 gets NOR corruption more often than rc3. the reset circuit in rc3 probably does help a bit..

09:17 <wpwrak> (at least my rc3 needs on average 500 power cycles)

09:17 <xiangfu__> wpwrak, I will reflash standby.bin now. there is not much info from me. just normal use. normal reflash

09:20 <wpwrak> yes. i think that's "normal" for rc2. if it was as rare as in rc3, it may have gone unnoticed.

09:24 <lekernel> xiangfu__, just a quick test. can you take FN 1.0RC1 (the old RTEMS/YAFFS and such) and verify that the flash write function does not get called at inappropriate times in YAFFS?

09:25 <lekernel> xiangfu__, there's a task that periodically flushes the YAFFS cache every 10 seconds or so

09:26 <lekernel> maybe it triggers flash writes everytime, even when the cache is in sync. that would be bad in every case, not only because it can cause unexpected corruption when the power is shut down in the middle, but also because it wears out the flash

09:27 <xiangfu__> lekernel, ok. I will do that. I have old stuff in my system.

09:31 <lekernel> in every case I made a binary snapshot of everything except Flickernoise: http://milkymist.org/3rdparty/rtems-fn1.0.tar.bz2

09:31 <lekernel> you'd just need to recompile YAFFS and relink/recompile FN

09:32 <xiangfu__> downloading.

09:32 <xiangfu__> yes.

09:32 <lekernel> well, if you still have the old stuff, just use it :)

09:32 <lekernel> I'm providing the binary snapshot just in case you have something missing ...

09:33 <xiangfu__> I have the SDK. like: http://fidelio.qi-hardware.com/~xiangfu/build-milkymist/milkymist-firmware-07062011-0000/

09:33 <xiangfu__> this is the old yaffs2 right? : https://github.com/milkymist/rtems-yaffs2-old

09:34 <xiangfu__> I will try to use the SDK and rtems-yaffs2-old.

09:34 <lekernel> yes

09:35 <wpwrak> lekernel: btw, latest scheduler comparison: http://downloads.qi-hardware.com/people/werner/m1/perf/chart-20110921b.html

09:35 <xiangfu__> I will do that today(if I have time), I will on the train to ChangeChun, 3 hours later. I will do this today or next few days.

09:35 <xiangfu__> I ask some days off, but will read email as always :-)

09:35 <lekernel> ha wow

09:35 <wpwrak> now produces functionally identical code to your scheduler for all the patches. i'll give it a spin on the M1 after a nap.

09:35 <lekernel> LCPF = long latency instructions first?

09:36 <wpwrak> Longest Critical Path First. also considers the dependant operations.

09:37 <lekernel> and "new (no optimizer)" is the same as my basic scheduler, except that it schedule the instruction with the largest latency when there are several choices?

09:37 <wpwrak> realizes that "longest" and "critial path" are a bit redundant

09:38 <wpwrak> it schedules the one that comes first in fpvm

09:39 <GitHub152> [rtems-yaffs2] sebhub pushed 1 new commit to master: https://github.com/milkymist/rtems-yaffs2/commit/734a675a484bed7fd870048d15e4a6ab8370b83a

09:39 <GitHub152> [rtems-yaffs2/master] Fixed return value of ycb_file_lseek(). - Sebastian Huber

09:39 <lekernel> this is cool, I never thought someone would contribute something on PFPU. I'm impressed :)

09:40 <xiangfu__> indeed.

09:41 <wpwrak> well, no, one more level: in each cycle, it adds the instructions that become available to its "to do" list. each list of additions is sorted by fpvm position. but the to do list doesn't get reordered. so the ones that become available first usually get issued first (unless the destination slot is blocked)

09:42 <wpwrak> the pfpu design is quite nice and straightforward. maybe with a bit of tweaking of the handling of static registers, even my scheduler could get a bit simpler.

09:51 <lekernel> wpwrak,Â Â "the flickernoise build instructions imply removal of flex and bison"?

09:51 <lekernel> mh?

09:51 <lekernel> you can install both flex/bison and re2c/lemon at the same time

10:00 <xiangfu__> I added some 'printf' to : https://github.com/milkymist/rtems-yaffs2-old/blob/master/direct/rtems/rtems.c#L111

10:00 <xiangfu__> my_write()

10:03 <wpwrak> lekernel: for ubuntu, there's deinstallation of M4, which in turn deinstalls flex and bison. i could of course manually install them again later. but it's not so nice if you're forced to have a lot of manually installed packages.

10:08 <lekernel> seems it's an ubuntu-specific bug

10:08 <wpwrak> sigh. cycles without trouble. time to make room in the fridge ...

10:08 <wpwrak> s/cycles/4000 cycles/

10:12 <wpwrak> hmm, does RTEMS/FN show the build date somewhere ?

10:13 <xiangfu__> the "About"

10:14 <xiangfu__> in Control Panel""

10:14 <wpwrak> great, thanks !

10:21 <xiangfu__> now add some printf to 'yaffs_flush_whole_cache'

10:21 <wpwrak> grmbl. it hits an assertion. now .. where did i go wrong ...

10:22 <lekernel> xiangfu__, add it to the flash write function

10:23 <lekernel> and check that the flash doesn't get written when it's only rendering

10:24 <lekernel> you shouldn't need to dive into the cache flushing code unless you see that it's writing the flash when it should not

10:27 <xiangfu__> lekernel, yes. I have added some printf to my_write.

10:27 <xiangfu__> there is no 'my_write' called when rendering.

10:29 <lekernel> ok. then it's not the problem

10:29 <lekernel> bbl

10:29 <lekernel> thanks for testing

10:49 <qi-bot> The Firmware build was successfull, see images here: http://fidelio.qi-hardware.com/~xiangfu/build-milkymist/milkymist-firmware-09222011-1112/

14:03 <wpwrak> hmm, rtems isn't very smart when if comes to concurrent console output, isn't it ? even an abort concurrent with a printf from the same program yields alphabet soup :)

14:04 <wpwrak> s/abort/assert/

14:48 <kristianpaul> wpwrak: rtems have some surprises :)

14:49 <kristianpaul> i baically uses because network support, but i'm actually using a hacked version of m1 bios, as i dont need too much troughput

14:50 <kristianpaul> (labsw) for my personal case, will be nice for full control of a milkymist one remotelly

14:50 <kristianpaul> so far i always my m1 turn on, and thanks to jtag-boot i can do remotelly soemthing before i needed to do in place (push button ;))

14:52 <kristianpaul> also your neocon logging support is extremely usefull for me now :)

14:54 <wpwrak> yeah, with jtag control, you need to power cycle only in some rare cases. i like how well urjtag works. at openmoko, we used openocd. that was a complete and utter nightmare. locked up at the slightest difficulty. and because it was daemon plus client, it wasn't easily scripted either.

14:54 <wpwrak> (neocon logging) sometimes it's the simplest things ... ;-)

14:56 <wpwrak> hmm, the wayward compilation spits out slightly different registers. interesting. let's see if the code input is the same ... about the only thing i can think of that might seriously derail the scheduler would be an incorrect value in nbindings.

14:57 <wpwrak> valgrind is very happy with my code. also -O9 doesn't have any complaints. this normally means that it should work ;-)

16:20 <lekernel> wolfspraul, who's Christiaan Virant ?

16:23 <wpwrak> very interesting. same source, but scheduler input is a little different on lm32 and on x86-64. hmm ...

16:27 <lekernel> input or output?

16:28 <wpwrak> input ! and the output (of my scheduler) has troubles, too. not sure yet whether that's because of something weird in the input or a yet undiscovered bug.

16:52 <wpwrak> Lekernel & Rovastar & Fvese - Subconscious Objects.fnp, lm32 has 108 bindings, x86 has 106 bindings (just unnamed constants). how peculiar. maybe this has something to do with the conversion in get_registers. the rest of the code looks pretty tame, though.

16:52 <lekernel> hm, maybe FN feeds the patch code differently to libfpvm than your x86 test program?

16:52 <lekernel> does it work?

16:55 <wpwrak> nope. my scheduler trips over an assert. so there's something it doesn't like. still searching ...

17:01 <wpwrak> hmm, but nourishment first. this feels as if it may take a while to sort out ...

19:54 <GitHub23> [llvm-lm32] jpbonn pushed 659 new commits to master: http://git.io/GJe5RA

19:54 <GitHub23> [llvm-lm32/master] Make IC_VEX* not inherit from IC_*. Prevents instructions with no VEX form from disassembling to their non-VEX form. Also prevents weak filter collisons that were keeping valid VEX instructions from decoding properly. Make VEX_L* not inherit from VEX_* because the VEX.L bit always important. This stops packed int VEX encodings from being disassembled when specified with VEX.L=1. Fixes PR10831 and PR10806. - Craig Topper

19:54 <GitHub23> [llvm-lm32/master] Pass signed (not unsigned) 10 bit field to SPU 'ori' instruction. - Kalle Raiskila

19:54 <GitHub23> [llvm-lm32/master] Compare type size instead of type _store_ size to make sure that BitCastInst - Jakub Staszak

23:49 <wpwrak> hmm, -DPRINTF_FLOAT ... where on earth would not setting it make sense, ever ? :)