#m-labs on 2016-11-01 — irc logs at freenode.irclog.whitequark.org

2015-03-04 14:45 sb0 changed the topic of #m-labs to: ARTIQ, Migen, MiSoC, Mixxeo & other M-Labs projects :: fka #milkymist :: Logs http://irclog.whitequark.org/m-labs

00:09 stekern has quit [Ping timeout: 245 seconds]

00:14 stekern has joined #m-labs

00:41 stekern has quit [Ping timeout: 265 seconds]

00:54 stekern has joined #m-labs

01:21 stekern has quit [Ping timeout: 268 seconds]

01:34 stekern has joined #m-labs

02:33 <GitHub46> [artiq] whitequark pushed 1 new commit to master: https://git.io/vXYGm

02:33 <GitHub46> artiq/master 18ae8d5 whitequark: gateware: fix mailbox.

02:35 <whitequark> doh, migen arrays don't wrap

02:35 <whitequark> tookme a few hours...

02:35 <cr1901_modern> what do you mean by wrap?

02:36 <whitequark> well, I expected them to just drop the high bits

02:38 <cr1901_modern> I still don't understand. s.eq(b[1:8]), where s and b are 10 bits, doesn't connect bits 1 to 7 of "b" to bits 0 to 6 of "s"?

02:38 <whitequark> that doesn't involve any arrays

02:40 <cr1901_modern> Oh, right... Array(). Erm, I thought they did wrap too.

02:42 <GitHub75> [migen] whitequark pushed 1 new commit to master: https://git.io/vXYGw

02:42 <GitHub75> migen/master b8000db whitequark: doc: explain what happens to an Array on out-of-bounds access.

02:45 stekern has quit [Ping timeout: 245 seconds]

02:46 <bb-m-labs> build #152 of artiq-board is complete: Success [build successful] Build details are at http://buildbot.m-labs.hk/builders/artiq-board/builds/152

02:48 <bb-m-labs> build #1049 of artiq is complete: Failure [failed] Build details are at http://buildbot.m-labs.hk/builders/artiq/builds/1049 blamelist: whitequark <whitequark@whitequark.org>

02:52 <sb0> whitequark, why does the mailbox have so many slots?

02:52 <sb0> and for large storage, use Memory which maps to more efficient FPGA resources

02:53 <whitequark> so many? it doesn't. it has three

02:54 <whitequark> I just didn't feel like messing with log2, which would probably have some subtle bugs

02:57 <sb0> I'm not sure how "& 0xff" plays out...

02:58 <sb0> how about i.adr[:bits_for(size-1)] ?

02:58 <sb0> bits_for(x) returns the number of bits you need for representing x. harder to get wrong than log2 equations...

02:59 stekern has joined #m-labs

03:01 <whitequark> oh, I thought something bits_for would be convenient, but didn't know

03:08 <sb0> hm, so you renamed them "async" rpcs...

03:08 <sb0> fire-and-forget -> background -> async ...

03:08 <whitequark> yes, "background" seems wrong

03:12 <sb0> what the fuck

03:13 <sb0> http://hastebin.com/foxodilosu.sql

03:13 <whitequark> protected branch?

03:14 <whitequark> [remote rejected] sounds like a protected branch

03:14 <whitequark> oh

03:14 <sb0> it's just a regular push

03:14 <whitequark> wait, fatal error...

03:14 <whitequark> try git gc

03:14 <sb0> didn't help

03:14 <sb0> this sort of shit was why i stopped using subversion...

03:14 <sb0> now it happens again

03:16 <GitHub145> [artiq] sbourdeauducq pushed 1 new commit to master: https://git.io/vXYZV

03:16 <GitHub145> artiq/master 43cd970 Sebastien Bourdeauducq: make set_dataset and mutate_dataset async RPCs

03:16 <sb0> unprotecting the branch solved it, even though it was not a force push or anything

03:20 <whitequark> bizarre

03:29 <bb-m-labs> build #153 of artiq-board is complete: Success [build successful] Build details are at http://buildbot.m-labs.hk/builders/artiq-board/builds/153

03:31 <bb-m-labs> build #1050 of artiq is complete: Failure [failed] Build details are at http://buildbot.m-labs.hk/builders/artiq/builds/1050 blamelist: Sebastien Bourdeauducq <sb@m-labs.hk>

03:47 kuldeep has quit [Ping timeout: 244 seconds]

03:48 sandeepkr has quit [Ping timeout: 260 seconds]

03:56 stekern has quit [Ping timeout: 268 seconds]

04:00 sandeepkr has joined #m-labs

04:03 stekern has joined #m-labs

04:17 sandeepkr has quit [Ping timeout: 260 seconds]

04:23 fengling has joined #m-labs

04:47 sandeepkr has joined #m-labs

05:21 stekern has quit [Ping timeout: 244 seconds]

05:31 <whitequark> sb0: ValueError: Vivado requires period constraints on all clocks used in false paths

05:33 stekern has joined #m-labs

05:45 kuldeep has joined #m-labs

06:02 stekern has quit [Ping timeout: 265 seconds]

06:19 stekern has joined #m-labs

06:34 stekern has quit [Ping timeout: 265 seconds]

06:38 stekern has joined #m-labs

06:47 mumptai has joined #m-labs

06:51 stekern has quit [Ping timeout: 244 seconds]

06:52 <GitHub125> [artiq] whitequark pushed 2 new commits to master: https://git.io/vXYRp

06:52 <GitHub125> artiq/master c1e6d4b whitequark: runtime: fix multiple async RPC bugs.

06:52 <GitHub125> artiq/master 636d4ef whitequark: gateware: rewrite mailbox to use bits_for.

06:52 stekern has joined #m-labs

07:10 <bb-m-labs> build #154 of artiq-board is complete: Success [build successful] Build details are at http://buildbot.m-labs.hk/builders/artiq-board/builds/154

07:32 <bb-m-labs> build #1051 of artiq is complete: Failure [failed python_unittest_1] Build details are at http://buildbot.m-labs.hk/builders/artiq/builds/1051 blamelist: whitequark <whitequark@whitequark.org>

08:01 rohitksingh has joined #m-labs

08:04 stekern has quit [Ping timeout: 244 seconds]

08:15 <whitequark> ugh, this is extremely obnoxious

08:16 <whitequark> rust stdlib was *really* not made for 0-allocation io

08:16 <whitequark> i'm not sure when it will really be done because all of the options here i have are bad,

08:18 <whitequark> ok well I'll go for the dirty hack.

08:19 <whitequark> (by 0-allocation here I mean 0-allocation *ever*, including on I/O errors; not on the fast path, which is of course provided)

08:24 <whitequark> maybe in a year something will officially be done about this, from the looks of PR discussions. i am very unhappy

08:30 stekern has joined #m-labs

08:56 <GitHub75> [artiq] whitequark pushed 1 new commit to master: https://git.io/vXYwh

08:56 <GitHub75> artiq/master 2095d01 whitequark: runtime: dirty hacks to remove allocations in ksupport.

09:15 <bb-m-labs> build #155 of artiq-board is complete: Success [build successful] Build details are at http://buildbot.m-labs.hk/builders/artiq-board/builds/155

09:17 <bb-m-labs> build #1052 of artiq is complete: Failure [failed] Build details are at http://buildbot.m-labs.hk/builders/artiq/builds/1052 blamelist: whitequark <whitequark@whitequark.org>

09:20 stekern has quit [Ping timeout: 256 seconds]

09:21 <sb0> whitequark, how did you get that error?

09:21 <whitequark> trying to build, with --no-compile-gateware even

09:21 <sb0> build what?

09:21 <whitequark> artiq. with updated migen/misoc

09:21 <sb0> why does it work on the buildserver then?

09:21 <whitequark> noidea?

09:22 <whitequark> well, i commented that out meanwhile.

09:22 <sb0> it works on my machine too

09:22 <sb0> do you have all the commits etc.?

09:23 <whitequark> should have

09:23 <sb0> building with the same options as the buildserver?

09:23 <whitequark> python3.5 -m artiq.gateware.targets.kc705 -H nist_clock --no-compile-gateware

09:23 <sb0> what branch?

09:23 <whitequark> master

09:37 stekern has joined #m-labs

09:43 <sb0> whitequark, yup works fine here

09:43 <sb0> migen b8000db81e26b4ad947110cae89a2cd2b265e530

09:44 <sb0> misoc ad414a72cab2a5301bf5e3579659ae136831f1be

09:44 <sb0> artiq 2095d01b84ebc018f2bd438dd8da5a8d846eb9fa

09:44 <whitequark> okay, will investigate

09:45 <whitequark> current status: all plumbing in place, simple async RPCs work, complex ones fail in mysterious ways

09:45 <whitequark> File "/home/whitequark/Work/artiq-dev/artiq/artiq/test/coredevice/test_portability.py", line 221, in test_misc

09:45 <whitequark> self.assertEqual(uut.acc, sum(uut.al))

09:45 <whitequark> AssertionError: 0 != 15

09:45 <whitequark> should be the last bug...

09:47 <whitequark> ah, looks like 2nd+ async RPC gets clobbered.

10:04 <whitequark> sb0: oh.

10:10 <whitequark> there's a race condition between async and sync RPCs on the comms CPU side

10:10 fengling has quit [Ping timeout: 268 seconds]

10:31 <sb0> don't you put them all into the same fifo?

10:33 <whitequark> no?

10:33 <whitequark> sync RPCs don't have to fit into the fifo chunks

10:33 <whitequark> anyway I fixed that

10:34 <whitequark> I believe async RPCs are finally done

10:34 <GitHub60> [artiq] whitequark pushed 1 new commit to master: https://git.io/vXY1r

10:34 <GitHub60> artiq/master 6fcd57a whitequark: runtime: fix remaining async RPC bugs.

10:34 <sb0> well yes, but isn't a single fifo simpler?

10:35 <whitequark> not really. I try to write them into a chunk, then just fall back to the old path if I can

10:35 <whitequark> I suppose I could move serialization completely to ksupport

10:35 <sb0> yes, but I'd have a single path...

10:35 <sb0> replace the mailbox completely with one FIFO

10:35 <sb0> then synchronization is trivial

10:36 <whitequark> meh, it wasn't very complex in the first place

10:36 <sb0> when you do async_rpc(); normal_rpc(); does it execute normal_rpc() only after async_rpc() has completed?

10:36 <whitequark> it does

10:36 <whitequark> session.rs:486

10:38 <sb0> bah that races?

10:39 <sb0> what if the kernel CPU posts both an async RPC and a normal RPC between lines 489 and 493?

10:48 stekern has quit [Ping timeout: 252 seconds]

10:49 fengling has joined #m-labs

10:53 <bb-m-labs> build #156 of artiq-board is complete: Success [build successful] Build details are at http://buildbot.m-labs.hk/builders/artiq-board/builds/156

10:54 fengling has quit [Ping timeout: 268 seconds]

10:55 <bb-m-labs> build #1053 of artiq is complete: Failure [failed] Build details are at http://buildbot.m-labs.hk/builders/artiq/builds/1053 blamelist: whitequark <whitequark@whitequark.org>

10:56 stekern has joined #m-labs

11:04 _whitelogger has joined #m-labs

11:14 stekern has quit [Ping timeout: 268 seconds]

11:19 stekern has joined #m-labs

11:34 stekern has quit [Ping timeout: 265 seconds]

11:44 rohitksingh has quit [Ping timeout: 250 seconds]

11:50 stekern has joined #m-labs

11:50 fengling has joined #m-labs

11:55 fengling has quit [Ping timeout: 268 seconds]

12:51 fengling has joined #m-labs

12:53 <whitequark> hm, correct

12:56 fengling has quit [Ping timeout: 268 seconds]

12:59 stekern has quit [Ping timeout: 260 seconds]

13:04 rohitksingh has joined #m-labs

13:10 <rjo> sb0: we'll have to do 'Alternate synchronization procedure #1'

13:11 <rjo> sb0: a) we can't arbitrarily move sysref relative to dac_clk and adc_clk since it needs to meet setup-hold there. that means the "[with fine delay]" should be removed.

13:12 <rjo> b) we can't use the fine delay on dac_clk or adc_clk. that would increase noise too much.

13:14 <rjo> basically we can't "fine-delay the entire clock tree to meet rtio". it also would completely sabotage reproducible (fine) phase. we'll have to coarse delay sysref (by integer dac_clk cycles) and track the fractional delay across reboots.

13:16 <rjo> but yes. we can use the hmc7044 to do the delay scan (on FPGA SYSREF only). but there will be some state saving needed.

13:18 stekern has joined #m-labs

13:21 <whitequark> sb0: oh, I remember one reason I didn't go with full on mailbox communication

13:21 <whitequark> er

13:21 <whitequark> full on queue communication

13:21 <whitequark> I thought it would make sense to have stuff like cache and watchdog requests be processed independently of the state of async RPCs

13:22 <whitequark> but now that I look at my current implementation, it doesn't take advantage of that anyway, so meh

13:22 <GitHub22> [artiq] whitequark pushed 1 new commit to master: https://git.io/vXOej

13:22 <GitHub22> artiq/master b30734a whitequark: runtime: fix a race condition with async RPCs....

13:29 <whitequark> sb0: ok that will do for now i think. i'll rewrite it in the way you want if we ever find another concurrency bug

13:29 <whitequark> what's the next priority?

13:30 <sb0> all mailbox RPCs will block the kernel CPU until the RPC is completed, correct?

13:30 <whitequark> yup

13:30 <sb0> #589

13:30 <whitequark> ah, the si5324 thing

13:30 <sb0> yes

13:30 <whitequark> ok that should be easy at least

13:31 <whitequark> do I recall it correctly that you want it to lock to clock input 1, or clock input 2, or if none of those are available, generate a clock itself?

13:31 <whitequark> and switch to whatever becomes available?

13:32 <sb0> whitequark, lock to clock input 1 if it's available, otherwise generate a clock itself (freerun)

13:32 <sb0> all inputs and outputs 62.5MHz

13:35 <sb0> rjo, first method still works, it's just that the FPGA has to know where the clock lands on each DAC, and delay sysref accordingly

13:37 <sb0> that's basically compensating for the clock/sysref skew on the PCB

13:40 <rjo> sb0: clk and sysrefs are trace-length matched anyway to each chip.

13:41 <rjo> sb0: but the state saving from the second method needs to be there. and fine-delay on dac_sysref and adc_sysref is neither useful nor possible.

13:42 <bb-m-labs> build #157 of artiq-board is complete: Success [build successful] Build details are at http://buildbot.m-labs.hk/builders/artiq-board/builds/157

13:43 <sb0> why does it need to be there? the delay method gives you finer resolution than a cycle

13:43 <sb0> so there will be an uncertainty of one (or maybe more) delay taps, but it won't matter

13:44 <rjo> sb0: what do you want to delay?

13:44 <sb0> sysref

13:44 <rjo> to what chip?

13:44 <bb-m-labs> build #1054 of artiq is complete: Failure [failed] Build details are at http://buildbot.m-labs.hk/builders/artiq/builds/1054 blamelist: whitequark <whitequark@whitequark.org>

13:44 <sb0> and since sysref is a synchronous signal, that uncertainty is absorbed

13:44 <sb0> to the dacs

13:45 <rjo> as long as you stay with in setup-hold that doesn't change anything. if you get out of setup hold, first you violate them and then you add/sub one dac_clk cycle.

13:45 <sb0> yes

13:45 <rjo> so. you don't need the fine delay.

13:45 <sb0> the delay resolution is 25ps, which is way below the setup/hold window

13:46 <sb0> i need the fine delay on sysref

13:46 <rjo> no. it doesn't give you anything. the trace lengths are matched.

13:46 <sb0> it does

13:47 <sb0> we measure the rtio/sysref phase in fine delay taps, then adjust sysref

13:47 <rjo> as i said. you can only ever delay sysref by integer multiples of dac_clk.

13:48 <sb0> no, you can move it within its s/h window

13:48 <rjo> that is "to the dacs" as you said.

13:48 <rjo> to the fpga you can move it as much as you wwant to do the scan. yes.

13:48 <rjo> but to the dac moving it within s/h doesn't change anything.

13:48 <sb0> I mean to the DACs

13:49 <rjo> it is smack in the middle already by design.

13:49 <sb0> and the scheme I propose does rely on a few delay taps giving the same result at the DAC as it will be within the s/h window

13:50 rohitksingh1 has joined #m-labs

13:50 <rjo> iiuyc, all you are trying to do is to compensate for pcb skew. but that's not needed and not the primary concern.

13:50 <rjo> (i.e. skew between dac_sysref and dac_clk).

13:51 rohitksingh has quit [Ping timeout: 250 seconds]

13:52 fengling has joined #m-labs

13:55 <sb0> hm, yes you're right, there's a problem

13:56 <rjo> sb0: ? just needs a bit of nudging.

13:57 fengling has quit [Ping timeout: 268 seconds]

13:58 <sb0> I really dislike the state store across reboots

13:58 <sb0> it looks like microsoft or xilinx design

13:59 <sb0> rjo, what's the problem with fine-delaying the DAC clock exactly?

13:59 <rjo> well. alternatively we can leave it to the user to "make sure that dac_clk meets rtio_clk setup-hold".

14:00 <rjo> sb0: the fine delay taps are very noisy.

14:00 <sb0> where do you find that spec'd?

14:01 <rjo> "Causes phase noise degradation of up to 12 dB; therefore, do not use on noise sensitive"

14:01 <rjo> DCLK channels.

14:01 <rjo> and in the model

14:01 <sb0> ah, found it.

14:01 <sb0> well

14:01 <sb0> we can delay the rtio clock then

14:02 stekern has quit [Ping timeout: 260 seconds]

14:02 <sb0> let me think how that would work...

14:02 <rjo> but you need to store something.

14:05 <rjo> and using the fine delay on dac_clk is conceptually flawed.

14:05 <rjo> even if it was noiseless.

14:09 <whitequark> bb-m-labs: force build artiq

14:09 <bb-m-labs> build #1055 forced

14:09 <bb-m-labs> I'll give a shout when the build finishes

14:16 stekern has joined #m-labs

14:16 <sb0> leave it to the user to "make sure that dac_clk meets rtio_clk setup-hold" << how, without delaying dac_clk?

14:17 <sb0> delaying rtio clock by some fixed amount?

14:29 <rjo> they can phase delay the 100 MHz into the rack.

14:30 <bb-m-labs> build #158 of artiq-board is complete: Success [build successful] Build details are at http://buildbot.m-labs.hk/builders/artiq-board/builds/158

14:31 <rjo> the the serial links metlino-sayma would need to be delay matched.

14:31 <sb0> sure, but the rtio clock at each sayma depends on backplane skew

14:31 <sb0> and in some configurations, the rtio clock will be derived from the same 100MHz, though we can just add a delay there

14:32 <sb0> well we can have a programmable rtio clock delay in each sayma, which is tuned for a given backplane

14:33 <sb0> doing that in the FPGA is just a MMCM and a handful of FFs

14:34 <bb-m-labs> build #1055 of artiq is complete: Failure [failed python_unittest_1] Build details are at http://buildbot.m-labs.hk/builders/artiq/builds/1055

14:53 fengling has joined #m-labs

14:58 fengling has quit [Ping timeout: 268 seconds]

14:59 stekern has quit [Ping timeout: 256 seconds]

15:12 stekern has joined #m-labs

15:17 <whitequark> sb0: do you care about only loss of signal, or also frequency offset?

15:22 <sb0> there shouldn't be a long term frequency offset, though the FPGA may glitch for a while

15:22 <whitequark> it only does very crude ranges

15:22 <sb0> (the input)

15:22 <whitequark> 50-105 MHz

15:23 <sb0> what does that mean exactly?

15:23 <whitequark> wait

15:24 <whitequark> nvm, I was wrong

15:25 <whitequark> the FOS alarm ranges are 11-12ppm, 48-49ppm, 30ppm, 200ppm

15:25 <whitequark> all +/-

15:26 <whitequark> sb0: so I am looking over it again

15:26 <whitequark> and to me it looks that "switch to freerun if CKIN1 is lost" is the default behavior

15:26 <sb0> okay, but freerun needs setup, and you need to connect the xtal to clkin2

15:28 <whitequark> can you elaborate?

15:29 <sb0> freerun is about connecting the xtal (internally) to clkin22

15:29 <sb0> *clkin2

15:29 <whitequark> that's the wrong freerun mode

15:30 <whitequark> that's the bit that lets you use it in freerun mode without ever locking to an external signal

15:30 <whitequark> if you simply set it up locked to a clock and then remove the clock, it will free run by itself

15:30 <sb0> appropriate clkin1/clkin2 divider settings need to be set so that they both result in approximately the same output freq

15:30 <whitequark> are you sure?

15:30 <sb0> I know, but I want the freerun mode with the xtal on clkin2

15:30 <sb0> yes

15:30 <whitequark> ah ok

15:30 <sb0> when the system boots it hasn't seen any clock

15:30 <sb0> digital hold will fail

15:31 <whitequark> and you want clock then, right?

15:31 <whitequark> ok

15:31 <sb0> but this mode is designed for dealing with that

15:31 <whitequark> then I misunderstood

15:32 <sb0> the chip should be configured to output 62.5MHz at all times, synchronized to clkin1 whenever possible, with hitless switching when clkin1 appears/disappears, in the most automous way possible

15:32 <sb0> *autonomous

15:32 <whitequark> so automated revertive mode

15:32 <whitequark> ok

15:33 <sb0> yes

15:33 <sb0> it can do that, it's actually designed to handle this very problem that appears in some high speed serial systems like SDI

15:33 <whitequark> yeah, I know it can do that

15:33 <whitequark> I just misunderstood my task

15:34 <whitequark> ok, so we have 62.5 MHz CKIN1 and 114.285 MHz XA/XB and always 62.5 MHz CKOUT1

15:35 <whitequark> 114.285, why that crystal specifically?

15:35 <whitequark> idle question relaly

15:46 stekern has quit [Ping timeout: 260 seconds]

15:48 <sb0> whitequark, precisely because that frequency is weird. for some reason, the PLL performs better when the output frequency is not a multiple of the crystal frequency

15:48 <sb0> thanks

15:51 <whitequark> so dspllsim shows me a 0.03ppm deviation

15:52 <whitequark> I assume that's completely benign

15:54 fengling has joined #m-labs

15:59 fengling has quit [Ping timeout: 268 seconds]

16:01 <whitequark> sb0: done

16:44 stekern has joined #m-labs

16:55 fengling has joined #m-labs

17:01 fengling has quit [Ping timeout: 268 seconds]

17:30 rohitksingh1 has quit [Ping timeout: 268 seconds]

17:43 rohitksingh has joined #m-labs

17:56 fengling has joined #m-labs

18:01 stekern has quit [Ping timeout: 260 seconds]

18:02 fengling has quit [Ping timeout: 268 seconds]

18:19 stekern has joined #m-labs

18:24 stekern has quit [Ping timeout: 260 seconds]

18:44 rohitksingh has quit [Quit: Leaving.]

18:48 stekern has joined #m-labs

18:57 fengling has joined #m-labs

19:03 fengling has quit [Ping timeout: 268 seconds]

19:14 rohitksingh has joined #m-labs

19:15 rohitksingh has quit [Client Quit]

19:58 fengling has joined #m-labs

20:04 fengling has quit [Ping timeout: 268 seconds]

20:18 stekern has quit [Ping timeout: 260 seconds]

20:54 stekern has joined #m-labs

20:59 fengling has joined #m-labs

21:00 stekern has quit [Ping timeout: 245 seconds]

21:04 MiW has quit [Ping timeout: 252 seconds]

21:05 fengling has quit [Ping timeout: 268 seconds]

21:10 MiW has joined #m-labs

21:15 stekern has joined #m-labs

21:21 stekern has quit [Ping timeout: 250 seconds]

21:38 stekern has joined #m-labs

21:42 MiW has quit [Ping timeout: 250 seconds]

21:45 MiW has joined #m-labs

21:57 kuldeep has quit [Ping timeout: 245 seconds]

21:58 sandeepkr has quit [Ping timeout: 260 seconds]

22:00 fengling has joined #m-labs

22:06 fengling has quit [Ping timeout: 268 seconds]

22:09 mumptai has quit [Quit: Verlassend]

22:17 sandeepkr has joined #m-labs

23:01 fengling has joined #m-labs

23:03 stekern has quit [Ping timeout: 268 seconds]

23:06 stekern has joined #m-labs

23:07 fengling has quit [Ping timeout: 268 seconds]

23:19 stekern has quit [Ping timeout: 250 seconds]

23:54 stekern has joined #m-labs

23:54 kuldeep has joined #m-labs