#linux-amlogic on 2018-04-02 — irc logs at freenode.irclog.whitequark.org

2018-03-27 08:09 narmstrong changed the topic of #linux-amlogic to: Amlogic mainline kernel development discussion - our wiki http://linux-meson.com/ - ml linux-amlogic@lists.infradead.org - Publicly Logged on https://irclog.whitequark.org/linux-amlogic

00:04 aballier has quit [Ping timeout: 264 seconds]

00:05 aballier has joined #linux-amlogic

00:14 <ndufresne> Ely, we'll have to study drivers/amlogic/amports/vmh264.c (the multi-instance variant for H264)

00:14 <ndufresne> I bet this is only on GXL+

00:15 <ndufresne> it seems like with that we can do 12 decoding instances in parallel

00:15 <ndufresne> I doubt it's real-time, but much more flexible for userspace

00:17 <ndufresne> and this one has a lof of if (!mmu_enable)

00:21 <ndufresne> interesting, this is a totally different codec, it can decoder in I420

00:59 cthugha has joined #linux-amlogic

01:02 default__ has quit [Ping timeout: 248 seconds]

01:24 commavir has quit [Remote host closed the connection]

01:25 commavir has joined #linux-amlogic

02:01 commavir has quit [Remote host closed the connection]

02:01 commavir has joined #linux-amlogic

02:13 commavir has quit [Remote host closed the connection]

02:13 commavir has joined #linux-amlogic

02:21 commavir has quit [Remote host closed the connection]

02:21 commavir has joined #linux-amlogic

02:43 steamport is now known as steamport|sleep

02:46 vagrantc has quit [Quit: leaving]

03:42 distemper has quit [Quit: bye]

03:42 distemper has joined #linux-amlogic

03:49 distemper has quit [Quit: bye]

03:49 distemper has joined #linux-amlogic

03:51 distemper has quit [Client Quit]

03:52 distemper has joined #linux-amlogic

05:13 Barada has joined #linux-amlogic

05:20 Elpaulo_m has joined #linux-amlogic

06:29 Barada has quit [Quit: Barada]

07:08 Barada has joined #linux-amlogic

07:21 <narmstrong> Beware, they have the same « plenty of alternative paths/registers » but they may not have all HW actually in the silicon, for instance they can have 3 different scalers for vpu primary plane, only one is actually present... having the driver does not mean it’s actually present and working ! The 4.9 branch may be more representative of what is actually in HW

08:25 trem has joined #linux-amlogic

08:30 tingoose has joined #linux-amlogic

09:31 Elpaulo_m2 has joined #linux-amlogic

09:34 Elpaulo_m has quit [Ping timeout: 255 seconds]

09:47 <Ely> ndufresne: Yes I studied the multi decoders briefly but I don't have much intel on it.

09:50 <Ely> I can't find any trace of "mmu_enable" in 3.14/4.9 vmh264 driver though

09:59 <TobiasTh1Viking> xdarklight: i have some time :)

10:00 <Ely> ndufresne: for hevc, vh265.c bundles both the single and multi driver (amvdec_h265_driver, ammvdec_h265_driver)

10:06 <xdarklight> TobiasTh1Viking: then let's go :)

10:07 <TobiasTh1Viking> we are hooking up meson6_cbus_banks yes?

10:07 <xdarklight> let's start with the aobus pins

10:07 <xdarklight> (it'll be easier, but cbus follows the same schema)

10:07 <xdarklight> let's look at the MESON_BANK preprocessor macro to understand it's parameters: https://github.com/torvalds/linux/blob/master/drivers/pinctrl/meson/pinctrl-meson.h#L134

10:07 <TobiasTh1Viking> oki.

10:08 <xdarklight> .name is easy, it's the name of the bank (any name you want in theory, we typically go with the letters in the GPIO name, "A", "X", ...)

10:08 <TobiasTh1Viking> is open. i feel like i kinda understand it. (playing a bit with arduino's seem to overlap a bit)

10:09 <TobiasTh1Viking> still, not enough to manage on my own.

10:09 <xdarklight> .first and .last are the first and last GPIO in the bank

10:09 <xdarklight> example from Meson8 so far: BANK("CARD", CARD_0, CARD_6, ...

10:10 <xdarklight> after that there's always a pair of register offset and bit offset in this register for: REG_PULLEN, REG_PULL, REG_DIR, REG_OUT, REG_IN

10:10 <TobiasTh1Viking> specifically that is where i feel like i'm lacking some knowledge.

10:10 <xdarklight> the meanings of these registers (I might be wrong for some, but you'll get the basic idea): REG_DIR = direction of the pin (input or output)

10:11 <xdarklight> REG_OUT = if the pin is an output pin then this defines if it should output HIGH or LOW

10:11 <xdarklight> REG_IN = if the pin is an input pin you can read the value (HIGH or LOW)

10:12 <TobiasTh1Viking> and what about PULLEN/PULL?

10:12 <xdarklight> REG_PULL = defines whether pull up or pull downs are enabled

10:12 <xdarklight> or wait, that might be REG_PULLEN

10:13 <Ely> ndufresne: From the looks of it, it looks like >= GXTVBB can do H.264 4K on the regular driver. Will have to test :)

10:13 <xdarklight> TobiasTh1Viking: hmm, one defines if the pin is in pull up or pull down mode, then the other defines if the whole pull-* logic is enabled at all

10:13 <TobiasTh1Viking> so, some of this data i already have (first/last pin). but the register and bit offset. and irq's. i'll have too look that up somewhere. yes? is it in the 3.10 source for me to locate?

10:13 <xdarklight> yep, that is the next step :)

10:14 <xdarklight> one more hint (that I find important): the REG_* values always specify the register and bit offset of the *first* pin in a bank

10:14 <TobiasTh1Viking> ok

10:14 <xdarklight> so let's keep going with the Meson8 example: direction register = 0 direction bit = 22

10:15 <xdarklight> this means that CARD_0 will use register 0 and bit 22, CARD_1 will use register 0 and bit 23, ...

10:15 <TobiasTh1Viking> ok. i'm not good with registers. but makes sense.

10:16 <xdarklight> next is where you find this information in the 3.10 kernel: https://github.com/endlessm/linux-meson/blob/master/arch/arm/mach-meson6/gpio.c#L303

10:16 <TobiasTh1Viking> for reference, you said we should do the meson6_aobus_banks. which only has BANK("AO", GPIOAO_0, GPIO_TEST_N, everything else is in cbus bank

10:16 <TobiasTh1Viking> so, that's first and last pin?

10:17 <TobiasTh1Viking> oh, AOMAP, has more values.

10:17 <xdarklight> let's start with what you have: BANK("AO", GPIOAO_0, GPIO_TEST_N,

10:17 <xdarklight> that looks good so far, now we need to fill the remaining values

10:18 <xdarklight> for the two IRQ fields you can go with -1 for now, they're not used by the pinctrl driver yet

10:18 <TobiasTh1Viking> sorry, i have some confusion, let me reread

10:19 <TobiasTh1Viking> i think i'm just being confused by the comment block. it says "irq" once. but there are two irq variables.

10:20 <xdarklight> ah yes, it's actual comment should be "irq first" and "irq last"

10:20 <TobiasTh1Viking> good.

10:20 <TobiasTh1Viking> changing from 0, 13 to -1, -1

10:20 <xdarklight> yep

10:20 <Ely> ndufresne: haha yup, H.264 4K works on GXL out of the box with the current driver. Decoding happens at 24-25fps

10:22 <TobiasTh1Viking> xdarklight: so, now i need to parse register and bit from the "PIN_AOMAP" yes?

10:22 <xdarklight> TobiasTh1Viking: yep, I'm slightly confused currently since the 3.10 kernel doesn't give the PULL* offsets. let's fill these with -1 for now (we'll set them later)

10:24 <TobiasTh1Viking> done

10:24 <xdarklight> next is the "direction" register and bit

10:25 <xdarklight> check gpio_amlogic_direction_input and gpio_amlogic_direction_output in the 3.10 kernel, they use the "out_en_reg_bit" struct member to configure the direction

10:27 <TobiasTh1Viking> define PIN_AOMAP(pin,en_reg,en_bit,out_reg,out_bit,in_reg,in_bit) <- seems to be what i need. yes?

10:27 <xdarklight> yep

10:28 <TobiasTh1Viking> en_reg,en_bit <- pullen?

10:28 <TobiasTh1Viking> en(able)

10:28 <xdarklight> no, they're for the direction register

10:28 <xdarklight> REG_DIR

10:28 <xdarklight> (REG_DIR in the mainline kernel code)

10:28 <TobiasTh1Viking> oh wait, i could see that later in code.

10:28 <TobiasTh1Viking> ok, let me try

10:30 <TobiasTh1Viking> so, dir, in and out, all use register 7. bit 0 for dir and in. bit 16 for out. ?

10:31 <xdarklight> seems so :)

10:31 <TobiasTh1Viking> BANK("AO", GPIOAO_0, GPIO_TEST_N, -1, -1, -1, -1, -1, -1, 7, 0, 7, 16, 7, 0),

10:32 <xdarklight> (when we add the cbus bank you'll see why they are separated)

10:32 <TobiasTh1Viking> and the code automatically +1 on the bit.

10:32 <xdarklight> right!

10:32 <TobiasTh1Viking> k. so now i do this for cbus, compile, and check sdcard gpio?

10:32 <xdarklight> let's fill the PULL and PULLEN bits also

10:33 <TobiasTh1Viking> oki. not irq?

10:33 <xdarklight> on Meson8 and Meson8b this simply uses the "direction" values for "pullen" and "out" values for "pull"

10:33 <xdarklight> IRQ is not used in the pinctrl drivers internally (yet). so you don't lose functionality if you don't add it for now

10:34 <TobiasTh1Viking> well, would have to go back and fix it later. might as well do it correctly now.

10:35 <xdarklight> I need to check where the IRQs are defined first

10:36 <xdarklight> before we do that we need one more step

10:36 <xdarklight> https://github.com/endlessm/linux-meson/blob/master/arch/arm/mach-meson6/gpio.c#L567

10:36 <xdarklight> the register offsets (7 in case of the AO bank) in the 3.10 kernel are going through some lookup table (p_gpio_oen_addr)

10:37 <xdarklight> index 7 ends up in the P_AO_GPIO_O_EN_N register, which in mainline register offsets is simply 0

10:39 <xdarklight> (amlogic's driver only uses one pinctrl devicetree node for both, AOBUS and CBUS - but the mainline drivers are much cleaner: they use one devicetree node per controller, each with the "correct" base addresses already specified in devicetree)

10:39 trem_ has joined #linux-amlogic

10:40 <xdarklight> it's a bit weird, not sure if you understand my (aim at an) explanation

10:41 <TobiasTh1Viking> a bit too much info i don't know where to fit. i'll reread all of this a couple of times. i'm not sure if you gave me all the data needed for fill out the pullen and pull values. if you did you need to be a bit more specific.

10:41 <TobiasTh1Viking> hopefully i'll catch up

10:41 <TobiasTh1Viking> in p_gpio_oen_addr in 3.10, i find aobus at 7. not 0. :/

10:41 trem has quit [Ping timeout: 246 seconds]

10:41 <xdarklight> ok, for pull and pullen I'll rephrase:

10:42 <xdarklight> pullen -> use same register and bit offset as for "direction"

10:42 <xdarklight> pull -> use same register and bit offset as for "out"

10:42 <TobiasTh1Viking> ah. oki.

10:43 <TobiasTh1Viking> for now it just feels like black magic numbers. but i'll sure i'll get it later.

10:43 <xdarklight> regarding P_AO_GPIO_O_EN_N

10:43 <TobiasTh1Viking> so besides for irq. everything is 7,0, or 7,16

10:43 <xdarklight> it's defined here: https://github.com/endlessm/linux-meson/blob/master/arch/arm/mach-meson6/include/mach/register.h#L5709

10:44 <xdarklight> (0x00 << 10) results in 0

10:44 <xdarklight> (0x09 << 2) results in 0x24

10:44 <xdarklight> so the actual register address of P_AO_GPIO_O_EN_N is wherever AOBUS is plus 0x24

10:45 <TobiasTh1Viking> yeah, i have never understood the << thing. and i don't know what it is called, so i can't google it.

10:45 <xdarklight> now look at your .dts: we already specify register 0x24 there :)

10:45 <xdarklight> ah, it's bit shifting

10:45 <xdarklight> shift 0x09 two bits to the left

10:45 <xdarklight> (shift by 1 also means: multiply with 2, so in this case you can simply multiply with 4 ;))

10:46 <TobiasTh1Viking> oki. got a nice wikipedia open, thx. i'll read up.

10:46 <TobiasTh1Viking> not getting it now. but bit's are easy. so no problem. :)

10:46 <xdarklight> :)

10:47 <TobiasTh1Viking> yes, i see "0x24 0x8" in dts.

10:47 <xdarklight> that was a lot of text just for saying: we have to replace register "7" with register "0" (only for the AO bank)

10:47 <TobiasTh1Viking> bahahaha

10:47 <TobiasTh1Viking> nice

10:48 <xdarklight> now onto the IRQ numbers

10:48 <xdarklight> they look them up here: https://github.com/endlessm/linux-meson/blob/master/arch/arm/mach-meson6/gpio.c#L531

10:49 <xdarklight> the first parameter to PIN_AOMAP is pin, the store it in the .num field (and read that in gpio_amlogic_to_irq)

10:50 <xdarklight> the GPIO* definitions are from: https://github.com/endlessm/linux-meson/blob/master/arch/arm/mach-meson6/include/mach/gpio.h#L186

10:50 <xdarklight> in this case you can take the values 1:1 :)

10:51 <TobiasTh1Viking> why 1:1?

10:52 <xdarklight> the GPIO numbers in the Amlogic driver are identical with the GPIO numbers

10:52 <xdarklight> sorry, GPIO numbers are identical with the IRQ numbers

10:52 <xdarklight> ^that's what I wanted to say

10:52 <TobiasTh1Viking> doesn't it say 182

10:52 <xdarklight> yep

10:53 <xdarklight> that's where the mainline driver differs from Amlogic's driver

10:54 <xdarklight> mainline: each pin controller starts counting the GPIO at 0, Amlogic: they count the overall pins in the order that some chip designer has defined

10:54 <TobiasTh1Viking> oh. so because AO is on its own pinctrl devicetree, it becomes 1:1 ?

10:55 <xdarklight> it'll be the same for CBUS: the IRQ numbers are always the ones from that gpio.h file

10:55 <TobiasTh1Viking> still kinda confused. what would the irq be for, lets say card and boot?

10:56 <xdarklight> CARD_0=121 (that would be irq start)

10:56 <xdarklight> CARD_8=129 (that would be irq end)

10:57 <TobiasTh1Viking> so, just 121 and 129, no conversion needed?

10:57 <xdarklight> indeed

10:58 <TobiasTh1Viking> only AO needed conversion. But why isn't it 1:13 for AO?

10:58 <TobiasTh1Viking> wait, why isn't it 0:13 (starting at 0).

10:58 <xdarklight> ah, the reason for that is they have *one* GPIO interrupt controller but two pin controllers. the GPIO interrupt controller takes the pins from both pin controller

10:59 <xdarklight> so they can't use 0 for GPIOAO_0 and 0 for GPIOZ_0 again because then the interrupt controller couldn't differentiate the pins

10:59 <TobiasTh1Viking> ah. oki. so 0 is reserved.

10:59 <xdarklight> not really, it's just used for GPIOZ_0 :)

10:59 <xdarklight> and GPIOAO_0 uses 182

11:00 <TobiasTh1Viking> ok. i have enough data now that i should be able to finish meson6_cbus_banks on my own?

11:00 <TobiasTh1Viking> for verification, just (un)plug sdcard, and check dmesg for card plugged unplugged. or is there a better way i can directly check the GPIO pin? (like in sys or proc)

11:02 <xdarklight> let me check the CBUS banks first

11:05 <xdarklight> ok, for CBUS it's similar but not identical

11:06 <xdarklight> let's take Meson8 as an example: https://github.com/endlessm/linux-meson/blob/master/arch/arm/mach-meson8/gpio.c#L368 let's go with PIN_MAP(GPIOX_0,0,0)

11:06 <xdarklight> in the mainline driver this bank looks like: BANK("X", GPIOX_0, GPIOX_21, 112, 133, 4, 0, 4, 0, 0, 0, 1, 0, 2, 0),

11:07 <xdarklight> (let's ignore IRQ, pull and pullen for now again)

11:07 <xdarklight> you can see that "dir" is 0, 0 in the mainline driver, just like in Amlogic's code

11:07 Barada has quit [Quit: Barada]

11:07 <TobiasTh1Viking> yes

11:07 <xdarklight> BUT: out is "1, 0" in the mainline driver (and in is "2, 0" in mainline)

11:08 Barada has joined #linux-amlogic

11:08 <xdarklight> while there doesn't seem to be a difference in Amlogic's code

11:08 Barada has quit [Client Quit]

11:09 Barada has joined #linux-amlogic

11:09 <xdarklight> this is because they use these p_gpio_oen_addr, p_gpio_output_addr and p_gpio_input_addr lookup tables that map indices to registers

11:10 <TobiasTh1Viking> and mainline uses pinctrl and device tree. for which i need to define it?

11:10 <xdarklight> so in Amlogic's code PIN_MAP(GPIOX_0,0,0) means: "for GPIO_X look up the registers (direction, input and output) at index 0, then use bit 0 in the resulting register"

11:11 <xdarklight> index 0 maps to P_PREG_PAD_GPIO0_EN_N (for the direction register), P_PREG_PAD_GPIO0_O (output register) and P_PREG_PAD_GPIO0_I (input register)

11:12 <xdarklight> in register.h (https://github.com/endlessm/linux-meson/blob/master/arch/arm/mach-meson6/include/mach/register.h#L38) these are: 0x200c, 0x200d and 0x200e

11:12 <xdarklight> only the first of these 3 addresses is defined in your .dts

11:13 <xdarklight> to find it: multiply 0x200c by 4

11:14 <xdarklight> found it?

11:14 <TobiasTh1Viking> i'm getting way wrong numbers out.

11:14 <TobiasTh1Viking> i've found the things. but i'm not comprehending.

11:14 <xdarklight> 0x200c * 4 = 0x8030

11:15 <xdarklight> you find this in your cbus pinctrl node, it matches the "reg-name" "gpio"

11:15 Barada has quit [Ping timeout: 246 seconds]

11:15 <TobiasTh1Viking> isn't 0x8030 == 32816 ?

11:15 <xdarklight> yep

11:15 <xdarklight> but the numbers in .dts are all hex

11:15 <TobiasTh1Viking> ah, it's for the dts.

11:16 <TobiasTh1Viking> ok, see it in dts.

11:16 <xdarklight> ok

11:16 Barada has joined #linux-amlogic

11:16 <xdarklight> so long story short: this is how I would translate the cbus banks:

11:16 * TobiasTh1Viking thought you were explaining how to calculate out and in for mainline.

11:18 <xdarklight> BANK("X", GPIOX_0, GPIOX_NN, <take the original numbering for IRQ first>, <take the original numbering for IRQ last>, <ignore pullen reg for now>, <ignore pullen bit for now>, <ignore pull reg for now>, <ignore pull bit for now>, <register from PIN_MAP>, <bit from PIN_MAP>, <register from PIN_MAP + 1>, <bit from PIN_MAP>, <register from PIN_MAP + 2>, <bit from PIN_MAP>),

11:19 <xdarklight> why + 1 and why + 2? well, it gets you from the 0x8030 register to the next register (0x200d, or to the one after that: 0x200e)

11:20 <TobiasTh1Viking> oki. i think i got it.

11:20 <TobiasTh1Viking> will that be enough to check the sdcard inserted GPIO pin?

11:21 <xdarklight> should be

11:21 <xdarklight> show the diff before you boot something

11:22 <xdarklight> just so someone else can check - then hope that it works :)

11:22 <xdarklight> in /sys/class/gpio you can export gpios

11:22 <xdarklight> so you can see that state in sysfs

11:23 <TobiasTh1Viking> ok. sounds good. And definetly don't boot without showing code first. i can fry something?

11:24 <TobiasTh1Viking> if there is no risk of frying something, i don't see why i don't just boot and try.

11:25 <xdarklight> I've not fried a board with "just" software yet

11:25 <xdarklight> you can probably just try it

11:26 <TobiasTh1Viking> i'll probably try then. thanks for help. i'll post a patch when i'm done with the cbus bank.

11:27 <TobiasTh1Viking> but i'll start compiling and do a test run once i've posted the patch.

11:27 Barada has quit [Quit: Barada]

11:27 Barada has joined #linux-amlogic

11:28 <xdarklight> good luck!

11:28 <TobiasTh1Viking> thx

11:52 <TobiasTh1Viking> reading thourhg this, you said dts file like 5 times where i missed it looking at code. sorry bout that.

12:51 dsd_ has joined #linux-amlogic

13:05 <ndufresne> Ely, nice, make sense, it's it's basically 1080p115 / 4

13:06 <ndufresne> I wonder what's the diff with the other ucode ?

13:06 <Ely> ndufresne: I spoke a tad too soon, the picture looks heavily corrupted, but at least the decoder doesn't complain and "decodes" 4K. I've seen some "if (is_4k" conditions in the original vh264.c so I'll have to check further. But without a doubt it's possible.

13:07 <ndufresne> yeah, well I notice we have broken frame with bbb 1080p60, I think one of our internal buffer is too small

13:08 <Ely> Ah, great that you figured that out. I noticed that some frames in my test files always got corrupted (buffer decode error 00200), but I never knew why. Most likely the bframes.

13:10 <Ely> They do have a define "DROP_B_FRAME_FOR_1080P_50_60FPS" but it's only for <= meson6, so yeah probably we do something else wrong.

13:12 <ndufresne> I see a lot of comment in the code that seems to indicate they add hacks like this, or bump some buffer size in a test driven way

13:12 <ndufresne> instead of trying to figure-out from the spec

13:15 <ndufresne> will be around later

13:26 tingoose has quit [Remote host closed the connection]

13:50 Elpaulo_m has joined #linux-amlogic

13:53 Elpaulo_m2 has quit [Ping timeout: 276 seconds]

13:54 Elpaulo_m2 has joined #linux-amlogic

13:57 Elpaulo_m has quit [Ping timeout: 264 seconds]

13:57 Elpaulo_m2 has quit [Client Quit]

14:27 Elpaulo has quit [Quit: Elpaulo]

14:31 Elpaulo has joined #linux-amlogic

14:42 Barada has quit [Quit: Barada]

14:55 tingoose has joined #linux-amlogic

15:38 fedux has joined #linux-amlogic

16:27 vagrantc has joined #linux-amlogic

17:16 BlueMatt has quit [Excess Flood]

17:17 BlueMatt has joined #linux-amlogic

17:21 fedux has quit []

18:04 Ntemis has joined #linux-amlogic

18:08 Guest8491 has quit [Quit: Bye]

18:10 mag has joined #linux-amlogic

18:13 <ndufresne> Ely, I'm trying to understand the purpose of load_extended_firmware(), the h264 implementation seems to allocate cma, and copy into it, and later free it, I don't see any use of ext_fw_vaddr. Am I missing something ?

18:14 <Ely> writel_relaxed(h264->ext_fw_paddr, core->dos_base + AV_SCRATCH_G);

18:14 <Ely> you had me worried for a second :D

18:16 <ndufresne> I'm blind ...

18:17 <Ely> Regarding what's inside that extended firmware, I have no idea. There are 5 sections of microcode named "header, data, mmco, list, slice"

18:17 <Ely> well you probably already know about them

18:19 <ndufresne> yeah, no idea

18:20 <ndufresne> I was trying to learn something about MDEC_PIC_DC_CTRL

18:20 <Ely> ah, the magic NV21 register..

18:20 <ndufresne> all I know is that (1 << 17) enabled NV21 (some kind of), but what happens if you don't enable that ?

18:21 <Ely> then you get a 3-plane format and you have to map 3 canvases per buffer

18:21 <Ely> at least in 3.10/S805. The more recent the code, the more everything is hardcoded to nv21

18:23 <Ely> No idea about the specific 3-plane pixfmt. With your canvas changes maybe you get yuv420p.

18:24 <ndufresne> ok, sounds like I420, or YUV420P in v4l2 terms

18:24 <ndufresne> well, no idea which plane will be U and which one V ;-P

18:25 <Ely> #else

18:25 <Ely> buffer_spec[i].v_canvas_index = 128 + i * 3 + 2;

18:25 <Ely> buffer_spec[i].u_canvas_index = 128 + i * 3 + 1;

18:25 <Ely> buffer_spec[i].y_canvas_index = 128 + i * 3;

18:25 <Ely> so.. yeah probably I420

18:26 <dsd_> i guess that is the setup we have in endless 3.10 kernel

18:26 <dsd_> unfortunately its not quite a standard format

18:26 <Ely> Don't you use ge2d to convert the buffer to a standard pixfmt?

18:26 <dsd_> but you can use ge2d to convert it to a standard format

18:27 <Ely> ndufresne recently discovered that you can configure the canvas in a way that you get proper NV21 or NV12 out of the decoder

18:27 <Ely> no tiling, no endianness..

18:28 <dsd_> nice

18:28 <dsd_> got any details there?

18:29 <dsd_> i wonder if that is S805 or S905+ only. we did ask amlgog

18:29 <dsd_> i think we asked amlogic at the time and they pointed us to ge2d only

18:30 <Ely> https://paste.fedoraproject.org/paste/85DDK1SfDDK0sSS5HRGa9g

18:30 <Ely> What you're looking for to get NV12 is linear mode + the proper bits in endian control

18:31 <Ely> https://github.com/Elyotna/linux/commit/c7978cb8ae88f71a12574f0111986cab44e389c2

18:34 <dsd_> thanks a lot, very cool

18:38 <ndufresne> the canvas have an extra (generally unused in the numerous copy of the canva implementation) bit mask

18:39 <ndufresne> from the rights, each bits will, swap every byte for each 16bit, then swap every 16bit every 32bit, swap every 32bit every 64bit and swap every 64bit every 128bit

18:41 <ndufresne> we saw that we had 64bit with bytes being swapped, so setting the first 3 lowest significant bits (0x7) will revert the order, then on uv plane, if you set 0x6, you get NV21

18:42 <ndufresne> Ely, btw, I found that in hevc driver

18:42 <narmstrong> i will need to take care of these bits in the drm driver

18:42 <dsd_> NV21 or NV12?

18:42 <narmstrong> both !

18:42 <ndufresne> yes, we can select both

18:43 <Ely> Isn't it "then on uv plane, if you set 0x6, you get NV12" ?

18:44 <ndufresne> I'm quite sure having 0x7 0x7 from both canvas produced NV12, because I had the color swapped at first

18:44 <ndufresne> so 0x7 0x6 should be NV21

18:44 <Ely> Oh I see

18:44 <ndufresne> it's natively NV21, but as we said, all bytes over 64bit are revered order

18:46 <ndufresne> dsd_, so it's likely we'll be able to expose NV12, NV21 and I420, + 32x32 tiling

18:46 <ndufresne> when we get there ;-P

18:47 <ndufresne> but it's not clear yet if tiling improve performance yet

18:48 * narmstrong trying the last version, working !!

18:48 <narmstrong> Ely: what are your options to kmssink to make it smooth ?

18:49 <Ely> sync=false helps a ton at the end iirc

18:50 <Ely> Also the perf seems better since we switched to NV12

18:50 <narmstrong> ok, but it plays at 1/2 speed, I need to apply the patch ndufresne pointed us

18:51 <narmstrong> Ely: did apply the patch to kmssink ?

18:51 <Ely> gst-launch-1.0 souphttpsrc location=<loc> ! parsebin ! v4l2video0dec ! videoconvert n-threads=4 ! kmssink driver-name=meson force-modesetting=true connector-id=31 max_lateness=-1 sync=false

18:51 <Ely> No I didn't apply any gst patch

18:52 <ndufresne> I also get steady 15fps

18:53 <ndufresne> Ely, the reason for the sync=0 being better is because of how v4l2video0dec guesses the decoder latency

18:53 <Ely> huh. 720p30/1080p30 plays almost smoothly on my TV

18:54 <ndufresne> I'll need to figure-out something

18:54 <narmstrong> bbb unflower 1080p30 ?

18:54 <ndufresne> Ely, with sync=0, max-lateness should have no effect, also max_lateness is not a property of kmssink

18:54 <Ely> ah okay, I'll remove it then.

18:55 <Ely> narmstrong: my kernel isn't in a state where I can play a H.264 video right now (I'm getting at HEVC), sorry :S

18:55 <ndufresne> narmstrong, I was surprise, but here, after fresh boot, I don't need to pass the connector and force the mode

18:55 <narmstrong> Ely: don't worry !

18:56 <ndufresne> narmstrong, I'm running bbb_sunflower_1080p_30fps_normal.mp4 here, it breaks at two spots

18:56 <ndufresne> first one is a usual spot, as I don't think the file is totally legit, the second spot was new to me

18:57 <narmstrong> ndufresne: it's because the HDMI connector is linked to an encoder since the fbcon uses it, but when kmssink stop it disconnects the connector and a the next run, HDMI and CVBS are equal and CVBS is selected

18:59 <narmstrong> kodi has the same issue...

18:59 <ndufresne> fair

19:00 <ndufresne> would be nice we kmssink also supported connectors by name, which I guess is what we have in old things like xrandr

19:03 <narmstrong> indeed, the number is board dependent

19:04 * ndufresne taking notes ;-P

19:05 Ntemis has quit [Remote host closed the connection]

19:26 <narmstrong> this `gst-launch-1.0 souphttpsrc location=http://download.blender.org/peach/bigbuckbunny_movies/big_buck_bunny_720p_h264.mov ! parsebin ! v4l2video0dec ! videoconvert n-threads=4 ! kmssink driver-name=meson force-modesetting=true connector-id=31 sync=false`is really smooth

19:27 <narmstrong> seems 1080p frames are too big to render in SW...

19:39 <narmstrong> Ely: you 648MHz fix is not good, you must force FCLK_DIV4 and use 666MHz like //termbin.com/r731

19:39 <narmstrong> *http://termbin.com/r731

19:40 <Ely> narmstrong oh, oops. I thought the clock framework automatically found the best parent to use

19:40 <narmstrong> Ely: the lowest, 500MHz is the lowest closest

19:40 <narmstrong> or you must add some other flags

19:41 tingoose has quit [Ping timeout: 256 seconds]

19:41 <Ely> okay

19:42 <Ely> but are you sure the clk framework doesn't try all the possible parents to see which one will give the best result ?

19:42 <Ely> closest* result

19:42 <Ely> well, lowest closest, whatever

19:42 <Ely> :D

19:42 <narmstrong> yes,it should if you have the right flags

19:42 <Ely> alright

19:43 <Ely> I think they use something else than DIV4 for 648 though let me check

19:43 <narmstrong> you will need to remove CLK_SET_RATE_NO_REPARENT then !

19:43 <narmstrong> they must use DIV1 and a divider on the VDEC1_DIV

19:44 <narmstrong> hmm, it can only use DIV3, 4, 5 & 7

19:44 <Ely> yes it's just like the vpu clks

19:45 <narmstrong> set to 666MHz, it's the max clock rate

19:45 <Ely> nvm the code says 648M but it's div4 with divider=1

19:45 <Ely> so indeed 666

19:55 Ivanovic has quit [Quit: Caught sigterm, terminating...]

19:55 Ivanovic has joined #linux-amlogic

19:55 Ivanovic has quit [Changing host]

19:55 Ivanovic has joined #linux-amlogic

19:56 <Ely> what's the clock of xtal ?

19:58 <Ely> "but it's div3* with divider=1"

19:58 <Ely> and nevermind, found it in the HK datasheets

20:05 <Ely> narmstrong: but I think it makes sense to remove the clk flag, I'll want to not care about what the parent is in the future

20:06 <Ely> What's the flag to get closest instead of lowest ?

20:20 <narmstrong> Hmm, none, just put 666 you won’t get upper anyway

20:26 <Ely> Sure, but how do you tell the clk framework to autoselect the parent without specifically telling it ? Like, I don't want to have to put this entry: assigned-clock-parents

20:58 <narmstrong> Remove the flag on clk_sel

21:03 <Ely> Thank you!

21:06 dsd_ has quit [Quit: Lost terminal]

21:32 trem_ has quit [Quit: Leaving]

23:44 <chewitt> xdarklight: someone was asking about the meson6 HDMI IP (not being the same Synopsys part as S805/905). Contacts say it's from Transwitch