diff mbox

ARM: exynos_defconfig: disable CONFIG_EXYNOS5420_MCPM; not stable

Message ID 1414796382-5447-1-git-send-email-khilman@kernel.org (mailing list archive)
State New, archived
Headers show

Commit Message

Kevin Hilman Oct. 31, 2014, 10:59 p.m. UTC
From: Kevin Hilman <khilman@linaro.org>

The option CONFIG_EXYNOS5420_MCPM is causing imprecise external aborts
during boot testing, causing various userspace startup failures.

Disable until it has gotten more testing.

Cc: Kukjin Kim <kgene.kim@samsung.com>,
Cc: Javier Martinez Canillas <javier.martinez@collabora.co.uk>,
Cc: Sachin Kamat <sachin.kamat@samsung.com>,
Cc: Doug Anderson <dianders@chromium.org>,
Cc: Bartlomiej Zolnierkiewicz <b.zolnierkie@samsung.com>,
Cc: Krzysztof Kozlowski <k.kozlowski@samsung.com>,
Cc: Tushar Behera <tushar.behera@linaro.org>,
Cc: stable@vger.kernel.org # v3.17+
Signed-off-by: Kevin Hilman <khilman@linaro.org>
---
This has been reported by a few people[1], but not investigated or fixed, so it's
time to disable this feature until it can be fixed.

[1] http://lists.infradead.org/pipermail/linux-arm-kernel/2014-September/288344.html

 arch/arm/configs/exynos_defconfig | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

Comments

kgene@kernel.org Nov. 8, 2014, 9:50 a.m. UTC | #1
Kevin Hilman wrote:
> 
> From: Kevin Hilman <khilman@linaro.org>
> 
> The option CONFIG_EXYNOS5420_MCPM is causing imprecise external aborts
> during boot testing, causing various userspace startup failures.
> 
> Disable until it has gotten more testing.
> 
> Cc: Kukjin Kim <kgene.kim@samsung.com>,
> Cc: Javier Martinez Canillas <javier.martinez@collabora.co.uk>,
> Cc: Sachin Kamat <sachin.kamat@samsung.com>,
> Cc: Doug Anderson <dianders@chromium.org>,
> Cc: Bartlomiej Zolnierkiewicz <b.zolnierkie@samsung.com>,
> Cc: Krzysztof Kozlowski <k.kozlowski@samsung.com>,
> Cc: Tushar Behera <tushar.behera@linaro.org>,
> Cc: stable@vger.kernel.org # v3.17+
> Signed-off-by: Kevin Hilman <khilman@linaro.org>
> ---
> This has been reported by a few people[1], but not investigated or fixed, so it's
> time to disable this feature until it can be fixed.
> 
Hi Kevin,

Yeah I agree with your opinion.

But as you can see my tree, I've queued regarding mcpm patches for 3.19 will
be shown in -next in this weekend. Anyway let me apply this into -fixes and
then let's enable after test its functionality in -next in a couple of days.

Thanks,
Kukjin


> [1] http://lists.infradead.org/pipermail/linux-arm-kernel/2014-September/288344.html
> 
>  arch/arm/configs/exynos_defconfig | 2 +-
>  1 file changed, 1 insertion(+), 1 deletion(-)
> 
> diff --git a/arch/arm/configs/exynos_defconfig b/arch/arm/configs/exynos_defconfig
> index 72058b8a6f4d..a250dcbf34cd 100644
> --- a/arch/arm/configs/exynos_defconfig
> +++ b/arch/arm/configs/exynos_defconfig
> @@ -10,7 +10,7 @@ CONFIG_MODULE_UNLOAD=y
>  CONFIG_PARTITION_ADVANCED=y
>  CONFIG_ARCH_EXYNOS=y
>  CONFIG_ARCH_EXYNOS3=y
> -CONFIG_EXYNOS5420_MCPM=y
> +CONFIG_EXYNOS5420_MCPM=n
>  CONFIG_SMP=y
>  CONFIG_BIG_LITTLE=y
>  CONFIG_BL_SWITCHER=y
> --
> 2.1.0
Kevin Hilman Nov. 10, 2014, 7:35 p.m. UTC | #2
Kukjin Kim <kgene@kernel.org> writes:

> Kevin Hilman wrote:
>> 
>> From: Kevin Hilman <khilman@linaro.org>
>> 
>> The option CONFIG_EXYNOS5420_MCPM is causing imprecise external aborts
>> during boot testing, causing various userspace startup failures.
>> 
>> Disable until it has gotten more testing.
>> 
>> Cc: Kukjin Kim <kgene.kim@samsung.com>,
>> Cc: Javier Martinez Canillas <javier.martinez@collabora.co.uk>,
>> Cc: Sachin Kamat <sachin.kamat@samsung.com>,
>> Cc: Doug Anderson <dianders@chromium.org>,
>> Cc: Bartlomiej Zolnierkiewicz <b.zolnierkie@samsung.com>,
>> Cc: Krzysztof Kozlowski <k.kozlowski@samsung.com>,
>> Cc: Tushar Behera <tushar.behera@linaro.org>,
>> Cc: stable@vger.kernel.org # v3.17+
>> Signed-off-by: Kevin Hilman <khilman@linaro.org>
>> ---
>> This has been reported by a few people[1], but not investigated or fixed, so it's
>> time to disable this feature until it can be fixed.
>> 
> Hi Kevin,
>
> Yeah I agree with your opinion.
>
> But as you can see my tree, I've queued regarding mcpm patches for 3.19 will
> be shown in -next in this weekend. 

Which of the recently queued patches are expected to address the
imprecise abort issue?  I'd be happy to test them out.  

> Anyway let me apply this into -fixes and
> then let's enable after test its functionality in -next in a couple of days.

Yes, I think this needs to be applied until these aborts are understood
and fixed.

Thanks,

Kevin
Kevin Hilman Nov. 24, 2014, 7:51 p.m. UTC | #3
Kukjin,

On Mon, Nov 10, 2014 at 11:35 AM, Kevin Hilman <khilman@kernel.org> wrote:
> Kukjin Kim <kgene@kernel.org> writes:
>
>> Kevin Hilman wrote:
>>>
>>> From: Kevin Hilman <khilman@linaro.org>
>>>
>>> The option CONFIG_EXYNOS5420_MCPM is causing imprecise external aborts
>>> during boot testing, causing various userspace startup failures.
>>>
>>> Disable until it has gotten more testing.
>>>
>>> Cc: Kukjin Kim <kgene.kim@samsung.com>,
>>> Cc: Javier Martinez Canillas <javier.martinez@collabora.co.uk>,
>>> Cc: Sachin Kamat <sachin.kamat@samsung.com>,
>>> Cc: Doug Anderson <dianders@chromium.org>,
>>> Cc: Bartlomiej Zolnierkiewicz <b.zolnierkie@samsung.com>,
>>> Cc: Krzysztof Kozlowski <k.kozlowski@samsung.com>,
>>> Cc: Tushar Behera <tushar.behera@linaro.org>,
>>> Cc: stable@vger.kernel.org # v3.17+
>>> Signed-off-by: Kevin Hilman <khilman@linaro.org>
>>> ---
>>> This has been reported by a few people[1], but not investigated or fixed, so it's
>>> time to disable this feature until it can be fixed.
>>>
>> Hi Kevin,
>>
>> Yeah I agree with your opinion.
>>
>> But as you can see my tree, I've queued regarding mcpm patches for 3.19 will
>> be shown in -next in this weekend.
>
> Which of the recently queued patches are expected to address the
> imprecise abort issue?  I'd be happy to test them out.

Exynos5 MCPM is still broken in linux-next and still causing an imprecise abort.

What is the status of $SUBJECT patch?

>> Anyway let me apply this into -fixes and
>> then let's enable after test its functionality in -next in a couple of days.
>
> Yes, I think this needs to be applied until these aborts are understood
> and fixed.

Is anyone at Samsung actually looking into these MCPM issues?

Kevin
Olof Johansson Nov. 25, 2014, 12:25 a.m. UTC | #4
On Mon, Nov 24, 2014 at 11:51 AM, Kevin Hilman <khilman@kernel.org> wrote:
> Kukjin,
>
> On Mon, Nov 10, 2014 at 11:35 AM, Kevin Hilman <khilman@kernel.org> wrote:
>> Kukjin Kim <kgene@kernel.org> writes:
>>
>>> Kevin Hilman wrote:
>>>>
>>>> From: Kevin Hilman <khilman@linaro.org>
>>>>
>>>> The option CONFIG_EXYNOS5420_MCPM is causing imprecise external aborts
>>>> during boot testing, causing various userspace startup failures.
>>>>
>>>> Disable until it has gotten more testing.
>>>>
>>>> Cc: Kukjin Kim <kgene.kim@samsung.com>,
>>>> Cc: Javier Martinez Canillas <javier.martinez@collabora.co.uk>,
>>>> Cc: Sachin Kamat <sachin.kamat@samsung.com>,
>>>> Cc: Doug Anderson <dianders@chromium.org>,
>>>> Cc: Bartlomiej Zolnierkiewicz <b.zolnierkie@samsung.com>,
>>>> Cc: Krzysztof Kozlowski <k.kozlowski@samsung.com>,
>>>> Cc: Tushar Behera <tushar.behera@linaro.org>,
>>>> Cc: stable@vger.kernel.org # v3.17+
>>>> Signed-off-by: Kevin Hilman <khilman@linaro.org>
>>>> ---
>>>> This has been reported by a few people[1], but not investigated or fixed, so it's
>>>> time to disable this feature until it can be fixed.
>>>>
>>> Hi Kevin,
>>>
>>> Yeah I agree with your opinion.
>>>
>>> But as you can see my tree, I've queued regarding mcpm patches for 3.19 will
>>> be shown in -next in this weekend.
>>
>> Which of the recently queued patches are expected to address the
>> imprecise abort issue?  I'd be happy to test them out.
>
> Exynos5 MCPM is still broken in linux-next and still causing an imprecise abort.
>
> What is the status of $SUBJECT patch?
>
>>> Anyway let me apply this into -fixes and
>>> then let's enable after test its functionality in -next in a couple of days.
>>
>> Yes, I think this needs to be applied until these aborts are understood
>> and fixed.
>
> Is anyone at Samsung actually looking into these MCPM issues?

Hi Kevin,

What hardware are you having problems with? 5420 or 5422/5800?


-Olof
Kevin Hilman Nov. 25, 2014, 1:35 a.m. UTC | #5
On Mon, Nov 24, 2014 at 4:25 PM, Olof Johansson <olof@lixom.net> wrote:
> On Mon, Nov 24, 2014 at 11:51 AM, Kevin Hilman <khilman@kernel.org> wrote:
>> Kukjin,
>>
>> On Mon, Nov 10, 2014 at 11:35 AM, Kevin Hilman <khilman@kernel.org> wrote:
>>> Kukjin Kim <kgene@kernel.org> writes:
>>>
>>>> Kevin Hilman wrote:
>>>>>
>>>>> From: Kevin Hilman <khilman@linaro.org>
>>>>>
>>>>> The option CONFIG_EXYNOS5420_MCPM is causing imprecise external aborts
>>>>> during boot testing, causing various userspace startup failures.
>>>>>
>>>>> Disable until it has gotten more testing.
>>>>>
>>>>> Cc: Kukjin Kim <kgene.kim@samsung.com>,
>>>>> Cc: Javier Martinez Canillas <javier.martinez@collabora.co.uk>,
>>>>> Cc: Sachin Kamat <sachin.kamat@samsung.com>,
>>>>> Cc: Doug Anderson <dianders@chromium.org>,
>>>>> Cc: Bartlomiej Zolnierkiewicz <b.zolnierkie@samsung.com>,
>>>>> Cc: Krzysztof Kozlowski <k.kozlowski@samsung.com>,
>>>>> Cc: Tushar Behera <tushar.behera@linaro.org>,
>>>>> Cc: stable@vger.kernel.org # v3.17+
>>>>> Signed-off-by: Kevin Hilman <khilman@linaro.org>
>>>>> ---
>>>>> This has been reported by a few people[1], but not investigated or fixed, so it's
>>>>> time to disable this feature until it can be fixed.
>>>>>
>>>> Hi Kevin,
>>>>
>>>> Yeah I agree with your opinion.
>>>>
>>>> But as you can see my tree, I've queued regarding mcpm patches for 3.19 will
>>>> be shown in -next in this weekend.
>>>
>>> Which of the recently queued patches are expected to address the
>>> imprecise abort issue?  I'd be happy to test them out.
>>
>> Exynos5 MCPM is still broken in linux-next and still causing an imprecise abort.
>>
>> What is the status of $SUBJECT patch?
>>
>>>> Anyway let me apply this into -fixes and
>>>> then let's enable after test its functionality in -next in a couple of days.
>>>
>>> Yes, I think this needs to be applied until these aborts are understood
>>> and fixed.
>>
>> Is anyone at Samsung actually looking into these MCPM issues?
>
> Hi Kevin,
>
> What hardware are you having problems with? 5420 or 5422/5800?

Yes.  :)

exynos5420-arndale-octa:
http://storage.armcloud.us/kernel-ci/mainline/v3.18-rc6/arm-exynos_defconfig/boot-exynos5420-arndale-octa.html
exynos5422-odroid-xu3:
http://storage.armcloud.us/kernel-ci/mainline/v3.18-rc6/arm-exynos_defconfig/boot-exynos5422-odroid-xu3.html

My boot tests seem to pass fine because I have such a minimal
userspace, but Tyler Baker reported that with a "real" userspace, he
can't boot to a shell:

  http://lists.infradead.org/pipermail/linux-arm-kernel/2014-September/286203.html

Kevin
Olof Johansson Nov. 25, 2014, 1:37 a.m. UTC | #6
On Mon, Nov 24, 2014 at 5:35 PM, Kevin Hilman <khilman@kernel.org> wrote:
> On Mon, Nov 24, 2014 at 4:25 PM, Olof Johansson <olof@lixom.net> wrote:
>> On Mon, Nov 24, 2014 at 11:51 AM, Kevin Hilman <khilman@kernel.org> wrote:
>>> Kukjin,
>>>
>>> On Mon, Nov 10, 2014 at 11:35 AM, Kevin Hilman <khilman@kernel.org> wrote:
>>>> Kukjin Kim <kgene@kernel.org> writes:
>>>>
>>>>> Kevin Hilman wrote:
>>>>>>
>>>>>> From: Kevin Hilman <khilman@linaro.org>
>>>>>>
>>>>>> The option CONFIG_EXYNOS5420_MCPM is causing imprecise external aborts
>>>>>> during boot testing, causing various userspace startup failures.
>>>>>>
>>>>>> Disable until it has gotten more testing.
>>>>>>
>>>>>> Cc: Kukjin Kim <kgene.kim@samsung.com>,
>>>>>> Cc: Javier Martinez Canillas <javier.martinez@collabora.co.uk>,
>>>>>> Cc: Sachin Kamat <sachin.kamat@samsung.com>,
>>>>>> Cc: Doug Anderson <dianders@chromium.org>,
>>>>>> Cc: Bartlomiej Zolnierkiewicz <b.zolnierkie@samsung.com>,
>>>>>> Cc: Krzysztof Kozlowski <k.kozlowski@samsung.com>,
>>>>>> Cc: Tushar Behera <tushar.behera@linaro.org>,
>>>>>> Cc: stable@vger.kernel.org # v3.17+
>>>>>> Signed-off-by: Kevin Hilman <khilman@linaro.org>
>>>>>> ---
>>>>>> This has been reported by a few people[1], but not investigated or fixed, so it's
>>>>>> time to disable this feature until it can be fixed.
>>>>>>
>>>>> Hi Kevin,
>>>>>
>>>>> Yeah I agree with your opinion.
>>>>>
>>>>> But as you can see my tree, I've queued regarding mcpm patches for 3.19 will
>>>>> be shown in -next in this weekend.
>>>>
>>>> Which of the recently queued patches are expected to address the
>>>> imprecise abort issue?  I'd be happy to test them out.
>>>
>>> Exynos5 MCPM is still broken in linux-next and still causing an imprecise abort.
>>>
>>> What is the status of $SUBJECT patch?
>>>
>>>>> Anyway let me apply this into -fixes and
>>>>> then let's enable after test its functionality in -next in a couple of days.
>>>>
>>>> Yes, I think this needs to be applied until these aborts are understood
>>>> and fixed.
>>>
>>> Is anyone at Samsung actually looking into these MCPM issues?
>>
>> Hi Kevin,
>>
>> What hardware are you having problems with? 5420 or 5422/5800?
>
> Yes.  :)
>
> exynos5420-arndale-octa:
> http://storage.armcloud.us/kernel-ci/mainline/v3.18-rc6/arm-exynos_defconfig/boot-exynos5420-arndale-octa.html
> exynos5422-odroid-xu3:
> http://storage.armcloud.us/kernel-ci/mainline/v3.18-rc6/arm-exynos_defconfig/boot-exynos5422-odroid-xu3.html
>
> My boot tests seem to pass fine because I have such a minimal
> userspace, but Tyler Baker reported that with a "real" userspace, he
> can't boot to a shell:
>
>   http://lists.infradead.org/pipermail/linux-arm-kernel/2014-September/286203.html

I'm not surprised that 5420 has issues, but I have not seen any
external aborts on neither Chromebook that I have in my farm.

Sounds like the secondary cpus should be disabled on those device
trees instead, doesn't it?


-Olof
Olof Johansson Nov. 25, 2014, 1:38 a.m. UTC | #7
On Mon, Nov 24, 2014 at 5:37 PM, Olof Johansson <olof@lixom.net> wrote:
> On Mon, Nov 24, 2014 at 5:35 PM, Kevin Hilman <khilman@kernel.org> wrote:
>> On Mon, Nov 24, 2014 at 4:25 PM, Olof Johansson <olof@lixom.net> wrote:
>>> On Mon, Nov 24, 2014 at 11:51 AM, Kevin Hilman <khilman@kernel.org> wrote:
>>>> Kukjin,
>>>>
>>>> On Mon, Nov 10, 2014 at 11:35 AM, Kevin Hilman <khilman@kernel.org> wrote:
>>>>> Kukjin Kim <kgene@kernel.org> writes:
>>>>>
>>>>>> Kevin Hilman wrote:
>>>>>>>
>>>>>>> From: Kevin Hilman <khilman@linaro.org>
>>>>>>>
>>>>>>> The option CONFIG_EXYNOS5420_MCPM is causing imprecise external aborts
>>>>>>> during boot testing, causing various userspace startup failures.
>>>>>>>
>>>>>>> Disable until it has gotten more testing.
>>>>>>>
>>>>>>> Cc: Kukjin Kim <kgene.kim@samsung.com>,
>>>>>>> Cc: Javier Martinez Canillas <javier.martinez@collabora.co.uk>,
>>>>>>> Cc: Sachin Kamat <sachin.kamat@samsung.com>,
>>>>>>> Cc: Doug Anderson <dianders@chromium.org>,
>>>>>>> Cc: Bartlomiej Zolnierkiewicz <b.zolnierkie@samsung.com>,
>>>>>>> Cc: Krzysztof Kozlowski <k.kozlowski@samsung.com>,
>>>>>>> Cc: Tushar Behera <tushar.behera@linaro.org>,
>>>>>>> Cc: stable@vger.kernel.org # v3.17+
>>>>>>> Signed-off-by: Kevin Hilman <khilman@linaro.org>
>>>>>>> ---
>>>>>>> This has been reported by a few people[1], but not investigated or fixed, so it's
>>>>>>> time to disable this feature until it can be fixed.
>>>>>>>
>>>>>> Hi Kevin,
>>>>>>
>>>>>> Yeah I agree with your opinion.
>>>>>>
>>>>>> But as you can see my tree, I've queued regarding mcpm patches for 3.19 will
>>>>>> be shown in -next in this weekend.
>>>>>
>>>>> Which of the recently queued patches are expected to address the
>>>>> imprecise abort issue?  I'd be happy to test them out.
>>>>
>>>> Exynos5 MCPM is still broken in linux-next and still causing an imprecise abort.
>>>>
>>>> What is the status of $SUBJECT patch?
>>>>
>>>>>> Anyway let me apply this into -fixes and
>>>>>> then let's enable after test its functionality in -next in a couple of days.
>>>>>
>>>>> Yes, I think this needs to be applied until these aborts are understood
>>>>> and fixed.
>>>>
>>>> Is anyone at Samsung actually looking into these MCPM issues?
>>>
>>> Hi Kevin,
>>>
>>> What hardware are you having problems with? 5420 or 5422/5800?
>>
>> Yes.  :)
>>
>> exynos5420-arndale-octa:
>> http://storage.armcloud.us/kernel-ci/mainline/v3.18-rc6/arm-exynos_defconfig/boot-exynos5420-arndale-octa.html
>> exynos5422-odroid-xu3:
>> http://storage.armcloud.us/kernel-ci/mainline/v3.18-rc6/arm-exynos_defconfig/boot-exynos5422-odroid-xu3.html
>>
>> My boot tests seem to pass fine because I have such a minimal
>> userspace, but Tyler Baker reported that with a "real" userspace, he
>> can't boot to a shell:
>>
>>   http://lists.infradead.org/pipermail/linux-arm-kernel/2014-September/286203.html
>
> I'm not surprised that 5420 has issues, but I have not seen any
> external aborts on neither Chromebook that I have in my farm.
>
> Sounds like the secondary cpus should be disabled on those device
> trees instead, doesn't it?

Er, cluster, not cpus.


-Olof
kgene@kernel.org Nov. 25, 2014, 1:50 a.m. UTC | #8
Olof Johansson wrote:
> 
> On Mon, Nov 24, 2014 at 5:37 PM, Olof Johansson <olof@lixom.net> wrote:
> > On Mon, Nov 24, 2014 at 5:35 PM, Kevin Hilman <khilman@kernel.org> wrote:
> >> On Mon, Nov 24, 2014 at 4:25 PM, Olof Johansson <olof@lixom.net> wrote:
> >>> On Mon, Nov 24, 2014 at 11:51 AM, Kevin Hilman <khilman@kernel.org> wrote:
> >>>> Kukjin,
> >>>>
> >>>> On Mon, Nov 10, 2014 at 11:35 AM, Kevin Hilman <khilman@kernel.org> wrote:
> >>>>> Kukjin Kim <kgene@kernel.org> writes:
> >>>>>
> >>>>>> Kevin Hilman wrote:
> >>>>>>>
> >>>>>>> From: Kevin Hilman <khilman@linaro.org>
> >>>>>>>
> >>>>>>> The option CONFIG_EXYNOS5420_MCPM is causing imprecise external aborts
> >>>>>>> during boot testing, causing various userspace startup failures.
> >>>>>>>
> >>>>>>> Disable until it has gotten more testing.
> >>>>>>>
> >>>>>>> Cc: Kukjin Kim <kgene.kim@samsung.com>,
> >>>>>>> Cc: Javier Martinez Canillas <javier.martinez@collabora.co.uk>,
> >>>>>>> Cc: Sachin Kamat <sachin.kamat@samsung.com>,
> >>>>>>> Cc: Doug Anderson <dianders@chromium.org>,
> >>>>>>> Cc: Bartlomiej Zolnierkiewicz <b.zolnierkie@samsung.com>,
> >>>>>>> Cc: Krzysztof Kozlowski <k.kozlowski@samsung.com>,
> >>>>>>> Cc: Tushar Behera <tushar.behera@linaro.org>,
> >>>>>>> Cc: stable@vger.kernel.org # v3.17+
> >>>>>>> Signed-off-by: Kevin Hilman <khilman@linaro.org>
> >>>>>>> ---
> >>>>>>> This has been reported by a few people[1], but not investigated or fixed, so it's
> >>>>>>> time to disable this feature until it can be fixed.
> >>>>>>>
> >>>>>> Hi Kevin,
> >>>>>>
> >>>>>> Yeah I agree with your opinion.
> >>>>>>
> >>>>>> But as you can see my tree, I've queued regarding mcpm patches for 3.19 will
> >>>>>> be shown in -next in this weekend.
> >>>>>
> >>>>> Which of the recently queued patches are expected to address the
> >>>>> imprecise abort issue?  I'd be happy to test them out.
> >>>>
> >>>> Exynos5 MCPM is still broken in linux-next and still causing an imprecise abort.
> >>>>
> >>>> What is the status of $SUBJECT patch?
> >>>>
> >>>>>> Anyway let me apply this into -fixes and
> >>>>>> then let's enable after test its functionality in -next in a couple of days.
> >>>>>
> >>>>> Yes, I think this needs to be applied until these aborts are understood
> >>>>> and fixed.
> >>>>
> >>>> Is anyone at Samsung actually looking into these MCPM issues?
> >>>
> >>> Hi Kevin,
> >>>
> >>> What hardware are you having problems with? 5420 or 5422/5800?
> >>
> >> Yes.  :)
> >>
> >> exynos5420-arndale-octa:
> >> http://storage.armcloud.us/kernel-ci/mainline/v3.18-rc6/arm-exynos_defconfig/boot-exynos5420-
> arndale-octa.html
> >> exynos5422-odroid-xu3:
> >> http://storage.armcloud.us/kernel-ci/mainline/v3.18-rc6/arm-exynos_defconfig/boot-exynos5422-
> odroid-xu3.html
> >>
> >> My boot tests seem to pass fine because I have such a minimal
> >> userspace, but Tyler Baker reported that with a "real" userspace, he
> >> can't boot to a shell:
> >>
> >>   http://lists.infradead.org/pipermail/linux-arm-kernel/2014-September/286203.html
> >
Hmm...his report was in Sep...I think it should be fine with current -next?

To be honest, since I don't have the exynos5420 arndale, chromebook...but smdk
which has different bootloader, I couldn't test it...I'll try to make a test
farm like you guys...

> > I'm not surprised that 5420 has issues, but I have not seen any

Sorry.

> > external aborts on neither Chromebook that I have in my farm.
> >
> > Sounds like the secondary cpus should be disabled on those device
> > trees instead, doesn't it?
> 
> Er, cluster, not cpus.

- Kukjin
Kevin Hilman Nov. 25, 2014, 2:13 a.m. UTC | #9
On Mon, Nov 24, 2014 at 5:37 PM, Olof Johansson <olof@lixom.net> wrote:
> On Mon, Nov 24, 2014 at 5:35 PM, Kevin Hilman <khilman@kernel.org> wrote:
>> On Mon, Nov 24, 2014 at 4:25 PM, Olof Johansson <olof@lixom.net> wrote:
>>> On Mon, Nov 24, 2014 at 11:51 AM, Kevin Hilman <khilman@kernel.org> wrote:
>>>> Kukjin,
>>>>
>>>> On Mon, Nov 10, 2014 at 11:35 AM, Kevin Hilman <khilman@kernel.org> wrote:
>>>>> Kukjin Kim <kgene@kernel.org> writes:
>>>>>
>>>>>> Kevin Hilman wrote:
>>>>>>>
>>>>>>> From: Kevin Hilman <khilman@linaro.org>
>>>>>>>
>>>>>>> The option CONFIG_EXYNOS5420_MCPM is causing imprecise external aborts
>>>>>>> during boot testing, causing various userspace startup failures.
>>>>>>>
>>>>>>> Disable until it has gotten more testing.
>>>>>>>
>>>>>>> Cc: Kukjin Kim <kgene.kim@samsung.com>,
>>>>>>> Cc: Javier Martinez Canillas <javier.martinez@collabora.co.uk>,
>>>>>>> Cc: Sachin Kamat <sachin.kamat@samsung.com>,
>>>>>>> Cc: Doug Anderson <dianders@chromium.org>,
>>>>>>> Cc: Bartlomiej Zolnierkiewicz <b.zolnierkie@samsung.com>,
>>>>>>> Cc: Krzysztof Kozlowski <k.kozlowski@samsung.com>,
>>>>>>> Cc: Tushar Behera <tushar.behera@linaro.org>,
>>>>>>> Cc: stable@vger.kernel.org # v3.17+
>>>>>>> Signed-off-by: Kevin Hilman <khilman@linaro.org>
>>>>>>> ---
>>>>>>> This has been reported by a few people[1], but not investigated or fixed, so it's
>>>>>>> time to disable this feature until it can be fixed.
>>>>>>>
>>>>>> Hi Kevin,
>>>>>>
>>>>>> Yeah I agree with your opinion.
>>>>>>
>>>>>> But as you can see my tree, I've queued regarding mcpm patches for 3.19 will
>>>>>> be shown in -next in this weekend.
>>>>>
>>>>> Which of the recently queued patches are expected to address the
>>>>> imprecise abort issue?  I'd be happy to test them out.
>>>>
>>>> Exynos5 MCPM is still broken in linux-next and still causing an imprecise abort.
>>>>
>>>> What is the status of $SUBJECT patch?
>>>>
>>>>>> Anyway let me apply this into -fixes and
>>>>>> then let's enable after test its functionality in -next in a couple of days.
>>>>>
>>>>> Yes, I think this needs to be applied until these aborts are understood
>>>>> and fixed.
>>>>
>>>> Is anyone at Samsung actually looking into these MCPM issues?
>>>
>>> Hi Kevin,
>>>
>>> What hardware are you having problems with? 5420 or 5422/5800?
>>
>> Yes.  :)
>>
>> exynos5420-arndale-octa:
>> http://storage.armcloud.us/kernel-ci/mainline/v3.18-rc6/arm-exynos_defconfig/boot-exynos5420-arndale-octa.html
>> exynos5422-odroid-xu3:
>> http://storage.armcloud.us/kernel-ci/mainline/v3.18-rc6/arm-exynos_defconfig/boot-exynos5422-odroid-xu3.html
>>
>> My boot tests seem to pass fine because I have such a minimal
>> userspace, but Tyler Baker reported that with a "real" userspace, he
>> can't boot to a shell:
>>
>>   http://lists.infradead.org/pipermail/linux-arm-kernel/2014-September/286203.html
>
> I'm not surprised that 5420 has issues, but I have not seen any
> external aborts on neither Chromebook that I have in my farm.
>
> Sounds like the secondary cpus should be disabled on those device
> trees instead, doesn't it?

That's possible.

Unfortunately, I've gotten very little support from Samsung on this
and it was originally reported 2.5 months ago[2], so I think that the
5420 MCPM should be disabled until they can propose the right fix, and
actually test it.

Also, I tried disabling some CPUs at boot time, but the
exynos5422-odroid-xu3 wont' even boot with nr_cpus=1 or 4 (or anything
less than 8)

Kevin

[1]  http://lists.infradead.org/pipermail/linux-arm-kernel/2014-September/286203.html
Kevin Hilman Nov. 25, 2014, 3:20 a.m. UTC | #10
On Mon, Nov 24, 2014 at 5:50 PM, Kukjin Kim <kgene@kernel.org> wrote:
> Olof Johansson wrote:
>>
>> On Mon, Nov 24, 2014 at 5:37 PM, Olof Johansson <olof@lixom.net> wrote:
>> > On Mon, Nov 24, 2014 at 5:35 PM, Kevin Hilman <khilman@kernel.org> wrote:
>> >> On Mon, Nov 24, 2014 at 4:25 PM, Olof Johansson <olof@lixom.net> wrote:
>> >>> On Mon, Nov 24, 2014 at 11:51 AM, Kevin Hilman <khilman@kernel.org> wrote:
>> >>>> Kukjin,
>> >>>>
>> >>>> On Mon, Nov 10, 2014 at 11:35 AM, Kevin Hilman <khilman@kernel.org> wrote:
>> >>>>> Kukjin Kim <kgene@kernel.org> writes:
>> >>>>>
>> >>>>>> Kevin Hilman wrote:
>> >>>>>>>
>> >>>>>>> From: Kevin Hilman <khilman@linaro.org>
>> >>>>>>>
>> >>>>>>> The option CONFIG_EXYNOS5420_MCPM is causing imprecise external aborts
>> >>>>>>> during boot testing, causing various userspace startup failures.
>> >>>>>>>
>> >>>>>>> Disable until it has gotten more testing.
>> >>>>>>>
>> >>>>>>> Cc: Kukjin Kim <kgene.kim@samsung.com>,
>> >>>>>>> Cc: Javier Martinez Canillas <javier.martinez@collabora.co.uk>,
>> >>>>>>> Cc: Sachin Kamat <sachin.kamat@samsung.com>,
>> >>>>>>> Cc: Doug Anderson <dianders@chromium.org>,
>> >>>>>>> Cc: Bartlomiej Zolnierkiewicz <b.zolnierkie@samsung.com>,
>> >>>>>>> Cc: Krzysztof Kozlowski <k.kozlowski@samsung.com>,
>> >>>>>>> Cc: Tushar Behera <tushar.behera@linaro.org>,
>> >>>>>>> Cc: stable@vger.kernel.org # v3.17+
>> >>>>>>> Signed-off-by: Kevin Hilman <khilman@linaro.org>
>> >>>>>>> ---
>> >>>>>>> This has been reported by a few people[1], but not investigated or fixed, so it's
>> >>>>>>> time to disable this feature until it can be fixed.
>> >>>>>>>
>> >>>>>> Hi Kevin,
>> >>>>>>
>> >>>>>> Yeah I agree with your opinion.
>> >>>>>>
>> >>>>>> But as you can see my tree, I've queued regarding mcpm patches for 3.19 will
>> >>>>>> be shown in -next in this weekend.
>> >>>>>
>> >>>>> Which of the recently queued patches are expected to address the
>> >>>>> imprecise abort issue?  I'd be happy to test them out.
>> >>>>
>> >>>> Exynos5 MCPM is still broken in linux-next and still causing an imprecise abort.
>> >>>>
>> >>>> What is the status of $SUBJECT patch?
>> >>>>
>> >>>>>> Anyway let me apply this into -fixes and
>> >>>>>> then let's enable after test its functionality in -next in a couple of days.
>> >>>>>
>> >>>>> Yes, I think this needs to be applied until these aborts are understood
>> >>>>> and fixed.
>> >>>>
>> >>>> Is anyone at Samsung actually looking into these MCPM issues?
>> >>>
>> >>> Hi Kevin,
>> >>>
>> >>> What hardware are you having problems with? 5420 or 5422/5800?
>> >>
>> >> Yes.  :)
>> >>
>> >> exynos5420-arndale-octa:
>> >> http://storage.armcloud.us/kernel-ci/mainline/v3.18-rc6/arm-exynos_defconfig/boot-exynos5420-
>> arndale-octa.html
>> >> exynos5422-odroid-xu3:
>> >> http://storage.armcloud.us/kernel-ci/mainline/v3.18-rc6/arm-exynos_defconfig/boot-exynos5422-
>> odroid-xu3.html
>> >>
>> >> My boot tests seem to pass fine because I have such a minimal
>> >> userspace, but Tyler Baker reported that with a "real" userspace, he
>> >> can't boot to a shell:
>> >>
>> >>   http://lists.infradead.org/pipermail/linux-arm-kernel/2014-September/286203.html
>> >
> Hmm...his report was in Sep...I think it should be fine with current -next?

No, it is still broken in linux-next (as I stated above.)

Moreover, earlier in this thread you mentioned you were merging some
MCPM patches that should address this, but did not respond when I
asked which patches you thing should address this issue

> To be honest, since I don't have the exynos5420 arndale, chromebook...but smdk
> which has different bootloader, I couldn't test it...I'll try to make a test
> farm like you guys...

Do you have some colleagues with any other 542x hardware?  I had
assumed that linux-next was being better tested on the publicaly
available, and widely available boards like odroid-xu3 and
Chromebook2, but I've come to realize the hard way that that is not
the case.  You mention your board has a different bootloader.  Do you
suspect there's a bootloader issue on these other platforms?  If so,
could you elaborate on possible fixes?  I'm more than willing to test
any proposed fixes, but I'm not familiar enough yet with these SoCs to
figure out the underlying issues alone.

Until you have a working board farm, you could start having a closer
look at the boot logs we're already producing.  Admittedly linux-next
broken in many ways besides this one for exynos currently, but it has
been having these imprecise aborts well before the other recent
issues.

Also, It's very possible that this issue is not even MCPM related at
all, and MCPM is just uncovering a previously hidden bug.  It would be
very helpful if people more familiar with this hardware and SoC would
investigate bug reports like these.

Kevin
Krzysztof Kozlowski Nov. 25, 2014, 8:47 a.m. UTC | #11
On pon, 2014-11-24 at 19:20 -0800, Kevin Hilman wrote:
> On Mon, Nov 24, 2014 at 5:50 PM, Kukjin Kim <kgene@kernel.org> wrote:
> > Olof Johansson wrote:
> >>
> >> On Mon, Nov 24, 2014 at 5:37 PM, Olof Johansson <olof@lixom.net> wrote:
> >> > On Mon, Nov 24, 2014 at 5:35 PM, Kevin Hilman <khilman@kernel.org> wrote:
> >> >> On Mon, Nov 24, 2014 at 4:25 PM, Olof Johansson <olof@lixom.net> wrote:
> >> >>> On Mon, Nov 24, 2014 at 11:51 AM, Kevin Hilman <khilman@kernel.org> wrote:
> >> >>>>
> >> >>>> Is anyone at Samsung actually looking into these MCPM issues?
> >> >>>
> >> >>> Hi Kevin,
> >> >>>
> >> >>> What hardware are you having problems with? 5420 or 5422/5800?
> >> >>
> >> >> Yes.  :)
> >> >>
> >> >> exynos5420-arndale-octa:
> >> >> http://storage.armcloud.us/kernel-ci/mainline/v3.18-rc6/arm-exynos_defconfig/boot-exynos5420-
> >> arndale-octa.html
> >> >> exynos5422-odroid-xu3:
> >> >> http://storage.armcloud.us/kernel-ci/mainline/v3.18-rc6/arm-exynos_defconfig/boot-exynos5422-
> >> odroid-xu3.html
> >> >>
> >> >> My boot tests seem to pass fine because I have such a minimal
> >> >> userspace, but Tyler Baker reported that with a "real" userspace, he
> >> >> can't boot to a shell:
> >> >>
> >> >>   http://lists.infradead.org/pipermail/linux-arm-kernel/2014-September/286203.html
> >> >
> > Hmm...his report was in Sep...I think it should be fine with current -next?
> 
> No, it is still broken in linux-next (as I stated above.)
> 
> Moreover, earlier in this thread you mentioned you were merging some
> MCPM patches that should address this, but did not respond when I
> asked which patches you thing should address this issue
> 
> > To be honest, since I don't have the exynos5420 arndale, chromebook...but smdk
> > which has different bootloader, I couldn't test it...I'll try to make a test
> > farm like you guys...
> 
> Do you have some colleagues with any other 542x hardware?  I had
> assumed that linux-next was being better tested on the publicaly
> available, and widely available boards like odroid-xu3 and
> Chromebook2, but I've come to realize the hard way that that is not
> the case.  You mention your board has a different bootloader.  Do you
> suspect there's a bootloader issue on these other platforms?  If so,
> could you elaborate on possible fixes?  I'm more than willing to test
> any proposed fixes, but I'm not familiar enough yet with these SoCs to
> figure out the underlying issues alone.
> 
> Until you have a working board farm, you could start having a closer
> look at the boot logs we're already producing.  Admittedly linux-next
> broken in many ways besides this one for exynos currently, but it has
> been having these imprecise aborts well before the other recent
> issues.
> 
> Also, It's very possible that this issue is not even MCPM related at
> all, and MCPM is just uncovering a previously hidden bug.  It would be
> very helpful if people more familiar with this hardware and SoC would
> investigate bug reports like these.

Interesting thing can be found in exynos5420.dtsi:
mdma1: mdma@11C10000 {
	...
	/*
	 * MDMA1 can support both secure and non-secure
	 * AXI transactions. When this is enabled in the kernel
	 * for boards that run in secure mode, we are getting
	 * imprecise external aborts causing the kernel to oops.
	 */
	status = "disabled";
};

I am booting Arndale Octa on some other config and exynos. However with
or without MCPM the imprecise aborts are still present (but not fatal,
shell comes up).

My board boots also under secure firmware (I am using Linaro's ubuntu
image). Maybe that is the cause?

Best regards,
Krzysztof
Krzysztof Kozlowski Nov. 25, 2014, 9:07 a.m. UTC | #12
On wto, 2014-11-25 at 09:47 +0100, Krzysztof Kozlowski wrote:
> On pon, 2014-11-24 at 19:20 -0800, Kevin Hilman wrote:
> > On Mon, Nov 24, 2014 at 5:50 PM, Kukjin Kim <kgene@kernel.org> wrote:
> > > Olof Johansson wrote:
> > >>
> > >> On Mon, Nov 24, 2014 at 5:37 PM, Olof Johansson <olof@lixom.net> wrote:
> > >> > On Mon, Nov 24, 2014 at 5:35 PM, Kevin Hilman <khilman@kernel.org> wrote:
> > >> >> On Mon, Nov 24, 2014 at 4:25 PM, Olof Johansson <olof@lixom.net> wrote:
> > >> >>> On Mon, Nov 24, 2014 at 11:51 AM, Kevin Hilman <khilman@kernel.org> wrote:
> > >> >>>>
> > >> >>>> Is anyone at Samsung actually looking into these MCPM issues?
> > >> >>>
> > >> >>> Hi Kevin,
> > >> >>>
> > >> >>> What hardware are you having problems with? 5420 or 5422/5800?
> > >> >>
> > >> >> Yes.  :)
> > >> >>
> > >> >> exynos5420-arndale-octa:
> > >> >> http://storage.armcloud.us/kernel-ci/mainline/v3.18-rc6/arm-exynos_defconfig/boot-exynos5420-
> > >> arndale-octa.html
> > >> >> exynos5422-odroid-xu3:
> > >> >> http://storage.armcloud.us/kernel-ci/mainline/v3.18-rc6/arm-exynos_defconfig/boot-exynos5422-
> > >> odroid-xu3.html
> > >> >>
> > >> >> My boot tests seem to pass fine because I have such a minimal
> > >> >> userspace, but Tyler Baker reported that with a "real" userspace, he
> > >> >> can't boot to a shell:
> > >> >>
> > >> >>   http://lists.infradead.org/pipermail/linux-arm-kernel/2014-September/286203.html
> > >> >
> > > Hmm...his report was in Sep...I think it should be fine with current -next?
> > 
> > No, it is still broken in linux-next (as I stated above.)
> > 
> > Moreover, earlier in this thread you mentioned you were merging some
> > MCPM patches that should address this, but did not respond when I
> > asked which patches you thing should address this issue
> > 
> > > To be honest, since I don't have the exynos5420 arndale, chromebook...but smdk
> > > which has different bootloader, I couldn't test it...I'll try to make a test
> > > farm like you guys...
> > 
> > Do you have some colleagues with any other 542x hardware?  I had
> > assumed that linux-next was being better tested on the publicaly
> > available, and widely available boards like odroid-xu3 and
> > Chromebook2, but I've come to realize the hard way that that is not
> > the case.  You mention your board has a different bootloader.  Do you
> > suspect there's a bootloader issue on these other platforms?  If so,
> > could you elaborate on possible fixes?  I'm more than willing to test
> > any proposed fixes, but I'm not familiar enough yet with these SoCs to
> > figure out the underlying issues alone.
> > 
> > Until you have a working board farm, you could start having a closer
> > look at the boot logs we're already producing.  Admittedly linux-next
> > broken in many ways besides this one for exynos currently, but it has
> > been having these imprecise aborts well before the other recent
> > issues.
> > 
> > Also, It's very possible that this issue is not even MCPM related at
> > all, and MCPM is just uncovering a previously hidden bug.  It would be
> > very helpful if people more familiar with this hardware and SoC would
> > investigate bug reports like these.
> 
> Interesting thing can be found in exynos5420.dtsi:
> mdma1: mdma@11C10000 {
> 	...
> 	/*
> 	 * MDMA1 can support both secure and non-secure
> 	 * AXI transactions. When this is enabled in the kernel
> 	 * for boards that run in secure mode, we are getting
> 	 * imprecise external aborts causing the kernel to oops.
> 	 */
> 	status = "disabled";
> };
> 
> I am booting Arndale Octa on some other config and exynos. However with
> or without MCPM the imprecise aborts are still present (but not fatal,
> shell comes up).
> 
> My board boots also under secure firmware (I am using Linaro's ubuntu
> image). Maybe that is the cause?

One update: some fatal imprecise aborts (hanging boot) happen also on
exynos_defconfig without MCPM. It looks random... one boot fails, next
succeeds (however also with "imprecise external abort" message but shell
comes up).

Best regards,
Krzysztof
diff mbox

Patch

diff --git a/arch/arm/configs/exynos_defconfig b/arch/arm/configs/exynos_defconfig
index 72058b8a6f4d..a250dcbf34cd 100644
--- a/arch/arm/configs/exynos_defconfig
+++ b/arch/arm/configs/exynos_defconfig
@@ -10,7 +10,7 @@  CONFIG_MODULE_UNLOAD=y
 CONFIG_PARTITION_ADVANCED=y
 CONFIG_ARCH_EXYNOS=y
 CONFIG_ARCH_EXYNOS3=y
-CONFIG_EXYNOS5420_MCPM=y
+CONFIG_EXYNOS5420_MCPM=n
 CONFIG_SMP=y
 CONFIG_BIG_LITTLE=y
 CONFIG_BL_SWITCHER=y