mbox series

[0/3] On AMD platforms only offer s2idle w/ proper FW

Message ID 20220105193910.25678-1-mario.limonciello@amd.com (mailing list archive)
Headers show
Series On AMD platforms only offer s2idle w/ proper FW | expand

Message

Mario Limonciello Jan. 5, 2022, 7:39 p.m. UTC
Currently the Linux kernel will offer s2idle regardless of whether the FADT
indicates the system should use or on X86 if the LPS0 ACPI device has been
activated.

On some non-AMD platforms s2idle can be offered even without proper
firmware support.  The power consumption may be higher in these instances
but the system otherwise properly suspends and resumes.

On AMD platforms however when the FW has been configured not to offer
s2idle some different hardware initialization has occurred such that the
system won't properly resume.

This patch series changes the kernel suspend code to conditionally decide
whether to offer s2idle.  When an AMD system is encountered all the
prerequisites will be checked before s2idle is offered to the user.

Mario Limonciello (3):
  PM: suspend: Move some structure declarations around
  PM: sleep: Don't always assume s2idle will be enabled
  acpi: sleep: Don't offer s2idle on AMD platforms without LPS0

 drivers/acpi/sleep.c      |  6 ++++++
 drivers/acpi/x86/s2idle.c | 36 ++++++++++++++++++++++++----------
 include/linux/suspend.h   |  1 +
 kernel/power/suspend.c    | 41 ++++++++++++++++++++-------------------
 4 files changed, 54 insertions(+), 30 deletions(-)

Comments

Rafael J. Wysocki Jan. 11, 2022, 3:52 p.m. UTC | #1
On Wed, Jan 5, 2022 at 8:39 PM Mario Limonciello
<mario.limonciello@amd.com> wrote:
>
> Currently the Linux kernel will offer s2idle regardless of whether the FADT
> indicates the system should use or on X86 if the LPS0 ACPI device has been
> activated.
>
> On some non-AMD platforms s2idle can be offered even without proper
> firmware support.  The power consumption may be higher in these instances
> but the system otherwise properly suspends and resumes.

Well, the idea is that s2idle should not require FW support at all.

It may not be possible to reach the minimum power level of the
platform without FW support, but that should not prevent s2idle from
being used.

> On AMD platforms however when the FW has been configured not to offer
> s2idle some different hardware initialization has occurred such that the
> system won't properly resume.

That's rather unfortunate.

Can you please share some details on what's going on in those cases?

Technically, without FW support there should be no difference between
the platform state reachable via s2idle and the platform state
reachable via runtime idle.
Mario Limonciello Jan. 11, 2022, 4:23 p.m. UTC | #2
+Alex

On 1/11/2022 09:52, Rafael J. Wysocki wrote:
> On Wed, Jan 5, 2022 at 8:39 PM Mario Limonciello
> <mario.limonciello@amd.com> wrote:
>>
>> Currently the Linux kernel will offer s2idle regardless of whether the FADT
>> indicates the system should use or on X86 if the LPS0 ACPI device has been
>> activated.
>>
>> On some non-AMD platforms s2idle can be offered even without proper
>> firmware support.  The power consumption may be higher in these instances
>> but the system otherwise properly suspends and resumes.
> 
> Well, the idea is that s2idle should not require FW support at all. >

May I ask - why?  It's an intentional design decision?

> It may not be possible to reach the minimum power level of the
> platform without FW support, but that should not prevent s2idle from
> being used.
> 
>> On AMD platforms however when the FW has been configured not to offer
>> s2idle some different hardware initialization has occurred such that the
>> system won't properly resume.
> 
> That's rather unfortunate.
> 
> Can you please share some details on what's going on in those cases?
> 
> Technically, without FW support there should be no difference between
> the platform state reachable via s2idle and the platform state
> reachable via runtime idle.

During resume there is a number of page faults that occur and during 
initialization the ring tests fail.  The graphics is unusable at this 
time as a result.

The amdgpu code actually *does* distinguish between the 3 different 
cases of S3, S0ix, and runtime suspend.

The function "amdgpu_acpi_is_s0ix_active" causes different codepaths to 
be used during the suspend routine.

In this particular case that FADT doesn't set the low power idle bit
and that function returns false meaning the s3 codepath is taken but
the hardware didn't go through a reset.

It *might* also be possible to solve this by mandating an ASIC reset in 
such a case (we didn't try).

However it comes back to my first upleveveled question - is this a case 
we really want to support and encourage?  This type of bug and 
combination of codepaths is not a case that is going to be well tested. 
This patch series will align the kernel behavior to only what AMD validates.
Rafael J. Wysocki Jan. 11, 2022, 5:05 p.m. UTC | #3
On Tue, Jan 11, 2022 at 5:23 PM Limonciello, Mario
<mario.limonciello@amd.com> wrote:
>
> +Alex
>
> On 1/11/2022 09:52, Rafael J. Wysocki wrote:
> > On Wed, Jan 5, 2022 at 8:39 PM Mario Limonciello
> > <mario.limonciello@amd.com> wrote:
> >>
> >> Currently the Linux kernel will offer s2idle regardless of whether the FADT
> >> indicates the system should use or on X86 if the LPS0 ACPI device has been
> >> activated.
> >>
> >> On some non-AMD platforms s2idle can be offered even without proper
> >> firmware support.  The power consumption may be higher in these instances
> >> but the system otherwise properly suspends and resumes.
> >
> > Well, the idea is that s2idle should not require FW support at all. >
>
> May I ask - why?  It's an intentional design decision?

Yes, it is.

> > It may not be possible to reach the minimum power level of the
> > platform without FW support, but that should not prevent s2idle from
> > being used.
> >
> >> On AMD platforms however when the FW has been configured not to offer
> >> s2idle some different hardware initialization has occurred such that the
> >> system won't properly resume.
> >
> > That's rather unfortunate.
> >
> > Can you please share some details on what's going on in those cases?
> >
> > Technically, without FW support there should be no difference between
> > the platform state reachable via s2idle and the platform state
> > reachable via runtime idle.
>
> During resume there is a number of page faults that occur and during
> initialization the ring tests fail.  The graphics is unusable at this
> time as a result.
>
> The amdgpu code actually *does* distinguish between the 3 different
> cases of S3, S0ix, and runtime suspend.

But s2idle doesn't guarantee S0ix in any case.

> The function "amdgpu_acpi_is_s0ix_active" causes different codepaths to
> be used during the suspend routine.

Well, as I said, s2idle need not mean S0ix.

> In this particular case that FADT doesn't set the low power idle bit
> and that function returns false meaning the s3 codepath is taken but
> the hardware didn't go through a reset.

If there is a separate S3 code path, taking it when
pm_suspend_target_state == PM_SUSPEND_TO_IDLE is incorrect.

> It *might* also be possible to solve this by mandating an ASIC reset in
> such a case (we didn't try).

I'd rather do a PM-runtime path equivalent if the target sleep state
is PM_SUSPEND_TO_IDLE and there is no FW support for S0ix.

> However it comes back to my first upleveveled question - is this a case
> we really want to support and encourage?  This type of bug and
> combination of codepaths is not a case that is going to be well tested.
> This patch series will align the kernel behavior to only what AMD validates.

But this does not follow the definition of s2idle and its documentation.
Mario Limonciello Jan. 11, 2022, 5:32 p.m. UTC | #4
On 1/11/2022 11:05, Rafael J. Wysocki wrote:
> On Tue, Jan 11, 2022 at 5:23 PM Limonciello, Mario
> <mario.limonciello@amd.com> wrote:
>>
>> +Alex
>>
>> On 1/11/2022 09:52, Rafael J. Wysocki wrote:
>>> On Wed, Jan 5, 2022 at 8:39 PM Mario Limonciello
>>> <mario.limonciello@amd.com> wrote:
>>>>
>>>> Currently the Linux kernel will offer s2idle regardless of whether the FADT
>>>> indicates the system should use or on X86 if the LPS0 ACPI device has been
>>>> activated.
>>>>
>>>> On some non-AMD platforms s2idle can be offered even without proper
>>>> firmware support.  The power consumption may be higher in these instances
>>>> but the system otherwise properly suspends and resumes.
>>>
>>> Well, the idea is that s2idle should not require FW support at all. >
>>
>> May I ask - why?  It's an intentional design decision?
> 
> Yes, it is.
> 
>>> It may not be possible to reach the minimum power level of the
>>> platform without FW support, but that should not prevent s2idle from
>>> being used.
>>>
>>>> On AMD platforms however when the FW has been configured not to offer
>>>> s2idle some different hardware initialization has occurred such that the
>>>> system won't properly resume.
>>>
>>> That's rather unfortunate.
>>>
>>> Can you please share some details on what's going on in those cases?
>>>
>>> Technically, without FW support there should be no difference between
>>> the platform state reachable via s2idle and the platform state
>>> reachable via runtime idle.
>>
>> During resume there is a number of page faults that occur and during
>> initialization the ring tests fail.  The graphics is unusable at this
>> time as a result.
>>
>> The amdgpu code actually *does* distinguish between the 3 different
>> cases of S3, S0ix, and runtime suspend.
> 
> But s2idle doesn't guarantee S0ix in any case.
> 
>> The function "amdgpu_acpi_is_s0ix_active" causes different codepaths to
>> be used during the suspend routine.
> 
> Well, as I said, s2idle need not mean S0ix.
> 
>> In this particular case that FADT doesn't set the low power idle bit
>> and that function returns false meaning the s3 codepath is taken but
>> the hardware didn't go through a reset.
> 
> If there is a separate S3 code path, taking it when
> pm_suspend_target_state == PM_SUSPEND_TO_IDLE is incorrect.
> 
>> It *might* also be possible to solve this by mandating an ASIC reset in
>> such a case (we didn't try).
> 
> I'd rather do a PM-runtime path equivalent if the target sleep state
> is PM_SUSPEND_TO_IDLE and there is no FW support for S0ix.
> 
>> However it comes back to my first upleveveled question - is this a case
>> we really want to support and encourage?  This type of bug and
>> combination of codepaths is not a case that is going to be well tested.
>> This patch series will align the kernel behavior to only what AMD validates.
> 
> But this does not follow the definition of s2idle and its documentation.

Very well, we'll work out in amdgpu what we can do.  Please close the 
kernel bugzilla (I don't have permissions).
Alex Deucher Jan. 11, 2022, 5:32 p.m. UTC | #5
[AMD Official Use Only]

> -----Original Message-----
> From: Rafael J. Wysocki <rafael@kernel.org>
> Sent: Tuesday, January 11, 2022 12:06 PM
> To: Limonciello, Mario <Mario.Limonciello@amd.com>
> Cc: Rafael J. Wysocki <rafael@kernel.org>; Rafael J . Wysocki
> <rjw@rjwysocki.net>; ACPI Devel Maling List <linux-acpi@vger.kernel.org>;
> S-k, Shyam-sundar <Shyam-sundar.S-k@amd.com>; Natikar, Basavaraj
> <Basavaraj.Natikar@amd.com>; Deucher, Alexander
> <Alexander.Deucher@amd.com>; bjoern.daase@gmail.com
> Subject: Re: [PATCH 0/3] On AMD platforms only offer s2idle w/ proper FW
> 
> On Tue, Jan 11, 2022 at 5:23 PM Limonciello, Mario
> <mario.limonciello@amd.com> wrote:
> >
> > +Alex
> >
> > On 1/11/2022 09:52, Rafael J. Wysocki wrote:
> > > On Wed, Jan 5, 2022 at 8:39 PM Mario Limonciello
> > > <mario.limonciello@amd.com> wrote:
> > >>
> > >> Currently the Linux kernel will offer s2idle regardless of whether
> > >> the FADT indicates the system should use or on X86 if the LPS0 ACPI
> > >> device has been activated.
> > >>
> > >> On some non-AMD platforms s2idle can be offered even without
> proper
> > >> firmware support.  The power consumption may be higher in these
> > >> instances but the system otherwise properly suspends and resumes.
> > >
> > > Well, the idea is that s2idle should not require FW support at all.
> > > >
> >
> > May I ask - why?  It's an intentional design decision?
> 
> Yes, it is.
> 
> > > It may not be possible to reach the minimum power level of the
> > > platform without FW support, but that should not prevent s2idle from
> > > being used.
> > >
> > >> On AMD platforms however when the FW has been configured not to
> > >> offer s2idle some different hardware initialization has occurred
> > >> such that the system won't properly resume.
> > >
> > > That's rather unfortunate.
> > >
> > > Can you please share some details on what's going on in those cases?
> > >
> > > Technically, without FW support there should be no difference
> > > between the platform state reachable via s2idle and the platform
> > > state reachable via runtime idle.
> >
> > During resume there is a number of page faults that occur and during
> > initialization the ring tests fail.  The graphics is unusable at this
> > time as a result.
> >
> > The amdgpu code actually *does* distinguish between the 3 different
> > cases of S3, S0ix, and runtime suspend.
> 
> But s2idle doesn't guarantee S0ix in any case.
> 
> > The function "amdgpu_acpi_is_s0ix_active" causes different codepaths
> > to be used during the suspend routine.
> 
> Well, as I said, s2idle need not mean S0ix.
> 
> > In this particular case that FADT doesn't set the low power idle bit
> > and that function returns false meaning the s3 codepath is taken but
> > the hardware didn't go through a reset.
> 
> If there is a separate S3 code path, taking it when pm_suspend_target_state
> == PM_SUSPEND_TO_IDLE is incorrect.
> 
> > It *might* also be possible to solve this by mandating an ASIC reset
> > in such a case (we didn't try).
> 
> I'd rather do a PM-runtime path equivalent if the target sleep state is
> PM_SUSPEND_TO_IDLE and there is no FW support for S0ix.
> 
> > However it comes back to my first upleveveled question - is this a
> > case we really want to support and encourage?  This type of bug and
> > combination of codepaths is not a case that is going to be well tested.
> > This patch series will align the kernel behavior to only what AMD validates.
> 
> But this does not follow the definition of s2idle and its documentation.

At least for devices integrated into the SoC, the power rails are controlled by the firmware in the SoC.  For S3, the power rails are cut by the FW when the platform enters S3.  For S0ix, the power rails are cut when all of the devices on the rail suspended and various conditions are met.  Also, in the case of some devices, the device has to be in a very specific state for s0ix to work properly.  The GPU is the big one here.  For S3, the entire GPU has to be re-initialized at resume.  For S0ix, the GPU's state is largely handled by the firmware and attempting to re-initialize it won't work unless you reset it.  Integrated AMD graphics don't support runtime power management, only dGPUs do.  For integrated graphics the firmware dynamically controls the power at runtime so there is no need to do anything special for runtime pm.  For dGPUs we support d3cold either via ACPI on platforms like all-in-ones and laptops or via a driver initiated sequence for add-in-cards.

What does S2idle ultimately do when all devices have suspended?  Does it enter S0ix or S3 at the end when it want to ultimately suspend the platform, or is the assumption that if all devices have suspended, the that is equivalent to S0ix or S3?  For AMD platforms, either S3 or S0ix needs to be entered for the platform to actually power down most of the power rails.  It's not clear to me what we should do for s2idle.

Alex
Rafael J. Wysocki Jan. 11, 2022, 5:44 p.m. UTC | #6
On Tue, Jan 11, 2022 at 6:32 PM Deucher, Alexander
<Alexander.Deucher@amd.com> wrote:
>
> [AMD Official Use Only]
>
> > -----Original Message-----
> > From: Rafael J. Wysocki <rafael@kernel.org>
> > Sent: Tuesday, January 11, 2022 12:06 PM
> > To: Limonciello, Mario <Mario.Limonciello@amd.com>
> > Cc: Rafael J. Wysocki <rafael@kernel.org>; Rafael J . Wysocki
> > <rjw@rjwysocki.net>; ACPI Devel Maling List <linux-acpi@vger.kernel.org>;
> > S-k, Shyam-sundar <Shyam-sundar.S-k@amd.com>; Natikar, Basavaraj
> > <Basavaraj.Natikar@amd.com>; Deucher, Alexander
> > <Alexander.Deucher@amd.com>; bjoern.daase@gmail.com
> > Subject: Re: [PATCH 0/3] On AMD platforms only offer s2idle w/ proper FW
> >
> > On Tue, Jan 11, 2022 at 5:23 PM Limonciello, Mario
> > <mario.limonciello@amd.com> wrote:
> > >
> > > +Alex
> > >
> > > On 1/11/2022 09:52, Rafael J. Wysocki wrote:
> > > > On Wed, Jan 5, 2022 at 8:39 PM Mario Limonciello
> > > > <mario.limonciello@amd.com> wrote:
> > > >>
> > > >> Currently the Linux kernel will offer s2idle regardless of whether
> > > >> the FADT indicates the system should use or on X86 if the LPS0 ACPI
> > > >> device has been activated.
> > > >>
> > > >> On some non-AMD platforms s2idle can be offered even without
> > proper
> > > >> firmware support.  The power consumption may be higher in these
> > > >> instances but the system otherwise properly suspends and resumes.
> > > >
> > > > Well, the idea is that s2idle should not require FW support at all.
> > > > >
> > >
> > > May I ask - why?  It's an intentional design decision?
> >
> > Yes, it is.
> >
> > > > It may not be possible to reach the minimum power level of the
> > > > platform without FW support, but that should not prevent s2idle from
> > > > being used.
> > > >
> > > >> On AMD platforms however when the FW has been configured not to
> > > >> offer s2idle some different hardware initialization has occurred
> > > >> such that the system won't properly resume.
> > > >
> > > > That's rather unfortunate.
> > > >
> > > > Can you please share some details on what's going on in those cases?
> > > >
> > > > Technically, without FW support there should be no difference
> > > > between the platform state reachable via s2idle and the platform
> > > > state reachable via runtime idle.
> > >
> > > During resume there is a number of page faults that occur and during
> > > initialization the ring tests fail.  The graphics is unusable at this
> > > time as a result.
> > >
> > > The amdgpu code actually *does* distinguish between the 3 different
> > > cases of S3, S0ix, and runtime suspend.
> >
> > But s2idle doesn't guarantee S0ix in any case.
> >
> > > The function "amdgpu_acpi_is_s0ix_active" causes different codepaths
> > > to be used during the suspend routine.
> >
> > Well, as I said, s2idle need not mean S0ix.
> >
> > > In this particular case that FADT doesn't set the low power idle bit
> > > and that function returns false meaning the s3 codepath is taken but
> > > the hardware didn't go through a reset.
> >
> > If there is a separate S3 code path, taking it when pm_suspend_target_state
> > == PM_SUSPEND_TO_IDLE is incorrect.
> >
> > > It *might* also be possible to solve this by mandating an ASIC reset
> > > in such a case (we didn't try).
> >
> > I'd rather do a PM-runtime path equivalent if the target sleep state is
> > PM_SUSPEND_TO_IDLE and there is no FW support for S0ix.
> >
> > > However it comes back to my first upleveveled question - is this a
> > > case we really want to support and encourage?  This type of bug and
> > > combination of codepaths is not a case that is going to be well tested.
> > > This patch series will align the kernel behavior to only what AMD validates.
> >
> > But this does not follow the definition of s2idle and its documentation.
>
> At least for devices integrated into the SoC, the power rails are controlled by the firmware in the SoC.  For S3, the power rails are cut by the FW when the platform enters S3.  For S0ix, the power rails are cut when all of the devices on the rail suspended and various conditions are met.  Also, in the case of some devices, the device has to be in a very specific state for s0ix to work properly.  The GPU is the big one here.  For S3, the entire GPU has to be re-initialized at resume.  For S0ix, the GPU's state is largely handled by the firmware and attempting to re-initialize it won't work unless you reset it.  Integrated AMD graphics don't support runtime power management, only dGPUs do.  For integrated graphics the firmware dynamically controls the power at runtime so there is no need to do anything special for runtime pm.  For dGPUs we support d3cold either via ACPI on platforms like all-in-ones and laptops or via a driver initiated sequence for add-in-cards.
>
> What does S2idle ultimately do when all devices have suspended?  Does it enter S0ix or S3 at the end when it want to ultimately suspend the platform, or is the assumption that if all devices have suspended, the that is equivalent to S0ix or S3?  For AMD platforms, either S3 or S0ix needs to be entered for the platform to actually power down most of the power rails.  It's not clear to me what we should do for s2idle.

s2idle will never enter S3, because that requires platform support and
generally is a different thing (eg. some devices need special upfront
preparation for S3, wakeup may need to be configured in a special way
etc.).

It will attempt to enter S0ix if possible, but otherwise it will just
put CPUs into the deepest available idle state and stay there until an
interrupt (or equivalent) triggers.

Physically, at a device level, s2idle is more similar to runtime
suspend than to S3, but it uses system-wide suspend callbacks and it
requires wakeup to be disabled for the devices that are not allowed to
wake up the system by user space, that is device_may_wakeup(0 returns
false (PM-runtime assumes wakeup to always be enabled for the
suspended devices that can signal wakeup).

In any case, the ACPI system state for s2idle is always S0.
Alex Deucher Jan. 11, 2022, 6:30 p.m. UTC | #7
[Public]

> -----Original Message-----
> From: Rafael J. Wysocki <rafael@kernel.org>
> Sent: Tuesday, January 11, 2022 12:45 PM
> To: Deucher, Alexander <Alexander.Deucher@amd.com>
> Cc: Rafael J. Wysocki <rafael@kernel.org>; Limonciello, Mario
> <Mario.Limonciello@amd.com>; Rafael J . Wysocki <rjw@rjwysocki.net>;
> ACPI Devel Maling List <linux-acpi@vger.kernel.org>; S-k, Shyam-sundar
> <Shyam-sundar.S-k@amd.com>; Natikar, Basavaraj
> <Basavaraj.Natikar@amd.com>; bjoern.daase@gmail.com
> Subject: Re: [PATCH 0/3] On AMD platforms only offer s2idle w/ proper FW
> 
> On Tue, Jan 11, 2022 at 6:32 PM Deucher, Alexander
> <Alexander.Deucher@amd.com> wrote:
> >
> > [AMD Official Use Only]
> >
> > > -----Original Message-----
> > > From: Rafael J. Wysocki <rafael@kernel.org>
> > > Sent: Tuesday, January 11, 2022 12:06 PM
> > > To: Limonciello, Mario <Mario.Limonciello@amd.com>
> > > Cc: Rafael J. Wysocki <rafael@kernel.org>; Rafael J . Wysocki
> > > <rjw@rjwysocki.net>; ACPI Devel Maling List
> > > <linux-acpi@vger.kernel.org>; S-k, Shyam-sundar
> > > <Shyam-sundar.S-k@amd.com>; Natikar, Basavaraj
> > > <Basavaraj.Natikar@amd.com>; Deucher, Alexander
> > > <Alexander.Deucher@amd.com>; bjoern.daase@gmail.com
> > > Subject: Re: [PATCH 0/3] On AMD platforms only offer s2idle w/
> > > proper FW
> > >
> > > On Tue, Jan 11, 2022 at 5:23 PM Limonciello, Mario
> > > <mario.limonciello@amd.com> wrote:
> > > >
> > > > +Alex
> > > >
> > > > On 1/11/2022 09:52, Rafael J. Wysocki wrote:
> > > > > On Wed, Jan 5, 2022 at 8:39 PM Mario Limonciello
> > > > > <mario.limonciello@amd.com> wrote:
> > > > >>
> > > > >> Currently the Linux kernel will offer s2idle regardless of
> > > > >> whether the FADT indicates the system should use or on X86 if
> > > > >> the LPS0 ACPI device has been activated.
> > > > >>
> > > > >> On some non-AMD platforms s2idle can be offered even without
> > > proper
> > > > >> firmware support.  The power consumption may be higher in these
> > > > >> instances but the system otherwise properly suspends and
> resumes.
> > > > >
> > > > > Well, the idea is that s2idle should not require FW support at all.
> > > > > >
> > > >
> > > > May I ask - why?  It's an intentional design decision?
> > >
> > > Yes, it is.
> > >
> > > > > It may not be possible to reach the minimum power level of the
> > > > > platform without FW support, but that should not prevent s2idle
> > > > > from being used.
> > > > >
> > > > >> On AMD platforms however when the FW has been configured not
> to
> > > > >> offer s2idle some different hardware initialization has
> > > > >> occurred such that the system won't properly resume.
> > > > >
> > > > > That's rather unfortunate.
> > > > >
> > > > > Can you please share some details on what's going on in those cases?
> > > > >
> > > > > Technically, without FW support there should be no difference
> > > > > between the platform state reachable via s2idle and the platform
> > > > > state reachable via runtime idle.
> > > >
> > > > During resume there is a number of page faults that occur and
> > > > during initialization the ring tests fail.  The graphics is
> > > > unusable at this time as a result.
> > > >
> > > > The amdgpu code actually *does* distinguish between the 3
> > > > different cases of S3, S0ix, and runtime suspend.
> > >
> > > But s2idle doesn't guarantee S0ix in any case.
> > >
> > > > The function "amdgpu_acpi_is_s0ix_active" causes different
> > > > codepaths to be used during the suspend routine.
> > >
> > > Well, as I said, s2idle need not mean S0ix.
> > >
> > > > In this particular case that FADT doesn't set the low power idle
> > > > bit and that function returns false meaning the s3 codepath is
> > > > taken but the hardware didn't go through a reset.
> > >
> > > If there is a separate S3 code path, taking it when
> > > pm_suspend_target_state == PM_SUSPEND_TO_IDLE is incorrect.
> > >
> > > > It *might* also be possible to solve this by mandating an ASIC
> > > > reset in such a case (we didn't try).
> > >
> > > I'd rather do a PM-runtime path equivalent if the target sleep state
> > > is PM_SUSPEND_TO_IDLE and there is no FW support for S0ix.
> > >
> > > > However it comes back to my first upleveveled question - is this a
> > > > case we really want to support and encourage?  This type of bug
> > > > and combination of codepaths is not a case that is going to be well
> tested.
> > > > This patch series will align the kernel behavior to only what AMD
> validates.
> > >
> > > But this does not follow the definition of s2idle and its documentation.
> >
> > At least for devices integrated into the SoC, the power rails are controlled
> by the firmware in the SoC.  For S3, the power rails are cut by the FW when
> the platform enters S3.  For S0ix, the power rails are cut when all of the
> devices on the rail suspended and various conditions are met.  Also, in the
> case of some devices, the device has to be in a very specific state for s0ix to
> work properly.  The GPU is the big one here.  For S3, the entire GPU has to
> be re-initialized at resume.  For S0ix, the GPU's state is largely handled by the
> firmware and attempting to re-initialize it won't work unless you reset it.
> Integrated AMD graphics don't support runtime power management, only
> dGPUs do.  For integrated graphics the firmware dynamically controls the
> power at runtime so there is no need to do anything special for runtime pm.
> For dGPUs we support d3cold either via ACPI on platforms like all-in-ones
> and laptops or via a driver initiated sequence for add-in-cards.
> >
> > What does S2idle ultimately do when all devices have suspended?  Does it
> enter S0ix or S3 at the end when it want to ultimately suspend the platform,
> or is the assumption that if all devices have suspended, the that is equivalent
> to S0ix or S3?  For AMD platforms, either S3 or S0ix needs to be entered for
> the platform to actually power down most of the power rails.  It's not clear to
> me what we should do for s2idle.
> 
> s2idle will never enter S3, because that requires platform support and
> generally is a different thing (eg. some devices need special upfront
> preparation for S3, wakeup may need to be configured in a special way etc.).
> 
> It will attempt to enter S0ix if possible, but otherwise it will just put CPUs into
> the deepest available idle state and stay there until an interrupt (or
> equivalent) triggers.
> 
> Physically, at a device level, s2idle is more similar to runtime suspend than to
> S3, but it uses system-wide suspend callbacks and it requires wakeup to be
> disabled for the devices that are not allowed to wake up the system by user
> space, that is device_may_wakeup(0 returns false (PM-runtime assumes
> wakeup to always be enabled for the suspended devices that can signal
> wakeup).
> 
> In any case, the ACPI system state for s2idle is always S0.

Thanks.  In that case, it's kind of pointless on AMD platforms then since the power rails will never be turned off for most devices on the system.  Does it even make sense to expose it?  It just gives users a false sense of suspend and then they will probably complain that it uses too much power when in s2idle.

Alex
Mario Limonciello Jan. 11, 2022, 6:36 p.m. UTC | #8
On 1/11/2022 12:30, Deucher, Alexander wrote:
> [Public]
> 
>> -----Original Message-----
>> From: Rafael J. Wysocki <rafael@kernel.org>
>> Sent: Tuesday, January 11, 2022 12:45 PM
>> To: Deucher, Alexander <Alexander.Deucher@amd.com>
>> Cc: Rafael J. Wysocki <rafael@kernel.org>; Limonciello, Mario
>> <Mario.Limonciello@amd.com>; Rafael J . Wysocki <rjw@rjwysocki.net>;
>> ACPI Devel Maling List <linux-acpi@vger.kernel.org>; S-k, Shyam-sundar
>> <Shyam-sundar.S-k@amd.com>; Natikar, Basavaraj
>> <Basavaraj.Natikar@amd.com>; bjoern.daase@gmail.com
>> Subject: Re: [PATCH 0/3] On AMD platforms only offer s2idle w/ proper FW
>>
>> On Tue, Jan 11, 2022 at 6:32 PM Deucher, Alexander
>> <Alexander.Deucher@amd.com> wrote:
>>>
>>> [AMD Official Use Only]
>>>
>>>> -----Original Message-----
>>>> From: Rafael J. Wysocki <rafael@kernel.org>
>>>> Sent: Tuesday, January 11, 2022 12:06 PM
>>>> To: Limonciello, Mario <Mario.Limonciello@amd.com>
>>>> Cc: Rafael J. Wysocki <rafael@kernel.org>; Rafael J . Wysocki
>>>> <rjw@rjwysocki.net>; ACPI Devel Maling List
>>>> <linux-acpi@vger.kernel.org>; S-k, Shyam-sundar
>>>> <Shyam-sundar.S-k@amd.com>; Natikar, Basavaraj
>>>> <Basavaraj.Natikar@amd.com>; Deucher, Alexander
>>>> <Alexander.Deucher@amd.com>; bjoern.daase@gmail.com
>>>> Subject: Re: [PATCH 0/3] On AMD platforms only offer s2idle w/
>>>> proper FW
>>>>
>>>> On Tue, Jan 11, 2022 at 5:23 PM Limonciello, Mario
>>>> <mario.limonciello@amd.com> wrote:
>>>>>
>>>>> +Alex
>>>>>
>>>>> On 1/11/2022 09:52, Rafael J. Wysocki wrote:
>>>>>> On Wed, Jan 5, 2022 at 8:39 PM Mario Limonciello
>>>>>> <mario.limonciello@amd.com> wrote:
>>>>>>>
>>>>>>> Currently the Linux kernel will offer s2idle regardless of
>>>>>>> whether the FADT indicates the system should use or on X86 if
>>>>>>> the LPS0 ACPI device has been activated.
>>>>>>>
>>>>>>> On some non-AMD platforms s2idle can be offered even without
>>>> proper
>>>>>>> firmware support.  The power consumption may be higher in these
>>>>>>> instances but the system otherwise properly suspends and
>> resumes.
>>>>>>
>>>>>> Well, the idea is that s2idle should not require FW support at all.
>>>>>>>
>>>>>
>>>>> May I ask - why?  It's an intentional design decision?
>>>>
>>>> Yes, it is.
>>>>
>>>>>> It may not be possible to reach the minimum power level of the
>>>>>> platform without FW support, but that should not prevent s2idle
>>>>>> from being used.
>>>>>>
>>>>>>> On AMD platforms however when the FW has been configured not
>> to
>>>>>>> offer s2idle some different hardware initialization has
>>>>>>> occurred such that the system won't properly resume.
>>>>>>
>>>>>> That's rather unfortunate.
>>>>>>
>>>>>> Can you please share some details on what's going on in those cases?
>>>>>>
>>>>>> Technically, without FW support there should be no difference
>>>>>> between the platform state reachable via s2idle and the platform
>>>>>> state reachable via runtime idle.
>>>>>
>>>>> During resume there is a number of page faults that occur and
>>>>> during initialization the ring tests fail.  The graphics is
>>>>> unusable at this time as a result.
>>>>>
>>>>> The amdgpu code actually *does* distinguish between the 3
>>>>> different cases of S3, S0ix, and runtime suspend.
>>>>
>>>> But s2idle doesn't guarantee S0ix in any case.
>>>>
>>>>> The function "amdgpu_acpi_is_s0ix_active" causes different
>>>>> codepaths to be used during the suspend routine.
>>>>
>>>> Well, as I said, s2idle need not mean S0ix.
>>>>
>>>>> In this particular case that FADT doesn't set the low power idle
>>>>> bit and that function returns false meaning the s3 codepath is
>>>>> taken but the hardware didn't go through a reset.
>>>>
>>>> If there is a separate S3 code path, taking it when
>>>> pm_suspend_target_state == PM_SUSPEND_TO_IDLE is incorrect.
>>>>
>>>>> It *might* also be possible to solve this by mandating an ASIC
>>>>> reset in such a case (we didn't try).
>>>>
>>>> I'd rather do a PM-runtime path equivalent if the target sleep state
>>>> is PM_SUSPEND_TO_IDLE and there is no FW support for S0ix.
>>>>
>>>>> However it comes back to my first upleveveled question - is this a
>>>>> case we really want to support and encourage?  This type of bug
>>>>> and combination of codepaths is not a case that is going to be well
>> tested.
>>>>> This patch series will align the kernel behavior to only what AMD
>> validates.
>>>>
>>>> But this does not follow the definition of s2idle and its documentation.
>>>
>>> At least for devices integrated into the SoC, the power rails are controlled
>> by the firmware in the SoC.  For S3, the power rails are cut by the FW when
>> the platform enters S3.  For S0ix, the power rails are cut when all of the
>> devices on the rail suspended and various conditions are met.  Also, in the
>> case of some devices, the device has to be in a very specific state for s0ix to
>> work properly.  The GPU is the big one here.  For S3, the entire GPU has to
>> be re-initialized at resume.  For S0ix, the GPU's state is largely handled by the
>> firmware and attempting to re-initialize it won't work unless you reset it.
>> Integrated AMD graphics don't support runtime power management, only
>> dGPUs do.  For integrated graphics the firmware dynamically controls the
>> power at runtime so there is no need to do anything special for runtime pm.
>> For dGPUs we support d3cold either via ACPI on platforms like all-in-ones
>> and laptops or via a driver initiated sequence for add-in-cards.
>>>
>>> What does S2idle ultimately do when all devices have suspended?  Does it
>> enter S0ix or S3 at the end when it want to ultimately suspend the platform,
>> or is the assumption that if all devices have suspended, the that is equivalent
>> to S0ix or S3?  For AMD platforms, either S3 or S0ix needs to be entered for
>> the platform to actually power down most of the power rails.  It's not clear to
>> me what we should do for s2idle.
>>
>> s2idle will never enter S3, because that requires platform support and
>> generally is a different thing (eg. some devices need special upfront
>> preparation for S3, wakeup may need to be configured in a special way etc.).
>>
>> It will attempt to enter S0ix if possible, but otherwise it will just put CPUs into
>> the deepest available idle state and stay there until an interrupt (or
>> equivalent) triggers.
>>
>> Physically, at a device level, s2idle is more similar to runtime suspend than to
>> S3, but it uses system-wide suspend callbacks and it requires wakeup to be
>> disabled for the devices that are not allowed to wake up the system by user
>> space, that is device_may_wakeup(0 returns false (PM-runtime assumes
>> wakeup to always be enabled for the suspended devices that can signal
>> wakeup).
>>
>> In any case, the ACPI system state for s2idle is always S0.
> 
> Thanks.  In that case, it's kind of pointless on AMD platforms then since the power rails will never be turned off for most devices on the system.  Does it even make sense to expose it?  It just gives users a false sense of suspend and then they will probably complain that it uses too much power when in s2idle.
> 
> Alex

That's why I thought my patch series made sense.

I guess another (antisocial) response would be to to return false when 
the suspend callback for amdgpu happens and dev_err mentioning that 
firmware support is needed for suspend.
Rafael J. Wysocki Jan. 11, 2022, 7:20 p.m. UTC | #9
On Tue, Jan 11, 2022 at 7:36 PM Limonciello, Mario
<mario.limonciello@amd.com> wrote:
>
> On 1/11/2022 12:30, Deucher, Alexander wrote:
> > [Public]
> >
> >> -----Original Message-----
> >> From: Rafael J. Wysocki <rafael@kernel.org>
> >> Sent: Tuesday, January 11, 2022 12:45 PM
> >> To: Deucher, Alexander <Alexander.Deucher@amd.com>
> >> Cc: Rafael J. Wysocki <rafael@kernel.org>; Limonciello, Mario
> >> <Mario.Limonciello@amd.com>; Rafael J . Wysocki <rjw@rjwysocki.net>;
> >> ACPI Devel Maling List <linux-acpi@vger.kernel.org>; S-k, Shyam-sundar
> >> <Shyam-sundar.S-k@amd.com>; Natikar, Basavaraj
> >> <Basavaraj.Natikar@amd.com>; bjoern.daase@gmail.com
> >> Subject: Re: [PATCH 0/3] On AMD platforms only offer s2idle w/ proper FW
> >>
> >> On Tue, Jan 11, 2022 at 6:32 PM Deucher, Alexander
> >> <Alexander.Deucher@amd.com> wrote:
> >>>
> >>> [AMD Official Use Only]
> >>>
> >>>> -----Original Message-----
> >>>> From: Rafael J. Wysocki <rafael@kernel.org>
> >>>> Sent: Tuesday, January 11, 2022 12:06 PM
> >>>> To: Limonciello, Mario <Mario.Limonciello@amd.com>
> >>>> Cc: Rafael J. Wysocki <rafael@kernel.org>; Rafael J . Wysocki
> >>>> <rjw@rjwysocki.net>; ACPI Devel Maling List
> >>>> <linux-acpi@vger.kernel.org>; S-k, Shyam-sundar
> >>>> <Shyam-sundar.S-k@amd.com>; Natikar, Basavaraj
> >>>> <Basavaraj.Natikar@amd.com>; Deucher, Alexander
> >>>> <Alexander.Deucher@amd.com>; bjoern.daase@gmail.com
> >>>> Subject: Re: [PATCH 0/3] On AMD platforms only offer s2idle w/
> >>>> proper FW
> >>>>
> >>>> On Tue, Jan 11, 2022 at 5:23 PM Limonciello, Mario
> >>>> <mario.limonciello@amd.com> wrote:
> >>>>>
> >>>>> +Alex
> >>>>>
> >>>>> On 1/11/2022 09:52, Rafael J. Wysocki wrote:
> >>>>>> On Wed, Jan 5, 2022 at 8:39 PM Mario Limonciello
> >>>>>> <mario.limonciello@amd.com> wrote:
> >>>>>>>
> >>>>>>> Currently the Linux kernel will offer s2idle regardless of
> >>>>>>> whether the FADT indicates the system should use or on X86 if
> >>>>>>> the LPS0 ACPI device has been activated.
> >>>>>>>
> >>>>>>> On some non-AMD platforms s2idle can be offered even without
> >>>> proper
> >>>>>>> firmware support.  The power consumption may be higher in these
> >>>>>>> instances but the system otherwise properly suspends and
> >> resumes.
> >>>>>>
> >>>>>> Well, the idea is that s2idle should not require FW support at all.
> >>>>>>>
> >>>>>
> >>>>> May I ask - why?  It's an intentional design decision?
> >>>>
> >>>> Yes, it is.
> >>>>
> >>>>>> It may not be possible to reach the minimum power level of the
> >>>>>> platform without FW support, but that should not prevent s2idle
> >>>>>> from being used.
> >>>>>>
> >>>>>>> On AMD platforms however when the FW has been configured not
> >> to
> >>>>>>> offer s2idle some different hardware initialization has
> >>>>>>> occurred such that the system won't properly resume.
> >>>>>>
> >>>>>> That's rather unfortunate.
> >>>>>>
> >>>>>> Can you please share some details on what's going on in those cases?
> >>>>>>
> >>>>>> Technically, without FW support there should be no difference
> >>>>>> between the platform state reachable via s2idle and the platform
> >>>>>> state reachable via runtime idle.
> >>>>>
> >>>>> During resume there is a number of page faults that occur and
> >>>>> during initialization the ring tests fail.  The graphics is
> >>>>> unusable at this time as a result.
> >>>>>
> >>>>> The amdgpu code actually *does* distinguish between the 3
> >>>>> different cases of S3, S0ix, and runtime suspend.
> >>>>
> >>>> But s2idle doesn't guarantee S0ix in any case.
> >>>>
> >>>>> The function "amdgpu_acpi_is_s0ix_active" causes different
> >>>>> codepaths to be used during the suspend routine.
> >>>>
> >>>> Well, as I said, s2idle need not mean S0ix.
> >>>>
> >>>>> In this particular case that FADT doesn't set the low power idle
> >>>>> bit and that function returns false meaning the s3 codepath is
> >>>>> taken but the hardware didn't go through a reset.
> >>>>
> >>>> If there is a separate S3 code path, taking it when
> >>>> pm_suspend_target_state == PM_SUSPEND_TO_IDLE is incorrect.
> >>>>
> >>>>> It *might* also be possible to solve this by mandating an ASIC
> >>>>> reset in such a case (we didn't try).
> >>>>
> >>>> I'd rather do a PM-runtime path equivalent if the target sleep state
> >>>> is PM_SUSPEND_TO_IDLE and there is no FW support for S0ix.
> >>>>
> >>>>> However it comes back to my first upleveveled question - is this a
> >>>>> case we really want to support and encourage?  This type of bug
> >>>>> and combination of codepaths is not a case that is going to be well
> >> tested.
> >>>>> This patch series will align the kernel behavior to only what AMD
> >> validates.
> >>>>
> >>>> But this does not follow the definition of s2idle and its documentation.
> >>>
> >>> At least for devices integrated into the SoC, the power rails are controlled
> >> by the firmware in the SoC.  For S3, the power rails are cut by the FW when
> >> the platform enters S3.  For S0ix, the power rails are cut when all of the
> >> devices on the rail suspended and various conditions are met.  Also, in the
> >> case of some devices, the device has to be in a very specific state for s0ix to
> >> work properly.  The GPU is the big one here.  For S3, the entire GPU has to
> >> be re-initialized at resume.  For S0ix, the GPU's state is largely handled by the
> >> firmware and attempting to re-initialize it won't work unless you reset it.
> >> Integrated AMD graphics don't support runtime power management, only
> >> dGPUs do.  For integrated graphics the firmware dynamically controls the
> >> power at runtime so there is no need to do anything special for runtime pm.
> >> For dGPUs we support d3cold either via ACPI on platforms like all-in-ones
> >> and laptops or via a driver initiated sequence for add-in-cards.
> >>>
> >>> What does S2idle ultimately do when all devices have suspended?  Does it
> >> enter S0ix or S3 at the end when it want to ultimately suspend the platform,
> >> or is the assumption that if all devices have suspended, the that is equivalent
> >> to S0ix or S3?  For AMD platforms, either S3 or S0ix needs to be entered for
> >> the platform to actually power down most of the power rails.  It's not clear to
> >> me what we should do for s2idle.
> >>
> >> s2idle will never enter S3, because that requires platform support and
> >> generally is a different thing (eg. some devices need special upfront
> >> preparation for S3, wakeup may need to be configured in a special way etc.).
> >>
> >> It will attempt to enter S0ix if possible, but otherwise it will just put CPUs into
> >> the deepest available idle state and stay there until an interrupt (or
> >> equivalent) triggers.
> >>
> >> Physically, at a device level, s2idle is more similar to runtime suspend than to
> >> S3, but it uses system-wide suspend callbacks and it requires wakeup to be
> >> disabled for the devices that are not allowed to wake up the system by user
> >> space, that is device_may_wakeup(0 returns false (PM-runtime assumes
> >> wakeup to always be enabled for the suspended devices that can signal
> >> wakeup).
> >>
> >> In any case, the ACPI system state for s2idle is always S0.
> >
> > Thanks.  In that case, it's kind of pointless on AMD platforms then since the power rails will never be turned off for most devices on the system.  Does it even make sense to expose it?  It just gives users a false sense of suspend and then they will probably complain that it uses too much power when in s2idle.
> >
> > Alex
>
> That's why I thought my patch series made sense.
>
> I guess another (antisocial) response would be to to return false when
> the suspend callback for amdgpu happens and dev_err mentioning that
> firmware support is needed for suspend.

If you want to avoid exposing s2idle at all in some cases, you need to
change the sysfs I/F in /sys/power/ so that it doesn't expose "s2idle"
in mem_sleep or "freeze" in state at all.  That's much more intrusive
than just failing every s2idle attempt (which is what the patches do
AFAICS).

Do you want to do that for platforms supporting S3, or for platforms
that don't support any kind of suspend at all?
Mario Limonciello Jan. 11, 2022, 7:35 p.m. UTC | #10
>>
>> That's why I thought my patch series made sense.
>>
>> I guess another (antisocial) response would be to to return false when
>> the suspend callback for amdgpu happens and dev_err mentioning that
>> firmware support is needed for suspend.
> 
> If you want to avoid exposing s2idle at all in some cases, you need to
> change the sysfs I/F in /sys/power/ so that it doesn't expose "s2idle"
> in mem_sleep or "freeze" in state at all.  

Right now they make it so that s2idle isn't exposed in "mem_sleep", but 
"freeze" is still exposed in "state".

I'd be worried about breaking userspace with taking "freeze" out of state.

> That's much more intrusive
> than just failing every s2idle attempt (which is what the patches do
> AFAICS)
As they stand right now when the BIOS is set to S3:
# cat /sys/power/mem_sleep
[deep]

# echo freeze > /sys/power/state
Returns a failure though.

And then when it's set to Modern Standby :

# cat /sys/power/mem_sleep
[s2idle]

# echo freeze > /sys/power/state
Works

> 
> Do you want to do that for platforms supporting S3, or for platforms
> that don't support any kind of suspend at all?

You know since this series went out 
6dc8265f9803ccb7e5da804e01601f0c14f270e0 was merged.  This will probably
have cleaned up the problem state that Bjoern found in the first place.

So this is really a philosophical discussion now if it's worth offering 
s2idle in /sys/power/mem_sleep on AMD systems.

Admittedly this has been an APU notebook oriented discussion.  Platforms 
that don't support any kind of suspend (like servers) will get caught up 
in this and could no longer do s2idle either.

With the assumption that 6dc8265f9803ccb7e5da804e01601f0c14f270e0 helps 
the problem state I'm leaning back on we should add some warnings to let 
people know how to help themselves for power consumption if they did this.

We can probably put those in amdgpu though.
Rafael J. Wysocki Jan. 11, 2022, 7:48 p.m. UTC | #11
On Tue, Jan 11, 2022 at 8:35 PM Limonciello, Mario
<mario.limonciello@amd.com> wrote:
>
> >>
> >> That's why I thought my patch series made sense.
> >>
> >> I guess another (antisocial) response would be to to return false when
> >> the suspend callback for amdgpu happens and dev_err mentioning that
> >> firmware support is needed for suspend.
> >
> > If you want to avoid exposing s2idle at all in some cases, you need to
> > change the sysfs I/F in /sys/power/ so that it doesn't expose "s2idle"
> > in mem_sleep or "freeze" in state at all.
>
> Right now they make it so that s2idle isn't exposed in "mem_sleep", but
> "freeze" is still exposed in "state".
>
> I'd be worried about breaking userspace with taking "freeze" out of state.
>
> > That's much more intrusive
> > than just failing every s2idle attempt (which is what the patches do
> > AFAICS)
> As they stand right now when the BIOS is set to S3:
> # cat /sys/power/mem_sleep
> [deep]
>
> # echo freeze > /sys/power/state
> Returns a failure though.
>
> And then when it's set to Modern Standby :
>
> # cat /sys/power/mem_sleep
> [s2idle]
>
> # echo freeze > /sys/power/state
> Works
>
> >
> > Do you want to do that for platforms supporting S3, or for platforms
> > that don't support any kind of suspend at all?
>
> You know since this series went out
> 6dc8265f9803ccb7e5da804e01601f0c14f270e0 was merged.  This will probably
> have cleaned up the problem state that Bjoern found in the first place.
>
> So this is really a philosophical discussion now if it's worth offering
> s2idle in /sys/power/mem_sleep on AMD systems.

Well, if "freeze" is present in "state", then "s2idle" needs to be
there in "mem_sleep", because they are clearly documented as
alternative paths to the same functionality.

> Admittedly this has been an APU notebook oriented discussion.  Platforms
> that don't support any kind of suspend (like servers) will get caught up
> in this and could no longer do s2idle either.
>
> With the assumption that 6dc8265f9803ccb7e5da804e01601f0c14f270e0 helps
> the problem state I'm leaning back on we should add some warnings to let
> people know how to help themselves for power consumption if they did this.
>
> We can probably put those in amdgpu though.

Sounds like a better direction to me, honestly.