pciehp 0000:00:1c.0:pcie004: Timeout on hotplug command 0x1038 (issued 65284 msec ago)

On Thu, May 03, 2018 at 10:49:24AM +0200, Paul Menzel wrote:
> On 04/27/18 21:22, Bjorn Helgaas wrote:
> > [+cc Lukas, Sinan]
> 
> > On Thu, Apr 26, 2018 at 12:17:53PM +0200, Paul Menzel wrote:
> 
> > > On the Lenovo X60t, during resume from ACPI suspend and during shutdown, the
> > > message below is shown in the logs.
> > > 
> > >      pciehp 0000:00:1c.0:pcie004: Timeout on hotplug command 0x1038 (issued
> > > 65284 msec ago)
> > 
> > This is an Intel root port:
> > 
> >    00:1c.0 PCI bridge: Intel Corporation NM10/ICH7 Family PCI Express Port 1 (rev 02) (prog-if 00 [Normal decode])
> > 
> > and probably has the CF118 erratum (see
> > http://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git/commit/?id=3461a068661c
> > for details).  I bet if you changed "msecs" in pcie_wait_cmd() to 30000
> > you'd see a 30 second delay during shutdown because we write a command to
> > tell the port not to generate any more hotplug interrupts, and we wait for
> > that command to complete, but the port never tells us it has completed.
> > 
> > Lukas reported a similar issue in
> > https://lkml.kernel.org/r/20180112104929.GA10599@wunner.de, which we sort
> > of worked around by assuming that Thunderbolt controllers never support
> > that "command complete" interrupt (see
> > http://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git/commit/?id=493fb50e958c)
> > 
> > Sinan mooted the idea of using a "no-wait" path of sending the "don't
> > generate hotplug interrupts" command.  I think we should work on this
> > idea a little more.  If we're shutting down the whole system, I can't
> > believe there's much value in *anything* we do in the pciehp_remove()
> > path.
> > 
> > Maybe we should just get rid of pciehp_remove() (and probably
> > pcie_port_remove_service() and the other service driver remove methods)
> > completely.  That dates from when the service drivers could be modules that
> > could be potentially unloaded, but unloading them hasn't been possible for
> > years.
> > 
> > As far as the resume path, my guess is that in pciehp_resume(), we
> > write a command to enable interrupts, then it looks like we get a
> > PCI_EXP_SLTSTA_DLLSC "Link Up" interrupt, and apparently we issue
> > another command.  Not sure exactly what's going on here.

> Thank you for the quick reply and sorry for only being able to test it now.
> Please find the relevant bits from the ACPI S3 suspend “action” below. The
> full log is attached.

No problem.  I think we need to bite the bullet and just do a quirk
for the Intel erratum.  I tried to avoid it by waiting for command
completion lazily, but I think that ended up being unnecessarily
clever and it didn't even solve the whole problem.

Can you try the patch below?  I think it should solve the problem
you're seeing.

commit ec48a1e0b91ce68903c8ea4dce659d4fdf17ad06
Author: Bjorn Helgaas <bhelgaas@google.com>
Date:   Thu May 3 18:39:38 2018 -0500

    PCI: pciehp: Add quirk for Intel Command Completed erratum

    The Intel CF118 erratum means the controller does not set the Command
    Completed bit unless writes to the Slot Command register change "Control"
    bits.  Command Completed is never set for writes that only change software
    notification "Enable" bits.  This results in timeouts like this:

      pciehp 0000:00:1c.0:pcie004: Timeout on hotplug command 0x1038 (issued 65284 msec ago)

    When this erratum is present, avoid these timeouts by marking commands
    "completed" immediately unless they change the "Control" bits.

    Here's the text of the erratum from the Intel document:

      CF118        PCIe Slot Status Register Command Completed bit not always
                   updated on any configuration write to the Slot Control
                   Register

      Problem:     For PCIe root ports (devices 0 - 10) supporting hot-plug,
                   the Slot Status Register (offset AAh) Command Completed
                   (bit[4]) status is updated under the following condition:
                   IOH will set Command Completed bit after delivering the new
                   commands written in the Slot Controller register (offset
                   A8h) to VPP. The IOH detects new commands written in Slot
                   Control register by checking the change of value for Power
                   Controller Control (bit[10]), Power Indicator Control
                   (bits[9:8]), Attention Indicator Control (bits[7:6]), or
                   Electromechanical Interlock Control (bit[11]) fields. Any
                   other configuration writes to the Slot Control register
                   without changing the values of these fields will not cause
                   Command Completed bit to be set.

                   The PCIe Base Specification Revision 2.0 or later describes
                   the “Slot Control Register” in section 7.8.10, as follows
                   (Reference section 7.8.10, Slot Control Register, Offset
                   18h). In hot-plug capable Downstream Ports, a write to the
                   Slot Control register must cause a hot-plug command to be
                   generated (see Section 6.7.3.2 for details on hot-plug
                   commands). A write to the Slot Control register in a
                   Downstream Port that is not hotplug capable must not cause a
                   hot-plug command to be executed.

                   The PCIe Spec intended that every write to the Slot Control
                   Register is a command and expected a command complete status
                   to abstract the VPP implementation specific nuances from the
                   OS software. IOH PCIe Slot Control Register implementation
                   is not fully conforming to the PCIe Specification in this
                   respect.

      Implication: Software checking on the Command Completed status after
                   writing to the Slot Control register may time out.

      Workaround:  Software can read the Slot Control register and compare the
                   existing and new values to determine if it should check the
                   Command Completed status after writing to the Slot Control
                   register.

    Link: http://www.intel.com/content/www/us/en/processors/xeon/xeon-e7-v2-spec-update.html
    Link: https://lkml.kernel.org/r/8770820b-85a0-172b-7230-3a44524e6c9f@molgen.mpg.de
    Reported-by: Paul Menzel <pmenzel+linux-pci@molgen.mpg.de>
    Signed-off-by: Bjorn Helgaas <bhelgaas@google.com>

Message ID	20180504024527.GE15790@bhelgaas-glaptop.roam.corp.google.com (mailing list archive)
State	New, archived
Delegated to:	Bjorn Helgaas
Headers	show Return-Path: <linux-pci-owner@kernel.org> Date: Thu, 3 May 2018 21:45:27 -0500 From: Bjorn Helgaas <helgaas@kernel.org> To: Paul Menzel <pmenzel+linux-pci@molgen.mpg.de> Cc: Bjorn Helgaas <bhelgaas@google.com>, linux-pci@vger.kernel.org, linux-kernel@vger.kernel.org, Lukas Wunner <lukas@wunner.de>, Sinan Kaya <okaya@codeaurora.org> Subject: Re: pciehp 0000:00:1c.0:pcie004: Timeout on hotplug command 0x1038 (issued 65284 msec ago) Message-ID: <20180504024527.GE15790@bhelgaas-glaptop.roam.corp.google.com> References: <8770820b-85a0-172b-7230-3a44524e6c9f@molgen.mpg.de> <20180427192207.GG8199@bhelgaas-glaptop.roam.corp.google.com> <43b8ab4a-f8ee-dc96-40ec-e6fdfedd8309@molgen.mpg.de> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <43b8ab4a-f8ee-dc96-40ec-e6fdfedd8309@molgen.mpg.de> User-Agent: Mutt/1.9.2 (2017-12-15) Sender: linux-pci-owner@vger.kernel.org Precedence: bulk

pciehp 0000:00:1c.0:pcie004: Timeout on hotplug command 0x1038 (issued 65284 msec ago)

Commit Message

Comments

Patch