diff mbox

PCI / ACPI: Do not set ACPI companions for host bridges with parents

Message ID 2625308.mOex6suvmm@vostro.rjw.lan (mailing list archive)
State New, archived
Delegated to: Bjorn Helgaas
Headers show

Commit Message

Rafael J. Wysocki May 26, 2015, 2:17 a.m. UTC
On Tuesday, May 26, 2015 03:08:17 AM Rafael J. Wysocki wrote:
> On Tuesday, May 26, 2015 01:42:16 AM Rafael J. Wysocki wrote:
> > On Tuesday, May 26, 2015 01:22:12 AM Rafael J. Wysocki wrote:
> > > On Friday, May 22, 2015 09:53:37 PM Boris Ostrovsky wrote:
> > > > On 05/22/2015 04:11 AM, Sander Eikelenboom wrote:
> > > > > Hello Sander,
> > > > >
> > 
> > [cut]
> > 
> > > > (+Rafael again)
> > > > 
> > > > So the immediate cause of those errors is that pdev->evtchn is 0. 
> > > > Backend is not notified and things not go well then.
> > > > 
> > > > And it is indeed caused by 97badf873ab60e841243b66133ff9eff2a46ef29:
> > > > 
> > > > We allocate pcifront_sd in pcifront_scan_root() and then pass it to 
> > > > pci_scan_bus_parented() as sysdata. Eventually this sysdata is used in 
> > > > pcibios_root_bridge_prepare() as pci_sysdata. It is dereferenced as 
> > > > pci_sysdata->companion (which I believe is aliased to pcifront_sd->pdev)
> > 
> > Well, there is an int node field between them, so I'm not sure.
> > 
> > > > and then set_primary_fwnode() writes it, thus corrupting 
> > > > pcifront_sd->pdev (and I think this is what sets evtchn to zero).
> > 
> > So the corruption happens when set_primary_fwnode() writes NULL to the
> > 'secondary' field of object pointed to by 'fwnode'.
> > 
> > This isn't strictly necessary and we might avoid the crash by only
> > writing to fwnode->secondary if fn is not NULL.
> > 
> > So, Sander please test the patch below too if possible.
> > 
> > Of course, that doesn't solve a problem of passing an incorrect pointer
> > to ACPI_COMPANION_SET() in pcibios_root_bridge_prepare().
> 
> And here's one more thing to test.

And the below is how I'd fix it, so you can simply test this patch and skip the
previous ones.

---
From: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
Subject: PCI / ACPI: Do not set ACPI companions for host bridges with parents

Commit 97badf873ab6 (device property: Make it possible to use
secondary firmware nodes) uncovered a bug in the x86 (and ia64) PCI
host bridge initialization code that assumes bridge->bus->sysdata
to always point to a struct pci_sysdata object which need not be
the case (in particular, the Xen PCI frontend driver sets it to point
to a different data type).  If it is not the case, an incorrect
pointer (or a piece of data that is not a pointer at all) will be
passed to ACPI_COMPANION_SET() and that may cause interesting
breakage to happen going forward.

To work around this problem use the observation that the ACPI
host bridge initialization always passes NULL as parent to
pci_create_root_bus(), so if pcibios_root_bridge_prepare() sees
a non-NULL parent of the bridge, it should not attempt to set
an ACPI companion for it, because that means that
pci_create_root_bus() has been called by someone else.

Reported-by: Sander Eikelenboom <linux@eikelenboom.it>
Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
---
 arch/ia64/pci/pci.c |   13 ++++++++++---
 arch/x86/pci/acpi.c |   13 ++++++++++---
 2 files changed, 20 insertions(+), 6 deletions(-)


--
To unsubscribe from this list: send the line "unsubscribe linux-pci" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Comments

Sander Eikelenboom May 26, 2015, 7:53 a.m. UTC | #1
Tuesday, May 26, 2015, 4:17:05 AM, you wrote:

> On Tuesday, May 26, 2015 03:08:17 AM Rafael J. Wysocki wrote:
>> On Tuesday, May 26, 2015 01:42:16 AM Rafael J. Wysocki wrote:
>> > On Tuesday, May 26, 2015 01:22:12 AM Rafael J. Wysocki wrote:
>> > > On Friday, May 22, 2015 09:53:37 PM Boris Ostrovsky wrote:
>> > > > On 05/22/2015 04:11 AM, Sander Eikelenboom wrote:
>> > > > > Hello Sander,
>> > > > >
>> > 
>> > [cut]
>> > 
>> > > > (+Rafael again)
>> > > > 
>> > > > So the immediate cause of those errors is that pdev->evtchn is 0. 
>> > > > Backend is not notified and things not go well then.
>> > > > 
>> > > > And it is indeed caused by 97badf873ab60e841243b66133ff9eff2a46ef29:
>> > > > 
>> > > > We allocate pcifront_sd in pcifront_scan_root() and then pass it to 
>> > > > pci_scan_bus_parented() as sysdata. Eventually this sysdata is used in 
>> > > > pcibios_root_bridge_prepare() as pci_sysdata. It is dereferenced as 
>> > > > pci_sysdata->companion (which I believe is aliased to pcifront_sd->pdev)
>> > 
>> > Well, there is an int node field between them, so I'm not sure.
>> > 
>> > > > and then set_primary_fwnode() writes it, thus corrupting 
>> > > > pcifront_sd->pdev (and I think this is what sets evtchn to zero).
>> > 
>> > So the corruption happens when set_primary_fwnode() writes NULL to the
>> > 'secondary' field of object pointed to by 'fwnode'.
>> > 
>> > This isn't strictly necessary and we might avoid the crash by only
>> > writing to fwnode->secondary if fn is not NULL.
>> > 
>> > So, Sander please test the patch below too if possible.
>> > 
>> > Of course, that doesn't solve a problem of passing an incorrect pointer
>> > to ACPI_COMPANION_SET() in pcibios_root_bridge_prepare().
>> 
>> And here's one more thing to test.

> And the below is how I'd fix it, so you can simply test this patch and skip the
> previous ones.

Hi Rafael,

Just tested it, works for me, thanks !

--
Sander


> ---
> From: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
> Subject: PCI / ACPI: Do not set ACPI companions for host bridges with parents

> Commit 97badf873ab6 (device property: Make it possible to use
> secondary firmware nodes) uncovered a bug in the x86 (and ia64) PCI
> host bridge initialization code that assumes bridge->bus->sysdata
> to always point to a struct pci_sysdata object which need not be
> the case (in particular, the Xen PCI frontend driver sets it to point
> to a different data type).  If it is not the case, an incorrect
> pointer (or a piece of data that is not a pointer at all) will be
> passed to ACPI_COMPANION_SET() and that may cause interesting
> breakage to happen going forward.

> To work around this problem use the observation that the ACPI
> host bridge initialization always passes NULL as parent to
> pci_create_root_bus(), so if pcibios_root_bridge_prepare() sees
> a non-NULL parent of the bridge, it should not attempt to set
> an ACPI companion for it, because that means that
> pci_create_root_bus() has been called by someone else.

> Reported-by: Sander Eikelenboom <linux@eikelenboom.it>
> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
> ---
>  arch/ia64/pci/pci.c |   13 ++++++++++---
>  arch/x86/pci/acpi.c |   13 ++++++++++---
>  2 files changed, 20 insertions(+), 6 deletions(-)

> Index: linux-pm/arch/x86/pci/acpi.c
> ===================================================================
> --- linux-pm.orig/arch/x86/pci/acpi.c
> +++ linux-pm/arch/x86/pci/acpi.c
> @@ -482,9 +482,16 @@ struct pci_bus *pci_acpi_scan_root(struc
>  
>  int pcibios_root_bridge_prepare(struct pci_host_bridge *bridge)
>  {
> -       struct pci_sysdata *sd = bridge->bus->sysdata;
> -
> -       ACPI_COMPANION_SET(&bridge->dev, sd->companion);
> +       /*
> +        * We pass NULL as parent to pci_create_root_bus(), so if it is not NULL
> +        * here, pci_create_root_bus() has been called by someone else and
> +        * sysdata is likely to be different from what we expect.  Let it go in
> +        * that case.
> +        */
> +       if (!bridge->dev.parent) {
> +               struct pci_sysdata *sd = bridge->bus->sysdata;
> +               ACPI_COMPANION_SET(&bridge->dev, sd->companion);
> +       }
>         return 0;
>  }
>  
> Index: linux-pm/arch/ia64/pci/pci.c
> ===================================================================
> --- linux-pm.orig/arch/ia64/pci/pci.c
> +++ linux-pm/arch/ia64/pci/pci.c
> @@ -478,9 +478,16 @@ struct pci_bus *pci_acpi_scan_root(struc
>  
>  int pcibios_root_bridge_prepare(struct pci_host_bridge *bridge)
>  {
> -       struct pci_controller *controller = bridge->bus->sysdata;
> -
> -       ACPI_COMPANION_SET(&bridge->dev, controller->companion);
> +       /*
> +        * We pass NULL as parent to pci_create_root_bus(), so if it is not NULL
> +        * here, pci_create_root_bus() has been called by someone else and
> +        * sysdata is likely to be different from what we expect.  Let it go in
> +        * that case.
> +        */
> +       if (!bridge->dev.parent) {
> +               struct pci_controller *controller = bridge->bus->sysdata;
> +               ACPI_COMPANION_SET(&bridge->dev, controller->companion);
> +       }
>         return 0;
>  }
>  



--
To unsubscribe from this list: send the line "unsubscribe linux-pci" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Bjorn Helgaas May 27, 2015, 10:58 p.m. UTC | #2
On Tue, May 26, 2015 at 04:17:05AM +0200, Rafael J. Wysocki wrote:
> On Tuesday, May 26, 2015 03:08:17 AM Rafael J. Wysocki wrote:
> > On Tuesday, May 26, 2015 01:42:16 AM Rafael J. Wysocki wrote:
> > > On Tuesday, May 26, 2015 01:22:12 AM Rafael J. Wysocki wrote:
> > > > On Friday, May 22, 2015 09:53:37 PM Boris Ostrovsky wrote:
> > > > > On 05/22/2015 04:11 AM, Sander Eikelenboom wrote:
> > > > > > Hello Sander,
> > > > > >
> > > 
> > > [cut]
> > > 
> > > > > (+Rafael again)
> > > > > 
> > > > > So the immediate cause of those errors is that pdev->evtchn is 0. 
> > > > > Backend is not notified and things not go well then.
> > > > > 
> > > > > And it is indeed caused by 97badf873ab60e841243b66133ff9eff2a46ef29:
> > > > > 
> > > > > We allocate pcifront_sd in pcifront_scan_root() and then pass it to 
> > > > > pci_scan_bus_parented() as sysdata. Eventually this sysdata is used in 
> > > > > pcibios_root_bridge_prepare() as pci_sysdata. It is dereferenced as 
> > > > > pci_sysdata->companion (which I believe is aliased to pcifront_sd->pdev)
> > > 
> > > Well, there is an int node field between them, so I'm not sure.
> > > 
> > > > > and then set_primary_fwnode() writes it, thus corrupting 
> > > > > pcifront_sd->pdev (and I think this is what sets evtchn to zero).
> > > 
> > > So the corruption happens when set_primary_fwnode() writes NULL to the
> > > 'secondary' field of object pointed to by 'fwnode'.
> > > 
> > > This isn't strictly necessary and we might avoid the crash by only
> > > writing to fwnode->secondary if fn is not NULL.
> > > 
> > > So, Sander please test the patch below too if possible.
> > > 
> > > Of course, that doesn't solve a problem of passing an incorrect pointer
> > > to ACPI_COMPANION_SET() in pcibios_root_bridge_prepare().
> > 
> > And here's one more thing to test.
> 
> And the below is how I'd fix it, so you can simply test this patch and skip the
> previous ones.
> 
> ---
> From: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
> Subject: PCI / ACPI: Do not set ACPI companions for host bridges with parents
> 
> Commit 97badf873ab6 (device property: Make it possible to use
> secondary firmware nodes) uncovered a bug in the x86 (and ia64) PCI
> host bridge initialization code that assumes bridge->bus->sysdata
> to always point to a struct pci_sysdata object which need not be
> the case (in particular, the Xen PCI frontend driver sets it to point
> to a different data type).  If it is not the case, an incorrect
> pointer (or a piece of data that is not a pointer at all) will be
> passed to ACPI_COMPANION_SET() and that may cause interesting
> breakage to happen going forward.
> 
> To work around this problem use the observation that the ACPI
> host bridge initialization always passes NULL as parent to
> pci_create_root_bus(), so if pcibios_root_bridge_prepare() sees
> a non-NULL parent of the bridge, it should not attempt to set
> an ACPI companion for it, because that means that
> pci_create_root_bus() has been called by someone else.
> 
> Reported-by: Sander Eikelenboom <linux@eikelenboom.it>
> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com>

Do you want to merge this, Rafael?

Acked-by: Bjorn Helgaas <bhelgaas@google.com>

> ---
>  arch/ia64/pci/pci.c |   13 ++++++++++---
>  arch/x86/pci/acpi.c |   13 ++++++++++---
>  2 files changed, 20 insertions(+), 6 deletions(-)
> 
> Index: linux-pm/arch/x86/pci/acpi.c
> ===================================================================
> --- linux-pm.orig/arch/x86/pci/acpi.c
> +++ linux-pm/arch/x86/pci/acpi.c
> @@ -482,9 +482,16 @@ struct pci_bus *pci_acpi_scan_root(struc
>  
>  int pcibios_root_bridge_prepare(struct pci_host_bridge *bridge)
>  {
> -	struct pci_sysdata *sd = bridge->bus->sysdata;
> -
> -	ACPI_COMPANION_SET(&bridge->dev, sd->companion);
> +	/*
> +	 * We pass NULL as parent to pci_create_root_bus(), so if it is not NULL
> +	 * here, pci_create_root_bus() has been called by someone else and
> +	 * sysdata is likely to be different from what we expect.  Let it go in
> +	 * that case.
> +	 */
> +	if (!bridge->dev.parent) {
> +		struct pci_sysdata *sd = bridge->bus->sysdata;
> +		ACPI_COMPANION_SET(&bridge->dev, sd->companion);
> +	}
>  	return 0;
>  }
>  
> Index: linux-pm/arch/ia64/pci/pci.c
> ===================================================================
> --- linux-pm.orig/arch/ia64/pci/pci.c
> +++ linux-pm/arch/ia64/pci/pci.c
> @@ -478,9 +478,16 @@ struct pci_bus *pci_acpi_scan_root(struc
>  
>  int pcibios_root_bridge_prepare(struct pci_host_bridge *bridge)
>  {
> -	struct pci_controller *controller = bridge->bus->sysdata;
> -
> -	ACPI_COMPANION_SET(&bridge->dev, controller->companion);
> +	/*
> +	 * We pass NULL as parent to pci_create_root_bus(), so if it is not NULL
> +	 * here, pci_create_root_bus() has been called by someone else and
> +	 * sysdata is likely to be different from what we expect.  Let it go in
> +	 * that case.
> +	 */
> +	if (!bridge->dev.parent) {
> +		struct pci_controller *controller = bridge->bus->sysdata;
> +		ACPI_COMPANION_SET(&bridge->dev, controller->companion);
> +	}
>  	return 0;
>  }
>  
> 
--
To unsubscribe from this list: send the line "unsubscribe linux-pci" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Rafael J. Wysocki May 27, 2015, 11:18 p.m. UTC | #3
On Thu, May 28, 2015 at 12:58 AM, Bjorn Helgaas <bhelgaas@google.com> wrote:
> On Tue, May 26, 2015 at 04:17:05AM +0200, Rafael J. Wysocki wrote:
>> On Tuesday, May 26, 2015 03:08:17 AM Rafael J. Wysocki wrote:
>> > On Tuesday, May 26, 2015 01:42:16 AM Rafael J. Wysocki wrote:
>> > > On Tuesday, May 26, 2015 01:22:12 AM Rafael J. Wysocki wrote:
>> > > > On Friday, May 22, 2015 09:53:37 PM Boris Ostrovsky wrote:
>> > > > > On 05/22/2015 04:11 AM, Sander Eikelenboom wrote:
>> > > > > > Hello Sander,
>> > > > > >
>> > >
>> > > [cut]
>> > >
>> > > > > (+Rafael again)
>> > > > >
>> > > > > So the immediate cause of those errors is that pdev->evtchn is 0.
>> > > > > Backend is not notified and things not go well then.
>> > > > >
>> > > > > And it is indeed caused by 97badf873ab60e841243b66133ff9eff2a46ef29:
>> > > > >
>> > > > > We allocate pcifront_sd in pcifront_scan_root() and then pass it to
>> > > > > pci_scan_bus_parented() as sysdata. Eventually this sysdata is used in
>> > > > > pcibios_root_bridge_prepare() as pci_sysdata. It is dereferenced as
>> > > > > pci_sysdata->companion (which I believe is aliased to pcifront_sd->pdev)
>> > >
>> > > Well, there is an int node field between them, so I'm not sure.
>> > >
>> > > > > and then set_primary_fwnode() writes it, thus corrupting
>> > > > > pcifront_sd->pdev (and I think this is what sets evtchn to zero).
>> > >
>> > > So the corruption happens when set_primary_fwnode() writes NULL to the
>> > > 'secondary' field of object pointed to by 'fwnode'.
>> > >
>> > > This isn't strictly necessary and we might avoid the crash by only
>> > > writing to fwnode->secondary if fn is not NULL.
>> > >
>> > > So, Sander please test the patch below too if possible.
>> > >
>> > > Of course, that doesn't solve a problem of passing an incorrect pointer
>> > > to ACPI_COMPANION_SET() in pcibios_root_bridge_prepare().
>> >
>> > And here's one more thing to test.
>>
>> And the below is how I'd fix it, so you can simply test this patch and skip the
>> previous ones.
>>
>> ---
>> From: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
>> Subject: PCI / ACPI: Do not set ACPI companions for host bridges with parents
>>
>> Commit 97badf873ab6 (device property: Make it possible to use
>> secondary firmware nodes) uncovered a bug in the x86 (and ia64) PCI
>> host bridge initialization code that assumes bridge->bus->sysdata
>> to always point to a struct pci_sysdata object which need not be
>> the case (in particular, the Xen PCI frontend driver sets it to point
>> to a different data type).  If it is not the case, an incorrect
>> pointer (or a piece of data that is not a pointer at all) will be
>> passed to ACPI_COMPANION_SET() and that may cause interesting
>> breakage to happen going forward.
>>
>> To work around this problem use the observation that the ACPI
>> host bridge initialization always passes NULL as parent to
>> pci_create_root_bus(), so if pcibios_root_bridge_prepare() sees
>> a non-NULL parent of the bridge, it should not attempt to set
>> an ACPI companion for it, because that means that
>> pci_create_root_bus() has been called by someone else.
>>
>> Reported-by: Sander Eikelenboom <linux@eikelenboom.it>
>> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
>
> Do you want to merge this, Rafael?

I can do that.

> Acked-by: Bjorn Helgaas <bhelgaas@google.com>

Thanks!
--
To unsubscribe from this list: send the line "unsubscribe linux-pci" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
diff mbox

Patch

Index: linux-pm/arch/x86/pci/acpi.c
===================================================================
--- linux-pm.orig/arch/x86/pci/acpi.c
+++ linux-pm/arch/x86/pci/acpi.c
@@ -482,9 +482,16 @@  struct pci_bus *pci_acpi_scan_root(struc
 
 int pcibios_root_bridge_prepare(struct pci_host_bridge *bridge)
 {
-	struct pci_sysdata *sd = bridge->bus->sysdata;
-
-	ACPI_COMPANION_SET(&bridge->dev, sd->companion);
+	/*
+	 * We pass NULL as parent to pci_create_root_bus(), so if it is not NULL
+	 * here, pci_create_root_bus() has been called by someone else and
+	 * sysdata is likely to be different from what we expect.  Let it go in
+	 * that case.
+	 */
+	if (!bridge->dev.parent) {
+		struct pci_sysdata *sd = bridge->bus->sysdata;
+		ACPI_COMPANION_SET(&bridge->dev, sd->companion);
+	}
 	return 0;
 }
 
Index: linux-pm/arch/ia64/pci/pci.c
===================================================================
--- linux-pm.orig/arch/ia64/pci/pci.c
+++ linux-pm/arch/ia64/pci/pci.c
@@ -478,9 +478,16 @@  struct pci_bus *pci_acpi_scan_root(struc
 
 int pcibios_root_bridge_prepare(struct pci_host_bridge *bridge)
 {
-	struct pci_controller *controller = bridge->bus->sysdata;
-
-	ACPI_COMPANION_SET(&bridge->dev, controller->companion);
+	/*
+	 * We pass NULL as parent to pci_create_root_bus(), so if it is not NULL
+	 * here, pci_create_root_bus() has been called by someone else and
+	 * sysdata is likely to be different from what we expect.  Let it go in
+	 * that case.
+	 */
+	if (!bridge->dev.parent) {
+		struct pci_controller *controller = bridge->bus->sysdata;
+		ACPI_COMPANION_SET(&bridge->dev, controller->companion);
+	}
 	return 0;
 }