diff mbox

Linux 3.7-rc1 (nouveau_bios_score oops).

Message ID 20121020221948.GE5826@joi.lan (mailing list archive)
State New, archived
Headers show

Commit Message

Marcin Ślusarz Oct. 20, 2012, 10:19 p.m. UTC
On Sat, Oct 20, 2012 at 11:20:36PM +0200, Heinz Diehl wrote:
> On 20.10.2012, Marcin Slusarz wrote: 
> 
> > Try this one.
> 
> It works, now I can boot again. However, nouveau seems to be dead now.
> The dmesg output with your patch on top of 3.7-rc1 is:
> 
> [    3.685909] [drm] Initialized i915 1.6.0 20080730 for 0000:00:02.0 on minor 0
> [    3.687784] nouveau  [  DEVICE][0000:01:00.0] BOOT0  : 0x0a8800b1
> [    3.689960] nouveau  [  DEVICE][0000:01:00.0] Chipset: GT218 (NVA8)
> [    3.692471] nouveau  [  DEVICE][0000:01:00.0] Family : NV50
> [    3.695716] nouveau  [   VBIOS][0000:01:00.0] checking PRAMIN for image...
> [    3.697087] nouveau  [   VBIOS][0000:01:00.0] ... signature not found
> [    3.698471] nouveau  [   VBIOS][0000:01:00.0] checking PROM for image...
> [    3.699838] nouveau  [   VBIOS][0000:01:00.0] ... signature not found
> [    3.701223] nouveau  [   VBIOS][0000:01:00.0] checking ACPI for image...
> [    3.702684] ACPI Error: Field [ROMI] Base+Offset+Width 0+24+1 is beyond end of region [VROM] (length 24) (20120913/exfldio-210)
> [    3.704139] ACPI Error: Method parse/execution failed[\_SB_.PCI0.PEG1.GFX0._ROM] (Node ffff880142e85cf8), AE_AML_REGION_LIMIT (20120913/psparse-536)
> [    3.716183] failed to evaluate ROM got AE_AML_REGION_LIMIT
> [    3.718776] nouveau  [   VBIOS][0000:01:00.0] ... signature not found
> [    3.721349] nouveau  [   VBIOS][0000:01:00.0] checking PCIROM for image...
> [    3.724111] nouveau 0000:01:00.0: Invalid ROM contents
> [    3.726663] nouveau  [   VBIOS][0000:01:00.0] ... signature not found
> [    3.729159] nouveau E[   VBIOS][0000:01:00.0] unable to locate usable image
> [    3.731677] nouveau E[  DEVICE][0000:01:00.0] failed to create 0x10000001, -22
> [    3.734231] nouveau E[     DRM] failed to create 0x80000080, -22
> [    3.736097] nouveau: probe of 0000:01:00.0 failed with error -22
> [    3.740523] dracut: Starting plymouth daemon

Hmm, maybe we can't fetch 3 bytes only...

Let's check this:

---
---

Please attach acpidump output.

Comments

Pawe? Sikora Oct. 21, 2012, 6:58 a.m. UTC | #1
On Sunday 21 of October 2012 00:19:48 Marcin Slusarz wrote:
> On Sat, Oct 20, 2012 at 11:20:36PM +0200, Heinz Diehl wrote:
> > On 20.10.2012, Marcin Slusarz wrote: 
> > 
> > > Try this one.
> > 
> > It works, now I can boot again. However, nouveau seems to be dead now.
> > The dmesg output with your patch on top of 3.7-rc1 is:
> > 
> > [    3.685909] [drm] Initialized i915 1.6.0 20080730 for 0000:00:02.0 on minor 0
> > [    3.687784] nouveau  [  DEVICE][0000:01:00.0] BOOT0  : 0x0a8800b1
> > [    3.689960] nouveau  [  DEVICE][0000:01:00.0] Chipset: GT218 (NVA8)
> > [    3.692471] nouveau  [  DEVICE][0000:01:00.0] Family : NV50
> > [    3.695716] nouveau  [   VBIOS][0000:01:00.0] checking PRAMIN for image...
> > [    3.697087] nouveau  [   VBIOS][0000:01:00.0] ... signature not found
> > [    3.698471] nouveau  [   VBIOS][0000:01:00.0] checking PROM for image...
> > [    3.699838] nouveau  [   VBIOS][0000:01:00.0] ... signature not found
> > [    3.701223] nouveau  [   VBIOS][0000:01:00.0] checking ACPI for image...
> > [    3.702684] ACPI Error: Field [ROMI] Base+Offset+Width 0+24+1 is beyond end of region [VROM] (length 24) (20120913/exfldio-210)
> > [    3.704139] ACPI Error: Method parse/execution failed[\_SB_.PCI0.PEG1.GFX0._ROM] (Node ffff880142e85cf8), AE_AML_REGION_LIMIT (20120913/psparse-536)
> > [    3.716183] failed to evaluate ROM got AE_AML_REGION_LIMIT
> > [    3.718776] nouveau  [   VBIOS][0000:01:00.0] ... signature not found
> > [    3.721349] nouveau  [   VBIOS][0000:01:00.0] checking PCIROM for image...
> > [    3.724111] nouveau 0000:01:00.0: Invalid ROM contents
> > [    3.726663] nouveau  [   VBIOS][0000:01:00.0] ... signature not found
> > [    3.729159] nouveau E[   VBIOS][0000:01:00.0] unable to locate usable image
> > [    3.731677] nouveau E[  DEVICE][0000:01:00.0] failed to create 0x10000001, -22
> > [    3.734231] nouveau E[     DRM] failed to create 0x80000080, -22
> > [    3.736097] nouveau: probe of 0000:01:00.0 failed with error -22
> > [    3.740523] dracut: Starting plymouth daemon
> 
> Hmm, maybe we can't fetch 3 bytes only...
> 
> Let's check this:
> 
> ---
> diff --git a/drivers/gpu/drm/nouveau/core/subdev/bios/base.c b/drivers/gpu/drm/nouveau/core/subdev/bios/base.c
> index 824eea0..8bd71aa 100644
> --- a/drivers/gpu/drm/nouveau/core/subdev/bios/base.c
> +++ b/drivers/gpu/drm/nouveau/core/subdev/bios/base.c
> @@ -192,14 +192,16 @@ nouveau_bios_shadow_acpi(struct nouveau_bios *bios)
>  {
>  	struct pci_dev *pdev = nv_device(bios)->pdev;
>  	int ret, cnt, i;
> -	u8  data[3];
> +	u8  *data;
>  
>  	if (!nouveau_acpi_rom_supported(pdev))
>  		return;
>  
>  	bios->size = 0;
> -	if (nouveau_acpi_get_bios_chunk(data, 0, 3) == 3)
> +	data = kmalloc(4096, GFP_KERNEL);
> +	if (data && nouveau_acpi_get_bios_chunk(data, 0, 4096) >= 3)
>  		bios->size = data[2] * 512;
> +	kfree(data);
>  	if (!bios->size)
>  		return;

with these both patches applied my laptop boots and gui works fine.
here's dmesg:

(...)
[    8.751795] nouveau  [   VBIOS][0000:01:00.0] ... appears to be valid
[    8.751798] nouveau  [   VBIOS][0000:01:00.0] using image from ACPI
[    8.751895] nouveau  [   VBIOS][0000:01:00.0] BIT signature found
[    8.751898] nouveau  [   VBIOS][0000:01:00.0] version 70.08.45.00
[    8.752366] nouveau  [ DEVINIT][0000:01:00.0] adaptor not initialised
[    8.752368] nouveau  [   VBIOS][0000:01:00.0] running init tables
[    8.867660] nouveau  [     MXM][0000:01:00.0] no VBIOS data, nothing to do
[    8.867690] nouveau  [     PFB][0000:01:00.0] RAM type: DDR3
[    8.867692] nouveau  [     PFB][0000:01:00.0] RAM size: 512 MiB
[    8.901523] vga_switcheroo: enabled
[    8.901979] [TTM] Zone  kernel: Available graphics memory: 6104282 kiB
[    8.901980] [TTM] Zone   dma32: Available graphics memory: 2097152 kiB
[    8.901982] [TTM] Initializing pool allocator
[    8.902014] [TTM] Initializing DMA pool allocator
[    8.902180] mtrr: type mismatch for c0000000,10000000 old: write-back new: write-combining
[    8.902184] nouveau  [     DRM] VRAM: 512 MiB
[    8.902185] nouveau  [     DRM] GART: 512 MiB
[    8.902188] nouveau  [     DRM] BIT BIOS found
[    8.902190] nouveau  [     DRM] Bios version 70.08.45.00
[    8.902192] nouveau  [     DRM] TMDS table version 2.0
[    8.902194] nouveau  [     DRM] DCB version 4.0
[    8.902196] nouveau  [     DRM] DCB outp 00: 02011300 00000000
[    8.902198] nouveau  [     DRM] DCB conn 01: 00000100
[    8.903540] [drm] Supports vblank timestamp caching Rev 1 (10.10.2010).
[    8.903541] [drm] No driver support for vblank timestamp query.
[    8.903545] nouveau  [     DRM] ACPI backlight interface available, not registering our own
[    8.903938] nouveau  [     DRM] 3 available performance level(s)
[    8.903941] nouveau  [     DRM] 0: core 50MHz shader 101MHz memory 135MHz voltage 830mV
[    8.903943] nouveau  [     DRM] 1: core 202MHz shader 405MHz memory 324MHz voltage 830mV
[    8.903946] nouveau  [     DRM] 3: core 672MHz shader 1344MHz memory 900MHz voltage 980mV
[    8.903948] nouveau  [     DRM] c: core 202MHz shader 405MHz memory 324MHz voltage 980mV
[    8.908895] nouveau  [     DRM] MM: using COPY1 for buffer copies
[    8.988695] nouveau  [     DRM] allocated 1024x768 fb: 0x60000, bo ffff88033b5f1800
[    8.988955] fb1: nouveaufb frame buffer device
[    8.988959] [drm] Initialized nouveau 1.1.0 20120801 for 0000:01:00.0 on minor 1
(...)

> Please attach acpidump output.

http://pluto.agmk.net/nv/acpidump.txt
Heinz Diehl Oct. 21, 2012, 9:26 a.m. UTC | #2
On 21.10.2012, Pawe? Sikora wrote: 

> with these both patches applied my laptop boots and gui works fine.

The same here. The two patches together, applied to 3.7-rc1 let my
machine boot again. Here's the relevant dmesg cut:

[    3.702671] nouveau  [  DEVICE][0000:01:00.0] BOOT0  : 0x0a8800b1
[    3.704643] nouveau  [  DEVICE][0000:01:00.0] Chipset: GT218 (NVA8)
[    3.707003] nouveau  [  DEVICE][0000:01:00.0] Family : NV50
[    3.711393] nouveau  [   VBIOS][0000:01:00.0] checking PRAMIN for image...
[    3.712793] nouveau  [   VBIOS][0000:01:00.0] ... signature not found
[    3.714147] nouveau  [   VBIOS][0000:01:00.0] checking PROM for image...
[    3.715578] nouveau  [   VBIOS][0000:01:00.0] ... signature not found
[    3.716969] nouveau  [   VBIOS][0000:01:00.0] checking ACPI for image...
[    4.207512] nouveau  [   VBIOS][0000:01:00.0] ... appears to be valid
[    4.209711] nouveau  [   VBIOS][0000:01:00.0] using image from ACPI
[    4.212469] nouveau  [   VBIOS][0000:01:00.0] BIT signature found
[    4.214855] nouveau  [   VBIOS][0000:01:00.0] version 70.18.5d.00
[    4.217554] nouveau  [ DEVINIT][0000:01:00.0] adaptor not initialised
[    4.219922] nouveau  [   VBIOS][0000:01:00.0] running init tables
[    4.342202] nouveau  [     MXM][0000:01:00.0] no VBIOS data, nothing to do
[    4.343590] nouveau  [     PFB][0000:01:00.0] RAM type: DDR3
[    4.344916] nouveau  [     PFB][0000:01:00.0] RAM size: 1024 MiB
[    4.991887] vga_switcheroo: enabled
[    4.993495] [TTM] Zone  kernel: Available graphics memory: 1917720 kiB
[    4.994911] [TTM] Initializing pool allocator
[    4.996228] [TTM] Initializing DMA pool allocator
[    4.998915] nouveau  [     DRM] VRAM: 1024 MiB
[    5.000251] nouveau  [     DRM] GART: 512 MiB
[    5.001579] nouveau  [     DRM] BIT BIOS found
[    5.002904] nouveau  [     DRM] Bios version 70.18.5d.00
[    5.004229] nouveau  [     DRM] TMDS table version 2.0
[    5.005545] nouveau  [     DRM] DCB version 4.0
[    5.006810] nouveau  [     DRM] DCB outp 00: 02014300 00000000
[    5.008115] nouveau  [     DRM] DCB conn 00: 00000040
[    5.009441] nouveau  [     DRM] DCB conn 01: 00410146
[    5.010745] nouveau  [     DRM] DCB conn 02: 00001261
[    5.012019] nouveau  [     DRM] DCB conn 03: 00002330
[    5.013255] nouveau  [     DRM] DCB conn 04: 00000400
[    5.014452] nouveau  [     DRM] DCB conn 05: 00000560
[    5.052344] [drm] Supports vblank timestamp caching Rev 1 (10.10.2010).
[    5.053486] [drm] No driver support for vblank timestamp query.
[    5.054590] nouveau  [     DRM] ACPI backlight interface available,not registering our own
[    5.356822] nouveau  [     DRM] 3 available performance level(s)
[    5.358682] nouveau  [     DRM] 0: core 135MHz shader 270MHz memory 135MHz voltage 850mV
[    5.360442] nouveau  [     DRM] 1: core 405MHz shader 810MHz memory 405MHz voltage 850mV
[    5.362215] nouveau  [     DRM] 3: core 606MHz shader 1468MHz memory 667MHz voltage 1000mV
[    5.363998] nouveau  [     DRM] c: core 405MHz shader 810MHz memory 405MHz
[    5.404143] nouveau  [     DRM] MM: using COPY for buffer copies
[    5.481593] No connectors reported connected with modes
[    5.482667] [drm] Cannot find any crtc or sizes - going 1024x768
[    5.517932] nouveau  [     DRM] allocated 1024x768 fb: 0x70000, bo ffff88013a6d3400
[    5.519236] fb1: nouveaufb frame buffer device
[    5.520368] [drm] Initialized nouveau 1.1.0 20120801 for 0000:01:00.0 on minor 1
[    5.527378] dracut: Starting plymouth daemon
Marcin Ślusarz Oct. 21, 2012, 12:09 p.m. UTC | #3
On Sun, Oct 21, 2012 at 08:58:07AM +0200, Pawe? Sikora wrote:
> On Sunday 21 of October 2012 00:19:48 Marcin Slusarz wrote:
> > On Sat, Oct 20, 2012 at 11:20:36PM +0200, Heinz Diehl wrote:
> > > On 20.10.2012, Marcin Slusarz wrote: 
> > > 
> > > > Try this one.
> > > 
> > > It works, now I can boot again. However, nouveau seems to be dead now.
> > > The dmesg output with your patch on top of 3.7-rc1 is:
> > > 
> > > [    3.685909] [drm] Initialized i915 1.6.0 20080730 for 0000:00:02.0 on minor 0
> > > [    3.687784] nouveau  [  DEVICE][0000:01:00.0] BOOT0  : 0x0a8800b1
> > > [    3.689960] nouveau  [  DEVICE][0000:01:00.0] Chipset: GT218 (NVA8)
> > > [    3.692471] nouveau  [  DEVICE][0000:01:00.0] Family : NV50
> > > [    3.695716] nouveau  [   VBIOS][0000:01:00.0] checking PRAMIN for image...
> > > [    3.697087] nouveau  [   VBIOS][0000:01:00.0] ... signature not found
> > > [    3.698471] nouveau  [   VBIOS][0000:01:00.0] checking PROM for image...
> > > [    3.699838] nouveau  [   VBIOS][0000:01:00.0] ... signature not found
> > > [    3.701223] nouveau  [   VBIOS][0000:01:00.0] checking ACPI for image...
> > > [    3.702684] ACPI Error: Field [ROMI] Base+Offset+Width 0+24+1 is beyond end of region [VROM] (length 24) (20120913/exfldio-210)
> > > [    3.704139] ACPI Error: Method parse/execution failed[\_SB_.PCI0.PEG1.GFX0._ROM] (Node ffff880142e85cf8), AE_AML_REGION_LIMIT (20120913/psparse-536)
> > > [    3.716183] failed to evaluate ROM got AE_AML_REGION_LIMIT
> > > [    3.718776] nouveau  [   VBIOS][0000:01:00.0] ... signature not found
> > > [    3.721349] nouveau  [   VBIOS][0000:01:00.0] checking PCIROM for image...
> > > [    3.724111] nouveau 0000:01:00.0: Invalid ROM contents
> > > [    3.726663] nouveau  [   VBIOS][0000:01:00.0] ... signature not found
> > > [    3.729159] nouveau E[   VBIOS][0000:01:00.0] unable to locate usable image
> > > [    3.731677] nouveau E[  DEVICE][0000:01:00.0] failed to create 0x10000001, -22
> > > [    3.734231] nouveau E[     DRM] failed to create 0x80000080, -22
> > > [    3.736097] nouveau: probe of 0000:01:00.0 failed with error -22
> > > [    3.740523] dracut: Starting plymouth daemon
> > 
> > Hmm, maybe we can't fetch 3 bytes only...
> > 
> > Let's check this:
> > 
> > ---
> > diff --git a/drivers/gpu/drm/nouveau/core/subdev/bios/base.c b/drivers/gpu/drm/nouveau/core/subdev/bios/base.c
> > index 824eea0..8bd71aa 100644
> > --- a/drivers/gpu/drm/nouveau/core/subdev/bios/base.c
> > +++ b/drivers/gpu/drm/nouveau/core/subdev/bios/base.c
> > @@ -192,14 +192,16 @@ nouveau_bios_shadow_acpi(struct nouveau_bios *bios)
> >  {
> >  	struct pci_dev *pdev = nv_device(bios)->pdev;
> >  	int ret, cnt, i;
> > -	u8  data[3];
> > +	u8  *data;
> >  
> >  	if (!nouveau_acpi_rom_supported(pdev))
> >  		return;
> >  
> >  	bios->size = 0;
> > -	if (nouveau_acpi_get_bios_chunk(data, 0, 3) == 3)
> > +	data = kmalloc(4096, GFP_KERNEL);
> > +	if (data && nouveau_acpi_get_bios_chunk(data, 0, 4096) >= 3)
> >  		bios->size = data[2] * 512;
> > +	kfree(data);
> >  	if (!bios->size)
> >  		return;
> 
> with these both patches applied my laptop boots and gui works fine.
> here's dmesg:
> 
> (...)
> [    8.751795] nouveau  [   VBIOS][0000:01:00.0] ... appears to be valid
> [    8.751798] nouveau  [   VBIOS][0000:01:00.0] using image from ACPI
> [    8.751895] nouveau  [   VBIOS][0000:01:00.0] BIT signature found
> [    8.751898] nouveau  [   VBIOS][0000:01:00.0] version 70.08.45.00
> [    8.752366] nouveau  [ DEVINIT][0000:01:00.0] adaptor not initialised
> [    8.752368] nouveau  [   VBIOS][0000:01:00.0] running init tables
> [    8.867660] nouveau  [     MXM][0000:01:00.0] no VBIOS data, nothing to do
> [    8.867690] nouveau  [     PFB][0000:01:00.0] RAM type: DDR3
> [    8.867692] nouveau  [     PFB][0000:01:00.0] RAM size: 512 MiB
> [    8.901523] vga_switcheroo: enabled
> [    8.901979] [TTM] Zone  kernel: Available graphics memory: 6104282 kiB
> [    8.901980] [TTM] Zone   dma32: Available graphics memory: 2097152 kiB
> [    8.901982] [TTM] Initializing pool allocator
> [    8.902014] [TTM] Initializing DMA pool allocator
> [    8.902180] mtrr: type mismatch for c0000000,10000000 old: write-back new: write-combining
> [    8.902184] nouveau  [     DRM] VRAM: 512 MiB
> [    8.902185] nouveau  [     DRM] GART: 512 MiB
> [    8.902188] nouveau  [     DRM] BIT BIOS found
> [    8.902190] nouveau  [     DRM] Bios version 70.08.45.00
> [    8.902192] nouveau  [     DRM] TMDS table version 2.0
> [    8.902194] nouveau  [     DRM] DCB version 4.0
> [    8.902196] nouveau  [     DRM] DCB outp 00: 02011300 00000000
> [    8.902198] nouveau  [     DRM] DCB conn 01: 00000100
> [    8.903540] [drm] Supports vblank timestamp caching Rev 1 (10.10.2010).
> [    8.903541] [drm] No driver support for vblank timestamp query.
> [    8.903545] nouveau  [     DRM] ACPI backlight interface available, not registering our own
> [    8.903938] nouveau  [     DRM] 3 available performance level(s)
> [    8.903941] nouveau  [     DRM] 0: core 50MHz shader 101MHz memory 135MHz voltage 830mV
> [    8.903943] nouveau  [     DRM] 1: core 202MHz shader 405MHz memory 324MHz voltage 830mV
> [    8.903946] nouveau  [     DRM] 3: core 672MHz shader 1344MHz memory 900MHz voltage 980mV
> [    8.903948] nouveau  [     DRM] c: core 202MHz shader 405MHz memory 324MHz voltage 980mV
> [    8.908895] nouveau  [     DRM] MM: using COPY1 for buffer copies
> [    8.988695] nouveau  [     DRM] allocated 1024x768 fb: 0x60000, bo ffff88033b5f1800
> [    8.988955] fb1: nouveaufb frame buffer device
> [    8.988959] [drm] Initialized nouveau 1.1.0 20120801 for 0000:01:00.0 on minor 1
> (...)
> 
> > Please attach acpidump output.
> 
> http://pluto.agmk.net/nv/acpidump.txt
> 

This looks like ACPI bug...

Nouveau calls this method:
Method (_ROM, 2, NotSerialized)// _ROM: Read-Only Memory
{
  Add (Arg0, RBUF, Local0)
  ShiftLeft (Arg1, 0x03, Local1)
  Name (VBUF, Buffer (Arg1) {})
  OperationRegion (VROM, SystemMemory, Local0, Local1)
  Field (VROM, ByteAcc, NoLock, Preserve)
  {
    ROMI, 65536
  }

  Store (ROMI, VBUF)
  Return (VBUF)
}

with Arg0 == 0, Arg1 == 3 and gets:

ACPI Error: Field [ROMI] Base+Offset+Width 0+24+1 is beyond end of region [VROM] (length 24) (20120913/exfldio-210)
ACPI Error: Method parse/execution failed [\_SB_.PCI0.PEGR.GFX0._ROM] (Node ffff88033e47fe88), AE_AML_REGION_LIMIT (20120913/psparse-536)

We can workaround it by aligning Arg1 to 4096 (I'm wondering what is the minimal
value), but do we really have to?

Marcin
Heinz Diehl Oct. 21, 2012, 1:31 p.m. UTC | #4
On 21.10.2012, Marcin Slusarz wrote: 

> > > Please attach acpidump output.
> > 
> > http://pluto.agmk.net/nv/acpidump.txt
> > 
> 
> This looks like ACPI bug...

I guess my acpidump didn't make it to the list. Anyway, here it is:

http://www.fritha.org/acpidump.gz
Linus Torvalds Oct. 21, 2012, 2:38 p.m. UTC | #5
On Sun, Oct 21, 2012 at 5:09 AM, Marcin Slusarz
<marcin.slusarz@gmail.com> wrote:
>
> This looks like ACPI bug...

I'm _shocked_ to hear that firmware would be fragile.

Anyway, here's the #1 thing to keep in mind about firmware:

 - firmware is *always* buggy.

It's that simple. Don't expect anything else. Firmware is written by
people who have lost the will to live (why? Because they do firmware
development for a living), and the only thing keeping them going is
the drugs. And they're not the "fun" kind of drugs. The end result is
predictable. In their drug-induced haze, they make a mess.

So saying "ACPI is buggy" is like saying "water is wet". Deal with it.

> Nouveau calls this method: [...]
>
> with Arg0 == 0, Arg1 == 3 and gets:
>
> ACPI Error: Field [ROMI] Base+Offset+Width 0+24+1 is beyond end of region [VROM] (length 24) (20120913/exfldio-210)
> ACPI Error: Method parse/execution failed [\_SB_.PCI0.PEGR.GFX0._ROM] (Node ffff88033e47fe88), AE_AML_REGION_LIMIT (20120913/psparse-536)
>
> We can workaround it by aligning Arg1 to 4096 (I'm wondering what is the minimal
> value), but do we really have to?

Yes, you really have to. You need to understand that ACPI has been
tested with one thing, and one thing only: Windows. Clearly windows
doesn't ask for some three-byte region. So it doesn't work. Big
surprise. Untested code written by monkeys on crack - what did you
expect?

So don't do "clever" things. When it comes to firmware, you need to
expect it to be buggy, and try to access it the way Windows accesses
it.

Now, whether Windows realy always does things with a 4kB-aligned
access, or whether you just need to make sure that you're (say)
4-byte-aligned, I don't know. Maybe doing things four (or eight, or
sixteen) bytes at a time will work. You can try. But if we know from
past experience that a 4kB block-size will work, I'd suggest just
saying

  "It's stupid, but stupid is good - we're working with firmware"

Think of firmware the way you think of hardware: it's buggy and needs
workarounds.

                Linus
Marcin Ślusarz Oct. 21, 2012, 2:49 p.m. UTC | #6
On Sun, Oct 21, 2012 at 07:38:58AM -0700, Linus Torvalds wrote:
> On Sun, Oct 21, 2012 at 5:09 AM, Marcin Slusarz
> <marcin.slusarz@gmail.com> wrote:
> >
> > This looks like ACPI bug...
> 
> I'm _shocked_ to hear that firmware would be fragile.
> 
> Anyway, here's the #1 thing to keep in mind about firmware:
> 
>  - firmware is *always* buggy.

I know. But this bug is not about broken firmware. It's about Linux kernel
ACPI implementation, which (I think) wrongly interprets ACPI script.

Marcin
Linus Torvalds Oct. 21, 2012, 5:13 p.m. UTC | #7
On Sun, Oct 21, 2012 at 5:49 PM, Marcin Slusarz
<marcin.slusarz@gmail.com> wrote:
>
> I know. But this bug is not about broken firmware. It's about Linux kernel
> ACPI implementation, which (I think) wrongly interprets ACPI script.

Hmm. Len, care to comment? Marcin quoted the AML and our arguments in
an earlier thing. I don't read AML, so I was assuming it's the AML
itself that was buggy, not our interpretation of it..

               Linus
Peter Wu Oct. 21, 2012, 8:07 p.m. UTC | #8
On Sunday 21 October 2012 16:49:08 Marcin Slusarz wrote:
> On Sun, Oct 21, 2012 at 07:38:58AM -0700, Linus Torvalds wrote:
> > On Sun, Oct 21, 2012 at 5:09 AM, Marcin Slusarz
> > 
> > <marcin.slusarz@gmail.com> wrote:
> > > This looks like ACPI bug...
> >
> > I'm shocked to hear that firmware would be fragile.
> >
> > Anyway, here's the #1 thing to keep in mind about firmware:
> > 
> >  - firmware is always buggy.
> 
> I know. But this bug is not about broken firmware. It's about Linux kernel
> ACPI implementation, which (I think) wrongly interprets ACPI script.

The ACPI implementation is fine, it is just the fw engineers that suck. I see I 
have not cc'd the linux-vger ml when replying to a previous mail. I'll paste 
it below:

Since commit 9a334cd "drm/nouveau/bios: fix shadowing of ACPI ROMs larger than 
64KiB" chunks are not always read in multiples of 4KiB anymore (less is 
possible). That is the only obvious thing I can think of atm (besides the 
kmalloc(0) bug for which Martin submitted a patch in the previous mail).

Do you still still have an Asus laptop with the same BIOS as in 
https://bugzilla.kernel.org/show_bug.cgi?id=19702? (if yes, then the acpidump 
from that bug still applies).

This is the ACPI _ROM method that is being called:
Method (_ROM, 2, NotSerialized)  // _ROM: Read-Only Memory
{
    Add (Arg0, RBUF, Local0)
    ShiftLeft (Arg1, 0x03, Local1) // times 8, bytes to bits?
    Name (VBUF, Buffer (Arg1) {}) 
    OperationRegion (VROM, SystemMemory, Local0, Local1)
    Field (VROM, ByteAcc, NoLock, Preserve)
    {   
        ROMI,   65536
    }   

    Store (ROMI, VBUF)
    Return (VBUF)
}

Arg0 is the offset (0 when first reading the size), Arg1 is the number of read 
bytes (3). Note the use of Local1 in OperationRegion. The meaning there is 
bytes, but a multiple of the requested bytes is passed. So if we request 4096 
bytes, we end up with a VROM of size 32KiB. ROMI is 65536 bits (or 8192 
bytes), hence reading 4096 does not give errors. But reading only 3 bytes will 
fail. Martin, I saw your second patch (https://lkml.org/lkml/2012/10/20/150) 
which takes care of the first case, but it skips the case where the ROM is of 
an odd size (does that even happen, something like 64KiB+1 bytes? No idea.)

Addition: Conclusion: this means that the request must have a length must be 
at least 1 KiB or it will fail with the ACPI error that you have seen before. 
This AML snippet suck.

Regards,
Peter
Ben Skeggs Oct. 22, 2012, 12:07 a.m. UTC | #9
On Sun, 2012-10-21 at 00:19 +0200, Marcin Slusarz wrote:
> On Sat, Oct 20, 2012 at 11:20:36PM +0200, Heinz Diehl wrote:
> > On 20.10.2012, Marcin Slusarz wrote: 
> > 
> > > Try this one.
> > 
> > It works, now I can boot again. However, nouveau seems to be dead now.
> > The dmesg output with your patch on top of 3.7-rc1 is:
> > 
> > [    3.685909] [drm] Initialized i915 1.6.0 20080730 for 0000:00:02.0 on minor 0
> > [    3.687784] nouveau  [  DEVICE][0000:01:00.0] BOOT0  : 0x0a8800b1
> > [    3.689960] nouveau  [  DEVICE][0000:01:00.0] Chipset: GT218 (NVA8)
> > [    3.692471] nouveau  [  DEVICE][0000:01:00.0] Family : NV50
> > [    3.695716] nouveau  [   VBIOS][0000:01:00.0] checking PRAMIN for image...
> > [    3.697087] nouveau  [   VBIOS][0000:01:00.0] ... signature not found
> > [    3.698471] nouveau  [   VBIOS][0000:01:00.0] checking PROM for image...
> > [    3.699838] nouveau  [   VBIOS][0000:01:00.0] ... signature not found
> > [    3.701223] nouveau  [   VBIOS][0000:01:00.0] checking ACPI for image...
> > [    3.702684] ACPI Error: Field [ROMI] Base+Offset+Width 0+24+1 is beyond end of region [VROM] (length 24) (20120913/exfldio-210)
> > [    3.704139] ACPI Error: Method parse/execution failed[\_SB_.PCI0.PEG1.GFX0._ROM] (Node ffff880142e85cf8), AE_AML_REGION_LIMIT (20120913/psparse-536)
> > [    3.716183] failed to evaluate ROM got AE_AML_REGION_LIMIT
> > [    3.718776] nouveau  [   VBIOS][0000:01:00.0] ... signature not found
> > [    3.721349] nouveau  [   VBIOS][0000:01:00.0] checking PCIROM for image...
> > [    3.724111] nouveau 0000:01:00.0: Invalid ROM contents
> > [    3.726663] nouveau  [   VBIOS][0000:01:00.0] ... signature not found
> > [    3.729159] nouveau E[   VBIOS][0000:01:00.0] unable to locate usable image
> > [    3.731677] nouveau E[  DEVICE][0000:01:00.0] failed to create 0x10000001, -22
> > [    3.734231] nouveau E[     DRM] failed to create 0x80000080, -22
> > [    3.736097] nouveau: probe of 0000:01:00.0 failed with error -22
> > [    3.740523] dracut: Starting plymouth daemon
> 
> Hmm, maybe we can't fetch 3 bytes only...
Yeah, I noticed this issue myself on a machine here while backporting
the patch to another (older) kernel.  I assumed (wrongly, apparently)
that there was just a bug in the older kernel's ACPI implementation that
had been fixed upstream already (I didn't see it on the same machine
with the current kernel)...  So, I didn't send the patch.

I'll queue it up with a -fixes batch later today.

Sorry about the trouble,
Ben.

> 
> Let's check this:
> 
> ---
> diff --git a/drivers/gpu/drm/nouveau/core/subdev/bios/base.c b/drivers/gpu/drm/nouveau/core/subdev/bios/base.c
> index 824eea0..8bd71aa 100644
> --- a/drivers/gpu/drm/nouveau/core/subdev/bios/base.c
> +++ b/drivers/gpu/drm/nouveau/core/subdev/bios/base.c
> @@ -192,14 +192,16 @@ nouveau_bios_shadow_acpi(struct nouveau_bios *bios)
>  {
>  	struct pci_dev *pdev = nv_device(bios)->pdev;
>  	int ret, cnt, i;
> -	u8  data[3];
> +	u8  *data;
>  
>  	if (!nouveau_acpi_rom_supported(pdev))
>  		return;
>  
>  	bios->size = 0;
> -	if (nouveau_acpi_get_bios_chunk(data, 0, 3) == 3)
> +	data = kmalloc(4096, GFP_KERNEL);
> +	if (data && nouveau_acpi_get_bios_chunk(data, 0, 4096) >= 3)
>  		bios->size = data[2] * 512;
> +	kfree(data);
>  	if (!bios->size)
>  		return;
>  
> ---
> 
> Please attach acpidump output.
diff mbox

Patch

diff --git a/drivers/gpu/drm/nouveau/core/subdev/bios/base.c b/drivers/gpu/drm/nouveau/core/subdev/bios/base.c
index 824eea0..8bd71aa 100644
--- a/drivers/gpu/drm/nouveau/core/subdev/bios/base.c
+++ b/drivers/gpu/drm/nouveau/core/subdev/bios/base.c
@@ -192,14 +192,16 @@  nouveau_bios_shadow_acpi(struct nouveau_bios *bios)
 {
 	struct pci_dev *pdev = nv_device(bios)->pdev;
 	int ret, cnt, i;
-	u8  data[3];
+	u8  *data;
 
 	if (!nouveau_acpi_rom_supported(pdev))
 		return;
 
 	bios->size = 0;
-	if (nouveau_acpi_get_bios_chunk(data, 0, 3) == 3)
+	data = kmalloc(4096, GFP_KERNEL);
+	if (data && nouveau_acpi_get_bios_chunk(data, 0, 4096) >= 3)
 		bios->size = data[2] * 512;
+	kfree(data);
 	if (!bios->size)
 		return;