From patchwork Wed Apr 20 14:15:35 2016 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Johannes Thumshirn X-Patchwork-Id: 8890231 Return-Path: X-Original-To: patchwork-linux-scsi@patchwork.kernel.org Delivered-To: patchwork-parsemail@patchwork2.web.kernel.org Received: from mail.kernel.org (mail.kernel.org [198.145.29.136]) by patchwork2.web.kernel.org (Postfix) with ESMTP id 288BEBF29F for ; Wed, 20 Apr 2016 14:16:00 +0000 (UTC) Received: from mail.kernel.org (localhost [127.0.0.1]) by mail.kernel.org (Postfix) with ESMTP id E33D8201C0 for ; Wed, 20 Apr 2016 14:15:58 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by mail.kernel.org (Postfix) with ESMTP id 95B1B20120 for ; Wed, 20 Apr 2016 14:15:57 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S932152AbcDTOPl (ORCPT ); Wed, 20 Apr 2016 10:15:41 -0400 Received: from mx2.suse.de ([195.135.220.15]:42777 "EHLO mx2.suse.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753257AbcDTOPk (ORCPT ); Wed, 20 Apr 2016 10:15:40 -0400 X-Virus-Scanned: by amavisd-new at test-mx.suse.de Received: from relay2.suse.de (charybdis-ext.suse.de [195.135.220.254]) by mx2.suse.de (Postfix) with ESMTP id 73C65AB9D; Wed, 20 Apr 2016 14:15:35 +0000 (UTC) Date: Wed, 20 Apr 2016 16:15:35 +0200 From: Johannes Thumshirn To: Laura Abbott Cc: Michael Reed , "James E.J. Bottomley" , "Martin K. Petersen" , linux-scsi@vger.kernel.org, linux-kernel@vger.kernel.org Subject: Re: alloc failure in qla1280 probe -- need to decrease can_queue? Message-ID: <20160420141535.GF28402@c203.arch.suse.de> References: <57155A93.7090604@redhat.com> MIME-Version: 1.0 Content-Disposition: inline In-Reply-To: <57155A93.7090604@redhat.com> User-Agent: Mutt/1.6.0 (2016-04-01) Sender: linux-scsi-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-scsi@vger.kernel.org X-Spam-Status: No, score=-7.9 required=5.0 tests=BAYES_00, RCVD_IN_DNSWL_HI, RP_MATCHES_RCVD, UNPARSEABLE_RELAY autolearn=unavailable version=3.3.1 X-Spam-Checker-Version: SpamAssassin 3.3.1 (2010-03-16) on mail.kernel.org X-Virus-Scanned: ClamAV using ClamSMTP [+Cc Michael Reed as get_maintainer.pl lists him as qla1280 maintainer ] On Mon, Apr 18, 2016 at 03:07:15PM -0700, Laura Abbott wrote: > Hi, > > We received a bug report https://bugzilla.redhat.com/show_bug.cgi?id=1321033 > of qla1280 scsi host failure on 4.4 based kernels that looks to be caused > by page alloc failure: > > [ 4.804166] scsi host0: QLogic QLA1040 PCI to SCSI Host Adapter > Firmware version: 7.65.06, Driver version 3.27.1 > [ 4.804174] ------------[ cut here ]------------ > [ 4.804184] WARNING: CPU: 2 PID: 305 at mm/page_alloc.c:2989 __alloc_pages_nodemask+0xae8/0xbc0() > [ 4.804186] Modules linked in: amdkfd amd_iommu_v2 radeon i2c_algo_bit drm_kms_helper ttm drm megaraid_sas serio_raw 8021q garp bnx2 stp llc mrp sunhme qla1280(+) fjes > [ 4.804208] CPU: 2 PID: 305 Comm: systemd-udevd Not tainted 4.4.6-201.fc22.x86_64 #1 > [ 4.804210] Hardware name: Google Enterprise Search Appliance/0DT021, BIOS 1.1.2 08/14/2006 > [ 4.804212] 0000000000000286 000000002f01064c ffff88042985b710 ffffffff813b542e > [ 4.804216] 0000000000000000 ffffffff81a75024 ffff88042985b748 ffffffff810a40f2 > [ 4.804220] 0000000000000000 0000000000000000 000000000000000b 0000000000000000 > [ 4.804223] Call Trace: > [ 4.804231] [] dump_stack+0x63/0x85 > [ 4.804236] [] warn_slowpath_common+0x82/0xc0 > [ 4.804239] [] warn_slowpath_null+0x1a/0x20 > [ 4.804242] [] __alloc_pages_nodemask+0xae8/0xbc0 > [ 4.804247] [] ? _raw_spin_unlock_irqrestore+0xe/0x10 > [ 4.804251] [] ? irq_work_queue+0x8e/0xa0 > [ 4.804256] [] ? console_unlock+0x20a/0x540 > [ 4.804262] [] alloc_pages_current+0x8c/0x110 > [ 4.804265] [] alloc_kmem_pages+0x19/0x90 > [ 4.804268] [] kmalloc_order_trace+0x2e/0xe0 > [ 4.804272] [] __kmalloc+0x232/0x260 > [ 4.804277] [] init_tag_map+0x3d/0xc0 > [ 4.804290] [] __blk_queue_init_tags+0x45/0x80 > [ 4.804293] [] blk_init_tags+0x14/0x20 > [ 4.804298] [] scsi_add_host_with_dma+0x80/0x300 > [ 4.804305] [] qla1280_probe_one+0x683/0x9ef [qla1280] > [ 4.804309] [] local_pci_probe+0x45/0xa0 > [ 4.804312] [] pci_device_probe+0xfd/0x140 > [ 4.804316] [] driver_probe_device+0x222/0x490 > [ 4.804319] [] __driver_attach+0x84/0x90 > [ 4.804321] [] ? driver_probe_device+0x490/0x490 > [ 4.804324] [] bus_for_each_dev+0x6c/0xc0 > [ 4.804326] [] driver_attach+0x1e/0x20 > [ 4.804328] [] bus_add_driver+0x1eb/0x280 > [ 4.804331] [] ? 0xffffffffa0015000 > [ 4.804333] [] driver_register+0x60/0xe0 > [ 4.804336] [] __pci_register_driver+0x4c/0x50 > [ 4.804339] [] qla1280_init+0x1ce/0x1000 [qla1280] > [ 4.804341] [] ? 0xffffffffa0015000 > [ 4.804345] [] do_one_initcall+0xb3/0x200 > [ 4.804348] [] ? kmem_cache_alloc_trace+0x196/0x210 > [ 4.804352] [] ? do_init_module+0x27/0x1cb > [ 4.804354] [] do_init_module+0x5f/0x1cb > [ 4.804358] [] load_module+0x2040/0x2680 > [ 4.804360] [] ? __symbol_put+0x60/0x60 > [ 4.804363] [] SYSC_init_module+0x149/0x190 > [ 4.804366] [] SyS_init_module+0xe/0x10 > [ 4.804369] [] entry_SYSCALL_64_fastpath+0x12/0x71 > [ 4.804371] ---[ end trace 0ea3b625f86705f7 ]--- > [ 4.804581] qla1280: probe of 0000:11:04.0 failed with error -12 > > This looks very similar to http://www.spinics.net/lists/linux-usb/msg136998.html > which was fixed by 55ff8cfbc4e1 ("USB: uas: Reduce can_queue to MAX_CMNDS"). > Does a similar fix need to be applied here? > > Thanks, > Laura Can you (or better the reporter) try below? Unfortunately I don't have a qla1280 setup here, so I couldn't test it myself. Byte, Johannes From f95e82e7e5f675c9869ea1da78021aa6abc7972b Mon Sep 17 00:00:00 2001 From: Johannes Thumshirn Date: Wed, 20 Apr 2016 16:07:37 +0200 Subject: [PATCH] qla1280: Reduce can_queue to 32 The qla1280 driver sets the scsi_host_template's can_queue field to 0xfffff which results in an allocation failure when allocating the block layer tags for the driver's queues like the one shown below: [ 4.804166] scsi host0: QLogic QLA1040 PCI to SCSI Host Adapter Firmware version: 7.65.06, Driver version 3.27.1 [ 4.804174] ------------[ cut here ]------------ [ 4.804184] WARNING: CPU: 2 PID: 305 at mm/page_alloc.c:2989 alloc_pages_nodemask+0xae8/0xbc0() [ 4.804186] Modules linked in: amdkfd amd_iommu_v2 radeon i2c_algo_bit m_kms_helper ttm drm megaraid_sas serio_raw 8021q garp bnx2 stp llc mrp nhme qla1280(+) fjes [ 4.804208] CPU: 2 PID: 305 Comm: systemd-udevd Not tainted 4.6-201.fc22.x86_64 #1 [ 4.804210] Hardware name: Google Enterprise Search Appliance/0DT021, OS 1.1.2 08/14/2006 [ 4.804212] 0000000000000286 000000002f01064c ffff88042985b710 ffffff813b542e [ 4.804216] 0000000000000000 ffffffff81a75024 ffff88042985b748 ffffff810a40f2 [ 4.804220] 0000000000000000 0000000000000000 000000000000000b 00000000000000 [ 4.804223] Call Trace: [ 4.804231] [] dump_stack+0x63/0x85 [ 4.804236] [] warn_slowpath_common+0x82/0xc0 [ 4.804239] [] warn_slowpath_null+0x1a/0x20 [ 4.804242] [] __alloc_pages_nodemask+0xae8/0xbc0 [ 4.804247] [] ? _raw_spin_unlock_irqrestore+0xe/0x10 [ 4.804251] [] ? irq_work_queue+0x8e/0xa0 [ 4.804256] [] ? console_unlock+0x20a/0x540 [ 4.804262] [] alloc_pages_current+0x8c/0x110 [ 4.804265] [] alloc_kmem_pages+0x19/0x90 [ 4.804268] [] kmalloc_order_trace+0x2e/0xe0 [ 4.804272] [] __kmalloc+0x232/0x260 [ 4.804277] [] init_tag_map+0x3d/0xc0 [ 4.804290] [] __blk_queue_init_tags+0x45/0x80 [ 4.804293] [] blk_init_tags+0x14/0x20 [ 4.804298] [] scsi_add_host_with_dma+0x80/0x300 [ 4.804305] [] qla1280_probe_one+0x683/0x9ef [qla1280] [ 4.804309] [] local_pci_probe+0x45/0xa0 [ 4.804312] [] pci_device_probe+0xfd/0x140 [ 4.804316] [] driver_probe_device+0x222/0x490 [ 4.804319] [] __driver_attach+0x84/0x90 [ 4.804321] [] ? driver_probe_device+0x490/0x490 [ 4.804324] [] bus_for_each_dev+0x6c/0xc0 [ 4.804326] [] driver_attach+0x1e/0x20 [ 4.804328] [] bus_add_driver+0x1eb/0x280 [ 4.804331] [] ? 0xffffffffa0015000 [ 4.804333] [] driver_register+0x60/0xe0 [ 4.804336] [] __pci_register_driver+0x4c/0x50 [ 4.804339] [] qla1280_init+0x1ce/0x1000 [qla1280] [ 4.804341] [] ? 0xffffffffa0015000 [ 4.804345] [] do_one_initcall+0xb3/0x200 [ 4.804348] [] ? kmem_cache_alloc_trace+0x196/0x210 [ 4.804352] [] ? do_init_module+0x27/0x1cb [ 4.804354] [] do_init_module+0x5f/0x1cb [ 4.804358] [] load_module+0x2040/0x2680 [ 4.804360] [] ? __symbol_put+0x60/0x60 [ 4.804363] [] SYSC_init_module+0x149/0x190 [ 4.804366] [] SyS_init_module+0xe/0x10 [ 4.804369] [] entry_SYSCALL_64_fastpath+0x12/0x71 [ 4.804371] ---[ end trace 0ea3b625f86705f7 ]--- [ 4.804581] qla1280: probe of 0000:11:04.0 failed with error -12 In qla1280_set_defaults() the maximum queue depth is set to 32 so adopt the scsi_host_template to it as well. Signed-off-by: Johannes Thumshirn --- drivers/scsi/qla1280.c | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/drivers/scsi/qla1280.c b/drivers/scsi/qla1280.c index 5d0ec42..6bd748e 100644 --- a/drivers/scsi/qla1280.c +++ b/drivers/scsi/qla1280.c @@ -4214,7 +4214,7 @@ static struct scsi_host_template qla1280_driver_template = { .eh_bus_reset_handler = qla1280_eh_bus_reset, .eh_host_reset_handler = qla1280_eh_adapter_reset, .bios_param = qla1280_biosparam, - .can_queue = 0xfffff, + .can_queue = 32, .this_id = -1, .sg_tablesize = SG_ALL, .use_clustering = ENABLE_CLUSTERING,