diff mbox series

KVM: x86: Increase KVM_MAX_VCPUS to 4096

Message ID 20230815153537.113861-1-kyle.meyer@hpe.com (mailing list archive)
State New, archived
Headers show
Series KVM: x86: Increase KVM_MAX_VCPUS to 4096 | expand

Commit Message

Kyle Meyer Aug. 15, 2023, 3:35 p.m. UTC
Increase KVM_MAX_VCPUS to 4096 when MAXSMP is enabled.

Notable changes (when MAXSMP is enabled):

* KMV_MAX_VCPUS will increase from 1024 to 4096.
* KVM_MAX_VCPU_IDS will increase from 4096 to 16384.
* KVM_HV_MAX_SPARSE_VCPU_SET_BITS will increase from 16 to 64.
* CPUID[HYPERV_CPUID_IMPLEMENT_LIMITS (0x40000005)].EAX will now be 4096.

* struct kvm will increase from 39408 B to 39792 B.
* struct kvm_ioapic will increase from 5240 B to 19064 B.

* The following (on-stack) bitmaps will increase from 128 B to 512 B:
	* dest_vcpu_bitmap in kvm_irq_delivery_to_apic.
	* vcpu_mask in kvm_hv_flush_tlb.
	* vcpu_bitmap in ioapic_write_indirect.
	* vp_bitmap in sparse_set_to_vcpu_mask.

Signed-off-by: Kyle Meyer <kyle.meyer@hpe.com>
---
Virtual machines with 4096 virtual CPUs have been created on 32 socket
Cascade Lake and Sapphire Rapids systems.

4096 is the current maximum value because of the Hyper-V TLFS. See
BUILD_BUG_ON in arch/x86/kvm/hyperv.c, commit 79661c3, and Vitaly's
comment on https://lore.kernel.org/all/87r136shcc.fsf@redhat.com.

 arch/x86/include/asm/kvm_host.h | 4 ++++
 1 file changed, 4 insertions(+)

Comments

Sean Christopherson Aug. 15, 2023, 5:11 p.m. UTC | #1
On Tue, Aug 15, 2023, Kyle Meyer wrote:
> Increase KVM_MAX_VCPUS to 4096 when MAXSMP is enabled.
> 
> Notable changes (when MAXSMP is enabled):
> 
> * KMV_MAX_VCPUS will increase from 1024 to 4096.
> * KVM_MAX_VCPU_IDS will increase from 4096 to 16384.
> * KVM_HV_MAX_SPARSE_VCPU_SET_BITS will increase from 16 to 64.
> * CPUID[HYPERV_CPUID_IMPLEMENT_LIMITS (0x40000005)].EAX will now be 4096.
> 
> * struct kvm will increase from 39408 B to 39792 B.
> * struct kvm_ioapic will increase from 5240 B to 19064 B.
> 
> * The following (on-stack) bitmaps will increase from 128 B to 512 B:
> 	* dest_vcpu_bitmap in kvm_irq_delivery_to_apic.
> 	* vcpu_mask in kvm_hv_flush_tlb.
> 	* vcpu_bitmap in ioapic_write_indirect.
> 	* vp_bitmap in sparse_set_to_vcpu_mask.
> 
> Signed-off-by: Kyle Meyer <kyle.meyer@hpe.com>
> ---
> Virtual machines with 4096 virtual CPUs have been created on 32 socket
> Cascade Lake and Sapphire Rapids systems.
> 
> 4096 is the current maximum value because of the Hyper-V TLFS. See
> BUILD_BUG_ON in arch/x86/kvm/hyperv.c, commit 79661c3, and Vitaly's
> comment on https://lore.kernel.org/all/87r136shcc.fsf@redhat.com.

Mostly out of curiosity, do you care about Hyper-V support?   If not, at some
point it'd probably be worth exploring a CONFIG_KVM_HYPERV option to allow
disabling KVM's Hyper-V support at compile time so that we're not bound by the
restrictions of the TLFS.

>  arch/x86/include/asm/kvm_host.h | 4 ++++
>  1 file changed, 4 insertions(+)
> 
> diff --git a/arch/x86/include/asm/kvm_host.h b/arch/x86/include/asm/kvm_host.h
> index 3bc146dfd38d..91a01fa17fa7 100644
> --- a/arch/x86/include/asm/kvm_host.h
> +++ b/arch/x86/include/asm/kvm_host.h
> @@ -39,7 +39,11 @@
>  
>  #define __KVM_HAVE_ARCH_VCPU_DEBUGFS
>  
> +#ifdef CONFIG_MAXSMP
> +#define KVM_MAX_VCPUS 4096
> +#else
>  #define KVM_MAX_VCPUS 1024
> +#endif

Rather than tightly couple this to MAXSMP, what if we add a Kconfig?  I know of
at least one scenario, SVM's AVIC/x2AVIC, where it would be desirable to configure
KVM to a much smaller maximum.  The biggest downside I can think of is that KVM
selftests would need to be updated (they assume the max is >=512), and some of the
tests might be completely invalid if KVM_MAX_VCPUS is too low (<256?).

E.g.

diff --git a/arch/x86/include/asm/kvm_host.h b/arch/x86/include/asm/kvm_host.h
index 60d430b4650f..8704748e35d9 100644
--- a/arch/x86/include/asm/kvm_host.h
+++ b/arch/x86/include/asm/kvm_host.h
@@ -39,7 +39,7 @@
 
 #define __KVM_HAVE_ARCH_VCPU_DEBUGFS
 
-#define KVM_MAX_VCPUS 1024
+#define KVM_MAX_VCPUS CONFIG_KVM_MAX_NR_VCPUS
 
 /*
  * In x86, the VCPU ID corresponds to the APIC ID, and APIC IDs
diff --git a/arch/x86/kvm/Kconfig b/arch/x86/kvm/Kconfig
index ed90f148140d..b0f92eb77f78 100644
--- a/arch/x86/kvm/Kconfig
+++ b/arch/x86/kvm/Kconfig
@@ -151,6 +151,17 @@ config KVM_PROVE_MMU
 
          If in doubt, say "N".
 
+config KVM_MAX_NR_VCPUS
+       int "Maximum vCPUs per VM"
+       default "4096" if MAXSMP
+       default "1024"
+       range 1 4096
+       depends on KVM
+       help
+         Set the maximum number of vCPUs for a single VM.  Larger values
+         increase the memory footprint of each VM regardless of how many vCPUs
+         are actually created (though the memory increase is relatively small).
+
 config KVM_EXTERNAL_WRITE_TRACKING
        bool
Kyle Meyer Aug. 17, 2023, 8:23 p.m. UTC | #2
> > 4096 is the current maximum value because of the Hyper-V TLFS. See
> > BUILD_BUG_ON in arch/x86/kvm/hyperv.c, commit 79661c3, and Vitaly's
> > comment on https://lore.kernel.org/all/87r136shcc.fsf@redhat.com.
>
> Mostly out of curiosity, do you care about Hyper-V support?   If not, at some
> point it'd probably be worth exploring a CONFIG_KVM_HYPERV option to allow
> disabling KVM's Hyper-V support at compile time so that we're not bound by the
> restrictions of the TLFS.

Yes, I care about Hyper-V support. I would like this limitation to be addressed
in the future.

> Rather than tightly couple this to MAXSMP, what if we add a Kconfig?  I know of
> at least one scenario, SVM's AVIC/x2AVIC, where it would be desirable to configure
> KVM to a much smaller maximum.  The biggest downside I can think of is that KVM
> selftests would need to be updated (they assume the max is >=512), and some of the
> tests might be completely invalid if KVM_MAX_VCPUS is too low (<256?).

That sounds good to me. I would prefer to set the range from 1024 to 4096 in
this patch.
Sean Christopherson Aug. 17, 2023, 11:44 p.m. UTC | #3
On Thu, Aug 17, 2023, Kyle Meyer wrote:
> > > 4096 is the current maximum value because of the Hyper-V TLFS. See
> > > BUILD_BUG_ON in arch/x86/kvm/hyperv.c, commit 79661c3, and Vitaly's
> > > comment on https://lore.kernel.org/all/87r136shcc.fsf@redhat.com.
> >
> > Mostly out of curiosity, do you care about Hyper-V support?   If not, at some
> > point it'd probably be worth exploring a CONFIG_KVM_HYPERV option to allow
> > disabling KVM's Hyper-V support at compile time so that we're not bound by the
> > restrictions of the TLFS.
> 
> Yes, I care about Hyper-V support. I would like this limitation to be addressed
> in the future.
> 
> > Rather than tightly couple this to MAXSMP, what if we add a Kconfig?  I know of
> > at least one scenario, SVM's AVIC/x2AVIC, where it would be desirable to configure
> > KVM to a much smaller maximum.  The biggest downside I can think of is that KVM
> > selftests would need to be updated (they assume the max is >=512), and some of the
> > tests might be completely invalid if KVM_MAX_VCPUS is too low (<256?).
> 
> That sounds good to me. I would prefer to set the range from 1024 to 4096 in
> this patch.

Yeah, that's probably for the best.
Vitaly Kuznetsov Sept. 6, 2023, 3:54 p.m. UTC | #4
Sean Christopherson <seanjc@google.com> writes:

> On Tue, Aug 15, 2023, Kyle Meyer wrote:
>> Increase KVM_MAX_VCPUS to 4096 when MAXSMP is enabled.
>> 
>> Notable changes (when MAXSMP is enabled):
>> 
>> * KMV_MAX_VCPUS will increase from 1024 to 4096.
>> * KVM_MAX_VCPU_IDS will increase from 4096 to 16384.
>> * KVM_HV_MAX_SPARSE_VCPU_SET_BITS will increase from 16 to 64.
>> * CPUID[HYPERV_CPUID_IMPLEMENT_LIMITS (0x40000005)].EAX will now be 4096.
>> 
>> * struct kvm will increase from 39408 B to 39792 B.
>> * struct kvm_ioapic will increase from 5240 B to 19064 B.
>> 
>> * The following (on-stack) bitmaps will increase from 128 B to 512 B:
>> 	* dest_vcpu_bitmap in kvm_irq_delivery_to_apic.
>> 	* vcpu_mask in kvm_hv_flush_tlb.
>> 	* vcpu_bitmap in ioapic_write_indirect.
>> 	* vp_bitmap in sparse_set_to_vcpu_mask.
>> 
>> Signed-off-by: Kyle Meyer <kyle.meyer@hpe.com>
>> ---
>> Virtual machines with 4096 virtual CPUs have been created on 32 socket
>> Cascade Lake and Sapphire Rapids systems.
>> 
>> 4096 is the current maximum value because of the Hyper-V TLFS. See
>> BUILD_BUG_ON in arch/x86/kvm/hyperv.c, commit 79661c3, and Vitaly's
>> comment on https://lore.kernel.org/all/87r136shcc.fsf@redhat.com.
>
> Mostly out of curiosity, do you care about Hyper-V support?   If not, at some
> point it'd probably be worth exploring a CONFIG_KVM_HYPERV option to allow
> disabling KVM's Hyper-V support at compile time so that we're not bound by the
> restrictions of the TLFS.
>

(sorry for necroposting)

While adding CONFIG_KVM_HYPERV to disable all-things-Hyper-V may make
sense for some deployments (and as we already have CONFIG_KVM_XEN), I
don't think we should forbid KVM_MAX_VCPUS > 4096 when it is enabled:
'general purpose' (distro) kernels are used both for hosting large Linux
guests and Windows guests. Instead, I'd suggest we define
KVM_MAX_HV_VCPUS as MIN(KVM_MAX_VCPUS, 4096) and then e.g. fail
KVM_SET_CPUID[,2] if we already have > 4096 vCPUs + fail
kvm_arch_vcpu_create() if we already have something-hyperv enabled on
the already created vCPUs.
Sean Christopherson Sept. 8, 2023, 5:44 p.m. UTC | #5
On Wed, Sep 06, 2023, Vitaly Kuznetsov wrote:
> Sean Christopherson <seanjc@google.com> writes:
> 
> > On Tue, Aug 15, 2023, Kyle Meyer wrote:
> >> Increase KVM_MAX_VCPUS to 4096 when MAXSMP is enabled.
> >> 
> >> Notable changes (when MAXSMP is enabled):
> >> 
> >> * KMV_MAX_VCPUS will increase from 1024 to 4096.
> >> * KVM_MAX_VCPU_IDS will increase from 4096 to 16384.
> >> * KVM_HV_MAX_SPARSE_VCPU_SET_BITS will increase from 16 to 64.
> >> * CPUID[HYPERV_CPUID_IMPLEMENT_LIMITS (0x40000005)].EAX will now be 4096.
> >> 
> >> * struct kvm will increase from 39408 B to 39792 B.
> >> * struct kvm_ioapic will increase from 5240 B to 19064 B.
> >> 
> >> * The following (on-stack) bitmaps will increase from 128 B to 512 B:
> >> 	* dest_vcpu_bitmap in kvm_irq_delivery_to_apic.
> >> 	* vcpu_mask in kvm_hv_flush_tlb.
> >> 	* vcpu_bitmap in ioapic_write_indirect.
> >> 	* vp_bitmap in sparse_set_to_vcpu_mask.
> >> 
> >> Signed-off-by: Kyle Meyer <kyle.meyer@hpe.com>
> >> ---
> >> Virtual machines with 4096 virtual CPUs have been created on 32 socket
> >> Cascade Lake and Sapphire Rapids systems.
> >> 
> >> 4096 is the current maximum value because of the Hyper-V TLFS. See
> >> BUILD_BUG_ON in arch/x86/kvm/hyperv.c, commit 79661c3, and Vitaly's
> >> comment on https://lore.kernel.org/all/87r136shcc.fsf@redhat.com.
> >
> > Mostly out of curiosity, do you care about Hyper-V support?   If not, at some
> > point it'd probably be worth exploring a CONFIG_KVM_HYPERV option to allow
> > disabling KVM's Hyper-V support at compile time so that we're not bound by the
> > restrictions of the TLFS.
> >
> 
> (sorry for necroposting)

There's a big difference between mostly dead and all dead.  Mostly dead is slightly
alive. ;-)

> While adding CONFIG_KVM_HYPERV to disable all-things-Hyper-V may make
> sense for some deployments (and as we already have CONFIG_KVM_XEN), I
> don't think we should forbid KVM_MAX_VCPUS > 4096 when it is enabled:
> 'general purpose' (distro) kernels are used both for hosting large Linux
> guests and Windows guests. Instead, I'd suggest we define
> KVM_MAX_HV_VCPUS as MIN(KVM_MAX_VCPUS, 4096) and then e.g. fail
> KVM_SET_CPUID[,2] if we already have > 4096 vCPUs + fail
> kvm_arch_vcpu_create() if we already have something-hyperv enabled on
> the already created vCPUs.

Ah, bummer.  I'd still like to add CONFIG_KVM_HYPERV at some point, but I agree
it should be orthogonal to KVM_MAX_VCPUS.
diff mbox series

Patch

diff --git a/arch/x86/include/asm/kvm_host.h b/arch/x86/include/asm/kvm_host.h
index 3bc146dfd38d..91a01fa17fa7 100644
--- a/arch/x86/include/asm/kvm_host.h
+++ b/arch/x86/include/asm/kvm_host.h
@@ -39,7 +39,11 @@ 
 
 #define __KVM_HAVE_ARCH_VCPU_DEBUGFS
 
+#ifdef CONFIG_MAXSMP
+#define KVM_MAX_VCPUS 4096
+#else
 #define KVM_MAX_VCPUS 1024
+#endif
 
 /*
  * In x86, the VCPU ID corresponds to the APIC ID, and APIC IDs