Message ID | 9-v7-cb149db3a320+3b5-smmuv3_newapi_p2_jgg@nvidia.com (mailing list archive) |
---|---|
State | New, archived |
Headers | show |
Series | Make the SMMUv3 CD logic match the new STE design (part 2a/3) | expand |
On Tue, Apr 16, 2024 at 04:28:20PM -0300, Jason Gunthorpe wrote: > Add tests for some of the more common STE update operations that we expect > to see, as well as some artificial STE updates to test the edges of > arm_smmu_write_entry. These also serve as a record of which common > operation is expected to be hitless, and how many syncs they require. > > arm_smmu_write_entry implements a generic algorithm that updates an STE/CD > to any other abritrary STE/CD configuration. The update requires a > sequence of write+sync operations with some invariants that must be held > true after each sync. arm_smmu_write_entry lends itself well to > unit-testing since the function's interaction with the STE/CD is already > abstracted by input callbacks that we can hook to introspect into the > sequence of operations. We can use these hooks to guarantee that > invariants are held throughout the entire update operation. > > Link: https://lore.kernel.org/r/20240106083617.1173871-3-mshavit@google.com > Signed-off-by: Michael Shavit <mshavit@google.com> > Signed-off-by: Jason Gunthorpe <jgg@nvidia.com> > --- > drivers/iommu/Kconfig | 12 +- > drivers/iommu/arm/arm-smmu-v3/Makefile | 2 + > .../iommu/arm/arm-smmu-v3/arm-smmu-v3-sva.c | 6 +- > .../iommu/arm/arm-smmu-v3/arm-smmu-v3-test.c | 467 ++++++++++++++++++ > drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.c | 36 +- > drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.h | 30 ++ > +config ARM_SMMU_V3_KUNIT_TEST > + tristate "KUnit tests for arm-smmu-v3 driver" if !KUNIT_ALL_TESTS > + depends on KUNIT > + default KUNIT_ALL_TESTS > + help > + Enable this option to unit-test arm-smmu-v3 driver functions. > + > + If unsure, say N. Forgot that my SVA sanity doesn't cover this patch. And it looks like some problems here when building it with "=m": ERROR: modpost: missing MODULE_LICENSE() in drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.o ERROR: modpost: "arm_smmu_make_cdtable_ste" [drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.ko] undefined! ERROR: modpost: "arm_smmu_make_bypass_ste" [drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.ko] undefined! ERROR: modpost: "arm_smmu_make_abort_ste" [drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.ko] undefined! ERROR: modpost: "arm_smmu_make_s2_domain_ste" [drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.ko] undefined! ERROR: modpost: "arm_smmu_get_ste_used" [drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.ko] undefined! ERROR: modpost: "arm_smmu_write_entry" [drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.ko] undefined! Likely needs MODULE_LICENSE and some EXPORT_SYMBOLs. With built-in '=y' and a MODULE_LICENSE, tests passed: [ 13.244780] KTAP version 1 [ 13.247542] 1..1 [ 13.249421] KTAP version 1 [ 13.252538] # Subtest: arm-smmu-v3-kunit-test [ 13.257344] 1..16 [ 13.259727] ok 1 arm_smmu_v3_write_ste_test_bypass_to_abort [ 13.259789] ok 2 arm_smmu_v3_write_ste_test_abort_to_bypass [ 13.265895] ok 3 arm_smmu_v3_write_ste_test_cdtable_to_abort [ 13.271988] ok 4 arm_smmu_v3_write_ste_test_abort_to_cdtable [ 13.278172] ok 5 arm_smmu_v3_write_ste_test_cdtable_to_bypass [ 13.284353] ok 6 arm_smmu_v3_write_ste_test_bypass_to_cdtable [ 13.290636] ok 7 arm_smmu_v3_write_ste_test_cdtable_s1dss_change [ 13.296917] ok 8 arm_smmu_v3_write_ste_test_s1dssbypass_to_stebypass [ 13.303464] ok 9 arm_smmu_v3_write_ste_test_stebypass_to_s1dssbypass [ 13.310357] ok 10 arm_smmu_v3_write_ste_test_s2_to_abort [ 13.317265] ok 11 arm_smmu_v3_write_ste_test_abort_to_s2 [ 13.323104] ok 12 arm_smmu_v3_write_ste_test_s2_to_bypass [ 13.328937] ok 13 arm_smmu_v3_write_ste_test_bypass_to_s2 [ 13.334861] ok 14 arm_smmu_v3_write_ste_test_s1_to_s2 [ 13.340787] ok 15 arm_smmu_v3_write_ste_test_s2_to_s1 [ 13.346364] ok 16 arm_smmu_v3_write_ste_test_non_hitless [ 13.351883] # arm-smmu-v3-kunit-test: pass:16 fail:0 skip:0 total:16 [ 13.357667] # Totals: pass:16 fail:0 skip:0 total:16 [ 13.364163] ok 1 arm-smmu-v3-kunit-test Once those are fixed, Tested-by: Nicolin Chen <nicolinc@nvidia.com>
On Wed, Apr 17, 2024 at 01:09:40AM -0700, Nicolin Chen wrote: > On Tue, Apr 16, 2024 at 04:28:20PM -0300, Jason Gunthorpe wrote: > > Add tests for some of the more common STE update operations that we expect > > to see, as well as some artificial STE updates to test the edges of > > arm_smmu_write_entry. These also serve as a record of which common > > operation is expected to be hitless, and how many syncs they require. > > > > arm_smmu_write_entry implements a generic algorithm that updates an STE/CD > > to any other abritrary STE/CD configuration. The update requires a > > sequence of write+sync operations with some invariants that must be held > > true after each sync. arm_smmu_write_entry lends itself well to > > unit-testing since the function's interaction with the STE/CD is already > > abstracted by input callbacks that we can hook to introspect into the > > sequence of operations. We can use these hooks to guarantee that > > invariants are held throughout the entire update operation. > > > > Link: https://lore.kernel.org/r/20240106083617.1173871-3-mshavit@google.com > > Signed-off-by: Michael Shavit <mshavit@google.com> > > Signed-off-by: Jason Gunthorpe <jgg@nvidia.com> > > --- > > drivers/iommu/Kconfig | 12 +- > > drivers/iommu/arm/arm-smmu-v3/Makefile | 2 + > > .../iommu/arm/arm-smmu-v3/arm-smmu-v3-sva.c | 6 +- > > .../iommu/arm/arm-smmu-v3/arm-smmu-v3-test.c | 467 ++++++++++++++++++ > > drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.c | 36 +- > > drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.h | 30 ++ > > > +config ARM_SMMU_V3_KUNIT_TEST > > + tristate "KUnit tests for arm-smmu-v3 driver" if !KUNIT_ALL_TESTS > > + depends on KUNIT > > + default KUNIT_ALL_TESTS > > + help > > + Enable this option to unit-test arm-smmu-v3 driver functions. > > + > > + If unsure, say N. > > Forgot that my SVA sanity doesn't cover this patch. And it looks > like some problems here when building it with "=m": > > ERROR: modpost: missing MODULE_LICENSE() in drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.o > ERROR: modpost: "arm_smmu_make_cdtable_ste" [drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.ko] undefined! > ERROR: modpost: "arm_smmu_make_bypass_ste" [drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.ko] undefined! > ERROR: modpost: "arm_smmu_make_abort_ste" [drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.ko] undefined! > ERROR: modpost: "arm_smmu_make_s2_domain_ste" [drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.ko] undefined! > ERROR: modpost: "arm_smmu_get_ste_used" [drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.ko] undefined! > ERROR: modpost: "arm_smmu_write_entry" [drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.ko] undefined! > > Likely needs MODULE_LICENSE and some EXPORT_SYMBOLs. Oh! The kbuild never tested this kconfig combination... I think just this? Michael? diff --git a/drivers/iommu/Kconfig b/drivers/iommu/Kconfig index d03c729c4142dc..7b6a4e244e99cf 100644 --- a/drivers/iommu/Kconfig +++ b/drivers/iommu/Kconfig @@ -411,7 +411,7 @@ config ARM_SMMU_V3_SVA and PRI. config ARM_SMMU_V3_KUNIT_TEST - tristate "KUnit tests for arm-smmu-v3 driver" if !KUNIT_ALL_TESTS + bool "KUnit tests for arm-smmu-v3 driver" if !KUNIT_ALL_TESTS depends on KUNIT default KUNIT_ALL_TESTS help diff --git a/drivers/iommu/arm/arm-smmu-v3/Makefile b/drivers/iommu/arm/arm-smmu-v3/Makefile index 014a997753a8a2..0b97054b3929b7 100644 --- a/drivers/iommu/arm/arm-smmu-v3/Makefile +++ b/drivers/iommu/arm/arm-smmu-v3/Makefile @@ -2,6 +2,5 @@ obj-$(CONFIG_ARM_SMMU_V3) += arm_smmu_v3.o arm_smmu_v3-objs-y += arm-smmu-v3.o arm_smmu_v3-objs-$(CONFIG_ARM_SMMU_V3_SVA) += arm-smmu-v3-sva.o +arm_smmu_v3-objs-$(CONFIG_ARM_SMMU_V3_KUNIT_TEST) += arm-smmu-v3-test.o arm_smmu_v3-objs := $(arm_smmu_v3-objs-y) - -obj-$(CONFIG_ARM_SMMU_V3_KUNIT_TEST) += arm-smmu-v3-test.o Jason
On Wed, Apr 17, 2024 at 11:16:18AM -0300, Jason Gunthorpe wrote: > On Wed, Apr 17, 2024 at 01:09:40AM -0700, Nicolin Chen wrote: > > On Tue, Apr 16, 2024 at 04:28:20PM -0300, Jason Gunthorpe wrote: > > > Add tests for some of the more common STE update operations that we expect > > > to see, as well as some artificial STE updates to test the edges of > > > arm_smmu_write_entry. These also serve as a record of which common > > > operation is expected to be hitless, and how many syncs they require. > > > > > > arm_smmu_write_entry implements a generic algorithm that updates an STE/CD > > > to any other abritrary STE/CD configuration. The update requires a > > > sequence of write+sync operations with some invariants that must be held > > > true after each sync. arm_smmu_write_entry lends itself well to > > > unit-testing since the function's interaction with the STE/CD is already > > > abstracted by input callbacks that we can hook to introspect into the > > > sequence of operations. We can use these hooks to guarantee that > > > invariants are held throughout the entire update operation. > > > > > > Link: https://lore.kernel.org/r/20240106083617.1173871-3-mshavit@google.com > > > Signed-off-by: Michael Shavit <mshavit@google.com> > > > Signed-off-by: Jason Gunthorpe <jgg@nvidia.com> > > > --- > > > drivers/iommu/Kconfig | 12 +- > > > drivers/iommu/arm/arm-smmu-v3/Makefile | 2 + > > > .../iommu/arm/arm-smmu-v3/arm-smmu-v3-sva.c | 6 +- > > > .../iommu/arm/arm-smmu-v3/arm-smmu-v3-test.c | 467 ++++++++++++++++++ > > > drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.c | 36 +- > > > drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.h | 30 ++ > > > > > +config ARM_SMMU_V3_KUNIT_TEST > > > + tristate "KUnit tests for arm-smmu-v3 driver" if !KUNIT_ALL_TESTS > > > + depends on KUNIT > > > + default KUNIT_ALL_TESTS > > > + help > > > + Enable this option to unit-test arm-smmu-v3 driver functions. > > > + > > > + If unsure, say N. > > > > Forgot that my SVA sanity doesn't cover this patch. And it looks > > like some problems here when building it with "=m": > > > > ERROR: modpost: missing MODULE_LICENSE() in drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.o > > ERROR: modpost: "arm_smmu_make_cdtable_ste" [drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.ko] undefined! > > ERROR: modpost: "arm_smmu_make_bypass_ste" [drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.ko] undefined! > > ERROR: modpost: "arm_smmu_make_abort_ste" [drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.ko] undefined! > > ERROR: modpost: "arm_smmu_make_s2_domain_ste" [drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.ko] undefined! > > ERROR: modpost: "arm_smmu_get_ste_used" [drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.ko] undefined! > > ERROR: modpost: "arm_smmu_write_entry" [drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.ko] undefined! > > > > Likely needs MODULE_LICENSE and some EXPORT_SYMBOLs. > > Oh! The kbuild never tested this kconfig combination... > > I think just this? Michael? Verified with the following change. Thanks Nicolin > diff --git a/drivers/iommu/Kconfig b/drivers/iommu/Kconfig > index d03c729c4142dc..7b6a4e244e99cf 100644 > --- a/drivers/iommu/Kconfig > +++ b/drivers/iommu/Kconfig > @@ -411,7 +411,7 @@ config ARM_SMMU_V3_SVA > and PRI. > > config ARM_SMMU_V3_KUNIT_TEST > - tristate "KUnit tests for arm-smmu-v3 driver" if !KUNIT_ALL_TESTS > + bool "KUnit tests for arm-smmu-v3 driver" if !KUNIT_ALL_TESTS > depends on KUNIT > default KUNIT_ALL_TESTS > help > diff --git a/drivers/iommu/arm/arm-smmu-v3/Makefile b/drivers/iommu/arm/arm-smmu-v3/Makefile > index 014a997753a8a2..0b97054b3929b7 100644 > --- a/drivers/iommu/arm/arm-smmu-v3/Makefile > +++ b/drivers/iommu/arm/arm-smmu-v3/Makefile > @@ -2,6 +2,5 @@ > obj-$(CONFIG_ARM_SMMU_V3) += arm_smmu_v3.o > arm_smmu_v3-objs-y += arm-smmu-v3.o > arm_smmu_v3-objs-$(CONFIG_ARM_SMMU_V3_SVA) += arm-smmu-v3-sva.o > +arm_smmu_v3-objs-$(CONFIG_ARM_SMMU_V3_KUNIT_TEST) += arm-smmu-v3-test.o > arm_smmu_v3-objs := $(arm_smmu_v3-objs-y) > - > -obj-$(CONFIG_ARM_SMMU_V3_KUNIT_TEST) += arm-smmu-v3-test.o > > > Jason
On Wed, Apr 17, 2024 at 10:16 PM Jason Gunthorpe <jgg@nvidia.com> wrote: > > On Wed, Apr 17, 2024 at 01:09:40AM -0700, Nicolin Chen wrote: > > On Tue, Apr 16, 2024 at 04:28:20PM -0300, Jason Gunthorpe wrote: > > > Add tests for some of the more common STE update operations that we expect > > > to see, as well as some artificial STE updates to test the edges of > > > arm_smmu_write_entry. These also serve as a record of which common > > > operation is expected to be hitless, and how many syncs they require. > > > > > > arm_smmu_write_entry implements a generic algorithm that updates an STE/CD > > > to any other abritrary STE/CD configuration. The update requires a > > > sequence of write+sync operations with some invariants that must be held > > > true after each sync. arm_smmu_write_entry lends itself well to > > > unit-testing since the function's interaction with the STE/CD is already > > > abstracted by input callbacks that we can hook to introspect into the > > > sequence of operations. We can use these hooks to guarantee that > > > invariants are held throughout the entire update operation. > > > > > > Link: https://lore.kernel.org/r/20240106083617.1173871-3-mshavit@google.com > > > Signed-off-by: Michael Shavit <mshavit@google.com> > > > Signed-off-by: Jason Gunthorpe <jgg@nvidia.com> > > > --- > > > drivers/iommu/Kconfig | 12 +- > > > drivers/iommu/arm/arm-smmu-v3/Makefile | 2 + > > > .../iommu/arm/arm-smmu-v3/arm-smmu-v3-sva.c | 6 +- > > > .../iommu/arm/arm-smmu-v3/arm-smmu-v3-test.c | 467 ++++++++++++++++++ > > > drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.c | 36 +- > > > drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.h | 30 ++ > > > > > +config ARM_SMMU_V3_KUNIT_TEST > > > + tristate "KUnit tests for arm-smmu-v3 driver" if !KUNIT_ALL_TESTS > > > + depends on KUNIT > > > + default KUNIT_ALL_TESTS > > > + help > > > + Enable this option to unit-test arm-smmu-v3 driver functions. > > > + > > > + If unsure, say N. > > > > Forgot that my SVA sanity doesn't cover this patch. And it looks > > like some problems here when building it with "=m": > > > > ERROR: modpost: missing MODULE_LICENSE() in drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.o > > ERROR: modpost: "arm_smmu_make_cdtable_ste" [drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.ko] undefined! > > ERROR: modpost: "arm_smmu_make_bypass_ste" [drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.ko] undefined! > > ERROR: modpost: "arm_smmu_make_abort_ste" [drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.ko] undefined! > > ERROR: modpost: "arm_smmu_make_s2_domain_ste" [drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.ko] undefined! > > ERROR: modpost: "arm_smmu_get_ste_used" [drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.ko] undefined! > > ERROR: modpost: "arm_smmu_write_entry" [drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.ko] undefined! > > > > Likely needs MODULE_LICENSE and some EXPORT_SYMBOLs. > > Oh! The kbuild never tested this kconfig combination... > > I think just this? Michael? Urhh I'm not sure... Should this also depend on ARM_SMMU_V3? Also what happens if ARM_SMMU_V3=m and ARM_SMMU_V3_KUNIT_TEST=y ? > > > diff --git a/drivers/iommu/Kconfig b/drivers/iommu/Kconfig > index d03c729c4142dc..7b6a4e244e99cf 100644 > --- a/drivers/iommu/Kconfig > +++ b/drivers/iommu/Kconfig > @@ -411,7 +411,7 @@ config ARM_SMMU_V3_SVA > and PRI. > > config ARM_SMMU_V3_KUNIT_TEST > - tristate "KUnit tests for arm-smmu-v3 driver" if !KUNIT_ALL_TESTS > + bool "KUnit tests for arm-smmu-v3 driver" if !KUNIT_ALL_TESTS > depends on KUNIT > default KUNIT_ALL_TESTS > help > diff --git a/drivers/iommu/arm/arm-smmu-v3/Makefile b/drivers/iommu/arm/arm-smmu-v3/Makefile > index 014a997753a8a2..0b97054b3929b7 100644 > --- a/drivers/iommu/arm/arm-smmu-v3/Makefile > +++ b/drivers/iommu/arm/arm-smmu-v3/Makefile > @@ -2,6 +2,5 @@ > obj-$(CONFIG_ARM_SMMU_V3) += arm_smmu_v3.o > arm_smmu_v3-objs-y += arm-smmu-v3.o > arm_smmu_v3-objs-$(CONFIG_ARM_SMMU_V3_SVA) += arm-smmu-v3-sva.o > +arm_smmu_v3-objs-$(CONFIG_ARM_SMMU_V3_KUNIT_TEST) += arm-smmu-v3-test.o > arm_smmu_v3-objs := $(arm_smmu_v3-objs-y) > - > -obj-$(CONFIG_ARM_SMMU_V3_KUNIT_TEST) += arm-smmu-v3-test.o > > > Jason
On Thu, Apr 18, 2024 at 12:39:29PM +0800, Michael Shavit wrote: > > > Forgot that my SVA sanity doesn't cover this patch. And it looks > > > like some problems here when building it with "=m": > > > > > > ERROR: modpost: missing MODULE_LICENSE() in drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.o > > > ERROR: modpost: "arm_smmu_make_cdtable_ste" [drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.ko] undefined! > > > ERROR: modpost: "arm_smmu_make_bypass_ste" [drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.ko] undefined! > > > ERROR: modpost: "arm_smmu_make_abort_ste" [drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.ko] undefined! > > > ERROR: modpost: "arm_smmu_make_s2_domain_ste" [drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.ko] undefined! > > > ERROR: modpost: "arm_smmu_get_ste_used" [drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.ko] undefined! > > > ERROR: modpost: "arm_smmu_write_entry" [drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.ko] undefined! > > > > > > Likely needs MODULE_LICENSE and some EXPORT_SYMBOLs. > > > > Oh! The kbuild never tested this kconfig combination... > > > > I think just this? Michael? > > Urhh I'm not sure... Should this also depend on ARM_SMMU_V3? It does: if ARM_SMMU_V3 config ARM_SMMU_V3_SVA bool "Shared Virtual Addressing support for the ARM SMMUv3" select IOMMU_SVA select IOMMU_IOPF select MMU_NOTIFIER help Support for sharing process address spaces with devices using the SMMUv3. Say Y here if your system supports SVA extensions such as PCIe PASID and PRI. config ARM_SMMU_V3_KUNIT_TEST bool "KUnit tests for arm-smmu-v3 driver" if !KUNIT_ALL_TESTS depends on KUNIT default KUNIT_ALL_TESTS help Enable this option to unit-test arm-smmu-v3 driver functions. If unsure, say N. endif The 'if' creates an automatic dependency and groups things into a kconfig menu > Also what > happens if ARM_SMMU_V3=m and ARM_SMMU_V3_KUNIT_TEST=y ? Works fine, the kunit symbols are exported and we still build one module for smmu so we don't need to have internal cross-module stuff Jason
On Thu, Apr 18, 2024 at 8:49 PM Jason Gunthorpe <jgg@nvidia.com> wrote: > > On Thu, Apr 18, 2024 at 12:39:29PM +0800, Michael Shavit wrote: > > > > Forgot that my SVA sanity doesn't cover this patch. And it looks > > > > like some problems here when building it with "=m": > > > > > > > > ERROR: modpost: missing MODULE_LICENSE() in drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.o > > > > ERROR: modpost: "arm_smmu_make_cdtable_ste" [drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.ko] undefined! > > > > ERROR: modpost: "arm_smmu_make_bypass_ste" [drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.ko] undefined! > > > > ERROR: modpost: "arm_smmu_make_abort_ste" [drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.ko] undefined! > > > > ERROR: modpost: "arm_smmu_make_s2_domain_ste" [drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.ko] undefined! > > > > ERROR: modpost: "arm_smmu_get_ste_used" [drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.ko] undefined! > > > > ERROR: modpost: "arm_smmu_write_entry" [drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.ko] undefined! > > > > > > > > Likely needs MODULE_LICENSE and some EXPORT_SYMBOLs. > > > > > > Oh! The kbuild never tested this kconfig combination... > > > > > > I think just this? Michael? > > > > Urhh I'm not sure... Should this also depend on ARM_SMMU_V3? > > It does: > > if ARM_SMMU_V3 > config ARM_SMMU_V3_SVA > bool "Shared Virtual Addressing support for the ARM SMMUv3" > select IOMMU_SVA > select IOMMU_IOPF > select MMU_NOTIFIER > help > Support for sharing process address spaces with devices using the > SMMUv3. > > Say Y here if your system supports SVA extensions such as PCIe PASID > and PRI. > > config ARM_SMMU_V3_KUNIT_TEST > bool "KUnit tests for arm-smmu-v3 driver" if !KUNIT_ALL_TESTS > depends on KUNIT > default KUNIT_ALL_TESTS > help > Enable this option to unit-test arm-smmu-v3 driver functions. > > If unsure, say N. > endif > > The 'if' creates an automatic dependency and groups things into a > kconfig menu Ohhh, I should have looked more carefully :) . Thanks. > > > Also what > > happens if ARM_SMMU_V3=m and ARM_SMMU_V3_KUNIT_TEST=y ? > > Works fine, the kunit symbols are exported and we still build one > module for smmu so we don't need to have internal cross-module stuff > > Jason
Hi Jason, I am still reviewing the patch, however 2 quick notes. On Tue, Apr 16, 2024 at 04:28:20PM -0300, Jason Gunthorpe wrote: > Add tests for some of the more common STE update operations that we expect > to see, as well as some artificial STE updates to test the edges of > arm_smmu_write_entry. These also serve as a record of which common > operation is expected to be hitless, and how many syncs they require. > > arm_smmu_write_entry implements a generic algorithm that updates an STE/CD > to any other abritrary STE/CD configuration. The update requires a > sequence of write+sync operations with some invariants that must be held > true after each sync. arm_smmu_write_entry lends itself well to > unit-testing since the function's interaction with the STE/CD is already > abstracted by input callbacks that we can hook to introspect into the > sequence of operations. We can use these hooks to guarantee that > invariants are held throughout the entire update operation. > > Link: https://lore.kernel.org/r/20240106083617.1173871-3-mshavit@google.com > Signed-off-by: Michael Shavit <mshavit@google.com> > Signed-off-by: Jason Gunthorpe <jgg@nvidia.com> > --- > drivers/iommu/Kconfig | 12 +- > drivers/iommu/arm/arm-smmu-v3/Makefile | 2 + > .../iommu/arm/arm-smmu-v3/arm-smmu-v3-sva.c | 6 +- > .../iommu/arm/arm-smmu-v3/arm-smmu-v3-test.c | 467 ++++++++++++++++++ > drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.c | 36 +- > drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.h | 30 ++ > 6 files changed, 525 insertions(+), 28 deletions(-) > create mode 100644 drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.c > > diff --git a/drivers/iommu/Kconfig b/drivers/iommu/Kconfig > index 0af39bbbe3a30e..2e597102baf6e5 100644 > --- a/drivers/iommu/Kconfig > +++ b/drivers/iommu/Kconfig > @@ -397,9 +397,9 @@ config ARM_SMMU_V3 > Say Y here if your system includes an IOMMU device implementing > the ARM SMMUv3 architecture. > > +if ARM_SMMU_V3 > config ARM_SMMU_V3_SVA > bool "Shared Virtual Addressing support for the ARM SMMUv3" > - depends on ARM_SMMU_V3 > select IOMMU_SVA > select IOMMU_IOPF > select MMU_NOTIFIER > @@ -410,6 +410,16 @@ config ARM_SMMU_V3_SVA > Say Y here if your system supports SVA extensions such as PCIe PASID > and PRI. > > +config ARM_SMMU_V3_KUNIT_TEST > + tristate "KUnit tests for arm-smmu-v3 driver" if !KUNIT_ALL_TESTS > + depends on KUNIT > + default KUNIT_ALL_TESTS > + help > + Enable this option to unit-test arm-smmu-v3 driver functions. > + > + If unsure, say N. > +endif > + > config S390_IOMMU > def_bool y if S390 && PCI > depends on S390 && PCI > diff --git a/drivers/iommu/arm/arm-smmu-v3/Makefile b/drivers/iommu/arm/arm-smmu-v3/Makefile > index 54feb1ecccad89..014a997753a8a2 100644 > --- a/drivers/iommu/arm/arm-smmu-v3/Makefile > +++ b/drivers/iommu/arm/arm-smmu-v3/Makefile > @@ -3,3 +3,5 @@ obj-$(CONFIG_ARM_SMMU_V3) += arm_smmu_v3.o > arm_smmu_v3-objs-y += arm-smmu-v3.o > arm_smmu_v3-objs-$(CONFIG_ARM_SMMU_V3_SVA) += arm-smmu-v3-sva.o > arm_smmu_v3-objs := $(arm_smmu_v3-objs-y) > + > +obj-$(CONFIG_ARM_SMMU_V3_KUNIT_TEST) += arm-smmu-v3-test.o > diff --git a/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-sva.c b/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-sva.c > index 80a7d559ef2d3f..f56a2d38012b5c 100644 > --- a/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-sva.c > +++ b/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-sva.c > @@ -120,9 +120,9 @@ static u64 page_size_to_cd(void) > return ARM_LPAE_TCR_TG0_4K; > } > > -static void arm_smmu_make_sva_cd(struct arm_smmu_cd *target, > - struct arm_smmu_master *master, > - struct mm_struct *mm, u16 asid) > +void arm_smmu_make_sva_cd(struct arm_smmu_cd *target, > + struct arm_smmu_master *master, struct mm_struct *mm, > + u16 asid) > { > u64 par; > > diff --git a/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.c b/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.c > new file mode 100644 > index 00000000000000..14c8e40712a70e > --- /dev/null > +++ b/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.c > @@ -0,0 +1,467 @@ > +// SPDX-License-Identifier: GPL-2.0 > +/* > + * Copyright 2024 Google LLC. > + */ > +#include <kunit/test.h> > +#include <linux/io-pgtable.h> > + > +#include "arm-smmu-v3.h" > + > +struct arm_smmu_test_writer { > + struct arm_smmu_entry_writer writer; > + struct kunit *test; > + const __le64 *init_entry; > + const __le64 *target_entry; > + __le64 *entry; > + > + bool invalid_entry_written; > + unsigned int num_syncs; > +}; > + > +#define NUM_ENTRY_QWORDS 8 > +#define NUM_EXPECTED_SYNCS(x) x > + > +static struct arm_smmu_ste bypass_ste; > +static struct arm_smmu_ste abort_ste; > +static struct arm_smmu_device smmu = { > + .features = ARM_SMMU_FEAT_STALLS | ARM_SMMU_FEAT_ATTR_TYPES_OVR > +}; > + > +static bool arm_smmu_entry_differs_in_used_bits(const __le64 *entry, > + const __le64 *used_bits, > + const __le64 *target, > + unsigned int length) > +{ > + bool differs = false; > + unsigned int i; > + > + for (i = 0; i < length; i++) { > + if ((entry[i] & used_bits[i]) != target[i]) > + differs = true; > + } > + return differs; > +} > + > +static void > +arm_smmu_test_writer_record_syncs(struct arm_smmu_entry_writer *writer) > +{ > + struct arm_smmu_test_writer *test_writer = > + container_of(writer, struct arm_smmu_test_writer, writer); > + __le64 *entry_used_bits; > + > + entry_used_bits = kunit_kzalloc( > + test_writer->test, sizeof(*entry_used_bits) * NUM_ENTRY_QWORDS, > + GFP_KERNEL); > + KUNIT_ASSERT_NOT_NULL(test_writer->test, entry_used_bits); > + > + pr_debug("STE value is now set to: "); > + print_hex_dump_debug(" ", DUMP_PREFIX_NONE, 16, 8, > + test_writer->entry, > + NUM_ENTRY_QWORDS * sizeof(*test_writer->entry), > + false); > + > + test_writer->num_syncs += 1; > + if (!(test_writer->entry[0] & writer->ops->v_bit)) { > + test_writer->invalid_entry_written = true; > + } else { > + /* > + * At any stage in a hitless transition, the entry must be > + * equivalent to either the initial entry or the target entry > + * when only considering the bits used by the current > + * configuration. > + */ > + writer->ops->get_used(test_writer->entry, entry_used_bits); > + KUNIT_EXPECT_FALSE( > + test_writer->test, > + arm_smmu_entry_differs_in_used_bits( > + test_writer->entry, entry_used_bits, > + test_writer->init_entry, NUM_ENTRY_QWORDS) && > + arm_smmu_entry_differs_in_used_bits( > + test_writer->entry, entry_used_bits, > + test_writer->target_entry, > + NUM_ENTRY_QWORDS)); > + } > +} > + > +static void > +arm_smmu_v3_test_debug_print_used_bits(struct arm_smmu_entry_writer *writer, > + const __le64 *ste) > +{ > + __le64 used_bits[NUM_ENTRY_QWORDS] = {}; > + > + arm_smmu_get_ste_used(ste, used_bits); > + pr_debug("STE used bits: "); > + print_hex_dump_debug(" ", DUMP_PREFIX_NONE, 16, 8, used_bits, > + sizeof(used_bits), false); > +} > + > +static const struct arm_smmu_entry_writer_ops test_ste_ops = { > + .v_bit = cpu_to_le64(STRTAB_STE_0_V), > + .sync = arm_smmu_test_writer_record_syncs, > + .get_used = arm_smmu_get_ste_used, > +}; > + > +static const struct arm_smmu_entry_writer_ops test_cd_ops = { > + .v_bit = cpu_to_le64(CTXDESC_CD_0_V), > + .sync = arm_smmu_test_writer_record_syncs, > + .get_used = arm_smmu_get_cd_used, > +}; > + > +static void arm_smmu_v3_test_ste_expect_transition( > + struct kunit *test, const struct arm_smmu_ste *cur, > + const struct arm_smmu_ste *target, unsigned int num_syncs_expected, > + bool hitless) > +{ > + struct arm_smmu_ste cur_copy = *cur; > + struct arm_smmu_test_writer test_writer = { > + .writer = { > + .ops = &test_ste_ops, > + }, > + .test = test, > + .init_entry = cur->data, > + .target_entry = target->data, > + .entry = cur_copy.data, > + .num_syncs = 0, > + .invalid_entry_written = false, > + > + }; > + > + pr_debug("STE initial value: "); > + print_hex_dump_debug(" ", DUMP_PREFIX_NONE, 16, 8, cur_copy.data, > + sizeof(cur_copy), false); > + arm_smmu_v3_test_debug_print_used_bits(&test_writer.writer, cur->data); > + pr_debug("STE target value: "); > + print_hex_dump_debug(" ", DUMP_PREFIX_NONE, 16, 8, target->data, > + sizeof(cur_copy), false); > + arm_smmu_v3_test_debug_print_used_bits(&test_writer.writer, > + target->data); > + > + arm_smmu_write_entry(&test_writer.writer, cur_copy.data, target->data); > + > + KUNIT_EXPECT_EQ(test, test_writer.invalid_entry_written, !hitless); > + KUNIT_EXPECT_EQ(test, test_writer.num_syncs, num_syncs_expected); > + KUNIT_EXPECT_MEMEQ(test, target->data, cur_copy.data, sizeof(cur_copy)); > +} > + > +static void arm_smmu_v3_test_ste_expect_hitless_transition( > + struct kunit *test, const struct arm_smmu_ste *cur, > + const struct arm_smmu_ste *target, unsigned int num_syncs_expected) > +{ > + arm_smmu_v3_test_ste_expect_transition(test, cur, target, > + num_syncs_expected, true); > +} > + > +static const dma_addr_t fake_cdtab_dma_addr = 0xF0F0F0F0F0F0; > + > +static void arm_smmu_test_make_cdtable_ste(struct arm_smmu_ste *ste, > + const dma_addr_t dma_addr) > +{ > + struct arm_smmu_master master = { > + .cd_table.cdtab_dma = dma_addr, > + .cd_table.s1cdmax = 0xFF, > + .cd_table.s1fmt = STRTAB_STE_0_S1FMT_64K_L2, > + .smmu = &smmu, > + }; > + > + arm_smmu_make_cdtable_ste(ste, &master); > +} > + > +static void arm_smmu_v3_write_ste_test_bypass_to_abort(struct kunit *test) > +{ > + /* > + * Bypass STEs has used bits in the first two Qwords, while abort STEs > + * only have used bits in the first QWord. Transitioning from bypass to > + * abort requires two syncs: the first to set the first qword and make > + * the STE into an abort, the second to clean up the second qword. > + */ > + arm_smmu_v3_test_ste_expect_hitless_transition( > + test, &bypass_ste, &abort_ste, NUM_EXPECTED_SYNCS(2)); > +} > + > +static void arm_smmu_v3_write_ste_test_abort_to_bypass(struct kunit *test) > +{ > + /* > + * Transitioning from abort to bypass also requires two syncs: the first > + * to set the second qword data required by the bypass STE, and the > + * second to set the first qword and switch to bypass. > + */ > + arm_smmu_v3_test_ste_expect_hitless_transition( > + test, &abort_ste, &bypass_ste, NUM_EXPECTED_SYNCS(2)); > +} > + > +static void arm_smmu_v3_write_ste_test_cdtable_to_abort(struct kunit *test) > +{ > + struct arm_smmu_ste ste; > + > + arm_smmu_test_make_cdtable_ste(&ste, fake_cdtab_dma_addr); > + arm_smmu_v3_test_ste_expect_hitless_transition(test, &ste, &abort_ste, > + NUM_EXPECTED_SYNCS(2)); > +} > + > +static void arm_smmu_v3_write_ste_test_abort_to_cdtable(struct kunit *test) > +{ > + struct arm_smmu_ste ste; > + > + arm_smmu_test_make_cdtable_ste(&ste, fake_cdtab_dma_addr); > + arm_smmu_v3_test_ste_expect_hitless_transition(test, &abort_ste, &ste, > + NUM_EXPECTED_SYNCS(2)); > +} > + > +static void arm_smmu_v3_write_ste_test_cdtable_to_bypass(struct kunit *test) > +{ > + struct arm_smmu_ste ste; > + > + arm_smmu_test_make_cdtable_ste(&ste, fake_cdtab_dma_addr); > + arm_smmu_v3_test_ste_expect_hitless_transition(test, &ste, &bypass_ste, > + NUM_EXPECTED_SYNCS(3)); > +} > + > +static void arm_smmu_v3_write_ste_test_bypass_to_cdtable(struct kunit *test) > +{ > + struct arm_smmu_ste ste; > + > + arm_smmu_test_make_cdtable_ste(&ste, fake_cdtab_dma_addr); > + arm_smmu_v3_test_ste_expect_hitless_transition(test, &bypass_ste, &ste, > + NUM_EXPECTED_SYNCS(3)); > +} > + > +static void arm_smmu_test_make_s2_ste(struct arm_smmu_ste *ste, > + bool ats_enabled) > +{ > + struct arm_smmu_master master = { > + .smmu = &smmu, > + .ats_enabled = ats_enabled, > + }; > + struct io_pgtable io_pgtable = {}; > + struct arm_smmu_domain smmu_domain = { > + .pgtbl_ops = &io_pgtable.ops, > + }; > + > + io_pgtable.cfg.arm_lpae_s2_cfg.vttbr = 0xdaedbeefdeadbeefULL; > + io_pgtable.cfg.arm_lpae_s2_cfg.vtcr.ps = 1; > + io_pgtable.cfg.arm_lpae_s2_cfg.vtcr.tg = 2; > + io_pgtable.cfg.arm_lpae_s2_cfg.vtcr.sh = 3; > + io_pgtable.cfg.arm_lpae_s2_cfg.vtcr.orgn = 1; > + io_pgtable.cfg.arm_lpae_s2_cfg.vtcr.irgn = 2; > + io_pgtable.cfg.arm_lpae_s2_cfg.vtcr.sl = 3; > + io_pgtable.cfg.arm_lpae_s2_cfg.vtcr.tsz = 4; > + > + arm_smmu_make_s2_domain_ste(ste, &master, &smmu_domain); > +} > + > +static void arm_smmu_v3_write_ste_test_s2_to_abort(struct kunit *test) > +{ > + struct arm_smmu_ste ste; > + > + arm_smmu_test_make_s2_ste(&ste, true); > + arm_smmu_v3_test_ste_expect_hitless_transition(test, &ste, &abort_ste, > + NUM_EXPECTED_SYNCS(2)); > +} > + > +static void arm_smmu_v3_write_ste_test_abort_to_s2(struct kunit *test) > +{ > + struct arm_smmu_ste ste; > + > + arm_smmu_test_make_s2_ste(&ste, true); > + arm_smmu_v3_test_ste_expect_hitless_transition(test, &abort_ste, &ste, > + NUM_EXPECTED_SYNCS(2)); > +} > + > +static void arm_smmu_v3_write_ste_test_s2_to_bypass(struct kunit *test) > +{ > + struct arm_smmu_ste ste; > + > + arm_smmu_test_make_s2_ste(&ste, true); > + arm_smmu_v3_test_ste_expect_hitless_transition(test, &ste, &bypass_ste, > + NUM_EXPECTED_SYNCS(2)); > +} > + > +static void arm_smmu_v3_write_ste_test_bypass_to_s2(struct kunit *test) > +{ > + struct arm_smmu_ste ste; > + > + arm_smmu_test_make_s2_ste(&ste, true); > + arm_smmu_v3_test_ste_expect_hitless_transition(test, &bypass_ste, &ste, > + NUM_EXPECTED_SYNCS(2)); > +} > + > +static void arm_smmu_v3_test_cd_expect_transition( > + struct kunit *test, const struct arm_smmu_cd *cur, > + const struct arm_smmu_cd *target, unsigned int num_syncs_expected, > + bool hitless) > +{ > + struct arm_smmu_cd cur_copy = *cur; > + struct arm_smmu_test_writer test_writer = { > + .writer = { > + .ops = &test_cd_ops, > + }, > + .test = test, > + .init_entry = cur->data, > + .target_entry = target->data, > + .entry = cur_copy.data, > + .num_syncs = 0, > + .invalid_entry_written = false, > + > + }; > + > + pr_debug("CD initial value: "); > + print_hex_dump_debug(" ", DUMP_PREFIX_NONE, 16, 8, cur_copy.data, > + sizeof(cur_copy), false); > + arm_smmu_v3_test_debug_print_used_bits(&test_writer.writer, cur->data); > + pr_debug("CD target value: "); > + print_hex_dump_debug(" ", DUMP_PREFIX_NONE, 16, 8, target->data, > + sizeof(cur_copy), false); > + arm_smmu_v3_test_debug_print_used_bits(&test_writer.writer, > + target->data); > + > + arm_smmu_write_entry(&test_writer.writer, cur_copy.data, target->data); > + > + KUNIT_EXPECT_EQ(test, test_writer.invalid_entry_written, !hitless); > + KUNIT_EXPECT_EQ(test, test_writer.num_syncs, num_syncs_expected); > + KUNIT_EXPECT_MEMEQ(test, target->data, cur_copy.data, sizeof(cur_copy)); > +} > + > +static void arm_smmu_v3_test_cd_expect_non_hitless_transition( > + struct kunit *test, const struct arm_smmu_cd *cur, > + const struct arm_smmu_cd *target, unsigned int num_syncs_expected) > +{ > + arm_smmu_v3_test_cd_expect_transition(test, cur, target, > + num_syncs_expected, false); > +} > + > +static void arm_smmu_v3_test_cd_expect_hitless_transition( > + struct kunit *test, const struct arm_smmu_cd *cur, > + const struct arm_smmu_cd *target, unsigned int num_syncs_expected) > +{ > + arm_smmu_v3_test_cd_expect_transition(test, cur, target, > + num_syncs_expected, true); > +} > + > +static void arm_smmu_test_make_s1_cd(struct arm_smmu_cd *cd, unsigned int asid) > +{ > + struct arm_smmu_master master = { > + .smmu = &smmu, > + }; > + struct io_pgtable io_pgtable = {}; > + struct arm_smmu_domain smmu_domain = { > + .pgtbl_ops = &io_pgtable.ops, > + .cd = { > + .asid = asid, > + }, > + }; > + > + io_pgtable.cfg.arm_lpae_s1_cfg.ttbr = 0xdaedbeefdeadbeefULL; > + io_pgtable.cfg.arm_lpae_s1_cfg.tcr.ips = 1; > + io_pgtable.cfg.arm_lpae_s1_cfg.tcr.tg = 2; > + io_pgtable.cfg.arm_lpae_s1_cfg.tcr.sh = 3; > + io_pgtable.cfg.arm_lpae_s1_cfg.tcr.orgn = 1; > + io_pgtable.cfg.arm_lpae_s1_cfg.tcr.irgn = 2; > + io_pgtable.cfg.arm_lpae_s1_cfg.tcr.tsz = 4; > + io_pgtable.cfg.arm_lpae_s1_cfg.mair = 0xabcdef012345678ULL; > + > + arm_smmu_make_s1_cd(cd, &master, &smmu_domain); > +} > + > +static void arm_smmu_v3_write_cd_test_s1_clear(struct kunit *test) > +{ > + struct arm_smmu_cd cd = {}; > + struct arm_smmu_cd cd_2; > + > + arm_smmu_test_make_s1_cd(&cd_2, 1997); > + arm_smmu_v3_test_cd_expect_non_hitless_transition( > + test, &cd, &cd_2, NUM_EXPECTED_SYNCS(2)); > + arm_smmu_v3_test_cd_expect_non_hitless_transition( > + test, &cd_2, &cd, NUM_EXPECTED_SYNCS(2)); > +} > + > +static void arm_smmu_v3_write_cd_test_s1_change_asid(struct kunit *test) > +{ > + struct arm_smmu_cd cd = {}; > + struct arm_smmu_cd cd_2; > + > + arm_smmu_test_make_s1_cd(&cd, 778); > + arm_smmu_test_make_s1_cd(&cd_2, 1997); > + arm_smmu_v3_test_cd_expect_hitless_transition(test, &cd, &cd_2, > + NUM_EXPECTED_SYNCS(1)); > + arm_smmu_v3_test_cd_expect_hitless_transition(test, &cd_2, &cd, > + NUM_EXPECTED_SYNCS(1)); > +} > + > +static void arm_smmu_test_make_sva_cd(struct arm_smmu_cd *cd, unsigned int asid) > +{ > + struct arm_smmu_master master = { > + .smmu = &smmu, > + }; > + struct mm_struct mm = { > + .pgd = (void *)0xdaedbeefdeadbeefULL, > + }; > + > + arm_smmu_make_sva_cd(cd, &master, &mm, asid); > +} > + > +static void arm_smmu_test_make_sva_release_cd(struct arm_smmu_cd *cd, > + unsigned int asid) > +{ > + struct arm_smmu_master master = { > + .smmu = &smmu, > + }; > + > + arm_smmu_make_sva_cd(cd, &master, NULL, asid); > +} > + The test doesn’t build with SVA disabled, it fails with: aarch64-linux-gnu-ld: Unexpected GOT/PLT entries detected! aarch64-linux-gnu-ld: Unexpected run-time procedure linkages detected! aarch64-linux-gnu-ld: drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.o: in function `arm_smmu_test_make_sva_release_cd': .../linux/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.c:409:(.text+0x17c): undefined reference to `arm_smmu_make_sva_cd' aarch64-linux-gnu-ld: drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.o: in function `arm_smmu_test_make_sva_cd': .../linux/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.c:399:(.text+0x230): undefined reference to `arm_smmu_make_sva_cd' I belive this check should be guarded under SVA. > +static void arm_smmu_v3_write_cd_test_sva_clear(struct kunit *test) > +{ > + struct arm_smmu_cd cd = {}; > + struct arm_smmu_cd cd_2; > + > + arm_smmu_test_make_sva_cd(&cd_2, 1997); > + arm_smmu_v3_test_cd_expect_non_hitless_transition( > + test, &cd, &cd_2, NUM_EXPECTED_SYNCS(2)); > + arm_smmu_v3_test_cd_expect_non_hitless_transition( > + test, &cd_2, &cd, NUM_EXPECTED_SYNCS(2)); > +} > + > +static void arm_smmu_v3_write_cd_test_sva_release(struct kunit *test) > +{ > + struct arm_smmu_cd cd; > + struct arm_smmu_cd cd_2; > + > + arm_smmu_test_make_sva_cd(&cd, 1997); > + arm_smmu_test_make_sva_release_cd(&cd_2, 1997); > + arm_smmu_v3_test_cd_expect_hitless_transition(test, &cd, &cd_2, > + NUM_EXPECTED_SYNCS(2)); > + arm_smmu_v3_test_cd_expect_hitless_transition(test, &cd_2, &cd, > + NUM_EXPECTED_SYNCS(2)); > +} > + > +static struct kunit_case arm_smmu_v3_test_cases[] = { > + KUNIT_CASE(arm_smmu_v3_write_ste_test_bypass_to_abort), > + KUNIT_CASE(arm_smmu_v3_write_ste_test_abort_to_bypass), > + KUNIT_CASE(arm_smmu_v3_write_ste_test_cdtable_to_abort), > + KUNIT_CASE(arm_smmu_v3_write_ste_test_abort_to_cdtable), > + KUNIT_CASE(arm_smmu_v3_write_ste_test_cdtable_to_bypass), > + KUNIT_CASE(arm_smmu_v3_write_ste_test_bypass_to_cdtable), > + KUNIT_CASE(arm_smmu_v3_write_ste_test_s2_to_abort), > + KUNIT_CASE(arm_smmu_v3_write_ste_test_abort_to_s2), > + KUNIT_CASE(arm_smmu_v3_write_ste_test_s2_to_bypass), > + KUNIT_CASE(arm_smmu_v3_write_ste_test_bypass_to_s2), > + KUNIT_CASE(arm_smmu_v3_write_cd_test_s1_clear), > + KUNIT_CASE(arm_smmu_v3_write_cd_test_s1_change_asid), > + KUNIT_CASE(arm_smmu_v3_write_cd_test_sva_clear), > + KUNIT_CASE(arm_smmu_v3_write_cd_test_sva_release), > + {}, > +}; > + > +static int arm_smmu_v3_test_suite_init(struct kunit_suite *test) > +{ > + arm_smmu_make_bypass_ste(&smmu, &bypass_ste); > + arm_smmu_make_abort_ste(&abort_ste); > + return 0; > +} > + > +static struct kunit_suite arm_smmu_v3_test_module = { > + .name = "arm-smmu-v3-kunit-test", > + .suite_init = arm_smmu_v3_test_suite_init, > + .test_cases = arm_smmu_v3_test_cases, > +}; > +kunit_test_suites(&arm_smmu_v3_test_module); > diff --git a/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.c b/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.c > index 72402f6a7ed4e0..3ffaa3b34b44bf 100644 > --- a/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.c > +++ b/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.c > @@ -42,18 +42,6 @@ enum arm_smmu_msi_index { > ARM_SMMU_MAX_MSIS, > }; > > -struct arm_smmu_entry_writer_ops; > -struct arm_smmu_entry_writer { > - const struct arm_smmu_entry_writer_ops *ops; > - struct arm_smmu_master *master; > -}; > - > -struct arm_smmu_entry_writer_ops { > - __le64 v_bit; > - void (*get_used)(const __le64 *entry, __le64 *used); > - void (*sync)(struct arm_smmu_entry_writer *writer); > -}; > - > #define NUM_ENTRY_QWORDS 8 > static_assert(sizeof(struct arm_smmu_ste) == NUM_ENTRY_QWORDS * sizeof(u64)); > static_assert(sizeof(struct arm_smmu_cd) == NUM_ENTRY_QWORDS * sizeof(u64)); > @@ -980,7 +968,7 @@ void arm_smmu_tlb_inv_asid(struct arm_smmu_device *smmu, u16 asid) > * would be nice if this was complete according to the spec, but minimally it > * has to capture the bits this driver uses. > */ > -static void arm_smmu_get_ste_used(const __le64 *ent, __le64 *used_bits) > +void arm_smmu_get_ste_used(const __le64 *ent, __le64 *used_bits) IMO we should not export all these low level functions unconditionally. KUNIT already defines “VISIBLE_IF_KUNIT” which sets symbols to be static if CONFIG_KUNIT is not enabled. Or maybe even guard it for this test like what btrfs does with “EXPORT_FOR_TESTS” Thanks, Mostafa > { > unsigned int cfg = FIELD_GET(STRTAB_STE_0_CFG, le64_to_cpu(ent[0])); > > @@ -1102,8 +1090,8 @@ static bool entry_set(struct arm_smmu_entry_writer *writer, __le64 *entry, > * V=0 process. This relies on the IGNORED behavior described in the > * specification. > */ > -static void arm_smmu_write_entry(struct arm_smmu_entry_writer *writer, > - __le64 *entry, const __le64 *target) > +void arm_smmu_write_entry(struct arm_smmu_entry_writer *writer, __le64 *entry, > + const __le64 *target) > { > __le64 unused_update[NUM_ENTRY_QWORDS]; > u8 used_qword_diff; > @@ -1257,7 +1245,7 @@ struct arm_smmu_cd_writer { > unsigned int ssid; > }; > > -static void arm_smmu_get_cd_used(const __le64 *ent, __le64 *used_bits) > +void arm_smmu_get_cd_used(const __le64 *ent, __le64 *used_bits) > { > used_bits[0] = cpu_to_le64(CTXDESC_CD_0_V); > if (!(ent[0] & cpu_to_le64(CTXDESC_CD_0_V))) > @@ -1514,7 +1502,7 @@ static void arm_smmu_write_ste(struct arm_smmu_master *master, u32 sid, > } > } > > -static void arm_smmu_make_abort_ste(struct arm_smmu_ste *target) > +void arm_smmu_make_abort_ste(struct arm_smmu_ste *target) > { > memset(target, 0, sizeof(*target)); > target->data[0] = cpu_to_le64( > @@ -1522,8 +1510,8 @@ static void arm_smmu_make_abort_ste(struct arm_smmu_ste *target) > FIELD_PREP(STRTAB_STE_0_CFG, STRTAB_STE_0_CFG_ABORT)); > } > > -static void arm_smmu_make_bypass_ste(struct arm_smmu_device *smmu, > - struct arm_smmu_ste *target) > +void arm_smmu_make_bypass_ste(struct arm_smmu_device *smmu, > + struct arm_smmu_ste *target) > { > memset(target, 0, sizeof(*target)); > target->data[0] = cpu_to_le64( > @@ -1535,8 +1523,8 @@ static void arm_smmu_make_bypass_ste(struct arm_smmu_device *smmu, > STRTAB_STE_1_SHCFG_INCOMING)); > } > > -static void arm_smmu_make_cdtable_ste(struct arm_smmu_ste *target, > - struct arm_smmu_master *master) > +void arm_smmu_make_cdtable_ste(struct arm_smmu_ste *target, > + struct arm_smmu_master *master) > { > struct arm_smmu_ctx_desc_cfg *cd_table = &master->cd_table; > struct arm_smmu_device *smmu = master->smmu; > @@ -1585,9 +1573,9 @@ static void arm_smmu_make_cdtable_ste(struct arm_smmu_ste *target, > } > } > > -static void arm_smmu_make_s2_domain_ste(struct arm_smmu_ste *target, > - struct arm_smmu_master *master, > - struct arm_smmu_domain *smmu_domain) > +void arm_smmu_make_s2_domain_ste(struct arm_smmu_ste *target, > + struct arm_smmu_master *master, > + struct arm_smmu_domain *smmu_domain) > { > struct arm_smmu_s2_cfg *s2_cfg = &smmu_domain->s2_cfg; > const struct io_pgtable_cfg *pgtbl_cfg = > diff --git a/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.h b/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.h > index 8f791f67f9f7f4..0455498d24c730 100644 > --- a/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.h > +++ b/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.h > @@ -737,6 +737,36 @@ struct arm_smmu_domain { > struct list_head mmu_notifiers; > }; > > +/* The following are exposed for testing purposes. */ > +struct arm_smmu_entry_writer_ops; > +struct arm_smmu_entry_writer { > + const struct arm_smmu_entry_writer_ops *ops; > + struct arm_smmu_master *master; > +}; > + > +struct arm_smmu_entry_writer_ops { > + __le64 v_bit; > + void (*get_used)(const __le64 *entry, __le64 *used); > + void (*sync)(struct arm_smmu_entry_writer *writer); > +}; > + > +void arm_smmu_get_ste_used(const __le64 *ent, __le64 *used_bits); > +void arm_smmu_get_cd_used(const __le64 *ent, __le64 *used_bits); > +void arm_smmu_write_entry(struct arm_smmu_entry_writer *writer, __le64 *cur, > + const __le64 *target); > + > +void arm_smmu_make_abort_ste(struct arm_smmu_ste *target); > +void arm_smmu_make_bypass_ste(struct arm_smmu_device *smmu, > + struct arm_smmu_ste *target); > +void arm_smmu_make_cdtable_ste(struct arm_smmu_ste *target, > + struct arm_smmu_master *master); > +void arm_smmu_make_s2_domain_ste(struct arm_smmu_ste *target, > + struct arm_smmu_master *master, > + struct arm_smmu_domain *smmu_domain); > +void arm_smmu_make_sva_cd(struct arm_smmu_cd *target, > + struct arm_smmu_master *master, struct mm_struct *mm, > + u16 asid); > + > static inline struct arm_smmu_domain *to_smmu_domain(struct iommu_domain *dom) > { > return container_of(dom, struct arm_smmu_domain, domain); > -- > 2.43.2 >
On Fri, Apr 19, 2024 at 09:24:52PM +0000, Mostafa Saleh wrote: > > +static void arm_smmu_test_make_sva_release_cd(struct arm_smmu_cd *cd, > > + unsigned int asid) > > +{ > > + struct arm_smmu_master master = { > > + .smmu = &smmu, > > + }; > > + > > + arm_smmu_make_sva_cd(cd, &master, NULL, asid); > > +} > > + > > The test doesn’t build with SVA disabled, it fails with: > aarch64-linux-gnu-ld: Unexpected GOT/PLT entries detected! > aarch64-linux-gnu-ld: Unexpected run-time procedure linkages detected! > aarch64-linux-gnu-ld: drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.o: in function `arm_smmu_test_make_sva_release_cd': > .../linux/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.c:409:(.text+0x17c): undefined reference to `arm_smmu_make_sva_cd' > aarch64-linux-gnu-ld: drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.o: in function `arm_smmu_test_make_sva_cd': > .../linux/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.c:399:(.text+0x230): undefined reference to `arm_smmu_make_sva_cd' > > I belive this check should be guarded under SVA. Ugh yes, 0-day just hit this too. I'm just going to do this: config ARM_SMMU_V3_KUNIT_TEST bool "KUnit tests for arm-smmu-v3 driver" if !KUNIT_ALL_TESTS depends on KUNIT + depends on ARM_SMMU_V3_SVA default KUNIT_ALL_TESTS help Enable this option to unit-test arm-smmu-v3 driver functions. Instead of adding #ifdefs. No reason not to test the whole driver? > > @@ -980,7 +968,7 @@ void arm_smmu_tlb_inv_asid(struct arm_smmu_device *smmu, u16 asid) > > * would be nice if this was complete according to the spec, but minimally it > > * has to capture the bits this driver uses. > > */ > > -static void arm_smmu_get_ste_used(const __le64 *ent, __le64 *used_bits) > > +void arm_smmu_get_ste_used(const __le64 *ent, __le64 *used_bits) > > IMO we should not export all these low level functions unconditionally. > KUNIT already defines “VISIBLE_IF_KUNIT” which sets symbols to be static > if CONFIG_KUNIT is not enabled. Or maybe even guard it for this test > like what btrfs does with “EXPORT_FOR_TESTS” Sure, that doesn't look like too much trouble long term. Thanks, Jason
On Mon, Apr 22, 2024 at 11:24:29AM -0300, Jason Gunthorpe wrote: > On Fri, Apr 19, 2024 at 09:24:52PM +0000, Mostafa Saleh wrote: > > > +static void arm_smmu_test_make_sva_release_cd(struct arm_smmu_cd *cd, > > > + unsigned int asid) > > > +{ > > > + struct arm_smmu_master master = { > > > + .smmu = &smmu, > > > + }; > > > + > > > + arm_smmu_make_sva_cd(cd, &master, NULL, asid); > > > +} > > > + > > > > The test doesn’t build with SVA disabled, it fails with: > > aarch64-linux-gnu-ld: Unexpected GOT/PLT entries detected! > > aarch64-linux-gnu-ld: Unexpected run-time procedure linkages detected! > > aarch64-linux-gnu-ld: drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.o: in function `arm_smmu_test_make_sva_release_cd': > > .../linux/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.c:409:(.text+0x17c): undefined reference to `arm_smmu_make_sva_cd' > > aarch64-linux-gnu-ld: drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.o: in function `arm_smmu_test_make_sva_cd': > > .../linux/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.c:399:(.text+0x230): undefined reference to `arm_smmu_make_sva_cd' > > > > I belive this check should be guarded under SVA. > > Ugh yes, 0-day just hit this too. > > I'm just going to do this: > > config ARM_SMMU_V3_KUNIT_TEST > bool "KUnit tests for arm-smmu-v3 driver" if !KUNIT_ALL_TESTS > depends on KUNIT > + depends on ARM_SMMU_V3_SVA > default KUNIT_ALL_TESTS > help > Enable this option to unit-test arm-smmu-v3 driver functions. > > > Instead of adding #ifdefs. No reason not to test the whole driver? > Sounds good, I guess that option will be only used for development. Thanks, Mostafa > > > @@ -980,7 +968,7 @@ void arm_smmu_tlb_inv_asid(struct arm_smmu_device *smmu, u16 asid) > > > * would be nice if this was complete according to the spec, but minimally it > > > * has to capture the bits this driver uses. > > > */ > > > -static void arm_smmu_get_ste_used(const __le64 *ent, __le64 *used_bits) > > > +void arm_smmu_get_ste_used(const __le64 *ent, __le64 *used_bits) > > > > IMO we should not export all these low level functions unconditionally. > > KUNIT already defines “VISIBLE_IF_KUNIT” which sets symbols to be static > > if CONFIG_KUNIT is not enabled. Or maybe even guard it for this test > > like what btrfs does with “EXPORT_FOR_TESTS” > > Sure, that doesn't look like too much trouble long term. > > Thanks, > Jason
diff --git a/drivers/iommu/Kconfig b/drivers/iommu/Kconfig index 0af39bbbe3a30e..2e597102baf6e5 100644 --- a/drivers/iommu/Kconfig +++ b/drivers/iommu/Kconfig @@ -397,9 +397,9 @@ config ARM_SMMU_V3 Say Y here if your system includes an IOMMU device implementing the ARM SMMUv3 architecture. +if ARM_SMMU_V3 config ARM_SMMU_V3_SVA bool "Shared Virtual Addressing support for the ARM SMMUv3" - depends on ARM_SMMU_V3 select IOMMU_SVA select IOMMU_IOPF select MMU_NOTIFIER @@ -410,6 +410,16 @@ config ARM_SMMU_V3_SVA Say Y here if your system supports SVA extensions such as PCIe PASID and PRI. +config ARM_SMMU_V3_KUNIT_TEST + tristate "KUnit tests for arm-smmu-v3 driver" if !KUNIT_ALL_TESTS + depends on KUNIT + default KUNIT_ALL_TESTS + help + Enable this option to unit-test arm-smmu-v3 driver functions. + + If unsure, say N. +endif + config S390_IOMMU def_bool y if S390 && PCI depends on S390 && PCI diff --git a/drivers/iommu/arm/arm-smmu-v3/Makefile b/drivers/iommu/arm/arm-smmu-v3/Makefile index 54feb1ecccad89..014a997753a8a2 100644 --- a/drivers/iommu/arm/arm-smmu-v3/Makefile +++ b/drivers/iommu/arm/arm-smmu-v3/Makefile @@ -3,3 +3,5 @@ obj-$(CONFIG_ARM_SMMU_V3) += arm_smmu_v3.o arm_smmu_v3-objs-y += arm-smmu-v3.o arm_smmu_v3-objs-$(CONFIG_ARM_SMMU_V3_SVA) += arm-smmu-v3-sva.o arm_smmu_v3-objs := $(arm_smmu_v3-objs-y) + +obj-$(CONFIG_ARM_SMMU_V3_KUNIT_TEST) += arm-smmu-v3-test.o diff --git a/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-sva.c b/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-sva.c index 80a7d559ef2d3f..f56a2d38012b5c 100644 --- a/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-sva.c +++ b/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-sva.c @@ -120,9 +120,9 @@ static u64 page_size_to_cd(void) return ARM_LPAE_TCR_TG0_4K; } -static void arm_smmu_make_sva_cd(struct arm_smmu_cd *target, - struct arm_smmu_master *master, - struct mm_struct *mm, u16 asid) +void arm_smmu_make_sva_cd(struct arm_smmu_cd *target, + struct arm_smmu_master *master, struct mm_struct *mm, + u16 asid) { u64 par; diff --git a/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.c b/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.c new file mode 100644 index 00000000000000..14c8e40712a70e --- /dev/null +++ b/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3-test.c @@ -0,0 +1,467 @@ +// SPDX-License-Identifier: GPL-2.0 +/* + * Copyright 2024 Google LLC. + */ +#include <kunit/test.h> +#include <linux/io-pgtable.h> + +#include "arm-smmu-v3.h" + +struct arm_smmu_test_writer { + struct arm_smmu_entry_writer writer; + struct kunit *test; + const __le64 *init_entry; + const __le64 *target_entry; + __le64 *entry; + + bool invalid_entry_written; + unsigned int num_syncs; +}; + +#define NUM_ENTRY_QWORDS 8 +#define NUM_EXPECTED_SYNCS(x) x + +static struct arm_smmu_ste bypass_ste; +static struct arm_smmu_ste abort_ste; +static struct arm_smmu_device smmu = { + .features = ARM_SMMU_FEAT_STALLS | ARM_SMMU_FEAT_ATTR_TYPES_OVR +}; + +static bool arm_smmu_entry_differs_in_used_bits(const __le64 *entry, + const __le64 *used_bits, + const __le64 *target, + unsigned int length) +{ + bool differs = false; + unsigned int i; + + for (i = 0; i < length; i++) { + if ((entry[i] & used_bits[i]) != target[i]) + differs = true; + } + return differs; +} + +static void +arm_smmu_test_writer_record_syncs(struct arm_smmu_entry_writer *writer) +{ + struct arm_smmu_test_writer *test_writer = + container_of(writer, struct arm_smmu_test_writer, writer); + __le64 *entry_used_bits; + + entry_used_bits = kunit_kzalloc( + test_writer->test, sizeof(*entry_used_bits) * NUM_ENTRY_QWORDS, + GFP_KERNEL); + KUNIT_ASSERT_NOT_NULL(test_writer->test, entry_used_bits); + + pr_debug("STE value is now set to: "); + print_hex_dump_debug(" ", DUMP_PREFIX_NONE, 16, 8, + test_writer->entry, + NUM_ENTRY_QWORDS * sizeof(*test_writer->entry), + false); + + test_writer->num_syncs += 1; + if (!(test_writer->entry[0] & writer->ops->v_bit)) { + test_writer->invalid_entry_written = true; + } else { + /* + * At any stage in a hitless transition, the entry must be + * equivalent to either the initial entry or the target entry + * when only considering the bits used by the current + * configuration. + */ + writer->ops->get_used(test_writer->entry, entry_used_bits); + KUNIT_EXPECT_FALSE( + test_writer->test, + arm_smmu_entry_differs_in_used_bits( + test_writer->entry, entry_used_bits, + test_writer->init_entry, NUM_ENTRY_QWORDS) && + arm_smmu_entry_differs_in_used_bits( + test_writer->entry, entry_used_bits, + test_writer->target_entry, + NUM_ENTRY_QWORDS)); + } +} + +static void +arm_smmu_v3_test_debug_print_used_bits(struct arm_smmu_entry_writer *writer, + const __le64 *ste) +{ + __le64 used_bits[NUM_ENTRY_QWORDS] = {}; + + arm_smmu_get_ste_used(ste, used_bits); + pr_debug("STE used bits: "); + print_hex_dump_debug(" ", DUMP_PREFIX_NONE, 16, 8, used_bits, + sizeof(used_bits), false); +} + +static const struct arm_smmu_entry_writer_ops test_ste_ops = { + .v_bit = cpu_to_le64(STRTAB_STE_0_V), + .sync = arm_smmu_test_writer_record_syncs, + .get_used = arm_smmu_get_ste_used, +}; + +static const struct arm_smmu_entry_writer_ops test_cd_ops = { + .v_bit = cpu_to_le64(CTXDESC_CD_0_V), + .sync = arm_smmu_test_writer_record_syncs, + .get_used = arm_smmu_get_cd_used, +}; + +static void arm_smmu_v3_test_ste_expect_transition( + struct kunit *test, const struct arm_smmu_ste *cur, + const struct arm_smmu_ste *target, unsigned int num_syncs_expected, + bool hitless) +{ + struct arm_smmu_ste cur_copy = *cur; + struct arm_smmu_test_writer test_writer = { + .writer = { + .ops = &test_ste_ops, + }, + .test = test, + .init_entry = cur->data, + .target_entry = target->data, + .entry = cur_copy.data, + .num_syncs = 0, + .invalid_entry_written = false, + + }; + + pr_debug("STE initial value: "); + print_hex_dump_debug(" ", DUMP_PREFIX_NONE, 16, 8, cur_copy.data, + sizeof(cur_copy), false); + arm_smmu_v3_test_debug_print_used_bits(&test_writer.writer, cur->data); + pr_debug("STE target value: "); + print_hex_dump_debug(" ", DUMP_PREFIX_NONE, 16, 8, target->data, + sizeof(cur_copy), false); + arm_smmu_v3_test_debug_print_used_bits(&test_writer.writer, + target->data); + + arm_smmu_write_entry(&test_writer.writer, cur_copy.data, target->data); + + KUNIT_EXPECT_EQ(test, test_writer.invalid_entry_written, !hitless); + KUNIT_EXPECT_EQ(test, test_writer.num_syncs, num_syncs_expected); + KUNIT_EXPECT_MEMEQ(test, target->data, cur_copy.data, sizeof(cur_copy)); +} + +static void arm_smmu_v3_test_ste_expect_hitless_transition( + struct kunit *test, const struct arm_smmu_ste *cur, + const struct arm_smmu_ste *target, unsigned int num_syncs_expected) +{ + arm_smmu_v3_test_ste_expect_transition(test, cur, target, + num_syncs_expected, true); +} + +static const dma_addr_t fake_cdtab_dma_addr = 0xF0F0F0F0F0F0; + +static void arm_smmu_test_make_cdtable_ste(struct arm_smmu_ste *ste, + const dma_addr_t dma_addr) +{ + struct arm_smmu_master master = { + .cd_table.cdtab_dma = dma_addr, + .cd_table.s1cdmax = 0xFF, + .cd_table.s1fmt = STRTAB_STE_0_S1FMT_64K_L2, + .smmu = &smmu, + }; + + arm_smmu_make_cdtable_ste(ste, &master); +} + +static void arm_smmu_v3_write_ste_test_bypass_to_abort(struct kunit *test) +{ + /* + * Bypass STEs has used bits in the first two Qwords, while abort STEs + * only have used bits in the first QWord. Transitioning from bypass to + * abort requires two syncs: the first to set the first qword and make + * the STE into an abort, the second to clean up the second qword. + */ + arm_smmu_v3_test_ste_expect_hitless_transition( + test, &bypass_ste, &abort_ste, NUM_EXPECTED_SYNCS(2)); +} + +static void arm_smmu_v3_write_ste_test_abort_to_bypass(struct kunit *test) +{ + /* + * Transitioning from abort to bypass also requires two syncs: the first + * to set the second qword data required by the bypass STE, and the + * second to set the first qword and switch to bypass. + */ + arm_smmu_v3_test_ste_expect_hitless_transition( + test, &abort_ste, &bypass_ste, NUM_EXPECTED_SYNCS(2)); +} + +static void arm_smmu_v3_write_ste_test_cdtable_to_abort(struct kunit *test) +{ + struct arm_smmu_ste ste; + + arm_smmu_test_make_cdtable_ste(&ste, fake_cdtab_dma_addr); + arm_smmu_v3_test_ste_expect_hitless_transition(test, &ste, &abort_ste, + NUM_EXPECTED_SYNCS(2)); +} + +static void arm_smmu_v3_write_ste_test_abort_to_cdtable(struct kunit *test) +{ + struct arm_smmu_ste ste; + + arm_smmu_test_make_cdtable_ste(&ste, fake_cdtab_dma_addr); + arm_smmu_v3_test_ste_expect_hitless_transition(test, &abort_ste, &ste, + NUM_EXPECTED_SYNCS(2)); +} + +static void arm_smmu_v3_write_ste_test_cdtable_to_bypass(struct kunit *test) +{ + struct arm_smmu_ste ste; + + arm_smmu_test_make_cdtable_ste(&ste, fake_cdtab_dma_addr); + arm_smmu_v3_test_ste_expect_hitless_transition(test, &ste, &bypass_ste, + NUM_EXPECTED_SYNCS(3)); +} + +static void arm_smmu_v3_write_ste_test_bypass_to_cdtable(struct kunit *test) +{ + struct arm_smmu_ste ste; + + arm_smmu_test_make_cdtable_ste(&ste, fake_cdtab_dma_addr); + arm_smmu_v3_test_ste_expect_hitless_transition(test, &bypass_ste, &ste, + NUM_EXPECTED_SYNCS(3)); +} + +static void arm_smmu_test_make_s2_ste(struct arm_smmu_ste *ste, + bool ats_enabled) +{ + struct arm_smmu_master master = { + .smmu = &smmu, + .ats_enabled = ats_enabled, + }; + struct io_pgtable io_pgtable = {}; + struct arm_smmu_domain smmu_domain = { + .pgtbl_ops = &io_pgtable.ops, + }; + + io_pgtable.cfg.arm_lpae_s2_cfg.vttbr = 0xdaedbeefdeadbeefULL; + io_pgtable.cfg.arm_lpae_s2_cfg.vtcr.ps = 1; + io_pgtable.cfg.arm_lpae_s2_cfg.vtcr.tg = 2; + io_pgtable.cfg.arm_lpae_s2_cfg.vtcr.sh = 3; + io_pgtable.cfg.arm_lpae_s2_cfg.vtcr.orgn = 1; + io_pgtable.cfg.arm_lpae_s2_cfg.vtcr.irgn = 2; + io_pgtable.cfg.arm_lpae_s2_cfg.vtcr.sl = 3; + io_pgtable.cfg.arm_lpae_s2_cfg.vtcr.tsz = 4; + + arm_smmu_make_s2_domain_ste(ste, &master, &smmu_domain); +} + +static void arm_smmu_v3_write_ste_test_s2_to_abort(struct kunit *test) +{ + struct arm_smmu_ste ste; + + arm_smmu_test_make_s2_ste(&ste, true); + arm_smmu_v3_test_ste_expect_hitless_transition(test, &ste, &abort_ste, + NUM_EXPECTED_SYNCS(2)); +} + +static void arm_smmu_v3_write_ste_test_abort_to_s2(struct kunit *test) +{ + struct arm_smmu_ste ste; + + arm_smmu_test_make_s2_ste(&ste, true); + arm_smmu_v3_test_ste_expect_hitless_transition(test, &abort_ste, &ste, + NUM_EXPECTED_SYNCS(2)); +} + +static void arm_smmu_v3_write_ste_test_s2_to_bypass(struct kunit *test) +{ + struct arm_smmu_ste ste; + + arm_smmu_test_make_s2_ste(&ste, true); + arm_smmu_v3_test_ste_expect_hitless_transition(test, &ste, &bypass_ste, + NUM_EXPECTED_SYNCS(2)); +} + +static void arm_smmu_v3_write_ste_test_bypass_to_s2(struct kunit *test) +{ + struct arm_smmu_ste ste; + + arm_smmu_test_make_s2_ste(&ste, true); + arm_smmu_v3_test_ste_expect_hitless_transition(test, &bypass_ste, &ste, + NUM_EXPECTED_SYNCS(2)); +} + +static void arm_smmu_v3_test_cd_expect_transition( + struct kunit *test, const struct arm_smmu_cd *cur, + const struct arm_smmu_cd *target, unsigned int num_syncs_expected, + bool hitless) +{ + struct arm_smmu_cd cur_copy = *cur; + struct arm_smmu_test_writer test_writer = { + .writer = { + .ops = &test_cd_ops, + }, + .test = test, + .init_entry = cur->data, + .target_entry = target->data, + .entry = cur_copy.data, + .num_syncs = 0, + .invalid_entry_written = false, + + }; + + pr_debug("CD initial value: "); + print_hex_dump_debug(" ", DUMP_PREFIX_NONE, 16, 8, cur_copy.data, + sizeof(cur_copy), false); + arm_smmu_v3_test_debug_print_used_bits(&test_writer.writer, cur->data); + pr_debug("CD target value: "); + print_hex_dump_debug(" ", DUMP_PREFIX_NONE, 16, 8, target->data, + sizeof(cur_copy), false); + arm_smmu_v3_test_debug_print_used_bits(&test_writer.writer, + target->data); + + arm_smmu_write_entry(&test_writer.writer, cur_copy.data, target->data); + + KUNIT_EXPECT_EQ(test, test_writer.invalid_entry_written, !hitless); + KUNIT_EXPECT_EQ(test, test_writer.num_syncs, num_syncs_expected); + KUNIT_EXPECT_MEMEQ(test, target->data, cur_copy.data, sizeof(cur_copy)); +} + +static void arm_smmu_v3_test_cd_expect_non_hitless_transition( + struct kunit *test, const struct arm_smmu_cd *cur, + const struct arm_smmu_cd *target, unsigned int num_syncs_expected) +{ + arm_smmu_v3_test_cd_expect_transition(test, cur, target, + num_syncs_expected, false); +} + +static void arm_smmu_v3_test_cd_expect_hitless_transition( + struct kunit *test, const struct arm_smmu_cd *cur, + const struct arm_smmu_cd *target, unsigned int num_syncs_expected) +{ + arm_smmu_v3_test_cd_expect_transition(test, cur, target, + num_syncs_expected, true); +} + +static void arm_smmu_test_make_s1_cd(struct arm_smmu_cd *cd, unsigned int asid) +{ + struct arm_smmu_master master = { + .smmu = &smmu, + }; + struct io_pgtable io_pgtable = {}; + struct arm_smmu_domain smmu_domain = { + .pgtbl_ops = &io_pgtable.ops, + .cd = { + .asid = asid, + }, + }; + + io_pgtable.cfg.arm_lpae_s1_cfg.ttbr = 0xdaedbeefdeadbeefULL; + io_pgtable.cfg.arm_lpae_s1_cfg.tcr.ips = 1; + io_pgtable.cfg.arm_lpae_s1_cfg.tcr.tg = 2; + io_pgtable.cfg.arm_lpae_s1_cfg.tcr.sh = 3; + io_pgtable.cfg.arm_lpae_s1_cfg.tcr.orgn = 1; + io_pgtable.cfg.arm_lpae_s1_cfg.tcr.irgn = 2; + io_pgtable.cfg.arm_lpae_s1_cfg.tcr.tsz = 4; + io_pgtable.cfg.arm_lpae_s1_cfg.mair = 0xabcdef012345678ULL; + + arm_smmu_make_s1_cd(cd, &master, &smmu_domain); +} + +static void arm_smmu_v3_write_cd_test_s1_clear(struct kunit *test) +{ + struct arm_smmu_cd cd = {}; + struct arm_smmu_cd cd_2; + + arm_smmu_test_make_s1_cd(&cd_2, 1997); + arm_smmu_v3_test_cd_expect_non_hitless_transition( + test, &cd, &cd_2, NUM_EXPECTED_SYNCS(2)); + arm_smmu_v3_test_cd_expect_non_hitless_transition( + test, &cd_2, &cd, NUM_EXPECTED_SYNCS(2)); +} + +static void arm_smmu_v3_write_cd_test_s1_change_asid(struct kunit *test) +{ + struct arm_smmu_cd cd = {}; + struct arm_smmu_cd cd_2; + + arm_smmu_test_make_s1_cd(&cd, 778); + arm_smmu_test_make_s1_cd(&cd_2, 1997); + arm_smmu_v3_test_cd_expect_hitless_transition(test, &cd, &cd_2, + NUM_EXPECTED_SYNCS(1)); + arm_smmu_v3_test_cd_expect_hitless_transition(test, &cd_2, &cd, + NUM_EXPECTED_SYNCS(1)); +} + +static void arm_smmu_test_make_sva_cd(struct arm_smmu_cd *cd, unsigned int asid) +{ + struct arm_smmu_master master = { + .smmu = &smmu, + }; + struct mm_struct mm = { + .pgd = (void *)0xdaedbeefdeadbeefULL, + }; + + arm_smmu_make_sva_cd(cd, &master, &mm, asid); +} + +static void arm_smmu_test_make_sva_release_cd(struct arm_smmu_cd *cd, + unsigned int asid) +{ + struct arm_smmu_master master = { + .smmu = &smmu, + }; + + arm_smmu_make_sva_cd(cd, &master, NULL, asid); +} + +static void arm_smmu_v3_write_cd_test_sva_clear(struct kunit *test) +{ + struct arm_smmu_cd cd = {}; + struct arm_smmu_cd cd_2; + + arm_smmu_test_make_sva_cd(&cd_2, 1997); + arm_smmu_v3_test_cd_expect_non_hitless_transition( + test, &cd, &cd_2, NUM_EXPECTED_SYNCS(2)); + arm_smmu_v3_test_cd_expect_non_hitless_transition( + test, &cd_2, &cd, NUM_EXPECTED_SYNCS(2)); +} + +static void arm_smmu_v3_write_cd_test_sva_release(struct kunit *test) +{ + struct arm_smmu_cd cd; + struct arm_smmu_cd cd_2; + + arm_smmu_test_make_sva_cd(&cd, 1997); + arm_smmu_test_make_sva_release_cd(&cd_2, 1997); + arm_smmu_v3_test_cd_expect_hitless_transition(test, &cd, &cd_2, + NUM_EXPECTED_SYNCS(2)); + arm_smmu_v3_test_cd_expect_hitless_transition(test, &cd_2, &cd, + NUM_EXPECTED_SYNCS(2)); +} + +static struct kunit_case arm_smmu_v3_test_cases[] = { + KUNIT_CASE(arm_smmu_v3_write_ste_test_bypass_to_abort), + KUNIT_CASE(arm_smmu_v3_write_ste_test_abort_to_bypass), + KUNIT_CASE(arm_smmu_v3_write_ste_test_cdtable_to_abort), + KUNIT_CASE(arm_smmu_v3_write_ste_test_abort_to_cdtable), + KUNIT_CASE(arm_smmu_v3_write_ste_test_cdtable_to_bypass), + KUNIT_CASE(arm_smmu_v3_write_ste_test_bypass_to_cdtable), + KUNIT_CASE(arm_smmu_v3_write_ste_test_s2_to_abort), + KUNIT_CASE(arm_smmu_v3_write_ste_test_abort_to_s2), + KUNIT_CASE(arm_smmu_v3_write_ste_test_s2_to_bypass), + KUNIT_CASE(arm_smmu_v3_write_ste_test_bypass_to_s2), + KUNIT_CASE(arm_smmu_v3_write_cd_test_s1_clear), + KUNIT_CASE(arm_smmu_v3_write_cd_test_s1_change_asid), + KUNIT_CASE(arm_smmu_v3_write_cd_test_sva_clear), + KUNIT_CASE(arm_smmu_v3_write_cd_test_sva_release), + {}, +}; + +static int arm_smmu_v3_test_suite_init(struct kunit_suite *test) +{ + arm_smmu_make_bypass_ste(&smmu, &bypass_ste); + arm_smmu_make_abort_ste(&abort_ste); + return 0; +} + +static struct kunit_suite arm_smmu_v3_test_module = { + .name = "arm-smmu-v3-kunit-test", + .suite_init = arm_smmu_v3_test_suite_init, + .test_cases = arm_smmu_v3_test_cases, +}; +kunit_test_suites(&arm_smmu_v3_test_module); diff --git a/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.c b/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.c index 72402f6a7ed4e0..3ffaa3b34b44bf 100644 --- a/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.c +++ b/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.c @@ -42,18 +42,6 @@ enum arm_smmu_msi_index { ARM_SMMU_MAX_MSIS, }; -struct arm_smmu_entry_writer_ops; -struct arm_smmu_entry_writer { - const struct arm_smmu_entry_writer_ops *ops; - struct arm_smmu_master *master; -}; - -struct arm_smmu_entry_writer_ops { - __le64 v_bit; - void (*get_used)(const __le64 *entry, __le64 *used); - void (*sync)(struct arm_smmu_entry_writer *writer); -}; - #define NUM_ENTRY_QWORDS 8 static_assert(sizeof(struct arm_smmu_ste) == NUM_ENTRY_QWORDS * sizeof(u64)); static_assert(sizeof(struct arm_smmu_cd) == NUM_ENTRY_QWORDS * sizeof(u64)); @@ -980,7 +968,7 @@ void arm_smmu_tlb_inv_asid(struct arm_smmu_device *smmu, u16 asid) * would be nice if this was complete according to the spec, but minimally it * has to capture the bits this driver uses. */ -static void arm_smmu_get_ste_used(const __le64 *ent, __le64 *used_bits) +void arm_smmu_get_ste_used(const __le64 *ent, __le64 *used_bits) { unsigned int cfg = FIELD_GET(STRTAB_STE_0_CFG, le64_to_cpu(ent[0])); @@ -1102,8 +1090,8 @@ static bool entry_set(struct arm_smmu_entry_writer *writer, __le64 *entry, * V=0 process. This relies on the IGNORED behavior described in the * specification. */ -static void arm_smmu_write_entry(struct arm_smmu_entry_writer *writer, - __le64 *entry, const __le64 *target) +void arm_smmu_write_entry(struct arm_smmu_entry_writer *writer, __le64 *entry, + const __le64 *target) { __le64 unused_update[NUM_ENTRY_QWORDS]; u8 used_qword_diff; @@ -1257,7 +1245,7 @@ struct arm_smmu_cd_writer { unsigned int ssid; }; -static void arm_smmu_get_cd_used(const __le64 *ent, __le64 *used_bits) +void arm_smmu_get_cd_used(const __le64 *ent, __le64 *used_bits) { used_bits[0] = cpu_to_le64(CTXDESC_CD_0_V); if (!(ent[0] & cpu_to_le64(CTXDESC_CD_0_V))) @@ -1514,7 +1502,7 @@ static void arm_smmu_write_ste(struct arm_smmu_master *master, u32 sid, } } -static void arm_smmu_make_abort_ste(struct arm_smmu_ste *target) +void arm_smmu_make_abort_ste(struct arm_smmu_ste *target) { memset(target, 0, sizeof(*target)); target->data[0] = cpu_to_le64( @@ -1522,8 +1510,8 @@ static void arm_smmu_make_abort_ste(struct arm_smmu_ste *target) FIELD_PREP(STRTAB_STE_0_CFG, STRTAB_STE_0_CFG_ABORT)); } -static void arm_smmu_make_bypass_ste(struct arm_smmu_device *smmu, - struct arm_smmu_ste *target) +void arm_smmu_make_bypass_ste(struct arm_smmu_device *smmu, + struct arm_smmu_ste *target) { memset(target, 0, sizeof(*target)); target->data[0] = cpu_to_le64( @@ -1535,8 +1523,8 @@ static void arm_smmu_make_bypass_ste(struct arm_smmu_device *smmu, STRTAB_STE_1_SHCFG_INCOMING)); } -static void arm_smmu_make_cdtable_ste(struct arm_smmu_ste *target, - struct arm_smmu_master *master) +void arm_smmu_make_cdtable_ste(struct arm_smmu_ste *target, + struct arm_smmu_master *master) { struct arm_smmu_ctx_desc_cfg *cd_table = &master->cd_table; struct arm_smmu_device *smmu = master->smmu; @@ -1585,9 +1573,9 @@ static void arm_smmu_make_cdtable_ste(struct arm_smmu_ste *target, } } -static void arm_smmu_make_s2_domain_ste(struct arm_smmu_ste *target, - struct arm_smmu_master *master, - struct arm_smmu_domain *smmu_domain) +void arm_smmu_make_s2_domain_ste(struct arm_smmu_ste *target, + struct arm_smmu_master *master, + struct arm_smmu_domain *smmu_domain) { struct arm_smmu_s2_cfg *s2_cfg = &smmu_domain->s2_cfg; const struct io_pgtable_cfg *pgtbl_cfg = diff --git a/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.h b/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.h index 8f791f67f9f7f4..0455498d24c730 100644 --- a/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.h +++ b/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.h @@ -737,6 +737,36 @@ struct arm_smmu_domain { struct list_head mmu_notifiers; }; +/* The following are exposed for testing purposes. */ +struct arm_smmu_entry_writer_ops; +struct arm_smmu_entry_writer { + const struct arm_smmu_entry_writer_ops *ops; + struct arm_smmu_master *master; +}; + +struct arm_smmu_entry_writer_ops { + __le64 v_bit; + void (*get_used)(const __le64 *entry, __le64 *used); + void (*sync)(struct arm_smmu_entry_writer *writer); +}; + +void arm_smmu_get_ste_used(const __le64 *ent, __le64 *used_bits); +void arm_smmu_get_cd_used(const __le64 *ent, __le64 *used_bits); +void arm_smmu_write_entry(struct arm_smmu_entry_writer *writer, __le64 *cur, + const __le64 *target); + +void arm_smmu_make_abort_ste(struct arm_smmu_ste *target); +void arm_smmu_make_bypass_ste(struct arm_smmu_device *smmu, + struct arm_smmu_ste *target); +void arm_smmu_make_cdtable_ste(struct arm_smmu_ste *target, + struct arm_smmu_master *master); +void arm_smmu_make_s2_domain_ste(struct arm_smmu_ste *target, + struct arm_smmu_master *master, + struct arm_smmu_domain *smmu_domain); +void arm_smmu_make_sva_cd(struct arm_smmu_cd *target, + struct arm_smmu_master *master, struct mm_struct *mm, + u16 asid); + static inline struct arm_smmu_domain *to_smmu_domain(struct iommu_domain *dom) { return container_of(dom, struct arm_smmu_domain, domain);