diff mbox series

[2/2] thunderbolt: Use pre-boot DMA protection on AMD systems

Message ID 20220315162455.5190-2-mario.limonciello@amd.com (mailing list archive)
State Superseded
Headers show
Series [1/2] iommu/amd: Add support to indicate whether DMA remap support is enabled | expand

Commit Message

Mario Limonciello March 15, 2022, 4:24 p.m. UTC
The information is exported from the IOMMU driver whether or not
pre-boot DMA protection has been enabled on AMD systems.  Use this
information to properly set iomma_dma_protection.

Link: https://docs.microsoft.com/en-us/windows-hardware/design/device-experiences/oem-kernel-dma-protection
Link: https://www.amd.com/system/files/TechDocs/48882_IOMMU.pdf
Signed-off-by: Mario Limonciello <mario.limonciello@amd.com>
---
 drivers/thunderbolt/domain.c | 11 ++++++++---
 1 file changed, 8 insertions(+), 3 deletions(-)

Comments

Christoph Hellwig March 15, 2022, 4:48 p.m. UTC | #1
On Tue, Mar 15, 2022 at 11:24:55AM -0500, Mario Limonciello wrote:
> -	 * handled natively using IOMMU. It is enabled when IOMMU is
> -	 * enabled and ACPI DMAR table has DMAR_PLATFORM_OPT_IN set.
> +	 * handled natively using IOMMU. It is enabled when the IOMMU is
> +	 * enabled and either:
> +	 * ACPI DMAR table has DMAR_PLATFORM_OPT_IN set
> +	 * or
> +	 * ACPI IVRS table has DMA_REMAP bitset
>  	 */
>  	return sprintf(buf, "%d\n",
> -		       iommu_present(&pci_bus_type) && dmar_platform_optin());
> +		       iommu_present(&pci_bus_type) &&
> +		       (dmar_platform_optin() || amd_ivrs_remap_support()));

Yikes.  No, the thunderbot code does not have any business poking into
either dmar_platform_optin or amd_ivrs_remap_support.  This needs
a proper abstration from the IOMMU code.
Mario Limonciello March 15, 2022, 4:54 p.m. UTC | #2
[Public]


> On Tue, Mar 15, 2022 at 11:24:55AM -0500, Mario Limonciello wrote:
> > -	 * handled natively using IOMMU. It is enabled when IOMMU is
> > -	 * enabled and ACPI DMAR table has DMAR_PLATFORM_OPT_IN set.
> > +	 * handled natively using IOMMU. It is enabled when the IOMMU is
> > +	 * enabled and either:
> > +	 * ACPI DMAR table has DMAR_PLATFORM_OPT_IN set
> > +	 * or
> > +	 * ACPI IVRS table has DMA_REMAP bitset
> >  	 */
> >  	return sprintf(buf, "%d\n",
> > -		       iommu_present(&pci_bus_type) &&
> dmar_platform_optin());
> > +		       iommu_present(&pci_bus_type) &&
> > +		       (dmar_platform_optin() || amd_ivrs_remap_support()));
> 
> Yikes.  No, the thunderbot code does not have any business poking into
> either dmar_platform_optin or amd_ivrs_remap_support.  This needs
> a proper abstration from the IOMMU code.

To make sure I follow your ask - it's to make a new generic iommu function
That would check dmar/ivrs, and switch out thunderbolt domain.c to use the
symbol?

I'm happy to rework that if that is what you want.
Do you have a preferred proposed function name for that?
Robin Murphy March 15, 2022, 6:07 p.m. UTC | #3
On 2022-03-15 16:54, Limonciello, Mario via iommu wrote:
> [Public]
> 
> 
>> On Tue, Mar 15, 2022 at 11:24:55AM -0500, Mario Limonciello wrote:
>>> -	 * handled natively using IOMMU. It is enabled when IOMMU is
>>> -	 * enabled and ACPI DMAR table has DMAR_PLATFORM_OPT_IN set.
>>> +	 * handled natively using IOMMU. It is enabled when the IOMMU is
>>> +	 * enabled and either:
>>> +	 * ACPI DMAR table has DMAR_PLATFORM_OPT_IN set
>>> +	 * or
>>> +	 * ACPI IVRS table has DMA_REMAP bitset
>>>   	 */
>>>   	return sprintf(buf, "%d\n",
>>> -		       iommu_present(&pci_bus_type) &&
>> dmar_platform_optin());
>>> +		       iommu_present(&pci_bus_type) &&
>>> +		       (dmar_platform_optin() || amd_ivrs_remap_support()));
>>
>> Yikes.  No, the thunderbot code does not have any business poking into
>> either dmar_platform_optin or amd_ivrs_remap_support.  This needs
>> a proper abstration from the IOMMU code.
> 
> To make sure I follow your ask - it's to make a new generic iommu function
> That would check dmar/ivrs, and switch out thunderbolt domain.c to use the
> symbol?
> 
> I'm happy to rework that if that is what you want.
> Do you have a preferred proposed function name for that?

But why? Either IOMMU translation is enabled or it isn't, and if it is, 
what's to gain from guessing at *why* it might have been? And even if 
the IOMMU's firmware table did tell the IOMMU driver to enable the 
IOMMU, why should that be Thunderbolt's business?

Furthermore, looking at patch #1 I can only conclude that this is 
entirely meaningless anyway. AFAICS it's literally reporting whether the 
firmware flag was set or not. Not whether it's actually been honoured 
and the IOMMU is enforcing any kind of DMA protection at all. Even on 
Intel where the flag does at least have some effect on the IOMMU driver, 
that can still be overridden.

I already have a patch refactoring this to get rid of iommu_present(), 
but at the time I wasn't looking to closely at what it's trying to *do* 
with the information. If it's supposed to accurately reflect whether the 
Thunderbolt device is subject to IOMMU translation and not bypassed, I 
can fix that too (and unexport dmar_platform_optin() in the process...)

Robin.
Mario Limonciello March 15, 2022, 6:36 p.m. UTC | #4
+ Christian Kellner (Bolt userspace maintainer)

On 3/15/2022 13:07, Robin Murphy wrote:
> On 2022-03-15 16:54, Limonciello, Mario via iommu wrote:
>> [Public]
>>
>>
>>> On Tue, Mar 15, 2022 at 11:24:55AM -0500, Mario Limonciello wrote:
>>>> -     * handled natively using IOMMU. It is enabled when IOMMU is
>>>> -     * enabled and ACPI DMAR table has DMAR_PLATFORM_OPT_IN set.
>>>> +     * handled natively using IOMMU. It is enabled when the IOMMU is
>>>> +     * enabled and either:
>>>> +     * ACPI DMAR table has DMAR_PLATFORM_OPT_IN set
>>>> +     * or
>>>> +     * ACPI IVRS table has DMA_REMAP bitset
>>>>        */
>>>>       return sprintf(buf, "%d\n",
>>>> -               iommu_present(&pci_bus_type) &&
>>> dmar_platform_optin());
>>>> +               iommu_present(&pci_bus_type) &&
>>>> +               (dmar_platform_optin() || amd_ivrs_remap_support()));
>>>
>>> Yikes.  No, the thunderbot code does not have any business poking into
>>> either dmar_platform_optin or amd_ivrs_remap_support.  This needs
>>> a proper abstration from the IOMMU code.
>>
>> To make sure I follow your ask - it's to make a new generic iommu 
>> function
>> That would check dmar/ivrs, and switch out thunderbolt domain.c to use 
>> the
>> symbol?
>>
>> I'm happy to rework that if that is what you want.
>> Do you have a preferred proposed function name for that?
> 
> But why? Either IOMMU translation is enabled or it isn't, and if it is, 
> what's to gain from guessing at *why* it might have been? And even if 
> the IOMMU's firmware table did tell the IOMMU driver to enable the 
> IOMMU, why should that be Thunderbolt's business?
A lot of this comes from baggage from early Thunderbolt 3 implementation 
on systems with ICM (Intel's FW CM). On those systems there was a 
concept of "Security Levels".  This meant that downstream PCIe devices 
were not automatically authorized when a TBT3 device was plugged in.  In 
those cases there was no guarantee that the IOMMU was in use and so the 
security was passed on to the user to make a decision.

In Linux this was accomplished using the 'authorized' attribute in 
/sys/bus/thunderbolt/devices/$NUM/authorized.  When this was set to 1 
then the TBT3 device and PCIe topology behind it would be enumerated.

Further documentation explaining how this works is available here:
https://www.kernel.org/doc/html/latest/admin-guide/thunderbolt.html#security-levels-and-how-to-use-them

(Intel based) Platforms from 2018+ w/ TBT3 started to use the IOMMU 
consistently at runtime but had this existing implementation of security 
levels to worry about.  Furthermore tunnels could be created pre-boot, 
and so the thunderbolt driver may or may not re-create them based on policy.

So a new attribute was created "iommu_dma_protection" that userspace 
could use as part of a policy decision to automatically authorize 
devices.  Exporting this attribute is very similar to what Microsoft 
does to let the user see the security of the system.

https://docs.microsoft.com/en-us/windows-hardware/design/device-experiences/oem-kernel-dma-protection

In Linux today some userspace software "bolt" has a policy included by
default that will automatically authorize TBT3 and USB4 (w/ PCIe) 
devices when iommu_dma_protection is set to 1.

> 
> Furthermore, looking at patch #1 I can only conclude that this is 
> entirely meaningless anyway. AFAICS it's literally reporting whether the 
> firmware flag was set or not. Not whether it's actually been honoured 
> and the IOMMU is enforcing any kind of DMA protection at all. Even on 
> Intel where the flag does at least have some effect on the IOMMU driver, 
> that can still be overridden.

Take a look at the Microsoft link I shared above.  They also make policy
decisions based on the information in these tables.

> 
> I already have a patch refactoring this to get rid of iommu_present(), 
> but at the time I wasn't looking to closely at what it's trying to *do* 
> with the information. If it's supposed to accurately reflect whether the 
> Thunderbolt device is subject to IOMMU translation and not bypassed, I 
> can fix that too (and unexport dmar_platform_optin() in the process...)
> 
> Robin.

This patch series stems from that history.  To give the best experience 
to end users you want hotplugged devices to be automatically authorized 
when software says it's safe to do so.

To summarize the flow:
* User plugs in device
* USB4 CM will query supported tunnels
* USB4 CM will create devices in /sys/bus/thunderbolt/devices for new 
plugged in TBT3/USB4 device
* "authorized" attribute will default to "0" and PCIe tunnels are not 
created
* Userspace gets a uevent that the device was added
* Userspace (bolt) reacts by reading 
/sys/bus/thunderbolt/devices/domainX/iommu_dma_protection
* If that is set to "1", bolt will write "1" to "authorized"  and USB4 
CM will create PCIe tunnels
* If that is set to "0", bolt will send an event to GUI to show a popup 
asking to authorize the device
* After user acks the authorization then it will write "1" to 
"authorized" and USB4 CM will create PCIe tunnels


Mika,

I wonder if maybe what we really want is to only use that flow for the 
authorized attribute when using TBT3 + ICM (or IOMMU disabled at 
runtime).  If we're using a USB4 host, check IOMMU translation layer 
active like Robin suggested and then automatically authorize from the CM.

Thoughts?
Mario Limonciello March 15, 2022, 9:34 p.m. UTC | #5
[Public]



> -----Original Message-----
> From: Limonciello, Mario
> Sent: Tuesday, March 15, 2022 13:36
> To: Robin Murphy <robin.murphy@arm.com>; Christoph Hellwig
> <hch@infradead.org>; christian@kellner.me; Mika Westerberg
> <mika.westerberg@linux.intel.com>
> Cc: Michael Jamet <michael.jamet@intel.com>; open list:THUNDERBOLT
> DRIVER <linux-usb@vger.kernel.org>; open list <linux-
> kernel@vger.kernel.org>; Yehezkel Bernat <YehezkelShB@gmail.com>;
> open list:AMD IOMMU (AMD-VI) <iommu@lists.linux-foundation.org>;
> Andreas Noever <andreas.noever@gmail.com>; Will Deacon
> <will@kernel.org>
> Subject: Re: [PATCH 2/2] thunderbolt: Use pre-boot DMA protection on AMD
> systems
> 
> + Christian Kellner (Bolt userspace maintainer)
> 
> On 3/15/2022 13:07, Robin Murphy wrote:
> > On 2022-03-15 16:54, Limonciello, Mario via iommu wrote:
> >> [Public]
> >>
> >>
> >>> On Tue, Mar 15, 2022 at 11:24:55AM -0500, Mario Limonciello wrote:
> >>>> -     * handled natively using IOMMU. It is enabled when IOMMU is
> >>>> -     * enabled and ACPI DMAR table has DMAR_PLATFORM_OPT_IN
> set.
> >>>> +     * handled natively using IOMMU. It is enabled when the IOMMU is
> >>>> +     * enabled and either:
> >>>> +     * ACPI DMAR table has DMAR_PLATFORM_OPT_IN set
> >>>> +     * or
> >>>> +     * ACPI IVRS table has DMA_REMAP bitset
> >>>>        */
> >>>>       return sprintf(buf, "%d\n",
> >>>> -               iommu_present(&pci_bus_type) &&
> >>> dmar_platform_optin());
> >>>> +               iommu_present(&pci_bus_type) &&
> >>>> +               (dmar_platform_optin() || amd_ivrs_remap_support()));
> >>>
> >>> Yikes.  No, the thunderbot code does not have any business poking into
> >>> either dmar_platform_optin or amd_ivrs_remap_support.  This needs
> >>> a proper abstration from the IOMMU code.
> >>
> >> To make sure I follow your ask - it's to make a new generic iommu
> >> function
> >> That would check dmar/ivrs, and switch out thunderbolt domain.c to use
> >> the
> >> symbol?
> >>
> >> I'm happy to rework that if that is what you want.
> >> Do you have a preferred proposed function name for that?
> >
> > But why? Either IOMMU translation is enabled or it isn't, and if it is,
> > what's to gain from guessing at *why* it might have been? And even if
> > the IOMMU's firmware table did tell the IOMMU driver to enable the
> > IOMMU, why should that be Thunderbolt's business?
> A lot of this comes from baggage from early Thunderbolt 3 implementation
> on systems with ICM (Intel's FW CM). On those systems there was a
> concept of "Security Levels".  This meant that downstream PCIe devices
> were not automatically authorized when a TBT3 device was plugged in.  In
> those cases there was no guarantee that the IOMMU was in use and so the
> security was passed on to the user to make a decision.
> 
> In Linux this was accomplished using the 'authorized' attribute in
> /sys/bus/thunderbolt/devices/$NUM/authorized.  When this was set to 1
> then the TBT3 device and PCIe topology behind it would be enumerated.
> 
> Further documentation explaining how this works is available here:
> https://www.kernel.org/doc/html/latest/admin-
> guide/thunderbolt.html#security-levels-and-how-to-use-them
> 
> (Intel based) Platforms from 2018+ w/ TBT3 started to use the IOMMU
> consistently at runtime but had this existing implementation of security
> levels to worry about.  Furthermore tunnels could be created pre-boot,
> and so the thunderbolt driver may or may not re-create them based on
> policy.
> 
> So a new attribute was created "iommu_dma_protection" that userspace
> could use as part of a policy decision to automatically authorize
> devices.  Exporting this attribute is very similar to what Microsoft
> does to let the user see the security of the system.
> 
> https://docs.microsoft.com/en-us/windows-hardware/design/device-
> experiences/oem-kernel-dma-protection
> 
> In Linux today some userspace software "bolt" has a policy included by
> default that will automatically authorize TBT3 and USB4 (w/ PCIe)
> devices when iommu_dma_protection is set to 1.
> 
> >
> > Furthermore, looking at patch #1 I can only conclude that this is
> > entirely meaningless anyway. AFAICS it's literally reporting whether the
> > firmware flag was set or not. Not whether it's actually been honoured
> > and the IOMMU is enforcing any kind of DMA protection at all. Even on
> > Intel where the flag does at least have some effect on the IOMMU driver,
> > that can still be overridden.
> 
> Take a look at the Microsoft link I shared above.  They also make policy
> decisions based on the information in these tables.
> 
> >
> > I already have a patch refactoring this to get rid of iommu_present(),
> > but at the time I wasn't looking to closely at what it's trying to *do*
> > with the information. If it's supposed to accurately reflect whether the
> > Thunderbolt device is subject to IOMMU translation and not bypassed, I
> > can fix that too (and unexport dmar_platform_optin() in the process...)
> >
> > Robin.
> 
> This patch series stems from that history.  To give the best experience
> to end users you want hotplugged devices to be automatically authorized
> when software says it's safe to do so.
> 
> To summarize the flow:
> * User plugs in device
> * USB4 CM will query supported tunnels
> * USB4 CM will create devices in /sys/bus/thunderbolt/devices for new
> plugged in TBT3/USB4 device
> * "authorized" attribute will default to "0" and PCIe tunnels are not
> created
> * Userspace gets a uevent that the device was added
> * Userspace (bolt) reacts by reading
> /sys/bus/thunderbolt/devices/domainX/iommu_dma_protection
> * If that is set to "1", bolt will write "1" to "authorized"  and USB4
> CM will create PCIe tunnels
> * If that is set to "0", bolt will send an event to GUI to show a popup
> asking to authorize the device
> * After user acks the authorization then it will write "1" to
> "authorized" and USB4 CM will create PCIe tunnels
> 
> 
> Mika,
> 
> I wonder if maybe what we really want is to only use that flow for the
> authorized attribute when using TBT3 + ICM (or IOMMU disabled at
> runtime).  If we're using a USB4 host, check IOMMU translation layer
> active like Robin suggested and then automatically authorize from the CM.
> 
> Thoughts?
> 
>
 
I put an RFC together with what this idea looks like, comments welcome.
https://lore.kernel.org/linux-usb/20220315213008.5357-1-mario.limonciello@amd.com/T/#u
Robin Murphy March 15, 2022, 10:04 p.m. UTC | #6
On 2022-03-15 18:36, Limonciello, Mario wrote:
> + Christian Kellner (Bolt userspace maintainer)
> 
> On 3/15/2022 13:07, Robin Murphy wrote:
>> On 2022-03-15 16:54, Limonciello, Mario via iommu wrote:
>>> [Public]
>>>
>>>
>>>> On Tue, Mar 15, 2022 at 11:24:55AM -0500, Mario Limonciello wrote:
>>>>> -     * handled natively using IOMMU. It is enabled when IOMMU is
>>>>> -     * enabled and ACPI DMAR table has DMAR_PLATFORM_OPT_IN set.
>>>>> +     * handled natively using IOMMU. It is enabled when the IOMMU is
>>>>> +     * enabled and either:
>>>>> +     * ACPI DMAR table has DMAR_PLATFORM_OPT_IN set
>>>>> +     * or
>>>>> +     * ACPI IVRS table has DMA_REMAP bitset
>>>>>        */
>>>>>       return sprintf(buf, "%d\n",
>>>>> -               iommu_present(&pci_bus_type) &&
>>>> dmar_platform_optin());
>>>>> +               iommu_present(&pci_bus_type) &&
>>>>> +               (dmar_platform_optin() || amd_ivrs_remap_support()));
>>>>
>>>> Yikes.  No, the thunderbot code does not have any business poking into
>>>> either dmar_platform_optin or amd_ivrs_remap_support.  This needs
>>>> a proper abstration from the IOMMU code.
>>>
>>> To make sure I follow your ask - it's to make a new generic iommu 
>>> function
>>> That would check dmar/ivrs, and switch out thunderbolt domain.c to 
>>> use the
>>> symbol?
>>>
>>> I'm happy to rework that if that is what you want.
>>> Do you have a preferred proposed function name for that?
>>
>> But why? Either IOMMU translation is enabled or it isn't, and if it 
>> is, what's to gain from guessing at *why* it might have been? And even 
>> if the IOMMU's firmware table did tell the IOMMU driver to enable the 
>> IOMMU, why should that be Thunderbolt's business?
> A lot of this comes from baggage from early Thunderbolt 3 implementation 
> on systems with ICM (Intel's FW CM). On those systems there was a 
> concept of "Security Levels".  This meant that downstream PCIe devices 
> were not automatically authorized when a TBT3 device was plugged in.  In 
> those cases there was no guarantee that the IOMMU was in use and so the 
> security was passed on to the user to make a decision.
> 
> In Linux this was accomplished using the 'authorized' attribute in 
> /sys/bus/thunderbolt/devices/$NUM/authorized.  When this was set to 1 
> then the TBT3 device and PCIe topology behind it would be enumerated.
> 
> Further documentation explaining how this works is available here:
> https://www.kernel.org/doc/html/latest/admin-guide/thunderbolt.html#security-levels-and-how-to-use-them 
> 
> 
> (Intel based) Platforms from 2018+ w/ TBT3 started to use the IOMMU 
> consistently at runtime but had this existing implementation of security 
> levels to worry about.  Furthermore tunnels could be created pre-boot, 
> and so the thunderbolt driver may or may not re-create them based on 
> policy.
> 
> So a new attribute was created "iommu_dma_protection" that userspace 
> could use as part of a policy decision to automatically authorize 
> devices.  Exporting this attribute is very similar to what Microsoft 
> does to let the user see the security of the system.
> 
> https://docs.microsoft.com/en-us/windows-hardware/design/device-experiences/oem-kernel-dma-protection 
> 
> 
> In Linux today some userspace software "bolt" has a policy included by
> default that will automatically authorize TBT3 and USB4 (w/ PCIe) 
> devices when iommu_dma_protection is set to 1.
> 
>>
>> Furthermore, looking at patch #1 I can only conclude that this is 
>> entirely meaningless anyway. AFAICS it's literally reporting whether 
>> the firmware flag was set or not. Not whether it's actually been 
>> honoured and the IOMMU is enforcing any kind of DMA protection at all. 
>> Even on Intel where the flag does at least have some effect on the 
>> IOMMU driver, that can still be overridden.
> 
> Take a look at the Microsoft link I shared above.  They also make policy
> decisions based on the information in these tables.
> 
>>
>> I already have a patch refactoring this to get rid of iommu_present(), 
>> but at the time I wasn't looking to closely at what it's trying to 
>> *do* with the information. If it's supposed to accurately reflect 
>> whether the Thunderbolt device is subject to IOMMU translation and not 
>> bypassed, I can fix that too (and unexport dmar_platform_optin() in 
>> the process...)
>>
>> Robin.
> 
> This patch series stems from that history.  To give the best experience 
> to end users you want hotplugged devices to be automatically authorized 
> when software says it's safe to do so.
> 
> To summarize the flow:
> * User plugs in device
> * USB4 CM will query supported tunnels
> * USB4 CM will create devices in /sys/bus/thunderbolt/devices for new 
> plugged in TBT3/USB4 device
> * "authorized" attribute will default to "0" and PCIe tunnels are not 
> created
> * Userspace gets a uevent that the device was added
> * Userspace (bolt) reacts by reading 
> /sys/bus/thunderbolt/devices/domainX/iommu_dma_protection
> * If that is set to "1", bolt will write "1" to "authorized"  and USB4 
> CM will create PCIe tunnels
> * If that is set to "0", bolt will send an event to GUI to show a popup 
> asking to authorize the device
> * After user acks the authorization then it will write "1" to 
> "authorized" and USB4 CM will create PCIe tunnels
> 
> 
> Mika,
> 
> I wonder if maybe what we really want is to only use that flow for the 
> authorized attribute when using TBT3 + ICM (or IOMMU disabled at 
> runtime).  If we're using a USB4 host, check IOMMU translation layer 
> active like Robin suggested and then automatically authorize from the CM.

Thanks for the explanation. I don't think there's anything wrong with 
that flow per se - fundamentally, whether it's relayed through userspace 
or done automagically inside the kernel doesn't change the end result - 
but it does seem to confirm my suspicion that even now it's not actually 
working as intended and may end up letting devices be authorised in 
circumstances that they probably shouldn't be.

It's absolutely fine for Thunderbolt to care about whether a device 
currently has IOMMU translation enabled (and to expose that to userspace 
in its own way if it wants to), but that's generic IOMMU API stuff, no 
firmware-poking required :)

Tomorrow I'll rework the patch out of my iommu_present() cleanup stack 
to do the right thing, and share it.

Cheers,
Robin.
kernel test robot March 16, 2022, 1:32 a.m. UTC | #7
Hi Mario,

Thank you for the patch! Yet something to improve:

[auto build test ERROR on joro-iommu/next]
[also build test ERROR on arm-perf/for-next/perf linus/master v5.17-rc8 next-20220315]
[If your patch is applied to the wrong git tree, kindly drop us a note.
And when submitting patch, we suggest to use '--base' as documented in
https://git-scm.com/docs/git-format-patch]

url:    https://github.com/0day-ci/linux/commits/Mario-Limonciello/iommu-amd-Add-support-to-indicate-whether-DMA-remap-support-is-enabled/20220316-002821
base:   https://git.kernel.org/pub/scm/linux/kernel/git/joro/iommu.git next
config: x86_64-randconfig-a012 (https://download.01.org/0day-ci/archive/20220316/202203160904.VB4alCdg-lkp@intel.com/config)
compiler: clang version 15.0.0 (https://github.com/llvm/llvm-project a6b2f50fb47da3baeee10b1906da6e30ac5d26ec)
reproduce (this is a W=1 build):
        wget https://raw.githubusercontent.com/intel/lkp-tests/master/sbin/make.cross -O ~/bin/make.cross
        chmod +x ~/bin/make.cross
        # https://github.com/0day-ci/linux/commit/9b0b7079d348c607cba7af4c87eaae1a79e52d91
        git remote add linux-review https://github.com/0day-ci/linux
        git fetch --no-tags linux-review Mario-Limonciello/iommu-amd-Add-support-to-indicate-whether-DMA-remap-support-is-enabled/20220316-002821
        git checkout 9b0b7079d348c607cba7af4c87eaae1a79e52d91
        # save the config file to linux build tree
        mkdir build_dir
        COMPILER_INSTALL_PATH=$HOME/0day COMPILER=clang make.cross W=1 O=build_dir ARCH=x86_64 SHELL=/bin/bash drivers/

If you fix the issue, kindly add following tag as appropriate
Reported-by: kernel test robot <lkp@intel.com>

All errors (new ones prefixed by >>):

   In file included from drivers/thunderbolt/domain.c:9:
>> include/linux/amd-iommu.h:159:52: error: use of undeclared identifier 'ENODEV'
   static inline int amd_iommu_detect(void) { return -ENODEV; }
                                                      ^
   1 error generated.


vim +/ENODEV +159 include/linux/amd-iommu.h

6a9401a7ac13e6 arch/x86/include/asm/amd_iommu.h Joerg Roedel          2009-11-20  158  
480125ba49ba62 arch/x86/include/asm/amd_iommu.h Konrad Rzeszutek Wilk 2010-08-26 @159  static inline int amd_iommu_detect(void) { return -ENODEV; }
6a9401a7ac13e6 arch/x86/include/asm/amd_iommu.h Joerg Roedel          2009-11-20  160  

---
0-DAY CI Kernel Test Service
https://lists.01.org/hyperkitty/list/kbuild-all@lists.01.org
Mika Westerberg March 16, 2022, 9:50 a.m. UTC | #8
Hi,

On Tue, Mar 15, 2022 at 01:36:11PM -0500, Limonciello, Mario wrote:
> + Christian Kellner (Bolt userspace maintainer)
> 
> On 3/15/2022 13:07, Robin Murphy wrote:
> > On 2022-03-15 16:54, Limonciello, Mario via iommu wrote:
> > > [Public]
> > > 
> > > 
> > > > On Tue, Mar 15, 2022 at 11:24:55AM -0500, Mario Limonciello wrote:
> > > > > -     * handled natively using IOMMU. It is enabled when IOMMU is
> > > > > -     * enabled and ACPI DMAR table has DMAR_PLATFORM_OPT_IN set.
> > > > > +     * handled natively using IOMMU. It is enabled when the IOMMU is
> > > > > +     * enabled and either:
> > > > > +     * ACPI DMAR table has DMAR_PLATFORM_OPT_IN set
> > > > > +     * or
> > > > > +     * ACPI IVRS table has DMA_REMAP bitset
> > > > >        */
> > > > >       return sprintf(buf, "%d\n",
> > > > > -               iommu_present(&pci_bus_type) &&
> > > > dmar_platform_optin());
> > > > > +               iommu_present(&pci_bus_type) &&
> > > > > +               (dmar_platform_optin() || amd_ivrs_remap_support()));
> > > > 
> > > > Yikes.  No, the thunderbot code does not have any business poking into
> > > > either dmar_platform_optin or amd_ivrs_remap_support.  This needs
> > > > a proper abstration from the IOMMU code.

I agree. When it was originally added it was only the DMAR (Intel) based
platforms that provided this hint so adding an abstraction for that did
not make much sense. Now, since we are seeing more and more USB4 host
controllers and many of them support PCIe tunneling (and IOMMU) adding
an API makes more sense.

> > > 
> > > To make sure I follow your ask - it's to make a new generic iommu
> > > function
> > > That would check dmar/ivrs, and switch out thunderbolt domain.c to
> > > use the
> > > symbol?
> > > 
> > > I'm happy to rework that if that is what you want.
> > > Do you have a preferred proposed function name for that?
> > 
> > But why? Either IOMMU translation is enabled or it isn't, and if it is,
> > what's to gain from guessing at *why* it might have been? And even if
> > the IOMMU's firmware table did tell the IOMMU driver to enable the
> > IOMMU, why should that be Thunderbolt's business?
> A lot of this comes from baggage from early Thunderbolt 3 implementation on
> systems with ICM (Intel's FW CM). On those systems there was a concept of
> "Security Levels".  This meant that downstream PCIe devices were not
> automatically authorized when a TBT3 device was plugged in.  In those cases
> there was no guarantee that the IOMMU was in use and so the security was
> passed on to the user to make a decision.
> 
> In Linux this was accomplished using the 'authorized' attribute in
> /sys/bus/thunderbolt/devices/$NUM/authorized.  When this was set to 1 then
> the TBT3 device and PCIe topology behind it would be enumerated.
> 
> Further documentation explaining how this works is available here:
> https://www.kernel.org/doc/html/latest/admin-guide/thunderbolt.html#security-levels-and-how-to-use-them
> 
> (Intel based) Platforms from 2018+ w/ TBT3 started to use the IOMMU
> consistently at runtime but had this existing implementation of security
> levels to worry about.  Furthermore tunnels could be created pre-boot, and
> so the thunderbolt driver may or may not re-create them based on policy.
> 
> So a new attribute was created "iommu_dma_protection" that userspace could
> use as part of a policy decision to automatically authorize devices.
> Exporting this attribute is very similar to what Microsoft does to let the
> user see the security of the system.
> 
> https://docs.microsoft.com/en-us/windows-hardware/design/device-experiences/oem-kernel-dma-protection
> 
> In Linux today some userspace software "bolt" has a policy included by
> default that will automatically authorize TBT3 and USB4 (w/ PCIe) devices
> when iommu_dma_protection is set to 1.
> 
> > 
> > Furthermore, looking at patch #1 I can only conclude that this is
> > entirely meaningless anyway. AFAICS it's literally reporting whether the
> > firmware flag was set or not. Not whether it's actually been honoured
> > and the IOMMU is enforcing any kind of DMA protection at all. Even on
> > Intel where the flag does at least have some effect on the IOMMU driver,
> > that can still be overridden.
> 
> Take a look at the Microsoft link I shared above.  They also make policy
> decisions based on the information in these tables.
> 
> > 
> > I already have a patch refactoring this to get rid of iommu_present(),
> > but at the time I wasn't looking to closely at what it's trying to *do*
> > with the information. If it's supposed to accurately reflect whether the
> > Thunderbolt device is subject to IOMMU translation and not bypassed, I
> > can fix that too (and unexport dmar_platform_optin() in the process...)
> > 
> > Robin.
> 
> This patch series stems from that history.  To give the best experience to
> end users you want hotplugged devices to be automatically authorized when
> software says it's safe to do so.
> 
> To summarize the flow:
> * User plugs in device
> * USB4 CM will query supported tunnels
> * USB4 CM will create devices in /sys/bus/thunderbolt/devices for new
> plugged in TBT3/USB4 device
> * "authorized" attribute will default to "0" and PCIe tunnels are not
> created
> * Userspace gets a uevent that the device was added
> * Userspace (bolt) reacts by reading
> /sys/bus/thunderbolt/devices/domainX/iommu_dma_protection
> * If that is set to "1", bolt will write "1" to "authorized"  and USB4 CM
> will create PCIe tunnels
> * If that is set to "0", bolt will send an event to GUI to show a popup
> asking to authorize the device
> * After user acks the authorization then it will write "1" to "authorized"
> and USB4 CM will create PCIe tunnels
> 
> 
> Mika,
> 
> I wonder if maybe what we really want is to only use that flow for the
> authorized attribute when using TBT3 + ICM (or IOMMU disabled at runtime).
> If we're using a USB4 host, check IOMMU translation layer active like Robin
> suggested and then automatically authorize from the CM.

I would still leave that policy to userspace to decide.
diff mbox series

Patch

diff --git a/drivers/thunderbolt/domain.c b/drivers/thunderbolt/domain.c
index 7018d959f775..e03790735c12 100644
--- a/drivers/thunderbolt/domain.c
+++ b/drivers/thunderbolt/domain.c
@@ -6,6 +6,7 @@ 
  * Author: Mika Westerberg <mika.westerberg@linux.intel.com>
  */
 
+#include <linux/amd-iommu.h>
 #include <linux/device.h>
 #include <linux/dmar.h>
 #include <linux/idr.h>
@@ -259,11 +260,15 @@  static ssize_t iommu_dma_protection_show(struct device *dev,
 {
 	/*
 	 * Kernel DMA protection is a feature where Thunderbolt security is
-	 * handled natively using IOMMU. It is enabled when IOMMU is
-	 * enabled and ACPI DMAR table has DMAR_PLATFORM_OPT_IN set.
+	 * handled natively using IOMMU. It is enabled when the IOMMU is
+	 * enabled and either:
+	 * ACPI DMAR table has DMAR_PLATFORM_OPT_IN set
+	 * or
+	 * ACPI IVRS table has DMA_REMAP bitset
 	 */
 	return sprintf(buf, "%d\n",
-		       iommu_present(&pci_bus_type) && dmar_platform_optin());
+		       iommu_present(&pci_bus_type) &&
+		       (dmar_platform_optin() || amd_ivrs_remap_support()));
 }
 static DEVICE_ATTR_RO(iommu_dma_protection);