[RFC,v2,06/18] cxl/port: Add Dynamic Capacity size support to endpoint decoders

To support Dynamic Capacity Devices (DCD) endpoint decoders will need to
map DC Regions (partitions).  Part of this is assigning the size of the
DC Region DPA to the decoder in addition to any skip value from the
previous decoder which exists.  This must be done within a continuous
DPA space.  Two complications arise with Dynamic Capacity regions which
did not exist with Ram and PMEM partitions.  First, gaps in the DPA
space can exist between and around the DC Regions.  Second, the Linux
resource tree does not allow a resource to be marked across existing
nodes within a tree.

For clarity, below is an example of an 60GB device with 10GB of RAM,
10GB of PMEM and 10GB for each of 2 DC Regions.  The desired CXL mapping
is 5GB of RAM, 5GB of PMEM, and all 10GB of DC1.

     DPA RANGE
     (dpa_res)
0GB        10GB       20GB       30GB       40GB       50GB       60GB
|----------|----------|----------|----------|----------|----------|

RAM         PMEM                  DC0                   DC1
 (ram_res)  (pmem_res)            (dc_res[0])           (dc_res[1])
|----------|----------|   <gap>  |----------|   <gap>  |----------|

 RAM        PMEM                                        DC1
|XXXXX|----|XXXXX|----|----------|----------|----------|XXXXXXXXXX|
0GB   5GB  10GB  15GB 20GB       30GB       40GB       50GB       60GB

The previous skip resource between RAM and PMEM was always a child of
the RAM resource and fit nicely (see X below).  Because of this
simplicity this skip resource reference was not stored in any CXL state.
On release the skip range could be calculated based on the endpoint
decoders stored values.

Now when DC1 is being mapped 4 skip resources must be created as
children.  One of the PMEM resource (A), two of the parent DPA resource
(B,D), and one more child of the DC0 resource (C).

0GB        10GB       20GB       30GB       40GB       50GB       60GB
|----------|----------|----------|----------|----------|----------|
                           |                     |
|----------|----------|    |     |----------|    |     |----------|
        |          |       |          |          |
       (X)        (A)     (B)        (C)        (D)
	v          v       v          v          v
|XXXXX|----|XXXXX|----|----------|----------|----------|XXXXXXXXXX|
       skip       skip  skip        skip      skip

Expand the calculation of DPA freespace and enhance the logic to support
mapping/unmapping DC DPA space.  To track the potential of multiple skip
resources an xarray is attached to the endpoint decoder.  The existing
algorithm is consolidated with the new one to store a single skip
resource in the same way as multiple skip resources.

Co-developed-by: Navneet Singh <navneet.singh@intel.com>
Signed-off-by: Navneet Singh <navneet.singh@intel.com>
Signed-off-by: Ira Weiny <ira.weiny@intel.com>

---
An alternative of using reserve_region_with_split() was considered.
The advantage of that would be keeping all the resource information
stored solely in the resource tree rather than having separate
references to them.  However, it would best be implemented with a call
such as release_split_region() [name TBD?] which could find all the leaf
resources in the range and release them.  Furthermore, it is not clear
if reserve_region_with_split() is really intended for anything outside
of init code.  In the end this algorithm seems straight forward enough.

Changes for v2:
[iweiny: write commit message]
[iweiny: remove unneeded changes]
[iweiny: split from region creation patch]
[iweiny: Alter skip algorithm to use 'anonymous regions']
[iweiny: enhance debug messages]
[iweiny: consolidate skip resource creation]
[iweiny: ensure xa_destroy() is called]
[iweiny: consolidate region requests further]
[iweiny: ensure resource is released on xa_insert]
---
 drivers/cxl/core/hdm.c  | 188 +++++++++++++++++++++++++++++++++++++++++++-----
 drivers/cxl/core/port.c |   2 +
 drivers/cxl/cxl.h       |   2 +
 3 files changed, 176 insertions(+), 16 deletions(-)

Message ID	20230604-dcd-type2-upstream-v2-6-f740c47e7916@intel.com
State	New, archived
Headers	show Return-Path: <linux-cxl-owner@vger.kernel.org> From: Ira Weiny <ira.weiny@intel.com> Date: Mon, 28 Aug 2023 22:20:57 -0700 Subject: [PATCH RFC v2 06/18] cxl/port: Add Dynamic Capacity size support to endpoint decoders MIME-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: 7bit Message-Id: <20230604-dcd-type2-upstream-v2-6-f740c47e7916@intel.com> References: <20230604-dcd-type2-upstream-v2-0-f740c47e7916@intel.com> In-Reply-To: <20230604-dcd-type2-upstream-v2-0-f740c47e7916@intel.com> To: Dan Williams <dan.j.williams@intel.com> Cc: Navneet Singh <navneet.singh@intel.com>, Fan Ni <fan.ni@samsung.com>, Jonathan Cameron <Jonathan.Cameron@huawei.com>, Davidlohr Bueso <dave@stgolabs.net>, Dave Jiang <dave.jiang@intel.com>, Alison Schofield <alison.schofield@intel.com>, Vishal Verma <vishal.l.verma@intel.com>, Ira Weiny <ira.weiny@intel.com>, linux-cxl@vger.kernel.org, linux-kernel@vger.kernel.org Precedence: bulk
Series	DCD: Add support for Dynamic Capacity Devices (DCD) \| expand [RFC,v2,00/18] DCD: Add support for Dynamic Capacity Devices (DCD) [RFC,v2,01/18] cxl/hdm: Debug, use decoder name function [RFC,v2,02/18] cxl/mbox: Flag support for Dynamic Capacity Devices (DCD) [RFC,v2,03/18] cxl/mem: Read Dynamic capacity configuration from the device [RFC,v2,04/18] cxl/region: Add Dynamic Capacity decoder and region modes [RFC,v2,05/18] cxl/port: Add Dynamic Capacity mode support to endpoint decoders [RFC,v2,06/18] cxl/port: Add Dynamic Capacity size support to endpoint decoders [RFC,v2,07/18] cxl/mem: Expose device dynamic capacity configuration [RFC,v2,08/18] cxl/region: Add Dynamic Capacity CXL region support [RFC,v2,09/18] cxl/mem: Read extents on memory device discovery [RFC,v2,10/18] cxl/mem: Handle DCD add and release capacity events. [RFC,v2,11/18] cxl/region: Expose DC extents on region driver load [RFC,v2,12/18] cxl/region: Notify regions of DC changes [RFC,v2,13/18] dax/bus: Factor out dev dax resize logic [RFC,v2,14/18] dax/region: Support DAX device creation on dynamic DAX regions [RFC,v2,15/18] cxl/mem: Trace Dynamic capacity Event Record [RFC,v2,16/18] tools/testing/cxl: Make event logs dynamic [RFC,v2,17/18] tools/testing/cxl: Add DC Regions to mock mem data [RFC,v2,18/18] tools/testing/cxl: Add Dynamic Capacity events

[RFC,v2,06/18] cxl/port: Add Dynamic Capacity size support to endpoint decoders

Commit Message

Comments

Patch