Message ID | 168357886214.2756219.978696616095404572.stgit@djiang5-mobl3 |
---|---|
State | Superseded |
Headers | show |
Series | cxl: Add support for QTG ID retrieval for CXL subsystem | expand |
On Mon, 08 May 2023 13:47:42 -0700 Dave Jiang <dave.jiang@intel.com> wrote: > CXL Memory Device SW Guide rev1.0 2.11.2 provides instruction on how to > caluclate latency and bandwidth for CXL memory device. Calculate minimum Spell check. Also, a link for SW guide probably good to add. > bandwidth and total latency for the path from the CXL device to the root > port. The QTG id is retrieved by providing the performance data as input > and calling the root port callback ->get_qos_class(). The retrieved id is > stored with the cxl_port of the CXL device. > > For example for a device that is directly attached to a host bus: > Total Latency = Device Latency (from CDAT) + Dev to Host Bus (HB) Link > Latency + Generic Port Latency > Min Bandwidth = Min bandwidth for link bandwidth between HB > and CXL device, device CDAT bandwidth, and Generic Port > Bandwidth > > For a device that has a switch in between host bus and CXL device: > Total Latency = Device (CDAT) Latency + Dev to Switch Link Latency + > Switch (CDAT) Latency + Switch to HB Link Latency + > Generic Port Latency > Min Bandwidth = Min bandwidth for link bandwidth between CXL device > to CXL switch, CXL device CDAT bandwidth, CXL switch CDAT > bandwidth, CXL switch to HB bandwidth, and Generic Port > Bandwidth. > > Signed-off-by: Dave Jiang <dave.jiang@intel.com> LGTM Reviewed-by: Jonathan Cameron <Jonathan.Cameron@huawei.com> > > --- > v5: > - Use new API call cxl_endpoint_get_perf_coordinates(). > - Use root_port->get_qos_class() (Dan) > - Add endieness handling to DSM input. > --- > drivers/cxl/cxl.h | 1 + > drivers/cxl/port.c | 56 +++++++++++++++++++++++++++++++++++++++++++++++++--- > 2 files changed, 54 insertions(+), 3 deletions(-) > > diff --git a/drivers/cxl/cxl.h b/drivers/cxl/cxl.h > index 0c8952e568cc..7c94f07771c1 100644 > --- a/drivers/cxl/cxl.h > +++ b/drivers/cxl/cxl.h > @@ -833,6 +833,7 @@ struct dsmas_entry { > struct range dpa_range; > u8 handle; > struct access_coordinate coord; > + int qos_class; > }; > > #ifdef CONFIG_ACPI > diff --git a/drivers/cxl/port.c b/drivers/cxl/port.c > index c5a24b75bf03..b474997cc7ee 100644 > --- a/drivers/cxl/port.c > +++ b/drivers/cxl/port.c > @@ -67,6 +67,52 @@ static void dsmas_list_destroy(struct list_head *dsmas_list) > } > } > > +static int cxl_port_perf_data_calculate(struct cxl_port *port, > + struct list_head *dsmas_list) > +{ > + struct access_coordinate c; > + struct qtg_dsm_input input; > + struct cxl_port *root_port; > + struct cxl_root *cxl_root; > + struct dsmas_entry *dent; > + int rc, qos_class; > + > + rc = cxl_endpoint_get_perf_coordinates(port, &c); > + if (rc) { > + dev_dbg(&port->dev, "Failed to retrieve perf coordinates.\n"); > + return rc; > + } > + > + root_port = find_cxl_root(port); > + cxl_root = to_cxl_root(root_port); > + if (!cxl_root->ops && !cxl_root->ops->get_qos_class) > + return -EOPNOTSUPP; > + > + list_for_each_entry(dent, dsmas_list, list) { > + dent->coord.read_latency = dent->coord.read_latency + > + c.read_latency; > + dent->coord.write_latency = dent->coord.write_latency + > + c.write_latency; > + dent->coord.read_bandwidth = min_t(int, c.read_bandwidth, > + dent->coord.read_bandwidth); > + dent->coord.write_bandwidth = min_t(int, c.write_bandwidth, > + dent->coord.write_bandwidth); > + > + input.rd_lat = cpu_to_le32(dent->coord.read_latency); > + input.wr_lat = cpu_to_le32(dent->coord.write_latency); > + input.rd_bw = cpu_to_le32(dent->coord.read_bandwidth); > + input.wr_bw = cpu_to_le32(dent->coord.write_bandwidth); > + > + qos_class = cxl_root->ops->get_qos_class(root_port, &input); > + if (qos_class < 0) > + continue; > + > + dent->qos_class = qos_class; > + } > + > + return 0; > +} > + > static int cxl_switch_port_probe(struct cxl_port *port) > { > struct cxl_hdm *cxlhdm; > @@ -155,10 +201,14 @@ static int cxl_endpoint_port_probe(struct cxl_port *port) > LIST_HEAD(dsmas_list); > > rc = cxl_cdat_endpoint_process(port, &dsmas_list); > - if (rc < 0) > + if (rc < 0) { > dev_dbg(&port->dev, "Failed to parse CDAT: %d\n", rc); > - > - /* Performance data processing */ > + } else { > + rc = cxl_port_perf_data_calculate(port, &dsmas_list); > + if (rc) > + dev_dbg(&port->dev, > + "Failed to do perf coord calculations.\n"); > + } > > dsmas_list_destroy(&dsmas_list); > } > >
diff --git a/drivers/cxl/cxl.h b/drivers/cxl/cxl.h index 0c8952e568cc..7c94f07771c1 100644 --- a/drivers/cxl/cxl.h +++ b/drivers/cxl/cxl.h @@ -833,6 +833,7 @@ struct dsmas_entry { struct range dpa_range; u8 handle; struct access_coordinate coord; + int qos_class; }; #ifdef CONFIG_ACPI diff --git a/drivers/cxl/port.c b/drivers/cxl/port.c index c5a24b75bf03..b474997cc7ee 100644 --- a/drivers/cxl/port.c +++ b/drivers/cxl/port.c @@ -67,6 +67,52 @@ static void dsmas_list_destroy(struct list_head *dsmas_list) } } +static int cxl_port_perf_data_calculate(struct cxl_port *port, + struct list_head *dsmas_list) +{ + struct access_coordinate c; + struct qtg_dsm_input input; + struct cxl_port *root_port; + struct cxl_root *cxl_root; + struct dsmas_entry *dent; + int rc, qos_class; + + rc = cxl_endpoint_get_perf_coordinates(port, &c); + if (rc) { + dev_dbg(&port->dev, "Failed to retrieve perf coordinates.\n"); + return rc; + } + + root_port = find_cxl_root(port); + cxl_root = to_cxl_root(root_port); + if (!cxl_root->ops && !cxl_root->ops->get_qos_class) + return -EOPNOTSUPP; + + list_for_each_entry(dent, dsmas_list, list) { + dent->coord.read_latency = dent->coord.read_latency + + c.read_latency; + dent->coord.write_latency = dent->coord.write_latency + + c.write_latency; + dent->coord.read_bandwidth = min_t(int, c.read_bandwidth, + dent->coord.read_bandwidth); + dent->coord.write_bandwidth = min_t(int, c.write_bandwidth, + dent->coord.write_bandwidth); + + input.rd_lat = cpu_to_le32(dent->coord.read_latency); + input.wr_lat = cpu_to_le32(dent->coord.write_latency); + input.rd_bw = cpu_to_le32(dent->coord.read_bandwidth); + input.wr_bw = cpu_to_le32(dent->coord.write_bandwidth); + + qos_class = cxl_root->ops->get_qos_class(root_port, &input); + if (qos_class < 0) + continue; + + dent->qos_class = qos_class; + } + + return 0; +} + static int cxl_switch_port_probe(struct cxl_port *port) { struct cxl_hdm *cxlhdm; @@ -155,10 +201,14 @@ static int cxl_endpoint_port_probe(struct cxl_port *port) LIST_HEAD(dsmas_list); rc = cxl_cdat_endpoint_process(port, &dsmas_list); - if (rc < 0) + if (rc < 0) { dev_dbg(&port->dev, "Failed to parse CDAT: %d\n", rc); - - /* Performance data processing */ + } else { + rc = cxl_port_perf_data_calculate(port, &dsmas_list); + if (rc) + dev_dbg(&port->dev, + "Failed to do perf coord calculations.\n"); + } dsmas_list_destroy(&dsmas_list); }
CXL Memory Device SW Guide rev1.0 2.11.2 provides instruction on how to caluclate latency and bandwidth for CXL memory device. Calculate minimum bandwidth and total latency for the path from the CXL device to the root port. The QTG id is retrieved by providing the performance data as input and calling the root port callback ->get_qos_class(). The retrieved id is stored with the cxl_port of the CXL device. For example for a device that is directly attached to a host bus: Total Latency = Device Latency (from CDAT) + Dev to Host Bus (HB) Link Latency + Generic Port Latency Min Bandwidth = Min bandwidth for link bandwidth between HB and CXL device, device CDAT bandwidth, and Generic Port Bandwidth For a device that has a switch in between host bus and CXL device: Total Latency = Device (CDAT) Latency + Dev to Switch Link Latency + Switch (CDAT) Latency + Switch to HB Link Latency + Generic Port Latency Min Bandwidth = Min bandwidth for link bandwidth between CXL device to CXL switch, CXL device CDAT bandwidth, CXL switch CDAT bandwidth, CXL switch to HB bandwidth, and Generic Port Bandwidth. Signed-off-by: Dave Jiang <dave.jiang@intel.com> --- v5: - Use new API call cxl_endpoint_get_perf_coordinates(). - Use root_port->get_qos_class() (Dan) - Add endieness handling to DSM input. --- drivers/cxl/cxl.h | 1 + drivers/cxl/port.c | 56 +++++++++++++++++++++++++++++++++++++++++++++++++--- 2 files changed, 54 insertions(+), 3 deletions(-)