Message ID | 20210617173655.430424-4-ben.widawsky@intel.com |
---|---|
State | New, archived |
Headers | show |
Series | Region creation | expand |
On Thu, 17 Jun 2021 10:36:52 -0700 Ben Widawsky <ben.widawsky@intel.com> wrote: > The region creation APIs leave a region unconfigured. Configuring the > region will work in the same way as similar subsystems such as devdax. > Sysfs attrs will be provided to allow userspace to configure the region. > Finally once all configuration is complete, userspace may "commit" the > config. What the kernel decides to do after a config is committed is out > of scope at this point. > > Introduced here are the most basic attributes needed to configure a > region. > > A x1 interleave example is provided below: > > decoder1.0 > ├── create_region > ├── delete_region > ├── devtype > ├── locked > ├── region1.0:0 > │ ├── offset > │ ├── size > │ ├── subsystem -> ../../../../../../../bus/cxl > │ ├── target0 > │ ├── uevent > │ ├── uuid > │ └── verify > ├── size > ├── start > ├── subsystem -> ../../../../../../bus/cxl > ├── target_list > ├── target_type > └── uevent > > Signed-off-by: Ben Widawsky <ben.widawsky@intel.com> Hi Ben, I think some of the sysfs interface needs more detailed docs and consideration about consistency. If an interface isn't fairly obvious without reading the docs (but understanding what it is trying to do) then it's normally a bad sign... Jonathan > --- > Documentation/ABI/testing/sysfs-bus-cxl | 32 ++++ > drivers/cxl/region.c | 223 ++++++++++++++++++++++++ > 2 files changed, 255 insertions(+) > > diff --git a/Documentation/ABI/testing/sysfs-bus-cxl b/Documentation/ABI/testing/sysfs-bus-cxl > index 115a25d2899d..70f9d09385a4 100644 > --- a/Documentation/ABI/testing/sysfs-bus-cxl > +++ b/Documentation/ABI/testing/sysfs-bus-cxl > @@ -148,3 +148,35 @@ Description: > Deletes the named region. A region must be unbound from the > region driver before being deleted. The attributes expects a > region in the form "regionX.Y:Z". > + > +What: /sys/bus/cxl/devices/decoderX.Y/regionX.Y:Z/offset > +Date: June, 2021 > +KernelVersion: v5.14 > +Contact: linux-cxl@vger.kernel.org > +Description: > + (RO) A region resides within an address space that is claimed by > + a decoder. Region space allocation is handled by the driver, but > + the offset may be read by userspace tooling in order to > + determine fragmentation, and available size for new regions. Side note rather than abotu this patch. Feels to me like the format definition of these docs should include something on permissions. The (RO) bit is fine but makes the docs less machine readable. > + > +What: > +/sys/bus/cxl/devices/decoderX.Y/regionX.Y:Z/{size,uuid,target[0-15]} > +Date: June, 2021 > +KernelVersion: v5.14 > +Contact: linux-cxl@vger.kernel.org > +Description: > + (RW) Configuring regions requires a minimal set of parameters in > + order for the subsequent bind operation to succeed. The > + following parameters are defined: > + > + == ======================================================== > + size Manadatory. Phsyical address space the region will Spell check. > + consume. > + uuid Optional. A unique identifier for the region. If none is > + selected, the kernel will create one. > + target Mandatory. Memory devices are the backing storage for a > + region. There will be N targets based on the number of > + interleave ways that the top level decoder is configured > + for. Each target must be set with a memdev device ie. > + 'mem1'. Document what happens if not set. 'nil' is not obvious. > + == ======================================================== The automation looking at these files is increasing, so I wonder if that will cause some issues with having these all documented in one block. I don't see a huge disadvantage in breaking this into 3 entries. Alternatively wait for someone to scream and refactor it then. > diff --git a/drivers/cxl/region.c b/drivers/cxl/region.c > index 391467e864a2..cf7fd3027419 100644 > --- a/drivers/cxl/region.c > +++ b/drivers/cxl/region.c > @@ -3,7 +3,9 @@ > #include <linux/io-64-nonatomic-lo-hi.h> > #include <linux/device.h> > #include <linux/module.h> > +#include <linux/sizes.h> > #include <linux/slab.h> > +#include <linux/uuid.h> > #include <linux/idr.h> > #include "region.h" > #include "cxl.h" > @@ -21,16 +23,237 @@ > * relationship between decoder and region when the region is interleaved. > */ > > +static struct cxl_region *to_cxl_region(struct device *dev); > + > +static bool is_region_active(struct cxl_region *region) > +{ > + /* TODO: Regions can't be activated yet. */ > + return false; > +} > + > +static ssize_t offset_show(struct device *dev, struct device_attribute *attr, > + char *buf) > +{ > + struct cxl_region *region = to_cxl_region(dev); > + struct cxl_decoder *cxld = to_cxl_decoder(dev->parent); > + > + if (!region->res) > + return sysfs_emit(buf, "\n"); > + > + return sysfs_emit(buf, "%#llx\n", cxld->range.start - region->res->start); > +} > +static DEVICE_ATTR_RO(offset); > + > +static ssize_t size_show(struct device *dev, struct device_attribute *attr, > + char *buf) > +{ > + struct cxl_region *region = to_cxl_region(dev); > + > + if (!region->res) > + return sysfs_emit(buf, "*%llu\n", region->requested_size); > + > + return sysfs_emit(buf, "%llu\n", resource_size(region->res)); > +} > + > +static ssize_t size_store(struct device *dev, struct device_attribute *attr, > + const char *buf, size_t len) > +{ > + struct cxl_region *region = to_cxl_region(dev); > + unsigned long long val; > + ssize_t rc; > + > + rc = kstrtoull(buf, 0, &val); > + if (rc) > + return rc; > + > + device_lock(®ion->dev); > + if (is_region_active(region)) { > + rc = -EBUSY; > + } else { > + region->requested_size = val; > + } > + device_unlock(®ion->dev); > + > + return len; > +} > +static DEVICE_ATTR_RW(size); > + > +static ssize_t uuid_show(struct device *dev, struct device_attribute *attr, > + char *buf) > +{ > + struct cxl_region *region = to_cxl_region(dev); > + > + return sysfs_emit(buf, "%pUb\n", ®ion->uuid); > +} > +static ssize_t uuid_store(struct device *dev, struct device_attribute *attr, > + const char *buf, size_t len) > +{ > + struct cxl_region *region = to_cxl_region(dev); > + ssize_t rc; > + > + if (len != UUID_STRING_LEN + 1) > + return -EINVAL; > + > + device_lock(®ion->dev); > + if (is_region_active(region)) { > + rc = -EBUSY; > + } else { > + rc = uuid_parse(buf, ®ion->uuid); > + } > + device_unlock(®ion->dev); > + > + return rc ? rc : len; > +} > +static DEVICE_ATTR_RW(uuid); > + > +static struct attribute *region_attrs[] = { > + &dev_attr_offset.attr, > + &dev_attr_size.attr, > + &dev_attr_uuid.attr, > + NULL, > +}; > + > +static const struct attribute_group region_group = { > + .attrs = region_attrs, > +}; > + > +static size_t show_targetN(struct cxl_region *region, char *buf, int n) > +{ > + int ret; > + > + device_lock(®ion->dev); > + if (!region->targets[n]) > + ret = sysfs_emit(buf, "%s\n", dev_name(®ion->targets[n]->dev)); > + else > + ret = sysfs_emit(buf, "nil\n"); > + device_unlock(®ion->dev); > + > + return ret; > +} > + > +static size_t set_targetN(struct cxl_region *region, const char *buf, int n, size_t len) > +{ > + struct device *memdev_dev; > + struct cxl_memdev *cxlmd; > + ssize_t rc; > + int val; > + > + device_lock(®ion->dev); > + > + rc = kstrtoint(buf, 0, &val); > + /* Special 'remote' target operation */ remote? > + if (!rc && val == 0) { I'm not sure on logic here. This seems to be if the value is valid and 0 we do something different? Why is 0 special? It's not documented as such and the thing returns nil, not 0. > + cxlmd = region->targets[n]; > + if (cxlmd) > + put_device(&cxlmd->dev); > + region->targets[n] = NULL; > + device_unlock(®ion->dev); > + return len; > + } > + > + memdev_dev = bus_find_device_by_name(&cxl_bus_type, NULL, buf); > + if (!memdev_dev) > + return -ENOENT; > + > + cxlmd = to_cxl_memdev(memdev_dev); > + get_device(&cxlmd->dev); > + region->targets[n] = cxlmd; > + > + device_unlock(®ion->dev); > + > + return len; > +} > + > +#define TARGET_ATTR_RW(n) \ > + static ssize_t target##n##_show( \ > + struct device *dev, struct device_attribute *attr, char *buf) \ > + { \ > + return show_targetN(to_cxl_region(dev), buf, n); \ > + } \ > + static ssize_t target##n##_store(struct device *dev, \ > + struct device_attribute *attr, \ > + const char *buf, size_t len) \ > + { \ > + return set_targetN(to_cxl_region(dev), buf, n, len); \ > + } \ > + static DEVICE_ATTR_RW(target##n) > + > +TARGET_ATTR_RW(0); > +TARGET_ATTR_RW(1); > +TARGET_ATTR_RW(2); > +TARGET_ATTR_RW(3); > +TARGET_ATTR_RW(4); > +TARGET_ATTR_RW(5); > +TARGET_ATTR_RW(6); > +TARGET_ATTR_RW(7); > +TARGET_ATTR_RW(8); > +TARGET_ATTR_RW(9); > +TARGET_ATTR_RW(10); > +TARGET_ATTR_RW(11); > +TARGET_ATTR_RW(12); > +TARGET_ATTR_RW(13); > +TARGET_ATTR_RW(14); > +TARGET_ATTR_RW(15); > + > +static struct attribute *interleave_attrs[] = { > + &dev_attr_target0.attr, > + &dev_attr_target1.attr, > + &dev_attr_target2.attr, > + &dev_attr_target3.attr, > + &dev_attr_target4.attr, > + &dev_attr_target5.attr, > + &dev_attr_target6.attr, > + &dev_attr_target7.attr, > + &dev_attr_target8.attr, > + &dev_attr_target9.attr, > + &dev_attr_target10.attr, > + &dev_attr_target11.attr, > + &dev_attr_target12.attr, > + &dev_attr_target13.attr, > + &dev_attr_target14.attr, > + &dev_attr_target15.attr, > + NULL, > +}; > + > +static umode_t visible_targets(struct kobject *kobj, struct attribute *a, int n) > +{ > + struct device *dev = container_of(kobj, struct device, kobj); > + struct cxl_region *region = to_cxl_region(dev); > + > + if (n < region->eniw) > + return a->mode; > + return 0; > +} > + > +static const struct attribute_group region_interleave_group = { > + .attrs = interleave_attrs, > + .is_visible = visible_targets, > +}; > + > +static const struct attribute_group *region_groups[] = { > + ®ion_group, > + ®ion_interleave_group, > + NULL, > +}; > + > static void cxl_region_release(struct device *dev); > > const struct device_type cxl_region_type = { > .name = "cxl_region", > .release = cxl_region_release, > + .groups = region_groups > }; > > void cxl_free_region(struct cxl_decoder *cxld, struct cxl_region *region) > { > + int i; > + > ida_free(&cxld->region_ida, region->id); > + for (i = 0; i < region->eniw; i++) { > + if (region->targets[i]) > + put_device(®ion->targets[i]->dev); > + } > + > kfree(region); > } >
On 21-06-18 12:22:37, Jonathan Cameron wrote: > On Thu, 17 Jun 2021 10:36:52 -0700 > Ben Widawsky <ben.widawsky@intel.com> wrote: > > > The region creation APIs leave a region unconfigured. Configuring the > > region will work in the same way as similar subsystems such as devdax. > > Sysfs attrs will be provided to allow userspace to configure the region. > > Finally once all configuration is complete, userspace may "commit" the > > config. What the kernel decides to do after a config is committed is out > > of scope at this point. > > > > Introduced here are the most basic attributes needed to configure a > > region. > > > > A x1 interleave example is provided below: > > > > decoder1.0 > > ├── create_region > > ├── delete_region > > ├── devtype > > ├── locked > > ├── region1.0:0 > > │ ├── offset > > │ ├── size > > │ ├── subsystem -> ../../../../../../../bus/cxl > > │ ├── target0 > > │ ├── uevent > > │ ├── uuid > > │ └── verify > > ├── size > > ├── start > > ├── subsystem -> ../../../../../../bus/cxl > > ├── target_list > > ├── target_type > > └── uevent > > > > Signed-off-by: Ben Widawsky <ben.widawsky@intel.com> > > Hi Ben, > > I think some of the sysfs interface needs more detailed docs > and consideration about consistency. > > If an interface isn't fairly obvious without reading the docs > (but understanding what it is trying to do) then it's normally > a bad sign... > > Jonathan > My measuring stick has been: the interface should be fairly obvious if you're fairly familiar with the spec. I think we've satisfied that here, but often times the creator of something can be blind to effects of the creation. I can certainly add more descriptive text to the ABI doc, but if you have suggestions on how to make the sysfs attrs more descriptive, that'd be great. > > --- > > Documentation/ABI/testing/sysfs-bus-cxl | 32 ++++ > > drivers/cxl/region.c | 223 ++++++++++++++++++++++++ > > 2 files changed, 255 insertions(+) > > > > diff --git a/Documentation/ABI/testing/sysfs-bus-cxl b/Documentation/ABI/testing/sysfs-bus-cxl > > index 115a25d2899d..70f9d09385a4 100644 > > --- a/Documentation/ABI/testing/sysfs-bus-cxl > > +++ b/Documentation/ABI/testing/sysfs-bus-cxl > > @@ -148,3 +148,35 @@ Description: > > Deletes the named region. A region must be unbound from the > > region driver before being deleted. The attributes expects a > > region in the form "regionX.Y:Z". > > + > > +What: /sys/bus/cxl/devices/decoderX.Y/regionX.Y:Z/offset > > +Date: June, 2021 > > +KernelVersion: v5.14 > > +Contact: linux-cxl@vger.kernel.org > > +Description: > > + (RO) A region resides within an address space that is claimed by > > + a decoder. Region space allocation is handled by the driver, but > > + the offset may be read by userspace tooling in order to > > + determine fragmentation, and available size for new regions. > > Side note rather than abotu this patch. Feels to me like the format definition > of these docs should include something on permissions. The (RO) bit is fine > but makes the docs less machine readable. Any examples of something you consider ideal? > > > + > > +What: > > +/sys/bus/cxl/devices/decoderX.Y/regionX.Y:Z/{size,uuid,target[0-15]} > > +Date: June, 2021 > > +KernelVersion: v5.14 > > +Contact: linux-cxl@vger.kernel.org > > +Description: > > + (RW) Configuring regions requires a minimal set of parameters in > > + order for the subsequent bind operation to succeed. The > > + following parameters are defined: > > + > > + == ======================================================== > > + size Manadatory. Phsyical address space the region will > > Spell check. > > > + consume. > > + uuid Optional. A unique identifier for the region. If none is > > + selected, the kernel will create one. > > + target Mandatory. Memory devices are the backing storage for a > > + region. There will be N targets based on the number of > > + interleave ways that the top level decoder is configured > > + for. Each target must be set with a memdev device ie. > > + 'mem1'. > > Document what happens if not set. 'nil' is not obvious. > > > + == ======================================================== > > The automation looking at these files is increasing, so I wonder if that will > cause some issues with having these all documented in one block. > > I don't see a huge disadvantage in breaking this into 3 entries. Alternatively > wait for someone to scream and refactor it then. Okay. > > > diff --git a/drivers/cxl/region.c b/drivers/cxl/region.c > > index 391467e864a2..cf7fd3027419 100644 > > --- a/drivers/cxl/region.c > > +++ b/drivers/cxl/region.c > > @@ -3,7 +3,9 @@ > > #include <linux/io-64-nonatomic-lo-hi.h> > > #include <linux/device.h> > > #include <linux/module.h> > > +#include <linux/sizes.h> > > #include <linux/slab.h> > > +#include <linux/uuid.h> > > #include <linux/idr.h> > > #include "region.h" > > #include "cxl.h" > > @@ -21,16 +23,237 @@ > > * relationship between decoder and region when the region is interleaved. > > */ > > > > +static struct cxl_region *to_cxl_region(struct device *dev); > > + > > +static bool is_region_active(struct cxl_region *region) > > +{ > > + /* TODO: Regions can't be activated yet. */ > > + return false; > > +} > > + > > +static ssize_t offset_show(struct device *dev, struct device_attribute *attr, > > + char *buf) > > +{ > > + struct cxl_region *region = to_cxl_region(dev); > > + struct cxl_decoder *cxld = to_cxl_decoder(dev->parent); > > + > > + if (!region->res) > > + return sysfs_emit(buf, "\n"); > > + > > + return sysfs_emit(buf, "%#llx\n", cxld->range.start - region->res->start); > > +} > > +static DEVICE_ATTR_RO(offset); > > + > > +static ssize_t size_show(struct device *dev, struct device_attribute *attr, > > + char *buf) > > +{ > > + struct cxl_region *region = to_cxl_region(dev); > > + > > + if (!region->res) > > + return sysfs_emit(buf, "*%llu\n", region->requested_size); > > + > > + return sysfs_emit(buf, "%llu\n", resource_size(region->res)); > > +} > > + > > +static ssize_t size_store(struct device *dev, struct device_attribute *attr, > > + const char *buf, size_t len) > > +{ > > + struct cxl_region *region = to_cxl_region(dev); > > + unsigned long long val; > > + ssize_t rc; > > + > > + rc = kstrtoull(buf, 0, &val); > > + if (rc) > > + return rc; > > + > > + device_lock(®ion->dev); > > + if (is_region_active(region)) { > > + rc = -EBUSY; > > + } else { > > + region->requested_size = val; > > + } > > + device_unlock(®ion->dev); > > + > > + return len; > > +} > > +static DEVICE_ATTR_RW(size); > > + > > +static ssize_t uuid_show(struct device *dev, struct device_attribute *attr, > > + char *buf) > > +{ > > + struct cxl_region *region = to_cxl_region(dev); > > + > > + return sysfs_emit(buf, "%pUb\n", ®ion->uuid); > > +} > > +static ssize_t uuid_store(struct device *dev, struct device_attribute *attr, > > + const char *buf, size_t len) > > +{ > > + struct cxl_region *region = to_cxl_region(dev); > > + ssize_t rc; > > + > > + if (len != UUID_STRING_LEN + 1) > > + return -EINVAL; > > + > > + device_lock(®ion->dev); > > + if (is_region_active(region)) { > > + rc = -EBUSY; > > + } else { > > + rc = uuid_parse(buf, ®ion->uuid); > > + } > > + device_unlock(®ion->dev); > > + > > + return rc ? rc : len; > > +} > > +static DEVICE_ATTR_RW(uuid); > > + > > +static struct attribute *region_attrs[] = { > > + &dev_attr_offset.attr, > > + &dev_attr_size.attr, > > + &dev_attr_uuid.attr, > > + NULL, > > +}; > > + > > +static const struct attribute_group region_group = { > > + .attrs = region_attrs, > > +}; > > + > > +static size_t show_targetN(struct cxl_region *region, char *buf, int n) > > +{ > > + int ret; > > + > > + device_lock(®ion->dev); > > + if (!region->targets[n]) > > + ret = sysfs_emit(buf, "%s\n", dev_name(®ion->targets[n]->dev)); > > + else > > + ret = sysfs_emit(buf, "nil\n"); > > + device_unlock(®ion->dev); > > + > > + return ret; > > +} > > + > > +static size_t set_targetN(struct cxl_region *region, const char *buf, int n, size_t len) > > +{ > > + struct device *memdev_dev; > > + struct cxl_memdev *cxlmd; > > + ssize_t rc; > > + int val; > > + > > + device_lock(®ion->dev); > > + > > + rc = kstrtoint(buf, 0, &val); > > + /* Special 'remote' target operation */ > > remote? > remove > > + if (!rc && val == 0) { > > I'm not sure on logic here. This seems to be if the value is valid and 0 > we do something different? Why is 0 special? It's not documented as such > and the thing returns nil, not 0. I changed it from the RFC to return empty string (or I meant to at least). I should make it take the empty string as well. > > > + cxlmd = region->targets[n]; > > + if (cxlmd) > > + put_device(&cxlmd->dev); > > + region->targets[n] = NULL; > > + device_unlock(®ion->dev); > > + return len; > > + } > > + > > + memdev_dev = bus_find_device_by_name(&cxl_bus_type, NULL, buf); > > + if (!memdev_dev) > > + return -ENOENT; > > + > > + cxlmd = to_cxl_memdev(memdev_dev); > > + get_device(&cxlmd->dev); > > + region->targets[n] = cxlmd; > > + > > + device_unlock(®ion->dev); > > + > > + return len; > > +} > > + > > +#define TARGET_ATTR_RW(n) \ > > + static ssize_t target##n##_show( \ > > + struct device *dev, struct device_attribute *attr, char *buf) \ > > + { \ > > + return show_targetN(to_cxl_region(dev), buf, n); \ > > + } \ > > + static ssize_t target##n##_store(struct device *dev, \ > > + struct device_attribute *attr, \ > > + const char *buf, size_t len) \ > > + { \ > > + return set_targetN(to_cxl_region(dev), buf, n, len); \ > > + } \ > > + static DEVICE_ATTR_RW(target##n) > > + > > +TARGET_ATTR_RW(0); > > +TARGET_ATTR_RW(1); > > +TARGET_ATTR_RW(2); > > +TARGET_ATTR_RW(3); > > +TARGET_ATTR_RW(4); > > +TARGET_ATTR_RW(5); > > +TARGET_ATTR_RW(6); > > +TARGET_ATTR_RW(7); > > +TARGET_ATTR_RW(8); > > +TARGET_ATTR_RW(9); > > +TARGET_ATTR_RW(10); > > +TARGET_ATTR_RW(11); > > +TARGET_ATTR_RW(12); > > +TARGET_ATTR_RW(13); > > +TARGET_ATTR_RW(14); > > +TARGET_ATTR_RW(15); > > + > > +static struct attribute *interleave_attrs[] = { > > + &dev_attr_target0.attr, > > + &dev_attr_target1.attr, > > + &dev_attr_target2.attr, > > + &dev_attr_target3.attr, > > + &dev_attr_target4.attr, > > + &dev_attr_target5.attr, > > + &dev_attr_target6.attr, > > + &dev_attr_target7.attr, > > + &dev_attr_target8.attr, > > + &dev_attr_target9.attr, > > + &dev_attr_target10.attr, > > + &dev_attr_target11.attr, > > + &dev_attr_target12.attr, > > + &dev_attr_target13.attr, > > + &dev_attr_target14.attr, > > + &dev_attr_target15.attr, > > + NULL, > > +}; > > + > > +static umode_t visible_targets(struct kobject *kobj, struct attribute *a, int n) > > +{ > > + struct device *dev = container_of(kobj, struct device, kobj); > > + struct cxl_region *region = to_cxl_region(dev); > > + > > + if (n < region->eniw) > > + return a->mode; > > + return 0; > > +} > > + > > +static const struct attribute_group region_interleave_group = { > > + .attrs = interleave_attrs, > > + .is_visible = visible_targets, > > +}; > > + > > +static const struct attribute_group *region_groups[] = { > > + ®ion_group, > > + ®ion_interleave_group, > > + NULL, > > +}; > > + > > static void cxl_region_release(struct device *dev); > > > > const struct device_type cxl_region_type = { > > .name = "cxl_region", > > .release = cxl_region_release, > > + .groups = region_groups > > }; > > > > void cxl_free_region(struct cxl_decoder *cxld, struct cxl_region *region) > > { > > + int i; > > + > > ida_free(&cxld->region_ida, region->id); > > + for (i = 0; i < region->eniw; i++) { > > + if (region->targets[i]) > > + put_device(®ion->targets[i]->dev); > > + } > > + > > kfree(region); > > } > > >
On Fri, 18 Jun 2021 08:25:51 -0700 Ben Widawsky <ben.widawsky@intel.com> wrote: > On 21-06-18 12:22:37, Jonathan Cameron wrote: > > On Thu, 17 Jun 2021 10:36:52 -0700 > > Ben Widawsky <ben.widawsky@intel.com> wrote: > > > > > The region creation APIs leave a region unconfigured. Configuring the > > > region will work in the same way as similar subsystems such as devdax. > > > Sysfs attrs will be provided to allow userspace to configure the region. > > > Finally once all configuration is complete, userspace may "commit" the > > > config. What the kernel decides to do after a config is committed is out > > > of scope at this point. > > > > > > Introduced here are the most basic attributes needed to configure a > > > region. > > > > > > A x1 interleave example is provided below: > > > > > > decoder1.0 > > > ├── create_region > > > ├── delete_region > > > ├── devtype > > > ├── locked > > > ├── region1.0:0 > > > │ ├── offset > > > │ ├── size > > > │ ├── subsystem -> ../../../../../../../bus/cxl > > > │ ├── target0 > > > │ ├── uevent > > > │ ├── uuid > > > │ └── verify Does that exist any more? > > > ├── size > > > ├── start > > > ├── subsystem -> ../../../../../../bus/cxl > > > ├── target_list > > > ├── target_type > > > └── uevent > > > > > > Signed-off-by: Ben Widawsky <ben.widawsky@intel.com> > > > > Hi Ben, > > > > I think some of the sysfs interface needs more detailed docs > > and consideration about consistency. > > > > If an interface isn't fairly obvious without reading the docs > > (but understanding what it is trying to do) then it's normally > > a bad sign... > > > > Jonathan > > > > My measuring stick has been: the interface should be fairly obvious if you're > fairly familiar with the spec. I think we've satisfied that here, but often > times the creator of something can be blind to effects of the creation. > > I can certainly add more descriptive text to the ABI doc, but if you have > suggestions on how to make the sysfs attrs more descriptive, that'd be great. In this patch, mainly the whole 0 / nil etc. Empty string make sense to me when clearing as that reflects what is returned when it wasn't set in the first place. Attr names are fine for this one. > > > > --- > > > Documentation/ABI/testing/sysfs-bus-cxl | 32 ++++ > > > drivers/cxl/region.c | 223 ++++++++++++++++++++++++ > > > 2 files changed, 255 insertions(+) > > > > > > diff --git a/Documentation/ABI/testing/sysfs-bus-cxl b/Documentation/ABI/testing/sysfs-bus-cxl > > > index 115a25d2899d..70f9d09385a4 100644 > > > --- a/Documentation/ABI/testing/sysfs-bus-cxl > > > +++ b/Documentation/ABI/testing/sysfs-bus-cxl > > > @@ -148,3 +148,35 @@ Description: > > > Deletes the named region. A region must be unbound from the > > > region driver before being deleted. The attributes expects a > > > region in the form "regionX.Y:Z". > > > + > > > +What: /sys/bus/cxl/devices/decoderX.Y/regionX.Y:Z/offset > > > +Date: June, 2021 > > > +KernelVersion: v5.14 > > > +Contact: linux-cxl@vger.kernel.org > > > +Description: > > > + (RO) A region resides within an address space that is claimed by > > > + a decoder. Region space allocation is handled by the driver, but > > > + the offset may be read by userspace tooling in order to > > > + determine fragmentation, and available size for new regions. > > > > Side note rather than abotu this patch. Feels to me like the format definition > > of these docs should include something on permissions. The (RO) bit is fine > > but makes the docs less machine readable. > > Any examples of something you consider ideal? Nope :) Mauro is doing some work on automated check of this documentation. If that works gets anywhere near RO vs RW we can figure out how to improve the docs then to support it! > > > > > > + > > > +What: > > > +/sys/bus/cxl/devices/decoderX.Y/regionX.Y:Z/{size,uuid,target[0-15]} > > > +Date: June, 2021 > > > +KernelVersion: v5.14 > > > +Contact: linux-cxl@vger.kernel.org > > > +Description: > > > + (RW) Configuring regions requires a minimal set of parameters in > > > + order for the subsequent bind operation to succeed. The > > > + following parameters are defined: > > > + > > > + == ======================================================== > > > + size Manadatory. Phsyical address space the region will > > > > Spell check. > > > > > + consume. > > > + uuid Optional. A unique identifier for the region. If none is > > > + selected, the kernel will create one. > > > + target Mandatory. Memory devices are the backing storage for a > > > + region. There will be N targets based on the number of > > > + interleave ways that the top level decoder is configured > > > + for. Each target must be set with a memdev device ie. > > > + 'mem1'. > > > > Document what happens if not set. 'nil' is not obvious. > > > > > + == ======================================================== > > > > The automation looking at these files is increasing, so I wonder if that will > > cause some issues with having these all documented in one block. > > > > I don't see a huge disadvantage in breaking this into 3 entries. Alternatively > > wait for someone to scream and refactor it then. > > Okay. > > > > > > diff --git a/drivers/cxl/region.c b/drivers/cxl/region.c > > > index 391467e864a2..cf7fd3027419 100644 > > > --- a/drivers/cxl/region.c > > > +++ b/drivers/cxl/region.c > > > @@ -3,7 +3,9 @@ > > > #include <linux/io-64-nonatomic-lo-hi.h> > > > #include <linux/device.h> > > > #include <linux/module.h> > > > +#include <linux/sizes.h> > > > #include <linux/slab.h> > > > +#include <linux/uuid.h> > > > #include <linux/idr.h> > > > #include "region.h" > > > #include "cxl.h" > > > @@ -21,16 +23,237 @@ > > > * relationship between decoder and region when the region is interleaved. > > > */ > > > > > > +static struct cxl_region *to_cxl_region(struct device *dev); > > > + > > > +static bool is_region_active(struct cxl_region *region) > > > +{ > > > + /* TODO: Regions can't be activated yet. */ > > > + return false; > > > +} > > > + > > > +static ssize_t offset_show(struct device *dev, struct device_attribute *attr, > > > + char *buf) > > > +{ > > > + struct cxl_region *region = to_cxl_region(dev); > > > + struct cxl_decoder *cxld = to_cxl_decoder(dev->parent); > > > + > > > + if (!region->res) > > > + return sysfs_emit(buf, "\n"); > > > + > > > + return sysfs_emit(buf, "%#llx\n", cxld->range.start - region->res->start); > > > +} > > > +static DEVICE_ATTR_RO(offset); > > > + > > > +static ssize_t size_show(struct device *dev, struct device_attribute *attr, > > > + char *buf) > > > +{ > > > + struct cxl_region *region = to_cxl_region(dev); > > > + > > > + if (!region->res) > > > + return sysfs_emit(buf, "*%llu\n", region->requested_size); > > > + > > > + return sysfs_emit(buf, "%llu\n", resource_size(region->res)); > > > +} > > > + > > > +static ssize_t size_store(struct device *dev, struct device_attribute *attr, > > > + const char *buf, size_t len) > > > +{ > > > + struct cxl_region *region = to_cxl_region(dev); > > > + unsigned long long val; > > > + ssize_t rc; > > > + > > > + rc = kstrtoull(buf, 0, &val); > > > + if (rc) > > > + return rc; > > > + > > > + device_lock(®ion->dev); > > > + if (is_region_active(region)) { > > > + rc = -EBUSY; > > > + } else { > > > + region->requested_size = val; > > > + } > > > + device_unlock(®ion->dev); > > > + > > > + return len; > > > +} > > > +static DEVICE_ATTR_RW(size); > > > + > > > +static ssize_t uuid_show(struct device *dev, struct device_attribute *attr, > > > + char *buf) > > > +{ > > > + struct cxl_region *region = to_cxl_region(dev); > > > + > > > + return sysfs_emit(buf, "%pUb\n", ®ion->uuid); > > > +} > > > +static ssize_t uuid_store(struct device *dev, struct device_attribute *attr, > > > + const char *buf, size_t len) > > > +{ > > > + struct cxl_region *region = to_cxl_region(dev); > > > + ssize_t rc; > > > + > > > + if (len != UUID_STRING_LEN + 1) > > > + return -EINVAL; > > > + > > > + device_lock(®ion->dev); > > > + if (is_region_active(region)) { > > > + rc = -EBUSY; > > > + } else { > > > + rc = uuid_parse(buf, ®ion->uuid); > > > + } > > > + device_unlock(®ion->dev); > > > + > > > + return rc ? rc : len; > > > +} > > > +static DEVICE_ATTR_RW(uuid); > > > + > > > +static struct attribute *region_attrs[] = { > > > + &dev_attr_offset.attr, > > > + &dev_attr_size.attr, > > > + &dev_attr_uuid.attr, > > > + NULL, > > > +}; > > > + > > > +static const struct attribute_group region_group = { > > > + .attrs = region_attrs, > > > +}; > > > + > > > +static size_t show_targetN(struct cxl_region *region, char *buf, int n) > > > +{ > > > + int ret; > > > + > > > + device_lock(®ion->dev); > > > + if (!region->targets[n]) > > > + ret = sysfs_emit(buf, "%s\n", dev_name(®ion->targets[n]->dev)); > > > + else > > > + ret = sysfs_emit(buf, "nil\n"); > > > + device_unlock(®ion->dev); > > > + > > > + return ret; > > > +} > > > + > > > +static size_t set_targetN(struct cxl_region *region, const char *buf, int n, size_t len) > > > +{ > > > + struct device *memdev_dev; > > > + struct cxl_memdev *cxlmd; > > > + ssize_t rc; > > > + int val; > > > + > > > + device_lock(®ion->dev); > > > + > > > + rc = kstrtoint(buf, 0, &val); > > > + /* Special 'remote' target operation */ > > > > remote? > > > > remove > > > > + if (!rc && val == 0) { > > > > I'm not sure on logic here. This seems to be if the value is valid and 0 > > we do something different? Why is 0 special? It's not documented as such > > and the thing returns nil, not 0. > > I changed it from the RFC to return empty string (or I meant to at least). I > should make it take the empty string as well. That works better.
diff --git a/Documentation/ABI/testing/sysfs-bus-cxl b/Documentation/ABI/testing/sysfs-bus-cxl index 115a25d2899d..70f9d09385a4 100644 --- a/Documentation/ABI/testing/sysfs-bus-cxl +++ b/Documentation/ABI/testing/sysfs-bus-cxl @@ -148,3 +148,35 @@ Description: Deletes the named region. A region must be unbound from the region driver before being deleted. The attributes expects a region in the form "regionX.Y:Z". + +What: /sys/bus/cxl/devices/decoderX.Y/regionX.Y:Z/offset +Date: June, 2021 +KernelVersion: v5.14 +Contact: linux-cxl@vger.kernel.org +Description: + (RO) A region resides within an address space that is claimed by + a decoder. Region space allocation is handled by the driver, but + the offset may be read by userspace tooling in order to + determine fragmentation, and available size for new regions. + +What: +/sys/bus/cxl/devices/decoderX.Y/regionX.Y:Z/{size,uuid,target[0-15]} +Date: June, 2021 +KernelVersion: v5.14 +Contact: linux-cxl@vger.kernel.org +Description: + (RW) Configuring regions requires a minimal set of parameters in + order for the subsequent bind operation to succeed. The + following parameters are defined: + + == ======================================================== + size Manadatory. Phsyical address space the region will + consume. + uuid Optional. A unique identifier for the region. If none is + selected, the kernel will create one. + target Mandatory. Memory devices are the backing storage for a + region. There will be N targets based on the number of + interleave ways that the top level decoder is configured + for. Each target must be set with a memdev device ie. + 'mem1'. + == ======================================================== diff --git a/drivers/cxl/region.c b/drivers/cxl/region.c index 391467e864a2..cf7fd3027419 100644 --- a/drivers/cxl/region.c +++ b/drivers/cxl/region.c @@ -3,7 +3,9 @@ #include <linux/io-64-nonatomic-lo-hi.h> #include <linux/device.h> #include <linux/module.h> +#include <linux/sizes.h> #include <linux/slab.h> +#include <linux/uuid.h> #include <linux/idr.h> #include "region.h" #include "cxl.h" @@ -21,16 +23,237 @@ * relationship between decoder and region when the region is interleaved. */ +static struct cxl_region *to_cxl_region(struct device *dev); + +static bool is_region_active(struct cxl_region *region) +{ + /* TODO: Regions can't be activated yet. */ + return false; +} + +static ssize_t offset_show(struct device *dev, struct device_attribute *attr, + char *buf) +{ + struct cxl_region *region = to_cxl_region(dev); + struct cxl_decoder *cxld = to_cxl_decoder(dev->parent); + + if (!region->res) + return sysfs_emit(buf, "\n"); + + return sysfs_emit(buf, "%#llx\n", cxld->range.start - region->res->start); +} +static DEVICE_ATTR_RO(offset); + +static ssize_t size_show(struct device *dev, struct device_attribute *attr, + char *buf) +{ + struct cxl_region *region = to_cxl_region(dev); + + if (!region->res) + return sysfs_emit(buf, "*%llu\n", region->requested_size); + + return sysfs_emit(buf, "%llu\n", resource_size(region->res)); +} + +static ssize_t size_store(struct device *dev, struct device_attribute *attr, + const char *buf, size_t len) +{ + struct cxl_region *region = to_cxl_region(dev); + unsigned long long val; + ssize_t rc; + + rc = kstrtoull(buf, 0, &val); + if (rc) + return rc; + + device_lock(®ion->dev); + if (is_region_active(region)) { + rc = -EBUSY; + } else { + region->requested_size = val; + } + device_unlock(®ion->dev); + + return len; +} +static DEVICE_ATTR_RW(size); + +static ssize_t uuid_show(struct device *dev, struct device_attribute *attr, + char *buf) +{ + struct cxl_region *region = to_cxl_region(dev); + + return sysfs_emit(buf, "%pUb\n", ®ion->uuid); +} +static ssize_t uuid_store(struct device *dev, struct device_attribute *attr, + const char *buf, size_t len) +{ + struct cxl_region *region = to_cxl_region(dev); + ssize_t rc; + + if (len != UUID_STRING_LEN + 1) + return -EINVAL; + + device_lock(®ion->dev); + if (is_region_active(region)) { + rc = -EBUSY; + } else { + rc = uuid_parse(buf, ®ion->uuid); + } + device_unlock(®ion->dev); + + return rc ? rc : len; +} +static DEVICE_ATTR_RW(uuid); + +static struct attribute *region_attrs[] = { + &dev_attr_offset.attr, + &dev_attr_size.attr, + &dev_attr_uuid.attr, + NULL, +}; + +static const struct attribute_group region_group = { + .attrs = region_attrs, +}; + +static size_t show_targetN(struct cxl_region *region, char *buf, int n) +{ + int ret; + + device_lock(®ion->dev); + if (!region->targets[n]) + ret = sysfs_emit(buf, "%s\n", dev_name(®ion->targets[n]->dev)); + else + ret = sysfs_emit(buf, "nil\n"); + device_unlock(®ion->dev); + + return ret; +} + +static size_t set_targetN(struct cxl_region *region, const char *buf, int n, size_t len) +{ + struct device *memdev_dev; + struct cxl_memdev *cxlmd; + ssize_t rc; + int val; + + device_lock(®ion->dev); + + rc = kstrtoint(buf, 0, &val); + /* Special 'remote' target operation */ + if (!rc && val == 0) { + cxlmd = region->targets[n]; + if (cxlmd) + put_device(&cxlmd->dev); + region->targets[n] = NULL; + device_unlock(®ion->dev); + return len; + } + + memdev_dev = bus_find_device_by_name(&cxl_bus_type, NULL, buf); + if (!memdev_dev) + return -ENOENT; + + cxlmd = to_cxl_memdev(memdev_dev); + get_device(&cxlmd->dev); + region->targets[n] = cxlmd; + + device_unlock(®ion->dev); + + return len; +} + +#define TARGET_ATTR_RW(n) \ + static ssize_t target##n##_show( \ + struct device *dev, struct device_attribute *attr, char *buf) \ + { \ + return show_targetN(to_cxl_region(dev), buf, n); \ + } \ + static ssize_t target##n##_store(struct device *dev, \ + struct device_attribute *attr, \ + const char *buf, size_t len) \ + { \ + return set_targetN(to_cxl_region(dev), buf, n, len); \ + } \ + static DEVICE_ATTR_RW(target##n) + +TARGET_ATTR_RW(0); +TARGET_ATTR_RW(1); +TARGET_ATTR_RW(2); +TARGET_ATTR_RW(3); +TARGET_ATTR_RW(4); +TARGET_ATTR_RW(5); +TARGET_ATTR_RW(6); +TARGET_ATTR_RW(7); +TARGET_ATTR_RW(8); +TARGET_ATTR_RW(9); +TARGET_ATTR_RW(10); +TARGET_ATTR_RW(11); +TARGET_ATTR_RW(12); +TARGET_ATTR_RW(13); +TARGET_ATTR_RW(14); +TARGET_ATTR_RW(15); + +static struct attribute *interleave_attrs[] = { + &dev_attr_target0.attr, + &dev_attr_target1.attr, + &dev_attr_target2.attr, + &dev_attr_target3.attr, + &dev_attr_target4.attr, + &dev_attr_target5.attr, + &dev_attr_target6.attr, + &dev_attr_target7.attr, + &dev_attr_target8.attr, + &dev_attr_target9.attr, + &dev_attr_target10.attr, + &dev_attr_target11.attr, + &dev_attr_target12.attr, + &dev_attr_target13.attr, + &dev_attr_target14.attr, + &dev_attr_target15.attr, + NULL, +}; + +static umode_t visible_targets(struct kobject *kobj, struct attribute *a, int n) +{ + struct device *dev = container_of(kobj, struct device, kobj); + struct cxl_region *region = to_cxl_region(dev); + + if (n < region->eniw) + return a->mode; + return 0; +} + +static const struct attribute_group region_interleave_group = { + .attrs = interleave_attrs, + .is_visible = visible_targets, +}; + +static const struct attribute_group *region_groups[] = { + ®ion_group, + ®ion_interleave_group, + NULL, +}; + static void cxl_region_release(struct device *dev); const struct device_type cxl_region_type = { .name = "cxl_region", .release = cxl_region_release, + .groups = region_groups }; void cxl_free_region(struct cxl_decoder *cxld, struct cxl_region *region) { + int i; + ida_free(&cxld->region_ida, region->id); + for (i = 0; i < region->eniw; i++) { + if (region->targets[i]) + put_device(®ion->targets[i]->dev); + } + kfree(region); }
The region creation APIs leave a region unconfigured. Configuring the region will work in the same way as similar subsystems such as devdax. Sysfs attrs will be provided to allow userspace to configure the region. Finally once all configuration is complete, userspace may "commit" the config. What the kernel decides to do after a config is committed is out of scope at this point. Introduced here are the most basic attributes needed to configure a region. A x1 interleave example is provided below: decoder1.0 ├── create_region ├── delete_region ├── devtype ├── locked ├── region1.0:0 │ ├── offset │ ├── size │ ├── subsystem -> ../../../../../../../bus/cxl │ ├── target0 │ ├── uevent │ ├── uuid │ └── verify ├── size ├── start ├── subsystem -> ../../../../../../bus/cxl ├── target_list ├── target_type └── uevent Signed-off-by: Ben Widawsky <ben.widawsky@intel.com> --- Documentation/ABI/testing/sysfs-bus-cxl | 32 ++++ drivers/cxl/region.c | 223 ++++++++++++++++++++++++ 2 files changed, 255 insertions(+)