Message ID | 20220503150017.16148-1-mheyne@amazon.de (mailing list archive) |
---|---|
State | Awaiting Upstream |
Delegated to: | Netdev Maintainers |
Headers | show |
Series | [net-next,v2] drivers, ixgbe: show VF statistics | expand |
On Tue, 3 May 2022 15:00:17 +0000 Maximilian Heyne wrote: > + for (i = 0; i < adapter->num_vfs; i++) { > + ethtool_sprintf(&p, "VF %u Rx Packets", i); > + ethtool_sprintf(&p, "VF %u Rx Bytes", i); > + ethtool_sprintf(&p, "VF %u Tx Packets", i); > + ethtool_sprintf(&p, "VF %u Tx Bytes", i); > + ethtool_sprintf(&p, "VF %u MC Packets", i); > + } Please drop the ethtool stats. We've been trying to avoid duplicating the same stats in ethtool and iproute2 for a while now.
On 2022-05-04T20:16:32-07:00 Jakub Kicinski <kuba@kernel.org> wrote: > On Tue, 3 May 2022 15:00:17 +0000 Maximilian Heyne wrote: > > + for (i = 0; i < adapter->num_vfs; i++) { > > + ethtool_sprintf(&p, "VF %u Rx Packets", i); > > + ethtool_sprintf(&p, "VF %u Rx Bytes", i); > > + ethtool_sprintf(&p, "VF %u Tx Packets", i); > > + ethtool_sprintf(&p, "VF %u Tx Bytes", i); > > + ethtool_sprintf(&p, "VF %u MC Packets", i); > > + } > > Please drop the ethtool stats. We've been trying to avoid duplicating > the same stats in ethtool and iproute2 for a while now. > I can see the point that standard metrics should only be reported via the iproute2 interface. However, in this special case this patch was intended to converge the out-of-tree driver with the in-tree version. These missing stats were breaking our userspace. If we now switch solely to iproute2 based statistics both driver versions would diverge even more. So depending on where a user gets the ixgbe driver from, they have to work-around. I can change the patch as requested, but it will contradict the inital intention. At least Intel should then port this change to the external driver as well. Let's get a statement from them. Amazon Development Center Germany GmbH Krausenstr. 38 10117 Berlin Geschaeftsfuehrung: Christian Schlaeger, Jonathan Weiss Eingetragen am Amtsgericht Charlottenburg unter HRB 149173 B Sitz: Berlin Ust-ID: DE 289 237 879
On Fri, 6 May 2022 06:44:40 +0000 Maximilian Heyne wrote: > On 2022-05-04T20:16:32-07:00 Jakub Kicinski <kuba@kernel.org> wrote: > > > On Tue, 3 May 2022 15:00:17 +0000 Maximilian Heyne wrote: > > > + for (i = 0; i < adapter->num_vfs; i++) { > > > + ethtool_sprintf(&p, "VF %u Rx Packets", i); > > > + ethtool_sprintf(&p, "VF %u Rx Bytes", i); > > > + ethtool_sprintf(&p, "VF %u Tx Packets", i); > > > + ethtool_sprintf(&p, "VF %u Tx Bytes", i); > > > + ethtool_sprintf(&p, "VF %u MC Packets", i); > > > + } > > > > Please drop the ethtool stats. We've been trying to avoid duplicating > > the same stats in ethtool and iproute2 for a while now. > > I can see the point that standard metrics should only be reported via the > iproute2 interface. However, in this special case this patch was > intended to converge the out-of-tree driver with the in-tree version. > These missing stats were breaking our userspace. If we now switch > solely to iproute2 based statistics both driver versions would > diverge even more. So depending on where a user gets the ixgbe driver > from, they have to work-around. > > I can change the patch as requested, but it will contradict the inital > intention. At least Intel should then port this change to the > external driver as well. Let's get a statement from them. Ack, but we really want people to move towards using standard stats. Can you change the user space to first try reading the stats via iproute2/rtnetlink? And fallback to random ethtool strings if not available? That way it will work with any driver implementing the standard API. Long term that'll make everyone's life easier. Out-of-tree code cannot be an argument upstream, otherwise we'd completely lose control over our APIs. Vendors could ship whatever in their out of tree repo and then force us to accept it upstream. It's disappointing to see the vendor letting the uAPI of the out of tree driver diverge from upstream, especially a driver this mature.
On 5/6/2022 9:13 AM, Jakub Kicinski wrote: > On Fri, 6 May 2022 06:44:40 +0000 Maximilian Heyne wrote: >> On 2022-05-04T20:16:32-07:00 Jakub Kicinski <kuba@kernel.org> wrote: >> >>> On Tue, 3 May 2022 15:00:17 +0000 Maximilian Heyne wrote: >>>> + for (i = 0; i < adapter->num_vfs; i++) { >>>> + ethtool_sprintf(&p, "VF %u Rx Packets", i); >>>> + ethtool_sprintf(&p, "VF %u Rx Bytes", i); >>>> + ethtool_sprintf(&p, "VF %u Tx Packets", i); >>>> + ethtool_sprintf(&p, "VF %u Tx Bytes", i); >>>> + ethtool_sprintf(&p, "VF %u MC Packets", i); >>>> + } >>> >>> Please drop the ethtool stats. We've been trying to avoid duplicating >>> the same stats in ethtool and iproute2 for a while now. >> >> I can see the point that standard metrics should only be reported via the >> iproute2 interface. However, in this special case this patch was >> intended to converge the out-of-tree driver with the in-tree version. >> These missing stats were breaking our userspace. If we now switch >> solely to iproute2 based statistics both driver versions would >> diverge even more. So depending on where a user gets the ixgbe driver >> from, they have to work-around. >> >> I can change the patch as requested, but it will contradict the inital >> intention. At least Intel should then port this change to the >> external driver as well. Let's get a statement from them. We discussed this patch internally and concluded the correct approach would be to not have the ethtool stats and use the VF info. Jakub beat us to the comment. We would plan to port the iproute implementation to OOT as well. > Ack, but we really want people to move towards using standard stats. > > Can you change the user space to first try reading the stats via > iproute2/rtnetlink? And fallback to random ethtool strings if not > available? That way it will work with any driver implementing the > standard API. Long term that'll make everyone's life easier. > > Out-of-tree code cannot be an argument upstream, otherwise we'd > completely lose control over our APIs. Vendors could ship whatever > in their out of tree repo and then force us to accept it upstream. > > It's disappointing to see the vendor letting the uAPI of the out of > tree driver diverge from upstream, especially a driver this mature.
On 2022-05-06T09:20:14-07:00 Tony Nguyen <anthony.l.nguyen@intel.com> wrote: > On 5/6/2022 9:13 AM, Jakub Kicinski wrote: > > On Fri, 6 May 2022 06:44:40 +0000 Maximilian Heyne wrote: > >> On 2022-05-04T20:16:32-07:00 Jakub Kicinski <kuba@kernel.org> wrote: > >> > >>> On Tue, 3 May 2022 15:00:17 +0000 Maximilian Heyne wrote: > >>>> + for (i = 0; i < adapter->num_vfs; i++) { > >>>> + ethtool_sprintf(&p, "VF %u Rx Packets", i); > >>>> + ethtool_sprintf(&p, "VF %u Rx Bytes", i); > >>>> + ethtool_sprintf(&p, "VF %u Tx Packets", i); > >>>> + ethtool_sprintf(&p, "VF %u Tx Bytes", i); > >>>> + ethtool_sprintf(&p, "VF %u MC Packets", i); > >>>> + } > >>> > >>> Please drop the ethtool stats. We've been trying to avoid duplicating > >>> the same stats in ethtool and iproute2 for a while now. > >> > >> I can see the point that standard metrics should only be reported via the > >> iproute2 interface. However, in this special case this patch was > >> intended to converge the out-of-tree driver with the in-tree version. > >> These missing stats were breaking our userspace. If we now switch > >> solely to iproute2 based statistics both driver versions would > >> diverge even more. So depending on where a user gets the ixgbe driver > >> from, they have to work-around. > >> > >> I can change the patch as requested, but it will contradict the inital > >> intention. At least Intel should then port this change to the > >> external driver as well. Let's get a statement from them. > > We discussed this patch internally and concluded the correct approach > would be to not have the ethtool stats and use the VF info. Jakub beat > us to the comment. We would plan to port the iproute implementation to > OOT as well. Ok, then I'll send a follow-up patch without the ethtool changes. I'm happy to get some kind of convergence between the out-of-tree driver and upstream. While at it, I wonder whether other drivers need similar changes as well and what other features are missing in the upstream driver. There should be no surprises when switching between these drivers. > > > Ack, but we really want people to move towards using standard stats. > > > > Can you change the user space to first try reading the stats via > > iproute2/rtnetlink? And fallback to random ethtool strings if not > > available? That way it will work with any driver implementing the > > standard API. Long term that'll make everyone's life easier. Yes, in this case we are in control of user space and can work around. > > > > Out-of-tree code cannot be an argument upstream, otherwise we'd > > completely lose control over our APIs. Vendors could ship whatever > > in their out of tree repo and then force us to accept it upstream. > > > > It's disappointing to see the vendor letting the uAPI of the out of > > tree driver diverge from upstream, especially a driver this mature. > Amazon Development Center Germany GmbH Krausenstr. 38 10117 Berlin Geschaeftsfuehrung: Christian Schlaeger, Jonathan Weiss Eingetragen am Amtsgericht Charlottenburg unter HRB 149173 B Sitz: Berlin Ust-ID: DE 289 237 879
diff --git a/drivers/net/ethernet/intel/ixgbe/ixgbe.h b/drivers/net/ethernet/intel/ixgbe/ixgbe.h index 921a4d977d65..48444ab9e0b1 100644 --- a/drivers/net/ethernet/intel/ixgbe/ixgbe.h +++ b/drivers/net/ethernet/intel/ixgbe/ixgbe.h @@ -167,12 +167,46 @@ enum ixgbe_tx_flags { #define IXGBE_82599_VF_DEVICE_ID 0x10ED #define IXGBE_X540_VF_DEVICE_ID 0x1515 +#define UPDATE_VF_COUNTER_32bit(reg, last_counter, counter) \ + { \ + u32 current_counter = IXGBE_READ_REG(hw, reg); \ + if (current_counter < last_counter) \ + counter += 0x100000000LL; \ + last_counter = current_counter; \ + counter &= 0xFFFFFFFF00000000LL; \ + counter |= current_counter; \ + } + +#define UPDATE_VF_COUNTER_36bit(reg_lsb, reg_msb, last_counter, counter) \ + { \ + u64 current_counter_lsb = IXGBE_READ_REG(hw, reg_lsb); \ + u64 current_counter_msb = IXGBE_READ_REG(hw, reg_msb); \ + u64 current_counter = (current_counter_msb << 32) | \ + current_counter_lsb; \ + if (current_counter < last_counter) \ + counter += 0x1000000000LL; \ + last_counter = current_counter; \ + counter &= 0xFFFFFFF000000000LL; \ + counter |= current_counter; \ + } + +struct vf_stats { + u64 gprc; + u64 gorc; + u64 gptc; + u64 gotc; + u64 mprc; +}; + struct vf_data_storage { struct pci_dev *vfdev; unsigned char vf_mac_addresses[ETH_ALEN]; u16 vf_mc_hashes[IXGBE_MAX_VF_MC_ENTRIES]; u16 num_vf_mc_hashes; bool clear_to_send; + struct vf_stats vfstats; + struct vf_stats last_vfstats; + struct vf_stats saved_rst_vfstats; bool pf_set_mac; u16 pf_vlan; /* When set, guest VLAN config not allowed. */ u16 pf_qos; diff --git a/drivers/net/ethernet/intel/ixgbe/ixgbe_ethtool.c b/drivers/net/ethernet/intel/ixgbe/ixgbe_ethtool.c index 628d0eb0599f..ae94d927bb46 100644 --- a/drivers/net/ethernet/intel/ixgbe/ixgbe_ethtool.c +++ b/drivers/net/ethernet/intel/ixgbe/ixgbe_ethtool.c @@ -122,9 +122,13 @@ static const struct ixgbe_stats ixgbe_gstrings_stats[] = { sizeof(((struct ixgbe_adapter *)0)->stats.pxoffrxc) + \ sizeof(((struct ixgbe_adapter *)0)->stats.pxofftxc)) \ / sizeof(u64)) +#define IXGBE_VF_STATS_LEN \ + ((((struct ixgbe_adapter *)netdev_priv(netdev))->num_vfs) * \ + (sizeof(struct vf_stats) / sizeof(u64))) #define IXGBE_STATS_LEN (IXGBE_GLOBAL_STATS_LEN + \ IXGBE_PB_STATS_LEN + \ - IXGBE_QUEUE_STATS_LEN) + IXGBE_QUEUE_STATS_LEN + \ + IXGBE_VF_STATS_LEN) static const char ixgbe_gstrings_test[][ETH_GSTRING_LEN] = { "Register test (offline)", "Eeprom test (offline)", @@ -1302,6 +1306,8 @@ static void ixgbe_get_ethtool_stats(struct net_device *netdev, struct ixgbe_adapter *adapter = netdev_priv(netdev); struct rtnl_link_stats64 temp; const struct rtnl_link_stats64 *net_stats; + u64 *queue_stat; + int stat_count, k; unsigned int start; struct ixgbe_ring *ring; int i, j; @@ -1368,11 +1374,22 @@ static void ixgbe_get_ethtool_stats(struct net_device *netdev, data[i++] = adapter->stats.pxonrxc[j]; data[i++] = adapter->stats.pxoffrxc[j]; } + stat_count = sizeof(struct vf_stats) / sizeof(u64); + for (j = 0; j < adapter->num_vfs; j++) { + queue_stat = (u64 *)&adapter->vfinfo[j].vfstats; + for (k = 0; k < stat_count; k++) + data[i + k] = queue_stat[k]; + queue_stat = (u64 *)&adapter->vfinfo[j].saved_rst_vfstats; + for (k = 0; k < stat_count; k++) + data[i + k] += queue_stat[k]; + i += k; + } } static void ixgbe_get_strings(struct net_device *netdev, u32 stringset, u8 *data) { + struct ixgbe_adapter *adapter = netdev_priv(netdev); unsigned int i; u8 *p = data; @@ -1401,6 +1418,13 @@ static void ixgbe_get_strings(struct net_device *netdev, u32 stringset, ethtool_sprintf(&p, "rx_pb_%u_pxon", i); ethtool_sprintf(&p, "rx_pb_%u_pxoff", i); } + for (i = 0; i < adapter->num_vfs; i++) { + ethtool_sprintf(&p, "VF %u Rx Packets", i); + ethtool_sprintf(&p, "VF %u Rx Bytes", i); + ethtool_sprintf(&p, "VF %u Tx Packets", i); + ethtool_sprintf(&p, "VF %u Tx Bytes", i); + ethtool_sprintf(&p, "VF %u MC Packets", i); + } /* BUG_ON(p - data != IXGBE_STATS_LEN * ETH_GSTRING_LEN); */ break; case ETH_SS_PRIV_FLAGS: diff --git a/drivers/net/ethernet/intel/ixgbe/ixgbe_main.c b/drivers/net/ethernet/intel/ixgbe/ixgbe_main.c index c4a4954aa317..f60d8b425f61 100644 --- a/drivers/net/ethernet/intel/ixgbe/ixgbe_main.c +++ b/drivers/net/ethernet/intel/ixgbe/ixgbe_main.c @@ -5548,6 +5548,47 @@ static int ixgbe_non_sfp_link_config(struct ixgbe_hw *hw) return ret; } +/** + * ixgbe_clear_vf_stats_counters - Clear out VF stats after reset + * @adapter: board private structure + * + * On a reset we need to clear out the VF stats or accounting gets + * messed up because they're not clear on read. + **/ +static void ixgbe_clear_vf_stats_counters(struct ixgbe_adapter *adapter) +{ + struct ixgbe_hw *hw = &adapter->hw; + int i; + + for (i = 0; i < adapter->num_vfs; i++) { + adapter->vfinfo[i].last_vfstats.gprc = + IXGBE_READ_REG(hw, IXGBE_PVFGPRC(i)); + adapter->vfinfo[i].saved_rst_vfstats.gprc += + adapter->vfinfo[i].vfstats.gprc; + adapter->vfinfo[i].vfstats.gprc = 0; + adapter->vfinfo[i].last_vfstats.gptc = + IXGBE_READ_REG(hw, IXGBE_PVFGPTC(i)); + adapter->vfinfo[i].saved_rst_vfstats.gptc += + adapter->vfinfo[i].vfstats.gptc; + adapter->vfinfo[i].vfstats.gptc = 0; + adapter->vfinfo[i].last_vfstats.gorc = + IXGBE_READ_REG(hw, IXGBE_PVFGORC_LSB(i)); + adapter->vfinfo[i].saved_rst_vfstats.gorc += + adapter->vfinfo[i].vfstats.gorc; + adapter->vfinfo[i].vfstats.gorc = 0; + adapter->vfinfo[i].last_vfstats.gotc = + IXGBE_READ_REG(hw, IXGBE_PVFGOTC_LSB(i)); + adapter->vfinfo[i].saved_rst_vfstats.gotc += + adapter->vfinfo[i].vfstats.gotc; + adapter->vfinfo[i].vfstats.gotc = 0; + adapter->vfinfo[i].last_vfstats.mprc = + IXGBE_READ_REG(hw, IXGBE_PVFMPRC(i)); + adapter->vfinfo[i].saved_rst_vfstats.mprc += + adapter->vfinfo[i].vfstats.mprc; + adapter->vfinfo[i].vfstats.mprc = 0; + } +} + static void ixgbe_setup_gpie(struct ixgbe_adapter *adapter) { struct ixgbe_hw *hw = &adapter->hw; @@ -5683,6 +5724,7 @@ static void ixgbe_up_complete(struct ixgbe_adapter *adapter) adapter->link_check_timeout = jiffies; mod_timer(&adapter->service_timer, jiffies); + ixgbe_clear_vf_stats_counters(adapter); /* Set PF Reset Done bit so PF/VF Mail Ops can work */ ctrl_ext = IXGBE_READ_REG(hw, IXGBE_CTRL_EXT); ctrl_ext |= IXGBE_CTRL_EXT_PFRSTD; @@ -7270,6 +7312,32 @@ void ixgbe_update_stats(struct ixgbe_adapter *adapter) netdev->stats.rx_length_errors = hwstats->rlec; netdev->stats.rx_crc_errors = hwstats->crcerrs; netdev->stats.rx_missed_errors = total_mpc; + + /* VF Stats Collection - skip while resetting because these + * are not clear on read and otherwise you'll sometimes get + * crazy values. + */ + if (!test_bit(__IXGBE_RESETTING, &adapter->state)) { + for (i = 0; i < adapter->num_vfs; i++) { + UPDATE_VF_COUNTER_32bit(IXGBE_PVFGPRC(i), + adapter->vfinfo[i].last_vfstats.gprc, + adapter->vfinfo[i].vfstats.gprc); + UPDATE_VF_COUNTER_32bit(IXGBE_PVFGPTC(i), + adapter->vfinfo[i].last_vfstats.gptc, + adapter->vfinfo[i].vfstats.gptc); + UPDATE_VF_COUNTER_36bit(IXGBE_PVFGORC_LSB(i), + IXGBE_PVFGORC_MSB(i), + adapter->vfinfo[i].last_vfstats.gorc, + adapter->vfinfo[i].vfstats.gorc); + UPDATE_VF_COUNTER_36bit(IXGBE_PVFGOTC_LSB(i), + IXGBE_PVFGOTC_MSB(i), + adapter->vfinfo[i].last_vfstats.gotc, + adapter->vfinfo[i].vfstats.gotc); + UPDATE_VF_COUNTER_32bit(IXGBE_PVFMPRC(i), + adapter->vfinfo[i].last_vfstats.mprc, + adapter->vfinfo[i].vfstats.mprc); + } + } } /** @@ -8995,6 +9063,23 @@ static void ixgbe_get_stats64(struct net_device *netdev, stats->rx_missed_errors = netdev->stats.rx_missed_errors; } +static int ixgbe_ndo_get_vf_stats(struct net_device *netdev, int vf, + struct ifla_vf_stats *vf_stats) +{ + struct ixgbe_adapter *adapter = netdev_priv(netdev); + + if (vf < 0 || vf >= adapter->num_vfs) + return -EINVAL; + + vf_stats->rx_packets = adapter->vfinfo[vf].vfstats.gprc; + vf_stats->rx_bytes = adapter->vfinfo[vf].vfstats.gorc; + vf_stats->tx_packets = adapter->vfinfo[vf].vfstats.gptc; + vf_stats->tx_bytes = adapter->vfinfo[vf].vfstats.gotc; + vf_stats->multicast = adapter->vfinfo[vf].vfstats.mprc; + + return 0; +} + #ifdef CONFIG_IXGBE_DCB /** * ixgbe_validate_rtr - verify 802.1Qp to Rx packet buffer mapping is valid. @@ -10311,6 +10396,7 @@ static const struct net_device_ops ixgbe_netdev_ops = { .ndo_set_vf_rss_query_en = ixgbe_ndo_set_vf_rss_query_en, .ndo_set_vf_trust = ixgbe_ndo_set_vf_trust, .ndo_get_vf_config = ixgbe_ndo_get_vf_config, + .ndo_get_vf_stats = ixgbe_ndo_get_vf_stats, .ndo_get_stats64 = ixgbe_get_stats64, .ndo_setup_tc = __ixgbe_setup_tc, #ifdef IXGBE_FCOE diff --git a/drivers/net/ethernet/intel/ixgbe/ixgbe_type.h b/drivers/net/ethernet/intel/ixgbe/ixgbe_type.h index 6da9880d766a..7f7ea468ffa9 100644 --- a/drivers/net/ethernet/intel/ixgbe/ixgbe_type.h +++ b/drivers/net/ethernet/intel/ixgbe/ixgbe_type.h @@ -2533,6 +2533,13 @@ enum { #define IXGBE_PVFTXDCTL(P) (0x06028 + (0x40 * (P))) #define IXGBE_PVFTDWBAL(P) (0x06038 + (0x40 * (P))) #define IXGBE_PVFTDWBAH(P) (0x0603C + (0x40 * (P))) +#define IXGBE_PVFGPRC(x) (0x0101C + (0x40 * (x))) +#define IXGBE_PVFGPTC(x) (0x08300 + (0x04 * (x))) +#define IXGBE_PVFGORC_LSB(x) (0x01020 + (0x40 * (x))) +#define IXGBE_PVFGORC_MSB(x) (0x0D020 + (0x40 * (x))) +#define IXGBE_PVFGOTC_LSB(x) (0x08400 + (0x08 * (x))) +#define IXGBE_PVFGOTC_MSB(x) (0x08404 + (0x08 * (x))) +#define IXGBE_PVFMPRC(x) (0x0D01C + (0x40 * (x))) #define IXGBE_PVFTDWBALn(q_per_pool, vf_number, vf_q_index) \ (IXGBE_PVFTDWBAL((q_per_pool)*(vf_number) + (vf_q_index)))
This change exports network statistics for individual virtual functions via ethtool and iproute2. The code for retrieving the statistics from the device and printing via ethtool is taken from the out-of-tree driver. The feature was introduced with version 2.0.75.7, so the diff between this version and the previous version 2.0.72.4 was used to identify required changes. I took commit c8d4725e985d ("intel: Update drivers to use ethtool_sprintf") into account and fixed a few style issues. Apart from that, ndo_get_vf_stats is implemented so that iproute2 also prints the statistics. Signed-off-by: Maximilian Heyne <mheyne@amazon.de> --- v2: implemented the ndo_get_vf_stats callback drivers/net/ethernet/intel/ixgbe/ixgbe.h | 34 ++++++++ .../net/ethernet/intel/ixgbe/ixgbe_ethtool.c | 26 +++++- drivers/net/ethernet/intel/ixgbe/ixgbe_main.c | 86 +++++++++++++++++++ drivers/net/ethernet/intel/ixgbe/ixgbe_type.h | 7 ++ 4 files changed, 152 insertions(+), 1 deletion(-)