Message ID | 20250110192616.2075055-5-sean.anderson@linux.dev (mailing list archive) |
---|---|
State | New |
Delegated to: | Netdev Maintainers |
Headers | show |
Series | net: xilinx: axienet: Enable adaptive IRQ coalescing with DIM | expand |
Context | Check | Description |
---|---|---|
netdev/tree_selection | success | Clearly marked for net-next, async |
netdev/apply | fail | Patch does not apply to net-next-1 |
On 1/10/2025 11:26 AM, Sean Anderson wrote: > > In preparation for adaptive IRQ coalescing, we first need to support > adjusting the settings at runtime. The existing code doesn't require any > locking because > > - dma_start is the only function that modifies rx/tx_dma_cr. It is > always called with IRQs and NAPI disabled, so nothing else is touching > the hardware. > - The IRQs don't race with poll, since the latter is a softirq. > - The IRQs don't race with dma_stop since they both just clear the > control registers. > - dma_stop doesn't race with poll since the former is called with NAPI > disabled. > > However, once we introduce another function that modifies rx/tx_dma_cr, > we need to have some locking to prevent races. Introduce two locks to > protect these variables and their registers. > > The control register values are now generated where the coalescing > settings are set. Converting coalescing settings to control register > values may require sleeping because of clk_get_rate. However, the > read/modify/write of the control registers themselves can't sleep > because it needs to happen in IRQ context. By pre-calculating the > control register values, we avoid introducing an additional mutex. > > Since axienet_dma_start writes the control settings when it runs, we > don't bother updating the CR registers when rx/tx_dma_started is false. > This prevents any issues from writing to the control registers in the > middle of a reset sequence. > > Signed-off-by: Sean Anderson <sean.anderson@linux.dev> Reviewed-by: Shannon Nelson <shannon.nelson@amd.com> > --- > > Changes in v3: > - Move spin (un)locking in IRQs inside the if condition of > napi_schedule_prep. This lets us hold the lock just for the rmw. > - Fix function name in doc comments for axienet_update_coalesce_rx/tx > > Changes in v2: > - Don't use spin_lock_irqsave when we know the context > - Split the CR calculation refactor from runtime coalesce settings > adjustment support for easier review. > - Have axienet_update_coalesce_rx/tx take the cr value/mask instead of > calculating it with axienet_calc_cr. This will make it easier to add > partial updates in the next few commits. > - Split off CR calculation merging into another patch > > drivers/net/ethernet/xilinx/xilinx_axienet.h | 8 ++ > .../net/ethernet/xilinx/xilinx_axienet_main.c | 134 +++++++++++++++--- > 2 files changed, 119 insertions(+), 23 deletions(-) > > diff --git a/drivers/net/ethernet/xilinx/xilinx_axienet.h b/drivers/net/ethernet/xilinx/xilinx_axienet.h > index 8fd3b45ef6aa..6b8e550c2155 100644 > --- a/drivers/net/ethernet/xilinx/xilinx_axienet.h > +++ b/drivers/net/ethernet/xilinx/xilinx_axienet.h > @@ -484,7 +484,9 @@ struct skbuf_dma_descriptor { > * @regs: Base address for the axienet_local device address space > * @dma_regs: Base address for the axidma device address space > * @napi_rx: NAPI RX control structure > + * @rx_cr_lock: Lock protecting @rx_dma_cr, its register, and @rx_dma_started > * @rx_dma_cr: Nominal content of RX DMA control register > + * @rx_dma_started: Set when RX DMA is started > * @rx_bd_v: Virtual address of the RX buffer descriptor ring > * @rx_bd_p: Physical address(start address) of the RX buffer descr. ring > * @rx_bd_num: Size of RX buffer descriptor ring > @@ -494,7 +496,9 @@ struct skbuf_dma_descriptor { > * @rx_bytes: RX byte count for statistics > * @rx_stat_sync: Synchronization object for RX stats > * @napi_tx: NAPI TX control structure > + * @tx_cr_lock: Lock protecting @tx_dma_cr, its register, and @tx_dma_started > * @tx_dma_cr: Nominal content of TX DMA control register > + * @tx_dma_started: Set when TX DMA is started > * @tx_bd_v: Virtual address of the TX buffer descriptor ring > * @tx_bd_p: Physical address(start address) of the TX buffer descr. ring > * @tx_bd_num: Size of TX buffer descriptor ring > @@ -566,7 +570,9 @@ struct axienet_local { > void __iomem *dma_regs; > > struct napi_struct napi_rx; > + spinlock_t rx_cr_lock; > u32 rx_dma_cr; > + bool rx_dma_started; > struct axidma_bd *rx_bd_v; > dma_addr_t rx_bd_p; > u32 rx_bd_num; > @@ -576,7 +582,9 @@ struct axienet_local { > struct u64_stats_sync rx_stat_sync; > > struct napi_struct napi_tx; > + spinlock_t tx_cr_lock; > u32 tx_dma_cr; > + bool tx_dma_started; > struct axidma_bd *tx_bd_v; > dma_addr_t tx_bd_p; > u32 tx_bd_num; > diff --git a/drivers/net/ethernet/xilinx/xilinx_axienet_main.c b/drivers/net/ethernet/xilinx/xilinx_axienet_main.c > index 961c9c9e5e18..e00759012894 100644 > --- a/drivers/net/ethernet/xilinx/xilinx_axienet_main.c > +++ b/drivers/net/ethernet/xilinx/xilinx_axienet_main.c > @@ -266,16 +266,12 @@ static u32 axienet_calc_cr(struct axienet_local *lp, u32 count, u32 usec) > */ > static void axienet_dma_start(struct axienet_local *lp) > { > + spin_lock_irq(&lp->rx_cr_lock); > + > /* Start updating the Rx channel control register */ > - lp->rx_dma_cr = axienet_calc_cr(lp, lp->coalesce_count_rx, > - lp->coalesce_usec_rx); > + lp->rx_dma_cr &= ~XAXIDMA_CR_RUNSTOP_MASK; > axienet_dma_out32(lp, XAXIDMA_RX_CR_OFFSET, lp->rx_dma_cr); > > - /* Start updating the Tx channel control register */ > - lp->tx_dma_cr = axienet_calc_cr(lp, lp->coalesce_count_tx, > - lp->coalesce_usec_tx); > - axienet_dma_out32(lp, XAXIDMA_TX_CR_OFFSET, lp->tx_dma_cr); > - > /* Populate the tail pointer and bring the Rx Axi DMA engine out of > * halted state. This will make the Rx side ready for reception. > */ > @@ -284,6 +280,14 @@ static void axienet_dma_start(struct axienet_local *lp) > axienet_dma_out32(lp, XAXIDMA_RX_CR_OFFSET, lp->rx_dma_cr); > axienet_dma_out_addr(lp, XAXIDMA_RX_TDESC_OFFSET, lp->rx_bd_p + > (sizeof(*lp->rx_bd_v) * (lp->rx_bd_num - 1))); > + lp->rx_dma_started = true; > + > + spin_unlock_irq(&lp->rx_cr_lock); > + spin_lock_irq(&lp->tx_cr_lock); > + > + /* Start updating the Tx channel control register */ > + lp->tx_dma_cr &= ~XAXIDMA_CR_RUNSTOP_MASK; > + axienet_dma_out32(lp, XAXIDMA_TX_CR_OFFSET, lp->tx_dma_cr); > > /* Write to the RS (Run-stop) bit in the Tx channel control register. > * Tx channel is now ready to run. But only after we write to the > @@ -292,6 +296,9 @@ static void axienet_dma_start(struct axienet_local *lp) > axienet_dma_out_addr(lp, XAXIDMA_TX_CDESC_OFFSET, lp->tx_bd_p); > lp->tx_dma_cr |= XAXIDMA_CR_RUNSTOP_MASK; > axienet_dma_out32(lp, XAXIDMA_TX_CR_OFFSET, lp->tx_dma_cr); > + lp->tx_dma_started = true; > + > + spin_unlock_irq(&lp->tx_cr_lock); > } > > /** > @@ -627,14 +634,22 @@ static void axienet_dma_stop(struct axienet_local *lp) > int count; > u32 cr, sr; > > - cr = axienet_dma_in32(lp, XAXIDMA_RX_CR_OFFSET); > - cr &= ~(XAXIDMA_CR_RUNSTOP_MASK | XAXIDMA_IRQ_ALL_MASK); > + spin_lock_irq(&lp->rx_cr_lock); > + > + cr = lp->rx_dma_cr & ~(XAXIDMA_CR_RUNSTOP_MASK | XAXIDMA_IRQ_ALL_MASK); > axienet_dma_out32(lp, XAXIDMA_RX_CR_OFFSET, cr); > + lp->rx_dma_started = false; > + > + spin_unlock_irq(&lp->rx_cr_lock); > synchronize_irq(lp->rx_irq); > > - cr = axienet_dma_in32(lp, XAXIDMA_TX_CR_OFFSET); > - cr &= ~(XAXIDMA_CR_RUNSTOP_MASK | XAXIDMA_IRQ_ALL_MASK); > + spin_lock_irq(&lp->tx_cr_lock); > + > + cr = lp->tx_dma_cr & ~(XAXIDMA_CR_RUNSTOP_MASK | XAXIDMA_IRQ_ALL_MASK); > axienet_dma_out32(lp, XAXIDMA_TX_CR_OFFSET, cr); > + lp->tx_dma_started = false; > + > + spin_unlock_irq(&lp->tx_cr_lock); > synchronize_irq(lp->tx_irq); > > /* Give DMAs a chance to halt gracefully */ > @@ -983,7 +998,9 @@ static int axienet_tx_poll(struct napi_struct *napi, int budget) > * cause an immediate interrupt if any TX packets are > * already pending. > */ > + spin_lock_irq(&lp->tx_cr_lock); > axienet_dma_out32(lp, XAXIDMA_TX_CR_OFFSET, lp->tx_dma_cr); > + spin_unlock_irq(&lp->tx_cr_lock); > } > return packets; > } > @@ -1249,7 +1266,9 @@ static int axienet_rx_poll(struct napi_struct *napi, int budget) > * cause an immediate interrupt if any RX packets are > * already pending. > */ > + spin_lock_irq(&lp->rx_cr_lock); > axienet_dma_out32(lp, XAXIDMA_RX_CR_OFFSET, lp->rx_dma_cr); > + spin_unlock_irq(&lp->rx_cr_lock); > } > return packets; > } > @@ -1287,11 +1306,14 @@ static irqreturn_t axienet_tx_irq(int irq, void *_ndev) > /* Disable further TX completion interrupts and schedule > * NAPI to handle the completions. > */ > - u32 cr = lp->tx_dma_cr; > - > - cr &= ~(XAXIDMA_IRQ_IOC_MASK | XAXIDMA_IRQ_DELAY_MASK); > if (napi_schedule_prep(&lp->napi_tx)) { > + u32 cr; > + > + spin_lock(&lp->tx_cr_lock); > + cr = lp->tx_dma_cr; > + cr &= ~(XAXIDMA_IRQ_IOC_MASK | XAXIDMA_IRQ_DELAY_MASK); > axienet_dma_out32(lp, XAXIDMA_TX_CR_OFFSET, cr); > + spin_unlock(&lp->tx_cr_lock); > __napi_schedule(&lp->napi_tx); > } > } > @@ -1332,11 +1354,15 @@ static irqreturn_t axienet_rx_irq(int irq, void *_ndev) > /* Disable further RX completion interrupts and schedule > * NAPI receive. > */ > - u32 cr = lp->rx_dma_cr; > - > - cr &= ~(XAXIDMA_IRQ_IOC_MASK | XAXIDMA_IRQ_DELAY_MASK); > if (napi_schedule_prep(&lp->napi_rx)) { > + u32 cr; > + > + spin_lock(&lp->rx_cr_lock); > + cr = lp->rx_dma_cr; > + cr &= ~(XAXIDMA_IRQ_IOC_MASK | XAXIDMA_IRQ_DELAY_MASK); > axienet_dma_out32(lp, XAXIDMA_RX_CR_OFFSET, cr); > + spin_unlock(&lp->rx_cr_lock); > + > __napi_schedule(&lp->napi_rx); > } > } > @@ -2002,6 +2028,62 @@ axienet_ethtools_set_pauseparam(struct net_device *ndev, > return phylink_ethtool_set_pauseparam(lp->phylink, epauseparm); > } > > +/** > + * axienet_update_coalesce_rx() - Set RX CR > + * @lp: Device private data > + * @cr: Value to write to the RX CR > + * @mask: Bits to set from @cr > + */ > +static void axienet_update_coalesce_rx(struct axienet_local *lp, u32 cr, > + u32 mask) > +{ > + spin_lock_irq(&lp->rx_cr_lock); > + lp->rx_dma_cr &= ~mask; > + lp->rx_dma_cr |= cr; > + /* If DMA isn't started, then the settings will be applied the next > + * time dma_start() is called. > + */ > + if (lp->rx_dma_started) { > + u32 reg = axienet_dma_in32(lp, XAXIDMA_RX_CR_OFFSET); > + > + /* Don't enable IRQs if they are disabled by NAPI */ > + if (reg & XAXIDMA_IRQ_ALL_MASK) > + cr = lp->rx_dma_cr; > + else > + cr = lp->rx_dma_cr & ~XAXIDMA_IRQ_ALL_MASK; > + axienet_dma_out32(lp, XAXIDMA_RX_CR_OFFSET, cr); > + } > + spin_unlock_irq(&lp->rx_cr_lock); > +} > + > +/** > + * axienet_update_coalesce_tx() - Set TX CR > + * @lp: Device private data > + * @cr: Value to write to the TX CR > + * @mask: Bits to set from @cr > + */ > +static void axienet_update_coalesce_tx(struct axienet_local *lp, u32 cr, > + u32 mask) > +{ > + spin_lock_irq(&lp->tx_cr_lock); > + lp->tx_dma_cr &= ~mask; > + lp->tx_dma_cr |= cr; > + /* If DMA isn't started, then the settings will be applied the next > + * time dma_start() is called. > + */ > + if (lp->tx_dma_started) { > + u32 reg = axienet_dma_in32(lp, XAXIDMA_TX_CR_OFFSET); > + > + /* Don't enable IRQs if they are disabled by NAPI */ > + if (reg & XAXIDMA_IRQ_ALL_MASK) > + cr = lp->tx_dma_cr; > + else > + cr = lp->tx_dma_cr & ~XAXIDMA_IRQ_ALL_MASK; > + axienet_dma_out32(lp, XAXIDMA_TX_CR_OFFSET, cr); > + } > + spin_unlock_irq(&lp->tx_cr_lock); > +} > + > /** > * axienet_ethtools_get_coalesce - Get DMA interrupt coalescing count. > * @ndev: Pointer to net_device structure > @@ -2050,12 +2132,7 @@ axienet_ethtools_set_coalesce(struct net_device *ndev, > struct netlink_ext_ack *extack) > { > struct axienet_local *lp = netdev_priv(ndev); > - > - if (netif_running(ndev)) { > - NL_SET_ERR_MSG(extack, > - "Please stop netif before applying configuration"); > - return -EBUSY; > - } > + u32 cr; > > if (ecoalesce->rx_max_coalesced_frames > 255 || > ecoalesce->tx_max_coalesced_frames > 255) { > @@ -2083,6 +2160,11 @@ axienet_ethtools_set_coalesce(struct net_device *ndev, > lp->coalesce_count_tx = ecoalesce->tx_max_coalesced_frames; > lp->coalesce_usec_tx = ecoalesce->tx_coalesce_usecs; > > + cr = axienet_calc_cr(lp, lp->coalesce_count_rx, lp->coalesce_usec_rx); > + axienet_update_coalesce_rx(lp, cr, ~XAXIDMA_CR_RUNSTOP_MASK); > + > + cr = axienet_calc_cr(lp, lp->coalesce_count_tx, lp->coalesce_usec_tx); > + axienet_update_coalesce_tx(lp, cr, ~XAXIDMA_CR_RUNSTOP_MASK); > return 0; > } > > @@ -2861,10 +2943,16 @@ static int axienet_probe(struct platform_device *pdev) > axienet_set_mac_address(ndev, NULL); > } > > + spin_lock_init(&lp->rx_cr_lock); > + spin_lock_init(&lp->tx_cr_lock); > lp->coalesce_count_rx = XAXIDMA_DFT_RX_THRESHOLD; > lp->coalesce_count_tx = XAXIDMA_DFT_TX_THRESHOLD; > lp->coalesce_usec_rx = XAXIDMA_DFT_RX_USEC; > lp->coalesce_usec_tx = XAXIDMA_DFT_TX_USEC; > + lp->rx_dma_cr = axienet_calc_cr(lp, lp->coalesce_count_rx, > + lp->coalesce_usec_rx); > + lp->tx_dma_cr = axienet_calc_cr(lp, lp->coalesce_count_tx, > + lp->coalesce_usec_tx); > > ret = axienet_mdio_setup(lp); > if (ret) > -- > 2.35.1.1320.gc452695387.dirty >
diff --git a/drivers/net/ethernet/xilinx/xilinx_axienet.h b/drivers/net/ethernet/xilinx/xilinx_axienet.h index 8fd3b45ef6aa..6b8e550c2155 100644 --- a/drivers/net/ethernet/xilinx/xilinx_axienet.h +++ b/drivers/net/ethernet/xilinx/xilinx_axienet.h @@ -484,7 +484,9 @@ struct skbuf_dma_descriptor { * @regs: Base address for the axienet_local device address space * @dma_regs: Base address for the axidma device address space * @napi_rx: NAPI RX control structure + * @rx_cr_lock: Lock protecting @rx_dma_cr, its register, and @rx_dma_started * @rx_dma_cr: Nominal content of RX DMA control register + * @rx_dma_started: Set when RX DMA is started * @rx_bd_v: Virtual address of the RX buffer descriptor ring * @rx_bd_p: Physical address(start address) of the RX buffer descr. ring * @rx_bd_num: Size of RX buffer descriptor ring @@ -494,7 +496,9 @@ struct skbuf_dma_descriptor { * @rx_bytes: RX byte count for statistics * @rx_stat_sync: Synchronization object for RX stats * @napi_tx: NAPI TX control structure + * @tx_cr_lock: Lock protecting @tx_dma_cr, its register, and @tx_dma_started * @tx_dma_cr: Nominal content of TX DMA control register + * @tx_dma_started: Set when TX DMA is started * @tx_bd_v: Virtual address of the TX buffer descriptor ring * @tx_bd_p: Physical address(start address) of the TX buffer descr. ring * @tx_bd_num: Size of TX buffer descriptor ring @@ -566,7 +570,9 @@ struct axienet_local { void __iomem *dma_regs; struct napi_struct napi_rx; + spinlock_t rx_cr_lock; u32 rx_dma_cr; + bool rx_dma_started; struct axidma_bd *rx_bd_v; dma_addr_t rx_bd_p; u32 rx_bd_num; @@ -576,7 +582,9 @@ struct axienet_local { struct u64_stats_sync rx_stat_sync; struct napi_struct napi_tx; + spinlock_t tx_cr_lock; u32 tx_dma_cr; + bool tx_dma_started; struct axidma_bd *tx_bd_v; dma_addr_t tx_bd_p; u32 tx_bd_num; diff --git a/drivers/net/ethernet/xilinx/xilinx_axienet_main.c b/drivers/net/ethernet/xilinx/xilinx_axienet_main.c index 961c9c9e5e18..e00759012894 100644 --- a/drivers/net/ethernet/xilinx/xilinx_axienet_main.c +++ b/drivers/net/ethernet/xilinx/xilinx_axienet_main.c @@ -266,16 +266,12 @@ static u32 axienet_calc_cr(struct axienet_local *lp, u32 count, u32 usec) */ static void axienet_dma_start(struct axienet_local *lp) { + spin_lock_irq(&lp->rx_cr_lock); + /* Start updating the Rx channel control register */ - lp->rx_dma_cr = axienet_calc_cr(lp, lp->coalesce_count_rx, - lp->coalesce_usec_rx); + lp->rx_dma_cr &= ~XAXIDMA_CR_RUNSTOP_MASK; axienet_dma_out32(lp, XAXIDMA_RX_CR_OFFSET, lp->rx_dma_cr); - /* Start updating the Tx channel control register */ - lp->tx_dma_cr = axienet_calc_cr(lp, lp->coalesce_count_tx, - lp->coalesce_usec_tx); - axienet_dma_out32(lp, XAXIDMA_TX_CR_OFFSET, lp->tx_dma_cr); - /* Populate the tail pointer and bring the Rx Axi DMA engine out of * halted state. This will make the Rx side ready for reception. */ @@ -284,6 +280,14 @@ static void axienet_dma_start(struct axienet_local *lp) axienet_dma_out32(lp, XAXIDMA_RX_CR_OFFSET, lp->rx_dma_cr); axienet_dma_out_addr(lp, XAXIDMA_RX_TDESC_OFFSET, lp->rx_bd_p + (sizeof(*lp->rx_bd_v) * (lp->rx_bd_num - 1))); + lp->rx_dma_started = true; + + spin_unlock_irq(&lp->rx_cr_lock); + spin_lock_irq(&lp->tx_cr_lock); + + /* Start updating the Tx channel control register */ + lp->tx_dma_cr &= ~XAXIDMA_CR_RUNSTOP_MASK; + axienet_dma_out32(lp, XAXIDMA_TX_CR_OFFSET, lp->tx_dma_cr); /* Write to the RS (Run-stop) bit in the Tx channel control register. * Tx channel is now ready to run. But only after we write to the @@ -292,6 +296,9 @@ static void axienet_dma_start(struct axienet_local *lp) axienet_dma_out_addr(lp, XAXIDMA_TX_CDESC_OFFSET, lp->tx_bd_p); lp->tx_dma_cr |= XAXIDMA_CR_RUNSTOP_MASK; axienet_dma_out32(lp, XAXIDMA_TX_CR_OFFSET, lp->tx_dma_cr); + lp->tx_dma_started = true; + + spin_unlock_irq(&lp->tx_cr_lock); } /** @@ -627,14 +634,22 @@ static void axienet_dma_stop(struct axienet_local *lp) int count; u32 cr, sr; - cr = axienet_dma_in32(lp, XAXIDMA_RX_CR_OFFSET); - cr &= ~(XAXIDMA_CR_RUNSTOP_MASK | XAXIDMA_IRQ_ALL_MASK); + spin_lock_irq(&lp->rx_cr_lock); + + cr = lp->rx_dma_cr & ~(XAXIDMA_CR_RUNSTOP_MASK | XAXIDMA_IRQ_ALL_MASK); axienet_dma_out32(lp, XAXIDMA_RX_CR_OFFSET, cr); + lp->rx_dma_started = false; + + spin_unlock_irq(&lp->rx_cr_lock); synchronize_irq(lp->rx_irq); - cr = axienet_dma_in32(lp, XAXIDMA_TX_CR_OFFSET); - cr &= ~(XAXIDMA_CR_RUNSTOP_MASK | XAXIDMA_IRQ_ALL_MASK); + spin_lock_irq(&lp->tx_cr_lock); + + cr = lp->tx_dma_cr & ~(XAXIDMA_CR_RUNSTOP_MASK | XAXIDMA_IRQ_ALL_MASK); axienet_dma_out32(lp, XAXIDMA_TX_CR_OFFSET, cr); + lp->tx_dma_started = false; + + spin_unlock_irq(&lp->tx_cr_lock); synchronize_irq(lp->tx_irq); /* Give DMAs a chance to halt gracefully */ @@ -983,7 +998,9 @@ static int axienet_tx_poll(struct napi_struct *napi, int budget) * cause an immediate interrupt if any TX packets are * already pending. */ + spin_lock_irq(&lp->tx_cr_lock); axienet_dma_out32(lp, XAXIDMA_TX_CR_OFFSET, lp->tx_dma_cr); + spin_unlock_irq(&lp->tx_cr_lock); } return packets; } @@ -1249,7 +1266,9 @@ static int axienet_rx_poll(struct napi_struct *napi, int budget) * cause an immediate interrupt if any RX packets are * already pending. */ + spin_lock_irq(&lp->rx_cr_lock); axienet_dma_out32(lp, XAXIDMA_RX_CR_OFFSET, lp->rx_dma_cr); + spin_unlock_irq(&lp->rx_cr_lock); } return packets; } @@ -1287,11 +1306,14 @@ static irqreturn_t axienet_tx_irq(int irq, void *_ndev) /* Disable further TX completion interrupts and schedule * NAPI to handle the completions. */ - u32 cr = lp->tx_dma_cr; - - cr &= ~(XAXIDMA_IRQ_IOC_MASK | XAXIDMA_IRQ_DELAY_MASK); if (napi_schedule_prep(&lp->napi_tx)) { + u32 cr; + + spin_lock(&lp->tx_cr_lock); + cr = lp->tx_dma_cr; + cr &= ~(XAXIDMA_IRQ_IOC_MASK | XAXIDMA_IRQ_DELAY_MASK); axienet_dma_out32(lp, XAXIDMA_TX_CR_OFFSET, cr); + spin_unlock(&lp->tx_cr_lock); __napi_schedule(&lp->napi_tx); } } @@ -1332,11 +1354,15 @@ static irqreturn_t axienet_rx_irq(int irq, void *_ndev) /* Disable further RX completion interrupts and schedule * NAPI receive. */ - u32 cr = lp->rx_dma_cr; - - cr &= ~(XAXIDMA_IRQ_IOC_MASK | XAXIDMA_IRQ_DELAY_MASK); if (napi_schedule_prep(&lp->napi_rx)) { + u32 cr; + + spin_lock(&lp->rx_cr_lock); + cr = lp->rx_dma_cr; + cr &= ~(XAXIDMA_IRQ_IOC_MASK | XAXIDMA_IRQ_DELAY_MASK); axienet_dma_out32(lp, XAXIDMA_RX_CR_OFFSET, cr); + spin_unlock(&lp->rx_cr_lock); + __napi_schedule(&lp->napi_rx); } } @@ -2002,6 +2028,62 @@ axienet_ethtools_set_pauseparam(struct net_device *ndev, return phylink_ethtool_set_pauseparam(lp->phylink, epauseparm); } +/** + * axienet_update_coalesce_rx() - Set RX CR + * @lp: Device private data + * @cr: Value to write to the RX CR + * @mask: Bits to set from @cr + */ +static void axienet_update_coalesce_rx(struct axienet_local *lp, u32 cr, + u32 mask) +{ + spin_lock_irq(&lp->rx_cr_lock); + lp->rx_dma_cr &= ~mask; + lp->rx_dma_cr |= cr; + /* If DMA isn't started, then the settings will be applied the next + * time dma_start() is called. + */ + if (lp->rx_dma_started) { + u32 reg = axienet_dma_in32(lp, XAXIDMA_RX_CR_OFFSET); + + /* Don't enable IRQs if they are disabled by NAPI */ + if (reg & XAXIDMA_IRQ_ALL_MASK) + cr = lp->rx_dma_cr; + else + cr = lp->rx_dma_cr & ~XAXIDMA_IRQ_ALL_MASK; + axienet_dma_out32(lp, XAXIDMA_RX_CR_OFFSET, cr); + } + spin_unlock_irq(&lp->rx_cr_lock); +} + +/** + * axienet_update_coalesce_tx() - Set TX CR + * @lp: Device private data + * @cr: Value to write to the TX CR + * @mask: Bits to set from @cr + */ +static void axienet_update_coalesce_tx(struct axienet_local *lp, u32 cr, + u32 mask) +{ + spin_lock_irq(&lp->tx_cr_lock); + lp->tx_dma_cr &= ~mask; + lp->tx_dma_cr |= cr; + /* If DMA isn't started, then the settings will be applied the next + * time dma_start() is called. + */ + if (lp->tx_dma_started) { + u32 reg = axienet_dma_in32(lp, XAXIDMA_TX_CR_OFFSET); + + /* Don't enable IRQs if they are disabled by NAPI */ + if (reg & XAXIDMA_IRQ_ALL_MASK) + cr = lp->tx_dma_cr; + else + cr = lp->tx_dma_cr & ~XAXIDMA_IRQ_ALL_MASK; + axienet_dma_out32(lp, XAXIDMA_TX_CR_OFFSET, cr); + } + spin_unlock_irq(&lp->tx_cr_lock); +} + /** * axienet_ethtools_get_coalesce - Get DMA interrupt coalescing count. * @ndev: Pointer to net_device structure @@ -2050,12 +2132,7 @@ axienet_ethtools_set_coalesce(struct net_device *ndev, struct netlink_ext_ack *extack) { struct axienet_local *lp = netdev_priv(ndev); - - if (netif_running(ndev)) { - NL_SET_ERR_MSG(extack, - "Please stop netif before applying configuration"); - return -EBUSY; - } + u32 cr; if (ecoalesce->rx_max_coalesced_frames > 255 || ecoalesce->tx_max_coalesced_frames > 255) { @@ -2083,6 +2160,11 @@ axienet_ethtools_set_coalesce(struct net_device *ndev, lp->coalesce_count_tx = ecoalesce->tx_max_coalesced_frames; lp->coalesce_usec_tx = ecoalesce->tx_coalesce_usecs; + cr = axienet_calc_cr(lp, lp->coalesce_count_rx, lp->coalesce_usec_rx); + axienet_update_coalesce_rx(lp, cr, ~XAXIDMA_CR_RUNSTOP_MASK); + + cr = axienet_calc_cr(lp, lp->coalesce_count_tx, lp->coalesce_usec_tx); + axienet_update_coalesce_tx(lp, cr, ~XAXIDMA_CR_RUNSTOP_MASK); return 0; } @@ -2861,10 +2943,16 @@ static int axienet_probe(struct platform_device *pdev) axienet_set_mac_address(ndev, NULL); } + spin_lock_init(&lp->rx_cr_lock); + spin_lock_init(&lp->tx_cr_lock); lp->coalesce_count_rx = XAXIDMA_DFT_RX_THRESHOLD; lp->coalesce_count_tx = XAXIDMA_DFT_TX_THRESHOLD; lp->coalesce_usec_rx = XAXIDMA_DFT_RX_USEC; lp->coalesce_usec_tx = XAXIDMA_DFT_TX_USEC; + lp->rx_dma_cr = axienet_calc_cr(lp, lp->coalesce_count_rx, + lp->coalesce_usec_rx); + lp->tx_dma_cr = axienet_calc_cr(lp, lp->coalesce_count_tx, + lp->coalesce_usec_tx); ret = axienet_mdio_setup(lp); if (ret)
In preparation for adaptive IRQ coalescing, we first need to support adjusting the settings at runtime. The existing code doesn't require any locking because - dma_start is the only function that modifies rx/tx_dma_cr. It is always called with IRQs and NAPI disabled, so nothing else is touching the hardware. - The IRQs don't race with poll, since the latter is a softirq. - The IRQs don't race with dma_stop since they both just clear the control registers. - dma_stop doesn't race with poll since the former is called with NAPI disabled. However, once we introduce another function that modifies rx/tx_dma_cr, we need to have some locking to prevent races. Introduce two locks to protect these variables and their registers. The control register values are now generated where the coalescing settings are set. Converting coalescing settings to control register values may require sleeping because of clk_get_rate. However, the read/modify/write of the control registers themselves can't sleep because it needs to happen in IRQ context. By pre-calculating the control register values, we avoid introducing an additional mutex. Since axienet_dma_start writes the control settings when it runs, we don't bother updating the CR registers when rx/tx_dma_started is false. This prevents any issues from writing to the control registers in the middle of a reset sequence. Signed-off-by: Sean Anderson <sean.anderson@linux.dev> --- Changes in v3: - Move spin (un)locking in IRQs inside the if condition of napi_schedule_prep. This lets us hold the lock just for the rmw. - Fix function name in doc comments for axienet_update_coalesce_rx/tx Changes in v2: - Don't use spin_lock_irqsave when we know the context - Split the CR calculation refactor from runtime coalesce settings adjustment support for easier review. - Have axienet_update_coalesce_rx/tx take the cr value/mask instead of calculating it with axienet_calc_cr. This will make it easier to add partial updates in the next few commits. - Split off CR calculation merging into another patch drivers/net/ethernet/xilinx/xilinx_axienet.h | 8 ++ .../net/ethernet/xilinx/xilinx_axienet_main.c | 134 +++++++++++++++--- 2 files changed, 119 insertions(+), 23 deletions(-)