[OpenWrt-Devel] [PATCH] ar71xx: check for stuck DMA on AR724x & fix sirq storm after recovery
Conn O'Griofa
connogriofa at gmail.com
Fri Jan 8 00:26:17 EST 2016
Hi,
I'm proposing the following patch to resolve ticket #18922 fully.
With the current master revision, when a tx timeout condition occurs, the interface recovers successfully, but a soft irq storm occurs (causing ksoftirqd to peg the CPU, due to this goto being called without end: https://github.com/openwrt-mirror/openwrt/blob/master/target/linux/ar71xx/files/drivers/net/ethernet/atheros/ag71xx/ag71xx_main.c#L1073 ). Forcing the tx and rx rings to be cleared and re-inited in ag71xx_restart_work_func seems to avoid the sirq storm, but I'd appreciate feedback on whether there's a more effective workaround.
Additionally, ag71xx_check_dma_stuck *does* successfully detect the stuck DMA condition on AR7241 (TR-WL842ND v1), so enabling the check for this chipset series ensures a link adjust occurs *before* an actual tx timeout is detected. This avoids the brief network interruption that normally occurs during the DMA stuck -> tx timeout -> link adjust condition.
Conn
P.S. The sirq storm also occurs when ag71xx_check_dma_stuck is utilized on this chipset to avoid the tx timeout condition, so it appears that both changes are necessary (or at least, a better way to solve the sirq storm needs to be discovered).
diff --git a/target/linux/ar71xx/files/drivers/net/ethernet/atheros/ag71xx/ag71xx_main.c b/target/linux/ar71xx/files/drivers/net/ethernet/atheros/ag71xx/ag71xx_main.c
index 31b38d7..8959701 100644
--- a/target/linux/ar71xx/files/drivers/net/ethernet/atheros/ag71xx/ag71xx_main.c
+++ b/target/linux/ar71xx/files/drivers/net/ethernet/atheros/ag71xx/ag71xx_main.c
@@ -183,6 +183,8 @@ static void ag71xx_ring_tx_init(struct ag71xx *ag)
ring->curr = 0;
ring->dirty = 0;
netdev_reset_queue(ag->dev);
+
+ ag71xx_wr(ag, AG71XX_REG_TX_DESC, ag->tx_ring.descs_dma);
}
static void ag71xx_ring_rx_clean(struct ag71xx *ag)
@@ -272,6 +274,8 @@ static int ag71xx_ring_rx_init(struct ag71xx *ag)
ring->curr = 0;
ring->dirty = 0;
+ ag71xx_wr(ag, AG71XX_REG_RX_DESC, ag->rx_ring.descs_dma);
+
return ret;
}
@@ -652,9 +656,6 @@ static int ag71xx_open(struct net_device *dev)
netif_carrier_off(dev);
ag71xx_phy_start(ag);
- ag71xx_wr(ag, AG71XX_REG_TX_DESC, ag->tx_ring.descs_dma);
- ag71xx_wr(ag, AG71XX_REG_RX_DESC, ag->rx_ring.descs_dma);
-
ag71xx_hw_set_macaddr(ag, dev->dev_addr);
netif_start_queue(dev);
@@ -873,6 +874,8 @@ static void ag71xx_restart_work_func(struct work_struct *work)
if (ag71xx_get_pdata(ag)->is_ar724x) {
ag->link = 0;
ag71xx_link_adjust(ag);
+ ag71xx_rings_cleanup(ag);
+ ag71xx_rings_init(ag);
return;
}
@@ -919,7 +922,7 @@ static int ag71xx_tx_packets(struct ag71xx *ag, bool flush)
struct sk_buff *skb = ring->buf[i].skb;
if (!flush && !ag71xx_desc_empty(desc)) {
- if (pdata->is_ar7240 &&
+ if (pdata->is_ar724x &&
ag71xx_check_dma_stuck(ag, ring->buf[i].timestamp))
schedule_work(&ag->restart_work);
break;
_______________________________________________
openwrt-devel mailing list
openwrt-devel at lists.openwrt.org
https://lists.openwrt.org/cgi-bin/mailman/listinfo/openwrt-devel
More information about the openwrt-devel
mailing list