* [PATCH 6.6.y] xfrm: hold dev ref until after transport_finish NF_HOOK
@ 2026-06-08 8:24 Simon Liebold
2026-06-09 0:51 ` Sasha Levin
0 siblings, 1 reply; 3+ messages in thread
From: Simon Liebold @ 2026-06-08 8:24 UTC (permalink / raw)
To: Steffen Klassert, Herbert Xu, David S . Miller, David Ahern,
Eric Dumazet, Jakub Kicinski, Paolo Abeni, Simon Horman, netdev,
linux-kernel, stable, Simon Liebold
Cc: Qi Tang, Florian Westphal, Simon Liebold
From: Qi Tang <tpluszz77@gmail•com>
[ Upstream commit 1c428b03840094410c5fb6a5db30640486bbbfcb ]
After async crypto completes, xfrm_input_resume() calls dev_put()
immediately on re-entry before the skb reaches transport_finish.
The skb->dev pointer is then used inside NF_HOOK and its okfn,
which can race with device teardown.
Remove the dev_put from the async resumption entry and instead
drop the reference after the NF_HOOK call in transport_finish,
using a saved device pointer since NF_HOOK may consume the skb.
This covers NF_DROP, NF_QUEUE and NF_STOLEN paths that skip
the okfn.
For non-transport exits (decaps, gro, drop) and secondary
async return points, release the reference inline when
async is set.
Suggested-by: Florian Westphal <fw@strlen•de>
Fixes: acf568ee859f ("xfrm: Reinject transport-mode packets through tasklet")
Cc: stable@vger•kernel.org
Signed-off-by: Qi Tang <tpluszz77@gmail•com>
Signed-off-by: Steffen Klassert <steffen.klassert@secunet•com>
[ net/xfrm/xfrm_input.c: dev_hold/dev_put are unconditional here rather
than inside !crypto_done as in mainline, and the dev_put in the
encap_type == -1 async-resumption block does not exist; adapted by
gating dev_put at resume: with if (!async), adding if (async) dev_put
at -EINPROGRESS return, gro_cells_receive paths, and drop label. ]
Signed-off-by: Simon Liebold <simonlie@amazon•de>
---
net/ipv4/xfrm4_input.c | 5 ++++-
net/ipv6/xfrm6_input.c | 5 ++++-
net/xfrm/xfrm_input.c | 14 ++++++++++++--
3 files changed, 20 insertions(+), 4 deletions(-)
diff --git a/net/ipv4/xfrm4_input.c b/net/ipv4/xfrm4_input.c
index f6e90ba50b63..d05e8ff9c37b 100644
--- a/net/ipv4/xfrm4_input.c
+++ b/net/ipv4/xfrm4_input.c
@@ -48,6 +48,7 @@ int xfrm4_transport_finish(struct sk_buff *skb, int async)
{
struct xfrm_offload *xo = xfrm_offload(skb);
struct iphdr *iph = ip_hdr(skb);
+ struct net_device *dev = skb->dev;
iph->protocol = XFRM_MODE_SKB_CB(skb)->protocol;
@@ -71,8 +72,10 @@ int xfrm4_transport_finish(struct sk_buff *skb, int async)
}
NF_HOOK(NFPROTO_IPV4, NF_INET_PRE_ROUTING,
- dev_net(skb->dev), NULL, skb, skb->dev, NULL,
+ dev_net(dev), NULL, skb, dev, NULL,
xfrm4_rcv_encap_finish);
+ if (async)
+ dev_put(dev);
return 0;
}
diff --git a/net/ipv6/xfrm6_input.c b/net/ipv6/xfrm6_input.c
index 8432b50d9ce4..b2e328de620b 100644
--- a/net/ipv6/xfrm6_input.c
+++ b/net/ipv6/xfrm6_input.c
@@ -41,6 +41,7 @@ static int xfrm6_transport_finish2(struct net *net, struct sock *sk,
int xfrm6_transport_finish(struct sk_buff *skb, int async)
{
struct xfrm_offload *xo = xfrm_offload(skb);
+ struct net_device *dev = skb->dev;
int nhlen = skb->data - skb_network_header(skb);
skb_network_header(skb)[IP6CB(skb)->nhoff] =
@@ -66,8 +67,10 @@ int xfrm6_transport_finish(struct sk_buff *skb, int async)
}
NF_HOOK(NFPROTO_IPV6, NF_INET_PRE_ROUTING,
- dev_net(skb->dev), NULL, skb, skb->dev, NULL,
+ dev_net(dev), NULL, skb, dev, NULL,
xfrm6_transport_finish2);
+ if (async)
+ dev_put(dev);
return 0;
}
diff --git a/net/xfrm/xfrm_input.c b/net/xfrm/xfrm_input.c
index 0c08bac3ed26..f79fc5d13e60 100644
--- a/net/xfrm/xfrm_input.c
+++ b/net/xfrm/xfrm_input.c
@@ -636,10 +636,14 @@ int xfrm_input(struct sk_buff *skb, int nexthdr, __be32 spi, int encap_type)
else
nexthdr = x->type->input(x, skb);
- if (nexthdr == -EINPROGRESS)
+ if (nexthdr == -EINPROGRESS) {
+ if (async)
+ dev_put(skb->dev);
return 0;
+ }
resume:
- dev_put(skb->dev);
+ if (!async)
+ dev_put(skb->dev);
spin_lock(&x->lock);
if (nexthdr < 0) {
@@ -707,6 +711,8 @@ int xfrm_input(struct sk_buff *skb, int nexthdr, __be32 spi, int encap_type)
sp->olen = 0;
if (skb_valid_dst(skb))
skb_dst_drop(skb);
+ if (async)
+ dev_put(skb->dev);
gro_cells_receive(&gro_cells, skb);
return 0;
} else {
@@ -726,6 +732,8 @@ int xfrm_input(struct sk_buff *skb, int nexthdr, __be32 spi, int encap_type)
sp->olen = 0;
if (skb_valid_dst(skb))
skb_dst_drop(skb);
+ if (async)
+ dev_put(skb->dev);
gro_cells_receive(&gro_cells, skb);
return err;
}
@@ -736,6 +744,8 @@ int xfrm_input(struct sk_buff *skb, int nexthdr, __be32 spi, int encap_type)
drop_unlock:
spin_unlock(&x->lock);
drop:
+ if (async)
+ dev_put(skb->dev);
xfrm_rcv_cb(skb, family, x && x->type ? x->type->proto : nexthdr, -1);
kfree_skb(skb);
return 0;
base-commit: 0a40c6fbd105802fbbcaadca249e0948fbf8095a
--
2.50.1
Amazon Web Services Development Center Germany GmbH
Tamara-Danz-Str. 13
10243 Berlin
Geschaeftsfuehrung: Christof Hellmis, Andreas Stieger
Eingetragen am Amtsgericht Charlottenburg unter HRB 257764 B
Sitz: Berlin
Ust-ID: DE 365 538 597
^ permalink raw reply related [flat|nested] 3+ messages in thread* Re: [PATCH 6.6.y] xfrm: hold dev ref until after transport_finish NF_HOOK
2026-06-08 8:24 [PATCH 6.6.y] xfrm: hold dev ref until after transport_finish NF_HOOK Simon Liebold
@ 2026-06-09 0:51 ` Sasha Levin
2026-06-09 11:33 ` Simon Liebold
0 siblings, 1 reply; 3+ messages in thread
From: Sasha Levin @ 2026-06-09 0:51 UTC (permalink / raw)
To: Steffen Klassert, Herbert Xu, David S . Miller, David Ahern,
Eric Dumazet, Jakub Kicinski, Paolo Abeni, Simon Horman, netdev,
linux-kernel, stable, Simon Liebold
Cc: Sasha Levin, Qi Tang, Florian Westphal, Simon Liebold
> [PATCH 6.6.y] xfrm: hold dev ref until after transport_finish NF_HOOK
I'm holding all four of these (6.6, 6.1, 5.15 and 5.10) for now.
As adapted, the backport leaks a netdev reference on the nested transport-mode
path where both an async and a sync decapsulation happen: the inner dev_hold is
balanced by a dev_put that the older trees don't have, so the saved reference
is never released. Mainline avoids this because it has b05d42eefac7 ("xfrm:
hold device only for the asynchronous decryption") as a prerequisite.
--
Thanks,
Sasha
^ permalink raw reply [flat|nested] 3+ messages in thread* Re: [PATCH 6.6.y] xfrm: hold dev ref until after transport_finish NF_HOOK
2026-06-09 0:51 ` Sasha Levin
@ 2026-06-09 11:33 ` Simon Liebold
0 siblings, 0 replies; 3+ messages in thread
From: Simon Liebold @ 2026-06-09 11:33 UTC (permalink / raw)
To: Sasha Levin, Steffen Klassert, Herbert Xu, David S . Miller,
David Ahern, Eric Dumazet, Jakub Kicinski, Paolo Abeni,
Simon Horman, netdev, linux-kernel, stable, Simon Liebold
Cc: Sasha Levin, Qi Tang, Florian Westphal
Sasha Levin <sashal@kernel•org> writes:
>> [PATCH 6.6.y] xfrm: hold dev ref until after transport_finish NF_HOOK
> I'm holding all four of these (6.6, 6.1, 5.15 and 5.10) for now.
You probably need to hold the backport I sent for 6.12 too:
https://lore.kernel.org/all/20260605141254.1177152-1-simonlie@amazon.de/
> As adapted, the backport leaks a netdev reference on the nested transport-mode
> path where both an async and a sync decapsulation happen: the inner dev_hold is
> balanced by a dev_put that the older trees don't have, so the saved reference
> is never released. Mainline avoids this because it has b05d42eefac7 ("xfrm:
> hold device only for the asynchronous decryption") as a prerequisite.
Sorry I missed this and thanks for catching. I'll send v2 for this.
- Simon
Amazon Web Services Development Center Germany GmbH
Tamara-Danz-Str. 13
10243 Berlin
Geschaeftsfuehrung: Christof Hellmis, Andreas Stieger
Eingetragen am Amtsgericht Charlottenburg unter HRB 257764 B
Sitz: Berlin
Ust-ID: DE 365 538 597
^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2026-06-09 11:33 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2026-06-08 8:24 [PATCH 6.6.y] xfrm: hold dev ref until after transport_finish NF_HOOK Simon Liebold
2026-06-09 0:51 ` Sasha Levin
2026-06-09 11:33 ` Simon Liebold
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox