Merge tag 'net-next-5.10' of git://git.kernel.org/pub/scm/linux/kernel/git/netdev/net-next
Pull networking updates from Jakub Kicinski: - Add redirect_neigh() BPF packet redirect helper, allowing to limit stack traversal in common container configs and improving TCP back-pressure. Daniel reports ~10Gbps => ~15Gbps single stream TCP performance gain. - Expand netlink policy support and improve policy export to user space. (Ge)netlink core performs request validation according to declared policies. Expand the expressiveness of those policies (min/max length and bitmasks). Allow dumping policies for particular commands. This is used for feature discovery by user space (instead of kernel version parsing or trial and error). - Support IGMPv3/MLDv2 multicast listener discovery protocols in bridge. - Allow more than 255 IPv4 multicast interfaces. - Add support for Type of Service (ToS) reflection in SYN/SYN-ACK packets of TCPv6. - In Multi-patch TCP (MPTCP) support concurrent transmission of data on multiple subflows in a load balancing scenario. Enhance advertising addresses via the RM_ADDR/ADD_ADDR options. - Support SMC-Dv2 version of SMC, which enables multi-subnet deployments. - Allow more calls to same peer in RxRPC. - Support two new Controller Area Network (CAN) protocols - CAN-FD and ISO 15765-2:2016. - Add xfrm/IPsec compat layer, solving the 32bit user space on 64bit kernel problem. - Add TC actions for implementing MPLS L2 VPNs. - Improve nexthop code - e.g. handle various corner cases when nexthop objects are removed from groups better, skip unnecessary notifications and make it easier to offload nexthops into HW by converting to a blocking notifier. - Support adding and consuming TCP header options by BPF programs, opening the doors for easy experimental and deployment-specific TCP option use. - Reorganize TCP congestion control (CC) initialization to simplify life of TCP CC implemented in BPF. - Add support for shipping BPF programs with the kernel and loading them early on boot via the User Mode Driver mechanism, hence reusing all the user space infra we have. - Support sleepable BPF programs, initially targeting LSM and tracing. - Add bpf_d_path() helper for returning full path for given 'struct path'. - Make bpf_tail_call compatible with bpf-to-bpf calls. - Allow BPF programs to call map_update_elem on sockmaps. - Add BPF Type Format (BTF) support for type and enum discovery, as well as support for using BTF within the kernel itself (current use is for pretty printing structures). - Support listing and getting information about bpf_links via the bpf syscall. - Enhance kernel interfaces around NIC firmware update. Allow specifying overwrite mask to control if settings etc. are reset during update; report expected max time operation may take to users; support firmware activation without machine reboot incl. limits of how much impact reset may have (e.g. dropping link or not). - Extend ethtool configuration interface to report IEEE-standard counters, to limit the need for per-vendor logic in user space. - Adopt or extend devlink use for debug, monitoring, fw update in many drivers (dsa loop, ice, ionic, sja1105, qed, mlxsw, mv88e6xxx, dpaa2-eth). - In mlxsw expose critical and emergency SFP module temperature alarms. Refactor port buffer handling to make the defaults more suitable and support setting these values explicitly via the DCBNL interface. - Add XDP support for Intel's igb driver. - Support offloading TC flower classification and filtering rules to mscc_ocelot switches. - Add PTP support for Marvell Octeontx2 and PP2.2 hardware, as well as fixed interval period pulse generator and one-step timestamping in dpaa-eth. - Add support for various auth offloads in WiFi APs, e.g. SAE (WPA3) offload. - Add Lynx PHY/PCS MDIO module, and convert various drivers which have this HW to use it. Convert mvpp2 to split PCS. - Support Marvell Prestera 98DX3255 24-port switch ASICs, as well as 7-port Mediatek MT7531 IP. - Add initial support for QCA6390 and IPQ6018 in ath11k WiFi driver, and wcn3680 support in wcn36xx. - Improve performance for packets which don't require much offloads on recent Mellanox NICs by 20% by making multiple packets share a descriptor entry. - Move chelsio inline crypto drivers (for TLS and IPsec) from the crypto subtree to drivers/net. Move MDIO drivers out of the phy directory. - Clean up a lot of W=1 warnings, reportedly the actively developed subsections of networking drivers should now build W=1 warning free. - Make sure drivers don't use in_interrupt() to dynamically adapt their code. Convert tasklets to use new tasklet_setup API (sadly this conversion is not yet complete). * tag 'net-next-5.10' of git://git.kernel.org/pub/scm/linux/kernel/git/netdev/net-next: (2583 commits) Revert "bpfilter: Fix build error with CONFIG_BPFILTER_UMH" net, sockmap: Don't call bpf_prog_put() on NULL pointer bpf, selftest: Fix flaky tcp_hdr_options test when adding addr to lo bpf, sockmap: Add locking annotations to iterator netfilter: nftables: allow re-computing sctp CRC-32C in 'payload' statements net: fix pos incrementment in ipv6_route_seq_next net/smc: fix invalid return code in smcd_new_buf_create() net/smc: fix valid DMBE buffer sizes net/smc: fix use-after-free of delayed events bpfilter: Fix build error with CONFIG_BPFILTER_UMH cxgb4/ch_ipsec: Replace the module name to ch_ipsec from chcr net: sched: Fix suspicious RCU usage while accessing tcf_tunnel_info bpf: Fix register equivalence tracking. rxrpc: Fix loss of final ack on shutdown rxrpc: Fix bundle counting for exclusive connections netfilter: restore NF_INET_NUMHOOKS ibmveth: Identify ingress large send packets. ibmveth: Switch order of ibmveth_helper calls. cxgb4: handle 4-tuple PEDIT to NAT mode translation selftests: Add VRF route leaking tests ...
This commit is contained in:
@@ -712,11 +712,10 @@ EXPORT_SYMBOL(kfree_skb_list);
|
||||
*
|
||||
* Must only be called from net_ratelimit()-ed paths.
|
||||
*
|
||||
* Dumps up to can_dump_full whole packets if full_pkt, headers otherwise.
|
||||
* Dumps whole packets if full_pkt, only headers otherwise.
|
||||
*/
|
||||
void skb_dump(const char *level, const struct sk_buff *skb, bool full_pkt)
|
||||
{
|
||||
static atomic_t can_dump_full = ATOMIC_INIT(5);
|
||||
struct skb_shared_info *sh = skb_shinfo(skb);
|
||||
struct net_device *dev = skb->dev;
|
||||
struct sock *sk = skb->sk;
|
||||
@@ -725,9 +724,6 @@ void skb_dump(const char *level, const struct sk_buff *skb, bool full_pkt)
|
||||
int headroom, tailroom;
|
||||
int i, len, seg_len;
|
||||
|
||||
if (full_pkt)
|
||||
full_pkt = atomic_dec_if_positive(&can_dump_full) >= 0;
|
||||
|
||||
if (full_pkt)
|
||||
len = skb->len;
|
||||
else
|
||||
@@ -895,9 +891,6 @@ void __kfree_skb_defer(struct sk_buff *skb)
|
||||
|
||||
void napi_consume_skb(struct sk_buff *skb, int budget)
|
||||
{
|
||||
if (unlikely(!skb))
|
||||
return;
|
||||
|
||||
/* Zero budget indicate non-NAPI context called us, like netpoll */
|
||||
if (unlikely(!budget)) {
|
||||
dev_consume_skb_any(skb);
|
||||
@@ -5562,6 +5555,73 @@ int skb_vlan_push(struct sk_buff *skb, __be16 vlan_proto, u16 vlan_tci)
|
||||
}
|
||||
EXPORT_SYMBOL(skb_vlan_push);
|
||||
|
||||
/**
|
||||
* skb_eth_pop() - Drop the Ethernet header at the head of a packet
|
||||
*
|
||||
* @skb: Socket buffer to modify
|
||||
*
|
||||
* Drop the Ethernet header of @skb.
|
||||
*
|
||||
* Expects that skb->data points to the mac header and that no VLAN tags are
|
||||
* present.
|
||||
*
|
||||
* Returns 0 on success, -errno otherwise.
|
||||
*/
|
||||
int skb_eth_pop(struct sk_buff *skb)
|
||||
{
|
||||
if (!pskb_may_pull(skb, ETH_HLEN) || skb_vlan_tagged(skb) ||
|
||||
skb_network_offset(skb) < ETH_HLEN)
|
||||
return -EPROTO;
|
||||
|
||||
skb_pull_rcsum(skb, ETH_HLEN);
|
||||
skb_reset_mac_header(skb);
|
||||
skb_reset_mac_len(skb);
|
||||
|
||||
return 0;
|
||||
}
|
||||
EXPORT_SYMBOL(skb_eth_pop);
|
||||
|
||||
/**
|
||||
* skb_eth_push() - Add a new Ethernet header at the head of a packet
|
||||
*
|
||||
* @skb: Socket buffer to modify
|
||||
* @dst: Destination MAC address of the new header
|
||||
* @src: Source MAC address of the new header
|
||||
*
|
||||
* Prepend @skb with a new Ethernet header.
|
||||
*
|
||||
* Expects that skb->data points to the mac header, which must be empty.
|
||||
*
|
||||
* Returns 0 on success, -errno otherwise.
|
||||
*/
|
||||
int skb_eth_push(struct sk_buff *skb, const unsigned char *dst,
|
||||
const unsigned char *src)
|
||||
{
|
||||
struct ethhdr *eth;
|
||||
int err;
|
||||
|
||||
if (skb_network_offset(skb) || skb_vlan_tag_present(skb))
|
||||
return -EPROTO;
|
||||
|
||||
err = skb_cow_head(skb, sizeof(*eth));
|
||||
if (err < 0)
|
||||
return err;
|
||||
|
||||
skb_push(skb, sizeof(*eth));
|
||||
skb_reset_mac_header(skb);
|
||||
skb_reset_mac_len(skb);
|
||||
|
||||
eth = eth_hdr(skb);
|
||||
ether_addr_copy(eth->h_dest, dst);
|
||||
ether_addr_copy(eth->h_source, src);
|
||||
eth->h_proto = skb->protocol;
|
||||
|
||||
skb_postpush_rcsum(skb, eth, sizeof(*eth));
|
||||
|
||||
return 0;
|
||||
}
|
||||
EXPORT_SYMBOL(skb_eth_push);
|
||||
|
||||
/* Update the ethertype of hdr and the skb csum value if required. */
|
||||
static void skb_mod_eth_type(struct sk_buff *skb, struct ethhdr *hdr,
|
||||
__be16 ethertype)
|
||||
@@ -5956,8 +6016,7 @@ static int pskb_carve_inside_nonlinear(struct sk_buff *skb, const u32 off,
|
||||
size = SKB_WITH_OVERHEAD(ksize(data));
|
||||
|
||||
memcpy((struct skb_shared_info *)(data + size),
|
||||
skb_shinfo(skb), offsetof(struct skb_shared_info,
|
||||
frags[skb_shinfo(skb)->nr_frags]));
|
||||
skb_shinfo(skb), offsetof(struct skb_shared_info, frags[0]));
|
||||
if (skb_orphan_frags(skb, gfp_mask)) {
|
||||
kfree(data);
|
||||
return -ENOMEM;
|
||||
|
Reference in New Issue
Block a user