mirror of
https://github.com/tbsdtv/linux_media.git
synced 2025-07-23 12:43:29 +02:00
xsk: Documentation for XDP_SHARED_UMEM between queues and netdevs
Add documentation for the XDP_SHARED_UMEM feature when a UMEM is shared between different queues and/or netdevs. Signed-off-by: Magnus Karlsson <magnus.karlsson@intel.com> Signed-off-by: Daniel Borkmann <daniel@iogearbox.net> Acked-by: Björn Töpel <bjorn.topel@intel.com> Link: https://lore.kernel.org/bpf/1598603189-32145-16-git-send-email-magnus.karlsson@intel.com
This commit is contained in:
committed by
Daniel Borkmann
parent
35149b2c04
commit
acabf32805
@@ -258,14 +258,21 @@ socket into zero-copy mode or fail.
|
|||||||
XDP_SHARED_UMEM bind flag
|
XDP_SHARED_UMEM bind flag
|
||||||
-------------------------
|
-------------------------
|
||||||
|
|
||||||
This flag enables you to bind multiple sockets to the same UMEM, but
|
This flag enables you to bind multiple sockets to the same UMEM. It
|
||||||
only if they share the same queue id. In this mode, each socket has
|
works on the same queue id, between queue ids and between
|
||||||
their own RX and TX rings, but the UMEM (tied to the fist socket
|
netdevs/devices. In this mode, each socket has their own RX and TX
|
||||||
created) only has a single FILL ring and a single COMPLETION
|
rings as usual, but you are going to have one or more FILL and
|
||||||
ring. To use this mode, create the first socket and bind it in the normal
|
COMPLETION ring pairs. You have to create one of these pairs per
|
||||||
way. Create a second socket and create an RX and a TX ring, or at
|
unique netdev and queue id tuple that you bind to.
|
||||||
least one of them, but no FILL or COMPLETION rings as the ones from
|
|
||||||
the first socket will be used. In the bind call, set he
|
Starting with the case were we would like to share a UMEM between
|
||||||
|
sockets bound to the same netdev and queue id. The UMEM (tied to the
|
||||||
|
fist socket created) will only have a single FILL ring and a single
|
||||||
|
COMPLETION ring as there is only on unique netdev,queue_id tuple that
|
||||||
|
we have bound to. To use this mode, create the first socket and bind
|
||||||
|
it in the normal way. Create a second socket and create an RX and a TX
|
||||||
|
ring, or at least one of them, but no FILL or COMPLETION rings as the
|
||||||
|
ones from the first socket will be used. In the bind call, set he
|
||||||
XDP_SHARED_UMEM option and provide the initial socket's fd in the
|
XDP_SHARED_UMEM option and provide the initial socket's fd in the
|
||||||
sxdp_shared_umem_fd field. You can attach an arbitrary number of extra
|
sxdp_shared_umem_fd field. You can attach an arbitrary number of extra
|
||||||
sockets this way.
|
sockets this way.
|
||||||
@@ -305,11 +312,41 @@ concurrently. There are no synchronization primitives in the
|
|||||||
libbpf code that protects multiple users at this point in time.
|
libbpf code that protects multiple users at this point in time.
|
||||||
|
|
||||||
Libbpf uses this mode if you create more than one socket tied to the
|
Libbpf uses this mode if you create more than one socket tied to the
|
||||||
same umem. However, note that you need to supply the
|
same UMEM. However, note that you need to supply the
|
||||||
XSK_LIBBPF_FLAGS__INHIBIT_PROG_LOAD libbpf_flag with the
|
XSK_LIBBPF_FLAGS__INHIBIT_PROG_LOAD libbpf_flag with the
|
||||||
xsk_socket__create calls and load your own XDP program as there is no
|
xsk_socket__create calls and load your own XDP program as there is no
|
||||||
built in one in libbpf that will route the traffic for you.
|
built in one in libbpf that will route the traffic for you.
|
||||||
|
|
||||||
|
The second case is when you share a UMEM between sockets that are
|
||||||
|
bound to different queue ids and/or netdevs. In this case you have to
|
||||||
|
create one FILL ring and one COMPLETION ring for each unique
|
||||||
|
netdev,queue_id pair. Let us say you want to create two sockets bound
|
||||||
|
to two different queue ids on the same netdev. Create the first socket
|
||||||
|
and bind it in the normal way. Create a second socket and create an RX
|
||||||
|
and a TX ring, or at least one of them, and then one FILL and
|
||||||
|
COMPLETION ring for this socket. Then in the bind call, set he
|
||||||
|
XDP_SHARED_UMEM option and provide the initial socket's fd in the
|
||||||
|
sxdp_shared_umem_fd field as you registered the UMEM on that
|
||||||
|
socket. These two sockets will now share one and the same UMEM.
|
||||||
|
|
||||||
|
There is no need to supply an XDP program like the one in the previous
|
||||||
|
case where sockets were bound to the same queue id and
|
||||||
|
device. Instead, use the NIC's packet steering capabilities to steer
|
||||||
|
the packets to the right queue. In the previous example, there is only
|
||||||
|
one queue shared among sockets, so the NIC cannot do this steering. It
|
||||||
|
can only steer between queues.
|
||||||
|
|
||||||
|
In libbpf, you need to use the xsk_socket__create_shared() API as it
|
||||||
|
takes a reference to a FILL ring and a COMPLETION ring that will be
|
||||||
|
created for you and bound to the shared UMEM. You can use this
|
||||||
|
function for all the sockets you create, or you can use it for the
|
||||||
|
second and following ones and use xsk_socket__create() for the first
|
||||||
|
one. Both methods yield the same result.
|
||||||
|
|
||||||
|
Note that a UMEM can be shared between sockets on the same queue id
|
||||||
|
and device, as well as between queues on the same device and between
|
||||||
|
devices at the same time.
|
||||||
|
|
||||||
XDP_USE_NEED_WAKEUP bind flag
|
XDP_USE_NEED_WAKEUP bind flag
|
||||||
-----------------------------
|
-----------------------------
|
||||||
|
|
||||||
@@ -364,7 +401,7 @@ resources by only setting up one of them. Both the FILL ring and the
|
|||||||
COMPLETION ring are mandatory as you need to have a UMEM tied to your
|
COMPLETION ring are mandatory as you need to have a UMEM tied to your
|
||||||
socket. But if the XDP_SHARED_UMEM flag is used, any socket after the
|
socket. But if the XDP_SHARED_UMEM flag is used, any socket after the
|
||||||
first one does not have a UMEM and should in that case not have any
|
first one does not have a UMEM and should in that case not have any
|
||||||
FILL or COMPLETION rings created as the ones from the shared umem will
|
FILL or COMPLETION rings created as the ones from the shared UMEM will
|
||||||
be used. Note, that the rings are single-producer single-consumer, so
|
be used. Note, that the rings are single-producer single-consumer, so
|
||||||
do not try to access them from multiple processes at the same
|
do not try to access them from multiple processes at the same
|
||||||
time. See the XDP_SHARED_UMEM section.
|
time. See the XDP_SHARED_UMEM section.
|
||||||
@@ -567,6 +604,17 @@ A: The short answer is no, that is not supported at the moment. The
|
|||||||
switch, or other distribution mechanism, in your NIC to direct
|
switch, or other distribution mechanism, in your NIC to direct
|
||||||
traffic to the correct queue id and socket.
|
traffic to the correct queue id and socket.
|
||||||
|
|
||||||
|
Q: My packets are sometimes corrupted. What is wrong?
|
||||||
|
|
||||||
|
A: Care has to be taken not to feed the same buffer in the UMEM into
|
||||||
|
more than one ring at the same time. If you for example feed the
|
||||||
|
same buffer into the FILL ring and the TX ring at the same time, the
|
||||||
|
NIC might receive data into the buffer at the same time it is
|
||||||
|
sending it. This will cause some packets to become corrupted. Same
|
||||||
|
thing goes for feeding the same buffer into the FILL rings
|
||||||
|
belonging to different queue ids or netdevs bound with the
|
||||||
|
XDP_SHARED_UMEM flag.
|
||||||
|
|
||||||
Credits
|
Credits
|
||||||
=======
|
=======
|
||||||
|
|
||||||
|
Reference in New Issue
Block a user