FreeBSD manual
download PDF document: unix.4.pdf
UNIX(4) FreeBSD Kernel Interfaces Manual UNIX(4)
NAME
unix - UNIX-domain protocol family
SYNOPSIS
#include <sys/types.h>
#include <sys/un.h>
DESCRIPTION
The UNIX-domain protocol family is a collection of protocols that
provides local (on-machine) interprocess communication through the normal
socket(2) mechanisms. The UNIX-domain family supports the SOCK_STREAM,
SOCK_SEQPACKET, and SOCK_DGRAM socket types and uses file system
pathnames for addressing.
ADDRESSING
UNIX-domain addresses are variable-length file system pathnames of at
most 104 characters. The include file <sys/un.h> defines this address:
struct sockaddr_un {
u_char sun_len;
u_char sun_family;
char sun_path[104];
};
Binding a name to a UNIX-domain socket with bind(2) causes a socket file
to be created in the file system. This file is not removed when the
socket is closed -- unlink(2) must be used to remove the file.
The length of UNIX-domain address, required by bind(2) and connect(2),
can be calculated by the macro SUN_LEN() defined in <sys/un.h>. The
sun_path field must be terminated by a NUL character to be used with
SUN_LEN(), but the terminating NUL is not part of the address.
The UNIX-domain protocol family does not support broadcast addressing or
any form of "wildcard" matching on incoming messages. All addresses are
absolute- or relative-pathnames of other UNIX-domain sockets. Normal
file system access-control mechanisms are also applied when referencing
pathnames; e.g., the destination of a connect(2) or sendto(2) must be
writable.
CONTROL MESSAGES
The UNIX-domain sockets support the communication of UNIX file
descriptors and process credentials through the use of the msg_control
field in the msg argument to sendmsg(2) and recvmsg(2). The items to be
passed are described using a struct cmsghdr that is defined in the
include file <sys/socket.h>.
To send file descriptors, the type of the message is SCM_RIGHTS, and the
data portion of the messages is an array of integers representing the
file descriptors to be passed. The number of descriptors being passed is
defined by the length field of the message; the length field is the sum
of the size of the header plus the size of the array of file descriptors.
The received descriptor is a duplicate of the sender's descriptor, as if
it were created via dup(fd) or fcntl(fd, F_DUPFD_CLOEXEC, 0) depending on
whether MSG_CMSG_CLOEXEC is passed in the recvmsg(2) call. Descriptors
that are awaiting delivery, or that are purposely not received, are
pid_t cmcred_pid; /* PID of sending process */
uid_t cmcred_uid; /* real UID of sending process */
uid_t cmcred_euid; /* effective UID of sending process */
gid_t cmcred_gid; /* real GID of sending process */
short cmcred_ngroups; /* number of groups */
gid_t cmcred_groups[CMGROUP_MAX]; /* groups */
};
The sender should pass a zeroed buffer which will be filled in by the
system.
The group list is truncated to at most CMGROUP_MAX GIDs.
The process ID cmcred_pid should not be looked up (such as via the
KERN_PROC_PID sysctl) for making security decisions. The sending process
could have exited and its process ID already been reused for a new
process.
SOCKET OPTIONS
UNIX domain sockets support a number of socket options for the options
level SOL_LOCAL, which can be set with setsockopt(2) and tested with
getsockopt(2):
LOCAL_CREDS This option may be enabled on SOCK_DGRAM,
SOCK_SEQPACKET, or a SOCK_STREAM socket. This
option provides a mechanism for the receiver to
receive the credentials of the process calling
write(2), send(2), sendto(2) or sendmsg(2) as a
recvmsg(2) control message. The msg_control
field in the msghdr structure points to a buffer
that contains a cmsghdr structure followed by a
variable length sockcred structure, defined in
<sys/socket.h> as follows:
struct sockcred {
uid_t sc_uid; /* real user id */
uid_t sc_euid; /* effective user id */
gid_t sc_gid; /* real group id */
gid_t sc_egid; /* effective group id */
int sc_ngroups; /* number of supplemental groups */
gid_t sc_groups[1]; /* variable length */
};
The current implementation truncates the group
list to at most CMGROUP_MAX groups.
The SOCKCREDSIZE() macro computes the size of the
sockcred structure for a specified number of
groups. The cmsghdr fields have the following
values:
cmsg_len = CMSG_LEN(SOCKCREDSIZE(ngroups))
cmsg_level = SOL_SOCKET
cmsg_type = SCM_CREDS
On SOCK_STREAM and SOCK_SEQPACKET sockets
credentials are passed only on the first read
from a socket, then the system clears the option
on the socket.
included.
Many setuid programs will write(2) data at least
partially controlled by the invoker, such as
error messages. Therefore, a message accompanied
by a particular sc_euid value should not be
trusted as being from that user.
LOCAL_CREDS_PERSISTENT This option is similar to LOCAL_CREDS, except
that socket credentials are passed on every read
from a SOCK_STREAM or SOCK_SEQPACKET socket,
instead of just the first read. Additionally,
the msg_control field in the msghdr structure
points to a buffer that contains a cmsghdr
structure followed by a variable length sockcred2
structure, defined in <sys/socket.h> as follows:
struct sockcred2 {
int sc_version; /* version of this structure */
pid_t sc_pid; /* PID of sending process */
uid_t sc_uid; /* real user id */
uid_t sc_euid; /* effective user id */
gid_t sc_gid; /* real group id */
gid_t sc_egid; /* effective group id */
int sc_ngroups; /* number of supplemental groups */
gid_t sc_groups[1]; /* variable length */
};
The current version is zero.
The cmsghdr fields have the following values:
cmsg_len = CMSG_LEN(SOCKCRED2SIZE(ngroups))
cmsg_level = SOL_SOCKET
cmsg_type = SCM_CREDS2
The LOCAL_CREDS and LOCAL_CREDS_PERSISTENT
options are mutually exclusive.
LOCAL_CONNWAIT Used with SOCK_STREAM sockets, this option causes
the connect(2) function to block until accept(2)
has been called on the listening socket.
LOCAL_PEERCRED Requested via getsockopt(2) on a SOCK_STREAM or
SOCK_SEQPACKET socket returns credentials of the
remote side. These will arrive in the form of a
filled in xucred structure, defined in
<sys/ucred.h> as follows:
struct xucred {
u_int cr_version; /* structure layout version */
uid_t cr_uid; /* effective user id */
short cr_ngroups; /* number of groups */
gid_t cr_groups[XU_NGROUPS]; /* groups */
pid_t cr_pid; /* process id of the sending process */
};
The cr_version fields should be checked against
XUCRED_VERSION define.
its peer except by calling the appropriate system
call (e.g., connect(2) or listen(2)) under
different effective credentials.
To reliably obtain peer credentials on a
SOCK_DGRAM socket refer to the LOCAL_CREDS socket
option.
BUFFERING
Due to the local nature of the UNIX-domain sockets, they do not implement
send buffers. The send(2) and write(2) families of system calls attempt
to write data to the receive buffer of the destination socket.
The default buffer sizes for SOCK_STREAM and SOCK_SEQPACKET UNIX-domain
sockets can be configured with net.local.stream and net.local.seqpacket
branches of sysctl(3) MIB respectively. Note that setting the send
buffer size (sendspace) affects only the maximum write size.
The UNIX-domain sockets of type SOCK_DGRAM are unreliable and always non-
blocking for write operations. The default receive buffer can be
configured with net.local.dgram.recvspace. The maximum allowed datagram
size is limited by net.local.dgram.maxdgram. A SOCK_DGRAM socket that
has been bound with bind(2) can have multiple peers connected at the same
time. The modern FreeBSD implementation will allocate
net.local.dgram.recvspace sized private buffers in the receive buffer of
the bound socket for every connected socket, preventing a situation when
a single writer can exhaust all of buffer space. Messages coming from
unconnected sends using sendto(2) land on the shared buffer of the
receiving socket, which has the same size limit. A side effect of the
implementation is that it doesn't guarantee that writes from different
senders will arrive at the receiver in the same chronological order they
were sent. The order is preserved for writes coming through a particular
connection.
SEE ALSO
connect(2), dup(2), fcntl(2), getsockopt(2), listen(2), recvmsg(2),
sendto(2), setsockopt(2), socket(2), CMSG_DATA(3), intro(4), sysctl(8)
"An Introductory 4.3 BSD Interprocess Communication Tutorial", PS1, 7.
"An Advanced 4.3 BSD Interprocess Communication Tutorial", PS1, 8.
FreeBSD 14.0-RELEASE-p11 June 24, 2022 FreeBSD 14.0-RELEASE-p11