.\" Automatically generated by Pod::Man 2.28 (Pod::Simple 3.30)
.\"
.\" Standard preamble:
.\" ========================================================================
.de Sp \" Vertical space (when we can't use .PP)
.if t .sp .5v
.if n .sp
..
.de Vb \" Begin verbatim text
.ft CW
.nf
.ne \\$1
..
.de Ve \" End verbatim text
.ft R
.fi
..
.\" Set up some character translations and predefined strings.  \*(-- will
.\" give an unbreakable dash, \*(PI will give pi, \*(L" will give a left
.\" double quote, and \*(R" will give a right double quote.  \*(C+ will
.\" give a nicer C++.  Capital omega is used to do unbreakable dashes and
.\" therefore won't be available.  \*(C` and \*(C' expand to `' in nroff,
.\" nothing in troff, for use with C<>.
.tr \(*W-
.ds C+ C\v'-.1v'\h'-1p'\s-2+\h'-1p'+\s0\v'.1v'\h'-1p'
.ie n \{\
.    ds -- \(*W-
.    ds PI pi
.    if (\n(.H=4u)&(1m=24u) .ds -- \(*W\h'-12u'\(*W\h'-12u'-\" diablo 10 pitch
.    if (\n(.H=4u)&(1m=20u) .ds -- \(*W\h'-12u'\(*W\h'-8u'-\"  diablo 12 pitch
.    ds L" ""
.    ds R" ""
.    ds C` 
.    ds C' 
'br\}
.el\{\
.    ds -- \|\(em\|
.    ds PI \(*p
.    ds L" ``
.    ds R" ''
.    ds C`
.    ds C'
'br\}
.\"
.\" Escape single quotes in literal strings from groff's Unicode transform.
.ie \n(.g .ds Aq \(aq
.el       .ds Aq '
.\"
.\" If the F register is turned on, we'll generate index entries on stderr for
.\" titles (.TH), headers (.SH), subsections (.SS), items (.Ip), and index
.\" entries marked with X<> in POD.  Of course, you'll have to process the
.\" output yourself in some meaningful fashion.
.\"
.\" Avoid warning from groff about undefined register 'F'.
.de IX
..
.nr rF 0
.if \n(.g .if rF .nr rF 1
.if (\n(rF:(\n(.g==0)) \{
.    if \nF \{
.        de IX
.        tm Index:\\$1\t\\n%\t"\\$2"
..
.        if !\nF==2 \{
.            nr % 0
.            nr F 2
.        \}
.    \}
.\}
.rr rF
.\"
.\" Accent mark definitions (@(#)ms.acc 1.5 88/02/08 SMI; from UCB 4.2).
.\" Fear.  Run.  Save yourself.  No user-serviceable parts.
.    \" fudge factors for nroff and troff
.if n \{\
.    ds #H 0
.    ds #V .8m
.    ds #F .3m
.    ds #[ \f1
.    ds #] \fP
.\}
.if t \{\
.    ds #H ((1u-(\\\\n(.fu%2u))*.13m)
.    ds #V .6m
.    ds #F 0
.    ds #[ \&
.    ds #] \&
.\}
.    \" simple accents for nroff and troff
.if n \{\
.    ds ' \&
.    ds ` \&
.    ds ^ \&
.    ds , \&
.    ds ~ ~
.    ds /
.\}
.if t \{\
.    ds ' \\k:\h'-(\\n(.wu*8/10-\*(#H)'\'\h"|\\n:u"
.    ds ` \\k:\h'-(\\n(.wu*8/10-\*(#H)'\`\h'|\\n:u'
.    ds ^ \\k:\h'-(\\n(.wu*10/11-\*(#H)'^\h'|\\n:u'
.    ds , \\k:\h'-(\\n(.wu*8/10)',\h'|\\n:u'
.    ds ~ \\k:\h'-(\\n(.wu-\*(#H-.1m)'~\h'|\\n:u'
.    ds / \\k:\h'-(\\n(.wu*8/10-\*(#H)'\z\(sl\h'|\\n:u'
.\}
.    \" troff and (daisy-wheel) nroff accents
.ds : \\k:\h'-(\\n(.wu*8/10-\*(#H+.1m+\*(#F)'\v'-\*(#V'\z.\h'.2m+\*(#F'.\h'|\\n:u'\v'\*(#V'
.ds 8 \h'\*(#H'\(*b\h'-\*(#H'
.ds o \\k:\h'-(\\n(.wu+\w'\(de'u-\*(#H)/2u'\v'-.3n'\*(#[\z\(de\v'.3n'\h'|\\n:u'\*(#]
.ds d- \h'\*(#H'\(pd\h'-\w'~'u'\v'-.25m'\f2\(hy\fP\v'.25m'\h'-\*(#H'
.ds D- D\\k:\h'-\w'D'u'\v'-.11m'\z\(hy\v'.11m'\h'|\\n:u'
.ds th \*(#[\v'.3m'\s+1I\s-1\v'-.3m'\h'-(\w'I'u*2/3)'\s-1o\s+1\*(#]
.ds Th \*(#[\s+2I\s-2\h'-\w'I'u*3/5'\v'-.3m'o\v'.3m'\*(#]
.ds ae a\h'-(\w'a'u*4/10)'e
.ds Ae A\h'-(\w'A'u*4/10)'E
.    \" corrections for vroff
.if v .ds ~ \\k:\h'-(\\n(.wu*9/10-\*(#H)'\s-2\u~\d\s+2\h'|\\n:u'
.if v .ds ^ \\k:\h'-(\\n(.wu*10/11-\*(#H)'\v'-.4m'^\v'.4m'\h'|\\n:u'
.    \" for low resolution devices (crt and lpr)
.if \n(.H>23 .if \n(.V>19 \
\{\
.    ds : e
.    ds 8 ss
.    ds o a
.    ds d- d\h'-1'\(ga
.    ds D- D\h'-1'\(hy
.    ds th \o'bp'
.    ds Th \o'LP'
.    ds ae ae
.    ds Ae AE
.\}
.rm #[ #] #H #V #F C
.\" ========================================================================
.\"
.IX Title "GVPE.PROTOCOL 7"
.TH GVPE.PROTOCOL 7 "2015-10-31" "2.25" "GNU Virtual Private Ethernet"
.\" For nroff, turn off justification.  Always turn off hyphenation; it makes
.\" way too many mistakes in technical documents.
.if n .ad l
.nh
.SH "The GNU-VPE Protocols"
.IX Header "The GNU-VPE Protocols"
.SH "Overview"
.IX Header "Overview"
\&\s-1GVPE\s0 can make use of a number of protocols. One of them is the \s-1GNU VPE\s0
protocol which is used to authenticate tunnels and send encrypted data
packets. This protocol is described in more detail the second part of this
document.
.PP
The first part of this document describes the transport protocols which
are used by \s-1GVPE\s0 to send its data packets over the network.
.SH "PART 1: Transport protocols"
.IX Header "PART 1: Transport protocols"
\&\s-1GVPE\s0 offers a wide range of transport protocols that can be used to
interchange data between nodes. Protocols differ in their overhead, speed,
reliability, and robustness.
.PP
The following sections describe each transport protocol in more
detail. They are sorted by overhead/efficiency, the most efficient
transport is listed first:
.SS "\s-1RAW IP\s0"
.IX Subsection "RAW IP"
This protocol is the best choice, performance-wise, as the minimum
overhead per packet is only 38 bytes.
.PP
It works by sending the \s-1VPN\s0 payload using raw \s-1IP\s0 frames (using the
protocol set by \f(CW\*(C`ip\-proto\*(C'\fR).
.PP
Using raw \s-1IP\s0 frames has the drawback that many firewalls block \*(L"unknown\*(R"
protocols, so this transport only works if you have full \s-1IP\s0 connectivity
between nodes.
.SS "\s-1ICMP\s0"
.IX Subsection "ICMP"
This protocol offers very low overhead (minimum 42 bytes), and can
sometimes tunnel through firewalls when other protocols can not.
.PP
It works by prepending an \s-1ICMP\s0 header with type \f(CW\*(C`icmp\-type\*(C'\fR and a code
of \f(CW255\fR. The default \f(CW\*(C`icmp\-type\*(C'\fR is \f(CW\*(C`echo\-reply\*(C'\fR, so the resulting
packets look like echo replies, which looks rather strange to network
administrators.
.PP
This transport should only be used if other transports (i.e. raw \s-1IP\s0) are
not available or undesirable (due to their overhead).
.SS "\s-1UDP\s0"
.IX Subsection "UDP"
This is a good general choice for the transport protocol as \s-1UDP\s0 packets
tunnel well through most firewalls and routers, and the overhead per
packet is moderate (minimum 58 bytes).
.PP
It should be used if \s-1RAW IP\s0 is not available.
.SS "\s-1TCP\s0"
.IX Subsection "TCP"
This protocol is a very bad choice, as it not only has high overhead (more
than 60 bytes), but the transport also retries on its own, which leads
to congestion when the link has moderate packet loss (as both the \s-1TCP\s0
transport and the tunneled traffic will retry, increasing congestion more
and more). It also has high latency and is quite inefficient.
.PP
It's only useful when tunneling through firewalls that block better
protocols. If a node doesn't have direct internet access but a \s-1HTTP\s0 proxy
that supports the \s-1CONNECT\s0 method it can be used to tunnel through a web
proxy. For this to work, the \f(CW\*(C`tcp\-port\*(C'\fR should be \f(CW443\fR (\f(CW\*(C`https\*(C'\fR), as
most proxies do not allow connections to other ports.
.PP
It is an abuse of the usage a proxy was designed for, so make sure you are
allowed to use it for \s-1GVPE.\s0
.PP
This protocol also has server and client sides. If the \f(CW\*(C`tcp\-port\*(C'\fR is
set to zero, other nodes cannot connect to this node directly. If the
\&\f(CW\*(C`tcp\-port\*(C'\fR is non-zero, the node can act both as a client as well as a
server.
.SS "\s-1DNS\s0"
.IX Subsection "DNS"
\&\fB\s-1WARNING:\s0\fR Parsing and generating \s-1DNS\s0 packets is rather tricky. The code
almost certainly contains buffer overflows and other, likely exploitable,
bugs. You have been warned.
.PP
This is the worst choice of transport protocol with respect to overhead
(overhead can be 2\-3 times higher than the transferred data), and latency
(which can be many seconds). Some \s-1DNS\s0 servers might not be prepared to
handle the traffic and drop or corrupt packets. The client also has to
constantly poll the server for data, so the client will constantly create
traffic even if it doesn't need to transport packets.
.PP
In addition, the same problems as the \s-1TCP\s0 transport also plague this
protocol.
.PP
Its only use is to tunnel through firewalls that do not allow direct
internet access. Similar to using a \s-1HTTP\s0 proxy (as the \s-1TCP\s0 transport
does), it uses a local \s-1DNS\s0 server/forwarder (given by the \f(CW\*(C`dns\-forw\-host\*(C'\fR
configuration value) as a proxy to send and receive data as a client,
and an \f(CW\*(C`NS\*(C'\fR record pointing to the \s-1GVPE\s0 server (as given by the
\&\f(CW\*(C`dns\-hostname\*(C'\fR directive).
.PP
The only good side of this protocol is that it can tunnel through most
firewalls mostly undetected, iff the local \s-1DNS\s0 server/forwarder is sane
(which is true for most routers, wireless \s-1LAN\s0 gateways and nameservers).
.PP
Fine-tuning needs to be done by editing \f(CW\*(C`src/vpn_dns.C\*(C'\fR directly.
.SH "PART 2: The GNU VPE protocol"
.IX Header "PART 2: The GNU VPE protocol"
This section, unfortunately, is not yet finished, although the protocol
is stable (until bugs in the cryptography are found, which will likely
completely change the following description). Nevertheless, it should give
you some overview over the protocol.
.SS "Anatomy of a \s-1VPN\s0 packet"
.IX Subsection "Anatomy of a VPN packet"
The exact layout and field lengths of a \s-1VPN\s0 packet is determined at
compile time and doesn't change. The same structure is used for all
transport protocols, be it \s-1RAWIP\s0 or \s-1TCP.\s0
.PP
.Vb 3
\& +\-\-\-\-\-\-+\-\-\-\-\-\-+\-\-\-\-\-\-\-\-+\-\-\-\-\-\-+
\& | HMAC | TYPE | SRCDST | DATA |
\& +\-\-\-\-\-\-+\-\-\-\-\-\-+\-\-\-\-\-\-\-\-+\-\-\-\-\-\-+
.Ve
.PP
The \s-1HMAC\s0 field is present in all packets, even if not used (e.g. in auth
request packets), in which case it is set to all zeroes. The \s-1MAC\s0 itself is
calculated over the \s-1TYPE, SRCDST\s0 and \s-1DATA\s0 fields in all cases.
.PP
The \s-1TYPE\s0 field is a single byte and determines the purpose of the packet
(e.g. \s-1RESET, COMPRESSED/UNCOMPRESSED DATA, PING, AUTH REQUEST/RESPONSE,
CONNECT REQUEST/INFO\s0 etc.).
.PP
\&\s-1SRCDST\s0 is a three byte field which contains the source and destination
node IDs (12 bits each).
.PP
The \s-1DATA\s0 portion differs between each packet type, naturally, and is the
only part that can be encrypted. Data packets contain more fields, as
shown:
.PP
.Vb 3
\& +\-\-\-\-\-\-+\-\-\-\-\-\-+\-\-\-\-\-\-\-\-+\-\-\-\-\-\-\-+\-\-\-\-\-\-+
\& | HMAC | TYPE | SRCDST | SEQNO | DATA |
\& +\-\-\-\-\-\-+\-\-\-\-\-\-+\-\-\-\-\-\-\-\-+\-\-\-\-\-\-\-+\-\-\-\-\-\-+
.Ve
.PP
\&\s-1SEQNO\s0 is a 32\-bit sequence number. It is negotiated at every connection
initialization and starts at some random 31 bit value. \s-1GVPE\s0 currently uses
a sliding window of 512 packets/sequence numbers to detect reordering,
duplication and replay attacks.
.PP
The encryption is done on \s-1SEQNO+DATA\s0 in \s-1CTR\s0 mode with \s-1IV\s0 generated from
the seqno (for \s-1AES:\s0 seqno || seqno || seqno || (u32)0), which ensures
uniqueness for a given key.
.SS "The authentication/key exchange protocol"
.IX Subsection "The authentication/key exchange protocol"
Before nodes can exchange packets, they need to establish authenticity of
the other side and a key. Every node has a private \s-1RSA\s0 key and the public
\&\s-1RSA\s0 keys of all other nodes.
.PP
When a node wants to establish a connection to another node, it sends an
RSA-OEAP-encrypted challenge and an \s-1ECDH \s0(curve25519) key. The other node
replies with its own \s-1ECDH\s0 key and a \s-1HKDF\s0 of the challenge and both \s-1ECDH\s0
keys to prove its identity.
.PP
The remote node enganges in exactly the same protocol. When both nodes
have exchanged their challenge and verified the response, they calculate a
cipher key and a \s-1HMAC\s0 key and start exchanging data packets.
.PP
In detail, the challenge consist of:
.PP
.Vb 1
\&  RSA\-OAEP (SEQNO MAC CIPHER SALT EXTRA\-AUTH) ECDH1
.Ve
.PP
That is, it encrypts (with the public key of the remote node) an initial
sequence number for data packets, key material for the \s-1HMAC\s0 key, key
material for the cipher key, a salt used by the \s-1HKDF \s0(as shown later) and
some extra random bytes that are unused except for authentication. It also
sends the public key of a curve25519 exchange.
.PP
The remote node decrypts the \s-1RSA\s0 data, generates its own \s-1ECDH\s0 key (\s-1ECDH2\s0),
and replies with:
.PP
.Vb 1
\&  HKDF\-Expand (HKDF\-Extract (ECDH2, RSA), ECDH1, AUTH_DIGEST_SIZE) ECDH2
.Ve
.PP
That is, it extracts from the decrypted \s-1RSA\s0 challenge, using its \s-1ECDH\s0
key as salt, and then expands using the requesting node's \s-1ECDH1\s0 key. The
resulting hash is returned as a proof that the node could decrypt the \s-1RSA\s0
challenge data, together with the \s-1ECDH\s0 key.
.PP
After both nodes have done this to each other, they calculate the shared
\&\s-1ECDH\s0 secret, cipher and \s-1HMAC\s0 keys for the session (each node generates two
cipher and \s-1HMAC\s0 keys, one for sending and one for receiving).
.PP
The \s-1HMAC\s0 key for sending is generated as follow:
.PP
.Vb 1
\&   HMAC_KEY = HKDF\-Expand (HKDF\-Extract (REMOTE_SALT, MAC ECDH_SECRET), info, HMAC_MD_SIZE)
.Ve
.PP
It extracts from \s-1MAC\s0 and \s-1ECDH_SECRET\s0 using the \fIremote\fR \s-1SALT,\s0 then
expands using a static info string.
.PP
The cipher key is generated in the same way, except using the \s-1CIPHER\s0 part
of the original challenge.
.PP
The result of this process is to authenticate each node to the other
node, while exchanging keys using both \s-1RSA\s0 and \s-1ECDH,\s0 the latter providing
perfect forward secrecy.
.PP
The protocol has been overdesigned where this was possible without
increasing implementation complexity, in an attempt to protect against
implementation or protocol failures. For example, if the \s-1ECDH\s0 challenge
was found to be flawed, perfect forward secrecy would be lost, but the
data would likely still be protected. Likewise, standard algorithms and
implementations are used where possible.
.SS "Retrying"
.IX Subsection "Retrying"
When there is no response to an auth request, the node will send auth
requests in bursts with an exponential back-off. After some time it will
resort to \s-1PING\s0 packets, which are very small (8 bytes + protocol header)
and lightweight (no \s-1RSA\s0 operations required). A node that receives ping
requests from an unconnected peer will respond by trying to create a
connection.
.PP
In addition to the exponential back-off, there is a global rate-limit on
a per-IP base. It allows long bursts but will limit total packet rate to
something like one control packet every ten seconds, to avoid accidental
floods due to protocol problems (like a \s-1RSA\s0 key file mismatch between two
nodes).
.PP
The intervals between retries are limited by the \f(CW\*(C`max\-retry\*(C'\fR
configuration value. A node with \f(CW\*(C`connect\*(C'\fR = \f(CW\*(C`always\*(C'\fR will always retry,
a node with \f(CW\*(C`connect\*(C'\fR = \f(CW\*(C`ondemand\*(C'\fR will only try (and re-try) to connect
as long as there are packets in the queue, usually this limits the retry
period to \f(CW\*(C`max\-ttl\*(C'\fR seconds.
.PP
Sending packets over the \s-1VPN\s0 will reset the retry intervals as well, which
means as long as somebody is trying to send packets to a given node, \s-1GVPE\s0
will try to connect every few seconds.
.SS "Routing and Protocol translation"
.IX Subsection "Routing and Protocol translation"
The \s-1GVPE\s0 routing algorithm is easy: there isn't much routing to speak
of: When routing packets to another node, \s-1GVPE\s0 tries the following
options, in order:
.IP "If the two nodes should be able to reach each other directly (common protocol, port known), then \s-1GVPE\s0 will send the packet directly to the other node." 4
.IX Item "If the two nodes should be able to reach each other directly (common protocol, port known), then GVPE will send the packet directly to the other node."
.PD 0
.ie n .IP "If this isn't possible (e.g. because the node doesn't have a \*(C`hostname\*(C' or known port), but the nodes speak a common protocol and a router is available, then \s-1GVPE\s0 will ask a router to ""mediate"" between both nodes (see below)." 4
.el .IP "If this isn't possible (e.g. because the node doesn't have a \f(CW\*(C`hostname\*(C'\fR or known port), but the nodes speak a common protocol and a router is available, then \s-1GVPE\s0 will ask a router to ``mediate'' between both nodes (see below)." 4
.IX Item "If this isn't possible (e.g. because the node doesn't have a hostname or known port), but the nodes speak a common protocol and a router is available, then GVPE will ask a router to mediate between both nodes (see below)."
.ie n .IP "If a direct connection isn't possible (no common protocols) or forbidden (\*(C`deny\-direct\*(C') and there are any routers, then \s-1GVPE\s0 will try to send packets to the router with the highest priority that is connected already \fIand\fR is able (as specified by the config file) to connect directly to the target node." 4
.el .IP "If a direct connection isn't possible (no common protocols) or forbidden (\f(CW\*(C`deny\-direct\*(C'\fR) and there are any routers, then \s-1GVPE\s0 will try to send packets to the router with the highest priority that is connected already \fIand\fR is able (as specified by the config file) to connect directly to the target node." 4
.IX Item "If a direct connection isn't possible (no common protocols) or forbidden (deny-direct) and there are any routers, then GVPE will try to send packets to the router with the highest priority that is connected already and is able (as specified by the config file) to connect directly to the target node."
.IP "If no such router exists, then \s-1GVPE\s0 will simply send the packet to the node with the highest priority available." 4
.IX Item "If no such router exists, then GVPE will simply send the packet to the node with the highest priority available."
.IP "Failing all that, the packet will be dropped." 4
.IX Item "Failing all that, the packet will be dropped."
.PD
.PP
A host can usually declare itself unreachable directly by setting its
port number(s) to zero. It can declare other hosts as unreachable by using
a config-file that disables all protocols for these other hosts. Another
option is to disable all protocols on that host in the other config files.
.PP
If two hosts cannot connect to each other because their \s-1IP\s0 address(es)
are not known (such as dial-up hosts), one side will send a \fImediated\fR
connection request to a router (routers must be configured to act as
routers!), which will send both the originating and the destination host
a connection info request with protocol information and \s-1IP\s0 address of the
other host (if known). Both hosts will then try to establish a direct
connection to the other peer, which is usually possible even when both
hosts are behind a \s-1NAT\s0 gateway.
.PP
Routing via other nodes works because the \s-1SRCDST\s0 field is not encrypted,
so the router can just forward the packet to the destination host. Since
each host uses its own private key, the router will not be able to
decrypt or encrypt packets, it will just act as a simple router and
protocol translator.