.\" Automatically generated by Pandoc 2.1.1
.\"
.TH "IRTT" "1" "February 11, 2018" "v0.9.0" "IRTT Manual"
.hy
.SH NAME
.PP
irtt \- Isochronous Round\-Trip Time
.SH SYNOPSIS
.PP
irtt \f[I]command\f[] [\f[I]args\f[]]
.PP
irtt help \f[I]command\f[]
.SH DESCRIPTION
.PP
IRTT measures round\-trip time and other latency related metrics using
UDP packets sent on a fixed period, and produces both text and JSON
output.
.SH COMMANDS
.TP
.B \f[I]client\f[]
runs the client
.RS
.RE
.TP
.B \f[I]server\f[]
runs the server
.RS
.RE
.TP
.B \f[I]bench\f[]
runs HMAC and fill benchmarks
.RS
.RE
.TP
.B \f[I]clock\f[]
runs wall vs monotonic clock test
.RS
.RE
.TP
.B \f[I]sleep\f[]
runs sleep accuracy test
.RS
.RE
.TP
.B \f[I]version\f[]
shows the version
.RS
.RE
.SH EXAMPLES
.PP
After installing IRTT, start a server:
.IP
.nf
\f[C]
$\ irtt\ server
IRTT\ server\ starting...
[ListenerStart]\ starting\ IPv6\ listener\ on\ [::]:2112
[ListenerStart]\ starting\ IPv4\ listener\ on\ 0.0.0.0:2112
\f[]
.fi
.PP
While that's running, run a client.
If no options are supplied, it will send a request once per second, like
ping.
Here we simulate a one minute G.711 VoIP conversation by using an
interval of 20ms and randomly filled payloads of 172 bytes:
.IP
.nf
\f[C]
$\ irtt\ client\ \-i\ 20ms\ \-l\ 172\ \-d\ 1m\ \-\-fill=rand\ \-\-sfill=rand\ \-q\ 192.168.100.10
[Connecting]\ connecting\ to\ 192.168.100.10
[Connected]\ connected\ to\ 192.168.100.10:2112

\ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ Min\ \ \ \ \ Mean\ \ \ Median\ \ \ \ \ \ Max\ \ Stddev
\ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \-\-\-\ \ \ \ \ \-\-\-\-\ \ \ \-\-\-\-\-\-\ \ \ \ \ \ \-\-\-\ \ \-\-\-\-\-\-
\ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ RTT\ \ 11.93ms\ \ 20.88ms\ \ \ 19.2ms\ \ 80.49ms\ \ 7.02ms
\ \ \ \ \ \ \ \ \ send\ delay\ \ \ 4.99ms\ \ 12.21ms\ \ 10.83ms\ \ 50.45ms\ \ 5.73ms
\ \ \ \ \ \ receive\ delay\ \ \ 6.38ms\ \ \ 8.66ms\ \ \ 7.86ms\ \ 69.11ms\ \ 2.89ms
\ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ 
\ \ \ \ \ \ IPDV\ (jitter)\ \ \ \ 782ns\ \ \ 4.53ms\ \ \ 3.39ms\ \ 64.66ms\ \ \ 4.2ms
\ \ \ \ \ \ \ \ \ \ send\ IPDV\ \ \ \ 256ns\ \ \ 3.99ms\ \ \ 2.98ms\ \ 35.28ms\ \ 3.69ms
\ \ \ \ \ \ \ receive\ IPDV\ \ \ \ 896ns\ \ \ 1.78ms\ \ \ \ 966µs\ \ 62.28ms\ \ 2.86ms
\ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ 
\ \ \ \ \ send\ call\ time\ \ \ 56.5µs\ \ \ 82.8µs\ \ \ \ \ \ \ \ \ \ \ 18.99ms\ \ \ 348µs
\ \ \ \ \ \ \ \ timer\ error\ \ \ \ \ \ \ 0s\ \ \ 21.7µs\ \ \ \ \ \ \ \ \ \ \ 19.05ms\ \ \ 356µs
\ \ server\ proc.\ time\ \ \ 23.9µs\ \ \ 26.9µs\ \ \ \ \ \ \ \ \ \ \ \ \ 141µs\ \ 11.2µs

\ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ duration:\ 1m0s\ (wait\ 241.5ms)
\ \ \ packets\ sent/received:\ 2996/2979\ (0.57%\ loss)
\ server\ packets\ received:\ 2980/2996\ (0.53%/0.03%\ loss\ up/down)
\ \ \ \ \ bytes\ sent/received:\ 515312/512388
\ \ \ \ \ \ \ send/receive\ rate:\ 68.7\ Kbps\ /\ 68.4\ Kbps
\ \ \ \ \ \ \ \ \ \ \ packet\ length:\ 172\ bytes
\ \ \ \ \ \ \ \ \ \ \ \ \ timer\ stats:\ 4/3000\ (0.13%)\ missed,\ 0.11%\ error
\f[]
.fi
.PP
In the results above, the client and server are located at two different
sites, around 50km from one another, each of which connects to the
Internet via point\-to\-point WiFi.
The client is 3km NLOS through trees located near its transmitter, which
is likely the reason for the higher upstream packet loss, mean send
delay and IPDV.
.SH BUGS
.IP \[bu] 2
Windows is unable to set DSCP values for IPv6.
.IP \[bu] 2
Windows is unable to set the source IP address, so
\f[C]\-\-set\-src\-ip\f[] may not be used on the server.
.IP \[bu] 2
The server doesn't run well on 32\-bit Windows platforms.
When connecting with a client, you may see
\f[C]Terminated\ due\ to\ receive\ error\f[].
To work around this, disable dual timestamps from the client by
including \f[C]\-\-tstamp=midpoint\f[].
.SH LIMITATIONS
.RS
.PP
\[lq]It is the limitations of software that give it life.\[rq]
.IP
.nf
\f[C]
\-Me,\ justifying\ my\ limitations
\f[]
.fi
.RE
.SS Isochronous (fixed period) send schedule
.PP
Currently, IRTT only sends packets on a fixed period, foregoing the
ability to simulate arbitrary traffic.
Accepting this limitation offers some benefits:
.IP \[bu] 2
It's easy to implement
.IP \[bu] 2
It's easy to calculate how many packets and how much data will be sent
in a given time
.IP \[bu] 2
It simplifies timer error compensation
.PP
Also, isochronous packets are commonly seen in VoIP, games and some
streaming media, so it already simulates an array of common types of
traffic.
.SS Fixed packet lengths for a given test
.PP
Packet lengths are fixed for the duration of the test.
While this may not be an accurate simulation of some types of traffic,
it means that IPDV measurements are accurate, where they wouldn't be in
any other case.
.SS Stateful protocol
.PP
There are numerous benefits to stateless protocols, particularly for
developers and data centers, including simplified server design,
horizontal scalabity, and easily implemented zero\-downtime restarts.
However, in this case, a stateful protocol provides important benefits
to the user, including:
.IP \[bu] 2
Smaller packet sizes (a design goal) as context does not need to be
included in every request
.IP \[bu] 2
More accurate measurement of upstream vs downstream packet loss (this
gets worse in a stateless protocol as RTT approaches the test duration,
complicating interplanetary tests!)
.IP \[bu] 2
More accurate rate and test duration limiting on the server
.SS In\-memory results storage
.PP
Results for each round\-trip are stored in memory as the test is being
run.
Each result takes 72 bytes in memory (8 64\-bit timestamps and a 64\-bit
server received packet window), so this limits the effective duration of
the test, especially at very small send intervals.
However, the advantages are:
.IP \[bu] 2
It's easier to perform statistical analysis (like calculation of the
median) on fixed arrays than on running data values
.IP \[bu] 2
We don't need to either send client timestamps to the server, or
maintain a local running window of sent packet info, because they're all
in memory, no matter when server replies come back
.IP \[bu] 2
Not accessing the disk during the test to write test output prevents
inadvertently affecting the results
.IP \[bu] 2
It simplifies the API
.PP
As a consequence of storing results in memory, packet sequence numbers
are fixed at 32\-bits.
If all 2^32 sequence numbers were used, the results would require over
300 Gb of virtual memory to record while the test is running.
That is why 64\-bit sequence numbers are currently unnecessary.
.SS 64\-bit received window
.PP
In order to determine per\-packet differentiation between upstream and
downstream loss, a 64\-bit \[lq]received window\[rq] may be returned
with each packet that contains the receipt status of the previous 64
packets.
This can be enabled using \f[C]\-\-stats=window/both\f[] with the irtt
client.
Its limited width and simple bitmap format lead to some caveats:
.IP \[bu] 2
Per\-packet differentiation is not available (for any intervening
packets) if greater than 64 packets are lost in succession.
These packets will be marked with the generic \f[C]Lost\f[].
.IP \[bu] 2
While any packet marked \f[C]LostDown\f[] is guaranteed to be marked
properly, there is no confirmation of receipt of the receive window from
the client to the server, so packets may sometimes be erroneously marked
\f[C]LostUp\f[], for example, if they arrive late to the server and
slide out of the received window before they can be confirmed to the
client, or if the received window is lost on its way to the client and
not amended by a later packet's received window.
.PP
There are many ways that this simple approach could be improved, such as
by:
.IP \[bu] 2
Allowing a wider window
.IP \[bu] 2
Encoding receipt seqnos in a more intelligent way to allow a wider seqno
range
.IP \[bu] 2
Sending confirmation of window receipt from the client to the server and
re\-sending unreceived windows
.PP
However, the current strategy means that a good approximation of
per\-packet loss results can be obtained with only 8 additional bytes in
each packet.
It also requires very little computational time on the server, and
almost all computation on the client occurs during results generation,
after the test is complete.
It isn't as accurate with late (out\-of\-order) upstream packets or with
long sequences of lost packets, but high loss or high numbers of late
packets typically indicate more severe network conditions that should be
corrected first anyway, perhaps before per\-packet results matter.
Note that in case of very high packet loss, the \f[B]total\f[] number of
packets received by the server but not returned to the client (which can
be obtained using \f[C]\-\-stats=count\f[]) will still be correct, which
will still provide an accurate \f[B]average\f[] loss percentage in each
direction over the course of the test.
.SS Use of Go
.PP
IRTT is written in Go.
That carries with it:
.IP \[bu] 2
Non\-negligible system call overhead
.IP \[bu] 2
A larger executable size than with C
.IP \[bu] 2
Somewhat slower execution speed than C (although not that much
slower (https://benchmarksgame.alioth.debian.org/u64q/compare.php?lang=go&lang2=gcc))
.PP
However, Go also has characteristics that make it a good fit for this
application:
.IP \[bu] 2
Go's target is network and server applications, with a focus on
simplicity, reliability and efficiency, which is appropriate for IRTT
.IP \[bu] 2
Memory footprint tends to be significantly lower than with some
interpreted languages
.IP \[bu] 2
It's easy to support a broad array of hardware and OS combinations
.SH SEE ALSO
.PP
irtt\-client(1) (irtt-client.html), irtt\-server(1) (irtt-server.html)
.PP
IRTT GitHub repository (https://github.com/peteheist/irtt/)
.SH AUTHOR
.PP
Pete Heist <pete@eventide.io>
.PP
Many thanks to both Toke Høiland\-Jørgensen and Dave Täht from the
Bufferbloat project (https://www.bufferbloat.net/) for their valuable
advice.
Any problems in design or implementation are entirely my own.
.SH HISTORY
.PP
IRTT was originally written to improve the latency and packet loss
measurements for the excellent Flent (https://flent.org) tool.
Flent was developed by and for the
Bufferbloat (https://www.bufferbloat.net/projects/) project, which aims
to reduce \[lq]chaotic and laggy network performance,\[rq] making this
project valuable to anyone who values their time and sanity while using
the Internet.