.html/dev/buffers.html


1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202

<!DOCTYPE html>
<html>
<head>
	<title>
		The Codex » 
		Observations on Buffering
	</title>

	<link
		rel='stylesheet'
		type='text/css'
		href='http://fonts.googleapis.com/css?family=Buenard:400,700&amp;subset=latin,latin-ext'>
	<link
		rel="stylesheet"
		type="text/css"
		href="../media/css/reset.css">
	<link
		rel="stylesheet"
		type="text/css"
		href="../media/css/grimoire.css">
</head>
<body>

<div id="shell">
	
		<ol id="breadcrumbs">
			
				<li class="crumb-0 not-last">
					
						<a href="../">index</a>
					
				</li>
			
				<li class="crumb-1 not-last">
					
						<a href="./">dev</a>
					
				</li>
			
				<li class="crumb-2 last">
					
						buffers
					
				</li>
			
		</ol>
	

	<div id="article">
		<h1 id="observations-on-buffering">Observations on Buffering</h1>
<p>None of the following is particularly novel, but the reminder has been useful:</p>
<ul>
<li>
<p>All buffers exist in one of two states: full (writes outpace reads), or empty
    (reads outpace writes). There are no other stable configurations.</p>
</li>
<li>
<p>Throughput on an empty buffer is dominated by the write rate. Throughput on a
    full buffer is dominated by the read rate.</p>
</li>
<li>
<p>A full buffer imposes a latency penalty equal to its size in bits, divided by
    the read rate in bits per second. An empty buffer imposes (approximately) no
    latency penalty.</p>
</li>
</ul>
<p>The previous three points suggest that <strong>traffic buffers should be measured in
seconds, not in bytes</strong>, and managed accordingly. Less obviously, buffer
management needs to be considerably more sophisticated than the usual "grow
buffer when full, up to some predefined maximum size."</p>
<p>Point one also implies a rule that I see honoured more in ignorance than in
awareness: <strong>you can't make a full buffer less full by making it bigger</strong>. Size
is not a factor in buffer fullness, only in buffer latency, so adjusting the
size in response to capacity pressure is worse than useless.</p>
<p>There are only three ways to make a full buffer less full:</p>
<ol>
<li>
<p>Increase the rate at which data exits the buffer.</p>
</li>
<li>
<p>Slow the rate at which data enters the buffer.</p>
</li>
<li>
<p>Evict some data from the buffer.</p>
</li>
</ol>
<p>In actual practice, most full buffers are upstream of some process that's
already going as fast as it can, either because of other design limits or
because of physics. A buffer ahead of disk writing can't drain faster than the
disk can accept data, for example. That leaves options two and three.</p>
<p>Slowing the rate of arrival usually implies some variety of <em>back-pressure</em> on
the source of the data, to allow upstream processes to match rates with
downstream processes. Over-large buffers delay this process by hiding
back-pressure, and buffer growth will make this problem worse. Often,
back-pressure can happen automatically: failing to read from a socket, for
example, will cause the underlying TCP stack to apply back-pressure to the peer
writing to the socket by delaying TCP-level message acknowledgement. Too often,
I've seen code attempt to suppress these natural forms of back-pressure without
replacing them with anything, leading to systems that fail by surprise when
some other resource – usually memory – runs out.</p>
<p>Eviction relies on the surrounding environment, and must be part of the
protocol design. Surprisingly, most modern application protocols get very
unhappy when you throw their data away: the network age has not, sadly, brought
about protocols and formats particularly well-designed for distribution.</p>
<p>If neither back-pressure nor eviction are available, the remaining option is to
fail: either to start dropping data unpredictably, or to cease processing data
entirely as a result of some resource or another running out, or to induce so
much latency that the data is useless by the time it arrives.</p>
<hr>
<p>Some uncategorized thoughts:</p>
<ul>
<li>
<p>Some buffers exist to trade latency against the overhead of coordination. A
    small buffer in this role will impose more coordination overhead; a large
    buffer will impose more latency.</p>
<ul>
<li>
<p>These buffers appear where data transits between heterogenous system: for
    example, buffering reads from the network for writes to disk.</p>
</li>
<li>
<p>Mismanaged buffers in this role will tend to cause the system to spend
    an inordinate proportion of latency and throughput negotiating buffer
    sizes and message readiness.</p>
</li>
<li>
<p>A coordination buffer is most useful when <em>empty</em>; in the ideal case, the
    buffer is large enough to absorb one message's worth of data from the
    source, then pass it along to the sink as quickly as possible.</p>
</li>
</ul>
</li>
<li>
<p>Some buffers exist to trade latency against jitter. A small buffer in this
    role will expose more jitter to the upstream process. A large buffer in this
    role will impose more latency.</p>
<ul>
<li>
<p>These tend to appear in <em>homogenous</em> systems with differing throughputs,
    or as a consequence of some other design choice. Store-and-forward
    switching in networks, for example, implies that switches must buffer at
    least one full frame of network data.</p>
</li>
<li>
<p>Mis-managed buffers in this role will <em>amplify</em> rather than smoothing out
    jitter. Apparent throughput will be high until the buffer fills, then
    change abruptly when full. Upstream processes are likely to throttle
    down, causing them to under-deliver if the buffer drains, pushing the
    system back to a high-throughput mode. <a href="http://www.bufferbloat.net">This problem gets worse the
    more buffers are present in a system</a>.</p>
</li>
<li>
<p>An anti-jitter buffer is most useful when <em>full</em>; in exchange for a 
    latency penalty, sudden changes in throughput will be absorbed by data
    in the buffer rather than propagating through to the source or sink.</p>
</li>
</ul>
</li>
<li>
<p>Multimedia people understand this stuff at a deep level. Listen to them when
    designing buffers for other applications.</p>
</li>
</ul>
	</div>


<div id="comments">
<div id="disqus_thread"></div>
<script type="text/javascript">
    /* * * CONFIGURATION VARIABLES: EDIT BEFORE PASTING INTO YOUR WEBPAGE * * */
    var disqus_shortname = 'grimoire'; // required: replace example with your forum shortname

    /* * * DON'T EDIT BELOW THIS LINE * * */
    (function() {
        var dsq = document.createElement('script'); dsq.type = 'text/javascript'; dsq.async = true;
        dsq.src = 'http://' + disqus_shortname + '.disqus.com/embed.js';
        (document.getElementsByTagName('head')[0] || document.getElementsByTagName('body')[0]).appendChild(dsq);
    })();
</script>
<noscript>Please enable JavaScript to view the <a href="http://disqus.com/?ref_noscript">comments powered by Disqus.</a></noscript>
<a href="http://disqus.com" class="dsq-brlink">comments powered by <span class="logo-disqus">Disqus</span></a>
</div>


	<div id="footer">
		<p>
			
				The Codex —
			
			Powered by <a href="http://markdoc.org/">Markdoc</a>.
			
<a href="https://bitbucket.org/ojacobson/grimoire.ca/src/master/wiki/dev/buffers.md">See this page on Bitbucket</a> (<a href="https://bitbucket.org/ojacobson/grimoire.ca/history-node/master/wiki/dev/buffers.md">history</a>).

		</p>
	</div>
	
</div>
</body>
</html>