Update: I feel like a jackass now, I thought I was running this against the stable haproxy build, but in reality this was against haproxy-1.4dev6. DOH! Well on the bright-side, I am helping out the author fix a potentially critical bug. Here is the truss and tcp dump if anyone cares.

Well yet another Solaris specific bug/issue to report. HAProxy resets long running connections. Meaning users on slow bandwidth connections are affected by this. I have sent tcpdumps and logs to the author of HAProxy, hopefully this bug/issue would be resolved. I am writing this as a precautionary warning to other Solaris admins out there.

Here the way to trigger this, see if your service is affected by this.


wget --limit-rate=2k http://somesite.com/onebigfile.txt

Result:


syris:~ victori$ wget --limit-rate=20k http://somesite.com/onebigfile.txt
--2010-01-20 11:19:29-- http://somesite.com/onebigfile.txt
Resolving somesite.com (somesite.com)... 72.11.142.91
Connecting to somesite.com (somesite.com)|72.11.142.91|:84... connected.
HTTP request sent, awaiting response... 200 OK
Length: 3806025 (3.6M)
Saving to: “onebigfile.txt”

7% [====> ] 269,008 20.1K/s in 13s

2010-01-20 11:19:42 (20.1 KB/s) - Read error at byte 269008/3806025 (Connection reset by peer). Retrying.

--2010-01-20 11:19:43-- (try: 2) http://somesite.com/onebigfile.txt
Connecting to somesite.com (somesite.com)|72.11.142.91|:84... connected.
HTTP request sent, awaiting response... 200 OK
Length: 3806025 (3.6M)
Saving to: “onebigfile.txt”

4% [==> ] 186,016 20.0K/s eta

/Raging, why are there so many Solaris TCP issues? First Varnish? now HAProxy? ARGHHHHH!@#!@