Running a High-Performance Web Server on HPUX
Date: Wed, 05 Nov 1997 16:59:34 -0800 From: Rick Jones <raj@cup.hp.com> Reply-To: raj@cup.hp.com Organization: Network Performance Subject: HP-UX tuning tips
Here are some tuning tips for HP-UX to add to the tuning page.
For HP-UX 9.X: Upgrade to 10.20
    For HP-UX 10.[00|01|10]: Upgrade to 10.20
For HP-UX 10.20:
Install the latest cumulative ARPA Transport Patch. This
    will allow you to configure the size of the TCP connection
    lookup hash table. The default is 256 buckets and must be set
    to a power of two. This is accomplished with adb against the
    *disc* image of the kernel. The variable name is tcp_hash_size.
    Notice that it's critically important that you use "W"
    to write a 32 bit quantity, not "w" to write a 16 bit
    value when patching the disc image because the tcp_hash_size
    variable is a 32 bit quantity.
How to pick the value? Examine the output of ftp://ftp.cup.hp.com/dist/networking/tools/connhist
    and see how many total TCP connections exist on the system. You
    probably want that number divided by the hash table size to be
    reasonably small, say less than 10. Folks can look at HP's
    SPECweb96 disclosures for some common settings. These can be
    found at Http://www.specbench.org/.
    If an HP-UX system was performing at 1000 SPECweb96 connections
    per second, the TIME_WAIT time of 60 seconds would mean
    60,000 TCP "connections" being tracked.
Folks can check their listen queue depths with ftp://ftp.cup.hp.com/dist/networking/misc/listenq.
If folks are running apache on a PA-8000 based system, they
    should consider "chatr'ing" the Apache executable to have a
    large page size. This would be "chatr +pi L <BINARY>".
    The GID of the running executable must have MLOCK privileges.
    Setprivgrp(1m) should be consulted for assigning
    MLOCK. The change can be validated by running Glance
    and examining the memory regions of the server(s) to make sure that
    they show a non-trivial fraction of the text segment being locked.
If folks are running Apache on MP systems, they might
    consider writing a small program that uses mpctl()
    to bind processes to processors. A simple pid % numcpu
    algorithm is probably sufficient. This might even go into the
    source code.
If folks are concerned about the number of FIN_WAIT_2
    connections, they can use nettune to shrink the value of
    tcp_keepstart. However, they should be careful there -
    certainly do not make it less than oh two to four minutes. If
    tcp_hash_size has been set well, it is probably OK to
    let the FIN_WAIT_2's take longer to timeout (perhaps
    even the default two hours) - they will not on average have a big
    impact on performance.
There are other things that could go into the code base, but that might be left for another email. Feel free to drop me a message if you or others are interested.
sincerely,
rick jones
http://www.cup.hp.com/netperf/NetperfPage.HTML