These are the Release Notes for Revision 2.7.0 of netperf: *) Add bits/s (-f b) and Bytes/s (-f B) as selectable throughput units. This may make life easier for folks doing post-processing of things like interim results. *) Miscelaneous fixes *) Much of the now-seeming kruft for things peripheral to netperf's core mission - have been removed though the code itself remains in the repository. So, things like looking-up egress interface, driver/slot information etc have been disabled in a manner requiring more than just a ./configure to bring-back. It wasn't clear those features were being used. This is a test of that hypothesis. These are the Release Notes for Revision 2.6.0 of netperf: *) Initial pass at support for --enable-intervals (WANT_INTERVALS) for Windows, courtesy of Jonathan Cook. *) When in demo mode (./configure --enable-demo and a global -D <interval> option netperf will make sure it emits "one last interval result" when the test is terminated. This should assist when post-processing results through the likes of rrdtool when there is a slow-down in the performance just at the end that would have stretched the interval to beyond the test termination. *) A fix to have the AF_UNIX tests realize that the value for "take the system default" socket buffer size became -1 years ago. Bug found by Eric Dumazet. *) Include a patch from Dave Taht to enable symbolic manipulation of IP_TOS values. *) Include a patch from Sachar Raindel to enable the omni tests to get ENOBUFS under Linux when the socket buffer is larger than the tx queue of the egress interface. This will help preclude netperf's reporting a larger than link-rate send-side figure. *) Fix a problem with late checking of the return from select() in src/netserver.c. Reported by Waqar Sheikh. *) A new global -Z option has been added to netperf and netserver. This takes as an argument a passphrase. In the case of netserver it will expect a control message with the passphrase as the first thing it receives on the control connection. If netserver does not receive a control message with the passphrase it will close the control connection and move-on. If the netserver receives a control message with a passprhase when it is not lookign for one, it will be ignored. There is at present a 20 second timeout on the attempted receipt of the request message. In the case of netperf, the passphrase will be the first thing sent on the control connection. There is no response to a passphrase control message. *) Demo mode output format will now track the omni output format. So, if the omni ouput format is CSV then the interim results will be emitted in csv. Likewise for keyval. If the mode is human (default and test-specific -O) then the output remains unchanged. Keyval output includes the count of interval, with a mind towards being able to source it in shells and whatnot. Subject to change without notice. *) A patch to correctly handle IPv6 addresses in the control messages, courtesy of Bjoern Zeeb. *) The global -F option can now be used specify a local and/or remote fill file. *) It is now possible to set/get the TCP congestion control algorithm being used by either end of the test connection when using the omni code. The output selectors are LOCAL_CONG_CONTROL and REMOTE_CONG_CONTROL and setting is via the test-specific -K option. *) Stop leaking file descriptors when looking-up probable egress interface names and I/O slot numbers. *) The global -Y option can be used to set IP_TOS on those platforms which support it. Since this is specific to IP (v4 or v6) it may move to a test-specific otion in the future. It is presently global for foolish consistency with the -y option to set SO_PRIORITY. *) The global -y option can be used to set SO_PRIORITY on those platforms which support it. Based on patches from Amir Vidai. *) The control message size has been increased from 256 bytes to 512 bytes. THIS WILL BREAK COMPATABILITY WITH PREVIOUS VERSIONS OF NETPERF. However, we need more room on the pinhead on which the angels dance. *) Make the "sum" field of the histogram structure a 64 bit int to avoid having it wrap-around on tests where the sum of all the measured latencies was larger than 31 bits. This was causing statistics like stddev to go negative in some cases. *) If the time delta between two events is negative, do not bother doing any math with it in the histogram/statistics code, just increment the ridiculous count and move-on. *) Fixed a bug which caused local transport retransmissions to be reported as -1 even though the getsockopt() call was successful. (Linux). Later included remote transport retransmissions. *) The src/nettest_omni.c and re-written src/netserver.c code are now known to have compiled under Windows 7 x64 with the Microsoft WDK. There remains a timing issue with confidence intervals which is yet to be addressed, and may have been there for ages. Netserver has been run as a non-spawning (-f) server, netperf has been run, both have run "classic" and "omni" tests. These are the Release Notes for Revision 2.5.0 of netperf: *) Add a new -N option to netserver which will suppress all creation of debug files and so debugging output. While this would put a serious crimp in debugging a problem in netserver, it will enable folks using small embedded systems to avoid soaking-up their /tmp filesystem with clutter. *) A refactoring and partial re-write of the src/netserver.c code to untangle years of accumulated spaghetti code. Included is the ability to not daemonize netserver when launched from the command-line (-D) and also to not fork/spawn child processes upon the acceptance of a control connection (-f). Combined, the two options will cause netserver to remain in the forground and not spawn children - in effect netserver will handle only one test at a time. *) As it has been two years since the defect in Solaris getaddrinfo() was submitted, it should be the case that a fix is available, so it should no longer be necessary to "hide" the "Hey your platform's getaddrinfo() call is buggy" warning. Consequently, it is no longer being suppressed. *) A new global command line option - -S - has been added to enable setting of SO_KEEPALIVE on the data socket. This will affect the netperf side of the "classic" netperf tests, and will also affect the netserver side of an "omni" or migrated classic test as only the control message for the omni tests has the requisite flags field to communicate the desire to set SO_KEEPALIVE. Ostensibly, this may help when netperf is (ab)used in functional testing situations and netservers end-up orphaned and out in the cold because their corresponding netperfs went away and the notification was lost amid the roar of traffic over-saturating the interconnect(s). The default is to behave as before - SO_KEEPALIVE not set. *) Base on the frequency at which the author has used the functionality, the default for --enable-burst is now "yes." To disable support for burst mode one must now include a --enable-burst=no when performing the ./configure prior to compiling the bits. *) The output of the -D global command line option (./configure --enable-demo) has been enhanced to include seconds and milliseconds since the epoch as returned by a gettimeofday() call with a null pointer for the timezone. This is in support of being able to easily shove interim results into an rrdtool Round-Robin Database (RRD). *) The "omni" tests will be compiled-in by default, and WANT_MIGRATION is the default. One must ./configure with --enable-omni=no to disable this. *) When ./configured with --enable-intervals and intervals are actually used, the round-trip latency reported by an omni (or migrated classic) request/response test should better reflect reality rather than the length of the pacing interval. It and the MEAN_LATENCY from the histogram and -j output will still differ slightly and probably always will. *) The histogram code has been enhanced to track more than one latency at a time and so --enable-histogram and --enable-burst are now compatible - for the omni tests or migrated "classic" tests only however. This change was inspired/instigated by Jim Gettys and his work on overly-large queues of buffers *) WANT_MIGRATION is enabled when one specifies --enable-omni on the configure command line. *) Massage and encorporate a patch from Google that enables randomization of the IP addresses used in a test. An optional mask length in the standard '/' notation can be added to the end of the IP/name in the test-specific -H or -L options of an Omni test. *) Massage and include a DEBUG_LOG_FILE patch for Android from Josselin Costanzi *) Add intial attempt to report Slot ID on HP-UX 11.31. *) Add global -s option to cause omni tests to pause between setting-up the test and actually starting it. pause is in seconds. Poor man's way to (attempt to) avoid issues when starting many, Many, MANY concurrent netperf tests. Based on patches from Google. *) Additional timing statistics will be kept by the omni tests when the global "-j" option is specified. The additional statistics are min, max, mean, stddev and the 50th, 90th and 99th percentiles on the timings measured by histograms. Based on patches from Google. *) Add a workaround to get Linux to report TX queue drops in a UDP_STREAM test when the socket buffer size is larger than the TX queue. Provided by Andrew Gallatin. *) Fix the configure script to know it does not have to look for an SCTP library on FreeBSD 8.X *) The BSD and "omni" tests now have a test-specific -R option which is a boolean controlling whether or not SO_DONTROUTE will be set on the data socket. By default, any unidirectional UDP test will have SO_DONTROUTE set unless a -R 1 option is given. All other tests (including UDP request/response tests) will not have SO_DONTROUTE set unless a -R 0 option is given. This is put into place to make it take longer for blithering idiots to shoot themselves in the foot by running tests on setups they shouldn't. *) At least the beginnings of support for RDS, based on a circa 2007 patch against 2.4.2 by Vladimir Sokolovsky. Rather than create the "RDS_STREAM" test of his patch, the intention this time around is to enable RDS for the "omni" tests by using an omni test-specific -T rds specifyer for the "transport" to use. *) Missing fprintf format statements provided by Bruno Cornec *) Numerous cleanups from Jose Pedro Oliveira *) Fixes to allow netperf -H ::1 to work without having to add -6 or an AF_INET6 -L option These are the Release Notes for Revision 2.4.5 of netperf: Things changed in this release: *) Fixes for Linux procstat-based CPU utilization on newer kernels from Andrew Gallatin. *) Fix for a TCP_RR hang from Michael Shuldman *) Compilation cleanups for MingW cnd MSDOS (djgpp) ourtesy of Gisle Vanem. *) Changes to enable compilation and building of netperf for VMware. Kudos to the person who did the first port, I will be happy to name that person when told it is OK :) *) Fixes from Adam Bidema for launching netserver children when the path to netserver.exe is very long. *) For the first time, netperf2 has a dependency, albeit optional, on another non-base-os bit of code - libsmbios under Linux. It will attept to detect this at compile time and use it to report the system model name in an omni test. If libsmbios is there we will try to use it, otherwise we will not. If the associated include file is also there (eg the -dev package in apt-get-speak), we will use it to get the prototype for SMBIOSGetSystemName, otherwise we make a guess as to the prototype for SMBIOSGetSystemName(), which is the only call we make to libsmbios. *) Fixes for BSD CPU utilization to deal with different BSD variants using different types. Courtesy of Simon Burge <simonb@NetBSD.org> *) The "omni" suite has been added on an experimental basis. If it works-out then many of the tests in src/nettest_bsd.c, src/nettest_sdp.c, and src/nettest_sctp.c will be "migrated" to use the "omni infrastructure" (aka two routines to measure them all...). Apart from reduced socket code, the omni suite has user-configurable output in either "human readable," CSV or keyword=value format. By default, a VERY large quantity of data is output when asking for csv format (test-specific -o option) or keyword format (test-specific -k option). The omni suite is not yet documented (there are some as-yet undiagnosed problems with doc/netperf.texi in emacs texinfo mode and updating nodes and links and such - any help there would be appreciated) but there is a small text file in doc/ describing the names (most) of the available output's. For the most up-to-date list consult src/nettest_omni.c and the enum netperf_output_name. Or, you can pass-in a "filename" of '?' to either of the -O, -o or -k options and netperf will emit a list of the known available outputs. *) Coming along for the ride are some new platform specific files to determine the probable egress interface for each end of a test, as well as driver information for that interface. There is also reporting of "uname" like information for both local and remote system, and eventually perhaps something about the vendor's model name for the systems as well as the processor types. The end goal is to make it easy to get most if not all what one would want in a database of netperf results. *) The UDP_RR test now understands the global -f option to change output units. It also understands the -B option to tag results. Courtesy of Alexander Duyck. *) A fix has been added for hanging UDP_RR tests under Windows. Courtesy of Alexander Duyck. *) Use vfork() on those platforms without fork(), courtesy of Matt Waddel *) Track the bouncing interfaces that are linux processor affinity *) Fixes for Solaris sendfilev usage. *) A TCP_MSS test has been added which will report the MSS for a data connection setup as if the test were a TCP_STREAM test. While the remote (netserver) is tricked into thinking it is to accept a TCP_STREAM test, no actual data will flow over the connection. This means that if the MSS is one which might change over the life of the connection, it will not be reflected in the test output. Should this prove to be a problem a single send() can be arranged along with the return of the shutdown();recv() handshake. The idea is that this might be useful for netperf scripts wanting to parameterize things based on the MSS - for example the packet_byte_script. *) The width of the confidence interval can be specified in fractions of a percent for the confidence of a clean, close, comfortable calculation. :) *) Honor the global -B option in a TCP_SENDFILE test. *) Correct the sense of Send/Recv in the banner of a TCP_MAERTS test. These are the Release Notes for Revision 2.4.4 of netperf: Things changed in this release: *) The LOC_CPU and REM_CPU tests will report their respective beliefs as to the number of CPUs present when the verbosity is set to more than one. This can be used when trying to diagnose issues with CPU utilization. *) A kind soul who wishes to remain anonymous provided a patch to enable use of sendfile() on OSX. *) Fix a misplaced \n in a format string of send_tcp_maerts, courtesy of Alexander Duyck. *) There is an experimental global -r option which will allow one to include CPU utilization measurements, but make the decision about hitting confidence based on the result only. The test banner will reflects this when -r is used. *) It is no longer necessary to specify a file with the global -F option when running a _SENDFILE test. Netperf will create a temporary file and populate it with random data and use that. If running aggregate tests it is strongly suggested one use a -F option. Otherwise, the overhead spent creating and populating the temporary file will be included in the CPU utilization calculation. *) The configure script recognizes Solaris 11 and selects the correct CPU utilization mechanism - or rather it selects the same mechanism as is used in Solaris 10. Fix courtesy of Andrew Gallatin. *) Convert a number of struct sockaddr_in's to struct sockaddr_storage's and add requisite casts to deal with some abort problems on Windows and perhaps other platforms as well. Kudos to Alexander Duyck. *) One can now pass a value of 'x' to the global -f option to specify the units as transactions per second. This is the default for any request/response test, which is determined by there being a "double `r'" in the name - eg "RR," "rr," "Rr," or "rR." At present only the TCP_RR test actually looks for this to be set. *) One can request bits/bytes per second as the primary output of a TCP_RR test by setting the global -f option to [kmgKMG] as with any of the "STREAM" tests. This converts the primary throughput metric to a bitrate (byterate) following the verbosity rules for a STREAM test. Service demand remains usec/Transaction regardless of the setting of the global -f option. A verbosity level of 2 or more will cause the TCP_RR test to report calculated average RTT latency, transaction rate, and inbound and outbound transfer rates regardless of the primary units selected with the global -f paramter. If the primary output is transactions per second, the reported inbound and outbound transfer rates will be 10^6 bits per second, otherwise, they honor the setting of the global -f option. All of this is EXPERIMENTAL and subject to change without prior notice in future versions of netperf. *) Replace "break" with "break 2" in acinclude.m4 for a socklen macro *) The default for the requested socket buffer size is changed from 0 to -1 to enable passing a value of 0 under Windows, which tells that stack one wishes to enable copy-avoidance. *) Call fflush() on each interim result displayed in demo mode to make things happier for folks redirecting same to a file. From Dan Yost. *) In theory each distinct netserver child will have a debug log with its pid appended to the name, somewhat like what appears to happen under Windows. *) A new global, command-line option to netperf and netserver has been added. The -V option will cause netperf/netserver to display its version and exit. *) Setting -I without setting -i will now implicitly set the iteration minimum and maximums as if a -i 10,3 were set. Also, some further sanity checking on the bounds for each is made. *) Fixed a typo in the manual (found by Emir Halepovic) so the description for the -s and -S options properly specifies they affect the data connection. These are the Release Notes for Revision 2.4.3 of netperf: Things changed in this release: *) The UDP_STREAM test includes --enable-demo support, courtesy of patches from Scott Weitzenkamp. *) The nettest_dns.* files have been removed from the release and the repository. Those wishing to perform DNS server tests should migrate to netperf4 which has better support for DNS test. *) Fixes for compiling under Windows with Mingw/gcc courtesy of Gisle Vanem. *) A new global option - -N - has been added. When specified, this option will tell netperf to not bother to try to establish a control connection with a remote netserver. Instead, netperf will only attempt to make a data connection to the remote system. By default, this will be to the "discard" service for a "STREAM" or "SENDFILE" test, the "echo" service for a "RR" test and the "chargen" service for a "MAERTS" test. Any "remote" settings are changed to reflect their being unused in the test, and a "no control" tag is added to the test banner when -N is specified. This still needs to be propagated to other test files - at least for those for which it may make sense. *) The tests in nettest_bsd.c have been altered to not actually take timestamps and deltas in --enable-histogram unless the verbosity level has been set to actually display a histogram. This reduces the overhead measurably, even on systems with "fast" time calls, which _may_ mean that a future release of netperf may have histogram support enabled by default. This still needs to be propagated to other test files. Patches from the community would be most welcome :) *) Eliminate a bogus fprintf from the signal catching routine which was being executed when both intervals and demo mode were active at the same time. *) The nettest_ipv6.* files are no longer included in the source tar/zip file. IPv6 functionality has been subsumed into the nettest_bsd.* files for some time now. *) Use a higher resolution "time" source for HISTOGRAM support under Windows, courtesy of Spencer Frink. Prior to this it had no better than 10ms granularity which could lead to some rather strange looking results :) *) A bug fix reporting recv_size rather than send_size in TCP_MAERTS when CPU utilization was requested. *) A bug fix for buffer filling from a file to properly advance the buffer pointer when the file is smaller than the send buffer. *) Enable certain UDP tests which previously used unconnected sockets to use connected sockets. Courtesy of Shilpi Agarwal. *) The OSX CPU utilization code actually gets put into the tarball in a make dist now :) *) The check to make sure that getaddrinfo returned ai_protocol and/or ai_socktype's matching that which we requested is done for all socket and/or protocol types and a warning is emitted if it returns any which do not match. *) The linux CPU affinity code has been made capable of binding to CPU's >=32 on a 32-bit compilation and >=64 on a 64-bit compilation. *) More complete closing/redirecting of stdin/stdout/stderr/where in netserver to make it easier to launch netserver at the far-end of a remote shell. Courtesy of Hans Blom. *) Sendfile changes for Solaris courtesy of Andrew Gallatin. *) "spec" file support to generate RPMs courtesy of Martin Brown These are the Release Notes for Revision 2.4.2 of netperf: Things changed in this release: *) Fixes for floating point format differences, courtesy of George Davis. *) Additions for CPU util support on MacOS X, courtesy of Anonymous. *) Processor affinity is now supported on AIX 5.3 (perhaps earlier) via the bindprocessor system call. *) Fixes for test lockups with TCP_CRR and TCP_CC under Windows courtesy of Dikon Reed. *) Fixes to netcpu_looper.c to get it to actually compile :) *) Have netcpu_looper use the bind_to_specific_processor() call provided by netlib since that knows about more platforms than the code in netcpu_looper did. The looper CPU binding will use a mapping to handle cases where the CPU id's on the system may not be a contiguous space starting from zero. At present, the code that setups the mapping only knows about retrieving actual CPU ids under HP-UX. *) The netcpu_sysctl method becomes calibration-free, courtesy of Andrew Gallatin These are the Release Notes for Revision 2.4.1 of netperf: Things changed in this release: *) There is now a -B global command-line argument that will append its parameter as a string to the end of result lines when test banners have been suppressed. this is to make it easier to distinguish one result from another when aggregate restults are being run in parallel, without having to resort to having the individual results shell redirected to a file. This has been done for some of the tests in nettest_bsd.c, but not all of them, nor for the tests in the other nettest_mumble.c files. *) There is now an --enable-spin configure option that will enable intervals if not already enabled and will have the sender sit and spin in a tight loop until time for the next interval rather than wait for an interval timer to expire. This means it should be possible to have a much finer granularity on the interval, at the expense of an EXTREME increase in CPU utilization. (To the extent I'm considering disabling measurement of local CPU utilization when that mode is enabled, and bursts have been requested - your feedback on that topic would be most appreciated) If only --enable-intervals is used with configure, the old set the interval timer and wait method is still used. If --enable-spin is configured, the test banner will include "spin intervals" rather than the "intervals" from a plain --enable-intervals. The sit and spin will either use gettimeofday(), or gethrtime() if gethrtime() is available. This has been implemented in the tests of nettest_bsd.c but none of the others. Volunteers would be most welcome. I would entertain the notion of making the implementation a series of inline functions in netlib. This holds true for the demo mode - why will become clear when you look at nettest_bsd.c. While things are considerably cleaner than they were before, with reuse within nettest_bsd.c, there is no resuse with the rest of the nettest_mumble.c files. *) the -w option for the interval time now takes three optional suffixes. if the suffix is 'm' (eg 10m) it will assume the user has specified time in units of milliseconds. if the suffix is 'u' it will assume microseconds, and if 's' seconds. no suffix remains milliseconds for backwards compatability with previous netperf versions. *) It should be possible to successfully compile with --enable-intervals. These are the Release Notes for Revision 2.4.1 of netperf: Things changed in this release: *) netcpu_pstatnew.c has been altered to workaround a bug in the interrupt cycle accounting in HP-UX 11.23 that is not expected to be resolved until a later release. basically, some interrupt time is not counted, which means the sum of idle, user, kernel and interrupt is less than the cycles per second multiplied by the elapsed time. the workaround preserves the "no calibration required" nature of the pstatnew CPU utilization mechanism. you can see more in netcpu_pstatnew.c and/or in debug output. *) in netlib.c recv_response has been renamed recv_response_timed(addl_time) which is now used in calibrate_remote_cpu in place of the "sleep(40);recv_response()" sequence. This then allows the REM_CPU test to complete in less than 40 seconds when the remote's CPU utilization mechanism does not require calibration. The value of "addl_time" is added to the tc_sec field of the select() timeout. A "new" recv_response has been added that simply calls recv_response_timed(0) - this is to minimize the number of changes needed elsewhere in the code. *) hopefully, this release fixes problems people have been having with the configure script failing when picking a type for socklen_t. now, instead of generating an error, it emits a warning and simply tries socklen_t *) the configure script no longer looks for the size of an in_port_t *) netlib.c now has code to perform processor binding for Tru64, but the configure script may or may not detect it correctly. This means that one may have to edit the config.h file by hand to get the functionality. *) it is known that netperf will compile under Windows XP and 2003 using the DDK it is possible that netperf 2.4.1 will compile on a Windows system under VC++/Visual Studio. It might even work!-) See the README.window file for additional details. Things _NOT_ changed in this release: *) The automagic determination of the number and type of parameters to sched_setaffinity under Linux remains brittle at best. These are the Release Notes for Revision 2.4.0 of netperf: Things changed in this release: *) Netperf has been converted to use a configure script. Yes boys and girls, after 12 years of distributing netperf with just a makefile I have finally bitten the bullet and cast my fate to autoconf, automake, etc. To get the most basic netperf built all you should need to do is: cd to the netperf directory ./configure make and perhaps make install (Note, I've not done much with make install - I'm hemming and hawing over what the default installation location should be) Please keep in mind that this is the first time I've tried to use autoconf et al. I am sure there are things that should be done differently and would welcome any and all constructive criticisms. I suspect there are several places where I've not fully demonstrated being of the autoconf body - particulary as pertains to include files being in "#if mumble #endif" blocks. Fixes would be most welcome. *) Speaking of becomming one with various GNU tools, work on a new netperf manual has begun, with the source being a texinfo document that is converted to "all" the other formats. This resides in doc/ . *) The platform-specific parts of CPU utilization measurement have been broken-out into separate .c files and selected at configure time a la the pcap_mumble files of tcpdump. This makes src/netlib.c _much_ easier to read and the addition of new CPU utilization mechanisms much easier. *) New HP-UX 11.23 and Solaris 10 CPU utilization measurement mechanisms (called pstatnew and kstat10 respectively) need no calibration step. Both have variations on microstate accounting. HP-UX 11.23 still identifies the method in the headers as 'P' for pstat. The kstat10 method is identified as 'M' for Microstate. Scripts which make calibration runs with LOC_CPU and REM_CPU may continue to do so, they will just run forty to eighty seconds faster on platforms with the calibration-free CPU util mechanisms. *) Automatic detection of CPU utilization mechanism for HP-UX, Linux, AIX, *BSD and Solaris. If you do not like what the configure script selects, you can use --enable-cpuutil=<foo> . *) The "times" (aka 'T') CPU utilization mechanism has been removed. It was never very accurate at all, only showing CPU time charged to the process, and with interrupts and other network processing it is rarely chaged to a or the correct process. It and other methods may remain in the format_cpu_method() routine of src/netlib.c for historical purposes only. *) CAVEAT - the "kstat" mechanism is KNOWN TO BE BOGUS for Solaris. It does not include time spent processing interrupts, and networking benchmarks will generate at least a few of those... This affects _ALL_ versions of Solaris with kstat. So, do NOT trust any CPU util figures where netperf says the method was 'K' for kstat - unless perhaps it reports 100% CPU util. Solaris 10 takes a step in the right direction adding microstate accounting similar to what netperf uses on HP-UX 11.23. HOWEVER, Solaris 10's accounting for user/kernel/idle is done in _parallel_ with interrupt, which means they overlap. Doubleplusungood. Netperf attempts to compensate for that with some handwaving (src/netcpu_kstat10.c) *) Initial support for SCTP has been added with the SCTP_STREAM and SCTP_RR tests. These tests use the libsctp mechanisms for increased portability. It has been explained that libsctp should not impart all that much overhead and it does make things rather simpler. *) Netperf now uses getaddrinfo() to resolve hostnames and IP addresses. A replacement getaddrinfo() is provided for those platforms where the configure script cannot tell that getaddrinfo is present. There are cases where a host's getaddrinfo call may return results that ignore the hints for protocol. Netperf catches these and reports a warning so you can pester your OS source for fixes. Solaris getaddrinfo() seems to return results with SCTP procotol cleared. Mac OS X getaddrinfo botches when the service/port is specified as "0" so one must specify a port number on the netperf command line. AIX 5.something getaddrinfo has a different but similar problem with "0" as a port/service name as well. Linux 2.6 and HP-UX 11i getaddrinfo seem to be fine - at least as far as netperf goes :) *) A "Demo Mode" has been added to the main BSD Sockets/TCP/UDP tests: TCP_STREAM, TCP_MAERTS, TCP_SENDFILE, TCP_RR, TCP_CC, TCP_CRR and UDP_RR. It has not been added to UDP_STREAM. This mode is enabled with --enable-demo when configuring netperf, which activates a global "-D" option. By default, -D will cause interim results (throughput or transactions/s only, not CPU util) from the netperf's perspective to be emitted no sooner than once per second. An optional parameter can specify another interval in units (floating point) of seconds: -D 1.5 will make the reporting interval at least 1.5 seconds. This mode makes no use of explicit interval timers since that can be so, well fun on different platforms. Instead, an initial guess of how many units of work must be done to consume the desired reporting interval is made, and that guess is refined throughout the entire test. If something happens to dramatically slow-down the test, the reproting interval may become must larger for a few intervals. When things speed-up it is detected very quickly. As with the --enable-historgram support, if gethrtime() is available on the platform, it will be used in lieu of gettimeofday(). In any case, the number of calls to gettimeofday()/gethrtime() is much, Much, MUCH smaller than for --enable-histogram so while there may be a measurable effect on the results, it should be rather small. *) The global -H option has been enhanced to take an optional address family specification for the control connection: -H <remote>,<family> Unlike other comma-separated options, where specifying only one thing will set both, here specifying only one thing will be ass-u-me-d to be the <remote> and will leave <family> defaulted (AF_UNSPEC). Family can be specified as "4" or "inet" for AF_INET, "6" or "inet6" for AF_INET6. *) A new global -L option has been added to specify the local name/IP and/or address family for the control connection: -L <local>,<family> Unlike other comma-separated options, where specifying only one thing will set both, here specifying only one thing will be ass-u-me-d to be the <local> and will leave <family> defaulted (AF_UNSPEC). Family can be specified as "4" or "inet" for AF_INET, "6" or "inet6" for AF_INET6. *) Test-specific -H and -L options are present for the TCP, UDP and SCTP tests, which are now (intended to be) IP protocol version agnostic. *) Global -4 and -6 options will set the both the local and remote address family to either AF_INET or AF_INET6 respectively. *) Test-specific -4 and -6 options have been added for TCP, UDP and SCTP tests. *) Since the basic TCP UDP and SCTP tests are no longer IPv4-only, the nettest_ipv6.[ch] files are only included in the source distribution for historical interest. *) The main test banners for the TCP, UDP and SCTP tests have been enhanced to give both local and remote addressing information for the data connection. *) Compilation under Windows is likely FUBAR at this point. I _hope_ to start trying to do builds under the DDK soon, but am not sure when I'll be able to start. Any and all assistance you can give there would be most welcome. *) Various and sundry fixes. TCP_RR should no longer go into an infinite loop when you abort netperf. I'm sure there are others. *) Unix domain socket tests are compiled-in with --enable-unix=yes at configure time. *) DLPI tests are compiled-in with --enable-dlpi=yes at configure time. *) XTI tests are compiled-in with --enable-xti=yes at configure time. Things not changed in this release: *) Seems like everything has changed :) These are the Release Notes for Revision 2.3pl2 of netperf: Things changed in this release *) One can bind netperf or netserver to specific CPUs with the -T option. This is a generalization of some HP-UX and netserver specific work from 2.3pl1. *) Extend the kludge to workaround the Linux setsockopt/getsockopt bizzarreness to the socket buffer sizes for the remote side in addition to the local side. *) Fix the lack of initialization of times_up in recv_tcp_maerts() that caused confidence intervals to fail miserably. *) Other misc fixes - than you to all of you who sent them. These are the Release Notes for revision 2.3pl1 of netperf: Things changed in this release *) The bind() call in create_data_socket() in the file nettest_bsd.c is no longer conditional on the user's specifying an IP address or port number to which the data socket should be bound. This fixes the "connection refused" errors in the UDP tests. *) Some experimental code to allow one to specify a CPU to which the remote netserver should be bound. This is intended to allow one to get greater certainty (as in confidence intervals) on SMP systems. At present the functionality is HP-UX specific. Submittals of changes for a more general approach are welcomed. These are the Release Notes for revision 2.3 of netperf: Things changed in this release *) The user can now specify local and/or remote port numbers for the data connection using the -P test-specific option. This is to support those folks who want to run netperf through those evil, end-to-end-breaking things known as firewalls... :) This changes the format of some of the control messages, hence the bump in the update number in the VUF. While it may be possible to mix 2.3 and pre-2.3 netperf and netserver, it is not supported. *) The user can now specify local and/or remote IP addresses for the data connection using the -I test-specific option. This is to support those folks who want to run netperf through those evil, end-to-end-breaking things known as firewalls... :) This changes the format of some of the control messages, hence the bump in the update number in the VUF. While it may be possible to mix 2.3 and pre-2.3 netperf and netserver, it is not supported. *) Set DL_mumble message priorities in the DLPI tests *) Fix error return check for getaddrinfo() *) Those systems with gethrtime() can define -DHAVE_GETHRTIME to use gethrtime() instead of gettimeofday() and reduce the measurement overhead when enabling the -DHISTOGRAM functionality. *) The default for -DHISTOGRAM compilation now adds a UNIT_USEC and TEN_USEC row and renames TENTH_MSEC to HUNDRED_USEC. If you want the old behaviour add -DOLD_HISTOGRAM to CFLAGS. *) Add missing '!' in the recv_udp*_stream so we recognize the end of a timed test correctly. *) Replace "||" with "&&" to fix an infinite loop in recv_tcp_conn_rr() most likely introduced in 2.2pl5. *) Code has been added to kludge around the bug in Linux getsockopt() where it almost always returns twice the value for which one asks unlike virtually every other stack on the face of the planet. This was doing some unpleasant things to tests in which confidence intervals were requested. Things not changed in this release *) Lots :) These are the Release Notes for revision 2.2pl5 of netperf: Things changed in this release *) Improved (perhaps even usable :) support for Windows, including compilation and run on Win64. *) Fixes for MacOS X and FreeBSD Things not changed in this release *) Specifying the port number(s) for the data connection These are the Release Notes for Revision 2.2pl4 of netperf: Things changed in this release *) USE_SYSCTL available on suitable FreeBSD releases to measure CPU utilization without having to resort to -DUSE_LOOPER. *) Include Solaris 9 with the Linux sendfile path under -DHAVE_SENDFILE This still outstanding in this release *) Knowing why signals are not interrupting socket calls under OpenVMS. A quick try to use threads for timing a la Win32 worked, but also cut performance in half. Any and all assistance in this area would be most welcome. These are the Release Notes for revisoin 2.2pl3 of netperf: Things changed in this release *) I started practicing what I preach and will set SO_REUSEADDR before netserver tries to bind to its well-known port. *) Initial port to OpenVMS. This includes support for the OVMS Auxilliary server (inetd replacement). See README.ovms for more details on what is involved in compiling and running netperf under OpenVMS. *) Testname comparisons are now case insensitive. This is a side effect of OpenVMS downshifting commandlines to lowercase. I made the change and decided it was OK to keep it that way, even though for OpenVMS one _has_ to set the right defines to disable that downshifting or the command-line options will not work. For example "-H" will become "-h" which isn't quite the same thing... *) Misc fixes for nettest_ipv6.c. *) Support for sendfile() under Linux Thins I would like to have changed but did not know how or didn't have time: *) Allow netserver to run as a standalone daemon under OpenVMS *) Allow netserver to run as a standalone daemon under Windows *) Rediscover an inetd-like facility for Windows *) Figure-out how to get low-overhead, accurate, per-CPU utilization figures under OpenVMS *) Get the UDP_RR and UDP_STREAM tests to work under OpenVMS, and get the TCP_RR test to work based on time rather than transaction count. There is some bug (possibly in OpenVMS?) where the SIGALRM fires, but a socket call will not return an EINTR. Things that changed prior to this release: *) Addition of the TCP_MAERTS test - this is a TCP_STREAM test where the data flows from the netserver to the netperf rather than from the netperf to the netserver. This can be useful in those situations where netperf (netserver) is installed on a remote system, but the tester has no shell access and wishes to get performance data for the path from netserver to netperf. These are the Release Notes for the 2.2 revision of netperf: Things changed in this release *) Various and sundry bugs fixed (in theory) for platforms such as FreeBSD and Linux. If I left-out your bug fix, it was purely accidental - my mind has a very small cache, and sometimes I will "lose" email in the shuffle. *) Initial support for sendfile() on HP-UX. This test will use the sendfile() call instead of send() to send data to the remote. Netperf "lies" to netserver and calls it a TCP_STREAM test since what netserver needs to do is exactly the same. A future patch may change that and simply have netserver call the same routine for both test types. Kudos to Charles Harris for the initial prototype. *) The Fore ATM API and HiPPI tests have been dropped from the distribution. Things I would have liked to have changed, but did not have time for: *) Conversion of the source and makefile to use the GNU configure/autoconf utility to make it easier for folks to build by not having to edit makefiles... You will notice that I have started to switch from "DO_MUMBLE" to "HAVE_MUMBLE" as always - happy benchmarking, rick jones <raj@cup.hp.com> --------------------------------------------------------------------- These are the Release Notes for the 2.1pl3 revision of netperf: *) An OBOB (Off By One Bug) in netlib.c that was causing a core dump on Irix should be fixed. *) Irix systems should now be able to determine the number of CPU's present automagically (code from outside, not tested yet because I have no MP Irix systems at my disposal) *) An alpha version of a TCP_CC test has been added - this is a TCP_CRR test with out the "RR." *) The -Ae has been removed from the default makefile. If someone has a nice way to automagically generate the correct makefile for different platforms I would like to learn how. happy benchmarking, rick jones <raj@cup.hp.com> ---------------------------------------------------------------------- These are the Release Notes for the 2.1 revision of netperf: Things Changed in this release: *) The XTI (Version 2 of the spec) tests are now documented in the manual. *) The TCP_CRR (Connect Request/Response) test is now documented in the manual, including a description of how it mimics the behaviour of http (the protocol underlying the WWW). *) Support for for Windows NT 3.51 OS in the BSD Sockets tests (ok, so they are really Winsock in that case :). Other test suites may be ported as required/desired/appropriate. *) Tests for TCP and UDP, using the IPv6 extensions to BSD sockets are included in this release. They are included by adding -DUSE_IPv6 to the makefile and recompiling. *) Support for a "long long" datatype should only be required for -DUSE_PSTAT compilation which is an HP-UX only thing. The *unbundled* HP compilers from at least "HP92453-01 A.09.61 HP C Compiler" and later should have the required support. The bundled compiler may not. GCC should work - check the archives listed in the comp.sys.hp.hpux FAQ for copies. The FAQ is archived on rtfm.mit.edu under the path pub/usenet/comp.sys.hp.hpux. *) A "proper" fix for double data type alignment has been included. *) A new script is included with this release which can be used to measure aggregate TCP_RR performance (multiple, concurrent instances of the TCP_RR test). A related use of this script would be measuring MP scaling. A single-byte TCP_RR test is good for this purpose for two reasons: 1) it excercises the control/protocol paths heavily without using much in the way of data copies which may be easier to scale. 2) most systems can easily saturate cards with bandwidth, but not so easily with request/response Of course, feedback on this is most welcome. *) When measuring CPU utilization, the units for service demand have been changed from milliseconds (designated ms) of CPU per unit (KB or Transaction) to microseconds (desginated us). *) For accurate reporting of service demand, netperf needs to know the number of CPU's present on a system. On some systems (HP-UX), this is automatic. For others (All), it is necessary to add a global "-n <numcpu>" option to both netperf and netserver. !! IF THIS IS LEFT-OUT CPU UTILIZATION AND SERVICE DEMAND FOR !! !! MULTI-PROCESSOR SYSTEMS WILL BE WRONG. !! If you know of ways to programatically determine the number of active CPUs on a system, please let the author Rick Jones <raj@cup.hp.com> know. *) other things I've probably forgotten :) Things Not Changed in this release: *) The ancillary test suites are essentially unchanged - DLPI, HiPPI/LLA, Unix Domain, and Fore ATM API. Unless there is much interest expressed in these tests, 2.1 may be the last release in which they are included. The order of retirement would likely be Unix Domain, HiPPI/LLA, Fore ATM API, and then DLPI. Miscelaneous Comments: *) The -DUSE_LOOPER CPU utilization _seems_ to be nice and low-impact on HP-UX, Digital Unix, and IRIX. It does not yet seem to be low-impact on Solaris (I need an example of priocntl usage), AIX (setpri only works if you are root), and NT (not sure of the reason). Help with those problems would be most appreciated.