# -*- mode: org -*- # Copyright (C) 2012 International Business Machines Corporation and Others. All Rights Reserved. How Expensive Is It? * Introduction ** The purpose of this test is to answer the question, "How expensive, relatively speaking, is ICU operation X?" ** ICU tests are compared with general purpose CPU operations to attempt to factor out differences between systems and load conditions ** Different ICU operations will have different levels of dependence on CPU, memory, disk, etc. So nothing can perfectly factor these conditions out. * Running the Test ** Simply run "make check" in this directory, icu/source/test/perf/howExpensiveIs/ ** Try to minimize other CPU loading throughout the test ** The test will take some time to run! ** Test runs outside of a margin of error will be thrown out. So, this will tend to produce more accurate results. ** After some time, the file howexpensive.xml will be created (an example is attached as Appendix I) ** The results may be read directly or processed such as with xslt. ** XML file contents: *** Element <tests>: the outermost element. *** Attribute 'icu': gives the basic ICU version number. **** Element <test>: a particularized test. ***** Attribute 'name': names the particular test, see howExpensiveIs.cpp for details ***** Attribute 'standardizedTime': The SieveTest by definition has a standardized test of 1, it runs a prime number sieve as a benchmark. All other standardizedTimes are normalized against this value. ***** Attribute 'realDuration': the actual duration, in seconds, of the test. ***** Attribute 'marginOfError': the amount +/- error for the real duration. Gives an idea of how much variability was in the test. ***** Attribute 'iterations': gives the total number of iterations run. **** Element <icuSystemParams>: This element gives the full details of the target platform, in the XML format produced by the 'icuinfo' tool. The contents are informative only and not documented here. * Analysis ** The data shows that, for example, parsing a number and opening the GB18030 converter are about the same cost. It also shows that opening a number formatter is about 60 times as expensive as formatting a number. ** Appendix II shows a .CSV (spreadsheet) file which shows analysis of a sample run between different systems. ** The Variation column for each Target system was calculated with the formula: "(Control-Target)/Control" where Control and Target are the standardized times for the Control and Target systems, respectively. * Appendices ** Appendix I: Sample File <?xml version="1.0" encoding="UTF-8" ?> <tests icu="49.0.2"> <!-- Copyright (C) 2011, International Business Machines Corporation and others. All Rights Reserved. --> <test name="SieveTest" standardizedTime="1.000000" realDuration="0.022804" marginOfError="0.000073" iterations="1000000" /> <test name="NullTest" standardizedTime="0.000017" realDuration="0.000000" marginOfError="0.000000" iterations="1000000" /> <test name="NumParseTest" standardizedTime="77.869922" realDuration="1.775721" marginOfError="0.011742" iterations="1000000" /> <test name="Test_unum_opendefault" standardizedTime="4855.974258" realDuration="110.734117" marginOfError="0.057131" iterations="1000000" /> <test name="Test_ucnv_opengb18030" standardizedTime="70.488403" realDuration="1.607395" marginOfError="0.009261" iterations="1000000" /> <icuSystemParams type="icu4c"> <param name="copyright"> Copyright (C) 2011, International Business Machines Corporation and others. All Rights Reserved. </param> <param name="product">icu4c</param> <param name="product.full">International Components for Unicode for C/C++</param> <param name="version">49.0.2</param> <param name="version.unicode">6.1</param> <param name="platform.number">4000</param> <param name="platform.type">Other (POSIX-like)</param> <param name="locale.default">en_US</param> <param name="locale.default.bcp47">en-US</param> <param name="converter.default">UTF-8</param> <param name="icudata.name">icudt49l</param> <param name="icudata.path">../../data/out/build/icudt49l</param> <param name="cldr.version">21.0</param> <param name="tz.version">2011n</param> <param name="tz.default">America/Los_Angeles</param> <param name="cpu.bits">64</param> <param name="cpu.big_endian">0</param> <param name="os.wchar_width">4</param> <param name="os.charset_family">0</param> <param name="os.host">x86_64-unknown-linux-gnu</param> <param name="build.build">x86_64-unknown-linux-gnu</param> <param name="build.cc">gcc</param> <param name="build.cxx">g++</param> </icuSystemParams> </tests> ** Appendix II: Analysis.csv http://bugs.icu-project.org/trac/ticket/8653,"""Control"", linux i7 Intel(R) Core(TM) i7-2720QM CPU @ 2.20GHz",MacBook 2.4ghz (Core2D),MacBook 2GhzCore2,AIX Power,MB 2.4 Variance,MB 2 variance,AIX Variance SieveTest (=1.0),1,1,1,1,0.00%,0.00%,0.00% NullTest (=0.0),0,0,0,0.08,#DIV/0!,#DIV/0!,#DIV/0! NumParseTest,74.10642,21.220191493,56.912133,85.423525612,71.37%,23.20%,-15.27% Test_unum_opendefault,4801.617798,1912.860018319,4522.900036,2580.805294162,60.16%,5.80%,46.25% Test_ucnv_opengb18030,65.268309,30.547740077,84.075584,51.587649619,53.20%,-28.82%,20.96% Test_unum_openpattern,4394.214773,1735.453339382,4472.298154,2263.471671239,60.51%,-1.78%,48.49% Test_ures_openroot,75.302253,23.773982586,71.248439,57.471889114,68.43%,5.38%,23.68% ** Appendix III: Revision History *** Feb 2012, ICU 49, srl: First revision