Re: commstime not scaling

Subject: Re: commstime not scaling

From: Alan Grover <alan.grover@xxxxxxxxx>

Date: Mon, 28 Nov 2005 13:58:56 -0500

Delivery-date: Mon, 28 Nov 2005 19:00:11 +0000

Domainkey-signature: a=rsa-sha1; q=dns; c=nofws; s=beta; d=gmail.com; h=received:message-id:date:from:to:subject:in-reply-to:mime-version:content-type:references; b=CIWTX02XpjgHjylN+IRzNsdtCLk1pj1q5gojp2IJRYeB7/qIGqslhoztbx4QPYdr+XTIRt4V92VzQdNXn0qSA5opmhRFraMtqmpEdFrQQABThEgdxKBhE5USdyWYadbkBNPWxpg7GKVNu1w4i5++8oEdG397oiFDXHweko1QyKY=

Envelope-to: phw@xxxxxxxxxxxxxxxx

In-reply-to: <002601c5f3aa$726fb300$0e00a8c0@xxxxxxxxxx>

References: <161915060511250700n5e387227n1f2d085b0ea24e9d@xxxxxxxxxxxxxx> <002601c5f3aa$726fb300$0e00a8c0@xxxxxxxxxx>

Sender: owner-occam-com@xxxxxxxxxx

On 11/27/05, Ruth Ivimey-Cook <Ruth.Ivimey-Cook@xxxxxxxxxx> wrote:

Alan,

My first though on this problem is that your Dell could easily be a Pentium-III, and the Compaq is probably a Pentium 4. The P4 is widely noted for having a very long pipeline (some 24 stages if memory serves) while the P3 is under half that. Commstime is a benchmark with an extraordinarily large number of jumps in it; almost none of the code is inline. Therefore, it makes sense that the P4 will have a harder time of it than the P3.

You're right to worry about the memory bandwidth; the pipeline misses will cost much more if the code is missing L1 cache much.

I too have found that xterm is consistently the fastest terminal around, beating even a plain console in some cases.

Hope this helps,

Ruth

I was recently comparing commstime values for my python implementation of CSP-style primitives and kroc and came away with some surprising (to me anyway :o) results. I tried the commstime metrics on two different machines, a 2.4 GHz Compaq laptop and a 1.0 GHz Dell.

for the python implementation (using threading.Thread):

Compaq: 935 millseconds for the commstime loop
Dell: 895 milliseconds

for kroc (1.4.0-pre2):

Compaq: 440 nanoseconds
Dell: 385 nanoseconds

It wasn't terribly suprising to me that the results for the python implementation would be similar (no chance it would fit in cache, using OS scheduled threads, etc), but it was suprising to me that results for kroc were similar on both machines and that in both cases the slower machine had better results.

References:

commstime not scaling
- From: Alan Grover
RE: commstime not scaling
- From: Ruth Ivimey-Cook