On the other hand, check out: http://www.geocities.com/SiliconValley/Way/6422/optimizations.htm
It says a shift is 1 clock. I'm assuming a multiply is more. I'm also assuming you had optimizations off, but you might want to see what code the compiler made for you, and that it didn't substitute an ADD for the multiply. (Try multiplying by 64 and << by 6)..I'd be interested to see what you get from doing that It's also possible that the multiplication was better suited for the dual pipeline, but probably not if it used a MUL.
-ns
[This message has been edited by NightShade (edited December 22, 1999).]