Physical to video memory blit/HW accel
It's always best to use as much DirectX in your DirectX programs as possible as new cards add new features all the time. By doing so, you get at least a limited degree of support without recompiling.
Also, you may want to check to see if your memcpy() copies DWORDs, WORDs, or BYTEs at a time. Most compilers use DWORDs for 32 bit apps, but I think some don't. The only other thing I can think of is if you're copying unaligned data, DirectX MAY do some dynamic ordering to at least TRY to setup aligned copying, which if it did would beat out your unaligned memcpy. Always try to keep mem movement aligned if possible.
Rock
As Rock suggested I checked out the DX Device Viewer tool, which tells me that Sys->Vid memory Blt's are accelerated. I was also wondering (technically) how this would be accelerated, since the bus is limited to a certain speed anyway. If anyone has any thoughts there I'd be interested to hear about it as I don't know a lot about video hardware acceleration/capabilities.
Also thanks for the suggestion on using MMX Queasy, I'm going to look into that.
Phoenix
Phoenix
"The IDirectDraw4::GetCaps method fills in the capabilities of the device driver for the hardware and the hardware-emulation layer (HEL)."
From the filled DDCAPS structure check if dwCaps has DDCAPS_CANBLTSYSMEM 'flag checked' or something
"DDCAPS_CANBLTSYSMEM
Indicates that display hardware is capable of blitting to or from system memory."
You could check that and then use preferred blitting method, am I right?
------------------
"Tank, I need an exit. Fast!"