Back to General and Gameplay Programming

FPS drop

General and Gameplay Programming Programming

Started by Spark March 21, 2000 12:57 PM

8 comments, last by Spark 24 years, 11 months ago

Spark

Author

122

March 21, 2000 12:57 PM

Hi, I''m trying to add the values of to large arrays like this: for (i = 0; i < XRES*YRES; i++) { fb1.r = fb1.r + fb2.r; fb1.g = fb1.g + fb2.g; fb1.b = fb1.b + fb2.b; } This is done once per frame and causes the FPS to drop from 42 to 31 FPS. It feels like a realy bad way of doing it, so do any of you now about anything that may speed it up? // Spark

Spark

Author

122

March 21, 2000 12:59 PM

hmm, the fist line is supposed to look like the other ones (but with .r)

// Spark

Edited by - Spark on 3/21/00 1:02:09 PM

Stoffel

250

March 21, 2000 02:04 PM

Spark, that''s just because the got interpretted as an italics tag. I''m not positive this will give you better results (depends on how good your compiler is), but here''s a different way of doing it: I don''t know why types fb1 and fb2 are, so I''m going to call them FBType. <pre> FBType *pfb1 = fb1; const FBType *pfb2 = fb2; const FBType * const pend = fb1 + (XRES*YRES); // points to end of array while (pfb1 != pend) { pfb1->r += pfb2->r; pfb1->g += pfb2->g; pfb1->b += pfb2->b; pfb1++; pfb2++; } </pre> Not sure you''ll get an improvement, but that''s at least another way to do it. I''m kinda gambling on the fact that it''s faster to move pointers along than to index into the array, which compilers SHOULD do, but maybe it''s confused because there are multiple fields? You could also try using the register keyword for the pointers, but I''ve never used it so I can''t say if it would get you anything. Hope this helps. Let me know. And you''re using a profiler, right?

Anonymous

March 21, 2000 03:28 PM

Thanks for the reply Stoffel but it only causes a minimal speed increase. =/
It eats ups about 25% of the time spend calculating each frame so it''s must be optimized or removed in some way.

// Spark

Stoffel

250

March 21, 2000 03:36 PM

Are you positive this is where the bottleneck is? Are you using a profiler tool to tell you this? Only reason I ask is that I can''t see any more efficient way to do the task you''ve presented.

Spark

Author

122

March 21, 2000 03:52 PM

I''m using a profiler and it spends about 23,2% of the time in that function.

// Spark

daveb

122

March 21, 2000 04:19 PM

Consider what you''re doing. At 640x480, you''re doing multiple memory accesses on almost 310,000 elements. This is a cache-coherency nightmare.

In general you want to avoid doing anything on a per-pixel basis in a realtime situation.

Volition, Inc.

Premandrake

175

March 21, 2000 04:41 PM

Also, if one (or both) of the arrays is in video memory you will get crappy performance no matter what you do. In general reads from video memory are to be avoided like the plague and if you are going to do reads on a surface it should NOT be in video ram.

If that doesn''t help try to think of another way to achieve the same effect, or post it up here so we can try and optimize the algorithm itself.

PreManDrake

ColdfireV

122

March 21, 2000 04:56 PM

That''s a definite for the video ram reply. If you are accessing the video ram (it looks like you are), try just making the whole array (or surface, if you''re using DX) in system memory. It''ll be slower when you go to blit, but definitely faster than manipulating many pixels such as the way you''re doing it in video memory.

ColdfireV

[email=jperegrine@customcall.com]ColdfireV[/email]

Spark

Author

122

March 22, 2000 03:15 AM

I''m trying to use lightmaps in software by first render the original textures and then the lightmaps into different arrays and in the end mix the colors of the lightmap array with the texture array. Both surfaces are in system memory and the resolution is 320x240. I can''t figure out any way to do this without adding the colors in the end. I coud of course mix the textures directly in the texturemaping function but that''s the same thing, I guess.

// Spark

FPS drop

This topic is closed to new replies.

Popular Topics

Recommended Tutorials

FPS drop

This topic is closed to new replies.

Popular Topics

Recommended Tutorials

Reticulating splines