The coordinates have to be in 3D as there are height elements, the buildigns are all texture mapped and the scaling shows that they are done the same way any item in 3D is texture mapped.
You could do this simpler with a non-3D engine but it wouldnt give the same feeling as when buildings and such that stuck up off the ground became bigger, the perspective of them wouldnt change.
-Geoff