DS: ARM Optimised blitters/scalers
|Reported by:||SF/robinwatts||Owned by:||agent-q|
The attached file contains ARM versions of the following routines from platform/ds/arm9/source/blitters.cpp
I should say up front that I don't personally have a DS, so I can't compile and test these - they have therefore been written blind.
All the files with the exception of the last one (which requires it's parameters reordering) should pretty much drop in as replacements for the ones that are there.
asmDrawStripToScreen: I believe this should be slightly faster - 2 cycles faster in the short loop case, 6 cycles faster in the long loop case. (Assuming that GCC is doing a perfect job - in the case where the compiler was fumbling stuff before we'll be doing better now).
asmCopy8Col: Should be at least 8 cycles a loop faster.
The rescale functions should be significantly faster too; the core calculations live entirely in registers, and I use a trick to calculate R G and B all at the same time.
If you have questions, or comments on this code, or I need to change (fix!) it please don't hesitate to contact me.
Ticket imported from: #1729975. Ticket imported from: patches/793.