Use CPUDetect.h and bSSSE3 to fix «unknown machine instructions» on DX9 plugin.
~93% speed improvement (over plain C++) in RGBA -> BGRA conversion for processors that support SSSE3 (Core or newer).
About 40% faster than the SSE2 implementation.
May not be required in future but good for reference.
45% speed improvement in RGBA -> BGRA conversion over reference C code. This might turn out to be unnecessary once I refactor the CreateTexture2D/ReplaceTexture2D to call the texture decoder instead of vice-versa and have the SSE2 code convert directly to BGRA and write directly to the D3D allocated texture.
Need to investigate source texture alignment and where to introduce that, if it is possible.
Add anvideo config option to automatically resize the render window to the size of the game’s output resolution. This avoids artifacts that appear when the render target is scaled to fit a window of a different size.