Reland "gpu, cmaa: reduce one-copy", "gpu, cmaa: copy RGBA8 texture via glCopyTexSubImage2D() instead of imageStore()."

Original CL:
https://codereview.chromium.org/2298613010 : use glCopyTexSubImage2D, and fix Pri1 bug
https://codereview.chromium.org/2405893002 : optimize further

Revert CL:
https://codereview.chromium.org/2456913002 : glCopyTexSubImage2D regresses performance and optimization CL is reverted due to conflict.

Changes:
- fix Pri1 bug as-is
- optimize further as-is
- don't introduce glCopyTexSubImage2D, and use the existing copy shader.

BUG=535198, 642290, 659438
CQ_INCLUDE_TRYBOTS=master.tryserver.chromium.linux:linux_optional_gpu_tests_rel;master.tryserver.chromium.mac:mac_optional_gpu_tests_rel;master.tryserver.chromium.win:win_optional_gpu_tests_rel

Review-Url: https://codereview.chromium.org/2465963002
Cr-Commit-Position: refs/heads/master@{#428826}
2 files changed