Symbiotic Scheduling of Concurrent GPU Kernels for Performance and Energy Optimizations