Exploring Handwritten PTX Code for GPU Optimization in CUDA