This project aims to evaluate an auto-tuning tool designed to explore compile-time and runtime parameters for identifying optimal performance configurations on GPUs. Current experiments are conducted on NVIDIA GPUs; however, to achieve a comprehensive performance comparison, I plan to extend the evaluation to AMD GPUs. Access to AMD GPUs on the Dardel supercomputer is requested to conduct these experiments and analyze cross-vendor performance characteristics.