The human brain effortlessly converts spoken words into meaning, requiring minimal energy while achieving remarkable accuracy. Replicating this feat in artificial systems has proved challenging. Although speech recognition technology has advanced significantly, the underlying computing architectures remain fundamentally inefficient, demanding considerable power and computational resources.