Job Description
As an ML Performance Engineer, you will optimize inference performance across the Akamai Inference Cloud. Your focus will be at the intersection of speed and accuracy—applying techniques like quantization, speculative decoding, and hardware-aware scheduling to maximize throughput and minimize latency. You will collaborate closely with hardware performance engineers to deliver end-to-end optimization.
