Strait: Perceiving Priority and Interference in ML Inference Serving — ThinkLLM