Decode Phase LLM Inference Memory Bound