Published: Jul 05, 2024

Unlocking real-time chat with 1M context Llama3-70B model on AMD's MI300X