Cerebras Inference now 3x faster: Llama3.1-70B breaks 2,100 tokens/s
📅 2024-10-25 ⚓ Hacker News 🌐 Source 🖼️ Load Image