Discussion about this post

User's avatar
Luke's avatar

What about avx-512 performance? (Yeah, also for LLM inference!)

Expand full comment
ABC8's avatar

Could you benchmark llama.cpp/olllama?

Expand full comment
7 more comments...

No posts