Discussion about this post

User's avatar
Luke's avatar

What about avx-512 performance? (Yeah, also for LLM inference!)

ABC8's avatar

Could you benchmark llama.cpp/olllama?

9 more comments...

No posts

Ready for more?