Llama cpp parallel inference. 1 vLLM We Meta Llama 3 8B Instruct (GGUF, Q4_K_M) Production-ready G...