LLM Inference Benchmark Leaderboard
Community-sourced benchmarks for running large language models on consumer NVIDIA GPU hardware. Standardised methodology. Reproducible results.
Submit your results:
pip install cgpubench && cgpubench run --model mixtral-8x7b --submit