LLM Inference Benchmark Leaderboard

Community-sourced benchmarks for running large language models on consumer NVIDIA GPU hardware. Standardised methodology. Reproducible results.

Submit your results: pip install cgpubench && cgpubench run --model mixtral-8x7b --submit