5 models · 1 goal · 1 judge.
Pick any subset of 7 cross-vendor LLMs across 3 providers. They race on the same prompt in parallel. A judge model scores them, picks a winner, and writes a merged answer.
JUDGE
[0] openai/gpt oss 120b
[1] meta llama/llama 4 scout 17b 16e
[2] openai/gpt oss 20b
[3] mistral large
[4] gemini 2.5 flash