Single-shot · temp 0.7 · real votes · identical prompts·How we test →
Single-shot · temp 0.7 · real votes · identical prompts·How we test →
Tests an AI's ability to make educated estimates based on technical knowledge
Estimate the total number of FLOPs required to train GPT-3 without searching online. Explain your reasoning step by step.
Tests an AI's ability to make educated estimates based on technical knowledge
Estimate the total number of FLOPs required to train GPT-3 without searching online. Explain your reasoning step by step.