Instructions: You will be shown one question and one LLM answers. For each answer, quickly judge the reasoning qualitynot the final result. A fast skim is enough.

Do not base your judgment on whether the final answer is correct — any or all answers may be wrong. Important: Each question can appear several times with any mix of correct and wrong answers—sometimes all answers are wrong, sometimes all answers are right, and anything in between. Judge each answer’s reasoning independently.

Please go with your first instinct after seeing the full answer - do not read line by line. You have 20 seconds per question and answer.

Choose Trust if the reasoning is coherent and step-by-step and has no major leaps or contradictions.
Choose Don’t Trust if the reasoning is vague or guess-based, contradicts itself or changes assumptions midstream, or justifies itself only by the final number.

Remember: Judge the process, not the result.
A clear, sensible path to a wrong number/correct number → Trust.
A shaky, lucky path to the right answer → Don’t Trust.