If the answer is so ambiguous that humans and AI get it wrong, is it really that great of a question?
If the answer is so ambiguous that humans and AI get it wrong, is it really that great of a question?