AI language models like GPT-3 can achieve up to 97% accuracy on some Winograd schemas, but understanding language doesn't equate to understanding the