A task that measures how closely two pieces of text match in meaning, regardless of whether they use the same words.
Quality of non-English language understanding and generation