What is HellaSwag Reasoning Benchmark?
The HellaSwag Reasoning Benchmark assesses a language model's ability to continue sentences in a contextually appropriate manner, reflecting an understanding of everyday scenarios and common sense. This benchmark provides sentence beginnings and requires the model to select the most plausible continuation from several choices.
