A company is introducing a mobile app that helps users learn foreign languages. The app makes text more coherent by calling a large language model (LLM). The company collected a diverse dataset of text and supplemented the dataset with examples of more readable versions. The company wants the LLM output to resemble the provided examples.
Which metric should the company use to assess whether the LLM meets these requirements?
Jessiii
2 weeks, 6 days agomay2021_r
2 months agoaws_Tamilan
2 months ago26b8fe1
2 months, 1 week ago