...
/Challenge: Compare the Performance of Two Different LLMs
Challenge: Compare the Performance of Two Different LLMs
Evaluate text generation by using multiple LLMs, and determine the best performer.
We'll cover the following...
Challenge
In this challenge, we’ll explore the capabilities of two LLMs, google/flan-t5-small
and bigscience/mt0-small
. The task is to use these models for a specific text generation task and evaluate their performance using ROUGE metrics.
Task
Translate the German proverb “Anfangen ist leicht, beharren eine Kunst” into English using both LLMs with the transformers pipeline
...
Access this course and 1400+ top-rated courses and projects.