...

/

Challenge: Compare the Performance of Two Different LLMs

Challenge: Compare the Performance of Two Different LLMs

Evaluate text generation by using multiple LLMs, and determine the best performer.

Challenge

In this challenge, we’ll explore the capabilities of two LLMs, google/flan-t5-small and bigscience/mt0-small. The task is to use these models for a specific text generation task and evaluate their performance using ROUGE metrics.

Task

Translate the German proverb “Anfangen ist leicht, beharren eine Kunst” into English using both LLMs with the transformers pipeline ...

Access this course and 1400+ top-rated courses and projects.