FLAN-T5-XL

by Google

 

 
Flan-t5-XL model available on Huggingface is a Large Language Model that is capable of various language generation tasks. As we had a good impression from our experiment on flan-xxl model, we like to check other versions available.

FLAN-T5-XL

by Google

 

 
Flan-t5-XL model available on Huggingface is a Large Language Model that is capable of various language generation tasks. As we had a good impression from our experiment on flan-xxl model, we like to check other versions available.

Main use cases: Model for speech generation, which can be used for translations, text summaries, sentiment analysis or intent recognition. The quality of language generation lags behind larger, more modern models, while intent recognition, for example, is similarly good.  

 
Input length: 512 tokens (approx. 384 words) is basic, up to 2048 tokens (approx. 1536 words) trained  

 
Languages: English, French, Romanian, German  

 
Model size: ~3 billion parameters

Main use cases: Model for speech generation, which can be used for translations, text summaries, sentiment analysis or intent recognition. The quality of language generation lags behind larger, more modern models, while intent recognition, for example, is similarly good.  

 
Input length: 512 tokens (approx. 384 words) is basic, up to 2048 tokens (approx. 1536 words) trained  

 
Languages: English, French, Romanian, German  

 
Model size: ~3 billion parameters