FLAN-T5-XXL

by Google

 

 
Flan-t5-xxl model available on Huggingface is a Large Language Model that is cabable of various language generation tasks. It could have a good potential for our intent-detection task as a model running locally.

FLAN-T5-XXL

by Google

 

 
Flan-t5-xxl model available on Huggingface is a Large Language Model that is cabable of various language generation tasks. It could have a good potential for our intent-detection task as a model running locally.

Main use cases: Model for speech generation, which can be used for translations, text summaries, sentiment analysis or intent recognition. The quality of language generation lags behind larger, more modern models, while intent recognition, for example, is similarly good.  

 
Input length: 512 tokens (approx. 384 words) is basic, trained up to 2048 tokens (approx. 1536 words)  

 
Languages: English, French, Romanian, German  

 
Model size: ~11 billion parameters

Main use cases: Model for speech generation, which can be used for translations, text summaries, sentiment analysis or intent recognition. The quality of language generation lags behind larger, more modern models, while intent recognition, for example, is similarly good.  

 
Input length: 512 tokens (approx. 384 words) is basic, trained up to 2048 tokens (approx. 1536 words)  

 
Languages: English, French, Romanian, German  

 
Model size: ~11 billion parameters