o %Ý«iÕã@sddZddlZddlZddlmmZddlmZddl m Z e eƒZdej d<Gdd„deƒZdS) züThis lobe enables the integration of huggingface pretrained LaBSE models. Reference: https://arxiv.org/abs/2007.01852 Transformer from HuggingFace needs to be installed: https://huggingface.co/transformers/installation.html Authors * Ha Nguyen 2023 éN)ÚHFTransformersInterface)Ú get_loggerÚfalseÚTOKENIZERS_PARALLELISMcs.eZdZdZ d‡fdd„ Zdd„Z‡ZS)ÚLaBSEa,This lobe enables the integration of HuggingFace and SpeechBrain pretrained LaBSE models. Source paper LaBSE: https://arxiv.org/abs/2007.01852 Transformer from HuggingFace needs to be installed: https://huggingface.co/transformers/installation.html The model can be used as a fixed text-based sentence-level embeddings generator or can be finetuned. It will download automatically the model from HuggingFace or use a local path. Arguments --------- source : str HuggingFace hub name: e.g "setu4993/LaBSE" save_path : str Path (dir) of the downloaded model. freeze : bool (default: True) If True, the model is frozen. If False, the model will be trained alongside with the rest of the pipeline. output_norm : bool (default: True) If True, normalize the output. Example ------- >>> inputs = ["La vie est belle"] >>> model_hub = "setu4993/smaller-LaBSE" >>> save_path = "savedir" >>> model = LaBSE(model_hub, save_path) >>> outputs = model(inputs) Tcs(tƒj|||d|j|d||_dS)N)ÚsourceÚ save_pathÚfreeze)r)ÚsuperÚ__init__Úload_tokenizerÚoutput_norm)Úselfrrr r ©Ú __class__©úk/home/ubuntu/.local/lib/python3.10/site-packages/speechbrain/lobes/models/huggingface_transformers/labse.pyr9s zLaBSE.__init__cCsø|jrLt ¡=|j|ddd}| ¡D]}||j|jjd||<d||_q|jd i|¤Žj }|j r;tj|dd}|WdƒS1sGwY|j|ddd}| ¡D]}||j|jjd||<qX|jd i|¤Žj }|j rztj|dd}|S) zøThis method implements a forward of the labse model, which generates sentence-level embeddings from input text. Arguments ---------- input_texts (translation): list The list of texts (required). ÚptT)Úreturn_tensorsÚpadding)ÚdeviceFé)ÚpNr) r ÚtorchÚno_gradÚ tokenizerÚkeysÚtoÚmodelrÚ requires_gradÚ pooler_outputr ÚFÚ normalize)rÚinput_textsÚkeyÚ embeddingsrrrÚforwardFs0 ÿ ÿ îÿz LaBSE.forward)TT)Ú__name__Ú __module__Ú__qualname__Ú__doc__rr&Ú __classcell__rrrrrs"û r)r*ÚosrÚtorch.nn.functionalÚnnÚ functionalr!Ú=speechbrain.lobes.models.huggingface_transformers.huggingfacerÚspeechbrain.utils.loggerrr'ÚloggerÚenvironrrrrrÚs