o -ºiã@szUdZddlmZddlmZddlmZe e¡Z ddddd œZ eee eeeeffed <dZdZGd d„deƒZdS)z Radio vision model configurationé)ÚAny)ÚPretrainedConfig)Úlogging)i€ééi)irri)iééi)ié ri)Úvit_small_patch16_224Úvit_base_patch16_224Úvit_large_patch16_224Úvit_huge_patch16_224ÚVIT_TIMM_DIM_BY_NAME)g3<Í4'ÐÞ?gwgí¶MÝ?gy{Îå Ú?)g‡Bô91Ñ?g•wÝt.¹Ð?gÝ U¦Ñ?c s°eZdZdZdZddddddd d deedddfd ededede de dede de dededee e e feBdee e e feBdedBdee eefdBde f‡fdd„ Z‡ZS)ÚRadioConfiga> This is the configuration class to store the configuration of a Radio vision model. It is used to instantiate a Radio model according to the specified arguments, defining the model architecture. Args: model_name: Name of the vision transformer model (e.g., "vit_base_patch16_224"). Used to determine architecture dimensions from `VIT_TIMM_DIM_BY_NAME`. image_size: The size (resolution) of each image. patch_size: The size (resolution) of each patch. qkv_bias: Whether to add a bias to the queries, keys and values. qk_normalization: Whether to apply normalization to queries and keys. norm_type: The normalization type to use. layer_norm_eps: The epsilon used by the layer normalization layers. initializer_factor: A factor for initializing all weight matrices. hidden_act: The non-linear activation function in the encoder. cpe_max_size: Maximum image size for position embeddings. norm_mean: Mean values for image normalization (RGB channels). Defaults to (0.48145466, 0.4578275, 0.40821073)). norm_std: Standard deviation values for image normalization (RGB channels). Defaults to (0.26862954, 0.26130258, 0.27577711)). register_multiple: Number of register tokens to use. teachers: A list of teacher model configurations. Each teacher configuration is a dict with keys like "name" and some may have "use_summary". cls_token_per_teacher: Whether to use a separate CLS token for each teacher. ÚradioéàrTFÚ layer_normgíµ ÷Æ°>gð?ÚgeluiNÚ model_nameÚ image_sizeÚ patch_sizeÚqkv_biasÚqk_normalizationÚ norm_typeÚlayer_norm_epsÚinitializer_factorÚ hidden_actÚcpe_max_sizeÚ norm_meanÚnorm_stdÚregister_multipleÚteachersÚcls_token_per_teachercsÀ||_t|\|_|_|_|_||_||_||_||_ ||_ ||_||_| |_ | |_t|ttfƒr5t|ƒn||_t|ttfƒrCt|ƒn||_| |_|durO|ng|_||_tƒjdi|¤ŽdS)N©)rrÚhidden_sizeÚnum_hidden_layersÚnum_attention_headsÚintermediate_sizerrrrrrrrrÚ isinstanceÚtupleÚlistrrr r!r"ÚsuperÚ__init__)Úselfrrrrrrrrrrrrr r!r"Úkwargs©Ú __class__r#úb/home/ubuntu/veenaModal/venv/lib/python3.10/site-packages/vllm/transformers_utils/configs/radio.pyr,6s0ûÿÿzRadioConfig.__init__)Ú__name__Ú __module__Ú__qualname__Ú__doc__Ú model_typeÚOPENAI_CLIP_MEANÚOPENAI_CLIP_STDÚstrÚintÚboolÚfloatr)r*Údictrr,Ú __classcell__r#r#r/r1rs`ðþýüûúùø ÷ öõô óòñðrN)r5ÚtypingrÚ transformers.configuration_utilsrÚtransformers.utilsrÚ get_loggerr2Úloggerrr=r9r)r:Ú__annotations__r7r8rr#r#r#r1Ús "ü