o y“©i<ã@s UddlmZddlmZmZmZmZddlmZddl Z ddl m Z ddlmZddl mZiZeed<d e d ee je jfdee e ffdd „Zd e d ee je jfdee e ffdd„Z dde de ded eddedee e ffdd„Zde de de fdd„ZdS)é)Úpermutations)ÚAnyÚCallableÚTupleÚUnion)ÚwarnN)ÚTensor)ÚLiteral)Ú_SCIPY_AVAILABLEÚ_ps_dictÚ metric_mtxÚ eval_funcÚreturncspddlm‰| ¡ ¡}t ‡‡fdd„|Dƒ¡}| |j¡}t |d|dd…dd…df¡ ddg¡}||fS) a•Solves the linear sum assignment problem using scipy, and returns the best metric values and the corresponding permutations. Args: metric_mtx: the metric matrix, shape [batch_size, spk_num, spk_num] eval_func: the function to reduce the metric values of different the permutations Returns: best_metric: shape ``[batch]`` best_perm: shape ``[batch, spk]`` r)Úlinear_sum_assignmentcs g|]}ˆ|ˆtjkƒd‘qS)é)ÚtorchÚmax)Ú.0Úpwm©r r©úU/home/ubuntu/.local/lib/python3.10/site-packages/torchmetrics/functional/audio/pit.pyÚ /s z<_find_best_perm_by_linear_sum_assignment..éNéÿÿÿÿéþÿÿÿ) Úscipy.optimizerÚdetachÚcpurÚtensorÚtoÚdeviceÚgatherÚmean)rr ÚmmtxÚ best_permÚbest_metricrrrÚ(_find_best_perm_by_linear_sum_assignments*r'c CsÄ|jdd…\}}t|ƒt|jƒ}|tvr*tjttt|ƒƒƒ|jdj }|t|<nt|}|jd}|d |||¡}t |d|¡}|jdd} || dd\} }| ¡}|j |dd…f}| |fS)aòSolves the linear sum assignment problem using exhaustive method, i.e. exhaustively calculates the metric values of all possible permutations, and returns the best metric values and the corresponding permutations. Args: metric_mtx: the metric matrix, shape ``[batch_size, spk_num, spk_num]`` eval_func: the function to reduce the metric values of different the permutations Returns: best_metric: shape ``[batch]`` best_perm: shape ``[batch, spk]`` Nr)r!r)N.r)Údim)ÚshapeÚstrr!rrrÚlistrÚrangeÚTÚexpandr"r#r) rr Ú batch_sizeÚspk_numÚkeyÚpsÚperm_numÚbpsÚmetric_of_ps_detailsÚmetric_of_psr&Úbest_indexesr%rrrÚ$_find_best_perm_by_exhaustive_method5s r8rÚpredsÚtargetÚmetric_func©rÚminÚkwargscKs˜|jdd…|jdd…krtdƒ‚|dvrtd|›ƒ‚|jdkr/td|j›d|j›dƒ‚|jdd…\}}d }t|ƒD]Y}t|ƒD]R} |d uri||d d …| d f|d d …|d ffi|¤Ž|d d …|| f<qD||d d …| d f|d d …|d ffi|¤Ž} tj|||f| j| jd}| |d d …|| f<qDq>|dkrŸtj ntj }|d ks¨tsÁ|d kr¶ts¶td|›dƒt ||ƒ\}} || fSt||ƒ\}} || fS)aCalculates `Permutation invariant training`_ (PIT) that can evaluate models for speaker independent multi- talker speech separation in a permutation invariant way. Args: preds: float tensor with shape ``(batch_size,num_speakers,...)`` target: float tensor with shape ``(batch_size,num_speakers,...)`` metric_func: a metric function accept a batch of target and estimate, i.e. ``metric_func(preds[:, i, ...], target[:, j, ...])``, and returns a batch of metric tensors ``(batch,)`` eval_func: the function to find the best permutation, can be ``'min'`` or ``'max'``, i.e. the smaller the better or the larger the better. kwargs: Additional args for metric_func Returns: Tuple of two float tensors. First tensor with shape ``(batch,)`` contains the best metric value for each sample and second tensor with shape ``(batch,)`` contains the best permutation. Example: >>> from torchmetrics.functional.audio import scale_invariant_signal_distortion_ratio >>> # [batch, spk, time] >>> preds = torch.tensor([[[-0.0579, 0.3560, -0.9604], [-0.1719, 0.3205, 0.2951]]]) >>> target = torch.tensor([[[ 1.0958, -0.1648, 0.5228], [-0.4100, 1.1942, -0.5103]]]) >>> best_metric, best_perm = permutation_invariant_training( ... preds, target, scale_invariant_signal_distortion_ratio, 'max') >>> best_metric tensor([-5.1091]) >>> best_perm tensor([[0, 1]]) >>> pit_permutate(preds, best_perm) tensor([[[-0.0579, 0.3560, -0.9604], [-0.1719, 0.3205, 0.2951]]]) rrz_Predictions and targets are expected to have the same shape at the batch and speaker dimensionsr<z-eval_func can only be "max" or "min" but got z/Inputs must be of shape [batch, spk, ...], got z and z insteadN.)Údtyper!rézIn pit metric for speaker-num z8>3, we recommend installing scipy for better performance)r)ÚRuntimeErrorÚ ValueErrorÚndimr,rÚemptyr?r!rr=r rr8r')r9r:r;r r>r/r0rÚ target_idxÚ preds_idxÚ first_eleÚopr&r%rrrÚpermutation_invariant_training`s<#ÿ ÿÿ.øþrIÚpermcCst dd„t||ƒDƒ¡}|S)a!Permutate estimate according to perm. Args: preds: the estimates you want to permutate, shape [batch, spk, ...] perm: the permutation returned from permutation_invariant_training, shape [batch, spk] Returns: Tensor: the permutated version of estimate cSsg|]\}}t |d|¡‘qS)r)rÚindex_select)rÚpredÚprrrr±sz!pit_permutate..)rÚstackÚzip)r9rJÚpreds_pmtedrrrÚ pit_permutate§s rQ)r)Ú itertoolsrÚtypingrrrrÚwarningsrrrÚtyping_extensionsr Útorchmetrics.utilities.importsr rÚdictÚ__annotations__r=rr'r8rIrQrrrrÚsJ ÿþ ýÿþ ý,ÿÿÿÿÿÿ þG