o }o™iÆã@sLddlZddlmZddlZddlmZGdd„deƒZGdd„deƒZ dS)éN)ÚCounter)Úcorpus_bleuc@s<eZdZedd„ƒZedd„ƒZedd„ƒZedd„ƒZd S) ÚDialogueGenerationMetricscCs†g}tt|ƒƒD]}| ||||||dœ¡qt|ddd}|D]}| t |¡d¡q$WdƒdS1s?sÿÿz4DialogueGenerationMetrics.get_f1..r)Úaxis)r&r'rrÚmean)r3r4Útotal_p_r_f1Ú avg_p_r_f1rr2rÚget_f1<s þÿz DialogueGenerationMetrics.get_f1csV‡‡fdd„ttˆƒƒDƒ}‡fdd„|Dƒ‰‡fdd„|Dƒ‰tˆˆgdd}|jS)zç Referenced from NMT evaluation Note 13a is the default tokenizer for English for WMT Known issue that it doesn't hand edge case of None or '' https://github.com/mjpost/sacrebleu/issues/161 cs g|]}ˆ|rˆ|r|‘qSrrr0©rrrrr5Os z6DialogueGenerationMetrics.get_bleu..cóg|]}ˆ|‘qSrrr0)rrrr5Pócr<rrr0)rrrr5Qr=Ú13a)Útokenize)rrrÚscore)rrÚ valid_indicesÚ sacre_bleurr;rÚget_bleuGs z"DialogueGenerationMetrics.get_bleuN)Ú__name__Ú __module__Ú__qualname__Ústaticmethodr r/r:rCrrrrrs rc@s2eZdZedd„ƒZed dd„ƒZedd„ƒZdS) ÚDialogueClassificationMetricscCsžg}tt|ƒƒD]} | || || || || || || || dœ¡qt|ddd} |D]}| t |¡d¡q0WdƒdS1sHwYdS)r)rrÚground_truth_slotsÚground_truth_labelsrÚgenerated_slotsÚgenerated_labelsr r rr Nr)rrLrKrJrIrrrrrrrrrrr Ws"ùÿÿ"ÿz.DialogueClassificationMetrics.save_predictionsFcCs¶g}g}|D]P}|rHdd„| dd¡Dƒ}d}t|ƒdkr"|\}}nt|ƒdkr.|d}d}t|tƒrDd|vr>| d¡d}| d ¡}nd g}n|}g}| |¡| |¡q||fS)am Split target into label and slots when doing joint label (i.e. intent) classificaiton and slot filling For instance, split "reserve_restaurant slots: time_of_day(7pm), number_of_people(3)" into label = "reserve_restaurant" and slots = ["time_of_day(7pm)", "number_of_people(3)"] Args: fields: list of strings cSsg|]}| ¡‘qSr)Ústripr0rrrr5‡r=zGDialogueClassificationMetrics.split_label_and_slots..zslots:éÚnoner!rzpossible intents:z, ÚNone)r#rÚ isinstanceÚstrr)ÚfieldsÚ with_slotsÚlabelsÚ slots_listÚfieldÚcomboÚlabelÚslotsrrrÚsplit_label_and_slotsys* z3DialogueClassificationMetrics.split_label_and_slotscsg}g}g}tt|ƒƒD]T}ttt||ƒƒƒ‰ttt||ƒƒƒ}‡fdd„|Dƒ}tˆƒdkr9t|ƒtˆƒnd}t|ƒdkrIt|ƒt|ƒnd} tˆ|kƒ} | |¡| | ¡| | ¡qt |¡d}t |¡d}t |¡d} d| || |d}|| ||fS)zÚ Args: generated_slots: list of list of strings. Each string is slot-name and slot-value pair e.g. location(Seattle) ground_truth_slots: list of list of strings csg|]}|ˆvr|‘qSrr)r1r©rrrr5szJDialogueClassificationMetrics.get_slot_filling_metrics..rr"r!g#B’¡œÇ;) rrÚsortedÚlistÚsetÚintrr&r7)rKrIÚ all_recallÚ all_precisionÚall_joint_goal_accuracyrÚ predictedÚcorrectr-r,Újoint_goal_accuracyÚavg_joint_goal_accuracyÚ avg_precisionÚ avg_recallÚavg_f1rr\rÚget_slot_filling_metricss$ z6DialogueClassificationMetrics.get_slot_filling_metricsN)F)rDrErFrGr r[rkrrrrrHVs !#rH) rÚcollectionsrÚnumpyr&Ú sacrebleurÚobjectrrHrrrrÚs?