o `Û·ib*ã@sØdZddlZddlmZmZddlZddlZddl Z ddl mZddlm Z mZddlmZddlmZer9sÿz1check_multiagent_environments..z&get_action_space(agent_id=..).sample()z6In particular, the `step()` method seems to be faulty.zstep, next_obszstep, rewardz step, donezstep, truncatedz step, infoT)Úallow_commonÚdummy_env_id)Úbase_envÚ agent_ids)Ú ray.rllib.envr Ú isinstanceÚ ValueErrorÚhasattrrÚloggerÚwarningÚresetÚ ExceptionrÚformatÚ"_check_if_element_multi_agent_dictÚkeysÚstepÚ _check_rewardÚagentsÚ_check_done_and_truncatedÚ_check_info) rr Ú obs_and_infosÚeÚ reset_obsÚreset_infosÚsampled_actionÚresultsÚnext_obsÚrewardÚdoneÚ truncatedÚinforrrÚcheck_multiagent_environmentssx ÿþý ÿÿÿü€ÿ ÿÿÿÿü€ÿür;FcCsð|rO| ¡D]F\}}| ¡D]=\}}t |¡r,t|tƒs,t |¡s:t|tjƒr,|jdks:d|›dt|ƒ›}t |ƒ‚||vsK|dksKd|›}t |ƒ‚qqdSt |¡rit|tƒsit |¡stt|tjƒri|jdksvd t|ƒ¡}t |ƒ‚dSdS)NrzJYour step function must return rewards that are integer or float. reward: z. Instead it was a Ú__all__zrYour reward dictionary must have agent ids that belong to the environment. AgentIDs received from env.agents are: zUYour step function must return a reward that is integer or float. Instead it was a {})ÚitemsÚnpÚisrealr!ÚboolÚisscalarÚndarrayÚshapeÚtyper"r()r7rrÚ_Úmulti_agent_dictÚagent_idÚrewÚerrorrrrr,YsVÿþü û ÿþÿþÿ€ëÿÿþü û þôr,c CsÄdD]]}|dkr |n|}|rI| ¡D]5\}}| ¡D],\}} t| ttjfƒs2td|›dt|ƒ›ƒ‚||vsF|dksFd|›d|›} t| ƒ‚qqqt|ttjfƒs_d|›d t|ƒ›} t| ƒ‚qdS) N)r8r9r8z Your step function must return `z's` that are boolean. But instead was a r<zYour `zis` dictionary must have agent ids that belong to the environment. AgentIDs received from env.agents are: z"Your step function must return a `z'` that is a boolean. But instead was a )r=r!r@r>Úbool_r"rD)r8r9rrÚwhatÚdatarErFrGÚdone_rIrrrr.s8ÿÿþÿ€ôÿÿÿûïr.cCs¢|r<| ¡D]3\}}| ¡D]*\}}t|tƒs#tdt|ƒ›d|›ƒ‚||vs8|dks8|dks8d|›}t|ƒ‚qqdSt|tƒsOdt|ƒ›d|›}t|ƒ‚dS)NzDYour step function must return infos that are a dict. instead was a z: element: r<Ú __common__zqYour dones dictionary must have agent ids that belong to the environment. AgentIDs received from env.agents are: zDYour step function must return a info that is a dict. element type: z. element: )r=r!Údictr"rD)r:rrrErFrGÚinfrIrrrr/›s> ÿÿÿþÿ€ðÿ ÿÿÿûr/c Cs.d|›d|›d|›d|›d|›d|›d }|S)NzThe z collected from z% was not contained within your env's z@ space. Its possible that there was a typemismatch (for example z)s of np.float32 and a space ofnp.float64 zs), or that one of the sub-zs wasout of boundsr)Ú func_nameÚ_typeÚ_errorrrrÚ_not_contained_error·sÿþýýÿrTcsØt|tƒs#|rd|›dt|ƒ›}t|ƒ‚d|›dt|ƒ›}t|ƒ‚t|jƒ‰ˆ d¡|r4ˆ d¡t‡fdd„|Dƒƒsj|rUd|›dt| ¡ƒ›d |j›}t|ƒ‚d|›d t| ¡ƒ›d |j›d}t|ƒ‚dS)NzThe element returned by zJ contains values that are not MultiAgentDicts. Instead, they are of type: z3 is not a MultiAgentDict. Instead, it is of type: r<rNc3s|]}|ˆvVqdS©Nr)rÚk©rrrÚ Üs€z5_check_if_element_multi_agent_dict..z_ has agent_ids that are not the names of the agents in the env.agent_ids in this MultiEnvDict: z AgentIDs in this env: zb has agent_ids that are not the names of the agents in the env. AgentIDs in this MultiAgentDict: zó. You likely need to add the attribute `agents` to your env, which is a list containing the IDs of agents currently in your env/episode, as well as, `possible_agents`, which is a list of all possible agents that could ever show up in your env.) r!rOrDr"Úsetr-ÚaddÚallÚlistr*)rÚelementÚfunction_stringrrrIrrWrr)ÂsB þÿüþÿ ýüÿ÷ ýüÿ ìr)c Cs¶t|tjjtjjfƒsd||j|t|ƒfSt|ƒ}dd„}z t |||¡WdSt yZ}z'|jdd|jdd}}d |jdd¡||j|t|ƒfWYd}~Sd}~ww) aCReturns error, value, and space when offending `space.contains(value)` fails. Returns only the offending sub-value/sub-space in case `space` is a complex Tuple or Dict space. Args: space: The gym.Space to check. value: The actual (numpy) value to check for matching `space`. Returns: Tuple consisting of 1) key-sequence of the offending sub-space or the empty string if `space` is not complex (Tuple or Dict), 2) the offending sub-space, 3) the offending sub-space's dtype, 4) the offending sub-value, 5) the offending sub-value's dtype. .. testcode:: :skipif: True path, space, space_dtype, value, value_dtype = _find_offending_sub_space( gym.spaces.Dict({ -2.0, 1.5, (2, ), np.int8), np.array([-1.5, 3.0]) ) NcSs| |¡st|||fƒ‚dSrU)Úcontainsr)ÚpÚsÚvrrrÚmap_fns ÿz)_find_offending_sub_space..map_fnrééz->)NNNNN) r!ÚgymÚspacesÚDictÚTupleÚdtypeÚ _get_typerÚtreeÚmap_structure_with_pathrÚargsÚjoin)ÚspaceÚvalueÚstructured_spacercr1rrrÚ_find_offending_sub_spaceósû0€þrscCst|dƒr|jSt|ƒS)Nrj)r#rjrD)Úvarrrrrksrk)rr rN)FN)FF) Ú__doc__ÚloggingÚtypingrrÚ gymnasiumrfÚnumpyr>rlÚray.rllib.utils.annotationsrÚray.rllib.utils.errorrrÚ"ray.rllib.utils.spaces.space_utilsrÚray.utilrr r Ú getLoggerÚ__name__r$r;r,r.r/rTr)rsrkrrrrÚs0 D ( û1,