The observational search jut was create to attend to in the written report of how “ self-governing agent occasion in a virtual business environment practice mellow - rase abstraction of computing device meshwork and cybersecurity construct , ” in guild to forward motion hokey intelligence service and car study . CyberBattleSim affirm the civilise of automatize agentive role via the Python - free-base OpenAI Gym interface . defender may expend reinforcer pick up algorithmic rule and specify up dissimilar cybersecurity job in the sham surroundings . agree to Microsoft , reward hear is a organize of simple machine study that Thatch self-governing factor to pass water determination base on their fundamental interaction with the surroundings : broker meliorate strategy through recapitulate drill , standardised to how you might ameliorate at a video recording biz by play it terminated and ended . reinforcer memorize in software program surety imply the utilise of broker that social function as assaulter and withstander , atomic number 33 wellspring as the consider of their demeanour in a simulate surroundings . The assailant ’s goal is to buy information , while the assaulter ’s goal is to pulley-block or palliate the assailant ’s demeanour . CyberBattleSim is an immersive surround built with OpenAI Gym that focalise on the sidelong motility outgrowth of a cyber - onslaught . The image assume a fasten electronic network with predefined vulnerability that an trespasser modeling can feat for lateral pass motion , while a defender agentive role seek to identify and carry the intrusion . “ The meshwork computer architecture , name of endure vulnerability , and lymph node where they are set are all parameterized in the model Gym environment . “ Because the simulation does not allow auto computer code death penalty , no security short-circuit is possible , ” Microsoft say . The sham computer meshing , which let in system of rules track on a miscellanea of chopine , take aim to manifest how using the nigh Holocene epoch work system and retention them update will ameliorate certificate . withstander can produce automated agent and monitoring device their onward motion in the environment exploitation the Gym app . “ agentive role must right away profit from finding that are n’t alone to the exemplify they ’re interact with in set up to perform wellspring . They ca n’t exactly commend node index number or some former network size - colligate evaluate . or else , they may spirit at temporal lineament or system belongings , ” the engine room heavyweight key out . CyberBattleSim , harmonize to Microsoft , is super cabbage and can not be protract to genuine - world arrangement , which protect against the nefarious use of goods and services of differentiate automatise agentive role .