MIT algorithm helps robots take calculated risks in dangerous environments
We know far less about the Earthβs oceans than we do about the surface of the moon or Mars. The sea floor is carved with expansive canyons, towering seamounts, deep trenches, and sheer cliffs, most of which are considered too dangerous or inaccessible for autonomous underwater vehicles to navigate.
But what if the reward for traversing such places was worth the risk?
MIT engineers have now developed an algorithm that lets autonomous underwater vehicles’ weigh the risks and potential rewards of exploring an unknown region. For instance, if a vehicle tasked with identifying underwater oil seeps approached a steep, rocky trench, the algorithm could assess the reward level (the probability that an oil seep exists near this trench), and the risk level (the probability of colliding with an obstacle), if it were to take a path through the trench.
βIf we were very conservative with our expensive vehicle, saying its survivability was paramount above all, then we wouldnβt find anything of interest,β says Benjamin Ayton, a graduate student in MITβs Department of Aeronautics and Astronautics. βBut if we understand thereβs a trade-off between the reward of what you gather, and the risk or threat of going toward these dangerous geographies, we can take certain risks when itβs worthwhile.β
Ayton says the new algorithm can compute trade-offs of risk versus reward in real time, as a vehicle decides where to explore next. He and his colleagues in the lab of Brian Williams, professor of aeronautics and astronautics, are implementing this algorithm and others on autonomous underwater vehicles’, with the vision of deploying fleets of bold, intelligent robotic explorers for a number of missions, including looking for offshore oil deposits, investigating the impact of climate change on coral reefs, and exploring extreme environments analogous to Europa, an ice-covered moon of Jupiter that the team hopes vehicles will one day traverse.
βIf we went to Europa and had a very strong reason to believe that there might be a billion-dollar observation in a cave or crevasse, which would justify sending a spacecraft to Europa, then we would absolutely want to risk going in that cave,β Ayton says. βBut algorithms that donβt consider risk are never going to find that potentially history-changing observation.β
Ayton and Williams, along with Richard Camilli of the Woods Hole Oceanographic Institution, will present their new algorithm at the Association for the Advancement of Artificial Intelligence conference this week in Honolulu.
A bold path
The teamβs new algorithm is the first to enable βrisk-bounded adaptive sampling.β An adaptive sampling mission is designed, for instance, to automatically adapt an autonomous underwater vehicles’ path, based on new measurements that the vehicle takes as it explores a given region. Most adaptive sampling missions that consider risk typically do so by finding paths with a concrete, acceptable level of risk. For instance, autonomous underwater vehicles’ may be programmed to only chart paths with a chance of collision that doesnβt exceed 5 percent.
But the researchers found that accounting for risk alone could severely limit a missionβs potential rewards.
βBefore we go into a mission, we want to specify the risk weβre willing to take for a certain level of reward,β Ayton says. βFor instance, if a path were to take us to more hydrothermal vents, we would be willing to take this amount of risk, but if weβre not going to see anything, we would be willing to take less risk.β
The teamβs algorithm takes in bathymetric data, or information about the ocean topography, including any surrounding obstacles, along with the vehicleβs dynamics and inertial measurements, to compute the level of risk for a certain proposed path. The algorithm also takes in all previous measurements that the autonomous underwater vehicles has taken, to compute the probability that such high-reward measurements may exist along the proposed path.
If the risk-to-reward ratio meets a certain value, determined by scientists beforehand, then the autonomous underwater vehicles goes ahead with the proposed path, taking more measurements that feed back into the algorithm to help it evaluate the risk and reward of other paths as the vehicle moves forward.
The researchers tested their algorithm in a simulation of an autonomous underwater vehicles mission east of Boston Harbour. They used bathymetric data collected from the region during a previous NOAA survey, and simulated the vehicle exploring at a depth of 15 meters through regions at relatively high temperatures. They looked at how the algorithm planned out the vehicleβs route under three different scenarios of acceptable risk.
In the scenario with the lowest acceptable risk, meaning that the vehicle should avoid any regions that would have a very high chance of collision, the algorithm mapped out a conservative path, keeping the vehicle in a safe region that also did not have any high rewards β in this case, high temperatures. For scenarios of higher acceptable risk, the algorithm charted bolder paths that took a vehicle through a narrow chasm, and ultimately to a high-reward region.
The team also ran the algorithm through 10,000 numerical simulations, generating random environments in each simulation through which to plan a path, and found that the algorithm βtrades off risk against reward intuitively, taking dangerous actions only when justified by the reward.β
A risky slope
Last December, Ayton, Williams, and others spent two weeks on a cruise off the coast of Costa Rica, deploying autonomous underwater gliders, on which they tested several algorithms, including this newest one. For the most part, the algorithmβs path planning agreed with those proposed by several onboard geologists who were looking for the best routes to find oil seeps.
Ayton says there was a particular moment when the risk-bounded algorithm proved especially handy. An autonomous underwater vehicles was making its way up a precarious slump, or landslide, where the vehicle couldnβt take too many risks.
βThe algorithm found a method to get us up the slump quickly, while being the most worthwhile,β Ayton says. βIt took us up a path that, while it didnβt help us discover oil seeps, it did help us refine our understanding of the environment.β
βWhat was really interesting was to watch how the machine algorithms began to βlearnβ after the findings of several dives, and began to choose sites that we geologists might not have chosen initially,β says Lori Summa, a geologist and guest investigator at the Woods Hole Oceanographic Institution, who took part in the cruise. βThis part of the process is still evolving, but it was exciting to watch the algorithms begin to identify the new patterns from large amounts of data, and couple that information to an efficient, βsafeβ search strategy.β
In their long-term vision, the researchers hope to use such algorithms to help autonomous vehicles explore environments beyond Earth.
βIf we went to Europa and werenβt willing to take any risks in order to preserve a probe, then the probability of finding life would be very, very low,β Ayton says. βYou have to risk a little to get more reward, which is generally true in life as well.β
This research was supported, in part, by Exxon Mobile, as part of the MIT Energy Initiative, and by NASA.
















