2015 NASA JPL Space Design Competition

Link to Github repo
Link to competition wiki

The purpose of this project is to find safe landing sites for a lander on Mars. For more detail on the rules, constraints, and data files, please visit the above link to the competition's wiki.

Data Files

The team running this competition provided us with 4 sets of training data, with 4 files in each dataset. Our solution only uses the Digital Elevation Map (DEM) from each of these datasets, which is given in .pgm format. This is a 500 by 500 grid of heights in meters, with a resolution of 0.2 m/pixel.

As output, our code is expected to output a 1000 by 1000 grid of integers designating whether each particular position is a safe landing site, supposing the center of the lander is at that particular point. The solution should be stored in a pgm format. In the solution, 0 means a location is not a safe landing site and 255 means a location is a safe landing site.

Constraints

In order to understand what a safe landing site is, we first need to understand what the lander looks like. The lander is circular. It has a 3.4 m diameter base plate, with 4 footpads located at 90° intervals around the edge of the base plate. Each footpad is circular and 0.5 m across. The ground clearance of the belly is 0.39 m.

There are two main constraints defining what constitutes a safe landing site, slope and roughness. The lander cannot safely land on a location of extreme slope, as it will be unstable and may slide or flip over. The threshold for slope is predefined as 10°. The lander also cannot land on a location deemed too rough, meaning there must not be any rock or land formation under the belly of the lander that is tall enough to penetrate the base plate. This means there cannot be any rocks taller than 0.39 m.

Algorithm Outline

Initialize (1000,1000) numpy array solution with zeros.
Set solution[20:979, 20:979] to 255 as points that may be allowable. Edges are disallowed per the competition rules.
Block out points of extreme slope.
Block out points of roughness.
Output solution to solutions/datasetname.pgm.

Slope: Pseudocode

a = angle/slope bound d = distance (rover width) in pixels Parallelized outer loop using Pool.apply_async() from multiprocessing module: For i in 10...490: For j in 10...490: points := list of 16 points spaced evenly around a circle of radius d around points (i, j) For p in 0...16: p1 := points[p] p2 := points[(p+4)%16] // 90 degrees from p1 p3 := points[(p+8)%16] // 180 degrees from p1 n := crossproduct(vector from p2 to p1, vector from p2 to p3) angle := arccos(dotproduct(n, (0, 0, -1)) / norm(n)) if angle > a: solution[2*i, 2*j] := 0 solution[2*i+1, 2*j] := 0 solution[2*i, 2*j+1] := 0 solution[2*i+1, 2*j+1] := 0 break

Roughness: Pseudocode

r = roughness bound d = distance (rover width) in pixels For i in 1...499: For j in 1...499: // Initialize multiplier array for calculating height at point x // between two other points m :=[1,...,d] / (d + 1) // Calculate plane height at each lander position in vertical and // horizontal ranges around points (i, j) // * and + here are element-wise operations iheights := m * DEM[i-d:i, j] + reverse(m) * DEM[i:i+d, j] jheights := m * DEM[i, j-d:j] + reverse(m) * DEM[i, j:j+d] // If elevation at (i, j) is more than r higher than an height in // iheights or jheights, then (i, j) is a point of roughness. Block // out the circle of points that would be affected. if DEM[i, j] - r > min(iheights) or DEM[i, j] - r > min(jheights): For (x, y) in getCircleIndices(center=(i*2, j*2), radius=d): solution[x, y] := 0

Performance

Summary

Let N = side length of the DEM (assuming a square shape)
Let X = number of points of roughness in the DEM
Slope: O(N²)
Roughness: O(N² + X)
Total computation complexity: O(N² + X)
Memory Usage: (5N²) * 32 bits

Slope Optimizations

The naive solution would assume we need to check every possible set of 3 landing points for the footpads of the rover, which would be at least the the circumference of the rover divided by average length in meters of a pixel in the DEM. Realistically, it would probably need to check more points to account for all possible positions where the tangent to the circle is not vertical or horizontal. This means checking at least 54 points. Our solution only checks 16 points, so we cut out at least 70% of the runtime that way.

We also chose to calculate the slopes using the plane normals, taking advantage of (and also optimizing for) the fact that the rover has 4 discreet footpads, rather than calculating the gradient of the plane or calculating the gradient at the central point. Calculating the gradient of any plane is slower than finding its normal, so avoiding that in general is preferable. Further, calculating gradient at the central point would be inaccurate due to the nature of the problem - the rover is landing on footpads around its edges, not a single leg in the middle.

Roughness Optimizations

We define the naive solution to be iterating over each point in the DEM, calling getCircleIndices with c = the current point, and checking if any of those points are taller than any of the planes formed by 3 of the border points. Assuming this checks a constant number of angles A to find the border points, getCircleIndices returns C points, and the DEM is size NxN, the naive solution would run in time O(ACN²), which simplifies to O(N²).

Our runtime in practice is much lower because we check if each point is the point of roughness, rather than if each possible circle contains a patch of roughness. We only call getCircleIndices if a point is rough, so the runtime is dependent on X, the number of points of that are raised/rough. We know X < N² and A is a large integer, so O(N² + CX) < O(ACN²). However, because A and C are constants, our worse case runtime simplified is O(N² + X).

Worst Case vs Likely Case

Real runtime is inversely proportional to the amount of slope in the given terrain. This is not reflected in O(N^2 + X) because the worst case remains the same no matter how much slope there is. Our code breaks out of the loop checking for slope at a given point once it knows the point is sloped, which only affects big-Omega and Theta. Further, because the getCircleIndices function used in our roughness calculations runs faster when more of the points being checked in the circle have been set to 0, our roughness calculation also runs faster when more points have been blocked out due to slope.

Accuracy on the Training Data

We have the solution outputed by our algorithm on the left. On the right, we have the difference between our solution and the real solution. We are using the default colors from matplotlib, so on the left, red = 255 and blue = 0. On the right, green = correct, red = false negative, and blue = false positive. We especially want to avoid false positives, or incorrectly labeling an unsafe zone, because that could lead to catastrophic failure, whereas a false negative would only mean artificially limitted options.

Data Set 1

False positives: 3934 = 2.02674854717 %
False negatives: 9213 = 1.14319961881 %
Total error: 13147 = 1.3147 %

Data Set 2

False positives: 12066 = 1.54332594031 %
False negatives: 21732 = 9.96049169959 %
Total error: 33798 = 3.3798 %

Data Set 3

False positives: 10286 = 1.17467292793 %
False negatives: 21310 = 17.1368373649 %
Total error: 31596 = 3.1596 %

Data Set 4

False positives: 9149 = 1.00149419562 %
False negatives: 22395 = 25.9006534436 %
Total error: 31544 = 3.1544 %

Choosing Parameters

The choose our threshold values, we ran grid search over a range of possible values for constants a and r to minimize average total error.

Optimal values:
a = 0.0364, r = 0.36

Parameter Selection References

Bergstra, J., Bardenet, R., Bengio, Y., & Kegl, B. (2011). Algorithms for Hyper-Parameter Optimization.
Retrieved from http://papers.nips.cc/paper/4443-algorithms-for-hyper-parameter-optimization.pdf

Hsu, C. W., Chang, C. C., & Lin, C. J. (2010, April 15). A Practical Guide to Support Vector Classification.
Retrieved from http://www.csie.ntu.edu.tw/~cjlin/papers/guide/guide.pdf

FAQ

1. You're using the height given in the DEM for roughness, but that doesn't take the slope of the ground into consideration.
We are filtering out all locations with a slope greater than 10 degrees, so in the worst case scenario, the roughness code needs to handle a 10 degree slope. cos(10°) = 0.9848. So, in theory, the surface normal should be taken into account, but with a difference of only 1.52%, this calculation is unneccesary with a suitably conservative roughness constant.

2. I tried downloading the zip files and running your code on it, but it isn't working!
There is a typo in the naming consistency of the zips. That typo is fixed in the unzipped data files on github, so try again with those. If you need to use the zips for whatever reason, just look at the directory and file names. The error should jump out at you.

3. I still don't understand the roughness part.
Email me.

Competition day was April 25, 2015, and this submission took 3rd place. Thanks to NASA and the Jet Propulsion Lab for the data and this great opportunity!