An Example of Augmenting a Latin Hypercube

Rob Carnell

22 November 2007

Suppose that a computer simulation study is being designed that requires expensive runs. A Latin hypercube design is desired for this simulation so that the expectation of the simulation output can be estimated efficiently given the distributions of the input variables. Latin hypercubes are most often used in highly dimensional problems, but the example shown is of small dimension. Suppose further that the total extent of funding is uncertain. Enough money is available for 5 runs, and there is a chance that there will be enough for 5 more. However, if the money for the additional 5 runs does not materialize, then the first 5 runs must be a Latin hypercube alone. A design for this situation can be created using the lhs package.

First create a random Latin hypercube using the randomLHS(n, k) command:

A <- randomLHS(5, 2)

An example of this hypercube is shown in Figure 1. Note that the Latin property of the hypercube requires that each of the 5 equal probability intervals be filled (i.e. each row and each column is filled with one point). Also notice that the exact location of the design point is randomly sampled from within that cell using a uniform distribution for each marginal variable.

Figure 1. A randomly produced Latin Hypercube with uniform marginal distributions for 2 parameters with 5 simulations

plot of chunk original5

Next, in order to augment the design with more points use augmentLHS(lhs, m). The following will add 5 more points to the design:

B <- augmentLHS(A, 5)

The augmentLHS function works by re-dividing the original design into n+m intervals (e.g. 5+5=10) keeping the original design points exactly in the same position. It then randomly fills the empty row-column sets. The results are shown in Figure 2.

Figure 2. A randomly produced Latin Hypercube of 5 points (red) with 5 augmented points (blue). Each parameter has a uniform marginal distribution.

plot of chunk augmented10

The augmentLHS function uses the following algorithm (see the documentation for augmentLHS):

Create a new (n+m) by k matrix to hold the candidate points after the design has been re-partitioned into (n+m)^2 cells, where n is number of points in the original lhs matrix.
Then randomly sweep through each column (1…k) in the repartitioned design to find the missing cells.
For each column (variable), randomly search for an empty row, generate a random value that fits in that row, record the value in the new matrix. The new matrix can contain more than m points unless m = 2n, in which case the new matrix will contain exactly m filled rows.
Finally, keep only the first m rows of the new matrix. It is guaranteed that there will be m full rows (points) in the new matrix. The deleted rows are partially full. The additional candidate points are selected randomly because of the random search used to find empty cells.

Also notice that because the original points are randomly placed within the cells, depending on how you bin the marginal distributions, a histogram (of x1 for example) will not necessarily be exactly uniform.

Now, the augmenting points do not necessarily form a Latin Hypercube themselves. The original design and augmenting points may form a Latin Hypercube, or there may be more than one point per row in the augmented design. If the augmented points are equal to the number of original points, then a strictly uniform Latin hypercube is guaranteed. An example of an augmented design which is not uniform in the marginal distributions is given in Figure 3 and Figure 4. The commands were:

A <- randomLHS(7, 2)
B <- augmentLHS(A, 3)

Figure 3. Original design with 7 points

plot of chunk Z

Figure 4. Augmented design with 3 additional points. Note that row 9 has 2 points and row 3 has none.

plot of chunk W