plot_sparsity (generic function with 1 method)
Block-Coordinate Frank-Wolfe and Block-Vectors
In this example, we demonstrate the usage of the FrankWolfe.block_coordinate_frank_wolfe
and FrankWolfe.BlockVector
. We consider the problem of minimizing the squared Euclidean distance between two sets. We compare different update orders and different update steps.
Import and setup
We first import the necessary packages and include the code for plotting the results.
using FrankWolfe
using LinearAlgebra
include("plot_utils.jl")
plot_sparsity (generic function with 1 method)
Next, we define the objective function and its gradient. The iterates x
are instances of the FrankWolfe.BlockVector
type. The different blocks of the vector can be accessed via the blocks
field.
f(x) = dot(x.blocks[1] - x.blocks[2], x.blocks[1] - x.blocks[2])
function grad!(storage, x)
@. storage.blocks = [x.blocks[1] - x.blocks[2], x.blocks[2] - x.blocks[1]]
end
grad! (generic function with 1 method)
In our example we consider the probability simplex and an L-infinity norm ball as the feasible sets.
n = 100
lmo1 = FrankWolfe.BoxLMO(-ones(n), zeros(n))
lmo2 = FrankWolfe.ProbabilitySimplexLMO(1.0)
prod_lmo = FrankWolfe.ProductLMO((lmo1, lmo2))
FrankWolfe.ProductLMO{2, Tuple{FrankWolfe.BoxLMO{Float64, 1, Vector{Float64}, Vector{Float64}}, FrankWolfe.ProbabilitySimplexLMO{Float64}}}((FrankWolfe.BoxLMO{Float64, 1, Vector{Float64}, Vector{Float64}}([-1.0, -1.0, -1.0, -1.0, -1.0, -1.0, -1.0, -1.0, -1.0, -1.0 … -1.0, -1.0, -1.0, -1.0, -1.0, -1.0, -1.0, -1.0, -1.0, -1.0], [0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0 … 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0]), FrankWolfe.ProbabilitySimplexLMO{Float64}(1.0)))
We initialize the starting point x0
as a FrankWolfe.BlockVector
with two blocks. The two other arguments are the block sizes and the overall number of entries.
x0 = FrankWolfe.BlockVector([-ones(n), [i == 1 ? 1 : 0 for i in 1:n]], [(n,), (n,)], 2 * n);
Running block-coordinate Frank-Wolfe with different update-orders
In a first step, we compare different update orders. There are three different update orders implemented, FrankWolfe.FullUpdate
, CyclicUpdate
and Stochasticupdate
. For creating a custome FrankWolfe.BlockCoordinateUpdateOrder
, one needs to implement the function select_update_indices
.
struct CustomOrder <: FrankWolfe.BlockCoordinateUpdateOrder end
function FrankWolfe.select_update_indices(::CustomOrder, state::FrankWolfe.CallbackState, dual_gaps)
return [rand() < 1 / n ? 1 : 2 for _ in 1:length(state.lmo.lmos)]
end
We run the block-coordinate Frank-Wolfe method with the different update orders and store the trajectories.
trajectories = []
for order in [
FrankWolfe.FullUpdate(),
FrankWolfe.CyclicUpdate(),
FrankWolfe.StochasticUpdate(),
CustomOrder(),
]
res = FrankWolfe.block_coordinate_frank_wolfe(
f,
grad!,
prod_lmo,
x0;
verbose=true,
trajectory=true,
update_order=order,
)
push!(trajectories, res.traj_data)
end
Block coordinate Frank-Wolfe (BCFW).
MEMORY_MODE: FrankWolfe.InplaceEmphasis() STEPSIZE: DataType[FrankWolfe.Secant{FrankWolfe.var"#24#26", FrankWolfe.Adaptive{Float64, Int64, FrankWolfe.var"#45#47"}}, FrankWolfe.Secant{FrankWolfe.var"#24#26", FrankWolfe.Adaptive{Float64, Int64, FrankWolfe.var"#45#47"}}] EPSILON: 1.0e-7 MAXITERATION: 10000 TYPE: Float64
MOMENTUM: nothing GRADIENT_TYPE: FrankWolfe.BlockVector{Float64, Vector{Float64}, Tuple{Int64}} UPDATE_ORDER: FrankWolfe.FullUpdate() UPDATE_STEP: DataType[FrankWolfe.FrankWolfeStep, FrankWolfe.FrankWolfeStep]
-------------------------------------------------------------------------------------------------
Type Iteration Primal Dual Dual Gap Time It/sec
-------------------------------------------------------------------------------------------------
I 1 5.000000e-01 -1.015000e+02 1.020000e+02 0.000000e+00 Inf
Last 100 1.000000e-02 1.000000e-02 1.734723e-17 4.900970e-01 2.040412e+02
-------------------------------------------------------------------------------------------------
Block coordinate Frank-Wolfe (BCFW).
MEMORY_MODE: FrankWolfe.InplaceEmphasis() STEPSIZE: DataType[FrankWolfe.Secant{FrankWolfe.var"#24#26", FrankWolfe.Adaptive{Float64, Int64, FrankWolfe.var"#45#47"}}, FrankWolfe.Secant{FrankWolfe.var"#24#26", FrankWolfe.Adaptive{Float64, Int64, FrankWolfe.var"#45#47"}}] EPSILON: 1.0e-7 MAXITERATION: 10000 TYPE: Float64
MOMENTUM: nothing GRADIENT_TYPE: FrankWolfe.BlockVector{Float64, Vector{Float64}, Tuple{Int64}} UPDATE_ORDER: FrankWolfe.CyclicUpdate(-1) UPDATE_STEP: DataType[FrankWolfe.FrankWolfeStep, FrankWolfe.FrankWolfeStep]
-------------------------------------------------------------------------------------------------
Type Iteration Primal Dual Dual Gap Time It/sec
-------------------------------------------------------------------------------------------------
I 1 5.000000e-01 -5.000000e-01 1.000000e+00 0.000000e+00 Inf
Last 100 1.000000e-02 1.000000e-02 1.734723e-17 9.941546e-02 1.005880e+03
-------------------------------------------------------------------------------------------------
Block coordinate Frank-Wolfe (BCFW).
MEMORY_MODE: FrankWolfe.InplaceEmphasis() STEPSIZE: DataType[FrankWolfe.Secant{FrankWolfe.var"#24#26", FrankWolfe.Adaptive{Float64, Int64, FrankWolfe.var"#45#47"}}, FrankWolfe.Secant{FrankWolfe.var"#24#26", FrankWolfe.Adaptive{Float64, Int64, FrankWolfe.var"#45#47"}}] EPSILON: 1.0e-7 MAXITERATION: 10000 TYPE: Float64
MOMENTUM: nothing GRADIENT_TYPE: FrankWolfe.BlockVector{Float64, Vector{Float64}, Tuple{Int64}} UPDATE_ORDER: FrankWolfe.StochasticUpdate(-1) UPDATE_STEP: DataType[FrankWolfe.FrankWolfeStep, FrankWolfe.FrankWolfeStep]
-------------------------------------------------------------------------------------------------
Type Iteration Primal Dual Dual Gap Time It/sec
-------------------------------------------------------------------------------------------------
I 1 5.000000e-01 -5.000000e-01 1.000000e+00 0.000000e+00 Inf
Last 92 1.000000e-02 1.000000e-02 1.734723e-17 3.714977e-02 2.476462e+03
-------------------------------------------------------------------------------------------------
Block coordinate Frank-Wolfe (BCFW).
MEMORY_MODE: FrankWolfe.InplaceEmphasis() STEPSIZE: DataType[FrankWolfe.Secant{FrankWolfe.var"#24#26", FrankWolfe.Adaptive{Float64, Int64, FrankWolfe.var"#45#47"}}, FrankWolfe.Secant{FrankWolfe.var"#24#26", FrankWolfe.Adaptive{Float64, Int64, FrankWolfe.var"#45#47"}}] EPSILON: 1.0e-7 MAXITERATION: 10000 TYPE: Float64
MOMENTUM: nothing GRADIENT_TYPE: FrankWolfe.BlockVector{Float64, Vector{Float64}, Tuple{Int64}} UPDATE_ORDER: Main.CustomOrder() UPDATE_STEP: DataType[FrankWolfe.FrankWolfeStep, FrankWolfe.FrankWolfeStep]
-------------------------------------------------------------------------------------------------
Type Iteration Primal Dual Dual Gap Time It/sec
-------------------------------------------------------------------------------------------------
I 1 2.333333e+00 -Inf Inf 0.000000e+00 Inf
Last 93 1.000000e-02 1.000000e-02 1.734723e-17 3.929418e-02 2.366763e+03
-------------------------------------------------------------------------------------------------
Plotting the results
labels = ["Full update", "Cyclic order", "Stochstic order", "Custom order"]
plot_trajectories(trajectories, labels, xscalelog=true)
Running BCFW with different update methods
As a second step, we compare different update steps. We consider the FrankWolfe.BPCGStep
and the FrankWolfe.FrankWolfeStep
. One can either pass a tuple of FrankWolfe.UpdateStep
to define for each block the update procedure or pass a single update step so that each block uses the same procedure.
trajectories = []
for us in [
(FrankWolfe.BPCGStep(), FrankWolfe.FrankWolfeStep()),
(FrankWolfe.FrankWolfeStep(), FrankWolfe.BPCGStep()),
FrankWolfe.BPCGStep(),
FrankWolfe.FrankWolfeStep(),
]
res = FrankWolfe.block_coordinate_frank_wolfe(
f,
grad!,
prod_lmo,
x0;
verbose=true,
trajectory=true,
update_step=us,
)
push!(trajectories, res.traj_data)
end
Block coordinate Frank-Wolfe (BCFW).
MEMORY_MODE: FrankWolfe.InplaceEmphasis() STEPSIZE: DataType[FrankWolfe.Secant{FrankWolfe.var"#24#26", FrankWolfe.Adaptive{Float64, Int64, FrankWolfe.var"#45#47"}}, FrankWolfe.Secant{FrankWolfe.var"#24#26", FrankWolfe.Adaptive{Float64, Int64, FrankWolfe.var"#45#47"}}] EPSILON: 1.0e-7 MAXITERATION: 10000 TYPE: Float64
MOMENTUM: nothing GRADIENT_TYPE: FrankWolfe.BlockVector{Float64, Vector{Float64}, Tuple{Int64}} UPDATE_ORDER: FrankWolfe.CyclicUpdate(-1) UPDATE_STEP: DataType[FrankWolfe.BPCGStep, FrankWolfe.FrankWolfeStep]
-------------------------------------------------------------------------------------------------
Type Iteration Primal Dual Dual Gap Time It/sec
-------------------------------------------------------------------------------------------------
I 1 5.000000e-01 -5.000000e-01 1.000000e+00 0.000000e+00 Inf
Last 100 1.000000e-02 1.000000e-02 1.734723e-17 5.380297e-01 1.858633e+02
-------------------------------------------------------------------------------------------------
Block coordinate Frank-Wolfe (BCFW).
MEMORY_MODE: FrankWolfe.InplaceEmphasis() STEPSIZE: DataType[FrankWolfe.Secant{FrankWolfe.var"#24#26", FrankWolfe.Adaptive{Float64, Int64, FrankWolfe.var"#45#47"}}, FrankWolfe.Secant{FrankWolfe.var"#24#26", FrankWolfe.Adaptive{Float64, Int64, FrankWolfe.var"#45#47"}}] EPSILON: 1.0e-7 MAXITERATION: 10000 TYPE: Float64
MOMENTUM: nothing GRADIENT_TYPE: FrankWolfe.BlockVector{Float64, Vector{Float64}, Tuple{Int64}} UPDATE_ORDER: FrankWolfe.CyclicUpdate(-1) UPDATE_STEP: DataType[FrankWolfe.FrankWolfeStep, FrankWolfe.BPCGStep]
-------------------------------------------------------------------------------------------------
Type Iteration Primal Dual Dual Gap Time It/sec
-------------------------------------------------------------------------------------------------
I 1 5.000000e-01 -5.000000e-01 1.000000e+00 0.000000e+00 Inf
Last 100 1.000000e-02 1.000000e-02 1.040834e-17 2.569455e-01 3.891876e+02
-------------------------------------------------------------------------------------------------
Block coordinate Frank-Wolfe (BCFW).
MEMORY_MODE: FrankWolfe.InplaceEmphasis() STEPSIZE: DataType[FrankWolfe.Secant{FrankWolfe.var"#24#26", FrankWolfe.Adaptive{Float64, Int64, FrankWolfe.var"#45#47"}}, FrankWolfe.Secant{FrankWolfe.var"#24#26", FrankWolfe.Adaptive{Float64, Int64, FrankWolfe.var"#45#47"}}] EPSILON: 1.0e-7 MAXITERATION: 10000 TYPE: Float64
MOMENTUM: nothing GRADIENT_TYPE: FrankWolfe.BlockVector{Float64, Vector{Float64}, Tuple{Int64}} UPDATE_ORDER: FrankWolfe.CyclicUpdate(-1) UPDATE_STEP: DataType[FrankWolfe.BPCGStep, FrankWolfe.BPCGStep]
-------------------------------------------------------------------------------------------------
Type Iteration Primal Dual Dual Gap Time It/sec
-------------------------------------------------------------------------------------------------
I 1 5.000000e-01 -5.000000e-01 1.000000e+00 0.000000e+00 Inf
Last 100 1.000000e-02 1.000000e-02 1.734723e-17 8.365461e-03 1.195391e+04
-------------------------------------------------------------------------------------------------
Block coordinate Frank-Wolfe (BCFW).
MEMORY_MODE: FrankWolfe.InplaceEmphasis() STEPSIZE: DataType[FrankWolfe.Secant{FrankWolfe.var"#24#26", FrankWolfe.Adaptive{Float64, Int64, FrankWolfe.var"#45#47"}}, FrankWolfe.Secant{FrankWolfe.var"#24#26", FrankWolfe.Adaptive{Float64, Int64, FrankWolfe.var"#45#47"}}] EPSILON: 1.0e-7 MAXITERATION: 10000 TYPE: Float64
MOMENTUM: nothing GRADIENT_TYPE: FrankWolfe.BlockVector{Float64, Vector{Float64}, Tuple{Int64}} UPDATE_ORDER: FrankWolfe.CyclicUpdate(-1) UPDATE_STEP: DataType[FrankWolfe.FrankWolfeStep, FrankWolfe.FrankWolfeStep]
-------------------------------------------------------------------------------------------------
Type Iteration Primal Dual Dual Gap Time It/sec
-------------------------------------------------------------------------------------------------
I 1 5.000000e-01 -5.000000e-01 1.000000e+00 0.000000e+00 Inf
Last 100 1.000000e-02 1.000000e-02 1.734723e-17 3.340726e-03 2.993361e+04
-------------------------------------------------------------------------------------------------
Plotting the results
labels = ["BPCG FW", "FW BPCG", "BPCG", "FW"]
plot_trajectories(trajectories, labels, xscalelog=true)
This page was generated using Literate.jl.