The default method for initializing the loadings \(\ell_{\cdot k}\) and factor values \(f_{\cdot k}\) of a new ("greedy") flash factor. It is essentially an implementation of the power method, but unlike many existing implementations, it can handle missing data and sign constraints. For details, see Chapter 2.2.3 in the reference below.
flash_greedy_init_default(
flash,
sign_constraints = NULL,
tol = NULL,
maxiter = 100,
seed = 666
)
A flash_fit
object.
This parameter can be used to constrain the sign of
the initial factor and loadings. It should be a vector of length two with
entries equal to -1, 0, or 1. The first entry constrains the sign of the
loadings \(\ell_{\cdot k}\), with -1 yielding nonpositive loadings, +1
yielding nonnegative loadings, and 0 indicating that loadings should not be
constrained. The second entry of sign_constraints
similarly
constrains the sign of factor values \(f_{\cdot k}\). If
sign_constraints = NULL
, then no constraints will be applied.
Convergence tolerance parameter. When the maximum (absolute)
change over all values \(\ell_{ik}\) and \(f_{jk}\) is less than or
equal to tol
, initialization terminates. At each iteration, the
factor and loadings are \(L^2\)-normalized. The default tolerance
parameter is \(\min(1 / n, 1 / p)\), where \(n\) is
the number of rows in the data matrix and \(p\) is the number of columns.
Maximum number of power iterations.
Since initialization is random, a default seed is set for reproducibility.
A list of length two consisting of, respectively, the vector of initial values for loadings \(\ell_{\cdot k}\) and the vector of initial factor values \(f_{\cdot k}\).
Jason Willwerscheid (2021), Empirical Bayes Matrix Factorization: Methods and Applications. Ph.D. thesis, University of Chicago.