![]() |
StarNEig Library
version v0.1-beta.1
A task-based library for solving nonsymmetric eigenvalue problems
|
StarNEig library aims to provide a full suite of algorithms for solving non-symmetric (generalized) eigenvalue problems. The library is built on top of the StarPU runtime system and targets both shared memory and distributed memory machines. Some components of the library support GPUs.
The four main components of the library are:
The library has been developed as a part of the NLAFET project. The project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No. 671633. Support has also been received from eSSENCE, a collaborative e-Science programme funded by the Swedish Government via the Swedish Research Council (VR).
The library is currently in a beta state and only real arithmetic is supported. In addition, some interface functions are implemented as LAPACK and ScaLAPACK wrappers.
Current status with standard eigenvalue problems:
Component | Shared memory | Distributed memory | Accelerators (GPUs) |
---|---|---|---|
Hessenberg reduction | Complete | ScaLAPACK wrapper | Single GPU |
Schur reduction | Complete | Experimental | Experimental |
Eigenvalue reordering | Complete | Complete | Experimental |
Eigenvectors | Complete | Waiting integration | Not planned |
Current status with generalized eigenvalue problems:
Component | Shared memory | Distributed memory | Accelerators (GPUs) |
---|---|---|---|
Hessenberg-triangular reduction | LAPACK wrapper / Planned | ScaLAPACK wrapper | Not planned |
Generalized Schur reduction | Complete | Experimental | Experimental |
Generalized eigenvalue reordering | Complete | Complete | Experimental |
Generalized eigenvectors | Complete | Waiting integration | Not planned |
--disable-cuda-memcpy-peer
. It is possible that newer version are also effected by this problem.STARPU_LIMIT_CUDA_MEM
environmental variable may fix some GPU related memory allocation problems. For example, if the GPU has 6 GB of memory, then setting STARPU_LIMIT_CUDA_MEM=5500
might help.Given a square matrix of size
, the standard eigenvalue problem (SEP) consists of finding eigenvalues
and associated eigenvectors
such that
The eigenvalues are the (potentially complex) roots of the polynomial
of degree
. There is often a full set of
linearly independent eigenvectors, but if there are multiple eigenvalues (i.e., if
for some
) then there might not be a full set of independent eigenvectors.
The dense matrix is condensed to Hessenberg form by computing a Hessenberg decomposition
where is unitary and
is upper Hessenberg. This is done in order to greatly accelerate the subsequent computation of a Schur decomposition since when working on
of size
, the amount of work in each iteration of the QR algorithm is reduced from
to
flops.
Starting from the Hessenberg matrix we compute a Schur decomposition
where is unitary and
is upper triangular. The eigenvalues of
can now be determined as they appear on the diagonal of
, i.e.,
. For real matrices there is a similar decomposition known as the real Schur decomposition
where is orthogonal and
is upper quasi-triangular with
and
blocks on the diagonal. The
blocks correspond to the real eigenvalues and each
block corresponds to a pair of complex conjugate eigenvalues.
Given a subset consisting of of the eigenvalues, we can reorder the eigenvalues on the diagonal of the Schur form by constructing a unitary matrix
such that
and the eigenvalues of the block
are the selected eigenvalues. The first
columns of
span an invariant subspace associated with the selected eigenvalues.
Given a subset consisting of of the eigenvalues
for
and a Schur decomposition
, we can compute for each
an eigenvector
such that
by first computing an eigenvector
of
and then transform it back to the original basis by pre-multiplication with
.
Given a square matrix pencil , where
, the generalized eigenvalue problem (GEP) consists of finding generalized eigenvalues
and associated generalized eigenvectors
such that
The eigenvalues are the (potentially complex) roots of the polynomial
of degree
. There is often a full set of
linearly independent generalized eigenvectors, but if there are multiple eigenvalues (i.e., if
for some
) then there might not be a full set of independent eigenvectors.
At least in principle, a GEP can be transformed into a SEP provided that is invertible, since
However, in finite precision arithmetic this practice is not recommended.
The dense matrix pair is condensed to Hessenberg-triangular form by computing a Hessenberg-triangular decomposition
where are unitary,
is upper Hessenberg, and
is upper triangular. This is done in order to greatly accelerate the subsequent computation of a generalized Schur decomposition.
Starting from the Hessenberg-triangular pencil we compute a generalized Schur decomposition
where are unitary and
are upper triangular. The eigenvalues of
can now be determined from the diagonal element pairs
, i.e.,
(if
). If
and
, then
is an infinite eigenvalue of the matrix pair
. (If both
and
for some
, then the pencil is singular and the eigenvalues are undetermined; all complex numbers are eigenvalues). For real matrix pairs there is a similar decomposition known as the real generalized Schur decomposition
where are orthogonal,
is upper quasi-triangular with
and
blocks on the diagonal, and
is upper triangular. The
blocks on the diagonal of
correspond to the real generalized eigenvalues and each
block corresponds to a pair of complex conjugate generalized eigenvalues.
Given a subset consisting of of the generalized eigenvalues, we can reorder the generalized eigenvalues on the diagonal of the generalized Schur form by constructing unitary matrices
and
such that
and the eigenvalues of the block pencil
are the selected generalized eigenvalues. The first
columns of
spans a right deflating subspace associated with the selected generalized eigenvalues.
Given a subset consisting of of the eigenvalues
for
and a generalized Schur decomposition
, we can compute for each
a generalized eigenvector
such that
by first computing a generalized eigenvector
of
and then transform it back to the original basis by pre-multiplication with
.