xbtorch.decomposition.svd

Singular Value Decomposition (SVD) decomposition module.

Provides two classes:

FullSVD: Computes the exact full-rank SVD of the gradient, serving as a baseline reference.
TruncatedSVD: Computes a low-rank approximation using truncated singular values, reducing memory and computation while approximating the gradient.

These methods allow benchmarking and comparison against other low-rank decomposition strategies.

Classes

`FullSVD`()	Full Singular Value Decomposition (SVD) gradient decomposition.
`TruncatedSVD`([rank])	Truncated Singular Value Decomposition (SVD) gradient decomposition.

class xbtorch.decomposition.svd.FullSVD[source]

Bases: GenericDecomposition

Full Singular Value Decomposition (SVD) gradient decomposition.

This method computes the full-rank singular value decomposition of the gradient matrix, yielding an exact reconstruction. It serves as a reference baseline for evaluating truncated or streaming methods.

Notes

Uses torch.linalg.svd for efficient GPU/CPU support.
While accurate, the full SVD has high computational cost and does not reduce gradient dimensionality.

References

Eckart and Young, “The approximation of one matrix by another of lower rank”, Psychometrika, 1936.
Huang et al., “Low-rank gradient descent for memory-efficient training of deep in-memory arrays”, ACM JETC, 2023.

decompose(input, delta, gradient, group_param_idx)[source]

Compute the full singular value decomposition (SVD) of the gradient.

Parameters:

input (torch.Tensor) – Input activations for the current layer (unused in this method).
delta (torch.Tensor) – Backpropagated errors for the current layer (unused in this method).
gradient (torch.Tensor) – Gradient tensor to be decomposed, shape (output_dim, input_dim).
group_param_idx (int or Any) – Parameter group index (unused in this method, kept for API consistency).

Returns:

Exact reconstruction of the gradient via:

gradient ≈ U @ diag(S) @ Vh

where U, S, and Vh are obtained from full SVD.

Return type:

torch.Tensor

class xbtorch.decomposition.svd.TruncatedSVD(rank=1)[source]

Bases: GenericDecomposition

Truncated Singular Value Decomposition (SVD) gradient decomposition.

This method approximates the gradient matrix using a rank-limited singular value decomposition, providing a compressed low-rank representation. It is especially useful for reducing memory and communication costs in training memristive neural networks.

Parameters:: rank (int, optional (default=1)) – The number of singular values and vectors to retain. Must be less than the number of features (columns of the gradient).