Assorted optimizations for sparse dense matrix multiplication by yuyichao · Pull Request #666 · JuliaSparse/SparseArrays.jl

yuyichao · 2026-01-01T01:49:16Z

The change is mainly based on testing of the _spmul!(C::StridedMatrix, X::DenseMatrixUnion, A::SparseMatrixCSCUnion2, α::Number, β::Number) function which is also the function used for my performance numbers below. I've then also applied the improvements to other similar functions though the performance for some of them may not be as big (since not all functions touched are amendable to vectorization).

The main improvement is to hoist matrix size and pointer access out of the loop to work around JuliaLang/julia#60409 . This change have as much as 2x performance impact for complex numbers (it actually be even more on armv8.3-a (i.e. including all apple processors) and above by better triggering LLVM's complex number multiplication pattern matching with llvm/llvm-project#173818).

Adding muladd is the second most important change which affect mostly complex number and bigfloat since the cost of operations saved is more significant compared to the bare memory access.

And then there are other minor tweaks that are mostly useful for small matrices (~20% impact for ~10x10 matrix). These were included mainly because I was working on another optimization which may not work well for small cases. I then make these optimizations to the small matrix cases so that I can make a fair comparison for the effect caused by the other change. I've still not done with testing the other change yet but these small fixes are ready so I've included them here.

Better performance for mutable type like BigFloat

…throwing function

codecov · 2026-01-01T02:10:01Z

Codecov Report

❌ Patch coverage is 95.68966% with 5 lines in your changes missing coverage. Please review.
✅ Project coverage is 84.42%. Comparing base (7f8c2c6) to head (939568b).
⚠️ Report is 2 commits behind head on main.

Files with missing lines	Patch %	Lines
src/linalg.jl	95.28%	5 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #666      +/-   ##
==========================================
+ Coverage   84.36%   84.42%   +0.05%     
==========================================
  Files          13       13              
  Lines        9346     9400      +54     
==========================================
+ Hits         7885     7936      +51     
- Misses       1461     1464       +3

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

ViralBShah · 2026-01-01T07:55:57Z

Is it possible to update tests to increase the coverage?

yuyichao · 2026-01-05T20:54:53Z

I’ve added more direct test for both the multiplications and the error checking.

ViralBShah · 2026-01-06T06:17:39Z

Let's give this a couple more days and merge.

- [x] #666 - [x] #691

yuyichao added 5 commits December 31, 2025 15:29

Use isone instead of comparing to one

7134baa

Better performance for mutable type like BigFloat

Remove unnecessary return values from internal functions

1193147

Helper function to get sizes of all input matrices and outline error …

715dd78

…throwing function

Pre-compute matrix axes out of the loop

5862843

Optimize for alpha being boolean

8d7ab9c

yuyichao force-pushed the yyc/spmul branch from cb5de3e to fde2514 Compare January 5, 2026 20:05

ViralBShah reviewed Jan 6, 2026

View reviewed changes

Comment thread src/linalg.jl

dkarrasch reviewed Jan 6, 2026

View reviewed changes

Comment thread src/linalg.jl Outdated

dkarrasch reviewed Jan 6, 2026

View reviewed changes

Comment thread src/sparsevector.jl

yuyichao added 4 commits January 6, 2026 08:30

Use a fixed size wrapper to workaround julia 1.11+ bug

9426aaf

Use muladd when possible in sparse matrix multiplication

4757d04

Add more complete multiplication tests

8c7039d

Add more dimension error check

e9ec23c

yuyichao force-pushed the yyc/spmul branch from fde2514 to e9ec23c Compare January 6, 2026 13:30

Merge branch 'main' into yyc/spmul

939568b

dkarrasch added backport 1.13 Change should be backported to release-1.13 backport 1.12 Change should be backported to release-1.12 labels Apr 20, 2026

dkarrasch merged commit 99a103b into JuliaSparse:main Apr 20, 2026
12 checks passed

dkarrasch pushed a commit that referenced this pull request Apr 20, 2026

Assorted optimizations for sparse dense matrix multiplication (#666)

7075f8a

dkarrasch mentioned this pull request Apr 20, 2026

Backports release 1.13 #705

Merged

2 tasks

dkarrasch added a commit that referenced this pull request Apr 20, 2026

Backports release 1.13 (#705)

9d08bb3

- [x] #666 - [x] #691

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Assorted optimizations for sparse dense matrix multiplication#666

Assorted optimizations for sparse dense matrix multiplication#666
dkarrasch merged 10 commits intoJuliaSparse:mainfrom
yuyichao:yyc/spmul

yuyichao commented Jan 1, 2026

Uh oh!

codecov Bot commented Jan 1, 2026 •

edited

Loading

Uh oh!

ViralBShah commented Jan 1, 2026

Uh oh!

yuyichao commented Jan 5, 2026

Uh oh!

Uh oh!

ViralBShah commented Jan 6, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

yuyichao commented Jan 1, 2026

Uh oh!

codecov Bot commented Jan 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

ViralBShah commented Jan 1, 2026

Uh oh!

yuyichao commented Jan 5, 2026

Uh oh!

Uh oh!

ViralBShah commented Jan 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov Bot commented Jan 1, 2026 •

edited

Loading

ViralBShah commented Jan 6, 2026 •

edited

Loading