Rewrite `tensor.eye` implementation and kernel by ndgrigorian · Pull Request #2937 · IntelPython/dpnp

ndgrigorian · 2026-05-30T09:28:26Z

This PR proposes an update to the tensor.eye constructor, shifting to an approach using a 2D nd_range kernel which instead uses the condition that col - row == k along the diagonal, which is correct whether the array is C- or F-contiguous

This eliminates any branching in the kernel, in hopes of improving performance even marginally.

Have you provided a meaningful PR description?
Have you added a test, reproducer or referred to an issue with a reproducer?
Have you tested your changes locally for CPU and GPU devices?
Have you made sure that new changes do not introduce compiler warnings?
Have you checked performance impact of proposed changes?
Have you added documentation for your changes, if necessary?
Have you added your changes to the changelog?

github-actions · 2026-05-30T10:10:58Z

View rendered docs @ https://intelpython.github.io/dpnp/pull/2937/index.html

coveralls · 2026-05-30T10:41:13Z

coverage: 78.248% (-0.003%) from 78.251% — update-eye-ctor into master

github-actions · 2026-05-30T10:46:12Z

Array API standard conformance tests for dpnp=0.21.0dev0=py313h509198e_50 ran successfully.
Passed: 1357
Failed: 3
Skipped: 16

ndgrigorian · 2026-05-31T00:14:06Z

@antonwolfy @vlad-perevezentsev used this as a chance to experiment with vtune and found that performance difference is very insignificant between implementations. Same for GPU utilization, about the same across the board.

This implementation is a bit cleaner in my opinion though, so I think still worth adding

vlad-perevezentsev · 2026-06-03T13:35:13Z

@@ -345,9 +345,11 @@ sycl::event full_strided_impl(sycl::queue &q,

 typedef sycl::event (*eye_fn_ptr_t)(sycl::queue &,
                                    std::size_t nelems, // num_elements


nelems is not used in the new logic and can be removed

vlad-perevezentsev · 2026-06-03T13:35:44Z

@@ -400,9 +406,11 @@ class EyeFunctor
 template <typename Ty>
 sycl::event eye_impl(sycl::queue &exec_q,
                     std::size_t nelems,


nelems is not used in the new logic and can be removed

vlad-perevezentsev · 2026-06-03T13:38:47Z

-                     const ssize_t start,
-                     const ssize_t end,
-                     const ssize_t step,
+                     const ssize_t rows,


could you also update a changelog?

vlad-perevezentsev · 2026-06-03T13:42:56Z

    int dst_typeid = array_types.typenum_to_lookup_id(dst_typenum);

-    const py::ssize_t nelem = dst.get_size();
+    const py::ssize_t nelems = dst.get_size();


nelems is passed to fn which does not use it

vlad-perevezentsev · 2026-06-03T13:43:05Z

-
    auto fn = eye_dispatch_vector[dst_typeid];
+    sycl::event eye_event =
+        fn(exec_q, static_cast<std::size_t>(nelems), rows, cols, k, stride0,


Rewrite eye implementation

d951f51

ndgrigorian marked this pull request as ready for review May 31, 2026 00:13

ndgrigorian requested review from antonwolfy and vlad-perevezentsev as code owners May 31, 2026 00:13

Merge branch 'master' into update-eye-ctor

189b4e4

vlad-perevezentsev reviewed Jun 3, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rewrite `tensor.eye` implementation and kernel#2937

Rewrite `tensor.eye` implementation and kernel#2937
ndgrigorian wants to merge 2 commits into
masterfrom
update-eye-ctor

ndgrigorian commented May 30, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented May 30, 2026

Uh oh!

coveralls commented May 30, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented May 30, 2026

Uh oh!

ndgrigorian commented May 31, 2026

Uh oh!

vlad-perevezentsev Jun 3, 2026

Uh oh!

vlad-perevezentsev Jun 3, 2026

Uh oh!

vlad-perevezentsev Jun 3, 2026

Uh oh!

vlad-perevezentsev Jun 3, 2026

Uh oh!

vlad-perevezentsev Jun 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		@@ -345,9 +345,11 @@ sycl::event full_strided_impl(sycl::queue &q,

		typedef sycl::event (*eye_fn_ptr_t)(sycl::queue &,
		std::size_t nelems, // num_elements

Conversation

ndgrigorian commented May 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented May 30, 2026

Uh oh!

coveralls commented May 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented May 30, 2026

Uh oh!

ndgrigorian commented May 31, 2026

Uh oh!

vlad-perevezentsev Jun 3, 2026

Choose a reason for hiding this comment

Uh oh!

vlad-perevezentsev Jun 3, 2026

Choose a reason for hiding this comment

Uh oh!

vlad-perevezentsev Jun 3, 2026

Choose a reason for hiding this comment

Uh oh!

vlad-perevezentsev Jun 3, 2026

Choose a reason for hiding this comment

Uh oh!

vlad-perevezentsev Jun 3, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ndgrigorian commented May 30, 2026 •

edited

Loading

coveralls commented May 30, 2026 •

edited

Loading