the__alchemist 2 hours ago

Love it. I've been using cudarc lately; would love to try this since it looks like it can share data structures between host and device (?). I infer that this is a higher-level abstraction.

adastra22 41 minutes ago

Where is the Metal love…

  • syl20bnr 26 minutes ago

    It also compiles directly to MSL, it is just missing from the post title.

zekrioca 4 hours ago

Very interesting project! I am wondering how it compare against OpenCL, which I think adopts the same fundamental idea (write once, run everywhere)? Is it about CUbeCL's internal optimization for Rust that happens at compile time?

  • nathanielsimard 2 hours ago

    A lot of things happen at compile time, but you can execute arbitrary code in your kernel that executes at compile time, similar to generics, but with more flexibility. It's very natural to branch on a comptime config to select an algorithm.