Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
cuda
Follow
Hide
Posts
Left menu
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
124x Slower: What PyTorch DataLoader Actually Does at the Kernel Level
Ingero
Ingero
Ingero
Follow
Apr 1
124x Slower: What PyTorch DataLoader Actually Does at the Kernel Level
#
pytorch
#
gpu
#
python
#
cuda
Comments
Add Comment
5 min read
Tracing a 13x PyTorch Slowdown to a Hidden NumPy Synchronization
Ingero
Ingero
Ingero
Follow
Mar 31
Tracing a 13x PyTorch Slowdown to a Hidden NumPy Synchronization
#
pytorch
#
cuda
#
python
#
gpu
Comments
Add Comment
4 min read
GPU Flight - Cut GPU Profiling Data Transfer by With a Schema Migration
Myoungho Shin
Myoungho Shin
Myoungho Shin
Follow
Mar 23
GPU Flight - Cut GPU Profiling Data Transfer by With a Schema Migration
#
backend
#
performance
#
cuda
#
springboot
1
 reaction
Comments
Add Comment
8 min read
Installing NVIDIA Drivers Without CUDA
Asher-ish
Asher-ish
Asher-ish
Follow
Mar 19
Installing NVIDIA Drivers Without CUDA
#
nvidia
#
cuda
#
ubuntu
#
linux
2
 reactions
Comments
Add Comment
7 min read
AMD ROCm on Consumer GPUs: The Open-Source CUDA Alternative That Actually Works Now [2026 Guide]
Kunal
Kunal
Kunal
Follow
Mar 18
AMD ROCm on Consumer GPUs: The Open-Source CUDA Alternative That Actually Works Now [2026 Guide]
#
rocm
#
amd
#
cuda
#
opensource
2
 reactions
Comments
Add Comment
7 min read
GPU Flight — System Architecture
Myoungho Shin
Myoungho Shin
Myoungho Shin
Follow
Mar 16
GPU Flight — System Architecture
#
cuda
#
gpu
#
performance
#
monitoring
4
 reactions
Comments
Add Comment
5 min read
GPU Flight — System Architecture
Myoungho Shin
Myoungho Shin
Myoungho Shin
Follow
Mar 16
GPU Flight — System Architecture
#
cuda
#
gpu
#
performance
#
monitoring
2
 reactions
Comments
Add Comment
5 min read
I built the first open-source FP8 linear solver in Python — 2-3x faster than cuBLAS
SHARVESWAR .M
SHARVESWAR .M
SHARVESWAR .M
Follow
Mar 15
I built the first open-source FP8 linear solver in Python — 2-3x faster than cuBLAS
#
python
#
cuda
#
gpu
#
opensource
2
 reactions
Comments
Add Comment
3 min read
Implementing Pollard's Kangaroo Algorithm on CUDA
Raphael Bernardo
Raphael Bernardo
Raphael Bernardo
Follow
Mar 12
Implementing Pollard's Kangaroo Algorithm on CUDA
#
cuda
#
gpu
#
cryptography
#
algorithms
1
 reaction
Comments
Add Comment
5 min read
Nvidia Open-Weight Models: Why the $26B Bet Matters
Simon Paxton
Simon Paxton
Simon Paxton
Follow
Mar 12
Nvidia Open-Weight Models: Why the $26B Bet Matters
#
nvidia
#
nemotron3
#
dgxcloud
#
cuda
2
 reactions
Comments
Add Comment
7 min read
AI Builds AI: How Anthropic’s Claude Codes Its Future
Simon Paxton
Simon Paxton
Simon Paxton
Follow
Mar 11
AI Builds AI: How Anthropic’s Claude Codes Its Future
#
anthropic
#
claude
#
time
#
cuda
2
 reactions
Comments
Add Comment
10 min read
From 2-Adic Geometry to Cunningham Chains: Visualization-Driven GPU Search
Nenad Mićić
Nenad Mićić
Nenad Mićić
Follow
Mar 11
From 2-Adic Geometry to Cunningham Chains: Visualization-Driven GPU Search
#
showdev
#
cuda
#
hpc
#
datavis
3
 reactions
Comments
Add Comment
4 min read
Detecting Thread Divergence with SASS Metrics and GPU Flight
Myoungho Shin
Myoungho Shin
Myoungho Shin
Follow
Mar 10
Detecting Thread Divergence with SASS Metrics and GPU Flight
#
gpu
#
cpp
#
cuda
#
performance
2
 reactions
Comments
Add Comment
6 min read
Profiling GPU (CUDA) — Getting Started with GPU Flight's Python Package
Myoungho Shin
Myoungho Shin
Myoungho Shin
Follow
Mar 9
Profiling GPU (CUDA) — Getting Started with GPU Flight's Python Package
#
cuda
#
cpp
#
gpu
#
python
3
 reactions
Comments
Add Comment
6 min read
Profiling GPU (CUDA) — What Is Actually Limiting Your Kernel?
Myoungho Shin
Myoungho Shin
Myoungho Shin
Follow
Mar 2
Profiling GPU (CUDA) — What Is Actually Limiting Your Kernel?
#
performance
#
cuda
#
gpu
#
cpp
3
 reactions
Comments
Add Comment
4 min read
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account