Article | ascend-tech.io

Moving away from a CUDA-dependent infrastructure can seem daunting, but Huawei's Compute Architecture for Neural Networks (CANN) provides an impressive alternative.

Understanding CANN Structure

CANN functions similarly to CUDA by bridging the gap between deep learning frameworks and the underlying NPU hardware.

Nvidia CUDA (PyTorch/GPU)

# PyTorch on CUDA
import torch

device = torch.device("cuda:0" if torch.cuda.is_available() else "cpu")
model = DeepSeekModel().to(device)
optimizer = torch.optim.Adam(model.parameters(), lr=1e-4)

for input, target in dataloader:
    input, target = input.to(device), target.to(device)

Huawei CANN (MindSpore/Ascend NPU)

# MindSpore on Ascend NPU
import mindspore as ms

ms.set_context(device_target="Ascend", device_id=0)
model = DeepSeekModel()
optimizer = ms.nn.Adam(model.trainable_params(), learning_rate=1e-4)

# Data natively optimized for NPU pipelines
for input, target in dataset.create_tuple_iterator():
    # Model forward and loss calculation built-in

Migration Steps

Convert PyTorch checkpoints
Map Tensor operations to CANN equivalents
Optimize computation graphs in MindSpore

"Supply chain independence is worth the initial engineering overhead."

Performance Equivalency

After migration, many models actually benefit from Ascend's specific architectural strengths.

Conclusion

The transition is entirely manageable and provides long-term platform resilience.

Migrating from Nvidia CUDA to Huawei CANN

Understanding CANN Structure

Migration Steps

Performance Equivalency

Conclusion