FAQ | ascend-tech.io — Huawei Ascend AI Solutions

Q: Why choose Ascend over Nvidia?

Independence from US supply chains, 20-40% lower TCO, local Czech support, and native support for Chinese LLMs like DeepSeek and Qwen.

Q: How much does an Ascend server cost?

Atlas 800 Training Server (8× 910B) costs approximately 2,500,000–3,000,000 CZK including 3-year support. Atlas 300I Pro Inference Card costs around 80,000–120,000 CZK.

General Questions

What is Huawei Ascend?

Huawei Ascend is a family of AI accelerators (NPU) and servers designed for training and inference of large language models (LLM) and other AI workloads. Ascend is a direct alternative to Nvidia GPU, offering comparable performance at significantly lower cost.

What is the difference between Ascend 910 and 310?

Parameter	Ascend 910	Ascend 310
Target	LLM training	Inference
Performance	256-376 TFLOPS FP16	16-22 TOPS INT8
Memory	32-64 GB HBM	8-16 GB LPDDR4
Power	310-400W	8-12W

Why choose Ascend over Nvidia?

Independence from US supply chains — no export restrictions
Lower TCO — 20-40% lower costs over 3 years
Local support — Czech technical support
Integration — native support for Chinese LLMs (DeepSeek, Qwen)

Is Ascend compatible with existing models?

Yes. Models trained on Nvidia can be converted to Ascend using ONNX format, CANN toolkit, or MindSpore framework. Most modern LLMs (Llama, DeepSeek, Qwen) are already optimized for Ascend.

Technical Questions

What is CANN toolkit?

Compute Architecture for Neural Networks is Huawei's software stack for Ascend, including:

ACL — Application Programming Interface
ATC — Model conversion (Caffe/ONNX/TensorFlow → Ascend)
Profiling tools — Performance optimization

How do I migrate from CUDA to CANN?

Export model from PyTorch/TensorFlow to ONNX
Convert using ATC compiler
Optimize for Ascend NPU
Test and benchmark

Read our detailed guide: Migrating from CUDA to CANN

Which frameworks are supported?

MindSpore — native, best performance
PyTorch — via CANN backend
TensorFlow — via CANN backend
ONNX Runtime — universal inference

What is the performance compared to Nvidia A100?

Metric	Ascend 910B	Nvidia A100
FP16	376 TFLOPS	312 TFLOPS
INT8	640 TOPS	624 TOPS
Memory BW	1.6 TB/s	2.0 TB/s
LLM inference	~95% A100	100%

Pricing & Purchase

How much does an Ascend server cost?

Atlas 800 Training Server (8× 910B):

Price: ~2,500,000–3,000,000 CZK
Includes 3-year support
Delivery: 4–6 weeks

Atlas 300I Pro Inference Card:

Price: ~80,000–120,000 CZK
PCI-Express card for inference deployment

Is rental/cloud available?

Yes, we offer:

Managed hosting — our server in your data center
Cloud instances — API access
Lease — 3-year financing options

Is there a trial/POC available?

Yes:

Remote demo — access to our server
POC — 30-day testing period
Benchmark — comparison with your current solution

Deployment & Operations

How long does deployment take?

Phase	Duration
Hardware delivery	4–6 weeks
Installation	1–2 days
Configuration	2–3 days
Model migration	1–2 weeks
Testing	1 week
Total	6–10 weeks

What are the data center requirements?

Rack space: 2U–4U depending on configuration
Power: 2× 3000W (for training server)
Cooling: Front-to-back airflow, 35–45 dB
Network: 2× 10G/25G/100G Ethernet

Is clustering supported?

Yes, we support:

Scale-up — up to 8× 910B in single server
Scale-out — multiple servers via RoCE/InfiniBand
Kubernetes — container orchestration
Slurm — HPC workload management

Security & Compliance

Is my data secure?

On-premise deployment means:

Data never leave your infrastructure
No cloud dependency
Full control over access
Audit logs for compliance

Is it GDPR compliant?