Get in Touch

Course Outline

Introduction to Cambricon and MLU Architecture

  • Overview of Cambricon’s AI chip portfolio.
  • MLU architecture and instruction pipeline.
  • Supported model types and use cases.

Installing the Development Toolchain

  • Installing BANGPy and the Neuware SDK.
  • Environment setup for Python and C++.
  • Model compatibility and preprocessing.

Model Development with BANGPy

  • Tensor structure and shape management.
  • Construction of computation graphs.
  • Support for custom operations in BANGPy.

Deploying with the Neuware Runtime

  • Converting and loading models.
  • Execution and inference control.
  • Best practices for edge and data center deployment.

Performance Optimization

  • Memory mapping and layer tuning.
  • Execution tracing and profiling.
  • Identifying and resolving common bottlenecks.

Integrating MLU into Applications

  • Using Neuware APIs for application integration.
  • Support for streaming and multi-model scenarios.
  • Hybrid CPU-MLU inference configurations.

End-to-End Project and Use Case

  • Lab: Deploying a vision or NLP model.
  • Edge inference with BANGPy integration.
  • Testing for accuracy and throughput.

Summary and Next Steps

Requirements

  • Familiarity with the structure of machine learning models.
  • Experience with Python and/or C++.
  • Understanding of model deployment and acceleration concepts.

Audience

  • Embedded AI developers.
  • ML engineers deploying to edge or data center environments.
  • Developers working with Chinese AI infrastructure.
 21 Hours

Number of participants


Price per participant

Testimonials (1)

Upcoming Courses

Related Categories