Get in Touch

Course Outline

Introduction to Open-Source LLMs

  • Understanding open-weight models and their significance
  • Overview of LLaMA, Mistral, Qwen, and other community-driven models
  • Use cases for private, on-premise, or secure deployments

Environment Setup and Tools

  • Installing and configuring Transformers, Datasets, and PEFT libraries
  • Selecting appropriate hardware for fine-tuning tasks
  • Loading pre-trained models from Hugging Face or other repositories

Data Preparation and Preprocessing

  • Understanding dataset formats (instruction tuning, chat data, text-only)
  • Managing tokenization and sequences
  • Creating custom datasets and data loaders

Fine-Tuning Techniques

  • Comparing standard full fine-tuning with parameter-efficient methods
  • Applying LoRA and QLoRA for efficient fine-tuning
  • Utilizing the Trainer API for rapid experimentation

Model Evaluation and Optimization

  • Assessing fine-tuned models using generation and accuracy metrics
  • Addressing overfitting, generalization, and validation sets
  • Tips for performance tuning and logging

Deployment and Private Use

  • Saving and loading models for inference purposes
  • Deploying fine-tuned models in secure enterprise environments
  • Comparing on-premise versus cloud deployment strategies

Case Studies and Use Cases

  • Examples of enterprise adoption of LLaMA, Mistral, and Qwen
  • Handling multilingual and domain-specific fine-tuning
  • Discussion: Evaluating the trade-offs between open and closed models

Summary and Next Steps

Requirements

  • A solid understanding of Large Language Models (LLMs) and their underlying architecture
  • Practical experience with Python and PyTorch
  • Basic familiarity with the Hugging Face ecosystem

Target Audience

  • Machine Learning practitioners
  • AI developers
 14 Hours

Number of participants


Price per participant

Upcoming Courses

Related Categories