
Online Class LLM on Production
Temen-temen kalau dah tertarik di LLM tapi nggak belajar gimana cara kita shipping model LLM kita ke production, gass join!!
bonus:
- video based supplementary material (untuk membantu pembelajaran Online Class)
- tugas besar dan dibimbing lansung saat praktik
https://www.rubythalib.com/ml-on-production.html
Start : 10 Agustus , 2024
Setiap Sabtu: 20.00-22.00 WIB
What You Get in AI Online Class
- Interactive Class
- Live Question and Answer with Mentor
- e-Certificate
- Portfolio
- Interactive Class
Material
*Session 1: Introduction to LLM on Production*
- Overview of LLM (Large Language Model) applications in production
- Importance of LLM in various industries
- Challenges and considerations in deploying LLM models
*Session 2: LLM Managed Service (Claude-3, OpenAI)*
- Introduction to Claude-3 and other managed LLM services
- Comparison of features and pricing
- Use cases and benefits of using managed LLM services
*Session 3: Building RAG using Vector Database*
- Understanding RAG (Retrieval-Augmented Generation) models
- Introduction to Vector Databases PgVector
- Practical implementation of RAG using a Vector Database
*Session 4: Handling with Open Source LLM (Loading, Fine-tuning, etc.)*
- Overview of popular open-source LLM frameworks (e.g., Hugging Face Transformers, LLAMA)
- Loading pre-trained models and datasets
- Techniques for fine-tuning LLMs for specific tasks
*Session 5: Optimizing Your Open Source LLM Deployment*
- Best practices for deploying LLMs in production environments
- Performance optimization tips (e.g., batch size, hardware acceleration, quantization)
*Session 6: Online vs Offline Metrics for LLM Applications*
- Understanding metrics for evaluating LLM performance in online vs. offline scenarios
- Key metrics: accuracy, latency, throughput, cost-efficiency
*Session 7: Building RAG using Open Source LLM*
- Step-by-step guide to building Retrieval-Augmented Generation models using open-source LLM frameworks
- Integrating with external data sources for retrieval
- Advanced techniques and extensions for RAG models