About the Project
Deploy a Scalable and Low-Latency Agentic Chatbot in python using cutting edge techniques like Cache Augmented Retrieval. Implement a professional MLOps Pipeline to ensure visibility, scalability, low latency, logging and continuous feature integration and deployment.
Resources
Week 0
- Understand Regression and Grad Descent
- Neural Networks
- Coding an NN in PyTorch, and a micrograd framework from scratch
Week 1
- NLP Playlist by Tensorflow
- Side by side, refer to this github repo
- Text Pre processing
- Text Normalization
- Bag of words representation
- Term Frequency-Inverse Document Frequency
- Continuous Bag of Words
- One Hot Encodings
Week 2
- Recurrent Neural Networks
- Mathematics of RNNs
- Long Short Term Memory
- overall idea of lstms and rnns with a little maths - video link
- Introduction to Transformers
- Attention in Transformers
Week 2 Additional Resources
- Attention Is All You Need
- QLoRA: Efficient Finetuning of Quantized LLMs
- LoRA: Low-Rank Adaptation of Large Language Models
- Fine Tuning Repository