ScyllaAgent: Scalable and Low Latency Agentic Chatbot

ScyllaAgent: Scalable and Low Latency Agentic Chatbot

in

About the Project

Deploy a Scalable and Low-Latency Agentic Chatbot in python using cutting edge techniques like Cache Augmented Retrieval. Implement a professional MLOps Pipeline to ensure visibility, scalability, low latency, logging and continuous feature integration and deployment.

Resources

Week 0

  1. Understand Regression and Grad Descent
  2. Neural Networks
  3. Coding an NN in PyTorch, and a micrograd framework from scratch

Week 1

  1. NLP Playlist by Tensorflow
  2. Side by side, refer to this github repo
  3. Text Pre processing
  4. Text Normalization
  5. Bag of words representation
  6. Term Frequency-Inverse Document Frequency
  7. Continuous Bag of Words
  8. One Hot Encodings

Week 2

  1. Recurrent Neural Networks
  2. Mathematics of RNNs
  3. Long Short Term Memory
  4. overall idea of lstms and rnns with a little maths - video link
  5. Introduction to Transformers
  6. Attention in Transformers

Week 2 Additional Resources

  1. Attention Is All You Need
  2. QLoRA: Efficient Finetuning of Quantized LLMs
  3. LoRA: Low-Rank Adaptation of Large Language Models
  4. Fine Tuning Repository

Introduction to Python

  1. Intro to Python
  2. Exception Handling
  3. Anaconda
  4. Path and environment variables for Python and Anaconda in Windows
  5. Interactive Python Notebooks
  6. Venv
  7. Managing Packages with venv
  8. Python virtualenv
  9. PyEnv for Python Version Management
  10. Git