About the Project

Deploy a Scalable and Low-Latency Agentic Chatbot in python using cutting edge techniques like Cache Augmented Retrieval. Implement a professional MLOps Pipeline to ensure visibility, scalability, low latency, logging and continuous feature integration and deployment.

Resources

Week 0

Understand Regression and Grad Descent
Neural Networks
Coding an NN in PyTorch, and a micrograd framework from scratch

Week 1 | Introduction to NLP and Sequence Models

Basic NLP

NLP Playlist by Tensorflow
Side by side, refer to this github repo
Text Pre processing
Text Normalization
Bag of words representation
Term Frequency-Inverse Document Frequency
Continuous Bag of Words
One Hot Encodings

Sequence Models

Recurrent Neural Networks
Mathematics of RNNs
Long Short Term Memory
overall idea of lstms and rnns with a little maths - video link
Introduction to Transformers
Attention in Transformers

Week 1 Additional Resources

Introduction to Python

Intro to Python
Exception Handling
Anaconda
Path and environment variables for Python and Anaconda in Windows
Interactive Python Notebooks
Venv
Managing Packages with venv
Python virtualenv
PyEnv for Python Version Management
Git

Week 2 | Retrieval Augmented Generation

APIs, LLMs and HuggingFace

Requests and APIs:
APIs
REST API
Requests
Accessing LLMs via APIs:
Explore models on Huggingface, resources are as follows:

RAGs

LangChain Ecosystem
1. LangChain Crash Course
2. LangSmith
  1. Introduction
  2. Docs and Getting Started
3. LangGraph Crash Course
Prompting
1. 12 Prompting Techniques
2. Prompt engineering by HugginFace
RAGs
1. RAG for Knowledge-Intensive NLP Tasks Paper
2. LangChain Implementation

Week 3 | Agentic Systems

Data Validation and Typing
Concurrency
1. Concurrency, Parallelism, and asyncio
2. Repo for codes in (2.1)
3. Async Programming in python
4. A nice video tutorial (alternative to 2.3)
5. Exception Handling in python asyncio
6. Nice clarificaton from stackoverflow
7. Parallelism v/s Concurrency
Design Patterns
1. Abstract Factory and Abstract Base Classes
2. Factory Method, Composite Patterns, Decorators, State, Iterators etc from Refactoring Guru Design Patterns
3. Grokking OOPs
LlamaIndex
[Tools, Agents, Agentic Orchestration]
[Llamaindex Workflows]

Week 4 | Agentic and Advanced RAGs

Mini Advanced RAGs Roadmap

Ingestion
1. Data Preprocessing/Cleaning
2. Chunking
  1. Fixed Size Chunking
  2. Content-aware Chunking
    1. Simple Sentence and Paragraph splitting
    2. Recursive Character Level Chunking
  3. Document structure-based chunking
  4. Semantic Chunking
  5. Contextual Retrieval: Provides scalability for larger documents
3. Embedding:
  1. Semantic Embeddings
  2. Lexical Embeddings
    1. BM-25 (Best Matching 25): Lexical Matching which builds upon TF-IDF (Term Frequency-Inverse Document Frequency)
Retrieval
1. Search
  1. Semantic Search (dense vectors)
  2. Lexical Search (sparse vectors)
  3. Hybrid Search
    1. Querying Hybrid Index
    2. Querying Sparse and Dense Index and reranking
2. Reranking: Increases quality of retrieved documents
  1. BGE Reranker
  2. Passage Reranking with BERT
Augmentation
Generation
Evaluation
1. Offline Metrics
  1. Binary Relevance Metrics
    1. Order-unaware:
      1. Precision@k: TP/(TP+FP) how many items in the result set are relevant
      2. Recall@k: TP/(TP+FN) how many relevant results your retrieval step returns from all existing relevant results for the query
      3. F1@k: (2 * Precision@k * Recall@k)/(Precision@k + Recall@k)
    2. Order-aware:
      1. Mean Reciprocal Rank (MRR)
      2. Mean Average Precision@K (MAP@K)
  2. Graded Relevance Metrics
    1. Discounted Cumulative Gain (DCG@k)
    2. Normalized Discounted Cumulative Gain (DCG@k)
2. Online Metrics: Based on user data, RL-based
3. Frameworks and Tooling
  1. Arize
  2. ARES
  3. RAGAS
  4. TraceLoop
  5. TruLens
  6. Galileo
Benchmarking AI Assistants

Anthropic Cookbook

Additional Resources

Week 5 and 6| Creating a Python Module for Advanced RAGs

System Design and Patterns Review

Project Management

Repos of similar modules for reference

FlashRAG: https://github.com/RUC-NLPIR/FlashRAG
RAGligh: https://github.com/Bessouat40/RAGLight

Building and structuring modules

Software testing:

Week 7 | Reading and Implementing Research Papers

Week 8 and 9| Refining and Publishing Python Module

Documentation Website

Refactoring and CI/CD

Packaging and Deploying

QubitFlow: Quantum Computing and Machine Learning

SpectroMorph: Advanced Audio Machine Learning

ScyllaAgent: Scalable and Low Latency Agentic Chatbot

About the Project

Resources

Week 0

Week 1 | Introduction to NLP and Sequence Models

Basic NLP

Sequence Models

Week 1 Additional Resources

Introduction to Python

Week 2 | Retrieval Augmented Generation

APIs, LLMs and HuggingFace

RAGs

Week 3 | Agentic Systems

Week 4 | Agentic and Advanced RAGs

Mini Advanced RAGs Roadmap

Additional Resources

Week 5 and 6| Creating a Python Module for Advanced RAGs

System Design and Patterns Review

Project Management

Repos of similar modules for reference

Building and structuring modules

Software testing:

Week 7 | Reading and Implementing Research Papers

Week 8 and 9| Refining and Publishing Python Module

Documentation Website

Refactoring and CI/CD

Packaging and Deploying

QubitFlow: Quantum Computing and Machine Learning

SpectroMorph: Advanced Audio Machine Learning

SpectroMorph: Advanced Audio Machine Learning

QubitFlow: Quantum Computing and Machine Learning

Lord of the Chains: Exploring Web3 and Blockchain

Latest Posts

Graphics Roadmap

Merkle Trees & its Application in VCS

ScyllaAgent: Scalable and Low Latency Agentic Chatbot

About the Project

Resources

Week 0

Week 1 | Introduction to NLP and Sequence Models

Basic NLP

Sequence Models

Week 1 Additional Resources

Introduction to Python

Week 2 | Retrieval Augmented Generation

APIs, LLMs and HuggingFace

RAGs

Week 3 | Agentic Systems

Week 4 | Agentic and Advanced RAGs

Mini Advanced RAGs Roadmap

Additional Resources

Week 5 and 6| Creating a Python Module for Advanced RAGs

System Design and Patterns Review

Project Management

Repos of similar modules for reference

Building and structuring modules

Software testing:

Week 7 | Reading and Implementing Research Papers

Week 8 and 9| Refining and Publishing Python Module

Documentation Website

Refactoring and CI/CD

Packaging and Deploying

QubitFlow: Quantum Computing and Machine Learning

SpectroMorph: Advanced Audio Machine Learning

You may also like

SpectroMorph: Advanced Audio Machine Learning

QubitFlow: Quantum Computing and Machine Learning

Lord of the Chains: Exploring Web3 and Blockchain

Latest Posts

Graphics Roadmap

Merkle Trees & its Application in VCS

Explore Tags