NotebookLM Topic Organization Index

Derived from embeddings visualization — 35 top-level domains, 80+ skills, ~2,000+ document chunks across the MangaAssist knowledge base.

How to Use This Index

Each numbered section is a standalone NotebookLM notebook. Upload the files listed under each notebook into a single NotebookLM project. Notebooks are sized to stay within NotebookLM's 50-source limit.

Notebook 1 — AWS AIP-C01 Domain 1: FM Integration, Data & Compliance

Theme: Everything about integrating Foundation Models into a production system — architecture design, model selection, data pipelines, vector stores, retrieval, and prompt governance.

Folders / Files to Upload: - Domain1-FM-Integration-Data-Compliance/README.md - Domain1-FM-Integration-Data-Compliance/01-task-1.1-solution-design-and-requirements.md - Domain1-FM-Integration-Data-Compliance/02-task-1.2-foundation-model-selection-and-configuration.md - Domain1-FM-Integration-Data-Compliance/03-task-1.3-data-validation-and-processing-pipelines.md - Domain1-FM-Integration-Data-Compliance/04-task-1.4-vector-store-solutions.md - Domain1-FM-Integration-Data-Compliance/05-task-1.5-retrieval-mechanisms-for-fm-augmentation.md - Domain1-FM-Integration-Data-Compliance/06-task-1.6-prompt-engineering-and-governance.md - Domain1-FM-Integration-Data-Compliance/07-domain-1-scenario-map.md - All Skill-1.x.x-*/ scenario and interview Q&A files

Key Skill Groups: | Task | Skills | Focus | |------|--------|-------| | 1.1 — Solution Design | 1.1.1, 1.1.2, 1.1.3 | Architecture, PoC, Standardized Components | | 1.2 — FM Selection | 1.2.1, 1.2.2, 1.2.3, 1.2.4 | Assessment, Dynamic Selection, Resilience, Customization | | 1.3 — Data Pipelines | 1.3.1, 1.3.2, 1.3.3, 1.3.4 | Validation, Multimodal, Formatting, Quality Enhancement | | 1.4 — Vector Stores | 1.4.1, 1.4.2, 1.4.3, 1.4.4, 1.4.5 | Architecture, Metadata, High-Perf Search, Integration, Maintenance | | 1.5 — Retrieval | 1.5.1–1.5.6 | Segmentation, Embeddings, Search, Advanced Architectures, Query Handling | | 1.6 — Prompt & QA | 1.6.1–1.6.5 | Instruction Frameworks, Interactive AI, Governance, QA, Performance |

Notebook 2 — AWS AIP-C01 Domain 2: AI Implementation & Integration

Theme: Building and deploying intelligent, agentic AI systems — autonomous agents, tool use, deployment patterns, secure access, real-time interaction, and advanced GenAI apps.

Folders / Files to Upload: - All files under Implementation-Integration-Domain2/

Key Skill Groups: | Group | Skills | Focus | |-------|--------|-------| | 2.1 — Agentic AI | 2.1.1–2.1.7 | Autonomous Systems, Problem Solving, Safeguards, Model Coordination, Collaborative AI, Tool Integrations, Extensions | | 2.2 — FM Deployment | 2.2.1, 2.2.3 | Deployment Patterns, Optimized Deployment | | 2.3 — Enterprise Connectivity | 2.3.1, 2.3.2, 2.3.3 | Connectivity, Integrated AI, Secure Access / RBAC | | 2.4 — Model Interaction | 2.4.1–2.4.4 | Flexible Interaction, Real-Time/Streaming, Resilient Systems, Intelligent Routing | | 2.5 — Interfaces & Apps | 2.5.1–2.5.6 | FM APIs, Accessible Interfaces, Business Systems, Developer Productivity, Advanced Apps, Troubleshooting |

Notebook 3 — AWS AIP-C01 Domain 4: Operations, Monitoring & Optimization

Theme: Running GenAI in production — token efficiency, cost-effective model selection, caching, throughput, responsive AI, retrieval performance, and full-stack observability.

Folders / Files to Upload: - All files under Operational-Efficiency-Optimization/ - All files under Monitoring-GenAI-Systems/

Key Skill Groups: | Group | Skills | Focus | |-------|--------|-------| | 4.1 — Cost & Tokens | 4.1.1–4.1.4 | Token Efficiency, Cost-Effective Model Selection, High-Perf FM Systems, Intelligent Caching | | 4.2 — Performance | 4.2.1–4.2.6 | Responsive AI, Retrieval Performance, Throughput, FM Enhancement, Resource Allocation, System Performance | | 4.3 — Observability | 4.3.1–4.3.6 | Holistic Observability, GenAI Monitoring, Integrated Observability, Tool Performance, Vector Store Ops, FM Troubleshooting |

Notebook 4 — Security, Safety & AI Governance

Theme: Defending GenAI applications — prompt injection, PII protection, guardrails pipelines, content moderation, incident response, ML-specific threats, supply chain risk, encryption, and responsible AI principles.

Folders / Files to Upload: - All files under Security-Privacy-Guardrails/ - All files under AI-Safety-Security-Governance/

Key Topic Groups: | Area | Content | |------|---------| | Input/Output Safety | Prompt injection defense, guardrails pipeline deep dive, multi-turn context poisoning | | PII & Privacy | PII protection, data privacy controls (Skill 02) | | Content Moderation | Abuse prevention, system prompt extraction (Skill 04) | | Incident Response | Forensics, runbooks (Skill 05) | | ML-Specific Threats | Adversarial AI, model inversion, membership inference (Skill 06) | | Supply Chain & Encryption | Third-party risk, key management (Skills 07, 08) | | Governance & Responsible AI | Compliance frameworks, responsible AI principles (Skills 03, 04 of AI-Safety-Security-Governance) | | Storytelling & Interview | Security interview scenarios, STAR-D narratives (Skills 09, 10) |

Notebook 5 — Evaluation Systems & Testing

Theme: How to evaluate, test, and continuously improve a GenAI system — FM output quality, model comparison, user-centered feedback, offline testing strategies, edge cases, and prompt optimization.

Folders / Files to Upload: - All files under Evaluation-Systems-GenAI/ - All files under Offline-Testing-Quality-Strategies/ - All files under Cost-Optimization-Offline-Testing/

Key Topic Groups: | Area | Content | |------|---------| | FM Output Quality | RAGAS metrics, LLM-as-judge, faithfulness scoring (Skill 01) | | Model Evaluation | Sonnet vs Haiku benchmarking, optimal config (Skill 02) | | User-Centered Evaluation | Thumbs feedback interface, annotation workflows, feedback-to-action pipeline (Skill 03) | | Offline Testing | E2E integration testing, edge case playbook, prompt optimization workflow | | Cost-Efficient Testing | Change-type matrix, cheap testing strategy, regression prevention |

Notebook 6 — Statistical Foundations & Inference

Theme: The statistical underpinning of decisions made in GenAI systems — confidence intervals, t-tests, A/B testing, distribution shift detection, and monitoring statistics.

Folders / Files to Upload: - All files under Statistical-Inference/ - All files under Statistics-Foundations/ - model_evaluation_framework_deep_dive.md

Key Topics: - Confidence intervals for model metrics - T-tests for A/B and canary deployments - Pearson/Spearman correlations in feedback analysis - Golden dataset evaluation layers - Distribution shift detection (chi-squared, MMD, Mahalanobis) - Tools and libraries for statistical monitoring

Notebook 7 — Fine-Tuning, Model Customization & LLMOps

Theme: Adapting foundation models for a specific domain — LoRA/QLoRA, continual learning, RLHF, RAFT, prefix tuning, data curation, synthetic generation, and managing the full model lifecycle.

Folders / Files to Upload: - All files under Fine-Tuning-Foundational-Models/ - All files under LLMOps/ - All files under Model-Inference/

Key Topic Groups: | Area | Content | |------|---------| | PEFT Methods | LoRA, QLoRA, prefix tuning, prompt tuning | | Training Strategies | Continual learning, catastrophic forgetting, RLHF | | Advanced Fine-Tuning | RAFT, distillation, multi-task fine-tuning, data curation | | Synthetic Data | Self-instruct, active learning, distribution shift detection | | Model Inference | GPU architecture, KV cache, batching, quantization | | LLMOps Lifecycle | Model versioning, rollout, A/B testing, lineage, deprecation |

Notebook 8 — Prompt Engineering

Theme: Designing, optimizing, and governing prompts at scale — chain-of-thought, few-shot, role prompting, failure scenarios, and system prompt management.

Folders / Files to Upload: - All files under Prompt-Engineering/

Key Topics: - Foundational prompt design techniques - Chain-of-thought and step-back prompting - Foundational model optimization techniques - Failure scenarios and workarounds (hallucinations, price errors) - Interview-prep drills for prompt engineering

Notebook 9 — RAG Pipeline & Vector Search

Theme: The full retrieval-augmented generation pipeline — chunking, embedding, hybrid search, MCP integrations, query handling, and architecture patterns for production RAG.

Folders / Files to Upload: - All files under RAG-MCP-Integration/ - Relevant Domain1-FM-Integration-Data-Compliance/Skill-1.4.x and Skill-1.5.x files (cross-reference with Notebook 1)

Key Topics: | File | Topic | |------|-------| | 00-overview-rag-mcp-architecture.md | End-to-end RAG + MCP architecture overview | | 01-catalog-search-mcp.md | Catalog search via MCP tool | | 02-user-preferences-recommendation-mcp.md | Personalized recommendations | | 03-order-inventory-mcp.md | Order and inventory lookups | | 04-review-sentiment-mcp.md | Review sentiment MCP pipeline | | 05-support-policy-mcp.md | Support and policy RAG | | 06-trending-discovery-mcp.md | Trending content discovery | | 07-cross-title-link-mcp.md | Cross-title/related item linking | | 08-mcp-orchestration-router.md | MCP orchestration and routing | | 09-rag-retrieval-pipeline-deep-dive.md | Deep dive — chunking, reranking, hybrid search | | 10-mcp-basics-and-amazon-backend-integration.md | MCP basics and backend integration |

Notebook 10 — AWS Infrastructure & Data Services

Theme: The AWS services that power MangaAssist — DynamoDB data modelling, ECS/Fargate/Lambda compute, Docker containers for ML, and the overall tech stack including vLLM.

Folders / Files to Upload: - All files under DynamoDB/ - All files under ECS-Fargate-Lambda/ - All files under Docker-Interview-Pack/ - All files under Tech-Stack/ - All files under MLflow/

Key Topic Groups: | Service / Tool | Area | |----------------|------| | DynamoDB | Data modelling, access patterns, streams, pipelines, security, observability | | ECS Fargate | Container orchestration, task definitions, auto-scaling | | Lambda | Event-driven patterns, cold starts, triggers | | Docker | Containerizing ML workloads, vLLM serving containers | | vLLM | Continuous batching, paged attention, throughput optimization | | MLflow | LLM observability, experiment tracking, tracing |

Notebook 11 — System Design: HLD & LLD

Theme: Designing and low-level implementing the MangaAssist system — high-level architecture questions, low-level design walkthroughs, and API design strategy.

Folders / Files to Upload: - All files under HLD-Questions/ - All files under LLD-Questions/ - All files under API-Design-and-Testing/

Key Topic Groups: | Area | Content | |------|---------| | HLD | RAG pipeline, chatbot scalability, recommendation engine, data ingestion, caching | | LLD | Component-level design — session management, embedding pipeline, guardrails | | API Design | REST vs WebSocket, API testing strategy, scale testing, offline quality | | Grilling Sessions | Hard follow-up drills, system design under pressure |

Notebook 12 — CI/CD Pipelines & DevOps

Theme: Shipping ML systems reliably — deployment pipelines, model promotion pipelines, database migrations, canary deployments, and behavioural tests.

Folders / Files to Upload: - All files under CI-CD-Pipeline-User-Stories/

Key Topics: - CD-01: Application code deployment pipeline - CD-02–06: Model pipeline, data pipeline, infrastructure, rollback - CD-07: Database migration pipeline - Behavioural tests for GenAI deployments

Notebook 13 — Cost & Performance Optimization User Stories

Theme: Product-level tradeoffs and optimization decisions — LLM model tiering, caching policies, token budgets, cost dashboards, and unified optimization decisions.

Folders / Files to Upload: - All files under Cost-Optimization-User-Stories/ - All files under Performance-Optimization-User-Stories/ - All files under Optimization-Tradeoffs-User-Stories/

Key Topics: - LLM model tiering (quality vs cost vs latency tradeoffs) - Caching strategy user stories - Token budget management - Performance SLA user stories - Unified optimization decision dashboard

Notebook 14 — Troubleshooting & Debugging GenAI Applications

Theme: Diagnosing and resolving failures in production GenAI systems — application logging, distributed tracing, POC failure patterns, and systematic debugging runbooks.

Folders / Files to Upload: - All files under Troubleshoot-GenAI-Applications/ - All files under Debugging/

Key Topics: - Application logging across a multi-service chatbot - End-to-end turn debugging - POC implementation pitfalls and fixes - Prompt observability pipeline - Prompt A/B testing framework debugging

Notebook 15 — Math & Modeling Foundations

Theme: The mathematical foundations underlying the ML models in MangaAssist — linear algebra, regression, neural network architectures, and embedding/retrieval math.

Folders / Files to Upload: - All files under Modeling-Math-Foundations/

Key Topics: | File | Content | |------|---------| | 01-matrix-theory-and-linear-algebra.md | Matrices, SVD, PCA for embeddings | | 02-linear-regression-and-generalized-models.md | Regression, logistic, GLMs | | 03-neural-network-architectures.md | Transformers, attention, MLP | | 04-embedding-and-retrieval-architectures.md | Bi-encoders, cross-encoders, HNSW | | 05-tools-and-libraries.md | NumPy, PyTorch, scikit-learn usage |

Notebook 16 — Interview Preparation & War Stories

Theme: Structured interview practice — architect-level questions, real-world war stories, behavioural questions, and the complete MangaAssist narrative.

Folders / Files to Upload: - All files under MangaAssist-Interview-Pack/ - All files under mangaassist_workflow_interview_pack/ - All files under Real-World-Interview-Questions/ - All files under Challenges/ - All files under POC-to-Production-War-Story/ - Root architecture files: 01-problem-statement.md through 16-final-summary.md

Key Topic Groups: | Area | Content | |------|---------| | MangaAssist Architecture Story | Problem statement → HLD → LLD → Metrics → Tradeoffs | | Architect-Level Questions | System design, scaling, reliability, failure modes | | Real-World Challenges | 20+ production challenges at scale | | POC-to-Production | The PoC that fooled us — lessons from production failure | | Workflow Pack | End-to-end chatbot workflow interview scenarios |

Quick Reference: Domain → Notebook Mapping

Folder / Domain	Notebook
`Domain1-FM-Integration-Data-Compliance/`	#1 — FM Integration & Data
`Implementation-Integration-Domain2/`	#2 — AI Implementation
`Operational-Efficiency-Optimization/`	#3 — Operations & Optimization
`Monitoring-GenAI-Systems/`	#3 — Operations & Optimization
`Security-Privacy-Guardrails/`	#4 — Security & Safety
`AI-Safety-Security-Governance/`	#4 — Security & Safety
`Evaluation-Systems-GenAI/`	#5 — Evaluation & Testing
`Offline-Testing-Quality-Strategies/`	#5 — Evaluation & Testing
`Cost-Optimization-Offline-Testing/`	#5 — Evaluation & Testing
`Statistical-Inference/`	#6 — Statistical Foundations
`Statistics-Foundations/`	#6 — Statistical Foundations
`model_evaluation_framework_deep_dive.md`	#6 — Statistical Foundations
`Fine-Tuning-Foundational-Models/`	#7 — Fine-Tuning & LLMOps
`LLMOps/`	#7 — Fine-Tuning & LLMOps
`Model-Inference/`	#7 — Fine-Tuning & LLMOps
`Prompt-Engineering/`	#8 — Prompt Engineering
`RAG-MCP-Integration/`	#9 — RAG & Vector Search
`DynamoDB/`	#10 — AWS Infrastructure
`ECS-Fargate-Lambda/`	#10 — AWS Infrastructure
`Docker-Interview-Pack/`	#10 — AWS Infrastructure
`Tech-Stack/`	#10 — AWS Infrastructure
`MLflow/`	#10 — AWS Infrastructure
`HLD-Questions/`	#11 — System Design HLD/LLD
`LLD-Questions/`	#11 — System Design HLD/LLD
`API-Design-and-Testing/`	#11 — System Design HLD/LLD
`CI-CD-Pipeline-User-Stories/`	#12 — CI/CD & DevOps
`Cost-Optimization-User-Stories/`	#13 — Cost & Performance
`Performance-Optimization-User-Stories/`	#13 — Cost & Performance
`Optimization-Tradeoffs-User-Stories/`	#13 — Cost & Performance
`Troubleshoot-GenAI-Applications/`	#14 — Troubleshooting & Debugging
`Debugging/`	#14 — Troubleshooting & Debugging
`Modeling-Math-Foundations/`	#15 — Math & Modeling Foundations
`MangaAssist-Interview-Pack/`	#16 — Interview Prep
`mangaassist_workflow_interview_pack/`	#16 — Interview Prep
`Real-World-Interview-Questions/`	#16 — Interview Prep
`Challenges/`	#16 — Interview Prep
`POC-to-Production-War-Story/`	#16 — Interview Prep
`01-problem-statement.md` → `16-final-summary.md`	#16 — Interview Prep