25 questions · Free · No signup

AI Interview Question Bank

Curated questions on system design, prompt engineering, RAG, LLM evaluation, and AI agents — with walkthroughs, follow-ups, and the kind of detail that actually helps you prep.

Try: "rag", "prompt", "agent memory"

Browse by Category

All Questions(25 of 25)

AI AgentsBeginner

Explain the ReAct Pattern and When You Would Use It

Understand the ReAct pattern — how Reasoning + Acting enables LLMs to solve multi-step problems with tools, and when to choose it over alternatives.

Read question
LLM Eval & OpsBeginner

Explain the Tradeoffs Between Latency, Cost, and Quality in LLM Selection

Navigate the three-way tradeoff between LLM latency, cost, and quality — and learn how to make the right selection for different use cases.

Read question
LLM Eval & OpsBeginner

What Metrics Would You Track for an LLM in Production?

A comprehensive framework for monitoring LLMs in production — from latency and cost to output quality and user satisfaction signals.

Read question
Prompt EngineeringBeginner

Explain Chain-of-Thought Prompting and When to Use It

Understand chain-of-thought prompting — how it works, when it helps, and when simpler prompts are actually better.

Read question
Prompt EngineeringBeginner

How Do You Evaluate Whether a Prompt Is Working Well?

Walk through a systematic approach to measuring prompt quality — from building eval datasets to automated metrics and human evaluation.

Read question
RAG & RetrievalBeginner

When Would You Choose RAG Over Fine-Tuning?

Understand the tradeoffs between RAG and fine-tuning — and learn a decision framework for choosing the right approach for your use case.

Read question
AI AgentsIntermediate

How Would You Implement Memory for a Long-Running AI Agent?

Design a memory system for a long-running AI agent — covering in-context working memory, episodic recall, semantic knowledge, and retrieval strategies.

Read question
AI AgentsIntermediate

How Do You Decide What Tools to Give an AI Agent?

A framework for deciding which tools to give an AI agent — covering granularity, safety boundaries, observability, and the principle of minimal tool sets.

Read question
AI System DesignIntermediate

Design a Conversational AI Customer Support System

Design an AI-powered customer support system that handles common queries automatically while escalating complex issues to human agents.

Read question
AI System DesignIntermediate

Design a Document Q&A System for a Large Corpus

Design an AI system that answers natural language questions over a large collection of documents, with accurate citations and low hallucination rates.

Read question
LLM Eval & OpsIntermediate

How Do You Build an Eval Suite for an LLM-Powered Feature?

Walk through building a systematic evaluation suite for an LLM feature — from test case design to automated metrics and regression tracking.

Read question
Prompt EngineeringIntermediate

What Strategies Do You Use to Reduce Hallucinations?

Walk through a layered approach to reducing LLM hallucinations — from prompt-level techniques to retrieval grounding and output validation.

Read question
Prompt EngineeringIntermediate

How Would You Design a Prompt for Structured Data Extraction?

Design a prompt that reliably extracts structured data (JSON, tables) from unstructured text — handling missing fields, ambiguity, and format errors.

Read question
RAG & RetrievalIntermediate

How Do You Handle Chunking Strategies for Different Document Types?

Compare chunking strategies for different document types — PDFs, code, HTML, and tables — and learn when each approach works best.

Read question
RAG & RetrievalIntermediate

Design a RAG Pipeline from Scratch

Walk through designing a production-ready RAG system covering document ingestion, chunking strategies, embedding models, vector search, and LLM generation.

Read question
RAG & RetrievalIntermediate

How Would You Evaluate Retrieval Quality in a RAG System?

Walk through metrics and methods for evaluating retrieval quality in a RAG pipeline — from offline metrics to end-to-end answer quality.

Read question
AI AgentsAdvanced

Design an AI Agent That Can Book Travel End-to-End

Design a multi-step AI agent that books flights, hotels, and transportation — covering tool design, planning loops, error recovery, and user confirmation.

Read question
AI AgentsAdvanced

Design a Multi-Agent System for Software Development

Design a multi-agent system where specialized agents collaborate on software development — covering orchestration, communication, coordination, and failure modes.

Read question
AI System DesignAdvanced

Design an AI-Powered Code Review System

Design a system that uses LLMs to automatically review pull requests — identifying bugs, style issues, and suggesting improvements at scale.

Read question
AI System DesignAdvanced

Design a Real-Time Content Moderation Pipeline Using LLMs

Design a scalable content moderation system that uses LLMs to detect harmful content in real time while minimizing false positives and latency.

Read question
AI System DesignAdvanced

How Would You Architect a Multi-Model AI Gateway?

Design a unified gateway that routes requests across multiple LLM providers, handles fallbacks, enforces rate limits, and tracks costs per team.

Read question
LLM Eval & OpsAdvanced

How Would You Detect and Handle LLM Output Regressions?

Build a system to detect when LLM output quality degrades — covering statistical monitoring, automated quality checks, and incident response.

Read question
LLM Eval & OpsAdvanced

How Do You Handle Model Version Upgrades Without Breaking Production?

A safe, systematic approach to upgrading LLM model versions in production — from pre-upgrade evaluation to canary deployment and rollback.

Read question
Prompt EngineeringAdvanced

Compare Few-Shot Prompting vs. Fine-Tuning for a Classification Task

Understand when to use few-shot prompting versus fine-tuning for classification — covering cost, data requirements, latency, and when each approach wins.

Read question
RAG & RetrievalAdvanced

Design a Hybrid Search System Combining Semantic and Keyword Search

Design a search system that combines dense vector search with sparse keyword search — outperforming either approach alone through intelligent score fusion.

Read question

Prep the coding round too

AI knowledge is only half the picture. Rubduck helps you nail DSA and coding interviews with an AI interviewer that gives real-time feedback.

Daily tips, confessions & AI news. Unsubscribe anytime. Questions? [email protected]