GenAI Interview Question
GenAI Interview Question
What are hallucinations in LLMs, why do they occur, and how do you detect and mitigate them in production systems?
In a high-throughput production system, when would you choose a list over a dictionary (or vice versa), and why?
In a performance-critical Python service, how do best- and worst-case list operations impact latency?
Which similarity measures are used to compare embedding vectors, and how do you choose the right one for production RAG systems?
How are embedding vectors stored and indexed inside a vector database, and what trade-offs affect speed, memory, and recall?
What is re-ranking in a RAG pipeline, when should it be introduced, and what trade-offs does it create?
How would you design a secure enterprise chatbot with role / pay-grade based access control?
How is vector retrieval fundamentally different from SQL queries?
What is LangGraph, how does it differ from LangChain, and how does agentic RAG change system design?
In a production RAG system, do you perform similarity search immediately, or apply validation before retrieval?
How do you guarantee valid JSON output from an LLM in production?
How do you improve reasoning quality without exposing chain-of-thought?
When would you choose FAISS over a managed vector DB like Pinecone or Weaviate?
Explain inverted indexes and product quantization in FAISS and justify your index choice.
With ~800K embeddings, what compute-efficient retrieval strategies would you use?
How does a vector database retrieve relevant vectors for a query embedding?
How is indexing done in FAISS?
What are the different categories of vector stores and when should each be used?
When should you use RAG vs fine-tuning in production?
Why is FastAPI preferred for ML / LLM inference services?
What are the limitations of few-shot prompting?
What is the difference between zero-shot and few-shot prompting, and how do you decide which is better?
What is GPT-4’s token limit and why does it matter?
Which embedding model did you use and how did you evaluate the choice?
What chunking strategies are used in RAG systems and how do you choose the right one?
Why is chunking still required for ~500K rows even if LLMs can handle small datasets?
How would you describe the size and structure of your dataset and why did it matter?
Hallucinations
RAG chatbot returns confident but incorrect answers — how do you debug without increasing latency?
In healthcare/finance, how do you prevent hallucinations from reaching users?
Python Lists / Dicts
RAG pipeline latency spikes due to list usage — which operations cause worst-case behavior?
Memory pressure from dicts at scale — when would you switch to a different structure?
Similarity Measures
Switching embedding models drops retrieval quality — cosine still correct?
Multimodal embeddings have different vector norms — how do you compare safely?
Vector Storage
Vector DB grows from 1M → 100M vectors — how do you control memory cost?
Exact search violates latency SLOs — redesign strategy?
Re-Ranking
Re-ranking doubles latency — how do you rebalance accuracy vs speed?
Re-ranking is expensive — when do you skip it dynamically?
Access Control RAG
Junior employee sees restricted data — where did the system fail?
Roles change monthly — how do you update access without re-embedding?
SQL vs Vector Search
Replace vector DB with Postgres + pgvector — where does it break?
SQL filters reduce recall — how do you redesign retrieval order?
LangGraph / Agentic RAG
Why LangChain agents fail for loops and approvals?
Agent over-retrieves and increases cost — how do you control it?
Retrieval Order
System retrieves for “hi” messages — how do you avoid waste?
Pre-retrieval classifier skips needed retrieval — how do you fix it?
JSON Output
JSON is valid but semantically wrong — how do you detect?
Retries increase latency — how do you enforce structure in one pass?
Reasoning Quality
Accuracy drops without chain-of-thought — how do you redesign safely?
Agent makes wrong decisions — how do you debug without inspecting thoughts?
FAISS vs Managed DB
FAISS can’t handle traffic growth — migration strategy?
Managed DB bill spikes — optimize before switching back?
FAISS Indexing
IVF+PQ drops recall — which parameters do you tune?
Frequent inserts/deletes — why is IVF+PQ problematic?
Scaling Retrieval
P95 latency exceeds SLO — which ANN index and knobs first?
ANN helps speed but raises cost — how do you reduce compute?
Vector Retrieval
Fast results but missing relevant docs — where do you debug?
Unauthorized docs must never appear — where to enforce access?
Vector Stores
When to migrate from FAISS to managed DB?
Managed DB too expensive — how do you self-host safely?
RAG vs Fine-Tuning
Fine-tuned policies become outdated — what went wrong?
RAG breaks latency SLOs — what do you replace with fine-tuning?
FastAPI
High concurrency + slow inference — how do you avoid blocking?
API schema changes — how do you handle backward compatibility?
Prompting
Few-shot works in demo but fails in prod — what’s your decision framework?
Few-shot increases cost 35% — alternatives?
Token Limits
Context overflow in RAG — architectural fix?
Larger context improves accuracy but doubles cost — redesign?
Embeddings
Switching embedding model breaks retrieval — debugging steps?
Better embeddings double latency — how do you decide?
Chunking
Over-chunking loses context — how do you fix it?
PDFs with tables + images — modality-aware chunking strategy?
Dataset Scale
Same row count, worse retrieval — why?
Schema evolves — how do you update without full re-index?
Ingestion / Azure Flow
How does the system detect and ingest PDF attachments from Outlook using Microsoft Graph API?
Why is Azure Blob Storage used in the document ingestion pipeline? What problem does it solve?
What is Microsoft Graph API and how is it used to access Outlook emails programmatically?
What is the role of Azure Functions or Azure App Service in this architecture, and how do you decide between them?
OCR / Document Processing
How does OCR work in production systems, and which LLM-based OCR solutions have you used?
After extracting data from PDFs, what downstream processing steps does the system perform before making it searchable?
What is document parsing, and how do you handle parsing across heterogeneous document types and data sources?
RAG / Multimodal / Search
How would you design a multimodal RAG system (text, images, tables), and what challenges arise compared to text-only RAG?
What is Azure AI Search, and how does it fit into a RAG architecture?
What is Azure AI Studio, and how is it used in building or managing GenAI applications?
What is RAG architecture, and how does data flow through retrieval, generation, and validation stages?
How does RAG architecture work end-to-end in a production system?
Vector DB / Embeddings / Retrieval
Which vector database did you use, and what factors influenced that choice?
How do you design and manage metadata in a vector database for filtering, security, and relevance?
What is a vector database, and why is it required for semantic search or RAG systems?
What are embeddings, and which embedding model did you use? Why was it suitable for your use case?
How is the retrieval operation performed in a vector-based RAG system?
Databases / State
Why do we use Azure Cosmos DB in GenAI systems, and what type of data is best suited for it?
How do you design state management and caching in an AI system?
How do you handle memory management in conversational AI systems?
LLMs / Models / Comparison
What is the difference between BERT and large language models (LLMs) in terms of architecture and use cases?
How is an LLM fundamentally different from a BERT-based model?
How is an LLM-based chatbot different from a traditional rule-based or intent-based chatbot?
Which LLMs have you worked with, and why did you choose them for specific projects?
Have you worked with Google Gemini models, and in what scenarios did you evaluate or use them?
How is Gemini different from GPT-4 in terms of architecture, capabilities, and use cases?
What was the typical token size of inputs you sent to the LLM, and how did it influence system design?
Deployment / Cloud
How would you deploy a Gemini-based RAG application on GCP and Azure? What services would you use?
How do you manage concurrency when many users interact with an AI system simultaneously?
Agents / LangChain / LangGraph
What is LangGraph, and how does it differ from LangChain?
What is the difference between RAG and agent-based systems, and when do you combine them?
How do you design an agentic flow for complex decision-making tasks?
What applications have you built using RAG, LangChain, and LangGraph?
Safety / Governance
How do you define and enforce guardrails for AI responses in production?
How do you implement explainable AI and responsible AI practices in real-world AI systems?
Evaluation / Accuracy
How do you evaluate model accuracy and system-level performance for LLM-based applications?
Conversational / Voice AI
Have you worked on conversational chatbots or voice bots, and what architectural differences did you encounter?
System Design / Experience
How would you design a full-fledged conversational AI system end-to-end?
Can you explain the key AI use cases you’ve worked on, including the business problem and technical solution?
Backend & API (Very Common in Fidelity)
Why did you choose FastAPI for building your backend services? What advantages did it give you in production?
How is a REST API designed for scalability and versioning in enterprise systems?
How did you use SQLAlchemy in your project, and why not raw SQL everywhere?
Explain SQL concepts from basic to intermediate level that you used in your project (joins, indexes, transactions).
๐น Retrieval / RAG (Core Enterprise GenAI Topic)
Explain the retrieval process in your system end-to-end.
How does your retrieval pipeline decide what data to fetch before sending it to the LLM?
How would you explain the retrieval process to a non-technical stakeholder?
✅ (Yes bhai — ye question REAL hai, business communication test karta hai)
๐น Use Case Discussion (Lilly / Fidelity Style)
Explain the business problem you solved in the Lilly/Fidelity project. Why was AI required?
What was your technical approach to solving this business problem?
What alternatives did you consider, and why did you choose this approach?
๐น Validation & Quality Checks (Very Important)
How did you validate that the questions or user queries were correct before processing them?
How did you ensure data quality and correctness in the retrieval results?
๐น Agents / Agentic Flow (Advanced but Valid)
What agents were involved in your system, and what responsibility did each agent have?
How do agents communicate and coordinate in your architecture?
๐น System Explanation Skills (Enterprise MUST)
Explain the entire system flow to a non-technical audience (product manager / business stakeholder).
Introduction & Project Context
Can you briefly introduce yourself and walk us through the AI/ML project you are currently working on?
What business problem does your current Generative AI project solve, and what is your role in it?
How have you used LangChain and LangGraph in a real production workflow?
๐น Traditional Machine Learning Experience
Have you worked on traditional machine learning models in industry-grade projects? If not, how have you applied them in POCs or controlled environments?
How comfortable are you transitioning between traditional ML approaches and GenAI-based solutions?
๐น Linear Regression & Statistics
What are the core assumptions of linear regression, and why are they important?
If residuals show a fan-shaped pattern, what assumption is being violated and how would you statistically verify it?
Which statistical tests are used to detect heteroscedasticity in regression models?
๐น Logistic Regression & Metrics
What evaluation metrics are commonly used for logistic regression, and why?
In what scenarios would you prioritize precision over recall, or recall over precision?
Does precision remain constant across different probability thresholds? Why or why not?
๐น GenAI Project Deep Dive
Can you walk us through your GenAI project end-to-end, including data ingestion, retrieval, and response generation?
What specific components did you personally design or implement in this project?
How did you collaborate with other teams or roles while owning multiple parts of the system?
๐น Frameworks & Tools
Which AI frameworks and tools did you use in your project, and what were the reasons behind those choices?
Why did you choose LangGraph over a purely chain-based or prompt-based approach?
๐น RAG-Based Chatbot Design
How would you design a chatbot using Retrieval-Augmented Generation (RAG) from scratch?
What factors influence your chunking, embedding, and retrieval strategy in a RAG system?
Which document parsing libraries have you used, and how do you decide between them?
How do you design prompt templates to ensure consistent and structured outputs?
How do you enforce structured outputs such as JSON in LLM responses?
๐น Validation & Quality Control
How do you validate user queries before sending them through the retrieval and generation pipeline?
How do you verify the correctness and relevance of retrieved documents?
๐น Agents & Agentic Workflows
What agents were involved in your system, and what responsibilities did each agent have?
How do agents coordinate and share state in an agentic architecture?
๐น Retrieval Process (Core Focus)
Can you explain the retrieval process step by step in your system?
How would you explain the retrieval process to a non-technical stakeholder or business user?
What happens if the retrieval layer fails or returns irrelevant data?
๐น Backend & API Design
Why did you choose FastAPI for building backend services?
How do you design REST APIs for scalability and maintainability?
How have you used SQLAlchemy in your project, and what advantages does it provide?
Which SQL concepts (joins, indexes, transactions) did you actively use in your implementation?
๐น System Design & Architecture
How would you design a full-fledged conversational AI system end-to-end?
How do you manage state, caching, and session memory in AI-driven applications?
How do you handle concurrent users accessing the system simultaneously?
๐น Evaluation & Governance
How do you evaluate the accuracy and performance of AI/ML models in production?
What mechanisms do you use to define guardrails and prevent unsafe AI responses?
How do you approach explainable AI and responsible AI in real-world applications?
๐น Behavioral & Motivation
Why are you interested in working with our organization?
Are you open to working on optimization or non-GenAI problem statements if required?
What kind of AI/ML roles and problems excite you the most?
Project & Ownership
Can you explain your project end-to-end, including the business problem it was solving and your role in the solution?
Which parts of the project did you personally own, and where did you collaborate with other teams?
๐น Agents & Agentic Architecture (Very Important)
What are AI agents, and why did you choose an agent-based architecture for your workflow?
How many agents were involved in your system, and what responsibility did each agent have?
How did these agents interact and coordinate with each other during execution?
What was the role of the supervisor agent, and how did it orchestrate or control the other agents?
At a code level, how were these agents implemented (classes, functions, services)?
๐น RAG & Architecture
Can you explain the RAG architecture you implemented in your project?
How did data flow from ingestion to retrieval to response generation in your system?
๐น Scalability, UI & Concurrency
How did your system handle multiple concurrent users accessing it at the same time?
What did the user interface look like, and how did users interact with the system?
๐น Challenges & Trade-offs
What were the major technical or architectural challenges you faced during the project?
How did you resolve those challenges, and what trade-offs did you make?
๐น Document Generation Decision (Excellent KPMG Question)
Why did you choose to generate Word documents programmatically instead of using the LLM’s document-generation capability directly?
What limitations or risks did you see in relying entirely on LLM-generated Word documents?
๐น Cloud, Model & Benchmarking Decisions
Why was the LLM hosted on Azure while other parts of the stack were deployed on AWS?
Why did you choose OpenAI’s LLMs for this project?
What benchmarking or evaluation did you perform to conclude that GPT-4o was the right model?
๐น Career & Skill Profile
Which areas of your career have you worked in—Generative AI, backend development, data analysis, or others?
How would you rate your proficiency in Python, and what kinds of problems have you solved using it?
Have you worked on the data side of Python, such as ETL pipelines, data frames, or data processing workflows?
๐น Conceptual AI Understanding (Business-Friendly)
How does an LLM summarize a piece of text? What is happening conceptually under the hood?
Why do you think the Generative AI revolution happened in the last 2–3 years and not a decade ago?
What technical breakthroughs or limitations were responsible for enabling this shift?
๐น Vectors & Embeddings
What are vectors and embeddings, and what do they represent in an AI system?
Why are embeddings critical for semantic search and RAG-based systems?
๐น Domain & Business Fit
Do you have experience or familiarity with the tax domain? Would you be open to working on tax-focused AI projects?
๐น Logic / Thinking Test (Classic Consulting Signal)
You have two doors—one leads to freedom and the other to death. Each door has a guard; one always tells the truth and the other always lies. You can ask one question to one guard. What would you ask to determine the correct door?
Project & Architecture (Deep Technical)
Can you explain your project from a purely technical perspective, focusing on architecture and design decisions?
Can you walk me through the RAG architecture you implemented end-to-end?
๐น Document Parsing & Ingestion (Cost-Aware Thinking)
How did you parse the documents in your system, and what challenges did you face with real-world document structures?
If a table spans across multiple pages in a document, how would you parse it accurately in a cost-effective way without overusing LLMs?
(Yes bhai — this is a classic product-company cost-awareness trap)
๐น Embeddings & Vector Databases
Which embedding model and vector database did you use, and why were they suitable for your use case?
What was the dimensionality of the embedding vectors you used, and how does vector length impact retrieval accuracy, latency, and storage cost?
๐น Determinism & Reliability (Very Important)
How do you ensure deterministic or near-deterministic responses from LLMs in systems with strict business rules?
What techniques can be used to reduce randomness and enforce consistency in LLM outputs?
๐น RAG Failure Handling & Evaluation (Senior-Level)
What kinds of failures can occur in a RAG pipeline, and how do you detect them?
How do you evaluate the performance of a RAG system, and where does the evaluation component sit in the overall architecture?
What strategies would you apply to continuously improve RAG accuracy and robustness?
๐น Similarity Search & Distance Metrics
What types of similarity search can be used in a RAG pipeline, and how do you decide which one to use?
Why is cosine similarity commonly used for embeddings, and what would change if you used Euclidean or Manhattan distance instead?
๐น Traditional ML Fundamentals
Can you explain a traditional machine learning project you’ve worked on from problem formulation to evaluation?
What are the key evaluation metrics for classification problems, and when would you prioritize one over another?
If you observe a drop in model accuracy in production, how would you investigate and respond?
How do you retrain a model, and how do you split data for training, validation, and testing when tuning hyperparameters?
๐น Coding (Hands-On Signal)
Write a Python function to generate all possible subsets of a given list of strings.
GSK – VERIFIED & REPHRASED QUESTION SET
(Clean, professional, global-enterprise wording)
๐น Ingestion & Storage
Outlook PDF Detection
How does your solution detect and extract PDF attachments from Outlook emails?
Azure Blob Storage
What role does Azure Blob Storage play in the ingestion pipeline, and why is it preferred over alternatives?
Microsoft Graph API
How do you use Microsoft Graph API to programmatically access and monitor Outlook mailboxes?
Azure Functions / App Service
Why did you choose Azure Functions or Azure App Service, and what responsibilities do they handle in the workflow?
๐น Document Understanding
OCR & LLM-Powered OCR
Can you explain how OCR works conceptually, and name some OCR solutions enhanced by LLMs?
Post-Extraction Processing
After extracting content from PDFs, what downstream processing steps does your system perform before retrieval or analytics?
๐น Retrieval-Augmented Generation (RAG)
Multi-Modal RAG
How would you design a multi-modal RAG system that handles text, tables, and images?
๐น Azure AI Services
Azure AI Search
What is Azure AI Search, and where does it fit in your overall architecture?
Azure AI Studio
What capabilities does Azure AI Studio provide, and how would you use them in an enterprise GenAI project?
๐น Vector Data & Persistence
Vector Database Choice
Which vector database did you choose, and what criteria influenced that decision?
Metadata Management
How do you store and leverage metadata alongside vectors to improve retrieval accuracy and governance?
Azure Cosmos DB
Why is Azure Cosmos DB part of the architecture, and what type of data is best suited for it?
๐น OCR Models & Trade-offs (High-Depth)
Donut vs TrOCR
In which scenarios would an OCR-free model like Donut outperform an OCR-first pipeline such as TrOCR? What trade-offs should be considered (accuracy, latency, language coverage, model size)?
Decoder Choice in TrOCR
Why does TrOCR initialize its text decoder from RoBERTa instead of GPT-2, and how does that affect training stability and fine-tuning?
Handwriting Recognition
Which architectural components make TrOCR strong at handwriting recognition, and how would you benchmark it against Tesseract + Seq2Seq?
๐น Layout-Aware Models
Layout Awareness
How does LayoutLMv3 encode spatial relationships between tokens, and why can’t Donut or vanilla BERT achieve this without bounding-box embeddings?
External OCR Dependency
LayoutLM relies on externally generated text and bounding boxes. Describe two scenarios where this dependency is a limitation and two where it is beneficial.
๐น Key-Value Extraction
Donut supports end-to-end key-value extraction. What training objective enables this, and how would you adapt it for invoice line-item extraction?
๐น Fine-Tuning Strategy
Given 10K manually labeled receipts, which model (Donut, TrOCR, or LayoutLMv3) would you fine-tune and why? Explain the preprocessing pipeline, loss functions, and expected challenges.
๐น Compute & Cost
Compare the GPU/CPU requirements of Donut and LayoutLMv3 for large-scale inference (e.g., 100K documents/hour). How would you optimize each?
๐น Hybrid Pipelines
Design a hybrid pipeline combining Donut for coarse key-value extraction and LayoutLMv3 for fine-grained validation. How does data flow between the models?
๐น Transfer Learning
How would you perform cross-language transfer for Donut on low-resource scripts such as Devanagari? Discuss synthetic data generation, tokenization, and evaluation.
๐น Model Evaluation
Beyond character-level accuracy, what composite metrics would you use to evaluate the real-world effectiveness of document understanding models?
BCN Associate – Org COE L1
๐น Motivation & Career Decisions (High Pressure Area)
Why Bain? What specifically attracts you to Bain Capability Network?
Why are you leaving PwC, and what gaps are you hoping to address at Bain?
What aspects of your role at PwC did you enjoy, and what aspects did you find limiting?
(Expect multiple uncomfortable follow-ups — yes, this is intentional)
๐น Guesstimates (Classic Consulting Signal)
Estimate the monthly revenue of a salon in Bangalore. Walk us through your assumptions.
๐น Case Interview
A US-based manufacturing firm wants to shift its operations to India. What factors should it consider before making this decision?
How would your recommendation change based on cost, talent availability, regulatory environment, and supply chain risks?
๐น Experience & Domain Understanding
Can you explain your most recent project in detail?
Walk us through your career trajectory at PwC and how it prepared you for this role.
Do you have an understanding of the organization or operating model domain? How have you applied it in your work?
BCN Associate – Org COE L2
๐น Project Depth (More Ownership Expected)
Can you explain the project you worked on, focusing on problem definition, approach, and outcomes?
๐น GenAI / Analytics Maturity
How did you identify and handle hallucinations in your AI or analytics solution?
Which models, tools, or frameworks did you use, and why were they appropriate for the problem?
๐น Responsible AI & Governance (Very Bain-Specific)
How did you implement human-in-the-loop mechanisms and ensure accountability in your solution?
How did you incorporate responsible AI principles, especially in scientifically sensitive or high-impact contexts?
๐น Advanced Guesstimates
Estimate the annual revenue of Netflix. Explain your approach and assumptions.
๐น Bain Motivation (Deep Grilling)
Why Bain? Why not continue at PwC?
How did you first learn about Bain, and what aspects of the firm resonate with you?
What differentiates Bain from other consulting firms in your view?
๐น Self-Awareness & Feedback (Critical Bain Signal)
If we spoke to your manager today, what three strengths would they highlight about you?
What three areas would they suggest you improve on, and why?
๐น Team & Collaboration
What was the structure of your project team, and where did you fit within that structure?
How did you interact with stakeholders across different levels of the organization?
๐น Project & Architecture
Can you explain the overall architecture of the project you are currently working on?
๐น RAG + LangGraph System Design (Core Question)
Design a RAG-based chatbot using LangGraph where documents are hosted in an external data service and are continuously added, updated, or deleted. How would you handle ingestion, retrieval, and hallucination control?
How would your LangGraph workflow handle query analysis, query rewriting, document relevance checks, and guardrails?
๐น Framework Choices
Have you worked with CrewAI? Why did you choose LangGraph for your project instead of other agent frameworks?
What alternative frameworks exist for building agentic systems, and how would you approach a project if a client asked you to use a framework you haven’t worked with before?
๐น LLM Selection
Which LLM did you use in your project, and what factors influenced that choice?
๐น Traditional Machine Learning (Deep Dive)
Can you explain one of your ML projects in detail, focusing on the model you used and why?
You mentioned using Random Forest. Why was it suitable for your problem, and how did you tune it?
How do you handle class imbalance in classification problems?
Can you explain precision and recall, and how would you communicate these concepts to a non-technical business stakeholder?
๐น Deep Learning Fundamentals
Which optimizers have you worked with in deep learning, and how do you decide which one to use?
Can you explain how the Adam optimizer works conceptually?
(Yes bhai — very common even at L1)
๐น Python Fundamentals & Performance
Are you familiar with dynamic programming? Where would you apply it in real problems?
When would you use tuples instead of lists in Python?
Explain the difference between the in operator and the == operator. How do their performance characteristics differ?
๐น Python Coding (Hands-On)
Given a dictionary where values are tuples, perform operations such as filtering or aggregation on the data. Please share your screen and code the solution.
Design a simple banking system in Python that supports user onboarding, deposits, withdrawals, lending, and balance display. Focus on clean design rather than UI.
(Yes — this is a classic LLD + Python test)
๐น NLP vs GenAI Decision-Making
Have you worked with NLP tasks such as NER or text classification?
Can you describe a scenario where you would recommend a traditional NLP or deep learning solution instead of a generative AI approach? Why?
KPMG GCC – L1
VERIFIED & REPHRASED INTERVIEW QUESTIONS
๐น Cloud Stack & Azure Expertise (Entry Gate)
Which cloud stack are you currently working on, and what Azure services have you used in GenAI projects?
How familiar are you with Azure services such as Azure AI Search, Document Intelligence, Azure AI Studio, and Azure OpenAI?
๐น RAG Chatbot – Full System Design
You are asked to design a chatbot that answers questions from hundreds of proprietary documents. Walk me through the complete technical design end-to-end.
How would you approach document analysis and ingestion, including parsing, chunking, indexing, and metadata design?
How would you design the retrieval pipeline—vector database choice, indexing strategy, filtering, and search approach?
How would you orchestrate the overall workflow using LangChain or LangGraph, including query analysis and guardrails?
๐น Memory Management & Conversational Context
How would you manage conversational memory in your chatbot to support follow-up questions within a session?
What storage options would you consider for conversation memory, and what trade-offs exist between Redis, Postgres, or other databases?
(Yes bhai — interviewer intentionally challenged your Redis choice)
๐น Performance & Latency Troubleshooting
If users report performance issues with the chatbot, how would you diagnose the problem?
Assuming latency is the issue, what techniques would you use to reduce response time at scale?
๐น Python & SQL Fundamentals
How would you rate your proficiency in Python and SQL, and why did you choose that rating?
Can you explain different types of SQL joins and when to use them?
How would you identify duplicate records in SQL?
Can you explain a self-join and a real-world use case for it?
๐น BI & Learning Ability
You’ve mentioned Power BI on your resume. How did you decide to use it, and how did you learn it effectively?
How do you generally approach learning a new tool or technology in a project environment?
๐น ML Project Architecture
You mentioned an ML project—can you explain its architecture from data sources to model deployment?
How did you orchestrate data cleaning, transformation, and feature engineering?
Where did you store intermediate datasets, and why?
Which models did you use, and how did you evaluate them?
๐น APIs & Ownership
Which modules of the project did you personally develop?
Did you work on backend APIs, and what responsibilities did those APIs handle?
๐น Vector Databases
Which vector databases have you worked with?
Have you used ChromaDB or Pinecone, and in what scenarios would you prefer one over the other?
๐น Cross-Cloud Design Decisions (Very KPMG-Specific)
You mentioned using Azure OpenAI while the rest of the stack was on AWS. Why was this decision made?
Why didn’t you use LLMs hosted directly on AWS?
How did you integrate Azure services into an AWS-based application, and what challenges did you face?
๐น Candidate-Led Discussion (Strong Move)
Do you have any questions for us?
(Using this to discuss interviewer’s project and brainstorm solutions = very strong signal, by the way)
Comments
Post a Comment