Reference
Tables, rules, architectures, and services — all in one place.
12 tables covering the key "when X vs Y" exam decisions
Vector Store Comparison
The exam's favorite question pattern. Match the exam trigger phrase to the correct vector store.
Vector Store Comparison
The exam's favorite question pattern. Match the exam trigger phrase to the correct vector store.
| Vector Store | Best For | Hybrid Search? | Managed? | Key Exam TriggerExam |
|---|---|---|---|---|
| Bedrock KB (managed store) | Fastest path, zero infra | Via Bedrock | Fully managed | 'simplest' or 'least operational overhead' |
| OpenSearch Serverless | Scale + analytics + hybrid | Yes (BM25 + k-NN) | Serverless | 'hybrid search' or 'analytics on retrieval data' |
| Aurora PostgreSQL (pgvector) | Existing RDS + SQL queries | Manual (SQL + vector) | Managed RDS | 'existing relational database' or 'SQL queries alongside vector' |
| Neptune (graph) | Knowledge graphs + relationships | No | Managed | 'entity relationships' or 'graph-based retrieval' |
| Kendra GenAI Index | Enterprise retrieval with connectors + ACL | Yes (native) | Fully managed | 'enterprise search' or 'high-accuracy retrieval' or 'respect source permissions' |
| DocumentDB | MongoDB-compatible, HNSW/IVFFlat, up to 2000-dim | No | Managed | 'existing MongoDB' or 'document-oriented database' |
| DynamoDB (co-index) | Metadata layer alongside vector stores | N/A (metadata only) | Fully managed | 'metadata filtering' alongside semantic search |
Chunking Decision Tree
Select the right chunking strategy based on content type.
Chunking Decision Tree
Select the right chunking strategy based on content type.
| Content Type | Best Chunking | Why |
|---|---|---|
| Short FAQ answers | Sentence-level or small fixed-size | Each answer is self-contained |
| Technical manuals | Hierarchical (section -> subsection) | Preserves document structure |
| Legal contracts | Semantic (by clause/paragraph) | Clauses must stay intact |
| Chat transcripts | Fixed-size with overlap | Even content, overlap preserves context |
| Code documentation | Hierarchical (by class -> method) | Code structure matters |
Embedding Model Comparison
Choose the right embedding model based on language and modality needs.
Embedding Model Comparison
Choose the right embedding model based on language and modality needs.
| Model | Dimensions | Multilingual? | Best For |
|---|---|---|---|
| Amazon Titan Text Embeddings v2 | 256/512/1024 | Limited | Cost-effective, AWS-native |
| Cohere Embed v3 | 1024 | Yes (100+ languages) | Multilingual, high accuracy |
| Amazon Nova Multimodal Embeddings | Variable | Yes | Text + image + video + audio crossmodal search |
Bedrock Agents vs AgentCore
Critical distinction: managed agent service vs deployment platform for any agent framework.
Bedrock Agents vs AgentCore
Critical distinction: managed agent service vs deployment platform for any agent framework.
| AspectExam | Bedrock Agents | Bedrock AgentCore |
|---|---|---|
| What | Managed agent service | Agent deployment platform |
| Framework | AWS-native only | Any (Strands, CrewAI, LangGraph, custom) |
| Model | Bedrock models only | Any FM (Bedrock, OpenAI, self-hosted) |
| Deployment | Fully managed | You deploy to AgentCore Runtime |
| Use when | Simple agent, fast setup, AWS-native | Custom framework, multi-model, complex orchestration |
AgentCore 9 Services
Know all 9 AgentCore services and the exam trigger phrases that point to each.
AgentCore 9 Services
Know all 9 AgentCore services and the exam trigger phrases that point to each.
| Service | Function | Exam TriggerExam |
|---|---|---|
| Runtime | Serverless deployment, session isolation, microVMs | 'deploy agent' + 'scale' + 'isolate sessions' |
| Gateway | Transform APIs/Lambda/MCP into agent-ready tools | 'connect agent to existing APIs' + 'tool integration' |
| Policy | Cedar-based action boundaries, natural language authoring | 'control what agents can do' + 'governance' |
| Identity | Agent auth for AWS + third-party services | 'agent needs to access external APIs securely' |
| Memory | Session + long-term + episodic memory | 'maintain context across sessions' + 'learn from past' |
| Observability | CloudWatch dashboards, OpenTelemetry, quality metrics | 'monitor agent behavior' + 'debug agent decisions' |
| Evaluations | Correctness, helpfulness, safety scoring | 'measure agent quality' + 'evaluate before production' |
| Code Interpreter | Secure sandbox for code execution | 'agent needs to run code' + 'generate visualizations' |
| Browser | Web-based workflow execution | 'agent needs to interact with web pages' |
MCP Server Implementation Patterns
The exam explicitly tests MCP server implementation. Know the stateless vs stateful distinction.
MCP Server Implementation Patterns
The exam explicitly tests MCP server implementation. Know the stateless vs stateful distinction.
| MCP Server Type | Implementation | When to Use | Exam TriggerExam |
|---|---|---|---|
| Stateless (lightweight) | Lambda function | Read-only queries, simple tools, no persistent state | 'lightweight tool access' |
| Stateful (complex) | ECS container | DB connection pooling, transactions, persistent state | 'complex tools' + 'persistent connections' |
Q Business vs Bedrock KB + Custom App
Know when to use the managed enterprise search solution vs building custom.
Q Business vs Bedrock KB + Custom App
Know when to use the managed enterprise search solution vs building custom.
| Feature | Q Business | Bedrock KB + Custom App |
|---|---|---|
| Setup | Plug-and-play, 40+ connectors (SharePoint, Confluence, Salesforce, Google Drive, Jira) | Custom build required |
| ACL | Built-in -- respects source permissions | Must implement manually |
| UI | Provided out of the box | Build with Amplify or custom |
| Customization | Limited | Full control |
| Identity | IAM Identity Center for SSO | Cognito or custom auth |
| Use when | Enterprise internal search, quick deployment, existing permissions matter | Customer-facing, custom UX, complex retrieval logic |
Model Cascading Pattern
Route queries to different models based on complexity. Frequently tested.
Model Cascading Pattern
Route queries to different models based on complexity. Frequently tested.
| Query Complexity | Route To | Characteristics |
|---|---|---|
| Simple (70% of traffic) | Nova Micro | Cheapest, fastest |
| Medium (20% of traffic) | Nova Pro | Balanced cost/quality |
| Complex (10% of traffic) | Claude Sonnet | Highest quality |
Service-to-Data-Type Mapping
Memorize which AWS service handles which data type in GenAI pipelines.
Service-to-Data-Type Mapping
Memorize which AWS service handles which data type in GenAI pipelines.
| Data Type | AWS Service | Use |
|---|---|---|
| Text | Comprehend | Entity extraction, sentiment, intent |
| Audio | Transcribe | Speech-to-text |
| Documents (PDF, images) | Textract | OCR, text extraction, table extraction |
| Images/Video | Rekognition / Bedrock multimodal | Object detection / FM analysis |
| Tabular data | Glue | ETL, data quality validation |
| Mixed/multimodal | Bedrock Data Automation | Automated processing pipeline |
Guardrails Filter Types
Know all 8 filter types, their configuration options, and what action they take.
Guardrails Filter Types
Know all 8 filter types, their configuration options, and what action they take.
| Filter | Config | Action |
|---|---|---|
| Content filters | 6 categories x 4 strengths (NONE/LOW/MED/HIGH) | Block harmful input/output |
| Prompt Attack | Separate from content filters | Detect jailbreaks, prompt injection |
| Denied topics | Custom topic + example phrases | Block specific subjects |
| Word filters | Custom word list + profanity toggle | Exact-match blocking |
| PII (sensitive info) | Per-entity type: BLOCK or ANONYMIZE | Mask or reject PII |
| Custom regex | Pattern-based detection | Org-specific identifiers |
| Contextual grounding | Grounding threshold + relevance threshold | Detect hallucinations |
| Automated Reasoning | Formal logic verification rules | Mathematically verify facts |
Caching Comparison
Critical exam distinction -- know all three caching layers and when to use each.
Caching Comparison
Critical exam distinction -- know all three caching layers and when to use each.
| Method | How It Works | When to Use | Cost Impact |
|---|---|---|---|
| Bedrock Prompt Caching | Caches system prompt at Bedrock API level. 5-min TTL (1.25x write, 0.1x read) or 1-hour TTL (2.0x write, 0.1x read). Min 1024 tokens. | Same long system prompt across many requests | Up to 90% reduction on cached tokens |
| Semantic Caching (ElastiCache/DynamoDB) | App-level: embed incoming query, compare against cached query embeddings via cosine similarity | Repeated similar user queries | Eliminates FM call entirely (86% cost reduction with ElastiCache) |
| Exact-match Caching (ElastiCache) | App-level: hash identical queries, return cached response | Identical queries | Eliminates FM call entirely |
| Intelligent Prompt Routing | Auto-analyzes each prompt and routes to most appropriate FM based on complexity | Mixed-complexity workloads | Automated model cascading without manual routing logic |
Evaluation Approaches
Match what you are evaluating to the correct method and AWS service.
Evaluation Approaches
Match what you are evaluating to the correct method and AWS service.
| What You're Evaluating | Method | Service |
|---|---|---|
| FM response quality | LLM-as-a-judge + human eval | Bedrock Evaluations |
| RAG retrieval relevance | Contextual grounding score | Bedrock Guardrails |
| RAG answer faithfulness | Grounding check against source | Bedrock Guardrails |
| Agent correctness | Automated eval on test cases | AgentCore Evaluations |
| Agent safety | Harmfulness scoring | AgentCore Evaluations |
| Production latency | End-to-end trace | X-Ray + CloudWatch |
| Prompt regression | Compare before/after on test set | Lambda + CloudWatch |