AIP-C01 Study Hub

Reference

Tables, rules, architectures, and services — all in one place.

12 tables covering the key "when X vs Y" exam decisions

FM Integration

Vector Store Comparison

The exam's favorite question pattern. Match the exam trigger phrase to the correct vector store.

D1: FM Integration

Vector Store Comparison

The exam's favorite question pattern. Match the exam trigger phrase to the correct vector store.

Vector StoreBest ForHybrid Search?Managed?Key Exam TriggerExam
Bedrock KB (managed store)Fastest path, zero infraVia BedrockFully managed'simplest' or 'least operational overhead'
OpenSearch ServerlessScale + analytics + hybridYes (BM25 + k-NN)Serverless'hybrid search' or 'analytics on retrieval data'
Aurora PostgreSQL (pgvector)Existing RDS + SQL queriesManual (SQL + vector)Managed RDS'existing relational database' or 'SQL queries alongside vector'
Neptune (graph)Knowledge graphs + relationshipsNoManaged'entity relationships' or 'graph-based retrieval'
Kendra GenAI IndexEnterprise retrieval with connectors + ACLYes (native)Fully managed'enterprise search' or 'high-accuracy retrieval' or 'respect source permissions'
DocumentDBMongoDB-compatible, HNSW/IVFFlat, up to 2000-dimNoManaged'existing MongoDB' or 'document-oriented database'
DynamoDB (co-index)Metadata layer alongside vector storesN/A (metadata only)Fully managed'metadata filtering' alongside semantic search
FM Integration

Chunking Decision Tree

Select the right chunking strategy based on content type.

D1: FM Integration

Chunking Decision Tree

Select the right chunking strategy based on content type.

Content TypeBest ChunkingWhy
Short FAQ answersSentence-level or small fixed-sizeEach answer is self-contained
Technical manualsHierarchical (section -> subsection)Preserves document structure
Legal contractsSemantic (by clause/paragraph)Clauses must stay intact
Chat transcriptsFixed-size with overlapEven content, overlap preserves context
Code documentationHierarchical (by class -> method)Code structure matters
FM Integration

Embedding Model Comparison

Choose the right embedding model based on language and modality needs.

D1: FM Integration

Embedding Model Comparison

Choose the right embedding model based on language and modality needs.

ModelDimensionsMultilingual?Best For
Amazon Titan Text Embeddings v2256/512/1024LimitedCost-effective, AWS-native
Cohere Embed v31024Yes (100+ languages)Multilingual, high accuracy
Amazon Nova Multimodal EmbeddingsVariableYesText + image + video + audio crossmodal search
Implementation

Bedrock Agents vs AgentCore

Critical distinction: managed agent service vs deployment platform for any agent framework.

D2: Implementation

Bedrock Agents vs AgentCore

Critical distinction: managed agent service vs deployment platform for any agent framework.

AspectExamBedrock AgentsBedrock AgentCore
WhatManaged agent serviceAgent deployment platform
FrameworkAWS-native onlyAny (Strands, CrewAI, LangGraph, custom)
ModelBedrock models onlyAny FM (Bedrock, OpenAI, self-hosted)
DeploymentFully managedYou deploy to AgentCore Runtime
Use whenSimple agent, fast setup, AWS-nativeCustom framework, multi-model, complex orchestration
Implementation

AgentCore 9 Services

Know all 9 AgentCore services and the exam trigger phrases that point to each.

D2: Implementation

AgentCore 9 Services

Know all 9 AgentCore services and the exam trigger phrases that point to each.

ServiceFunctionExam TriggerExam
RuntimeServerless deployment, session isolation, microVMs'deploy agent' + 'scale' + 'isolate sessions'
GatewayTransform APIs/Lambda/MCP into agent-ready tools'connect agent to existing APIs' + 'tool integration'
PolicyCedar-based action boundaries, natural language authoring'control what agents can do' + 'governance'
IdentityAgent auth for AWS + third-party services'agent needs to access external APIs securely'
MemorySession + long-term + episodic memory'maintain context across sessions' + 'learn from past'
ObservabilityCloudWatch dashboards, OpenTelemetry, quality metrics'monitor agent behavior' + 'debug agent decisions'
EvaluationsCorrectness, helpfulness, safety scoring'measure agent quality' + 'evaluate before production'
Code InterpreterSecure sandbox for code execution'agent needs to run code' + 'generate visualizations'
BrowserWeb-based workflow execution'agent needs to interact with web pages'
Implementation

MCP Server Implementation Patterns

The exam explicitly tests MCP server implementation. Know the stateless vs stateful distinction.

D2: Implementation

MCP Server Implementation Patterns

The exam explicitly tests MCP server implementation. Know the stateless vs stateful distinction.

MCP Server TypeImplementationWhen to UseExam TriggerExam
Stateless (lightweight)Lambda functionRead-only queries, simple tools, no persistent state'lightweight tool access'
Stateful (complex)ECS containerDB connection pooling, transactions, persistent state'complex tools' + 'persistent connections'
Implementation

Q Business vs Bedrock KB + Custom App

Know when to use the managed enterprise search solution vs building custom.

D2: Implementation

Q Business vs Bedrock KB + Custom App

Know when to use the managed enterprise search solution vs building custom.

FeatureQ BusinessBedrock KB + Custom App
SetupPlug-and-play, 40+ connectors (SharePoint, Confluence, Salesforce, Google Drive, Jira)Custom build required
ACLBuilt-in -- respects source permissionsMust implement manually
UIProvided out of the boxBuild with Amplify or custom
CustomizationLimitedFull control
IdentityIAM Identity Center for SSOCognito or custom auth
Use whenEnterprise internal search, quick deployment, existing permissions matterCustomer-facing, custom UX, complex retrieval logic
Implementation

Model Cascading Pattern

Route queries to different models based on complexity. Frequently tested.

D2: Implementation

Model Cascading Pattern

Route queries to different models based on complexity. Frequently tested.

Query ComplexityRoute ToCharacteristics
Simple (70% of traffic)Nova MicroCheapest, fastest
Medium (20% of traffic)Nova ProBalanced cost/quality
Complex (10% of traffic)Claude SonnetHighest quality
FM Integration

Service-to-Data-Type Mapping

Memorize which AWS service handles which data type in GenAI pipelines.

D1: FM Integration

Service-to-Data-Type Mapping

Memorize which AWS service handles which data type in GenAI pipelines.

Data TypeAWS ServiceUse
TextComprehendEntity extraction, sentiment, intent
AudioTranscribeSpeech-to-text
Documents (PDF, images)TextractOCR, text extraction, table extraction
Images/VideoRekognition / Bedrock multimodalObject detection / FM analysis
Tabular dataGlueETL, data quality validation
Mixed/multimodalBedrock Data AutomationAutomated processing pipeline
Safety & Security

Guardrails Filter Types

Know all 8 filter types, their configuration options, and what action they take.

D3: Safety & Security

Guardrails Filter Types

Know all 8 filter types, their configuration options, and what action they take.

FilterConfigAction
Content filters6 categories x 4 strengths (NONE/LOW/MED/HIGH)Block harmful input/output
Prompt AttackSeparate from content filtersDetect jailbreaks, prompt injection
Denied topicsCustom topic + example phrasesBlock specific subjects
Word filtersCustom word list + profanity toggleExact-match blocking
PII (sensitive info)Per-entity type: BLOCK or ANONYMIZEMask or reject PII
Custom regexPattern-based detectionOrg-specific identifiers
Contextual groundingGrounding threshold + relevance thresholdDetect hallucinations
Automated ReasoningFormal logic verification rulesMathematically verify facts
Optimization

Caching Comparison

Critical exam distinction -- know all three caching layers and when to use each.

D4: Optimization

Caching Comparison

Critical exam distinction -- know all three caching layers and when to use each.

MethodHow It WorksWhen to UseCost Impact
Bedrock Prompt CachingCaches system prompt at Bedrock API level. 5-min TTL (1.25x write, 0.1x read) or 1-hour TTL (2.0x write, 0.1x read). Min 1024 tokens.Same long system prompt across many requestsUp to 90% reduction on cached tokens
Semantic Caching (ElastiCache/DynamoDB)App-level: embed incoming query, compare against cached query embeddings via cosine similarityRepeated similar user queriesEliminates FM call entirely (86% cost reduction with ElastiCache)
Exact-match Caching (ElastiCache)App-level: hash identical queries, return cached responseIdentical queriesEliminates FM call entirely
Intelligent Prompt RoutingAuto-analyzes each prompt and routes to most appropriate FM based on complexityMixed-complexity workloadsAutomated model cascading without manual routing logic
Testing

Evaluation Approaches

Match what you are evaluating to the correct method and AWS service.

D5: Testing

Evaluation Approaches

Match what you are evaluating to the correct method and AWS service.

What You're EvaluatingMethodService
FM response qualityLLM-as-a-judge + human evalBedrock Evaluations
RAG retrieval relevanceContextual grounding scoreBedrock Guardrails
RAG answer faithfulnessGrounding check against sourceBedrock Guardrails
Agent correctnessAutomated eval on test casesAgentCore Evaluations
Agent safetyHarmfulness scoringAgentCore Evaluations
Production latencyEnd-to-end traceX-Ray + CloudWatch
Prompt regressionCompare before/after on test setLambda + CloudWatch