Thiết kế AI-Native App

Mở đầu

Sao 1 số AI product khiến ngạc nhiên, còn 1 số chỉ là "vỏ ChatGPT"? Khác biệt không ở model dùng mạnh cỡ nào, mà product có design từ underlying xoay quanh đặc tính AI hay không. AI-native app không phải "thêm chat box" lên app traditional, mà rethink toàn bộ user interaction, system architecture, product logic.

Bạn sẽ học:

Paradigm: hiểu khác biệt bản chất AI-native vs traditional
Design principles: core principle cho AI-native product
Prompt engineering: design prompt chất lượng cao drive AI capability
Interaction: new pattern UX của AI era
Architecture: full lifecycle request của AI app

Chương	Nội dung
1	Architecture comparison
2	Design principles
3	Prompt engineering
4	Interaction patterns
5	Request flow

0. Toàn cảnh: từ "thêm AI" → "AI-native"

Mấy năm trước, path AI hoá nhiều product: có app sẵn, thêm 1 nút "AI assistant" ở góc nào đó. Như lắp engine lên xe ngựa — chạy được, nhưng không bằng design 1 chiếc xe hơi từ đầu.

AI-native app = product thinking mới: từ dòng code đầu, AI là capability core, không phải feature add sau.

Traditional vs AI-native

Traditional: user action → deterministic logic → deterministic result. Mỗi lần click "submit order", flow y nhau.
AI-native: user intent → AI hiểu → probabilistic result. Cùng question, mỗi lần answer hơi khác.
Core shift: từ "viết rule" → "mô tả intent", từ "deterministic" → "probabilistic", từ "operation UI" → "dialogue UI".

1. Architecture comparison

Traditional architecture = "request-response": user click, backend execute deterministic logic, return deterministic result. Cả quá trình predictable, testable, reproducible.

AI-native introduce role mới — LLM. Như "smart middleware", nhận natural language input, output natural language. Mang thay đổi architecture căn bản.

传统应用架构

🖥️
前端 UI
用户界面与交互

⚙️

业务逻辑层

硬编码的规则引擎

🗄️

数据存储

结构化数据管理

🔌

API 接口

固定的请求/响应

🖥️ 前端 UI

基于确定性的表单、按钮、页面路由。用户操作触发固定的业务流程，所有交互路径在开发时已经确定。

典型技术

ReactVueHTML/CSS

💡 核心区别：传统应用的逻辑由开发者用 if/else 硬编码，行为完全确定。

Dim	Traditional	AI-native
Input	Form, button, dropdown	Natural language, image, voice
Logic	if-else, rule engine	LLM reasoning, prompt-driven
Output	Deterministic, reproducible	Probabilistic, mỗi lần khác
Latency	ms	s (cần streaming)
Error handling	Error code rõ	Hallucination, refusal, off-topic
Cost	Compute cố định	Pay per token, fluctuate

3 stage architecture evolution

AI-augmented: nhúng AI vào app sẵn (autocomplete, smart recommend)
AI-collaborative: AI là interaction core, nhưng vẫn traditional UI làm fallback (Notion AI, GitHub Copilot)
AI-native: cả product xoay quanh AI, bỏ AI = product không tồn tại (ChatGPT, Cursor, Midjourney)

2. Design principles: "hiến pháp" AI-native

Không copy thinking của traditional software. Probabilistic + latency + unpredictable của AI yêu cầu principle mới.

🛡️

优雅降级

AI 失败时，系统仍然可用

🤝

人机协作

关键决策由人类确认

🔍

透明可解释

让用户理解 AI 的推理过程

🔄

反馈闭环

用户反馈驱动持续改进

🛡️ 优雅降级

AI 模型可能超时、返回错误、产生幻觉。优雅降级意味着：当 AI 不可用时，系统应该有兜底方案，而不是直接崩溃。这是 AI 原生应用与玩具项目的分水岭。

实践对比

❌ 反面示例

模型 API 超时后，页面显示空白错误页，用户只能刷新重试。

✅ 正确做法

模型超时后，显示缓存的上一次回答或推荐相关文档，同时后台自动重试。

检查清单

☐设置合理的 API 超时时间（通常 30-60s）

☐准备降级方案：缓存、规则引擎、人工转接

☐向用户透明地展示当前状态

☐记录失败日志用于后续优化

5 core principles

Embrace uncertainty: AI output không 100% reliable, design phải cân nhắc "AI có thể sai". Provide edit, retry, feedback. User luôn control.
Progressive trust: đừng cho AI decision high-risk từ đầu. Build trust ở low-risk trước, expand quyền tự chủ AI.
Transparent + explainable: cho user biết AI làm gì, sao làm vậy. Show reasoning, citation, confidence.
Human-AI collaboration: AI không thay người, mà augment người. Best design: AI làm draft, người final review.
Graceful degradation: AI down hoặc kết quả tệ, product vẫn dùng được. Luôn có Plan B.

3. Prompt engineering: "programming language" của AI app

Traditional: viết code bảo máy làm gì. AI-native: viết Prompt bảo model làm gì. Prompt = programming language của AI era — viết tốt AI ấn tượng; viết tệ AI bịa.

System Prompt（系统指令）

User Prompt（用户输入）

模拟输出

点击"模拟生成"查看效果

💡 Prompt 技巧：没有 System Prompt，没有上下文，问题过于模糊 —— AI 只能猜测你的意图。

4 layer Prompt

System Prompt: define role, capability boundary, behavior. Instruction cấp "hiến pháp", user không thấy nhưng luôn effective.
Context injection: doc retrieved qua RAG, user history → background AI cần.
User Message: question/instruction thực của user.
Format constraint: chỉ định output format (JSON, Markdown, template), đảm bảo parse được.

Technique	Note	Effect
Role setting	"Bạn là senior FE engineer"	Tăng quality domain answer
Few-shot	Cho 2-3 input-output example	Model hiểu format + style
Chain of Thought	"Suy nghĩ từng bước"	Tăng accuracy reasoning phức tạp
Output constraint	"Trả lời JSON"	Output parse được
Negative instruction	"Đừng bịa info không chắc"	Giảm hallucination

4. Interaction: UX của AI era

AI-native sinh nhiều pattern mới. Traditional UX = "click-wait-view", AI app = "dialogue-observe-adjust".

💬

流式输出

逐字生成，即时反馈

⏳

智能加载态

分阶段展示处理进度

📊

置信度指示

展示 AI 的确定程度

🛡️

优雅降级

不确定时的兜底策略

4 core interaction pattern

Streaming: AI gen content hiện từng chữ, không chờ gen hết. Giảm perceived waiting time, user judge direction sớm.
Multi-turn: dialogue liên tục qua context memory, user refine progressively. Challenge: context window management + history compression.
Multimodal: support text, image, voice, file. AI cũng output image, code, table.
Agentic: AI không chỉ answer, mà tự plan + execute multi-step task. User cho goal, AI tự breakdown + complete.

5. Request flow: 1 AI call lifecycle

User gửi message trong AI app, background xảy ra gì? Hiểu full flow = foundation build reliable AI app.

👤

用户输入

User Input

→

🔧

预处理

Preprocessing

→

🧠

模型推理

Model Inference

→

🛡️

后处理

Post-processing

→

💬

响应输出

Response

💡 关键洞察： AI 应用的请求链路比传统应用更长，模型推理通常占总耗时的 60-80%。优化重点在于：Prompt 缓存、流式输出、异步处理。

6 stage processing

Input preprocess: validate user input, content safety, sensitive info redact
Context assembly: ghép system prompt + retrieve relevant doc (RAG) + load history
Model call: send assembled prompt tới LLM API, open streaming
Output postprocess: format, content safety filter, extract structured data
Cache: cache common question result, giảm cost + latency
Monitor: log token usage, response time, user feedback → continuous optimize

Stage	Key	Common issue
Input preprocess	Injection protection, length limit	Prompt injection, jailbreak
Context assembly	Token budget, info priority	Context overflow, key info truncated
Model call	Timeout, retry, streaming	API rate limit, network timeout
Output postprocess	Format check, hallucination detect	Output format không match
Cache	Semantic vs exact cache	Hit rate thấp
Monitoring	Cost monitor, quality eval	Token cost out of control

Tổng kết

AI-native design không phải chỉ đắp AI lên traditional, mà refactor toàn diện về architecture, interaction, engineering.

Key:

Architecture shift: từ deterministic logic → probabilistic reasoning
Design principle: embrace uncertainty, progressive trust, transparent, human-AI collab, graceful degradation
Prompt là core: "programming language" của AI app, quyết product quality
Interaction revolution: streaming, multi-turn, multimodal, Agent — redefine UX
Full-chain thinking: từ input preprocess đến monitoring, mỗi mắt xích design riêng cho AI

2026 cho VN dev

Streaming là must: dùng Server-Sent Events (SSE) hoặc WebSocket cho UX tốt
Generative UI: Vercel AI SDK, Vercel v0 — AI gen UI component dynamic
Cost monitor: LangSmith, Helicone, Langfuse track cost + quality
Eval framework: Promptfoo, Braintrust auto eval prompt
Safety: Llama Guard, OpenAI Moderation API check input/output
VN case: build CS bot e-commerce → dùng RAG với product DB + structured output cho recommendations
Bài tập: clone Cursor mini — IDE-like editor + chat sidebar + streaming

Thiết kế AI-Native App ​

0. Toàn cảnh: từ "thêm AI" → "AI-native" ​

1. Architecture comparison ​

2. Design principles: "hiến pháp" AI-native ​

3. Prompt engineering: "programming language" của AI app ​

4. Interaction: UX của AI era ​

5. Request flow: 1 AI call lifecycle ​

Tổng kết ​

Tài liệu ​