← All projects

jailbreak_llms

B22/40UI / 聊天洞察置信度：中

[CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 1,405 jailbreak prompts).

★ 3,622Jupyter NotebookCreated 2023-08-01GitHub →

chatgptjailbreakjailbreakinglarge-language-modelllmllm-securityprompt

Executive Insight

jailbreak_llms 属于「UI / 聊天」方向，综合分 22/40（B）。当前最强项是记忆系统、人机协作、LLM 集成，短板集中在工具使用、知识检索 (RAG)。

核心优势

- 记忆系统达到 4/5（Level 4），说明该项目在这一能力上较成熟。
- 人机协作达到 4/5（Level 4），说明该项目在这一能力上较成熟。
- LLM 集成达到 3/5（Level 3），说明该项目在这一能力上较成熟。

能力短板

- 工具使用仅 2/5，当前更像“可用基础版”，需要补齐工程化能力。
- 知识检索 (RAG)仅 2/5，当前更像“可用基础版”，需要补齐工程化能力。

适用场景

- 面向终端用户的 AI 产品
- 多模型聊天入口

落地风险与建议

- 该条目为启发式分析，建议在核心决策前做一次仓库级人工复核。
- 评估与验证环节偏弱，上线前需要补充自动测试与回归策略。
- 梳理工具调用协议，先统一输入输出，再做动态路由。
- 优先引入检索层：切块 + 向量召回 + 重排，提升事实性。

Intelligence Profile

Dimensions

LLM 集成

Level 3

Level 3: 上下文管理 + Streaming

Heuristic from category + topics — verify manually

Agent 自主性

Level 3

Level 3: ReAct 循环（自主工具调用）

Heuristic from category + topics — verify manually

记忆系统

Level 4

Level 4: 分层记忆（短期/长期）

Heuristic from category + topics — verify manually

工具使用

Level 2

Level 2: 多工具 + 路由

Heuristic from category + topics — verify manually

知识检索 (RAG)

Level 2

Level 2: Embedding + 向量检索

Heuristic from category + topics — verify manually

多模态

Level 2

Level 2: 图片输入 + 文本输出

Heuristic from category + topics — verify manually

评估与验证

Level 2

Level 2: 规则校验

Heuristic from category + topics — verify manually

人机协作

Level 4

Level 4: 自适应（知道什么时候该问人）

Heuristic from category + topics — verify manually

Architecture

ui-chat ecosystem (GitHub)

[CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 1,405 jailbreak prompts).

GitHub Live Metrics

Loading live metrics...