arXiv 论文速递

SecureCAI: Injection-Resilient LLM Assistants for Cybersecurity Operations

Authors: Mohammed Himayath Ali, Mohammed Aqib Abdullah, Mohammed Mudassir Uddin, Shahnawaz Alam

First: 2026-01-12T18:59:45+00:00 · Latest: 2026-01-12T18:59:45+00:00

Abstract

Large Language Models have emerged as transformative tools for Security Operations Centers, enabling automated log analysis, phishing triage, and malware explanation; however, deployment in adversarial cybersecurity environments exposes critical vulnerabilities to prompt injection attacks where malicious instructions embedded in security artifacts manipulate model behavior. This paper introduces SecureCAI, a novel defense framework extending Constitutional AI principles with security-aware guardrails, adaptive constitution evolution, and Direct Preference Optimization for unlearning unsafe response patterns, addressing the unique challenges of high-stakes security contexts where traditional safety mechanisms prove insufficient against sophisticated adversarial manipulation. Experimental evaluation demonstrates that SecureCAI reduces attack success rates by 94.7% compared to baseline models while maintaining 95.1% accuracy on benign security analysis tasks, with the framework incorporating continuous red-teaming feedback loops enabling dynamic adaptation to emerging attack strategies and achieving constitution adherence scores exceeding 0.92 under sustained adversarial pressure, thereby establishing a foundation for trustworthy integration of language model capabilities into operational cybersecurity workflows and addressing a critical gap in current approaches to AI safety within adversarial domains.

中文标题/摘要

标题：SecureCAI：在对抗性网络安全环境中具有注入抗性的LLM辅助工具

大型语言模型已成为安全运营中心的变革性工具，能够实现自动化日志分析、钓鱼处理和恶意软件解释；然而，在对抗性网络安全环境中部署时，模型暴露于提示注入攻击中，恶意指令嵌入安全数据中，操控模型行为。本文介绍了SecureCAI，这是一种新颖的防御框架，结合了安全意识护栏、自适应宪法进化和直接偏好优化以消除不安全的响应模式，解决了传统安全机制在高风险安全环境中对抗复杂对手操纵不足的问题。实验评估表明，与基线模型相比，SecureCAI将攻击成功率降低了94.7%，同时在良性安全分析任务上的准确率保持在95.1%；框架还集成了持续的红队反馈循环，以实现动态适应新兴攻击策略，并在持续的对抗压力下实现超过0.92的宪法合规性得分，从而为将语言模型能力安全地集成到运营网络安全工作流中奠定了基础，并解决了当前对抗性领域中AI安全方法的关键空白。

Summary / 总结

SecureCAI is a defense framework designed to protect large language models from prompt injection attacks in cybersecurity operations. It uses Constitutional AI principles with security-aware guardrails and adaptive constitution evolution to unlearn unsafe response patterns. SecureCAI significantly reduces attack success rates by 94.7% while maintaining high accuracy on benign tasks, and it continuously adapts to new attack strategies through red-teaming feedback loops, achieving high constitution adherence scores under adversarial pressure.

SecureCAI 是一种防御框架，旨在保护大型语言模型免受网络安全操作中的提示注入攻击。它使用宪法AI原则结合安全意识护栏、自适应宪法进化和直接偏好优化来消除不安全的响应模式。SecureCAI 将攻击成功率显著降低94.7%，同时在良性任务上保持95.1%的准确性，并通过持续的红队反馈循环动态适应新的攻击策略，从而在持续的 adversarial 压力下实现高宪法一致性得分。

Tuning-free Visual Effect Transfer across Videos

Authors: Maxwell Jones, Rameen Abdal, Or Patashnik, Ruslan Salakhutdinov, Sergey Tulyakov, Jun-Yan Zhu, Kuan-Chieh Jackson Wang

First: 2026-01-12T18:59:32+00:00 · Latest: 2026-01-12T18:59:32+00:00

Comments: Project Page: $\href{https://tuningfreevisualeffects-maker.github.io/Tuning-free-Visual-Effect-Transfer-across-Videos-Project-Page/}{this\ URL}$

Abs · PDF · Code1 · Code2 · Project1 · Project2

Abstract

We present RefVFX, a new framework that transfers complex temporal effects from a reference video onto a target video or image in a feed-forward manner. While existing methods excel at prompt-based or keyframe-conditioned editing, they struggle with dynamic temporal effects such as dynamic lighting changes or character transformations, which are difficult to describe via text or static conditions. Transferring a video effect is challenging, as the model must integrate the new temporal dynamics with the input video's existing motion and appearance. % To address this, we introduce a large-scale dataset of triplets, where each triplet consists of a reference effect video, an input image or video, and a corresponding output video depicting the transferred effect. Creating this data is non-trivial, especially the video-to-video effect triplets, which do not exist naturally. To generate these, we propose a scalable automated pipeline that creates high-quality paired videos designed to preserve the input's motion and structure while transforming it based on some fixed, repeatable effect. We then augment this data with image-to-video effects derived from LoRA adapters and code-based temporal effects generated through programmatic composition. Building on our new dataset, we train our reference-conditioned model using recent text-to-video backbones. Experimental results demonstrate that RefVFX produces visually consistent and temporally coherent edits, generalizes across unseen effect categories, and outperforms prompt-only baselines in both quantitative metrics and human preference. See our website $\href{https://tuningfreevisualeffects-maker.github.io/Tuning-free-Visual-Effect-Transfer-across-Videos-Project-Page/}{at\ this\ URL}$.

中文标题/摘要

标题：无需调参的视频视觉效果转移

我们提出了一种名为RefVFX的新框架，该框架能够以端到端的方式将参考视频中的复杂时间效果转移到目标视频或图像上。现有方法在基于提示或关键帧条件的编辑方面表现出色，但在处理动态时间效果（如动态光照变化或角色变形）方面存在困难，这些效果难以通过文本或静态条件描述。将视频效果转移是一项挑战，因为模型必须将新的时间动态与输入视频的现有运动和外观相结合。为此，我们引入了一个大规模的三元组数据集，其中每个三元组包含一个参考效果视频、一个输入图像或视频以及一个显示转移效果的对应输出视频。创建这些数据并不容易，尤其是自然不存在的视频到视频效果三元组。为此，我们提出了一种可扩展的自动化管道，该管道可以生成高质量的配对视频，旨在保留输入的运动和结构，同时基于某些固定且可重复的效果进行转换。然后，我们使用LoRA适配器和代码生成的基于程序组合的时间效果对该数据集进行扩充。基于我们新构建的数据集，我们使用最新的文本到视频骨干网络训练参考条件模型。实验结果表明，RefVFX生成的编辑效果在视觉上一致且时间上连贯，能够跨未见过的效果类别泛化，并在定量指标和人类偏好方面优于仅基于提示的基线。请访问我们的网站：https://tuningfreevisualeffects-maker.github.io/Tuning-free-Visual-Effect-Transfer-across-Videos-Project-Page/

Optimal Learning Rate Schedule for Balancing Effort and Performance

Authors: Valentina Njaradi, Rodrigo Carrasco-Davis, Peter E. Latham, Andrew Saxe

First: 2026-01-12T18:59:07+00:00 · Latest: 2026-01-12T18:59:07+00:00