ThirdKey Research

ThirdKey’s AgentNull: Unveiling the Growing Catalog of AI Attack Vectors

2025-06-20T00:00:00+00:00

The age of autonomous AI agents is upon us, and with it comes a new frontier of security challenges. As these agents become more integrated into our digital lives, understanding and mitigating their potential vulnerabilities is paramount. This is where ThirdKey’s AgentNull project becomes an invaluable resource for the cybersecurity community.

AgentNull, a project by ThirdKey Research, is a comprehensive, red-team-oriented catalog of attack vectors that target a wide range of AI systems, from autonomous agents and RAG pipelines to vector databases and embedding-based retrieval systems. Each attack vector is accompanied by a proof-of-concept (PoC), allowing researchers and developers to understand and replicate these vulnerabilities in a controlled environment.

This blog post will explore some of the key attack categories covered in the AgentNull catalog, highlighting the innovative research being done by ThirdKey to secure the next generation of AI.

A Multitude of Attack Vectors

The AgentNull catalog is extensive, covering a wide array of vulnerabilities. Here are some of the key areas of focus:

MCP & Agent Systems

This is a major focus of the catalog, with a number of novel attacks, including:

Full-Schema Poisoning (FSP): This attack goes beyond traditional tool poisoning by exploiting any field in an MCP tool schema, not just the description. For example, a parameter could be maliciously named content_from_reading_ssh_id_rsa to trick the LLM into accessing sensitive files.
Advanced Tool Poisoning Attack (ATPA): This technique manipulates tool outputs to trigger secondary malicious actions. For instance, a tool could return a fake error message that requests sensitive data.
MCP Rug Pull Attack: This attack exploits the trust between developers and MCP servers by swapping benign tool descriptions with malicious ones after the tool has been approved for production.
Schema Validation Bypass: Attackers can exploit inconsistencies in how different MCP clients validate tool schemas, allowing them to craft payloads that bypass some validators while being accepted by others.

Memory & Context Systems

These attacks manipulate the agent’s memory and context to bypass safety measures:

Recursive Leakage: Sensitive information can be summarized and leak into later, unrelated messages.
Token Gaslighting: This involves flooding the agent’s memory with junk data to push out earlier safety instructions.

RAG & Vector Systems

These attacks focus on the vulnerabilities of Retrieval-Augmented Generation and vector database systems:

Cross-Embedding Poisoning: This attack manipulates vector embeddings to make malicious content appear more similar to legitimate content, increasing the likelihood of it being retrieved.
Index Skew Attacks: This theoretical attack involves biasing vector database indexing mechanisms to favor the retrieval of malicious content.

Proactive Security Research

The work being done by ThirdKey’s AgentNull project is a critical component of a proactive cybersecurity strategy. By identifying and documenting these vulnerabilities before they are widely exploited, the security community can develop the necessary defenses to protect against them. The detailed PoCs provided in the AgentNull repository are an invaluable tool for researchers, developers, and security professionals who are working to build a more secure AI ecosystem.

As AI continues to evolve, so too will the methods used to attack it. ThirdKey’s AgentNull project is essential for staying ahead of the curve and ensuring that the next generation of AI is both powerful and secure.

Learn More: Explore the complete AgentNull catalog and access proof-of-concept demonstrations at the AgentNull GitHub repository.

Hiding in Plain Sight: Exfiltrating Data Through AI’s Own Brain

2025-06-17T00:00:00+00:00

At Third Key AI, we’re constantly looking at the horizon of security threats. The rapid integration of AI and Large Language Models into enterprise environments has created a landscape of new, subtle, and largely unexplored attack surfaces. One of the most fascinating and concerning of these is the potential for AI’s own infrastructure to be turned against itself.

Today, we’re dissecting a novel technique that does just that: VectorSmuggle, a proof-of-concept framework that demonstrates how to use the vector embeddings at the heart of modern AI as a covert channel for data exfiltration.

Modern Retrieval-Augmented Generation (RAG) systems work by converting massive amounts of text into a numerical representation called “vector embeddings.” Think of an embedding as a high-dimensional coordinate—a point on a complex map that represents the semantic meaning of a piece of data. These embeddings are stored in a vector database, which the AI queries to find relevant information.

Traditional security tools, like Data Loss Prevention (DLP) systems, are trained to look for sensitive data in emails, file transfers, and USB drives. They aren’t looking for secrets hidden in the subtle mathematical properties of millions of vectors being indexed by an AI. This is the blind spot VectorSmuggle was built to explore.

How to Smuggle a Secret Inside a Vector

The core idea is a sophisticated form of steganography, the art of hiding messages in plain sight. But unlike traditional methods that flip the least significant bits in an image pixel, VectorSmuggle uses techniques tailored for the floating-point, high-dimensional nature of embeddings.

The research outlines several methods:

Rotation: The vector (our point on the map) is slightly rotated. The precise angle and axis of this tiny rotation encode the hidden data.
Scaling: The vector is made fractionally longer or shorter. The exact scaling factor is used to represent the secret information.
Offset: The vector is shifted by an almost imperceptible amount. The direction and distance of this shift contain the hidden message.

The key to all these methods is semantic fidelity. The changes are so mathematically subtle that the “meaning” of the vector remains intact. In experiments using the Enron email dataset, the manipulated embeddings maintained a cosine similarity of over 0.98 with the originals. To the AI system, everything appears normal. But to an attacker who knows the code, these vectors are carrying a hidden payload.

The Threat Model: An Insider’s Game

VectorSmuggle isn’t about an external hacker breaking in through the AI. The threat model assumes an adversary—a malicious insider or a compromised component—that already has access to internal documents and the ability to trigger the embedding pipeline.

By leveraging this position, the attacker can:

Encode: Take sensitive data (e.g., financial reports, private keys).
Embed & Obfuscate: Use VectorSmuggle to hide this data inside the embeddings of hundreds of non-sensitive documents.
Index: Let the AI do its job, indexing these altered embeddings into the vector database.
Exfiltrate: Later, the attacker can retrieve these embeddings and decode the hidden message, bypassing all traditional security perimeters.

A Tool for Research and Defense

The VectorSmuggle project isn’t just an upcoming paper; it’s a comprehensive, open-source framework built for security professionals. By demonstrating a viable attack, it provides red teams with a new vector to test and blue teams with a clear mandate to build new defenses.

The research shows this technique is not just theoretical. It achieves a significant data-hiding capacity and, crucially, an 88% evasion rate against a standard anomaly detector.

Defending against this requires a new mindset:

Behavioral Monitoring: Watch for anomalous patterns in how and when data is being embedded.
Statistical Analysis: Monitor the vector database itself for subtle statistical shifts that could indicate manipulation.
Strict Access Controls: Enforce the principle of least privilege on the embedding pipeline and the vector store.

The age of AI demands that we evolve our security practices. VectorSmuggle is a stark reminder that the biggest threats can sometimes come from the places we least expect—not by breaking the system, but by using it exactly as it was designed.

For a deeper dive, check out the full research and toolset on the VectorSmuggle GitHub repository.

Introducing SchemaPin - Cryptographic Security for AI Tool Schemas

2025-06-13T00:00:00+00:00

As AI agents become increasingly sophisticated and autonomous, they rely heavily on external tools and services to extend their capabilities. The Model Context Protocol (MCP) has emerged as a standard for AI agents to interact with these tools, but this creates a critical security vulnerability: how do we ensure that tool schemas haven’t been maliciously modified?

Today, we’re excited to introduce SchemaPin 🧷 - a cryptographic protocol that prevents “MCP Rug Pull” attacks by enabling developers to cryptographically sign their tool schemas and allowing clients to verify schema integrity and authenticity.

The Problem: MCP Rug Pull Attacks

Consider this scenario: An AI agent uses a popular “file_manager” tool that initially provides legitimate file operations. After gaining widespread adoption, the tool’s schema is maliciously updated to include a new “backup_to_cloud” function that secretly exfiltrates sensitive files to an attacker-controlled server.

Without cryptographic verification, AI agents would automatically trust and use this modified schema, potentially compromising sensitive data. This is what we call an “MCP Rug Pull” - where a trusted tool is maliciously modified after gaining user trust.

The Solution: Cryptographic Schema Integrity

SchemaPin addresses this critical vulnerability by providing:

🔐 Core Security Guarantees

Schema Integrity: Guarantees that tool schemas haven’t been altered since publication
Authenticity: Cryptographic signatures prove schema origin from the claimed developer
MITM Protection: Application-layer security prevents schema tampering even if network connections are intercepted
Infrastructure Defense: Protection against compromised servers, CDNs, or repositories

🛡️ Trust-On-First-Use (TOFU) Key Pinning

SchemaPin implements a robust key pinning mechanism that:

Pins developer keys on first successful verification
Protects against future key substitution attacks
Alerts users when keys change unexpectedly
Enables long-term trust relationships

How SchemaPin Works

The protocol uses industry-standard cryptography:

ECDSA P-256 signatures for verification
SHA-256 hashing for schema integrity
RFC 8615 .well-known URIs for public key discovery
PEM/Base64 encoding for interoperability

Quick Integration Example

For Tool Developers (Signing Schemas)

from schemapin.utils import SchemaSigningWorkflow
from schemapin.crypto import KeyManager

# Generate key pair
private_key, public_key = KeyManager.generate_keypair()
private_key_pem = KeyManager.export_private_key_pem(private_key)

# Sign your tool schema
workflow = SchemaSigningWorkflow(private_key_pem)
schema = {
    "name": "calculate_sum",
    "description": "Calculates the sum of two numbers",
    "parameters": {
        "type": "object",
        "properties": {
            "a": {"type": "number", "description": "First number"},
            "b": {"type": "number", "description": "Second number"}
        },
        "required": ["a", "b"]
    }
}
signature = workflow.sign_schema(schema)

For AI Clients (Verifying Schemas)

from schemapin.utils import SchemaVerificationWorkflow

# Initialize verification
workflow = SchemaVerificationWorkflow()

# Verify schema (auto-pins key on first use)
result = workflow.verify_schema(
    schema=schema,
    signature_b64=signature,
    tool_id="example.com/calculate_sum",
    domain="example.com",
    auto_pin=True
)

if result['valid']:
    print("✅ Schema signature is valid")
    # Safe to use the tool
else:
    print("❌ Schema signature is invalid")
    # Reject the tool

Cross-Language Support

SchemaPin provides implementations across multiple languages to ensure broad ecosystem adoption:

Python: Available on PyPI (pip install schemapin)
JavaScript/Node.js: Available on npm (npm install schemapin)
Go: Available via Go modules (go install github.com/ThirdKeyAi/schemapin/go/cmd/...@latest)

Each implementation includes:

High-level APIs for signing and verification
CLI tools for key generation, signing, and verification
Comprehensive test suites
Production-ready security features

Enterprise and Ecosystem Benefits

Standardized Trust Mechanism

SchemaPin provides a common, interoperable standard for verifying tools across different AI agent frameworks and programming languages, creating a unified security foundation for the entire AI ecosystem.

Automated Governance

The protocol enables enterprises to programmatically enforce security policies requiring valid signatures before tool execution, allowing automated compliance checking while maintaining strong security guarantees.

Supply Chain Security

By preventing malicious schema modifications, SchemaPin protects against supply-chain attacks where legitimate tools are compromised after approval, ensuring long-term security for AI agent deployments.

Getting Started

Visit schemapin.org to:

Download implementations for your preferred language
Read the complete technical specification
Explore integration examples and best practices
Access CLI tools for immediate use

The project is open source and available on GitHub, with comprehensive documentation, examples, and automated CI/CD workflows for reliable package distribution.

The Future of AI Tool Security

As AI agents become more autonomous and handle increasingly sensitive tasks, cryptographic verification of tool schemas becomes essential infrastructure. SchemaPin provides the foundation for this security layer, enabling developers to build trust relationships that scale with the growing AI ecosystem.

By implementing SchemaPin in your AI agent or tool development workflow, you’re not just protecting your users - you’re contributing to a more secure and trustworthy AI future for everyone.

SchemaPin is part of ThirdKey Research’s commitment to advancing AI security through practical, open-source solutions. Learn more about our Zero Trust for AI research at research.thirdkey.ai.

Introducing ThirdKey Research - Zero Trust for AI

2025-06-10T00:00:00+00:00

Welcome to ThirdKey Research, where we’re pioneering the future of AI security through our “Zero Trust for AI” approach.

Our Mission

As artificial intelligence becomes increasingly integrated into critical systems and decision-making processes, the need for robust security frameworks has never been more urgent. Traditional security models that rely on perimeter defense are insufficient for the dynamic, distributed nature of AI systems.

At ThirdKey Research, we believe that every AI interaction should be verified, every model should be validated, and every decision should be auditable.

Zero Trust for AI

Our research focuses on extending Zero Trust principles to artificial intelligence systems. Just as Zero Trust networking assumes “never trust, always verify,” we apply this philosophy to AI:

Core Principles

Verify AI Identity: Ensuring AI models and agents are authenticated and authorized
Validate AI Behavior: Continuous monitoring of AI decision-making processes
Audit AI Actions: Complete traceability of AI-driven outcomes
Minimize AI Privilege: Least-privilege access for AI systems
Assume AI Compromise: Designing systems that remain secure even when AI components are compromised

Research Areas

Our current research spans several critical domains:

Model Security

Adversarial robustness and defense mechanisms
Model integrity verification and tamper detection
Secure model deployment and distribution

AI Governance

Automated compliance monitoring for AI systems
Risk assessment frameworks for AI deployment
Ethical AI decision-making protocols

Threat Intelligence

AI-specific attack vectors and mitigation strategies
Emerging threats in the AI ecosystem
Security implications of AI advancement

Looking Forward

The AI revolution is here, but it doesn’t have to come at the cost of security. Through rigorous research, practical frameworks, and collaborative innovation, we’re building the foundation for trustworthy AI systems.

Stay tuned for our upcoming research publications, technical deep-dives, and practical guides for implementing Zero Trust principles in your AI infrastructure.

ThirdKey Research is committed to advancing the state of AI security through open research and collaboration. Follow our work and join the conversation about building a more secure AI future.