XIP-67: Inline Actions

saul · June 5, 2025, 5:30pm

XIP-67: Inline Actions

Abstract

This XIP introduces a standardized protocol for inline action buttons within XMTP conversations, enabling agents to present clear, actionable options to users through button interfaces. The protocol defines two primary content types: Actions for declaring available actions and Intent, which captures a user’s selection. This specification provides a foundation for both group-appropriate interactions (such as polls and shared decisions) and individual workflows that transition to direct messages to avoid cluttering group chats (such as multi-step configurations, payments, and personalized responses).

Motivation

Today, interacting with agents usually means either typing arcane slash‑commands or composing free‑form natural‑language prompts. Both approaches introduce ambiguity, increase cognitive load, and clutter conversation histories with verbose exchanges. Users struggle to discover available actions, understand how to invoke them, and parse agent responses buried in text.

Inline action buttons address these challenges by surfacing clear, discrete options that users can select with a single tap. This approach dramatically reduces friction in agent interactions while maintaining a clean, scannable conversation history. By providing structured, machine-readable messages for both action declaration and intent expression, the protocol enables clients to render appropriate UI elements while gracefully degrading for older implementations.

The deliberately minimal design allows the protocol to work effectively in both 1:1 and group contexts within XMTP’s current constraints, while remaining compatible with future enhancements to group messaging.

Specification

Core Concepts

The protocol standardizes how users initiate interactions with agents:

Actions — An agent presents available options to users through button interfaces
Intents — A user expresses their selection by tapping a button
Handling — The agent processes the intent through appropriate workflows (simple text responses, multi-step DM conversations, transaction requests, etc.)

This minimal approach provides clear user intent to agents while allowing flexible response patterns through standard XMTP messaging.

Content Type: `Actions`

The Actions content type enables agents to present interactive button options to users. Each message contains a descriptive text field and an array of action definitions.

Schema

{
  // Unique identifier for these actions
  "id": "string",
  // Descriptive text explaining the actions
  "description": "string",
  // Array of action definitions
  "actions": [
    {
      // Unique identifier for this action
      "id": "string",
      // Display text for the button
      "label": "string",
      // Optional image URL to associate with this action
      "imageUrl": "string",
      // Optional visual style for this action: e.g. primary|secondary|danger
      "style": "string",
      // Optional ISO-8601 expiration timestamp for this action
      "expiresAt": "string"
    }
  ],
  // Optional ISO-8601 expiration timestamp for all actions
  "expiresAt": "string"
}

Requirements

Actions messages SHOULD only be sent by agents; end users SHOULD NOT emit them.
When the optional expiresAt timestamp is passed, clients SHOULD disable or hide the associated buttons.
Clients SHOULD validate all fields against the schema before rendering.

Content Type: `Intent`

Intent represents a user’s selection of an action. It contains only the essential information needed to identify which action was selected.

Schema

{
  // References id from the Actions message
  "id": "string",
  // References action.id from the Actions message
  "actionId": "string",
  // Optional context data
  "metadata": {}
}

Requirements

Clients SHOULD emit Intent immediately upon user selection and disable all related buttons
The action_id SHOULD match an action ID from the referenced Actions message
The metadata field is OPTIONAL and MAY contain action-specific data (e.g., custom amounts, user input)
Agents SHOULD handle intents idempotently - duplicate (id, actionId, sender) combinations should produce the same outcome
Clients SHOULD NOT emit multiple different Intent messages for the same Actions message

Group Chat Behavior

In group conversations, the protocol operates with full transparency:

All Actions messages are visible to all group members
All Intent messages are visible to all group members
Agents MUST assume public visibility when designing group actions
For individual workflows, agents MUST transition to direct messages

This transparent model works well for:

Polls and surveys
Game moves where everyone needs to see selections
Shared decisions
Simple queries with brief responses

Agents should transition to DMs for:

Individual workflows - Multi-step processes, configurations, or extended interactions
Privacy-sensitive actions - Payments, personal data, credentials
Personalized responses - Results that only matter to the requesting user
Complex interactions - Anything requiring back-and-forth that would clutter the group

The goal is maintaining clean, readable group transcripts where every message provides value to all participants.

Integration with XIP-59 `WalletSendCalls`

For actions requiring onchain transactions, agents respond to Intent messages with XIP-59 WalletSendCalls:

User sends Intent expressing intent to execute onchain action
Agent responds with a WalletSendCalls message containing transaction parameters
Client presents native wallet interface for user confirmation
Transaction executes through standard wallet flow
Agent monitors blockchain for confirmation and sends text updates

This integration works identically in both 1:1 and group contexts, though privacy-sensitive transactions should be handled in direct messages.

Representative Flows

Simple Group Poll

A straightforward public voting scenario in a group chat:

Agent → Group: “Should we meet at 3pm or 4pm?”
Agent → Group: Sends Actions with buttons: “3pm ”, “4pm ”
Alice → Group: Sends Intent selecting “3pm” (references message #2)
Bob → Group: Sends Intent selecting “3pm” (references message #2)
Carol → Group: Sends Intent selecting “4pm” (references message #2)
Agent → Group: “Current votes: 3pm (2), 4pm (1)”

Payment with Privacy Transition

A payment flow that starts in a group but handles sensitive data privately:

In Group Chat:

User: “I want to pay for premium”
Agent → Group: “Premium is $10/month. Ready to upgrade?”
Agent → Group: Sends Actions with button: “Subscribe ”
User → Group: Sends Intent selecting “Subscribe”
Agent → Group: “Thanks! Check your DMs to complete payment”

In Direct Message:

Agent → User: “Let’s set up your premium subscription”
Agent → User: Sends WalletSendCalls for $10 USDC payment
User: Confirms transaction in wallet
Agent → User: “Payment confirmed! Your premium access is active”

Back in Group Chat:

Agent → Group: “ User is now a premium member!”

Multi-Stage Game with DM Coordination

A game where private actions in DM affect public group state:

In Group Chat:

Agent → Group: “Starting a new game of Mafia! 5 players joined”
Agent → Group: Sends Actions with button: “Get Your Role ”
Alice → Group: Sends Intent selecting “Get Your Role”
Bob → Group: Sends Intent selecting “Get Your Role”
Carol → Group: Sends Intent selecting “Get Your Role”

In Direct Messages (sent to each player):

Agent → Alice: “You are the Mafia! Kill someone each night”
Agent → Bob: “You are a Villager. Find the Mafia!”
Agent → Carol: “You are the Detective. Investigate one player each night”

Night Phase in DMs:

Agent → Alice: “Who do you want to eliminate?”
Agent → Alice: Sends Actions with buttons for each player
Alice → Agent: Sends Intent selecting “Bob”

Day Phase in Group:

Agent → Group: “ Morning! Bob was eliminated last night”
Agent → Group: “Time to vote! Who is the Mafia?”
Agent → Group: Sends Actions with remaining players as buttons

Configuration Flow with DM Transition

A personalization flow that transitions to DM to avoid cluttering the group:

In Group Chat:

User: “I want to set up my preferences”
Agent → Group: “I’ll help you configure your settings”
Agent → Group: Sends Actions with button: “Configure Settings ”
User → Group: Sends Intent selecting “Configure Settings”
Agent → Group: “I’ll walk you through setup in DMs”

In Direct Message:

Agent → User: “Let’s configure your preferences. First, choose your timezone:”
Agent → User: Sends Actions with buttons: “PST”, “EST”, “CST”, “UTC”
User → Agent: Sends Intent selecting “PST”
Agent → User: “Great! Now select your notification preferences:”
Agent → User: Sends Actions with buttons: “All”, “Important Only”, “None”
User → Agent: Sends Intent selecting “Important Only”
Agent → User: “All set! Your preferences have been saved”

Back in Group Chat:

Agent → Group: “ User’s preferences have been configured”

Sports Betting with Group Discussion and Private Wagers

A comprehensive flow showing how a betting agent facilitates public discussion while handling individual wagers privately:

Section 1-2.png

In Group Chat (Pre-Game Discussion):

Agent → Group: “ Tonight’s Games: Lakers vs Warriors (8pm ET), Celtics vs Heat (7:30pm ET)”
Agent → Group: Sends Actions with buttons: “Lakers/Warriors ”, “Celtics/Heat ”, “My Bets ”
Alice → Group: Sends Intent selecting “Lakers/Warriors”
Agent → Group: “Lakers (-5.5) vs Warriors • O/U 220.5 • Lakers -250 ML”
Bob → Group: “Lakers covering easily tonight”
Agent → Group: “Want to place a bet on Lakers/Warriors?”
Agent → Group: Sends Actions with buttons: “Bet on This Game ”, “Just Watching ”
Alice → Group: Sends Intent selecting “Bet on This Game”
Bob → Group: Sends Intent selecting “Bet on This Game”
Agent → Group: “Alice, Bob - check your DMs to place your bets privately”

In Alice’s DM:

Agent → Alice: “Let’s set up your Lakers/Warriors bet. Choose your bet type:”
Agent → Alice: Sends Actions with buttons: “Lakers -5.5”, “Warriors +5.5”, “Over 220.5”, “Under 220.5”, “Lakers ML”
Alice → Agent: Sends Intent selecting “Lakers -5.5”
Agent → Alice: “How much would you like to wager?”
Agent → Alice: Sends Actions with buttons: “$10”, “$25”, “$50”, “$100”, “Custom”
Alice → Agent: Sends Intent selecting “$25”
Agent → Alice: “Confirm: $25 on Lakers -5.5 (-110) • To win: $22.73”
Agent → Alice: Sends WalletSendCalls for $25 USDC payment
Alice → Agent: Completes transaction
Agent → Alice: “ Bet confirmed! Lakers -5.5 for $25 • Ticket #8934”

In Bob’s DM (simultaneously):

Agent → Bob: “Let’s set up your Lakers/Warriors bet. Choose your bet type:”
Agent → Bob: Sends Actions with buttons: “Lakers -5.5”, “Warriors +5.5”, “Over 220.5”, “Under 220.5”, “Lakers ML”
Bob → Agent: Sends Intent selecting “Over 220.5”
Agent → Bob: “How much would you like to wager?”
Agent → Bob: Sends Actions with buttons: “$10”, “$25”, “$50”, “$100”, “Custom”
Bob → Agent: Sends Intent selecting “$50”
Agent → Bob: “Confirm: $50 on Over 220.5 (-110) • To win: $45.45”
Agent → Bob: Sends WalletSendCalls for $50 USDC payment
Bob → Agent: Completes transaction
Agent → Bob: “ Bet confirmed! Over 220.5 for $50 • Ticket #8935”

Back in Group Chat (Pre-Game):

Agent → Group: “ Active bets placed for tonight’s Lakers/Warriors game!”
Carol → Group: “What’s everyone betting on?”
Agent → Group: “2 bets placed so far. Popular picks: Spread (1), Total (1)”
Carol → Group: “I’ll take Warriors +5.5”
Agent → Group: Sends Actions with button: “Place This Bet ”
Carol → Group: Sends Intent selecting “Place This Bet”
Agent → Group: “Carol - check your DMs to complete your Warriors +5.5 bet”

During the Game (Group Chat):

Agent → Group: “ Q3 Update: Lakers 78, Warriors 71 • Total: 149”
Alice → Group: “Lakers covering! ”
Bob → Group: “Need 72 points in Q4 for the over ”

Post-Game Settlement (Group Chat):

Agent → Group: “ Final: Lakers 115, Warriors 108 • Total: 223”
Agent → Group: “Tonight’s results: Lakers covered -5.5 • Over 220.5 hit ”
Agent → Group: “Congrats to tonight’s winners! Check your DMs for payouts”

In Alice’s DM:

Agent → Alice: “Congrats! Your bet won: Lakers -5.5 ”
Agent → Alice: “Payout: $47.73 USDC (bet: $25 + winnings: $22.73)”
Agent → Alice: Sends WalletSendCalls to transfer winnings
Agent → Alice: “Want to see tomorrow’s games?”

In Carol’s DM:

Agent → Carol: “Your bet lost: Warriors +5.5 (Lakers won by 7)”
Agent → Carol: “Better luck next time! Want to see tomorrow’s games?”

This pattern demonstrates:

Public discussion and analysis in the group
Private handling of all financial transactions
Group-visible aggregate information without exposing individual bet amounts
Clear transitions between group and DM contexts
Post-game settlement maintaining privacy while celebrating publicly

User Experience Considerations

Current Limitations

The requirement to transition between group chats and DMs for individual workflows introduces UX friction:

Users must context-switch between conversations
Message flow is fragmented across multiple threads
Users may miss DM notifications while focused on group chat
Mobile clients must handle navigation between chat contexts

These transitions are necessary not only for privacy but also to prevent group chats from becoming cluttered with individual interactions that provide no value to other members. A group chat filled with one user’s multi-step configuration process or personalized results would degrade the experience for everyone.

Future Compatibility

This protocol is designed to remain fully compatible with future enhancements to XMTP that may address these limitations:

Directed Group Messages: If XMTP adds support for private messages within groups, Intent could target specific recipients without protocol changes
Client-Level Solutions: Clients could implement UI patterns that embed DM contexts within group chat interfaces
Inline Frames: Future clients might render agent DMs as inline frames or modals within the group chat context
Protocol Evolution: The minimal design ensures compatibility with future privacy features without breaking changes

The simplicity of this design means the protocol can adapt to new messaging patterns as they emerge.

Security Considerations

Replay Protection

The protocol employs nonces to prevent replay attacks:

Agents MUST generate unique nonces for each Actions message
Clients MUST reject duplicate (nonce, sender) combinations
Agents SHOULD maintain nonce validity windows (e.g., 24 hours)

Intent Verification

To prevent spoofing:

Agents MUST verify that Intent references a valid, unexpired Actions message they sent
Agents MUST verify the sender is an intended recipient of the original Actions
Clients MUST NOT allow users to craft arbitrary Intent messages

Privacy Preservation

Agents MUST clearly indicate when actions will be public vs. private
Payment amounts, personal data, and credentials MUST only be requested in DMs
Agents SHOULD transition to DMs for any extended interaction to avoid group clutter
Agents SHOULD warn users before collecting sensitive information

Backwards Compatibility

Clients that do not support the Inline Actions protocol will cannot parse Actions or Intent messages. To mitigate this, agents SHOULD:

Include fallback text with available commands in Actions messages
Recognize both button selections and text commands
Provide clear instructions for non-supporting clients

Example fallback text:

Choose an option:
[A] Option One
[B] Option Two

Reply with A or B to select

Implementation Considerations

Button State Management

Clients MUST:

Disable all buttons immediately upon selection
Persist selection state across app restarts
Show visual feedback for selected buttons
Handle expired actions appropriately

Agent Response Patterns

Agents SHOULD:

Acknowledge every Intent with a response
Use text messages for simple confirmations
Transition to DM for:
- Multi-step workflows
- Personalized configurations
- Extended interactions
- Privacy-sensitive data
Clearly communicate when/why DM is required
Keep group responses concise and relevant to all members

Visual Consistency

While clients have flexibility in rendering, they SHOULD:

Respect the style hint for visual hierarchy
Display icons when provided
Show expiration status clearly
Maintain consistent disabled states

Future Considerations

This specification provides a minimal foundation that can be extended:

Rich metadata schemas for common use cases
Conditional actions based on user state
Multi-step wizards with backward navigation
Integration with other content types
Standardized DM transition patterns

Extensions should maintain backward compatibility with the core intent model.

References

Copyright

Copyright and related rights waived via CC0.

XIP-67: Inline Actions

XIP-67: Inline Actions

Abstract

Motivation

Specification

Core Concepts

Content Type: Actions

Schema

Requirements

Content Type: Intent

Schema

Requirements

Group Chat Behavior

Integration with XIP-59 WalletSendCalls

Representative Flows

Simple Group Poll

Payment with Privacy Transition

Multi-Stage Game with DM Coordination

Configuration Flow with DM Transition

Sports Betting with Group Discussion and Private Wagers

User Experience Considerations

Current Limitations

Future Compatibility

Security Considerations

Replay Protection

Intent Verification

Privacy Preservation

Backwards Compatibility

Implementation Considerations

Button State Management

Agent Response Patterns

Visual Consistency

Future Considerations

References

Copyright

Content Type: `Actions`

Content Type: `Intent`

Integration with XIP-59 `WalletSendCalls`