Submit

Forge - GPU Kernel Optimization

@RightNow-AI

Turn slow PyTorch into fast CUDA/Triton kernels. 32 parallel swarm agents optimize your code on real datacenter GPUs (B200, H200, H100, A100) with up to 14x speedup over torch.compile.
Overview

Forge MCP Server

Swarm agents that turn slow PyTorch into fast CUDA/Triton kernels, from any AI coding agent.

What it does

  • Optimize existing kernels - Submit PyTorch code, get back an optimized Triton/CUDA kernel
  • Generate new kernels - Describe an operation, get a production-ready optimized kernel
  • 32 parallel swarm agents - Coder+Judge pairs compete to find the fastest kernel
  • Real GPU benchmarking - Every kernel is tested on datacenter hardware (B200, H200, H100, A100, L40S, T4)
  • Up to 14x faster than torch.compile(mode='max-autotune')
  • One-click auth - Browser-based OAuth, no API keys needed

Quick Start

claude mcp add forge-mcp -- npx -y @rightnow/forge-mcp-server

Tools
┌────────────────┬───────────────────────────────────────────────┐
│      Tool      │                  Description                  │
├────────────────┼───────────────────────────────────────────────┤
│ forge_auth     │ Authenticate with Forge via browser           │
├────────────────┼───────────────────────────────────────────────┤
│ forge_optimize │ Optimize PyTorch code into fast GPU kernels   │
├────────────────┼───────────────────────────────────────────────┤
│ forge_generate │ Generate optimized kernels from a description │
├────────────────┼───────────────────────────────────────────────┤
│ forge_credits  │ Check credit balance                          │
├────────────────┼───────────────────────────────────────────────┤
│ forge_status   │ Check job status                              │
├────────────────┼───────────────────────────────────────────────┤
│ forge_cancel   │ Cancel a running job                          │
├────────────────┼───────────────────────────────────────────────┤
│ forge_sessions │ List past optimization sessions               │
└────────────────┴───────────────────────────────────────────────┘


Pricing

Pay-as-you-go. $15/credit, 25% off at 10+. Free trial included - optimize 1 kernel, no credit card required.

https://rightnowai.co - https://github.com/RightNow-AI/forge-mcp-server - https://www.npmjs.com/package/@rightnow/forge-mcp-server

Server Config

{
  "mcpServers": {
    "forge": {
      "command": "npx",
      "args": [
        "-y",
        "@rightnow/forge-mcp-server"
      ]
    }
  }
}
© 2025 MCP.so. All rights reserved.

Build with ShipAny.