API Reference: createArmor

The main entry point for ai-armor. Creates an ArmorInstance with all configured protections.

Import

import { createArmor } from 'ai-armor'

Signature

function createArmor(config: ArmorConfig): ArmorInstance

Parameters

config: ArmorConfig

Every field on ArmorConfig is optional. Only configure the features you need.

interface ArmorConfig {
  rateLimit?: RateLimitConfig
  budget?: BudgetConfig
  fallback?: FallbackConfig
  cache?: CacheConfig
  routing?: RoutingConfig
  safety?: SafetyConfig
  logging?: LoggingConfig
}

rateLimit: RateLimitConfig

Field	Type	Required	Description
`strategy`	`'sliding-window'`	Yes	Rate limiting algorithm
`rules`	`RateLimitRule[]`	Yes	Array of rate limit rules
`store`	`'memory' \| 'redis' \| StorageAdapter`	No	Storage backend. Default: in-memory
`keyResolver`	`(ctx: ArmorContext, ruleKey: string) => string`	No	Custom key resolution
`onLimited`	`(ctx: ArmorContext) => void`	No	Callback when rate limited

RateLimitRule:

Field	Type	Description
`key`	`'user' \| 'ip' \| 'apiKey' \| string`	Dimension to rate limit by
`limit`	`number`	Maximum requests in the window
`window`	`string`	Time window (e.g., `'1m'`, `'1h'`, `'30s'`, `'1d'`)

See Rate Limiting guide for details.

budget: BudgetConfig

Field	Type	Required	Description
`daily`	`number`	No	Maximum daily spend in USD
`monthly`	`number`	No	Maximum monthly spend in USD
`perUser`	`number`	No	Maximum daily spend per user in USD
`perUserMonthly`	`number`	No	Maximum monthly spend per user in USD
`onExceeded`	`'block' \| 'warn' \| 'downgrade-model'`	Yes	Action when budget exceeded
`downgradeMap`	`Record<string, string>`	No	Model downgrade mapping
`store`	`'memory' \| 'redis' \| StorageAdapter`	No	Storage backend. Default: in-memory
`onWarned`	`(ctx: ArmorContext, budget: { daily: number, monthly: number, perUserDaily?: number, perUserMonthly?: number }) => void`	No	Callback when budget warning fires
`onUnknownModel`	`(model: string) => void`	No	Callback when a model not in the pricing database is encountered

See Cost Tracking guide for details.

fallback: FallbackConfig

Field	Type	Required	Description
`chains`	`Record<string, string[]>`	Yes	Fallback model chains per primary model
`timeout`	`number`	No	Request timeout in ms before triggering fallback
`retries`	`number`	No	Number of retries before moving to next in chain
`backoff`	`'exponential' \| 'linear'`	No	Backoff strategy between retries
`healthCheck`	`boolean`	No	Enable passive health checking

cache: CacheConfig

Field	Type	Required	Description
`enabled`	`boolean`	Yes	Enable/disable caching
`strategy`	`'exact' \| 'semantic'`	Yes	Cache matching strategy
`ttl`	`number`	Yes	Time-to-live in seconds
`maxSize`	`number`	No	Maximum entries (LRU eviction)
`keyFn`	`(request: ArmorRequest) => string`	No	Custom cache key generator

Note: Semantic strategy requires additional fields (embeddingFn, similarityThreshold).

Semantic cache additional fields:

Field	Type	Required	Description
`embeddingFn`	`(text: string) => Promise<number[]>`	Yes (semantic)	User-provided embedding function
`similarityThreshold`	`number`	No	Minimum cosine similarity for cache hit (default: 0.92)

See Caching guide for details.

routing: RoutingConfig

Field	Type	Required	Description
`aliases`	`Record<string, string>`	Yes	Map of alias to real model name

See Model Routing guide for details.

safety: SafetyConfig

Field	Type	Required	Description
`promptInjection`	`boolean`	No	Enable prompt injection detection
`piiDetection`	`boolean`	No	Enable PII detection
`maxTokensPerRequest`	`number`	No	Maximum tokens per request
`blockedPatterns`	`RegExp[]`	No	Patterns that block matching content
`onBlocked`	`(ctx: ArmorContext, reason: string) => void`	No	Callback when request is blocked

See Safety guide for details.

logging: LoggingConfig

Field	Type	Required	Description
`enabled`	`boolean`	Yes	Enable/disable logging
`include`	`Array<'model' \| 'tokens' \| 'cost' \| 'latency' \| 'userId' \| 'cached' \| 'fallback'>`	Yes	Fields to include
`onRequest`	`(log: ArmorLog) => void \| Promise<void>`	No	Callback after each logged request
`maxEntries`	`number`	No	Max entries in memory. Default: `10000`

See Logging guide for details.

Return Type: ArmorInstance

interface ArmorInstance {
  config: ArmorConfig
  checkRateLimit: (ctx: ArmorContext) => Promise<RateLimitResult>
  peekRateLimit: (ctx: ArmorContext) => Promise<{ remaining: number, resetAt: number }>
  trackCost: (model: string, inputTokens: number, outputTokens: number, userId?: string) => Promise<void>
  checkBudget: (model: string, ctx: ArmorContext) => Promise<{ allowed: boolean, action: string, suggestedModel?: string }>
  getDailyCost: (userId?: string) => Promise<number>
  getMonthlyCost: (userId?: string) => Promise<number>
  resolveModel: (model: string) => string
  getCachedResponse: (request: ArmorRequest) => Promise<unknown | undefined>
  setCachedResponse: (request: ArmorRequest, response: unknown) => Promise<void>
  log: (entry: ArmorLog) => Promise<void>
  getLogs: () => ArmorLog[]
  estimateCost: (model: string, inputTokens: number, outputTokens: number) => number
  getProvider: (model: string) => string
  checkSafety: (request: ArmorRequest, ctx: ArmorContext) => SafetyCheckResult
  executeFallback: <T>(request: ArmorRequest, handler: (model: string) => Promise<T>) => Promise<FallbackResult<T>>
}

Methods

checkRateLimit(ctx)

Check all configured rate limit rules for the given context.

const result = await armor.checkRateLimit({
  userId: 'user-123',
  ip: '192.168.1.1',
})

Parameters:

Name	Type	Description
`ctx`	`ArmorContext`	Request context with user/IP/API key info

Returns: Promise<RateLimitResult>

interface RateLimitResult {
  allowed: boolean // true if request can proceed
  remaining: number // requests remaining before limit
  resetAt: number // Unix timestamp (ms) when window resets
}

If no rate limit is configured, returns { allowed: true, remaining: Infinity, resetAt: 0 }.

trackCost(model, inputTokens, outputTokens, userId?)

Record token usage for budget tracking. Looks up the model in the pricing database and stores the calculated cost.

await armor.trackCost('gpt-4o', 500, 200, 'user-123')

Parameters:

Name	Type	Description
`model`	`string`	Model name (must match pricing database)
`inputTokens`	`number`	Number of input/prompt tokens
`outputTokens`	`number`	Number of output/completion tokens
`userId`	`string?`	Optional user ID for per-user tracking

Returns: Promise<void>

No-op if budget is not configured.

checkBudget(model, ctx)

Check if the current spend is within budget limits.

const result = await armor.checkBudget('gpt-4o', { userId: 'user-123' })
if (!result.allowed) {
  throw new Error(`Budget exceeded: ${result.action}`)
}
const finalModel = result.suggestedModel ?? 'gpt-4o'

Parameters:

Name	Type	Description
`model`	`string`	Model to check budget for
`ctx`	`ArmorContext`	Request context (needs `userId` for per-user checks)

Returns: Promise<{ allowed: boolean, action: string, suggestedModel?: string }>

Field	Description
`allowed`	`true` if request can proceed
`action`	`'pass'`, `'block'`, `'warn'`, or `'downgrade-model'`
`suggestedModel`	Cheaper model to use (only when action is `'downgrade-model'`)

If no budget is configured, returns { allowed: true, action: 'pass' }.

resolveModel(model)

Resolve a model alias to its real model name.

const real = armor.resolveModel('fast')
// => 'gpt-4o-mini' (if alias configured)

const passthrough = armor.resolveModel('gpt-4o')
// => 'gpt-4o' (not an alias, returned as-is)

Parameters:

Name	Type	Description
`model`	`string`	Model name or alias

Returns: string

getCachedResponse(request)

Look up a cached response for the given request.

const request = {
  model: 'gpt-4o',
  messages: [{ role: 'user', content: 'Hello' }],
}
const cached = await armor.getCachedResponse(request)
if (cached) {
  return cached // Skip API call
}

Parameters:

Name	Type	Description
`request`	`ArmorRequest`	Request to look up

Returns: Promise<unknown | undefined> -- The cached response, or undefined on cache miss.

setCachedResponse(request, response)

Store a response in the cache.

await armor.setCachedResponse(request, response)

Parameters:

Name	Type	Description
`request`	`ArmorRequest`	The request key
`response`	`unknown`	The response to cache

Returns: Promise<void>

log(entry)

Record a log entry.

await armor.log({
  id: crypto.randomUUID(),
  timestamp: Date.now(),
  model: 'gpt-4o',
  provider: 'openai',
  inputTokens: 500,
  outputTokens: 200,
  cost: 0.00325,
  latency: 1234,
  cached: false,
  fallback: false,
  rateLimited: false,
  userId: 'user-123',
})

Parameters:

Name	Type	Description
`entry`	`ArmorLog`	Full log entry

Returns: Promise<void>

getLogs()

Retrieve all stored log entries.

const logs = armor.getLogs()
const totalCost = logs.reduce((sum, l) => sum + l.cost, 0)

Returns: ArmorLog[] -- A copy of all stored log entries.

estimateCost(model, inputTokens, outputTokens)

Calculate the estimated cost for a request without tracking it.

const cost = armor.estimateCost('gpt-4o', 1000, 500)
// eslint-disable-next-line no-console
console.log(`Estimated: $${cost.toFixed(6)}`)

Parameters:

Name	Type	Description
`model`	`string`	Model name
`inputTokens`	`number`	Number of input tokens
`outputTokens`	`number`	Number of output tokens

Returns: number -- Cost in USD. Returns 0 if model is not in the pricing database.

peekRateLimit(ctx)

Check rate limit status without recording a request. Useful for UI indicators.

const status = await armor.peekRateLimit({ userId: 'user-123' })
if (status.remaining < 5) {
  showWarning('Approaching rate limit')
}

Parameters:

Name	Type	Description
`ctx`	`ArmorContext`	Request context

Returns: Promise<{ remaining: number, resetAt: number }>

getDailyCost(userId?)

Get current daily spend.

const daily = await armor.getDailyCost('user-123')

Parameters:

Name	Type	Description
`userId`	`string?`	Optional user ID for per-user cost

Returns: Promise<number> -- Cost in USD.

getMonthlyCost(userId?)

Get current monthly spend.

const monthly = await armor.getMonthlyCost('user-123')

Parameters:

Name	Type	Description
`userId`	`string?`	Optional user ID for per-user cost

Returns: Promise<number> -- Cost in USD.

getProvider(model)

Get the provider name for a model from the pricing database.

const provider = armor.getProvider('gpt-4o')
// => 'openai'

Parameters:

Name	Type	Description
`model`	`string`	Model name

Returns: string -- Provider name, or 'unknown' if model not found.

checkSafety(request, ctx)

Run safety checks on a request without going through the full pipeline.

const result = armor.checkSafety(
  { model: 'gpt-4o', messages: [{ role: 'user', content: userInput }] },
  { userId: 'user-123' }
)
if (result.blocked) {
  throw new Error(`Blocked: ${result.reason}`)
}

Parameters:

Name	Type	Description
`request`	`ArmorRequest`	Request to check
`ctx`	`ArmorContext`	Request context

Returns: SafetyCheckResult

interface SafetyCheckResult {
  allowed: boolean
  blocked: boolean
  reason: string | null
  details: string[]
}

executeFallback(request, handler)

Execute a request through the fallback chain.

const result = await armor.executeFallback(
  { model: 'gpt-4o', messages: [{ role: 'user', content: 'Hello' }] },
  async (model) => {
    return await callAI(model, messages)
  }
)
if (result.fallbackUsed) {
  console.warn(`Used fallback model: ${result.model}`)
}

Parameters:

Name	Type	Description
`request`	`ArmorRequest`	Request with primary model
`handler`	`(model: string) => Promise<T>`	Function that calls the AI provider

Returns: Promise<FallbackResult<T>>

interface FallbackResult<T> {
  result: T
  model: string
  attempts: number
  fallbackUsed: boolean
}

Additional Exports

The ai-armor package also exports these pricing utilities:

import {
  addModel,
  calculateCost,
  cosineSimilarity,
  createPricingRegistry,
  createRedisAdapter,
  createSemanticCache,
  getAllModels,
  getModelPricing,
  getProvider,
  pricingDatabase,
  registerModels,
  removeModel,
  resetPricing,
  updateModel,
} from 'ai-armor'

Function	Description
`calculateCost(model, inputTokens, outputTokens)`	Calculate cost in USD
`getModelPricing(model)`	Get pricing info for a model
`getAllModels()`	List all models in the pricing database
`getProvider(model)`	Get the provider name for a model
`addModel(pricing)`	Add a custom model to the pricing registry
`updateModel(model, updates)`	Update pricing for an existing model
`removeModel(model)`	Remove a model from the registry
`resetPricing()`	Reset all custom models, restore defaults
`registerModels(models)`	Bulk add multiple models
`createPricingRegistry(initialModels?)`	Create an isolated pricing instance
`createRedisAdapter(client, options?)`	Built-in Redis storage adapter for rate limiting and cost tracking
`createSemanticCache(config)`	Create a standalone semantic cache instance
`cosineSimilarity(a, b)`	Compute cosine similarity between two vectors
`pricingDatabase`	Built-in model pricing dataset (`ModelPricing[]`)

Type Exports

Type	Description
`RedisAdapterOptions`	Options for `createRedisAdapter` (prefix, ttl)
`RedisLike`	Interface for Redis client compatibility
`PricingRegistry`	Pricing registry type

Types Reference -- All interface definitions
AI SDK Integration -- Middleware adapter
Getting Started -- Quick start guide

API Reference: createArmor ​

Import ​

Signature ​

Parameters ​

config: ArmorConfig ​

rateLimit: RateLimitConfig ​

budget: BudgetConfig ​

fallback: FallbackConfig ​

cache: CacheConfig ​

routing: RoutingConfig ​

safety: SafetyConfig ​

logging: LoggingConfig ​

Return Type: ArmorInstance ​

Methods ​

checkRateLimit(ctx) ​

trackCost(model, inputTokens, outputTokens, userId?) ​

checkBudget(model, ctx) ​

resolveModel(model) ​

getCachedResponse(request) ​

setCachedResponse(request, response) ​

log(entry) ​

getLogs() ​

estimateCost(model, inputTokens, outputTokens) ​

peekRateLimit(ctx) ​

getDailyCost(userId?) ​

getMonthlyCost(userId?) ​

getProvider(model) ​

checkSafety(request, ctx) ​

executeFallback(request, handler) ​

Additional Exports ​

Type Exports ​

Related ​

API Reference: createArmor

Import

Signature

Parameters

config: ArmorConfig

rateLimit: RateLimitConfig

budget: BudgetConfig

fallback: FallbackConfig

cache: CacheConfig

routing: RoutingConfig

safety: SafetyConfig

logging: LoggingConfig

Return Type: ArmorInstance

Methods

checkRateLimit(ctx)

trackCost(model, inputTokens, outputTokens, userId?)

checkBudget(model, ctx)

resolveModel(model)

getCachedResponse(request)

setCachedResponse(request, response)

log(entry)

getLogs()

estimateCost(model, inputTokens, outputTokens)

peekRateLimit(ctx)

getDailyCost(userId?)

getMonthlyCost(userId?)

getProvider(model)

checkSafety(request, ctx)

executeFallback(request, handler)

Additional Exports

Type Exports

Related