/fluxloop:test

Run agent tests with scenarios from FluxLoop Web.

Overview

The /fluxloop:test command:

Pulls test scenarios from FluxLoop Web (if needed)
Runs your instrumented agent with synthetic inputs
Captures traces and results
Uploads results to FluxLoop Web
Displays summary and analysis link

Basic Usage

/fluxloop:test

This runs the full test suite for the current scenario.

Options

--scenario

Specify which scenario to test:

/fluxloop:test --scenario customer-onboarding

--skip-upload

Run tests locally without uploading results:

/fluxloop:test --skip-upload

Useful for:

Quick local validation
Testing without internet connection
Development without FluxLoop account

--smoke

Run a quick smoke test (subset of inputs):

/fluxloop:test --smoke

Typically tests 5-10 inputs for rapid feedback.

--full

Run comprehensive test with all iterations:

/fluxloop:test --full

--quiet

Minimize output for CI/CD:

/fluxloop:test --quiet

Prerequisites

Before running tests, ensure:

Authenticated: Run /fluxloop:setup first
Project selected: Current project configured
Inputs available: Scenarios pulled from Web

Check status:

/fluxloop:status

How It Works

Step 1: Pull Scenarios

If scenarios aren't already local, the command automatically pulls them:

🔄 Pulling test scenarios from FluxLoop Web...
✓ Pulled 50 test inputs for scenario: customer-support

Step 2: Run Tests

Executes your agent with each test input:

🧪 Running tests...
▓▓▓▓▓▓░░░░ 25/50 (50%) - ETA: 1m 30s

Step 3: Upload Results

Sends traces and metrics to FluxLoop Web:

📤 Uploading results to FluxLoop Web...
✓ Results uploaded successfully

Step 4: Show Summary

Displays test summary and link to detailed analysis:

📊 Results:
- Total: 50 traces
- Success: 47 (94%)
- Failed: 3 (6%)
- Avg Duration: 2.3s

🌐 View detailed analysis:
https://alpha.app.fluxloop.ai/projects/abc123/results/xyz789

Examples

Quick Development Test

# Fast feedback during development
/fluxloop:test --smoke

Pre-Commit Validation

# Full test before committing
/fluxloop:test

Local Testing Only

# Test without uploading
/fluxloop:test --skip-upload

Specific Scenario

# Test specific workflow
/fluxloop:test --scenario error-handling

Workflow Integration

Development Loop

1. Make changes to agent code
   ↓
2. Run: /fluxloop:test --smoke
   ↓
3. Review results
   ↓
4. Iterate
   ↓
5. Before commit: /fluxloop:test

Before Deployment

# Comprehensive testing
/fluxloop:test --full

# Review in Web
# Check all scenarios passed
# Verify performance metrics

Output Details

Success Summary

📊 Test Results:
✓ Passed: 47/50 (94%)
✗ Failed: 3/50 (6%)

⚡ Performance:
- Avg Duration: 2.3s
- Max Duration: 5.1s
- Total Time: 2m 15s

🎯 Coverage:
- Personas: 5/5 tested
- Scenarios: customer-support

🔗 Details: https://alpha.app.fluxloop.ai/projects/.../results/...

Failure Details

✗ Failed Traces (3):
1. [novice_user] "How do I reset my password?"
   Error: Timeout after 30s

2. [expert_user] "What's the API rate limit?"
   Error: Agent returned null

3. [frustrated_user] "This doesn't work!"
   Error: Exception in tool call

🔍 View full details: [link to Web]

Troubleshooting

"Scenario not found"

# List available scenarios
/fluxloop:status

# Pull specific scenario
/fluxloop:pull --scenario <name>

# Retry test
/fluxloop:test --scenario <name>

"No inputs found"

Scenarios haven't been pulled yet:

# Pull from Web
/fluxloop:pull

# Then test
/fluxloop:test

"Authentication error"

Re-authenticate:

# Re-authenticate
/fluxloop:setup --force-login

"Agent not instrumented"

Ensure your agent function has the @fluxloop.agent() decorator:

import fluxloop

@fluxloop.agent()
def my_agent(input: str) -> str:
    # Your logic
    return response

Slow Tests

For faster iteration:

# Use smoke test
/fluxloop test --smoke

# Or reduce inputs in FluxLoop Web

Advanced Usage

Custom Configuration

Override simulation config:

# Specify custom runner
/fluxloop:test --runner custom_module:custom_function

# Increase timeout
/fluxloop:test --timeout 60

Multiple Scenarios

Test all scenarios in a project:

# List scenarios
/fluxloop:status

# Test each one
/fluxloop:test --scenario scenario1
/fluxloop:test --scenario scenario2

Exit Codes

Code	Meaning
0	All tests passed
1	Some tests failed
2	Configuration error
3	Authentication error

/fluxloop:pull - Pull test scenarios
/fluxloop:upload - Upload results manually
/fluxloop:status - Check project status
/fluxloop:criteria - View success criteria

Overview​

Basic Usage​

Options​

--scenario​

--skip-upload​

--smoke​

--full​

--quiet​

Prerequisites​

How It Works​

Step 1: Pull Scenarios​

Step 2: Run Tests​

Step 3: Upload Results​

Step 4: Show Summary​

Examples​

Quick Development Test​

Pre-Commit Validation​

Local Testing Only​

Specific Scenario​

Workflow Integration​

Development Loop​

Before Deployment​

Output Details​

Success Summary​

Failure Details​

Troubleshooting​

"Scenario not found"​

"No inputs found"​

"Authentication error"​

"Agent not instrumented"​

Slow Tests​

Advanced Usage​

Custom Configuration​

Multiple Scenarios​

Exit Codes​

Related Commands​

See Also​