/fluxloop:test
Run agent tests with scenarios from FluxLoop Web.
Overview
The /fluxloop:test command:
- Pulls test scenarios from FluxLoop Web (if needed)
- Runs your instrumented agent with synthetic inputs
- Captures traces and results
- Uploads results to FluxLoop Web
- Displays summary and analysis link
Basic Usage
/fluxloop:test
This runs the full test suite for the current scenario.
Options
--scenario
Specify which scenario to test:
/fluxloop:test --scenario customer-onboarding
--skip-upload
Run tests locally without uploading results:
/fluxloop:test --skip-upload
Useful for:
- Quick local validation
- Testing without internet connection
- Development without FluxLoop account
--smoke
Run a quick smoke test (subset of inputs):
/fluxloop:test --smoke
Typically tests 5-10 inputs for rapid feedback.
--full
Run comprehensive test with all iterations:
/fluxloop:test --full
--quiet
Minimize output for CI/CD:
/fluxloop:test --quiet
Prerequisites
Before running tests, ensure:
- Authenticated: Run
/fluxloop:setupfirst - Project selected: Current project configured
- Inputs available: Scenarios pulled from Web
Check status:
/fluxloop:status
How It Works
Step 1: Pull Scenarios
If scenarios aren't already local, the command automatically pulls them:
🔄 Pulling test scenarios from FluxLoop Web...
✓ Pulled 50 test inputs for scenario: customer-support
Step 2: Run Tests
Executes your agent with each test input:
🧪 Running tests...
▓▓▓▓▓▓░░░░ 25/50 (50%) - ETA: 1m 30s
Step 3: Upload Results
Sends traces and metrics to FluxLoop Web:
📤 Uploading results to FluxLoop Web...
✓ Results uploaded successfully
Step 4: Show Summary
Displays test summary and link to detailed analysis:
📊 Results:
- Total: 50 traces
- Success: 47 (94%)
- Failed: 3 (6%)
- Avg Duration: 2.3s
🌐 View detailed analysis:
https://alpha.app.fluxloop.ai/projects/abc123/results/xyz789
Examples
Quick Development Test
# Fast feedback during development
/fluxloop:test --smoke
Pre-Commit Validation
# Full test before committing
/fluxloop:test
Local Testing Only
# Test without uploading
/fluxloop:test --skip-upload
Specific Scenario
# Test specific workflow
/fluxloop:test --scenario error-handling
Workflow Integration
Development Loop
1. Make changes to agent code
↓
2. Run: /fluxloop:test --smoke
↓
3. Review results
↓
4. Iterate
↓
5. Before commit: /fluxloop:test
Before Deployment
# Comprehensive testing
/fluxloop:test --full
# Review in Web
# Check all scenarios passed
# Verify performance metrics
Output Details
Success Summary
📊 Test Results:
✓ Passed: 47/50 (94%)
✗ Failed: 3/50 (6%)
⚡ Performance:
- Avg Duration: 2.3s
- Max Duration: 5.1s
- Total Time: 2m 15s
🎯 Coverage:
- Personas: 5/5 tested
- Scenarios: customer-support
🔗 Details: https://alpha.app.fluxloop.ai/projects/.../results/...
Failure Details
✗ Failed Traces (3):
1. [novice_user] "How do I reset my password?"
Error: Timeout after 30s
2. [expert_user] "What's the API rate limit?"
Error: Agent returned null
3. [frustrated_user] "This doesn't work!"
Error: Exception in tool call
🔍 View full details: [link to Web]
Troubleshooting
"Scenario not found"
# List available scenarios
/fluxloop:status
# Pull specific scenario
/fluxloop:pull --scenario <name>
# Retry test
/fluxloop:test --scenario <name>
"No inputs found"
Scenarios haven't been pulled yet:
# Pull from Web
/fluxloop:pull
# Then test
/fluxloop:test
"Authentication error"
Re-authenticate:
# Re-authenticate
/fluxloop:setup --force-login
"Agent not instrumented"
Ensure your agent function has the @fluxloop.agent() decorator:
import fluxloop
@fluxloop.agent()
def my_agent(input: str) -> str:
# Your logic
return response
Slow Tests
For faster iteration:
# Use smoke test
/fluxloop test --smoke
# Or reduce inputs in FluxLoop Web
Advanced Usage
Custom Configuration
Override simulation config:
# Specify custom runner
/fluxloop:test --runner custom_module:custom_function
# Increase timeout
/fluxloop:test --timeout 60
Multiple Scenarios
Test all scenarios in a project:
# List scenarios
/fluxloop:status
# Test each one
/fluxloop:test --scenario scenario1
/fluxloop:test --scenario scenario2
Exit Codes
| Code | Meaning |
|---|---|
| 0 | All tests passed |
| 1 | Some tests failed |
| 2 | Configuration error |
| 3 | Authentication error |
Related Commands
/fluxloop:pull- Pull test scenarios/fluxloop:upload- Upload results manually/fluxloop:status- Check project status/fluxloop:criteria- View success criteria
See Also
- Setup Guide - Initial configuration
- Integration Workflow - Full development workflow
- CLI test command - Standalone CLI equivalent