Spaces:
Running
Running
Commit History
feat: Add real job submission for HuggingFace Jobs and Modal
4db4e9d
feat: Integrate prompt template generation into synthetic data workflow
97b162d
fix: Make Response Time chart consistent with other per-test metrics
fb44126
fix: Add missing settings_screen and settings_nav_btn to app.load outputs
7a791eb
fix: Add missing settings_screen to back_to_leaderboard button outputs
3c3b06c
fix: Complete settings_screen integration in navigation outputs
6559dd0
fix: Settings screen return value - return only interface not tuple
fe72fcb
feat: Add Settings screen for API key configuration
dab7275
fix: Add missing documentation_screen to navigation outputs
7573aeb
feat: Add comprehensive documentation screen with 4-tab ecosystem guide
abb32f1
fix: Add missing outputs to app.load() for dashboard
698e93f
fix: Update auto-selection to match MCP server defaults
6b05c3d
feat: Add infrastructure-specific hardware options (Modal vs HF Jobs)
0b0eb36
feat: Add comprehensive GPU hardware options from Modal
055c400
fix: Use Markdown component for cost estimates (no markdown lib needed)
9704693
fix: Handle MCP cost estimator markdown response properly
0b78b24
fix: Improve cost estimation MCP fallback error handling
5c7c72c
refactor: Simplify New Evaluation screen for hackathon
ccceff6
feat: Improve New Evaluation screen hardware selection
0b870a2
feat: Complete New Evaluation screen with full navigation and MCP integration
14c0bae
feat: Add New Evaluation screen with MCP cost estimation (WIP)
7addd50
feat: Add Synthetic Data Generator screen with MCP server integration
49ead1e
Fix analyze_results dropdown choices to match MCP server
79e0ac2
Disable SSR mode to fix MCP client on HuggingFace Spaces
23ca4b9
Wire MCP server tools to UI screens
315aa68
feat: Add agent reasoning and tool call execution display to chat
4e4722b
docs: Add open source foundation section to README and app.py
8dccf7d
Update leaderboard accordion to mention clickable rows
e7b14e6
Fix NameError: comment out drilldown data loading on app.load
3dcbfe7
Mandark-droid
commited on
Comment out DrillDown tab (replaced by clickable By Model tab)
26a1db5
Mandark-droid
commited on
Fix HTMLPlus event handler to use camelCase keys
d87972c
Mandark-droid
commited on
Fix HTMLPlus data attributes generation to match reference implementation
dafc8f1
Mandark-droid
commited on
Implement HTMLPlus for clickable leaderboard rows in By Model tab
7f90c34
Mandark-droid
commited on
Move GPU metrics from Trace Detail to Run Detail screen
0fca968
Mandark-droid
commited on
Add Agent Chat screen with smolagents MCP integration (Track 2)
908be6c
Mandark-droid
commited on
Add info parameters to all UI components
4dc8a59
Mandark-droid
commited on
Add comparison report card feature to compare screen
a50320a
Mandark-droid
commited on
Move run report card to Overview tab to match MockTraceMind layout
a3b9254
Mandark-droid
commited on
Add Run Report Card feature and fix event handler issues
5a775ac
Mandark-droid
commited on
Make Analytics accordion dynamically update with exact MockTraceMind content
d9a086c
Mandark-droid
commited on
Add user guide accordions to all leaderboard tabs
daacf12
Mandark-droid
commited on
Add chart explanation accordion and enhance Analytics tab
bf61933
Mandark-droid
commited on
Fix compare screen to handle duplicate run_ids using composite keys
60b7b04
Mandark-droid
commited on
Fix filter functionality and align with MockTraceMind structure
920ea09
Mandark-droid
commited on
Add compare screen for side-by-side run comparison
5c51b47
Mandark-droid
commited on
Add dashboard screen with aggregate statistics and recent runs
3138502
Mandark-droid
commited on
Fix TypeError: unhashable type 'list' - resolve variable name collision
f0d23ea
Mandark-droid
commited on
Fix AttributeError: Extract span data directly instead of using gr.JSON component
5828eb5
Mandark-droid
commited on
Fix outputs mismatch: Add performance_charts to event wiring
856817d
Mandark-droid
commited on