Commit History

Replace Novita AI with ZeroGPU Chat API (RunPod)
0747201

JatsTheAIGen commited on

Fix indentation issues in prepare_context_for_llm and _truncate_to_tokens methods
bd329bc

JatsTheAIGen commited on

FINAL FIX: Correct except clause indentation - ensure proper alignment with try statement
f608c70

JatsTheAIGen commited on

Fix return statement and else clause indentation in response handling
043d8e9

JatsTheAIGen commited on

Fix else clause indentation at line 153 - align with if stream
21a18e3

JatsTheAIGen commited on

Fix else clause indentation in response handling
a67b5f4

JatsTheAIGen commited on

Fix raise statement indentation in exception handler
44b41ba

JatsTheAIGen commited on

CRITICAL FIX: Correct except clause indentation at line 58 - align with try statement
dc8667d

JatsTheAIGen commited on

Add logging for context window configuration and improve max_tokens calculation debugging
ca77f38

JatsTheAIGen commited on

Update token allocation for DeepSeek R1 128K context window - Set NOVITA_MODEL_CONTEXT_WINDOW to 128000 (128K tokens) - Increase USER_INPUT_MAX_TOKENS to 32000 - Increase CONTEXT_PREPARATION_BUDGET to 115000 - Increase CONTEXT_PRUNING_THRESHOLD to 115000 - Update validators to support larger token limits
5d37f3d

JatsTheAIGen commited on

Fix IndentationError at line 378: correct indentation in _truncate_to_tokens method
9d31b94

JatsTheAIGen commited on

CRITICAL FIX: Correct except clause indentation at line 57 - align with try statement
6c81140

JatsTheAIGen commited on

Fix raise statement indentation
b21bcac

JatsTheAIGen commited on

Fix IndentationError: correct else clause alignment - ensure else aligns with if stream
5599138

JatsTheAIGen commited on

Fix IndentationError: correct else clause alignment at line 152
074e6cc

JatsTheAIGen commited on

Fix critical syntax error: correct except clause indentation at line 57
82b52b6

JatsTheAIGen commited on

Clean up try-except block formatting
e96d039

JatsTheAIGen commited on

Fix encoding issue: replace special character in logger message
e4c5683

JatsTheAIGen commited on

Fix except clause indentation in llm_router.py
d14bd01

JatsTheAIGen commited on

Fix syntax errors in llm_router.py - Fix indentation issues in try/except blocks - Fix else clause alignment
bb2bbad

JatsTheAIGen commited on

Fix max_tokens calculation to respect model context window (5000 tokens) - Add dynamic max_tokens calculation based on input size - Add novita_model_context_window configuration - Prevents 400 errors when input tokens exceed available output space
a9135e0

JatsTheAIGen commited on

Fix max_tokens calculation to respect model context window
42ed1a7

JatsTheAIGen commited on

Integrate Novita AI as exclusive inference provider - Add Novita AI API integration with DeepSeek-R1-Distill-Qwen-7B model - Remove all local model dependencies - Optimize token allocation for user inputs and context - Add Anaconda environment setup files - Add comprehensive test scripts and documentation
927854c

JatsTheAIGen commited on

Fix: DynamicCache compatibility, dependencies, and Docker configuration
ea87e33

JatsTheAIGen commited on

Fix: GPU tensor placement and Docker deployment configuration
67c580c

JatsTheAIGen commited on

Fix: BitsAndBytes compatibility and error handling
13fa6c4

JatsTheAIGen commited on

Fix: Cache directory permissions and gated repository handling
9959ea9

JatsTheAIGen commited on

Phase 1: Remove HF API inference - Local models only
5787d0a

JatsTheAIGen commited on

Fix infinite fallback loop in local model loading
8d4bf4a

JatsTheAIGen commited on

Add initialization diagnostics and background initialization
e2eb926

JatsTheAIGen commited on

Improve orchestrator initialization error handling and diagnostics
1b4b874

JatsTheAIGen commited on

Update model IDs to use Cerebras deployment and add gated repository error handling
b3aba24

JatsTheAIGen commited on

Security Enhancements: Production WSGI, Rate Limiting, Security Headers, Secure Logging
79ea999

JatsTheAIGen commited on

Add context mode endpoints (fresh vs relevant) and update API documentation
c3a42ce

JatsTheAIGen commited on

Add Express.js integration guide with URL troubleshooting and input validation
91c95f0

JatsTheAIGen commited on

Add HF Spaces URL format guide with troubleshooting
61782c5

JatsTheAIGen commited on

Fix API documentation URLs: use correct HF Spaces .hf.space domain format and add troubleshooting section
40b87ab

JatsTheAIGen commited on

Update README with API documentation reference
3f8cddc

JatsTheAIGen commited on

Add comprehensive Flask API documentation for integration
50ec2c4

JatsTheAIGen commited on

Fix UnboundLocalError: remove redundant os import in exception handler
48f2898

JatsTheAIGen commited on

Set DB_PATH environment variable in Dockerfile for explicit database location
1d056ab

JatsTheAIGen commited on

Improve database path handling with Docker detection and fallback to /tmp
a4d66c0

JatsTheAIGen commited on

Add input validation for chat endpoint - length limits and type checking
96e6d20

JatsTheAIGen commited on

Merge branch 'main' of https://huggingface.co/spaces/JatinAutonomousLabs/Research_AI_Assistant_API
e776b5f

JatsTheAIGen commited on

Fix database path issue for Docker/HF Spaces - use /tmp for writable directory
4e3f60e

JatsTheAIGen commited on

Initial commit: Research AI Assistant API
8f4d405

JatsTheAIGen commited on