Spaces:

lemms
/

openllm

Runtime error

App Files Files Community

openllm / README.md

lemms

Update README for Gradio 4.44.1 and complete training features

d185e19 verified 5 months ago

preview code

raw

history blame

2.81 kB

metadata

title: OpenLLM Training Space
emoji: 🚀
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: 4.44.1
app_file: app.py
pinned: false
license: gpl-3.0

OpenLLM Training Space

This space provides complete training infrastructure for OpenLLM models with real model training functionality.

Features

🎯 Real Model Training: Actual PyTorch training with Transformers
📊 Training Monitoring: Live progress tracking and loss monitoring
🔄 Model Versioning: Automatic model saving and uploading to HF Hub
📈 Performance Tracking: Training metrics and completion status
🚀 Gradio 4.44.1: Latest UI framework with enhanced compatibility

Complete Training Pipeline

What Happens When You Click "Start Training":

📥 Model Loading: Loads the 7k model from lemms/openllm-small-extended-7k
📊 Dataset Preparation: Loads and tokenizes training data from lemms/openllm-training-data
⚙️ Training Setup: Configures PyTorch Trainer with your parameters
🚀 Real Training: Executes actual model training for specified steps
💾 Save & Upload: Saves trained model and uploads to HF Hub as lemms/openllm-{size}-extended-8k

Training Configuration Options:

Model Size: small, medium, large (currently supports small)
Max Steps: 100-10,000 training iterations
Learning Rate: 0.00001-0.001 (configurable)
Batch Size: 1-16 samples per batch

Expected Results:

Training Time: 10-30 minutes for 1000 steps (depending on HF Space resources)
Output Model: lemms/openllm-small-extended-8k (or other sizes)
Model Files: Complete PyTorch model with tokenizer and configuration

Model Repositories

Technical Details

Framework: PyTorch with Transformers
UI: Gradio 4.44.1 (latest stable version)
Training: Mixed precision (FP16) for efficiency
Memory: Optimized for HF Spaces with gradient accumulation
Dependencies: Complete ML stack with all training utilities

Usage

Configure Parameters: Set model size, steps, learning rate, and batch size
Start Training: Click "Start Training" to begin the complete pipeline
Monitor Progress: Watch real-time status updates and training progress
Access Results: Find your trained model in the HF Hub repository

License

GPL-3.0 - See LICENSE for details.

Author

Louis Chua Bean Chong - OpenLLM Project Maintainer