Improve infill prompt and add fallback for non-JSON model outputs 0dfa1e4 Patryk Studzinski commited on 3 days ago
Fix: Improve infill prompt to prevent model from copying template 2de7977 Patryk Studzinski commited on 3 days ago
Fix: Remove unsupported use_xformers_attention parameter 9153886 Patryk Studzinski commited on 3 days ago
Fix: Use direct model.generate() with proper KV caching instead of pipeline eaa2e37 Patryk Studzinski commited on 3 days ago
Add KV caching and batch processing optimizations for 5-10x speedup ab2e415 Patryk Studzinski commited on 3 days ago
Improve Polish grammar in infill prompt + remove debug logs 14fc89e Patryk Studzinski commited on 11 days ago
Fix: Handle double-escaped JSON in infill parser + add debug logging 6cc98f9 Patryk Studzinski commited on 11 days ago