About AutoInfra
Specialized inference optimization for codegen
Fast Open-Source models: Deepseek v4 flash, qwen 397b, at 200+ tps
Specialized models for applying edits, code search, compaction, and model routing
Fast Open-Source models: Deepseek v4 flash, qwen 397b, at 200+ tps
Specialized models for applying edits, code search, compaction, and model routing










