BLOGS

Fine-tuning Large Language Models with Mini-Sequence Technology and Distributed Training

Extending LLAMA Training Context with Mini-Sequence Technology

Extending Mistral Training Context with Mini-Sequence Technology

Extending Qwen Training Context with Mini-Sequence Technology

Extending gemma2 Training with Mini-Sequence Technology

Revolutionizing LLM Training with Mini-Sequence Technology