deepblue research – Medium

deepblue research

Finetune Mistral 7b on Colab Free Version

Clearly Mistral 7b is too big to train on a single T4 15GB GPU provided free by Google. But there are ways to squeeze it in. Let’s use a…

Jun 7, 2024

Finetune Mistral 7b on Colab Free Version

Jun 7, 2024

Finetuning mistral 7b using Unsloth

Unsloth is a project that allows you to finetune Llama 3, Mistral, Gemma and other Large Language Models with less memory and time.

Jun 4, 2024

Finetuning mistral 7b using Unsloth

Jun 4, 2024

Transformers and its Trainer

The Trainer class provides an API for feature-complete training in PyTorch, and it supports distributed training on multiple GPUs/TPUs…

Jun 3, 2024

Transformers and its Trainer

Jun 3, 2024

Loading an LLM in 4 bits using bitsandbytes

Bitsandbytes allow us to load large models in low resource environments. Typically, the weights and biases in an LLM are in float32 format…

May 31, 2024

Loading an LLM in 4 bits using bitsandbytes

May 31, 2024

Quantization using bitsandbytes

‘bitsandbytes’ is a tool to reduce model size using 8-bit and 4-bit quantization. This improves memory usage and can help fit large models…

May 29, 2024

Quantization using bitsandbytes

May 29, 2024

deepblue research

deepblue research

AI Research Company.

Help
Status
About
Careers
Press
Blog
Privacy
Rules
Terms
Text to speech