Feature/hyperparameter tuning final #267

Saf9933 · 2024-10-29T11:10:07Z

Design

Implement hyperparameter tuning for the mLoRA project to optimize model parameters and improve task performance.
Files Modified
mlora_train_optuna.py
Purpose: This script introduces Optuna for automated hyperparameter tuning, focusing on parameters such as rank, alpha, learning_rate, and dropout.
Key Changes:
Objective Function: Defines an Optuna objective function that tests combinations of hyperparameters to minimize loss.
Task Configuration: Dynamically generates a task configuration using the tuned hyperparameters.
Execution: Sets up an Optuna study to run multiple trials, logging the best hyperparameters for improved model performance.
edited executor.py
Purpose: Manages task execution, model loading, and adapter application during training.
Key Changes:
Loss Tracking: Enhanced to log loss values for each task during training, integrating seamlessly with the hyperparameter tuning process.
Hooks: Added and refined hooks (init, running, ready, done, terminate) for better control over model adapter loading and task status.
Summary
These changes streamline and automate the process of finding optimal hyperparameters, improving model performance with minimal manual intervention. The integration with Optuna in mlora_train_optuna.py complements executor.py’s enhanced task management and loss tracking to support efficient tuning workflows.

yezhengmao1 · 2024-10-29T11:22:01Z

mlora_train_optuna.py

+
+    if args.base_model == "tinyllama":
+        logging.info("Using TinyLlama_v1.1 model for testing")
+        args.base_model = "/model/TinyLlama_v1.1"


do not use hard code

yezhengmao1 · 2024-10-29T11:23:15Z

mlora_train_optuna.py

+import os
+from typing import Dict
+
+import optuna


add optuna to requirements.txt

yezhengmao1 · 2024-10-29T11:24:12Z

mlora_train_optuna.py

+from mlora.config.task import TrainTaskConfig
+
+# Set up logging
+logging.basicConfig(


redunant code

Saf9933 added 4 commits October 14, 2024 10:28

Implemented auto hyperparameter search using Optuna

4601177

Add mlora_train_optuna.py to main

412987e

Temporary commit for mlora_train_optuna.py before branch switch

0f79011

Final updates: hyperparameter tuning implementation and adjustments

a8c53eb

yezhengmao1 requested changes Oct 30, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/hyperparameter tuning final #267

Feature/hyperparameter tuning final #267

Saf9933 commented Oct 29, 2024 •

edited

Loading

yezhengmao1 Oct 29, 2024

yezhengmao1 Oct 29, 2024

yezhengmao1 Oct 29, 2024

Feature/hyperparameter tuning final #267

Are you sure you want to change the base?

Feature/hyperparameter tuning final #267

Conversation

Saf9933 commented Oct 29, 2024 • edited Loading

Design

yezhengmao1 Oct 29, 2024

Choose a reason for hiding this comment

yezhengmao1 Oct 29, 2024

Choose a reason for hiding this comment

yezhengmao1 Oct 29, 2024

Choose a reason for hiding this comment

Saf9933 commented Oct 29, 2024 •

edited

Loading