Skip to content

Commit d2ff109

Browse files
vkkhareVarun Khare
andauthored
Add offline dataset generation (#39)
* add offline dataset gen Signed-off-by: Varun Khare <varun.khare@nimbleedgehq.ai> * remove type hints for dynamic classes Signed-off-by: Varun Khare <varun.khare@nimbleedgehq.ai> * add evaluation logic for predictors Signed-off-by: Varun Khare <varun.khare@nimbleedgehq.ai> * add batch processing in dataset gen Signed-off-by: Varun Khare <varun.khare@nimbleedgehq.ai> * random sampling for sequence length Signed-off-by: Varun Khare <varun.khare@nimbleedgehq.ai> * add dataset append for memory efficiency Signed-off-by: Varun Khare <varun.khare@niimbleedgehq.ai> * add streaming support for training Signed-off-by: Varun Khare <varun.khare@niimbleedgehq.ai> * increase defaults for dataset generation Signed-off-by: Varun Khare <varun.khare@niimbleedgehq.ai> * rename files and enable full dataset loading Signed-off-by: Varun Khare <varun.khare@niimbleedgehq.ai> * add resume checkpointing Signed-off-by: Varun Khare <varun.khare@niimbleedgehq.ai> --------- Signed-off-by: Varun Khare <varun.khare@nimbleedgehq.ai> Signed-off-by: Varun Khare <varun.khare@niimbleedgehq.ai> Co-authored-by: Varun Khare <varun.khare@niimbleedgehq.ai>
1 parent d70d2b6 commit d2ff109

18 files changed

+1555
-1092
lines changed

.gitignore

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -21,6 +21,7 @@ wheels/
2121
.installed.cfg
2222
*.egg
2323
.cursorrules
24+
trained_predictors/
2425
wandb/
2526
# CUDA
2627
*.i
@@ -87,6 +88,7 @@ coverage.xml
8788
# Logs and databases
8889
*.sqlite
8990
*.db
91+
data/
9092

9193
# OS generated files
9294
.DS_Store

README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -70,7 +70,7 @@ Sparse LLaMA 3.2 3B vs Standard LLaMA 3.2 3B CUDA Results (on HuggingFace Implem
7070
```bash
7171
# Run comprehensive benchmark
7272

73-
python run_benchmark.py \
73+
python benchmark.py \
7474
--device cpu \ # Device: 'cpu' or 'cuda'
7575
--config configs/llama_skip_causal_3b.json \ # Model configuration
7676
--num_runs 50 \ # Number of benchmark runs
File renamed without changes.

0 commit comments

Comments
 (0)