NexaCompute Quick Start Guide
Todayβs Mission (2025-11-04)
Generate full dataset (10k-100k samples) and attempt training on 2ΓA100.π Quick Commands
1. Validate Environment
2. Run QC Batch (1k samples, ~10 min)
3. Run Full Generation (100k samples, ~16-18 hrs)
4. Apply Filtering
5. Launch Training
6. Complete Pipeline (automated)
π Tmux Cheat Sheet
| Command | Description |
|---|---|
tmux ls | List sessions |
tmux attach -t SESSION | Attach to session |
Ctrl+B then D | Detach from session |
tmux kill-session -t SESSION | Kill session |
Ctrl+B then [ | Enter scroll mode |
q | Exit scroll mode |
π Key Files
Configurations
batches/teacher_gen_v1.yaml- Generation confignexa_train/configs/baseline_qlora.yaml- Training config
Scripts
scripts/tmux_data_gen.sh- Data generation launcherscripts/tmux_training.sh- Training launcherscripts/run_full_pipeline.sh- Complete pipelinescripts/validate_environment.sh- Environment check
Core Modules
nexa_eval/rubrics/judge_f.py- Factual accuracy judgenexa_eval/rubrics/judge_r.py- Reasoning & safety judgenexa_distill/sample_gate.py- Quality filteringscripts/run_batch_generation.py- Generation orchestrator
Documentation
docs/TODAY_EXECUTION_PLAN.md- Detailed execution planQUICK_START.md- This file
π Monitoring
Check Generation Progress
Check Training Progress
Check GPU Usage
β οΈ Troubleshooting
API Rate Limits
Editbatches/teacher_gen_v1.yaml:
OOM During Training
Editnexa_train/configs/baseline_qlora.yaml:
Tmux Session Exists
π° Cost Estimates
| Task | Cost | Duration |
|---|---|---|
| QC Batch (1k) | $0.50-0.75 | ~10 min |
| Full Gen (100k) | $50-75 | ~16-18 hrs |
| Training (2ΓA100) | $9 | ~3 hrs |
| Total | $60-85 | ~19-21 hrs |
β Success Criteria
Data Generation
- β 100k samples generated
- β Judge-F mean β₯ 75
- β Judge-R mean β₯ 75
- β Acceptance rate β₯ 70%
Training
- β Training starts without errors
- β Throughput β₯ 12k tokens/sec
- β Training loss decreases
- β Checkpoints saved
π― Todayβs Workflow
π Need Help?
See detailed execution plan:Quick Start:
./scripts/validate_environment.sh && ./scripts/run_full_pipeline.sh