Download the LLaMA model as a separate job and share it as an artifact to prevent repeated downloads from Huggingface which could lead to rate limiting or blocking. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>