Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

llama stack and vllm on ocp #24

Merged
merged 4 commits into from
Feb 27, 2025
Merged

llama stack and vllm on ocp #24

merged 4 commits into from
Feb 27, 2025

Conversation

cooktheryan
Copy link
Collaborator

@cooktheryan cooktheryan commented Feb 20, 2025

YAML and README.md to run llama stack and vLLM with meta-llama/Llama-3.1-8B-Instruct

@cooktheryan cooktheryan changed the title serve meta through vllm llama stack and vllm on ocp Feb 20, 2025
@hemajv hemajv self-requested a review February 20, 2025 23:11
Copy link
Contributor

@hemajv hemajv left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@cooktheryan this looks awesome 🎉 added a few comments


```
llamastack-deployment-llama-serve.apps.ocp-beta-test.nerc.mghpcc.org
```
Copy link
Contributor

@hemajv hemajv Feb 20, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Could you also mention the port to be used? And can we also add a section at the end like so:

Testing the Llamastack Server

In order to test the Llamastack server, you can try some of the examples mentioned here by setting the following env vars:

INFERENCE_MODEL="meta-llama/Llama-3.1-8B-Instruct"
LLAMA_STACK_PORT= <mention the port number>

When connecting to the server using LlamaStackClient make sure to update the base_url with the URL of the Llamastack server.

@cooktheryan
Copy link
Collaborator Author

@hemajv i may have you do a follow up PR on how to use these endpoints based on your testing

Copy link
Contributor

@hemajv hemajv left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@cooktheryan thanks for the changes!
/lgtm 🚢

@hemajv hemajv merged commit 8bed7a3 into main Feb 27, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants