Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Run On Multi-GPU #15

Open
mayzyo opened this issue Mar 17, 2024 · 5 comments
Open

Run On Multi-GPU #15

mayzyo opened this issue Mar 17, 2024 · 5 comments

Comments

@mayzyo
Copy link

mayzyo commented Mar 17, 2024

Is it possible? my CUDA reported out of memory.

@linkedlist771
Copy link

You can set the device_map when load the embedding model with transformer.

@limcheekin
Copy link
Owner

@linkedlist771 Thanks for suggestion. Appreciate if you could send me PR for the implementation.

@linkedlist771
Copy link

linkedlist771 commented Jul 13, 2024

@limcheekin Thank you for your response. I'd be happy to attempt submitting a PR to address this issue. I'll start working on this as soon as possible and submit a PR for your review when it's ready. If you have any specific requirements or suggestions for the implementation, please let me know. I'll strive to ensure the PR adheres to the project's coding standards and best practices.

If I encounter any issues or need clarification during the implementation process, I'll update the progress in this issue. Thank you again for the opportunity to contribute to the project.

@riyajatar37003
Copy link

once i launch the server , how can i use it in the same way as below
from openai import OpenAI from openai import AsyncOpenAI client = AsyncOpenAI(api_key="fake-api-key",base_url="http://localhost:8000") embeddings = client.embeddings.create( input=["input"], )

@limcheekin
Copy link
Owner

once i launch the server , how can i use it in the same way as below from openai import OpenAI from openai import AsyncOpenAI client = AsyncOpenAI(api_key="fake-api-key",base_url="http://localhost:8000") embeddings = client.embeddings.create( input=["input"], )

Thanks for your interest. I don't have experience on AsyncOpenAI class. I think it is not supported now, it is good candidate for future enhancement. Please help to open an issue.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants