You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello, I noticed that the book doesn't really have much information about clustering unlabeled data. I'm aware that there are some resources out there that address this question. However it would be nice to know what are some techniques that work best to cluster text, especially ones that don't rely on API calls that might be rate limited.
I have been pondering on these issues lately and the winning method so far is:
Generate embeddings.
MinMax scaler on features.
Use algorithm like K-means and plot number of clusters versus silhouette score.
Would appreciate to know your thoughts on this.
Best,
Nadim
The text was updated successfully, but these errors were encountered:
Information
The question or comment is about chapter:
Question or comment
Hello, I noticed that the book doesn't really have much information about clustering unlabeled data. I'm aware that there are some resources out there that address this question. However it would be nice to know what are some techniques that work best to cluster text, especially ones that don't rely on API calls that might be rate limited.
I have been pondering on these issues lately and the winning method so far is:
Would appreciate to know your thoughts on this.
Best,
Nadim
The text was updated successfully, but these errors were encountered: