-
-
Notifications
You must be signed in to change notification settings - Fork 27
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Pinecone Import: Multiple matches for FieldRef.Name(__filename) in id: string #70
Comments
Thanks for reporting! Will have a look soon |
I have pushed a potential fix to the export script (can't test since I'm not sure how the __filename is showing up in the vectors as well as metadata). You can install the latest version of the package: vdf-io==0.1.232 and try exporting your dataset again. Please let me know if that works. Thanks |
Well after exporting for 2 hours, it failed with the below. To get around my original problem, I modified the code to use JSON as the output since it is human readable and easy to fix. Worked great.
|
Attached is one of the parquet files generated from a Pinecone export. When I try to re-import I get these errors regarding duplicate fields.
Multiple matches for FieldRef.Name(__filename) in id: string vector: list<element: double> __filename: string __ingested_at: string content_id: string filename: string ingested_at: string text: string __fragment_index: int32 __batch_index: int32 __last_in_fragment: bool __filename: string
i2.parquet.zip
The text was updated successfully, but these errors were encountered: