Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

captus paralog filter - references added in the wrong direction #12

Open
EdBiffin opened this issue Jun 28, 2024 · 2 comments
Open

captus paralog filter - references added in the wrong direction #12

EdBiffin opened this issue Jun 28, 2024 · 2 comments

Comments

@EdBiffin
Copy link

Dear Edgardo, Ive noticed that when adding reference sequences to alignments, prior to informed paralog filtering, in some cases these are added in the reverse direction to the extracted sequences in the alignment. Im using a custom reference file that comprises the sequences that were used for probe design, mostly sourced from 1KP and Phytozome - the references were generated by clustering using CD-Hit (longest sequence per cluster at specified identity). Ive attached an example alignment and also the references for that gene. I'm using v1.01. Any advice would be greatly appreciated.
AT1G03750.fna.txt
AT1G03750.references.txt

@edgardomortiz
Copy link
Owner

Dear Ed,

Sorry for the really late reply, I was in the chaos of moving countries. I see that in your reference all sequences are in different reading frames so Captus might be having troubles translating them consistently.

Captus translates the references using the six reading frames, then selects the reading frame that produces the fewest internal stop codons, if there is a tie between two reading frames it will prefer a positive reading frame. So maybe these are not CDS?

If you don't care about obtaining the aminoacid format from the alignment step, and you are sure all are in the same direction you could provide the reference to Captus as miscellaneous DNA (-d AT1G03750.references.fasta).

If the aminoacid output is necessary then I would suggest verifying that these are translatable (preferably in reading frame 1) or at least consistently for all

Let me know if this helps!

Edgardo

@EdBiffin
Copy link
Author

EdBiffin commented Aug 5, 2024 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants