<div dir="ltr">Hi Joao,<div><br></div><div>The reasons for this are clearer if you look at the Amino_acids and Codons fields in the VEP output.</div><div><br></div><div>For your first input, the bases are inserted between the first and second bases of a codon, disrupting the original codon. The resulted translated sequence is then "PQM", which shares no sequence with the original translated codon ("L"), so inframe_insertion is not correct. Since you're replacing an "L" with a different sequence, the most specific consequence type VEP can call is protein_altering_variant.</div><div><br></div><div>Amino_acids: L/PQM   Codons: ctg/cCCCAAAtg<br></div><div><br></div><div>In your second input, the bases are inserted *between* a pair of codons, so the original codons are not disrupted and inframe_insertion is valid.</div><div><br></div><div>Amino_acids: -/PK    Codons: -/CCCAAA<br></div><div><br></div><div>In the third input, while you are inserting in the same position as the first input, the translated sequence still includes the original amino acid as by chance the inserted sequence restores a codon that translates to that amino acid:</div><div><br></div><div>Amino_acids: L/PL    Codons: ctg/cCCCtg<br></div><div><br></div><div>This also holds true for the fourth input, with the degeneracy of the amino acid code helping out:</div><div><br></div><div>Amino_acids: L/LPK   Codons: ctg/ctCCCAAAg<br></div><div><br></div><div>Hope this helps</div><div><br></div><div>Will McLaren</div><div>Ensembl Variation</div></div><div class="gmail_extra"><br><div class="gmail_quote">On 3 November 2016 at 15:51, João Eiras <span dir="ltr"><<a href="mailto:joao.eiras@gmail.com" target="_blank">joao.eiras@gmail.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">Hi!<br>

<br>

I'm building some tests for an application, and I found something<br>

which is not entirely intelligible for me.<br>

<br>

Consider the following VCF variant, and its effect on transcript<br>

ENSMUST00000086738<br>

<br>

#CHROM POS REF ALT<br>

chr1 99772783 C CCCCAAA<br>

<br>

It is annotated with consequence_terms "protein_altering_variant".<br>

<br>

However, if I shift the variant left or right, or remove one of the<br>

AAs, I get as expected an "inframe_insertion".<br>

<br>

#CHROM POS REF ALT<br>

chr1 99772782 A ACCCAAA<br>

chr1 99772783 C CCCC<br>

chr1 99772784 T TCCCAAA<br>

<br>

Being that, I don't see any other differences in the annotation.<br>

Why this difference ?<br>

<br>

Thank you.<br>

<br>

______________________________<wbr>_________________<br>

Dev mailing list    <a href="mailto:Dev@ensembl.org">Dev@ensembl.org</a><br>

Posting guidelines and subscribe/unsubscribe info: <a href="http://lists.ensembl.org/mailman/listinfo/dev" rel="noreferrer" target="_blank">http://lists.ensembl.org/<wbr>mailman/listinfo/dev</a><br>

Ensembl Blog: <a href="http://www.ensembl.info/" rel="noreferrer" target="_blank">http://www.ensembl.info/</a><br>

</blockquote></div><br></div>