<html><head><meta http-equiv="Content-Type" content="text/html; charset=utf-8"></head><body style="word-wrap: break-word; -webkit-nbsp-mode: space; line-break: after-white-space;" class="">Hi Julie<div class=""><br class=""></div><div class="">The information is stored in the transcript_attrib table under attribution_id 380:</div><div class=""><a href="http://www.ensembl.org/info/docs/api/core/core_schema.html#transcript_attrib" class="">http://www.ensembl.org/info/docs/api/core/core_schema.html#transcript_attrib</a></div><div class=""><br class=""></div><div class="">You can fetch it from the Perl API using $transcript->get_all_Attributes</div><div class=""><a href="http://www.ensembl.org/info/docs/Doxygen/core-api/classBio_1_1EnsEMBL_1_1Transcript.html#a59f9ff2079a28ba80bc8b62a5e636327" class="">http://www.ensembl.org/info/docs/Doxygen/core-api/classBio_1_1EnsEMBL_1_1Transcript.html#a59f9ff2079a28ba80bc8b62a5e636327</a></div><div class=""><br class=""></div><div class="">All the best</div><div class=""><br class=""></div><div class="">Emily</div><div class=""><div><br class=""><blockquote type="cite" class=""><div class="">On 10 Mar 2021, at 14:37, Julie Sullivan <<a href="mailto:julie.sullivan@gmail.com" class="">julie.sullivan@gmail.com</a>> wrote:</div><br class="Apple-interchange-newline"><div class=""><div dir="ltr" class=""><div class="gmail_default" style="font-family:trebuchet ms,sans-serif;font-size:small">Thank you! That answers my question!</div><div class="gmail_default" style="font-family:trebuchet ms,sans-serif;font-size:small"><br class=""></div><div class="gmail_default" style="font-family:trebuchet ms,sans-serif;font-size:small">I would really like to be able to access that tag (non-ATG start) programmatically. Are there plans for putting it with the other transcript flags in the GTF file?<br class=""></div></div><br class=""><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Wed, 10 Mar 2021 at 10:09, Emily Perry <<a href="mailto:emily@ebi.ac.uk" class="">emily@ebi.ac.uk</a>> wrote:<br class=""></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div style="overflow-wrap: break-word;" class="">Hi Julie<div class=""><br class=""></div><div class="">We have some information about non-ATG start codons in our blog post from release 102:</div><div class=""><a href="https://www.ensembl.info/2020/11/30/ensembl-102-has-been-released/" target="_blank" class="">https://www.ensembl.info/2020/11/30/ensembl-102-has-been-released/</a></div><div class=""><br class=""></div><div class="">Quite simply, there is not a rule. This is a situation of exceptional biology which we are only able to annotate correctly because of our expert manual gene annotators analysing the data in detail.</div><div class=""><br class=""></div><div class="">All the best</div><div class=""><br class=""></div><div class="">Emily<br class=""><div class=""><br class=""><blockquote type="cite" class=""><div class="">On 10 Mar 2021, at 09:08, Julie Sullivan <<a href="mailto:julie.sullivan@gmail.com" target="_blank" class="">julie.sullivan@gmail.com</a>> wrote:</div><br class=""><div class=""><div dir="ltr" class=""><div class="gmail_default" style="font-family:trebuchet ms,sans-serif;font-size:small"><a href="https://www.ensembl.org/Homo_sapiens/Transcript/Sequence_cDNA?db=core;g=ENSG00000288649;r=20:33667144-33668235;t=ENST00000678634" target="_blank" class="">https://www.ensembl.org/Homo_sapiens/Transcript/Sequence_cDNA?db=core;g=ENSG00000288649;r=20:33667144-33668235;t=ENST00000678634</a><br class=""></div><div class="gmail_default" style="font-family:trebuchet ms,sans-serif;font-size:small">The first codon is GTG. I would not have expected that to be Methionine.</div><div class="gmail_default" style="font-family:trebuchet ms,sans-serif;font-size:small"><br class=""></div><div class="gmail_default" style="font-family:trebuchet ms,sans-serif;font-size:small">I looked in the text files, and there are 123 of these transcripts where the start codon is NOT ATG but the aa is M, in Homo sapiens.</div><div class="gmail_default" style="font-family:trebuchet ms,sans-serif;font-size:small"><pre style="box-sizing:inherit;margin:4px 0px;padding:8px;font-size:12px;line-height:1.50001;font-variant-ligatures:none;white-space:pre-wrap;word-break:normal;font-family:Monaco,Menlo,Consolas,"Courier New",monospace;border-radius:4px;color:rgb(29,28,29);font-style:normal;font-variant-caps:normal;font-weight:400;letter-spacing:normal;text-align:left;text-indent:0px;text-transform:none;word-spacing:0px;text-decoration-style:initial;text-decoration-color:initial" class="">{'error': 0,<br style="box-sizing:inherit" class=""> 'methionine': 91434,<br style="box-sizing:inherit" class=""> 'GTG': 22,<br style="box-sizing:inherit" class=""> 'ATA': 10,<br style="box-sizing:inherit" class=""> 'CTG': 67,<br style="box-sizing:inherit" class=""> 'ACG': 8,<br style="box-sizing:inherit" class=""> 'TTG': 9,<br style="box-sizing:inherit" class=""> 'ATT': 5,<br style="box-sizing:inherit" class=""> 'AAC': 1,<br style="box-sizing:inherit" class=""> 'AAG': 1}</pre><br class=""></div><div class="gmail_default" style="font-family:trebuchet ms,sans-serif;font-size:small">Why is that? <br class=""></div><div class="gmail_default" style="font-family:trebuchet ms,sans-serif;font-size:small"><br class=""></div><div class="gmail_default" style="font-family:trebuchet ms,sans-serif;font-size:small">Specifically I would like a rule I can use, as my HGVSp strings are different from VEP for this reason.</div><div class="gmail_default" style="font-family:trebuchet ms,sans-serif;font-size:small"><br class=""></div><div class="gmail_default" style="font-family:trebuchet ms,sans-serif;font-size:small">Thanks!<br class=""></div><div class="gmail_default" style="font-family:trebuchet ms,sans-serif;font-size:small">Julie<br class=""></div></div>
_______________________________________________<br class="">Dev mailing list <a href="mailto:Dev@ensembl.org" target="_blank" class="">Dev@ensembl.org</a><br class="">Posting guidelines and subscribe/unsubscribe info: <a href="https://lists.ensembl.org/mailman/listinfo/dev_ensembl.org" target="_blank" class="">https://lists.ensembl.org/mailman/listinfo/dev_ensembl.org</a><br class="">Ensembl Blog: <a href="http://www.ensembl.info/" target="_blank" class="">http://www.ensembl.info/</a><br class=""></div></blockquote></div><br class=""><div class="">
<div dir="auto" style="letter-spacing: normal; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; word-spacing: 0px; text-decoration: none;" class=""><div class="">—</div><div class=""><br class=""></div><div class="">Dr Emily Perry (Pritchard)<br class="">Ensembl Outreach Project Leader <br class="">(she/her)<br class=""><br class="">European Bioinformatics Institute (EMBL-EBI)<br class="">European Molecular Biology Laboratory <br class="">Wellcome Genome Campus<br class="">Hinxton<br class="">Cambridge<br class="">CB10 1SD<br class="">UK </div><div class=""><br class=""></div></div><br class=""><br class="">
</div>
<br class=""></div></div>_______________________________________________<br class="">
Dev mailing list <a href="mailto:Dev@ensembl.org" target="_blank" class="">Dev@ensembl.org</a><br class="">
Posting guidelines and subscribe/unsubscribe info: <a href="https://lists.ensembl.org/mailman/listinfo/dev_ensembl.org" rel="noreferrer" target="_blank" class="">https://lists.ensembl.org/mailman/listinfo/dev_ensembl.org</a><br class="">
Ensembl Blog: <a href="http://www.ensembl.info/" rel="noreferrer" target="_blank" class="">http://www.ensembl.info/</a><br class="">
</blockquote></div>
_______________________________________________<br class="">Dev mailing list <a href="mailto:Dev@ensembl.org" class="">Dev@ensembl.org</a><br class="">Posting guidelines and subscribe/unsubscribe info: <a href="https://lists.ensembl.org/mailman/listinfo/dev_ensembl.org" class="">https://lists.ensembl.org/mailman/listinfo/dev_ensembl.org</a><br class="">Ensembl Blog: <a href="http://www.ensembl.info/" class="">http://www.ensembl.info/</a><br class=""></div></blockquote></div><br class=""><div class="">
<meta charset="UTF-8" class=""><div dir="auto" style="caret-color: rgb(0, 0, 0); color: rgb(0, 0, 0); letter-spacing: normal; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; word-spacing: 0px; -webkit-text-stroke-width: 0px; text-decoration: none; word-wrap: break-word; -webkit-nbsp-mode: space; line-break: after-white-space;" class=""><div>—</div><div><br class=""></div><div>Dr Emily Perry (Pritchard)<br class="">Ensembl Outreach Project Leader <br class="">(she/her)<br class=""><br class="">European Bioinformatics Institute (EMBL-EBI)<br class="">European Molecular Biology Laboratory <br class="">Wellcome Genome Campus<br class="">Hinxton<br class="">Cambridge<br class="">CB10 1SD<br class="">UK </div><div class=""><br class=""></div></div><br class="Apple-interchange-newline"><br class="Apple-interchange-newline">
</div>
<br class=""></div></body></html>