<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=Windows-1252">
</head>
<body>
<style type="text/css" style="display:none;"><!-- P {margin-top:0;margin-bottom:0;} --></style>
<div id="divtagdefaultwrapper" style="font-size:10pt;color:#000000;font-family:Arial,Helvetica,sans-serif;" dir="ltr">
<p>Hi Irina,</p>
<p><br>
</p>
<p>Thanks, also I forgot to ask, regarding the stats_html command, there are two fields in the output html "Variants Processed' and 'Variants remaining after filtering'.</p>
<p><br>
</p>
<p>What processes are normally used to process and filter these VCFs. Does VEP use the same processing approach for all VCFs?</p>
<p><br>
</p>
<p>Best Regards,<br>
</p>
<p><br>
</p>
<div id="Signature">
<div id="divtagdefaultwrapper" dir="ltr" style="font-size: 12pt; color: rgb(0, 0, 0); font-family: Calibri, Helvetica, sans-serif, "EmojiFont", "Apple Color Emoji", "Segoe UI Emoji", NotoColorEmoji, "Segoe UI Symbol", "Android Emoji", EmojiSymbols;">
<div id="divtagdefaultwrapper" dir="ltr" style="font-size: 12pt; color: rgb(0, 0, 0); font-family: Calibri, Helvetica, sans-serif, "EmojiFont", "Apple Color Emoji", "Segoe UI Emoji", NotoColorEmoji, "Segoe UI Symbol", "Android Emoji", EmojiSymbols;">
<font color="#1F497D"><font size="2"><span style="font-size:12pt; color:rgb(0,0,0)"><b><span style="font-family:"Arial Black",Arial,sans-serif; font-size:11pt"></span></b><span style="font-family:"Arial Black",Arial,sans-serif; font-size:11pt"><b><span style="font-family:Consolas,Courier,monospace"></span><span style="font-family:Calibri,Helvetica,sans-serif; font-size:12pt"></span></b><span style="font-family:Calibri,Helvetica,sans-serif; font-size:14pt"><i><b></b></i><b><span style="font-family:Arial,Helvetica,sans-serif; font-size:10pt">Margaret
 Linan, MPH MS</span></b><i><b></b></i></span><b><span style="font-family:Calibri,Helvetica,sans-serif; font-size:12pt"></span><span style="font-family:Consolas,Courier,monospace"></span></b></span><b><span style="font-family:"Arial Black",Arial,sans-serif; font-size:11pt"></span></b></span></font></font></div>
<span style="font-size:10pt"></span><span style="font-family:Arial,Helvetica,sans-serif; font-size:12pt"></span><span style="font-size:10pt"></span>
<div dir="ltr" style="font-size:12pt; color:#000000; font-family:Calibri,Helvetica,sans-serif">
<span style="font-size:10pt"></span><span style="font-family:Arial,Helvetica,sans-serif; font-size:12pt"></span><font color="#1F497D"><font size="2"><span style="font-size:10pt; color:rgb(0,0,0); font-family:Arial,Helvetica,sans-serif">Independent Consultant</span></font></font></div>
<span style="font-size:10pt"></span><span style="font-family:Arial,Helvetica,sans-serif; font-size:12pt"></span><span style="font-size:10pt"></span>
<div dir="ltr" style="font-size:12pt; color:#000000; font-family:Calibri,Helvetica,sans-serif">
<span style="font-size:10pt"></span><span style="font-family:Arial,Helvetica,sans-serif; font-size:12pt"></span><font color="#1F497D"><font size="2"><span style="font-size:10pt; color:rgb(0,0,0); font-family:Arial,Helvetica,sans-serif">Serving the CBIPM @ Icahn
 School of Medicine at Mount Sinai</span></font></font></div>
<span style="font-size:10pt"></span><span style="font-family:Arial,Helvetica,sans-serif; font-size:12pt"></span><span style="font-size:10pt"></span>
<div dir="ltr" style="font-size:12pt; color:#000000; font-family:Calibri,Helvetica,sans-serif">
<span style="font-size:10pt"></span><span style="font-family:Arial,Helvetica,sans-serif; font-size:12pt"></span><font color="#1F497D"><font size="2"><span style="font-size:10pt; color:rgb(0,0,0); font-family:Arial,Helvetica,sans-serif">Margaret.Linan@mssm.edu</span></font></font></div>
<span style="font-size:10pt"></span>
<div dir="ltr" style="font-size:12pt; color:#000000; font-family:Calibri,Helvetica,sans-serif">
<font color="#1F497D"><font size="2"><span style="font-size:11pt; color:rgb(0,0,0); font-family:"Arial Black",Arial,sans-serif"></span><br>
</font></font></div>
</div>
</div>
</div>
<hr style="display:inline-block;width:98%" tabindex="-1">
<div id="divRplyFwdMsg" dir="ltr"><font face="Calibri, sans-serif" style="font-size:11pt" color="#000000"><b>From:</b> Irina Armean <iarmean@ebi.ac.uk><br>
<b>Sent:</b> Tuesday, September 17, 2019 10:01:26 AM<br>
<b>To:</b> Linan, Margaret; Ensembl developers list<br>
<b>Subject:</b> Re: [ensembl-dev] VEP command line</font>
<div> </div>
</div>
<div>
<p>Hi Margaret,</p>
<p><br>
</p>
<p>To get an accurate count for overlapping transcripts you would need to count the unique identifiers in the 'Feature' column for the records that have '<i>Transcript</i>' in the 'Feature_type' column.</p>
<p>A similar count would be needed for regulatory features: counting the unique identifiers in the 'Feature' column for the records that have '<i>RegulatoryFeature</i>' in the 'Feature_type' column.</p>
<p>Counting the occurrences of "<i>Transcript</i>" in the 'Feature' column will result in double counting any transcript that is affected by more than one of the input variants.</p>
<p><br>
</p>
<p>'sense_overlapping' indicates that the variant is on a long non-coding transcript that contains a protein coding gene within its intronic sequence on the same strand, with no overlap of exonic sequence.<span style="color: rgb(51, 51, 51); font-family:
        "Luxi Sans", Helvetica, Arial, Geneva, sans-serif;
        font-size: 12.8px; font-style: normal; font-variant-ligatures:
        normal; font-variant-caps: normal; font-weight: 400;
        letter-spacing: normal; orphans: 2; text-align: left;
        text-indent: 0px; text-transform: none; white-space: normal;
        widows: 2; word-spacing: 0px; -webkit-text-stroke-width: 0px;
        background-color: rgb(255, 255, 255); text-decoration-style:
        initial; text-decoration-color: initial; display: inline
        !important; float: none;"><br>
</span></p>
<p><span style="color: rgb(51, 51, 51); font-family: "Luxi
        Sans", Helvetica, Arial, Geneva, sans-serif; font-size:
        12.8px; font-style: normal; font-variant-ligatures: normal;
        font-variant-caps: normal; font-weight: 400; letter-spacing:
        normal; orphans: 2; text-align: left; text-indent: 0px;
        text-transform: none; white-space: normal; widows: 2;
        word-spacing: 0px; -webkit-text-stroke-width: 0px;
        background-color: rgb(255, 255, 255); text-decoration-style:
        initial; text-decoration-color: initial; display: inline
        !important; float: none;"><br>
</span></p>
<p><span style="color: rgb(51, 51, 51); font-family: "Luxi
        Sans", Helvetica, Arial, Geneva, sans-serif; font-size:
        12.8px; font-style: normal; font-variant-ligatures: normal;
        font-variant-caps: normal; font-weight: 400; letter-spacing:
        normal; orphans: 2; text-align: left; text-indent: 0px;
        text-transform: none; white-space: normal; widows: 2;
        word-spacing: 0px; -webkit-text-stroke-width: 0px;
        background-color: rgb(255, 255, 255); text-decoration-style:
        initial; text-decoration-color: initial; display: inline
        !important; float: none;"></span><br>
Kind regards,</p>
<p>Irina<br>
</p>
<p><span style="color: rgb(51, 51, 51); font-family: "Luxi
        Sans", Helvetica, Arial, Geneva, sans-serif; font-size:
        12.8px; font-style: normal; font-variant-ligatures: normal;
        font-variant-caps: normal; font-weight: 400; letter-spacing:
        normal; orphans: 2; text-align: left; text-indent: 0px;
        text-transform: none; white-space: normal; widows: 2;
        word-spacing: 0px; -webkit-text-stroke-width: 0px;
        background-color: rgb(255, 255, 255); text-decoration-style:
        initial; text-decoration-color: initial; display: inline
        !important; float: none;"><br>
</span></p>
<p><span style="color: rgb(51, 51, 51); font-family: "Luxi
        Sans", Helvetica, Arial, Geneva, sans-serif; font-size:
        12.8px; font-style: normal; font-variant-ligatures: normal;
        font-variant-caps: normal; font-weight: 400; letter-spacing:
        normal; orphans: 2; text-align: left; text-indent: 0px;
        text-transform: none; white-space: normal; widows: 2;
        word-spacing: 0px; -webkit-text-stroke-width: 0px;
        background-color: rgb(255, 255, 255); text-decoration-style:
        initial; text-decoration-color: initial; display: inline
        !important; float: none;"><br>
</span></p>
<p><br>
</p>
<div class="moz-cite-prefix">On 16/09/2019 18:25, Linan, Margaret wrote:<br>
</div>
<blockquote type="cite" cite="mid:d406bbfce8694ea682a1d15a6bbfac01@mssm.edu"><style type="text/css" style="display:none;"><!-- P {margin-top:0;margin-bottom:0;} --></style>
<div id="divtagdefaultwrapper" style="font-size:10pt;color:#000000;font-family:Arial,Helvetica,sans-serif;" dir="ltr">
<p>Thanks Irina,</p>
<p><br>
</p>
<p>Regarding the counting of the overlapped transcripts and regulatory features (using stats_html), should I just count how many times the string "transcript" or "regulatory features" appears in the 'Feature' column?
<br>
</p>
<p>Also, what string would I be searching for in the 'Feature_type' column? In an example VEP annotated VCF, the only relevant string was: '<span>sense_overlapping</span>'</p>
<p><br>
</p>
<p>Best regards,</p>
<p><br>
</p>
<div id="Signature">
<div id="divtagdefaultwrapper" dir="ltr" style="font-size:
            12pt; color: rgb(0, 0, 0); font-family: Calibri, Helvetica,
            sans-serif, "EmojiFont", "Apple Color
            Emoji", "Segoe UI Emoji", NotoColorEmoji,
            "Segoe UI Symbol", "Android Emoji",
            EmojiSymbols;">
<div id="divtagdefaultwrapper" dir="ltr" style="font-size:
              12pt; color: rgb(0, 0, 0); font-family: Calibri,
              Helvetica, sans-serif, "EmojiFont", "Apple
              Color Emoji", "Segoe UI Emoji",
              NotoColorEmoji, "Segoe UI Symbol", "Android
              Emoji", EmojiSymbols;">
<font color="#1F497D"><font size="2"><span style="font-size:12pt; color:rgb(0,0,0)"><b><span style="font-family:"Arial
                        Black",Arial,sans-serif; font-size:11pt"></span></b><span style="font-family:"Arial
                      Black",Arial,sans-serif; font-size:11pt"><b><span style="font-family:Consolas,Courier,monospace"></span><span style="font-family:Calibri,Helvetica,sans-serif; font-size:12pt"></span></b><span style="font-family:Calibri,Helvetica,sans-serif;
                        font-size:14pt"><b><span style="font-family:Arial,Helvetica,sans-serif;
                            font-size:10pt">Margaret
 Linan, MPH MS</span></b></span><b><span style="font-family:Calibri,Helvetica,sans-serif; font-size:12pt"></span><span style="font-family:Consolas,Courier,monospace"></span></b></span><b><span style="font-family:"Arial
                        Black",Arial,sans-serif; font-size:11pt"></span></b></span></font></font></div>
<span style="font-size:10pt"></span><span style="font-family:Arial,Helvetica,sans-serif;
              font-size:12pt"></span><span style="font-size:10pt"></span>
<div dir="ltr" style="font-size:12pt; color:#000000;
              font-family:Calibri,Helvetica,sans-serif">
<span style="font-size:10pt"></span><span style="font-family:Arial,Helvetica,sans-serif;
                font-size:12pt"></span><font color="#1F497D"><font size="2"><span style="font-size:10pt;
                    color:rgb(0,0,0);
                    font-family:Arial,Helvetica,sans-serif">Independent
 Consultant</span></font></font></div>
<span style="font-size:10pt"></span><span style="font-family:Arial,Helvetica,sans-serif;
              font-size:12pt"></span><span style="font-size:10pt"></span>
<div dir="ltr" style="font-size:12pt; color:#000000;
              font-family:Calibri,Helvetica,sans-serif">
<span style="font-size:10pt"></span><span style="font-family:Arial,Helvetica,sans-serif;
                font-size:12pt"></span><font color="#1F497D"><font size="2"><span style="font-size:10pt;
                    color:rgb(0,0,0);
                    font-family:Arial,Helvetica,sans-serif">Serving
 the CBIPM @ Icahn School of Medicine at Mount Sinai</span></font></font></div>
<span style="font-size:10pt"></span><span style="font-family:Arial,Helvetica,sans-serif;
              font-size:12pt"></span><span style="font-size:10pt"></span>
<div dir="ltr" style="font-size:12pt; color:#000000;
              font-family:Calibri,Helvetica,sans-serif">
<span style="font-size:10pt"></span><span style="font-family:Arial,Helvetica,sans-serif;
                font-size:12pt"></span><font color="#1F497D"><font size="2"><span style="font-size:10pt;
                    color:rgb(0,0,0);
                    font-family:Arial,Helvetica,sans-serif"><a class="moz-txt-link-abbreviated" href="mailto:Margaret.Linan@mssm.edu">Margaret.Linan@mssm.edu</a></span></font></font></div>
<span style="font-size:10pt"></span>
<div dir="ltr" style="font-size:12pt; color:#000000;
              font-family:Calibri,Helvetica,sans-serif">
<font color="#1F497D"><font size="2"><span style="font-size:11pt; color:rgb(0,0,0);
                    font-family:"Arial
                    Black",Arial,sans-serif"></span><br>
</font></font></div>
</div>
</div>
</div>
<hr style="display:inline-block;width:98%" tabindex="-1">
<div id="divRplyFwdMsg" dir="ltr"><font style="font-size:11pt" face="Calibri, sans-serif" color="#000000"><b>From:</b> Irina Armean
<a class="moz-txt-link-rfc2396E" href="mailto:iarmean@ebi.ac.uk"><iarmean@ebi.ac.uk></a><br>
<b>Sent:</b> Monday, September 16, 2019 8:22:53 AM<br>
<b>To:</b> Ensembl developers list; Linan, Margaret<br>
<b>Subject:</b> Re: [ensembl-dev] VEP command line</font>
<div> </div>
</div>
<div>
<table width="200" cellspacing="0" border="1" bgcolor="yellow" align="center">
<tbody>
<tr>
<td nowrap="nowrap">USE CAUTION: External Message.</td>
</tr>
</tbody>
</table>
<div>
<p>Hi Margaret,</p>
<p><br>
</p>
<p>Sorry for the delay.</p>
<p>The stats written out in stats_html are collected internally simultaneously with the VEP annotation and therefore are not generated based on the VCF columns of the output file.<br>
</p>
<p><br>
</p>
<p>Depending on what VEP run options were selected, the counts could be reproduced based on the output file. For example the number of overlapped genes corresponds to the unique count of ENSG identifiers in the 'Gene' output column. The number of overlapped
 transcripts and regulatory features could be computed based on the 'Feature' and 'Feature_type' columns.<br>
</p>
<p><br>
</p>
<p><br>
</p>
<p>Kind regards,</p>
<p>Irina<br>
</p>
<p><br>
</p>
<div class="moz-cite-prefix">On 12/09/2019 19:27, Linan, Margaret wrote:<br>
</div>
<blockquote type="cite" cite="mid:6bcf55004f5247a4874c04ff54cfa93b@mssm.edu"><style type="text/css" style="display:none;"><!-- P {margin-top:0;margin-bottom:0;} --></style>
<div id="divtagdefaultwrapper" style="font-size:10pt;color:#000000;font-family:Arial,Helvetica,sans-serif;" dir="ltr">
<p>Hi -</p>
<p><br>
</p>
<p>Does anyone know how the VEP command line program's stats_html utility calculates the following (i.e., what VCF columns and operations it uses)?</p>
<blockquote>
<p>- VCF file pre-processing</p>
<p>- Number of overlapped genes</p>
<p>- Number of overlapped transcripts</p>
<p>- Number of overlapped regulatory features<br>
</p>
</blockquote>
<p><br>
</p>
Thank you,
<p>Margaret<br>
</p>
</div>
<br>
<fieldset class="mimeAttachmentHeader"></fieldset>
<pre class="moz-quote-pre" wrap="">_______________________________________________
Dev mailing list    <a class="moz-txt-link-abbreviated" href="mailto:Dev@ensembl.org" moz-do-not-send="true">Dev@ensembl.org</a>
Posting guidelines and subscribe/unsubscribe info: <a class="moz-txt-link-freetext" href="https://urldefense.proofpoint.com/v2/url?u=https-3A__lists.ensembl.org_mailman_listinfo_dev-5Fensembl.org&d=DwMC-g&c=shNJtf5dKgNcPZ6Yh64b-A&r=kRxZpbitOhDkEC3BuUN1vDtzo3iicYrRn6woDJL_jnA&m=w9gjaZF2-WgEeSoFXEwsblFfwJmVFz1CEmhpSp9zXtY&s=SpZOBETLvgtXkDPVAYD1y-NoSVS2-Gm6y5Og0WsbrqU&e=" moz-do-not-send="true">https://lists.ensembl.org/mailman/listinfo/dev_ensembl.org</a>
Ensembl Blog: <a class="moz-txt-link-freetext" href="https://urldefense.proofpoint.com/v2/url?u=http-3A__www.ensembl.info_&d=DwMC-g&c=shNJtf5dKgNcPZ6Yh64b-A&r=kRxZpbitOhDkEC3BuUN1vDtzo3iicYrRn6woDJL_jnA&m=w9gjaZF2-WgEeSoFXEwsblFfwJmVFz1CEmhpSp9zXtY&s=5upY6Tga0npIqKtFlwp1cmQIuwbtshPzDJQJPRAHMYg&e=" moz-do-not-send="true">http://www.ensembl.info/</a>
</pre>
</blockquote>
<pre class="moz-signature" cols="72">-- 
</pre>
</div>
</div>
</blockquote>
<pre class="moz-signature" cols="72">
</pre>
</div>
</body>
</html>