<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=Windows-1252">
</head>
<body text="#000000" bgcolor="#FFFFFF">
<style type="text/css" style="display:none;"><!-- P {margin-top:0;margin-bottom:0;} --></style>
<div id="divtagdefaultwrapper" style="font-size:10pt;color:#000000;font-family:Arial,Helvetica,sans-serif;" dir="ltr">
<p>Thanks Irina,</p>
<p><br>
</p>
<p>Regarding the counting of the overlapped transcripts and regulatory features (using stats_html), should I just count how many times the string "transcript" or "regulatory features" appears in the 'Feature' column?
<br>
</p>
<p>Also, what string would I be searching for in the 'Feature_type' column? In an example VEP annotated VCF, the only relevant string was: '<span>sense_overlapping</span>'</p>
<p><br>
</p>
<p>Best regards,</p>
<p><br>
</p>
<div id="Signature">
<div id="divtagdefaultwrapper" dir="ltr" style="font-size: 12pt; color: rgb(0, 0, 0); font-family: Calibri, Helvetica, sans-serif, "EmojiFont", "Apple Color Emoji", "Segoe UI Emoji", NotoColorEmoji, "Segoe UI Symbol", "Android Emoji", EmojiSymbols;">
<div id="divtagdefaultwrapper" dir="ltr" style="font-size: 12pt; color: rgb(0, 0, 0); font-family: Calibri, Helvetica, sans-serif, "EmojiFont", "Apple Color Emoji", "Segoe UI Emoji", NotoColorEmoji, "Segoe UI Symbol", "Android Emoji", EmojiSymbols;">
<font color="#1F497D"><font size="2"><span style="font-size:12pt; color:rgb(0,0,0)"><b><span style="font-family:"Arial Black",Arial,sans-serif; font-size:11pt"></span></b><span style="font-family:"Arial Black",Arial,sans-serif; font-size:11pt"><b><span style="font-family:Consolas,Courier,monospace"></span><span style="font-family:Calibri,Helvetica,sans-serif; font-size:12pt"></span></b><span style="font-family:Calibri,Helvetica,sans-serif; font-size:14pt"><i><b></b></i><b><span style="font-family:Arial,Helvetica,sans-serif; font-size:10pt">Margaret
 Linan, MPH MS</span></b><i><b></b></i></span><b><span style="font-family:Calibri,Helvetica,sans-serif; font-size:12pt"></span><span style="font-family:Consolas,Courier,monospace"></span></b></span><b><span style="font-family:"Arial Black",Arial,sans-serif; font-size:11pt"></span></b></span></font></font></div>
<span style="font-size:10pt"></span><span style="font-family:Arial,Helvetica,sans-serif; font-size:12pt"></span><span style="font-size:10pt"></span>
<div dir="ltr" style="font-size:12pt; color:#000000; font-family:Calibri,Helvetica,sans-serif">
<span style="font-size:10pt"></span><span style="font-family:Arial,Helvetica,sans-serif; font-size:12pt"></span><font color="#1F497D"><font size="2"><span style="font-size:10pt; color:rgb(0,0,0); font-family:Arial,Helvetica,sans-serif">Independent Consultant</span></font></font></div>
<span style="font-size:10pt"></span><span style="font-family:Arial,Helvetica,sans-serif; font-size:12pt"></span><span style="font-size:10pt"></span>
<div dir="ltr" style="font-size:12pt; color:#000000; font-family:Calibri,Helvetica,sans-serif">
<span style="font-size:10pt"></span><span style="font-family:Arial,Helvetica,sans-serif; font-size:12pt"></span><font color="#1F497D"><font size="2"><span style="font-size:10pt; color:rgb(0,0,0); font-family:Arial,Helvetica,sans-serif">Serving the CBIPM @ Icahn
 School of Medicine at Mount Sinai</span></font></font></div>
<span style="font-size:10pt"></span><span style="font-family:Arial,Helvetica,sans-serif; font-size:12pt"></span><span style="font-size:10pt"></span>
<div dir="ltr" style="font-size:12pt; color:#000000; font-family:Calibri,Helvetica,sans-serif">
<span style="font-size:10pt"></span><span style="font-family:Arial,Helvetica,sans-serif; font-size:12pt"></span><font color="#1F497D"><font size="2"><span style="font-size:10pt; color:rgb(0,0,0); font-family:Arial,Helvetica,sans-serif">Margaret.Linan@mssm.edu</span></font></font></div>
<span style="font-size:10pt"></span>
<div dir="ltr" style="font-size:12pt; color:#000000; font-family:Calibri,Helvetica,sans-serif">
<font color="#1F497D"><font size="2"><span style="font-size:11pt; color:rgb(0,0,0); font-family:"Arial Black",Arial,sans-serif"></span><br>
</font></font></div>
</div>
</div>
</div>
<hr style="display:inline-block;width:98%" tabindex="-1">
<div id="divRplyFwdMsg" dir="ltr"><font face="Calibri, sans-serif" style="font-size:11pt" color="#000000"><b>From:</b> Irina Armean <iarmean@ebi.ac.uk><br>
<b>Sent:</b> Monday, September 16, 2019 8:22:53 AM<br>
<b>To:</b> Ensembl developers list; Linan, Margaret<br>
<b>Subject:</b> Re: [ensembl-dev] VEP command line</font>
<div> </div>
</div>
<div>
<table border="1" width="200" cellspacing="0" align="center" bgcolor="yellow">
<tbody>
<tr>
<td nowrap="nowrap">USE CAUTION: External Message.</td>
</tr>
</tbody>
</table>
<div>
<p>Hi Margaret,</p>
<p><br>
</p>
<p>Sorry for the delay.</p>
<p>The stats written out in stats_html are collected internally simultaneously with the VEP annotation and therefore are not generated based on the VCF columns of the output file.<br>
</p>
<p><br>
</p>
<p>Depending on what VEP run options were selected, the counts could be reproduced based on the output file. For example the number of overlapped genes corresponds to the unique count of ENSG identifiers in the 'Gene' output column. The number of overlapped
 transcripts and regulatory features could be computed based on the 'Feature' and 'Feature_type' columns.<br>
</p>
<p><br>
</p>
<p><br>
</p>
<p>Kind regards,</p>
<p>Irina<br>
</p>
<p><br>
</p>
<div class="moz-cite-prefix">On 12/09/2019 19:27, Linan, Margaret wrote:<br>
</div>
<blockquote type="cite" cite="mid:6bcf55004f5247a4874c04ff54cfa93b@mssm.edu"><style type="text/css" style="display:none;"><!-- P {margin-top:0;margin-bottom:0;} --></style>
<div id="divtagdefaultwrapper" style="font-size:10pt;color:#000000;font-family:Arial,Helvetica,sans-serif;" dir="ltr">
<p>Hi -</p>
<p><br>
</p>
<p>Does anyone know how the VEP command line program's stats_html utility calculates the following (i.e., what VCF columns and operations it uses)?</p>
<blockquote>
<p>- VCF file pre-processing</p>
<p>- Number of overlapped genes</p>
<p>- Number of overlapped transcripts</p>
<p>- Number of overlapped regulatory features<br>
</p>
</blockquote>
<p><br>
</p>
Thank you,
<p>Margaret<br>
</p>
</div>
<br>
<fieldset class="mimeAttachmentHeader"></fieldset>
<pre class="moz-quote-pre" wrap="">_______________________________________________
Dev mailing list    <a class="moz-txt-link-abbreviated" href="mailto:Dev@ensembl.org">Dev@ensembl.org</a>
Posting guidelines and subscribe/unsubscribe info: <a class="moz-txt-link-freetext" href="https://urldefense.proofpoint.com/v2/url?u=https-3A__lists.ensembl.org_mailman_listinfo_dev-5Fensembl.org&d=DwMC-g&c=shNJtf5dKgNcPZ6Yh64b-A&r=kRxZpbitOhDkEC3BuUN1vDtzo3iicYrRn6woDJL_jnA&m=w9gjaZF2-WgEeSoFXEwsblFfwJmVFz1CEmhpSp9zXtY&s=SpZOBETLvgtXkDPVAYD1y-NoSVS2-Gm6y5Og0WsbrqU&e=">https://lists.ensembl.org/mailman/listinfo/dev_ensembl.org</a>
Ensembl Blog: <a class="moz-txt-link-freetext" href="https://urldefense.proofpoint.com/v2/url?u=http-3A__www.ensembl.info_&d=DwMC-g&c=shNJtf5dKgNcPZ6Yh64b-A&r=kRxZpbitOhDkEC3BuUN1vDtzo3iicYrRn6woDJL_jnA&m=w9gjaZF2-WgEeSoFXEwsblFfwJmVFz1CEmhpSp9zXtY&s=5upY6Tga0npIqKtFlwp1cmQIuwbtshPzDJQJPRAHMYg&e=">http://www.ensembl.info/</a>
</pre>
</blockquote>
<pre class="moz-signature" cols="72">-- 
</pre>
</div>
</div>
</body>
</html>