<div dir="ltr">Hello,<div><br></div><div>I'm trying to figure out how to programmatically find this URL:<br><a href="ftp://ftp.ensemblgenomes.org/pub/bacteria/release-45/fasta/bacteria_13_collection/pseudomonas_aeruginosa_pao1/dna/">ftp://ftp.ensemblgenomes.org/pub/bacteria/release-45/fasta/bacteria_13_collection/pseudomonas_aeruginosa_pao1/dna/</a><br></div><div><br></div><div>I got that URL by going to <a href="https://bacteria.ensembl.org/Pseudomonas_aeruginosa_pao1/Info/Index/">https://bacteria.ensembl.org/Pseudomonas_aeruginosa_pao1/Info/Index/</a> and clicking a link that said: "Download DNA sequence (FASTA)". However I can't figure out how to get the API to tell me that and I don't want to scrape the HTML for the link.</div><div><br></div><div>Does anyone know how to find that URL for a given organism/strain?</div><div><br></div><div>Thanks,</div><div><br></div><div>- Kurt<br><br>P.S. I solved this problem for divisions other than bacteria by building the URLs with information that the API does provide: <a href="https://github.com/AlexsLemonade/refinebio/blob/dev/foreman/data_refinery_foreman/surveyor/transcriptome_index.py#L48">https://github.com/AlexsLemonade/refinebio/blob/dev/foreman/data_refinery_foreman/surveyor/transcriptome_index.py#L48</a><br><br>However in the FTP server the bacteria are broken up into collections which I'm having trouble figuring out how to determine.</div></div>