Gene EcSMS35_0606 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0606 
SymbolentF 
ID6145499 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp615268 
End bp619149 
Gene Length3882 bp 
Protein Length1293 aa 
Translation table11 
GC content57% 
IMG OID641615498 
Productenterobactin synthase subunit F 
Protein accessionYP_001742704 
Protein GI170682592 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.630178 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGC ATTTACCTTT GGTCGCCGCA CAGCCCGGCA TCTGGATGGC AGAAAAACTG 
TCGGATTTAC CCTCCGCCTG GAGCGTGGCG CATTACGTTG AGTTAACCGG AGAGGTTGAT
GCGCCACTAC TGGCCCGCGC GGTGGTTGCC GGACTAGCGC AAGCAGATAC GTTGCGGATG
CGTTTTACGG AAGATAACGG CGAAGTCTGG CAGTGGGTCG ATGATGCGCA GACGTTCGAA
CTGCCAGAAA TTATCGACCT GCGAACCAAC ATTGATCCGC ACGGTACTGC GCGGGCATTA
ATGCAGGCGG ATTTGCAACA AGATCTGCGC GTCGATAGTG GTAAACCACT GGTCTTTCAT
CAGCTGATAC AGGTGGCGGA TAACCGCTGG TACTGGTATC AGCGTTATCA CCATTTGCTG
GTCGATGGCT TCAGTTTCCC GGCCATTACA CGCCAGATCG CCAATATTTA CTGCGCATGG
CTGCGTGGCG AACCAACGCC TGCTTCGCCG TTTACGCCTT TCGCTGATGT AGTGGAAGAG
TATCAGCAGT ACCGCGAAAG TGAAGCCTGG CAGCGTGATG CGGCATTCTG GGCGGAACAG
CGTCGTCAAC TGCCACCGCC CGCGTCACTT TCTCCGGCAC CTTTACCGTG GCGCAGCGCT
TCGGCAGATA TTTTGCGCCT GAAACTGGAA TTTACCGACG GGGAATTCCG CCAGCTGGCT
ACGCAACTTT CAGGTGTGCA GCGTACCGAT TTAGCCCTTG CGTTGGCAGC TTTGTGGCTG
GGGCGATTGT GCAACCGCAT GGACTACGCT GCCGGATTTA TCTTTATGCG TCGACTGGGC
TCGGCGGCGC TGACGGCTAC CGGACCCGTG CTCAACGTTT TGCCGTTGGG TATTCACATT
GCGGCGCAAG AAACGCTGCC GCAACTGGCA ACCCGACTGG CGGCTCAACT GAAAAAAATG
CGCCGCCATC AACGCTACGA CGCCGAACAA ATTGTCCGTG ACAGTGGGCG AGCGGCAGGA
GAGGAACCGC TGTTCGGTCC GGTACTCAAT ATCAAGGTAT TTGATTACCA ACTGGATATT
CCTGGTGTTC AGGCGCAAAC CCATACCCTG GCAACCGGTC CGGTTAATGA CCTTGAGCTG
GCCCTGTTCC CGGATGAACA CGGTGATTTG AGTATTGAGA TCATCGCCAA TAAACAGCGT
TACGCTGAGC CAACGTTAAT CCAGCATGCT GAACGCCTGA AAATGCTGAT TGCGCAGTTC
GCCGCGGATC CGTCGCTGTT GTGCGGCGAT GTCGATATTA TGCTGCCAGG CGAGTATGCG
CAGCTGGCGC AGATCAACGC CACTCAGGTT GAGATTCCAG AAACCACGCT GAGCGCGCTG
GTGGCAGAAC AAGCGGCAAA AACACCGGAT GCTCCGGCGC TGGTTGATGC GCGTTACCAG
TTCAGCTATC GGGAAATGCG CGAACAGGTG GTGGCGCTGG CGAATCTGCT GCGTGAGCGC
GGCGTTAAAC CAGGCGACAG CGTGGCGGTG GCATTACCGC GCTCGGTCTT TTTGACCCTG
GCGCTACATG CGATTGTTGA AGCTGGTGCG GCCTGGCTAC CGTTGGATAC CGGCTATCCG
GACGATCGCC TGAAAATGAT GCTGGAAGAT GCGCGTCCGT CGCTGTTAAT CACCACCGAC
GATCAACTGC CGCGCTTTGC CGATGTTCCA GATTTAACCC GCCTTTGCTA TAACGCCCCG
CTTACACCAC AGGGCAGTGC GCCGCTGCAG CTTTCACAAC CGCACCACAC GGCTTATATC
ATCTTCACCT CCGGTTCCAC CGGCAGACCA AAAGGGGTAA TGGTCGGGCA GACGGCTATC
GTTAATCGCC TGCTTTGGAT GCAAAACCAT TACCCGCTTA CAGGCGAAGA TGTCGTTGCC
CAAAAAACGC CATGCAGTTT TGATGTCTCG GTGTGGGAGT TTTTCTGGCC GTTTATTGCC
GGGGCGAAAC TTGTGATGGC TGAACCGGAA GCGCACCGCG ACCCGCTCGC TATGCAGCAT
TTTTTTGCTG ATTATGGCGT AACGACCACG CACTTTGTGC CGTCGATGCT GGCGGCATTT
GTTGCATCGC TGACGCCGCA AACCGCTCGC CAGAGTTGCG CGACGTTGAA ACAGGTTTTC
TGTAGTGGTG AAGCCTTACC GGCTGATTTA TGCCGCGAAT GGCAACAGTT AACCGGCGCG
CCGTTGCATA ATCTATATGG CCCGACGGAA GCGGCGGTAG ATGTCAGTTG GTATCCGGCT
TTTGGCGAGG AACTGGCAGA GGTGCGCGGC AGCAGTGTGC CGATTGGTTA TCCGGTGTGG
AATACGGGCC TGCGCATTCT CGATGCGATG ATGCATCCGG TGCCGCCGGG TGTGGCGGGA
GATCTCTATC TCACCGGTAT TCAACTGGCG CAGGGGTATC TTGGACGACC CGATCTGACC
GCCAGCCGCT TTATTGCCGA TCCTTTTGCC CCAGGTGAAC GGATGTACCG TACCGGCGAT
GTGGCCCGCT GGCTGGATAA CGGCGCAGTG GAGTACCTCG GGCGCAGTGA CGATCAGCTA
AAAATTCGCG GGCAGCGTAT CGAACTGGGC GAAATTGATC GCGTGATGCA GGCGCTGCCG
GATGTCGAAC AAGCCGTTAC CCACGCCTGT GTGATTAACC AGGCGGCAGC CACTGGTGGT
GATGCGCGTC AGTTGGTGGG CTATCTGGTG TCGCAATCAG GTCTGCCGTT GGATACCAGC
GCATTACAGG CGCAGCTTCG TGAAACATTG CCACCACATA TGGTACCGGT GGTTCTGCTG
CAACTTCCAC AGTTACCACT TAGCGCCAAC GGCAAGCTGG ATCGCAAAGC CTTACCGTTG
CCTGAACTGA AGGCACAAGC GCCTGGGCGT GCGCCGAAAG CGGGCAGTGA AACGATTATC
GCCGCGGCAT TCTCGTCGTT GCTGGGTTGT GACGTGCAGG ATGCCGATGC TGATTTCTTT
GCGCTTGGCG GTCATTCGCT ACTGGCAATG AAACTGGCTG CGCAGTTAAG TCGGCAGTTT
GCCCGCCAGG TGACGCCGGG GCAAGTGATG GTCGCGTCAA CTGTCGCCAA ACTGGCAACG
ATTATTGATG GTGAAGAAGA CAGCTCCCGG CGCATGGGAT TCGAAACCAT TCTGCCGTTG
CGTGAAGGTA ATGGCCCGAC GCTGTTTTGT TTCCATCCGG CCTCCGGTTT TGCCTGGCAG
TTCAGCGTGC TCTCGCGTTA TCTCGATCCA CAATGGTCGA TTATCGGCAT TCAGTCGCCG
CGCCCCAATG GCCCCATGCA GACCGCGGCA AATCTGGATG AAGTCTGCGA AGCGCATCTG
GCAACGTTAC TTGAGCATCA ACCGCACGGC CCTTATTACC TGCTGGGTTA TTCGCTGGGC
GGTACGCTGG CGCAGGGTAT TGCGGCGCGG CTGCGTGCCC GTGGCGAACA GGTGGCATTT
CTTGGCTTGC TGGATACCTG GCCGCCAGAA ACGCAAAACT GGCAGGAAAA AGAGGCTAAT
GGCCTGGACC CGGAAGTGCT GGCGGAGATT AACCGCGAGC GCGAGGCCTT CCTGGCGGCA
CAGCAGGGAA GTACTTCAAC GGAGTTGTTT ACCACCATTG AAGGCAACTA CGCTGATGCT
GTGCGCCTGC TGACGACTGC TCATAGCGTA CCGTTTGAAG GTAAAGCGAC GCTGTTTGTT
GCTGAACGCA CGCTTCAGGA AGGTATGAAT CCCGAACGCG CCTGGTCGCC GTGGATTACG
GAGCTGGATA TCTATCGTCA GGATTGTGCG CATGTGGATA TTATCTCTCC AGGGGCGTTT
GAAAAAATTG GGCCGATTAT TCGCGCTACG CTAAACAGGT AA
 
Protein sequence
MSQHLPLVAA QPGIWMAEKL SDLPSAWSVA HYVELTGEVD APLLARAVVA GLAQADTLRM 
RFTEDNGEVW QWVDDAQTFE LPEIIDLRTN IDPHGTARAL MQADLQQDLR VDSGKPLVFH
QLIQVADNRW YWYQRYHHLL VDGFSFPAIT RQIANIYCAW LRGEPTPASP FTPFADVVEE
YQQYRESEAW QRDAAFWAEQ RRQLPPPASL SPAPLPWRSA SADILRLKLE FTDGEFRQLA
TQLSGVQRTD LALALAALWL GRLCNRMDYA AGFIFMRRLG SAALTATGPV LNVLPLGIHI
AAQETLPQLA TRLAAQLKKM RRHQRYDAEQ IVRDSGRAAG EEPLFGPVLN IKVFDYQLDI
PGVQAQTHTL ATGPVNDLEL ALFPDEHGDL SIEIIANKQR YAEPTLIQHA ERLKMLIAQF
AADPSLLCGD VDIMLPGEYA QLAQINATQV EIPETTLSAL VAEQAAKTPD APALVDARYQ
FSYREMREQV VALANLLRER GVKPGDSVAV ALPRSVFLTL ALHAIVEAGA AWLPLDTGYP
DDRLKMMLED ARPSLLITTD DQLPRFADVP DLTRLCYNAP LTPQGSAPLQ LSQPHHTAYI
IFTSGSTGRP KGVMVGQTAI VNRLLWMQNH YPLTGEDVVA QKTPCSFDVS VWEFFWPFIA
GAKLVMAEPE AHRDPLAMQH FFADYGVTTT HFVPSMLAAF VASLTPQTAR QSCATLKQVF
CSGEALPADL CREWQQLTGA PLHNLYGPTE AAVDVSWYPA FGEELAEVRG SSVPIGYPVW
NTGLRILDAM MHPVPPGVAG DLYLTGIQLA QGYLGRPDLT ASRFIADPFA PGERMYRTGD
VARWLDNGAV EYLGRSDDQL KIRGQRIELG EIDRVMQALP DVEQAVTHAC VINQAAATGG
DARQLVGYLV SQSGLPLDTS ALQAQLRETL PPHMVPVVLL QLPQLPLSAN GKLDRKALPL
PELKAQAPGR APKAGSETII AAAFSSLLGC DVQDADADFF ALGGHSLLAM KLAAQLSRQF
ARQVTPGQVM VASTVAKLAT IIDGEEDSSR RMGFETILPL REGNGPTLFC FHPASGFAWQ
FSVLSRYLDP QWSIIGIQSP RPNGPMQTAA NLDEVCEAHL ATLLEHQPHG PYYLLGYSLG
GTLAQGIAAR LRARGEQVAF LGLLDTWPPE TQNWQEKEAN GLDPEVLAEI NREREAFLAA
QQGSTSTELF TTIEGNYADA VRLLTTAHSV PFEGKATLFV AERTLQEGMN PERAWSPWIT
ELDIYRQDCA HVDIISPGAF EKIGPIIRAT LNR