Gene EcSMS35_3474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3474 
SymbolhflB 
ID6146516 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3551757 
End bp3553700 
Gene Length1944 bp 
Protein Length647 aa 
Translation table11 
GC content53% 
IMG OID641618303 
ProductATP-dependent metalloprotease 
Protein accessionYP_001745450 
Protein GI170682536 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000879508 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGTGACA TGGCGAAAAA CCTAATACTC TGGCTGGTCA TTGCCGTTGT GCTGATGTCA 
GTATTCCAGA GCTTTGGGCC CAGCGAGTCT AATGGCCGTA AGGTGGATTA CTCTACCTTC
CTACAAGAGG TCAATAACGA CCAGGTTCGT GAAGCGCGTA TCAACGGACG TGAAATCAAC
GTTACCAAGA AAGATAGTAA CCGTTATACC ACTTACATTC CGGTTCAGGA TCCGAAATTA
CTGGATAACC TGTTGACCAA GAACGTCAAG GTTGTCGGTG AACCGCCTGA AGAACCAAGC
CTGCTGGCTT CTATCTTCAT CTCCTGGTTC CCGATGCTGT TGCTGATTGG TGTCTGGATC
TTCTTCATGC GTCAAATGCA GGGCGGCGGT GGCAAAGGTG CCATGTCGTT TGGTAAGAGC
AAAGCGCGCA TGCTGACGGA AGATCAGATC AAAACGACCT TTGCTGACGT TGCGGGCTGC
GACGAAGCAA AAGAAGAAGT TGCTGAACTG GTTGAGTATC TGCGCGAGCC GAGCCGCTTC
CAGAAACTCG GCGGTAAGAT CCCGAAAGGC GTCCTGATGG TCGGTCCTCC GGGTACCGGT
AAAACGCTGC TGGCGAAAGC GATTGCAGGC GAAGCGAAAG TTCCGTTCTT TACTATCTCC
GGTTCTGACT TCGTAGAAAT GTTCGTCGGT GTGGGTGCAT CCCGTGTTCG TGACATGTTC
GAACAGGCGA AGAAAGCGGC ACCGTGCATC ATCTTTATCG ATGAAATCGA CGCCGTAGGC
CGCCAGCGTG GCGCAGGTCT GGGCGGTGGT CACGATGAAC GTGAACAGAC TTTGAACCAG
ATGCTGGTTG AGATGGATGG CTTCGAAGGT AACGAAGGTA TCATCGTTAT CGCCGCGACT
AACCGTCCGG ACGTTCTTGA CCCGGCCCTG CTGCGTCCTG GCCGTTTCGA CCGTCAGGTT
GTGGTTGGCT TGCCAGATGT TCGTGGTCGT GAGCAGATCC TGAAAGTTCA CATGCGTCGC
GTACCATTGG CACCCGATAT CGACGCGGCA ATCATTGCCC GTGGTACTCC TGGTTTCTCC
GGTGCTGACC TGGCGAACCT GGTGAACGAA GCGGCACTGT TCGCTGCTCG TGGCAACAAA
CGCGTTGTGT CGATGGTTGA GTTCGAGAAA GCGAAAGACA AAATCATGAT GGGTGCGGAA
CGTCGCTCCA TGGTGATGAC GGAAGCGCAG AAAGAATCGA CGGCTTACCA CGAAGCGGGT
CACGCGATTA TCGGTCGCCT GGTGCCGGAA CACGATCCGG TGCACAAAGT AACGATTATC
CCGCGCGGTC GTGCGCTGGG TGTGACCTTC TTCTTGCCTG AGGGCGACGC AATCAGCGCC
AGCCGTCAGA AACTGGAAAG CCAGATTTCT ACGCTGTACG GTGGTCGTCT GGCAGAAGAG
ATCATCTACG GGCCGGAACA TGTTTCTACC GGTGCGTCCA ACGATATTAA AGTTGCGACC
AACCTGGCAC GTAACATGGT GACTCAGTGG GGTTTCTCTG AGAAACTCGG TCCGCTGCTG
TATGCGGAAG AAGAAGGTGA AGTGTTCCTC GGCCGTAGCG TAGCGAAAGC GAAACATATG
TCCGATGAAA CTGCACGTAT CATCGACCAG GAAGTGAAAG CACTGATTGA GCGTAACTAT
AATCGTGCGC GTCAGCTTCT GACCGACAAT ATGGATATTC TGCATGCGAT GAAAGATGCT
CTCATGAAAT ATGAGACTAT CGACGCACCG CAGATTGATG ACCTGATGGC ACGTCGCGAT
GTACGTCCGC CAGCGGGCTG GGAAGAACCA GTCGCTTCTA ACAATTCTGG CGACAATGGT
AGTCCAAAGG CTCCACGTCC GGTTGATGAA CCGCGTACGC CGAACCCGGG TAACACCATG
TCAGAGCAGT TAGGCGACAA GTAA
 
Protein sequence
MSDMAKNLIL WLVIAVVLMS VFQSFGPSES NGRKVDYSTF LQEVNNDQVR EARINGREIN 
VTKKDSNRYT TYIPVQDPKL LDNLLTKNVK VVGEPPEEPS LLASIFISWF PMLLLIGVWI
FFMRQMQGGG GKGAMSFGKS KARMLTEDQI KTTFADVAGC DEAKEEVAEL VEYLREPSRF
QKLGGKIPKG VLMVGPPGTG KTLLAKAIAG EAKVPFFTIS GSDFVEMFVG VGASRVRDMF
EQAKKAAPCI IFIDEIDAVG RQRGAGLGGG HDEREQTLNQ MLVEMDGFEG NEGIIVIAAT
NRPDVLDPAL LRPGRFDRQV VVGLPDVRGR EQILKVHMRR VPLAPDIDAA IIARGTPGFS
GADLANLVNE AALFAARGNK RVVSMVEFEK AKDKIMMGAE RRSMVMTEAQ KESTAYHEAG
HAIIGRLVPE HDPVHKVTII PRGRALGVTF FLPEGDAISA SRQKLESQIS TLYGGRLAEE
IIYGPEHVST GASNDIKVAT NLARNMVTQW GFSEKLGPLL YAEEEGEVFL GRSVAKAKHM
SDETARIIDQ EVKALIERNY NRARQLLTDN MDILHAMKDA LMKYETIDAP QIDDLMARRD
VRPPAGWEEP VASNNSGDNG SPKAPRPVDE PRTPNPGNTM SEQLGDK