Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A3370 |
Symbol | hflB |
ID | 5593402 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 3375927 |
End bp | 3377870 |
Gene Length | 1944 bp |
Protein Length | 647 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640922490 |
Product | ATP-dependent metalloprotease |
Protein accession | YP_001459979 |
Protein GI | 157162661 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0465] ATP-dependent Zn proteases |
TIGRFAM ID | [TIGR01241] ATP-dependent metalloprotease FtsH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1.23805e-17 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAGTGACA TGGCGAAAAA CCTAATACTC TGGCTGGTCA TTGCCGTTGT GCTGATGTCA GTATTCCAGA GCTTTGGGCC CAGCGAGTCT AATGGCCGTA AGGTGGATTA CTCTACCTTC CTACAAGAGG TCAATAACGA CCAGGTTCGT GAAGCGCGTA TCAACGGACG TGAAATCAAC GTTACCAAGA AAGATAGTAA CCGTTATACC ACTTACATTC CGGTTCAGGA TCCGAAATTA CTGGATAACC TGTTGACCAA GAACGTCAAG GTTGTCGGTG AACCGCCTGA AGAACCAAGC CTGCTGGCTT CTATCTTCAT CTCCTGGTTC CCGATGCTGT TGCTGATTGG TGTCTGGATC TTCTTCATGC GTCAAATGCA GGGCGGCGGT GGCAAAGGTG CCATGTCGTT TGGTAAGAGC AAAGCGCGCA TGCTGACGGA AGATCAGATC AAAACGACCT TTGCTGACGT TGCGGGCTGC GACGAAGCAA AAGAAGAAGT TGCTGAACTG GTTGAGTATC TGCGCGAGCC GAGCCGCTTC CAGAAACTCG GCGGTAAGAT CCCGAAAGGC GTCTTGATGG TCGGTCCTCC GGGTACCGGT AAAACGCTGC TGGCGAAAGC GATTGCAGGC GAAGCGAAAG TTCCGTTCTT TACTATCTCC GGTTCTGACT TCGTAGAAAT GTTCGTCGGT GTGGGTGCAT CCCGTGTTCG TGACATGTTC GAACAGGCGA AGAAAGCGGC ACCGTGCATC ATCTTTATCG ATGAAATCGA CGCCGTAGGC CGCCAGCGTG GCGCTGGTCT GGGCGGTGGT CACGATGAAC GTGAACAGAC TCTGAACCAG ATGCTGGTTG AGATGGATGG CTTCGAAGGT AACGAAGGTA TCATCGTTAT CGCCGCGACT AACCGTCCGG ACGTTCTCGA CCCGGCCCTG CTGCGTCCTG GCCGTTTCGA CCGTCAGGTT GTGGTCGGCT TGCCAGATGT TCGCGGTCGT GAGCAGATCC TGAAAGTTCA CATGCGTCGC GTACCATTGG CACCCGATAT CGACGCGGCA ATCATTGCCC GTGGTACTCC TGGTTTCTCC GGTGCTGACC TGGCGAACCT GGTGAACGAA GCGGCACTGT TCGCTGCTCG TGGCAACAAA CGCGTTGTGT CGATGGTTGA GTTCGAGAAA GCGAAAGACA AAATCATGAT GGGTGCGGAA CGTCGCTCCA TGGTGATGAC GGAAGCGCAG AAAGAATCGA CGGCTTACCA CGAAGCGGGT CATGCGATTA TCGGTCGCCT GGTGCCGGAA CACGATCCGG TGCACAAAGT GACGATTATC CCACGCGGTC GTGCGCTGGG TGTGACTTTC TTCTTGCCTG AGGGCGACGC AATCAGCGCC AGCCGTCAGA AACTGGAAAG CCAGATTTCT ACGCTGTACG GTGGTCGTCT GGCAGAAGAG ATCATCTACG GGCCGGAACA TGTATCTACC GGTGCGTCCA ACGATATTAA AGTTGCGACC AACCTGGCAC GTAACATGGT GACTCAGTGG GGCTTCTCTG AGAAATTGGG CCCACTGCTG TACGCGGAAG AAGAAGGTGA AGTGTTCCTC GGCCGTAGCG TAGCGAAAGC GAAACATATG TCCGATGAAA CTGCACGTAT CATCGACCAG GAAGTGAAAG CACTGATTGA GCGTAACTAT AATCGTGCGC GTCAGCTTCT GACCGACAAT ATGGATATTC TGCATGCGAT GAAAGATGCT CTCATGAAAT ATGAGACTAT CGACGCACCG CAGATTGATG ACCTGATGGC ACGTCGCGAT GTACGTCCGC CAGCGGGCTG GGAAGAACCA GGCGCTTCTA ACAATTCTGG CGACAATGGT AGTCCAAAGG CTCCTCGTCC GGTTGATGAA CCGCGTACGC CGAACCCGGG TAACACCATG TCAGAGCAGT TAGGCGACAA GTAA
|
Protein sequence | MSDMAKNLIL WLVIAVVLMS VFQSFGPSES NGRKVDYSTF LQEVNNDQVR EARINGREIN VTKKDSNRYT TYIPVQDPKL LDNLLTKNVK VVGEPPEEPS LLASIFISWF PMLLLIGVWI FFMRQMQGGG GKGAMSFGKS KARMLTEDQI KTTFADVAGC DEAKEEVAEL VEYLREPSRF QKLGGKIPKG VLMVGPPGTG KTLLAKAIAG EAKVPFFTIS GSDFVEMFVG VGASRVRDMF EQAKKAAPCI IFIDEIDAVG RQRGAGLGGG HDEREQTLNQ MLVEMDGFEG NEGIIVIAAT NRPDVLDPAL LRPGRFDRQV VVGLPDVRGR EQILKVHMRR VPLAPDIDAA IIARGTPGFS GADLANLVNE AALFAARGNK RVVSMVEFEK AKDKIMMGAE RRSMVMTEAQ KESTAYHEAG HAIIGRLVPE HDPVHKVTII PRGRALGVTF FLPEGDAISA SRQKLESQIS TLYGGRLAEE IIYGPEHVST GASNDIKVAT NLARNMVTQW GFSEKLGPLL YAEEEGEVFL GRSVAKAKHM SDETARIIDQ EVKALIERNY NRARQLLTDN MDILHAMKDA LMKYETIDAP QIDDLMARRD VRPPAGWEEP GASNNSGDNG SPKAPRPVDE PRTPNPGNTM SEQLGDK
|
| |