Gene ECH74115_4500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4500 
SymbolhflB 
ID6968365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4169266 
End bp4171200 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content54% 
IMG OID643388213 
ProductATP-dependent metalloprotease 
Protein accessionYP_002272648 
Protein GI209399479 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000284191 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAAA ACCTAATACT CTGGCTGGTC ATTGCCGTTG TGCTGATGTC AGTATTCCAG 
AGCTTTGGGC CCAGCGAGTC TAATGGCCGT AAGGTGGATT ACTCTACCTT CCTACAAGAG
GTCAATAACG ACCAGGTTCG TGAAGCGCGT ATCAACGGAC GTGAAATCAA CGTTACCAAG
AAAGATAGTA ACCGTTATAC CACTTACATT CCGGTTCAGG ATCCGAAATT ACTGGATAAC
CTGTTGACCA AGAACGTCAA GGTTGTCGGT GAACCGCCTG AAGAACCCAG CCTGCTGGCT
TCTATCTTCA TCTCCTGGTT CCCGATGCTG TTGCTGATTG GTGTCTGGAT CTTCTTCATG
CGTCAAATGC AGGGCGGCGG TGGCAAAGGT GCCATGTCGT TTGGTAAGAG TAAAGCGCGC
ATGCTGACGG AAGATCAGAT CAAAACGACC TTTGCTGACG TTGCGGGCTG CGACGAAGCA
AAAGAAGAAG TTGCTGAACT GGTTGAGTAT CTGCGCGAGC CGAGCCGCTT CCAGAAACTC
GGCGGTAAGA TCCCGAAAGG CGTCCTGATG GTCGGTCCTC CGGGTACCGG TAAAACGCTG
CTGGCGAAAG CGATTGCAGG CGAAGCGAAA GTTCCGTTCT TTACTATCTC CGGTTCTGAC
TTCGTAGAAA TGTTCGTCGG TGTGGGTGCA TCCCGTGTTC GTGACATGTT CGAACAGGCG
AAGAAAGCGG CACCGTGCAT CATCTTTATC GATGAAATCG ACGCCGTAGG CCGCCAGCGT
GGCGCAGGTC TGGGCGGTGG TCACGATGAA CGTGAACAGA CTCTGAACCA GATGCTGGTT
GAGATGGATG GCTTCGAAGG TAACGAAGGT ATCATCGTTA TCGCCGCGAC TAACCGTCCG
GACGTTCTCG ACCCAGCGCT GCTGCGTCCT GGCCGTTTCG ACCGTCAGGT TGTGGTTGGC
TTGCCAGATG TTCGCGGTCG TGAGCAGATC CTGAAGGTTC ACATGCGTCG CGTACCATTG
GCACCCGATA TCGACGCGGC AATCATTGCC CGTGGTACTC CTGGTTTCTC CGGTGCTGAC
CTGGCGAACC TGGTGAACGA AGCGGCACTG TTCGCTGCTC GTGGCAACAA ACGCGTTGTG
TCGATGGTTG AGTTCGAGAA AGCGAAAGAC AAAATCATGA TGGGTGCGGA ACGTCGCTCC
ATGGTGATGA CGGAAGCGCA GAAAGAATCG ACGGCTTACC ACGAAGCGGG TCATGCGATT
ATCGGTCGCC TGGTGCCGGA GCACGATCCG GTGCACAAAG TGACGATTAT CCCACGCGGT
CGTGCGCTGG GTGTGACTTT CTTCTTGCCT GAGGGCGACG CAATCAGCGC CAGCCGTCAG
AAACTGGAAA GCCAGATTTC TACGCTGTAC GGTGGTCGTC TGGCAGAAGA GATCATCTAC
GGGCCGGAAC ATGTTTCTAC CGGTGCGTCC AACGATATTA AAGTTGCGAC CAACCTGGCA
CGTAACATGG TGACTCAGTG GGGCTTCTCT GAGAAATTGG GTCCACTGCT GTACGCGGAA
GAAGAAGGTG AAGTGTTCCT CGGCCGTAGC GTAGCGAAAG CGAAACATAT GTCCGATGAA
ACTGCACGTA TCATCGACCA GGAAGTGAAA GCACTGATTG AGCGTAACTA TAATCGTGCG
CGTCAGCTTC TGACCGACAA TATGGATATT CTGCATGCGA TGAAAGATGC TCTCATGAAA
TATGAGACTA TCGACGCACC GCAGATTGAT GACCTGATGG CACGTCGCGA TGTACGTCCG
CCAGCGGGCT GGGAAGAACC AGGCGCTTCT AACAATGCTG GCGACAATGG TAGTCCAAAG
GCTCCTCGTC CGGTTGATGA ACCGCGTACG CCGAACCCGG GTAACACCAT GTCAGAGCAG
TTAGGCGACA AGTAA
 
Protein sequence
MAKNLILWLV IAVVLMSVFQ SFGPSESNGR KVDYSTFLQE VNNDQVREAR INGREINVTK 
KDSNRYTTYI PVQDPKLLDN LLTKNVKVVG EPPEEPSLLA SIFISWFPML LLIGVWIFFM
RQMQGGGGKG AMSFGKSKAR MLTEDQIKTT FADVAGCDEA KEEVAELVEY LREPSRFQKL
GGKIPKGVLM VGPPGTGKTL LAKAIAGEAK VPFFTISGSD FVEMFVGVGA SRVRDMFEQA
KKAAPCIIFI DEIDAVGRQR GAGLGGGHDE REQTLNQMLV EMDGFEGNEG IIVIAATNRP
DVLDPALLRP GRFDRQVVVG LPDVRGREQI LKVHMRRVPL APDIDAAIIA RGTPGFSGAD
LANLVNEAAL FAARGNKRVV SMVEFEKAKD KIMMGAERRS MVMTEAQKES TAYHEAGHAI
IGRLVPEHDP VHKVTIIPRG RALGVTFFLP EGDAISASRQ KLESQISTLY GGRLAEEIIY
GPEHVSTGAS NDIKVATNLA RNMVTQWGFS EKLGPLLYAE EEGEVFLGRS VAKAKHMSDE
TARIIDQEVK ALIERNYNRA RQLLTDNMDI LHAMKDALMK YETIDAPQID DLMARRDVRP
PAGWEEPGAS NNAGDNGSPK APRPVDEPRT PNPGNTMSEQ LGDK