Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4500 |
Symbol | hflB |
ID | 6968365 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 4169266 |
End bp | 4171200 |
Gene Length | 1935 bp |
Protein Length | 644 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643388213 |
Product | ATP-dependent metalloprotease |
Protein accession | YP_002272648 |
Protein GI | 209399479 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0465] ATP-dependent Zn proteases |
TIGRFAM ID | [TIGR01241] ATP-dependent metalloprotease FtsH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000284191 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGAAAA ACCTAATACT CTGGCTGGTC ATTGCCGTTG TGCTGATGTC AGTATTCCAG AGCTTTGGGC CCAGCGAGTC TAATGGCCGT AAGGTGGATT ACTCTACCTT CCTACAAGAG GTCAATAACG ACCAGGTTCG TGAAGCGCGT ATCAACGGAC GTGAAATCAA CGTTACCAAG AAAGATAGTA ACCGTTATAC CACTTACATT CCGGTTCAGG ATCCGAAATT ACTGGATAAC CTGTTGACCA AGAACGTCAA GGTTGTCGGT GAACCGCCTG AAGAACCCAG CCTGCTGGCT TCTATCTTCA TCTCCTGGTT CCCGATGCTG TTGCTGATTG GTGTCTGGAT CTTCTTCATG CGTCAAATGC AGGGCGGCGG TGGCAAAGGT GCCATGTCGT TTGGTAAGAG TAAAGCGCGC ATGCTGACGG AAGATCAGAT CAAAACGACC TTTGCTGACG TTGCGGGCTG CGACGAAGCA AAAGAAGAAG TTGCTGAACT GGTTGAGTAT CTGCGCGAGC CGAGCCGCTT CCAGAAACTC GGCGGTAAGA TCCCGAAAGG CGTCCTGATG GTCGGTCCTC CGGGTACCGG TAAAACGCTG CTGGCGAAAG CGATTGCAGG CGAAGCGAAA GTTCCGTTCT TTACTATCTC CGGTTCTGAC TTCGTAGAAA TGTTCGTCGG TGTGGGTGCA TCCCGTGTTC GTGACATGTT CGAACAGGCG AAGAAAGCGG CACCGTGCAT CATCTTTATC GATGAAATCG ACGCCGTAGG CCGCCAGCGT GGCGCAGGTC TGGGCGGTGG TCACGATGAA CGTGAACAGA CTCTGAACCA GATGCTGGTT GAGATGGATG GCTTCGAAGG TAACGAAGGT ATCATCGTTA TCGCCGCGAC TAACCGTCCG GACGTTCTCG ACCCAGCGCT GCTGCGTCCT GGCCGTTTCG ACCGTCAGGT TGTGGTTGGC TTGCCAGATG TTCGCGGTCG TGAGCAGATC CTGAAGGTTC ACATGCGTCG CGTACCATTG GCACCCGATA TCGACGCGGC AATCATTGCC CGTGGTACTC CTGGTTTCTC CGGTGCTGAC CTGGCGAACC TGGTGAACGA AGCGGCACTG TTCGCTGCTC GTGGCAACAA ACGCGTTGTG TCGATGGTTG AGTTCGAGAA AGCGAAAGAC AAAATCATGA TGGGTGCGGA ACGTCGCTCC ATGGTGATGA CGGAAGCGCA GAAAGAATCG ACGGCTTACC ACGAAGCGGG TCATGCGATT ATCGGTCGCC TGGTGCCGGA GCACGATCCG GTGCACAAAG TGACGATTAT CCCACGCGGT CGTGCGCTGG GTGTGACTTT CTTCTTGCCT GAGGGCGACG CAATCAGCGC CAGCCGTCAG AAACTGGAAA GCCAGATTTC TACGCTGTAC GGTGGTCGTC TGGCAGAAGA GATCATCTAC GGGCCGGAAC ATGTTTCTAC CGGTGCGTCC AACGATATTA AAGTTGCGAC CAACCTGGCA CGTAACATGG TGACTCAGTG GGGCTTCTCT GAGAAATTGG GTCCACTGCT GTACGCGGAA GAAGAAGGTG AAGTGTTCCT CGGCCGTAGC GTAGCGAAAG CGAAACATAT GTCCGATGAA ACTGCACGTA TCATCGACCA GGAAGTGAAA GCACTGATTG AGCGTAACTA TAATCGTGCG CGTCAGCTTC TGACCGACAA TATGGATATT CTGCATGCGA TGAAAGATGC TCTCATGAAA TATGAGACTA TCGACGCACC GCAGATTGAT GACCTGATGG CACGTCGCGA TGTACGTCCG CCAGCGGGCT GGGAAGAACC AGGCGCTTCT AACAATGCTG GCGACAATGG TAGTCCAAAG GCTCCTCGTC CGGTTGATGA ACCGCGTACG CCGAACCCGG GTAACACCAT GTCAGAGCAG TTAGGCGACA AGTAA
|
Protein sequence | MAKNLILWLV IAVVLMSVFQ SFGPSESNGR KVDYSTFLQE VNNDQVREAR INGREINVTK KDSNRYTTYI PVQDPKLLDN LLTKNVKVVG EPPEEPSLLA SIFISWFPML LLIGVWIFFM RQMQGGGGKG AMSFGKSKAR MLTEDQIKTT FADVAGCDEA KEEVAELVEY LREPSRFQKL GGKIPKGVLM VGPPGTGKTL LAKAIAGEAK VPFFTISGSD FVEMFVGVGA SRVRDMFEQA KKAAPCIIFI DEIDAVGRQR GAGLGGGHDE REQTLNQMLV EMDGFEGNEG IIVIAATNRP DVLDPALLRP GRFDRQVVVG LPDVRGREQI LKVHMRRVPL APDIDAAIIA RGTPGFSGAD LANLVNEAAL FAARGNKRVV SMVEFEKAKD KIMMGAERRS MVMTEAQKES TAYHEAGHAI IGRLVPEHDP VHKVTIIPRG RALGVTFFLP EGDAISASRQ KLESQISTLY GGRLAEEIIY GPEHVSTGAS NDIKVATNLA RNMVTQWGFS EKLGPLLYAE EEGEVFLGRS VAKAKHMSDE TARIIDQEVK ALIERNYNRA RQLLTDNMDI LHAMKDALMK YETIDAPQID DLMARRDVRP PAGWEEPGAS NNAGDNGSPK APRPVDEPRT PNPGNTMSEQ LGDK
|
| |