Gene EcolC_0522 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0522 
SymbolhflB 
ID6068720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp561425 
End bp563359 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content54% 
IMG OID641599927 
ProductATP-dependent metalloprotease 
Protein accessionYP_001723526 
Protein GI170018572 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000177959 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00179189 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGAAAA ACCTAATACT CTGGCTGGTC ATTGCCGTTG TGCTGATGTC AGTATTCCAG 
AGCTTTGGGC CCAGCGAGTC TAATGGCCGT AAGGTGGATT ACTCTACCTT CCTACAAGAG
GTCAATAACG ACCAGGTTCG TGAAGCGCGT ATCAACGGAC GTGAAATCAA CGTTACCAAG
AAAGATAGTA ACCGTTATAC CACTTACATT CCGGTTCAGG ATCCGAAATT ACTGGATAAC
CTGTTGACCA AGAACGTCAA GGTTGTCGGT GAACCGCCTG AAGAACCAAG CCTGCTGGCT
TCTATCTTCA TCTCCTGGTT CCCGATGCTG TTGCTGATTG GTGTCTGGAT CTTCTTCATG
CGTCAAATGC AGGGCGGCGG TGGCAAAGGT GCCATGTCGT TTGGTAAGAG CAAAGCGCGC
ATGCTGACGG AAGATCAGAT CAAAACGACC TTTGCTGACG TTGCGGGCTG CGACGAAGCA
AAAGAAGAAG TTGCTGAACT GGTTGAGTAT CTGCGCGAGC CGAGCCGCTT CCAGAAACTC
GGCGGTAAGA TCCCGAAAGG CGTCTTGATG GTCGGTCCTC CGGGTACCGG TAAAACGCTG
CTGGCGAAAG CGATTGCAGG CGAAGCGAAA GTTCCGTTCT TTACTATCTC CGGTTCTGAC
TTCGTAGAAA TGTTCGTCGG TGTGGGTGCA TCCCGTGTTC GTGACATGTT CGAACAGGCG
AAGAAAGCGG CACCGTGCAT CATCTTTATC GATGAAATCG ACGCCGTAGG CCGCCAGCGT
GGCGCTGGTC TGGGCGGTGG TCACGATGAA CGTGAACAGA CTCTGAACCA GATGCTGGTT
GAGATGGATG GCTTCGAAGG TAACGAAGGT ATCATCGTTA TCGCCGCGAC TAACCGTCCG
GACGTTCTCG ACCCGGCCCT GCTGCGTCCT GGCCGTTTCG ACCGTCAGGT TGTGGTCGGC
TTGCCAGATG TTCGCGGTCG TGAGCAGATC CTGAAAGTTC ACATGCGTCG CGTACCATTG
GCACCCGATA TCGACGCGGC AATCATTGCC CGTGGTACTC CTGGTTTCTC CGGTGCTGAC
CTGGCGAACC TGGTGAACGA AGCGGCACTG TTCGCTGCTC GTGGCAACAA ACGCGTTGTG
TCGATGGTTG AGTTCGAGAA AGCGAAAGAC AAAATCATGA TGGGTGCGGA ACGTCGCTCC
ATGGTGATGA CGGAAGCGCA GAAAGAATCG ACGGCTTACC ACGAAGCGGG TCATGCGATT
ATCGGTCGCC TGGTGCCGGA ACACGATCCG GTGCACAAAG TGACGATTAT CCCACGCGGT
CGTGCGCTGG GTGTGACTTT CTTCTTGCCT GAGGGCGACG CAATCAGCGC CAGCCGTCAG
AAACTGGAAA GCCAGATTTC TACGCTGTAC GGTGGTCGTC TGGCAGAAGA GATCATCTAC
GGGCCGGAAC ATGTATCTAC CGGTGCGTCC AACGATATTA AAGTTGCGAC CAACCTGGCA
CGTAACATGG TGACTCAGTG GGGCTTCTCT GAGAAATTGG GTCCACTGCT GTACGCGGAA
GAAGAAGGTG AAGTGTTCCT CGGCCGTAGC GTAGCGAAAG CGAAACATAT GTCCGATGAA
ACTGCACGTA TCATCGACCA GGAAGTGAAA GCACTGATTG AGCGTAACTA TAATCGTGCG
CGTCAGCTTC TGACCGACAA TATGGATATT CTGCATGCGA TGAAAGATGC TCTCATGAAA
TATGAGACTA TCGACGCACC GCAGATTGAT GACCTGATGG CACGTCGCGA TGTACGTCCG
CCAGCGGGCT GGGAAGAACC AGGCGCTTCT AACAATTCTG GCGACAATGG TAGTCCAAAG
GCTCCTCGTC CGGTTGATGA ACCGCGTACG CCGAACCCGG GTAACACCAT GTCAGAGCAG
TTAGGCGACA AGTAA
 
Protein sequence
MAKNLILWLV IAVVLMSVFQ SFGPSESNGR KVDYSTFLQE VNNDQVREAR INGREINVTK 
KDSNRYTTYI PVQDPKLLDN LLTKNVKVVG EPPEEPSLLA SIFISWFPML LLIGVWIFFM
RQMQGGGGKG AMSFGKSKAR MLTEDQIKTT FADVAGCDEA KEEVAELVEY LREPSRFQKL
GGKIPKGVLM VGPPGTGKTL LAKAIAGEAK VPFFTISGSD FVEMFVGVGA SRVRDMFEQA
KKAAPCIIFI DEIDAVGRQR GAGLGGGHDE REQTLNQMLV EMDGFEGNEG IIVIAATNRP
DVLDPALLRP GRFDRQVVVG LPDVRGREQI LKVHMRRVPL APDIDAAIIA RGTPGFSGAD
LANLVNEAAL FAARGNKRVV SMVEFEKAKD KIMMGAERRS MVMTEAQKES TAYHEAGHAI
IGRLVPEHDP VHKVTIIPRG RALGVTFFLP EGDAISASRQ KLESQISTLY GGRLAEEIIY
GPEHVSTGAS NDIKVATNLA RNMVTQWGFS EKLGPLLYAE EEGEVFLGRS VAKAKHMSDE
TARIIDQEVK ALIERNYNRA RQLLTDNMDI LHAMKDALMK YETIDAPQID DLMARRDVRP
PAGWEEPGAS NNSGDNGSPK APRPVDEPRT PNPGNTMSEQ LGDK