Gene ECH74115_0113 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0113 
SymbolhofB 
ID6968055 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp120184 
End bp121569 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content54% 
IMG OID643384190 
Producthypothetical protein 
Protein accessionYP_002268713 
Protein GI209396061 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00764351 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATTC CACAGCTCAC GGCCCTGTGC CTGCGTTATC AGGGAGTCTT GCTGGATGCC 
AGCGAAGAAG TGGTTCATGT TGCGGTGGTC GATGCCCCCT CACATGAGTT GCTGGACGCA
TTGCATTTCG CTACCACCAA ACGTATTGAG ATCACCTGCT GGACGCGCCA ACAAATGGAA
GGTCACGCCA GTCGCACACA ACAGACATTG CCCGTAGCTG TTCAGGAGAA GCATCAGCCC
AAAGCAGAGT TGCTGACTCG AACGTTACAA TCTGCGCTGG AACAACGCGC GTCTGATATT
CATATCGAAC CAGCGGACAA TGCCTACCGC ATCCGCTTGC GTATCGACGG CGTATTGCAT
CCTTTACCGG ATGTTTCACC GGATGCCGGA GTCGCATTAA CCGCCAGATT AAAAGTGCTG
GGAAACCTGG ATATTGCGGA ACATCGCCTG CCGCAGGACG GGCAATTCAC TGTCGAACTG
GCAGGAAACG CCGTCTCATT TCGTATTGCG ACCTTACCAT GTCGGGGTGG TGAAAAGGTG
GTATTAAGGT TGTTACAGCA GGTGAGCCAG GCACTGGATG TTAACACGCT GGGAATGCAG
CCGTTACAAC TGGCGGGCTT TGCTCATGCC TTGCAACAAC CACAGGGACT GGTGCTGGTA
ACTGGCCCTA CCGGCAGCGG CAAAACGGTC ACGCTTTATA GTGCCCTGCA AAAGCTGAAT
ACCGCTGACA TTAATATTTG TAGCGTCGAA GATCCGGTTG AGATCCCCAT AGCCGGACTA
AACCAGACGC AAATCCATCC GCGTGCCGGA CTCACCTTTC AGGGCGTTTT GCGTGCGTTA
TTGCGCCAGG ATCCTGACGT CATCATGATC GGAGAGATCC GCGATGGCGA AACGGCAGAA
ATTGCCATTA AAGCCGCGCA AACCGGTCAC CTGGTGTTGT CTACCCTACA CACTAATTCC
ACCTGCGAAA CGCTGGTACG TTTACAGCAA ATGGGGGTCG CCCGCTGGAT GCTATCATCG
GCGCTTACGC TGGTAATAGC CCAGCGTCTG GTACGCAAAC TTTGCCCACA TTGTCGCCGG
CAGCAAGGGG AGCCCATCCA CATTCCAGTC AATGTATGGC CGTCGCCGCT GCCCCACTGG
CAAGCACCCG GTTGTGTACA TTGCTACCAC GGTTTTTATG GTCGCACTGC CTTATTTGAA
GTTCTGCCCA TAACACCGGT CATACGTCAG CTTATTTCCG CTAATACCGA CGTTGAATCG
CTGGAAACGC ACGCCCGACA GGCGGGTATG CGAACGCTTT TTGAAAACGG CTGCCTGGCC
GTGGAGCAAG GCTTAACCAC CTTTGAAGAG TTAATCCGCG TATTGGGGAT GCCGCATGGC
GAGTAA
 
Protein sequence
MNIPQLTALC LRYQGVLLDA SEEVVHVAVV DAPSHELLDA LHFATTKRIE ITCWTRQQME 
GHASRTQQTL PVAVQEKHQP KAELLTRTLQ SALEQRASDI HIEPADNAYR IRLRIDGVLH
PLPDVSPDAG VALTARLKVL GNLDIAEHRL PQDGQFTVEL AGNAVSFRIA TLPCRGGEKV
VLRLLQQVSQ ALDVNTLGMQ PLQLAGFAHA LQQPQGLVLV TGPTGSGKTV TLYSALQKLN
TADINICSVE DPVEIPIAGL NQTQIHPRAG LTFQGVLRAL LRQDPDVIMI GEIRDGETAE
IAIKAAQTGH LVLSTLHTNS TCETLVRLQQ MGVARWMLSS ALTLVIAQRL VRKLCPHCRR
QQGEPIHIPV NVWPSPLPHW QAPGCVHCYH GFYGRTALFE VLPITPVIRQ LISANTDVES
LETHARQAGM RTLFENGCLA VEQGLTTFEE LIRVLGMPHG E