Gene ECH74115_5098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5098 
SymboluhpB 
ID6967523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4738870 
End bp4740375 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content57% 
IMG OID643388773 
Productsensory histidine kinase UhpB 
Protein accessionYP_002273199 
Protein GI209397016 
COG category[T] Signal transduction mechanisms 
COG ID[COG3851] Signal transduction histidine kinase, glucose-6-phosphate specific 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.118457 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATGAAGA CGTTGTTCTC CCGCTTAATT ACCGTTATTG CCTGCTTTTT TATCTTCTCT 
GCCGCATGGT TTTGCCTGTG GAGTATCAGC CTGCACCTGG TTGAGCGCCC TGATATGGCG
GTGCTGTTAT TTCCGTTTGG TCTGCGGCTA GGGCTAATGC TGCAATGCCC GCGCGGATAC
TGGCCGGTGC TGCTGGGAGC GGAGTGGCTG CTGATTTACT GGCTAACGCA GGCGGTCGGT
TTAACCCATT TCTCCTTATT GATGATCGGT AGTTTACTGA CGTTACTGCC CGTGGCGCTG
ATCTCGCGCT ATCGCCATCA ACGTGACTGG CGCACCTTGC TGTTACAGGG TGCGGCGTTA
ACGGCGGCGG CGTTGTTGCA GTCGCTGCCC TGGCTTTGGC ACGGCAAAGA GTCGTGGAAT
GCGCTGCTGT TAACTTTAAC TGGCGGCCTG ACGCTGGCCC CGATATGTCT GGTGTTCTGG
CATTATCTCG CCAATAACAC CTGGCTGCCG CTCGGGCCGT CGTTAGTTTC TCAACCGATC
AACTGGCGCG GGCGGCATCT GGTCTGGTAC TTGCTGCTGT TTGTTATCAG TCTCTGGCTC
CAGTTGGGCT TGCCGGACGA ACTGTCGCGC TTTACGCCAT TCTGCCTGGC GCTGCCGATT
ATCGCGCTGG CCTGGCACTA CGGCTGGCAA GGGGCGCTGA TAGCGACATT GATGAACGCC
ATCGCGCTGA TCGCCAGTCA AACCTGGCGC GATCATCCTG TGGATTTATT GCTCTCGCTG
CTGGTGCAAA GTCTGACAGG GTTGTTACTG GGCGCTGGCA TCCAGCGGTT GCGTGAACTT
AACCAGTCGC TGCAAAAGGA ACTGGCGCGC AATCAGCATC TCGCTGAACG TTTGTTAGAA
ACTGAAGAGA GCGTGCGCCG TGATGTGGCG CGTGAGCTGC ATGATGATAT CGGCCAGACC
ATCACTGCCA TTCGTACTCA AGCGGGCATT GTCCAGCGAC TGGCGGCAGA TAACGCCAGT
GTGAAGCAGA GCGGGCAGCT CATCGAACAA CTGTCGCTGG GCGTGTACGA CGCGGTGCGC
CGTTTGTTAG GGCGGTTACG TCCGCGCCAG TTGGATGATC TCACCCTAGA GCAGGCCATC
CGCTCACTGA TGCGGGAAAT GGAGCTGGAA GGGCGCGGCA TTGTCAGCCA TCTCGAATGG
CGAATCGAAG AGTCAGCGTT AAGCGAAAAC CAGCGCGTGA CGCTGTTTCG TGTCTGCCAG
GAAGGGCTGA ATAACATTGT GAAACATGCT GATGCCAGCG CGGTCACCCT GCAAGGCTGG
CTGCAGGATG AACGGTTGAT GCTGGTGATT GAAGACGATG GCAGCGGTTT ACCGCCGGGT
TCCGGGCAAC AAGGTTTTGG CCTCACCGGA ATGCGCGAGC GCGTAACGGC GCTGGGCGGC
ACCTTGACCA TTTCCTGTCT GCACGGCACG CGTGTCAGCG TTTCTCTACC TCAACGCTAT
GTCTAA
 
Protein sequence
MMKTLFSRLI TVIACFFIFS AAWFCLWSIS LHLVERPDMA VLLFPFGLRL GLMLQCPRGY 
WPVLLGAEWL LIYWLTQAVG LTHFSLLMIG SLLTLLPVAL ISRYRHQRDW RTLLLQGAAL
TAAALLQSLP WLWHGKESWN ALLLTLTGGL TLAPICLVFW HYLANNTWLP LGPSLVSQPI
NWRGRHLVWY LLLFVISLWL QLGLPDELSR FTPFCLALPI IALAWHYGWQ GALIATLMNA
IALIASQTWR DHPVDLLLSL LVQSLTGLLL GAGIQRLREL NQSLQKELAR NQHLAERLLE
TEESVRRDVA RELHDDIGQT ITAIRTQAGI VQRLAADNAS VKQSGQLIEQ LSLGVYDAVR
RLLGRLRPRQ LDDLTLEQAI RSLMREMELE GRGIVSHLEW RIEESALSEN QRVTLFRVCQ
EGLNNIVKHA DASAVTLQGW LQDERLMLVI EDDGSGLPPG SGQQGFGLTG MRERVTALGG
TLTISCLHGT RVSVSLPQRY V