Gene EcSMS35_4033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4033 
SymboluhpB 
ID6144031 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4122073 
End bp4123575 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content57% 
IMG OID641618858 
Productsensory histidine kinase UhpB 
Protein accessionYP_001745996 
Protein GI170679726 
COG category[T] Signal transduction mechanisms 
COG ID[COG3851] Signal transduction histidine kinase, glucose-6-phosphate specific 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.406028 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACGT TTTTCTCTCG CTTAATTACC GTTATTGCCT GCTTTTTTAT CTTCTCTGCC 
GCGTGGTTTT GCTTGTGGAG TATCAGCCTG CACCTGGTTG AGCGCCCTGA TATGGCGGTG
CTGTTATTTC CGTTTGGTCT GCGTCTGGGG CTAATGCTGC AATGCCCGCG CGGCTACTGG
CCGGTGCTGC TGGGAGCGGA GTGGCTGCTG ATTTACTGGC TAACGCAGGC GGTCGGTTTA
ACCCATTTCT CCTTATTGAT GATCGGTAGT TTACTGACGT TATTGCCCGT GGCGCTTATC
TCGCGCTATC GCCATCAACG TGACTGGCGC ACCCTGCTGT TACAGGGGGC AGCACTGACG
GCGGCGGCGT TGTTGCAGTC GCTGCCCTGG CTTTGGCACG GGAAAGAGTC GTGGAATGCG
CTGTTGCTGA CTTTAACTGG CGGCCTGACG CTGGCCCCGA TATGTCTGGT GTTCTGGCAC
TATCTCGCCA ATAACATCTG GCTGCCGCTC GGGCCGTCGT TAGTTTCTCA ACCGATCAAC
TGGCGCGGGC GGCATCTGGT CTGGTACTTG CTGCTGTTTG TTATCAGTCT CTGGCTCCAG
TTGGGATTGC CGGACGAACT GTCGCGCTTT ACGCCATTCT GCCTGGCGCT GCCGATTATC
GCGCTGGCCT GGCACTACGG CTGGCAAGGG GCGCTGATTG CGACGTTGAT GAACGCCATC
GCGCTGATCG CCAGTCAAAC CTGGCGCGAT CATTCGGTGG ATTTATTGCT CTCGCTGCTG
GTGCAAAGTC TGACAGGGTT GTTACTGGGC GCAGGCATCC AGCGGTTGCG TGAACTTAAC
CAGTCGCTGC AAAAGGAACT AGCGCGCAAT CAGCATCTGG CTGAACGTTT GTTAGAAACT
GAAGAGAGCG TGCGCCGTGA TGTGGCGCGT GAGCTGCACG ACGATATCGG TCAGACCATC
ACTGCTATTC GTACTCAAGC AGGTATTGTT CAGCGGCTGG CGGCAGATAA CGCCAGCGTG
AAGCAGAGCG GGCAGCTCAT CGAACACCTG TCGCTGGGTG TTTACGATGC GGTGCGCCGT
TTGTTAGGGC GGTTACGTCC GCGCCAGCTG GATGATCTCA CCCTGGAGCA GGCCATCCGC
TCACTGATGC GGGAAATGGA GCTGGAAGGG CGCGGCATTG TCAGTCATCT CGAATGGCGA
ATCGATGAAT CAGCGTTAAG CGAAAACCAG CGCGTGACGC TATTTCGCGT CTGCCAGGAA
GGGTTGAACA ACATTGTGAA ACATGCCGAT GCCAGCGCGG TCACGCTGCA AGGCTGGCAG
CAGGATGAGC GGTTAATGCT GGTGATTGAA GACGATGGCA GCGGTTTACC GCCAGATTCC
GGGCAACACG GTTTTGGCCT TACCGGAATG CGCGAGCGCG TAACGGCGCT GGGCGGCACA
TTGACCATTT CCTGTCTGCA CGGCACGCGT GTCAGCGTTT CTCTACCTCA ACGTTATGTC
TAA
 
Protein sequence
MKTFFSRLIT VIACFFIFSA AWFCLWSISL HLVERPDMAV LLFPFGLRLG LMLQCPRGYW 
PVLLGAEWLL IYWLTQAVGL THFSLLMIGS LLTLLPVALI SRYRHQRDWR TLLLQGAALT
AAALLQSLPW LWHGKESWNA LLLTLTGGLT LAPICLVFWH YLANNIWLPL GPSLVSQPIN
WRGRHLVWYL LLFVISLWLQ LGLPDELSRF TPFCLALPII ALAWHYGWQG ALIATLMNAI
ALIASQTWRD HSVDLLLSLL VQSLTGLLLG AGIQRLRELN QSLQKELARN QHLAERLLET
EESVRRDVAR ELHDDIGQTI TAIRTQAGIV QRLAADNASV KQSGQLIEHL SLGVYDAVRR
LLGRLRPRQL DDLTLEQAIR SLMREMELEG RGIVSHLEWR IDESALSENQ RVTLFRVCQE
GLNNIVKHAD ASAVTLQGWQ QDERLMLVIE DDGSGLPPDS GQHGFGLTGM RERVTALGGT
LTISCLHGTR VSVSLPQRYV