Gene EcHS_A3881 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3881 
SymboluhpB 
ID5592919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3878216 
End bp3879709 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content57% 
IMG OID640922991 
Productsensory histidine kinase UhpB 
Protein accessionYP_001460468 
Protein GI157163150 
COG category[T] Signal transduction mechanisms 
COG ID[COG3851] Signal transduction histidine kinase, glucose-6-phosphate specific 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones63 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTTCTCCC GCTTAATTAC CGTTATTGCC TGCTTTTTTA TCTTCTCTGC CGCATGGTTT 
TGCCTGTGGA GTATCAGCCT GCATCTGGTT GAGCGCCCTG ATATGGCGGT GCTGTTATTT
CCGTTTGGTC TGCGTCTGGG GCTAATGCTG CAATGCCCGC GCGGATACTG GCCGGTGCTG
CTGGGCGCGG AGTGGCTGCT GATTTACTGG CTAACGCAGG CGGTCGGTTT AACCCATTTT
CCGTTATTGA TGATCGGTAG TTTACTGACG TTACTGCCCG TAGCGCTGAT CTCGCGCTAT
CGCCATCAGC GTGACTGGCG CACCTTGCTG TTACAGGGGG CGGCGTTAAC GGCGGCGGCG
TTGTTGCAGT CGCTGCCCTG GCTTTGGCAC GGCAAAGAGT CGTGGAATGC GCTGTTGCTG
ACTTTAACTG GCGGCCTGAC GCTGGCCCCG ATATGTCTGG TGTTCTGGCA CTATCTCGCC
AATAACACCT GGCTGCCGCT CGGCCCGTCA CTGGTTTCTC AGCCAATCAA CTGGCGCGGG
CGACATCTGG TCTGGTACTT GCTGCTGTTT GTTATCAGTC TCTGGCTCCA GTTGGGATTG
CCGGACGAAC TGTCGCGCTT TACGCCATTC TGTCTGGCGC TGCCGATTAT CGCGCTGGCC
TGGCACTATG GTTGGCAAGG GGCGCTGATT GCGACGTTGA TGAACGCCAT CGCGCTGATC
GCCAGTCAAA CCTGGCGCGA TCATCCGGTG GATTTATTGC TCTCGCTGCT GGTGCAAAGT
CTGACAGGGT TGTTGCTTGG CGCTGGCATC CAGCGGTTGC GTGAACTTAA CCAGTCGCTG
CAAAAGGAAC TGGCGCGCAA TCAGCATCTG GCTGAACGGT TGCTGGAAAC CGAAGAGAGC
GTGCGCCGTG ATGTGGCGCG TGAGCTGCAT GATGATATCG GTCAGACCAT CACTGCTATT
CGTACTCAGG CGGGCATTGT TCAGCGGCTG GCGGCAGATA ACGCCAGCGT GAAGCAGAGC
GGGCAGCTCA TCGAACAACT ATCGCTGGGC GTTTACGACG CGGTGCGCCG TTTGTTGGGT
CGGTTACGTC CGCGCCAGTT GGATGATCTC ACCCTGGAGC AGGCCATCCG CTCACTGATG
CGGGAAATGG AGCTGGAAGG GCGCGGTATT GTCAGCCATC TCGAATGGCG AATCGATGAA
TCAGCGTTAA GCGAAAACCA GCGCGTGACG CTGTTTCGTG TCTGCCAGGA AGGGCTGAAC
AACATTGTGA AACATGCTGA TGCCAGCGCG GTCACCCTGC AAGGCTGGCA GCAGGATGAA
CGGTTGATGC TGGTTATTGA AGACGATGGC AGCGGTTTGC CGCCGGGTTC CGGGCAACAA
GGTTTTGGCC TCACCGGAAT GCGCGAGCGC GTAACGGCGC TGGGTGGCAC ATTACACATT
TCCTGTCTGC ACGGCACGCG TGTCAGCGTT TCTCTACCTC AACGCTATGT CTAA
 
Protein sequence
MFSRLITVIA CFFIFSAAWF CLWSISLHLV ERPDMAVLLF PFGLRLGLML QCPRGYWPVL 
LGAEWLLIYW LTQAVGLTHF PLLMIGSLLT LLPVALISRY RHQRDWRTLL LQGAALTAAA
LLQSLPWLWH GKESWNALLL TLTGGLTLAP ICLVFWHYLA NNTWLPLGPS LVSQPINWRG
RHLVWYLLLF VISLWLQLGL PDELSRFTPF CLALPIIALA WHYGWQGALI ATLMNAIALI
ASQTWRDHPV DLLLSLLVQS LTGLLLGAGI QRLRELNQSL QKELARNQHL AERLLETEES
VRRDVARELH DDIGQTITAI RTQAGIVQRL AADNASVKQS GQLIEQLSLG VYDAVRRLLG
RLRPRQLDDL TLEQAIRSLM REMELEGRGI VSHLEWRIDE SALSENQRVT LFRVCQEGLN
NIVKHADASA VTLQGWQQDE RLMLVIEDDG SGLPPGSGQQ GFGLTGMRER VTALGGTLHI
SCLHGTRVSV SLPQRYV