Gene EcolC_0031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0031 
Symbol 
ID6068480 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp31230 
End bp32732 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content57% 
IMG OID641599435 
Productsensory histidine kinase UhpB 
Protein accessionYP_001723045 
Protein GI170018091 
COG category[T] Signal transduction mechanisms 
COG ID[COG3851] Signal transduction histidine kinase, glucose-6-phosphate specific 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACGT TGTTCTCCCG CTTAATTACC GTTATTGCCT GCTTTTTTAT CTTCTCTGCC 
GCATGGTTTT GCCTGTGGAG TATCAGCCTG CATCTGGTTG AGCGCCCTGA TATGGCGGTG
CTGTTATTTC CGTTTGGTCT GCGTCTGGGG CTAATGCTGC AATGCCCGCG CGGATACTGG
CCGGTGCTGC TGGGCGCGGA GTGGCTGCTG ATTTACTGGC TAACGCAGGC GGTCGGTTTA
ACCCATTTTC CGTTATTGAT GATCGGTAGT TTACTGACGT TACTGCCCGT AGCGCTGATC
TCGCGCTATC GCCATCAGCG TGACTGGCGC ACCTTGCTGT TACAGGGGGC GGCGTTAACG
GCGGCGGCGT TGTTGCAGTC GCTGCCCTGG CTTTGGCACG GCAAAGAGTC GTGGAATGCG
CTGTTGCTGA CTTTAACTGG CGGCCTGACG CTGGCCCCGA TATGTCTGGT GTTCTGGCAC
TATCTCGCCA ATAACACCTG GCTGCCGCTC GGCCCGTCAC TGGTTTCTCA GCCAATCAAC
TGGCGCGGGC GACATCTGGT CTGGTACTTG CTGCTGTTTG TTATCAGTCT CTGGCTCCAG
TTGGGATTGC CGGACGAACT GTCGCGCTTT ACGCCATTCT GTCTGGCGCT GCCGATTATC
GCGCTGGCCT GGCACTATGG TTGGCAAGGG GCGCTGATTG CGACGTTGAT GAACGCCATC
GCGCTGATCG CCAGTCAAAC CTGGCGCGAT CATCCGGTGG ATTTATTGCT CTCGCTGCTG
GTGCAAAGTC TGACAGGGTT GTTGCTTGGC GCTGGCATCC AGCGGTTGCG TGAACTTAAC
CAGTCGCTGC AAAAGGAACT GGCGCGCAAT CAGCATCTGG CTGAACGGTT GCTGGAAACC
GAAGAGAGCG TGCGCCGTGA TGTGGCGCGT GAGCTGCATG ATGATATCGG TCAGACCATC
ACTGCTATTC GTACTCAGGC GGGCATTGTT CAGCGGCTGG CGGCAGATAA CGCCAGCGTG
AAGCAGAGCG GGCAGCTCAT CGAACAACTA TCGCTGGGCG TTTACGACGC GGTGCGCCGT
TTGTTGGGTC GGTTACGTCC GCGCCAGTTG GATGATCTCA CCCTGGAGCA GGCCATCCGC
TCACTGATGC GGGAAATGGA GCTGGAAGGG CGCGGTATTG TCAGCCATCT CGAATGGCGA
ATCGATGAAT CAGCGTTAAG CGAAAACCAG CGCGTGACGC TGTTTCGTGT CTGCCAGGAA
GGGCTGAACA ACATTGTGAA ACATGCTGAT GCCAGCGCGG TCACCCTGCA AGGCTGGCAG
CAGGATGAAC GGTTGATGCT GGTTATTGAA GACGATGGCA GCGGTTTGCC GCCGGGTTCC
GGGCAACAAG GTTTTGGCCT CACCGGAATG CGCGAGCGCG TAACGGCGCT GGGTGGCACA
TTACACATTT CCTGTCTGCA CGGCACGCGT GTCAGCGTTT CTCTACCTCA ACGTTATGTC
TAA
 
Protein sequence
MKTLFSRLIT VIACFFIFSA AWFCLWSISL HLVERPDMAV LLFPFGLRLG LMLQCPRGYW 
PVLLGAEWLL IYWLTQAVGL THFPLLMIGS LLTLLPVALI SRYRHQRDWR TLLLQGAALT
AAALLQSLPW LWHGKESWNA LLLTLTGGLT LAPICLVFWH YLANNTWLPL GPSLVSQPIN
WRGRHLVWYL LLFVISLWLQ LGLPDELSRF TPFCLALPII ALAWHYGWQG ALIATLMNAI
ALIASQTWRD HPVDLLLSLL VQSLTGLLLG AGIQRLRELN QSLQKELARN QHLAERLLET
EESVRRDVAR ELHDDIGQTI TAIRTQAGIV QRLAADNASV KQSGQLIEQL SLGVYDAVRR
LLGRLRPRQL DDLTLEQAIR SLMREMELEG RGIVSHLEWR IDESALSENQ RVTLFRVCQE
GLNNIVKHAD ASAVTLQGWQ QDERLMLVIE DDGSGLPPGS GQQGFGLTGM RERVTALGGT
LHISCLHGTR VSVSLPQRYV