Gene EcHS_A4366 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4366 
SymboldcuS 
ID5592118 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4374053 
End bp4375684 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content50% 
IMG OID640923464 
Productsensory histidine kinase DcuS 
Protein accessionYP_001460909 
Protein GI157163591 
COG category[T] Signal transduction mechanisms 
COG ID[COG3290] Signal transduction histidine kinase regulating citrate/malate metabolism 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value0.206477 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGACATT CATTGCCCTA CCGCATGTTA CGCAAACGTC CGATGAAATT GAGTACCACA 
GTGATCTTAA TGGTCAGCGC GGTACTGTTC TCGGTGCTAT TGGTGGTGCA TCTGATTTAC
TTCTCGCAAA TCAGTGATAT GACGCGAGAC GGGCTGGCCA ACAAGGCACT GGCAGTGGCG
CGTACCCTCG CCGACTCGCC GGAAATCCGT CAGGGCTTGC AGAAAAAACC GCAGGAGAGC
GGAATCCAGG CCATCGCGGA AGCCGTGCGC AAACGCAACG ATCTGCTGTT TATTGTCGTT
ACCGATATGC AAAGTCTTCG CTACTCGCAT CCTGAAGCCC AGCGTATTGG TCAGCCATTT
AAAGGTGATG ACATCCTTAA AGCGCTGAAT GGCAAAGAAA ATGTCGCTAT CAATCGCGGT
TTTCTGGCGC AGGCTTTACG CGTATTTACC CCCATCTACG ATGAAAATCA TAAACAAATT
GGCGTGGTGG CGATCGGCCT TGAGTTAAGC CGCGTGACCC AACAAATCAA TGACAGTCGC
TGGAGCATTA TCTGGTCGGT ATTATTTGGC ATGCTGGTCG GACTGATTGG CACCTGCATT
CTGGTTAAGG TACTGAAAAA AATCCTTTTC GGCCTGGAAC CCTACGAAAT CTCCACGCTG
TTTGAGCAGC GCCAGGCCAT GTTGCAGTCT ATCAAAGAAG GCGTCGTTGC CGTGGACGAT
CGCGGCGAGG TCACGCTGAT CAACGATGCC GCACAAGAAT TGCTGAATTA CCGTAAGTCG
CAGGACGATG AGAAACTGTC GACGCTAAGC CACTCATGGT CACAGGTGGT AGATGTCTCG
GAAGTGTTAC GCGACGGTAC GCCGCGCCGC GACGAAGAGA TTACGATTAA AGACCGGCTA
TTACTGATCA ACACCGTTCC GGTGCGCAGT AATGGCGTTA TCATCGGTGC CATTTCAACC
TTCAGGGACA AAACTGAAGT TCGTAAACTG ATGCAGCGAC TCGACGGTCT GGTCAACTAT
GCTGATGCAC TTCGTGAACG ATCCCACGAA TTTATGAATA AATTGCATGT GATTCTCGGA
TTATTGCATC TGAAGAGTTA TAAGCAGTTG GAAGATTACA TTCTCAAAAC AGCCAATAAC
TATCAGGAAG AGATTGGCTC TCTGCTGGGT AAGATCAAAT CTCCGGTTAT CGCTGGTTTT
TTAATCAGCA AGATTAACCG CGCGACCGAT TTAGGCCATA CGCTGATTTT AAACAGTGAA
AGCCTGCTGC CAGACAGCGG CAGTGAGGAC CAGGTCGCGA CGCTGATTAC CACATTGGGA
AATCTGATAG AAAACGCGCT GGAAGCATTA GGGCCGGAAC CCGGTGGCGA AATTAGCGTA
ACATTGCACT ACCGTCACGG CTGGCTGCAC TGCGAAGTTA ACGATGATGG ACCGGGGATC
GCACCCGACA AAATCGATCA CATTTTTGAC AAAGGTGTCT CGACAAAAGG AAGCGAGCGA
GGCGTCGGTT TAGCACTTGT CAAACAACAG GTAGAAAATC TCGGCGGCAG CATCGCCGTG
GAATCGGAAC CCGGGATTTT CACACAATTT TTTGTCCAGA TACCCTGGGA CGGGGAGAGG
TCGAACAGAT GA
 
Protein sequence
MRHSLPYRML RKRPMKLSTT VILMVSAVLF SVLLVVHLIY FSQISDMTRD GLANKALAVA 
RTLADSPEIR QGLQKKPQES GIQAIAEAVR KRNDLLFIVV TDMQSLRYSH PEAQRIGQPF
KGDDILKALN GKENVAINRG FLAQALRVFT PIYDENHKQI GVVAIGLELS RVTQQINDSR
WSIIWSVLFG MLVGLIGTCI LVKVLKKILF GLEPYEISTL FEQRQAMLQS IKEGVVAVDD
RGEVTLINDA AQELLNYRKS QDDEKLSTLS HSWSQVVDVS EVLRDGTPRR DEEITIKDRL
LLINTVPVRS NGVIIGAIST FRDKTEVRKL MQRLDGLVNY ADALRERSHE FMNKLHVILG
LLHLKSYKQL EDYILKTANN YQEEIGSLLG KIKSPVIAGF LISKINRATD LGHTLILNSE
SLLPDSGSED QVATLITTLG NLIENALEAL GPEPGGEISV TLHYRHGWLH CEVNDDGPGI
APDKIDHIFD KGVSTKGSER GVGLALVKQQ VENLGGSIAV ESEPGIFTQF FVQIPWDGER
SNR