Gene ECH74115_5639 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5639 
SymboldcuS 
ID6969710 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5279681 
End bp5281312 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content50% 
IMG OID643389273 
Productsensory histidine kinase DcuS 
Protein accessionYP_002273670 
Protein GI209397333 
COG category[T] Signal transduction mechanisms 
COG ID[COG3290] Signal transduction histidine kinase regulating citrate/malate metabolism 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACATT CATTGCCCTA CCGCATGTTA CGCAAACGTC CGATGAAATT GAGTACCACA 
GTGATCTTAA TGGTCAGCGC GGTACTGTTC TCGGTGCTAT TGGTGGTGCA TCTGATTTAC
TTCTCGCAAA TCAGTGATAT GACGCGAGAC GGGCTAGCCA ACAAGGCACT GGCAGTGGCG
CGTACTCTCG CCGACTCGCC GGAAATCCGT CAGGGCTTGC AGAAAAAACC GCAGGAGAGC
GGCATCCAGG CCATCGCGGA AGCCGTGCGC AAACGCAACG ATCTGCTGTT TATTGTCGTT
ACCGATATGC AAAGTCTTCG CTACTCGCAT CCTGAAGCCC AGCGTATTGG TCAGCCATTT
AAAGGTGATG ACATCCTTAA AGCGCTGAAT GGCGAAGAAA ATGTCGCTAT CAATCGCGGT
TTTCTGGCGC AGGCTTTACG CGTATTTACC CCCATCTACG ATGAAAATCA TAAACAAATT
GGCGTGGTGG CGATCGGCCT TGAGTTAAGC CGCGTGACCC AACAGATCAA TGACAGTCGC
TGGAGCATTA TCTGGTCGGT ATTATTTGGC ATGCTGGTCG GACTGATTGG CACCTGCATT
CTGGTTAAGG TACTGAAAAA AATCCTTTTC GGCCTGGAAC CCTACGAAAT CTCCACGCTG
TTTGAGCAAC GCCAGGCCAT GTTGCAGTCT ATCAAAGAAG GCGTCGTTGC CGTGGACGAT
CGCGGCGAGG TCACGCTGAT CAACGATGCC GCACAAGAAT TGCTGAATTA CCGTAAGTCG
CAGGACGATG AGAAACTGTC GACGCTCAGC CACTCATGGT CACAGGTGGT AGATGTCTCG
GAAGTGTTAC GCGACGGTAC GCCGCGCCGC GACGAAGAGA TTACGATTAA AGACCGGCTA
TTACTGATCA ACACCGTTCC GGTGCGCAGT AATGGCGTTA TCATCGGTGC CATTTCAACC
TTCAGGGACA AAACTGAAGT TCGTAAACTG ATGCAGCGAC TCGACGGTCT GGTCAACTAT
GCTGATGCAC TTCGTGAACG ATCCCACGAA TTTATGAATA AATTGCATGT GATTCTCGGA
TTATTGCATC TGAAGAGTTA TAAGCAGTTG GAAGATTACA TTCTCAAAAC AGCCAATAAC
TATCAGGAAG AGATTGGCTC TCTGCTGGGT AAGATCAAAT CTCCGGTTAT CGCTGGTTTT
TTAATCAGCA AGATTAACCG CGCGACCGAT TTAGGCCATA CGCTGATTTT AAACAGTGAA
AGCCAGCTGC CAGACAGCGG CAGTGAGGAC CAGGTCGCGA CGCTGATTAC CACATTGGGA
AATCTGATAG AAAACGCGCT GGAAGCATTA GGGCCGGAAC CCGGTGGCGA AATTAGCGTA
ACATTGCACT ACCGTCACGG CTGGCTGCAC TGCGAAGTTA ACGATGATGG ACCGGGGATC
GCACCCGACA AAATCGATCA CATTTTTGAC AAAGGTGTCT CGACAAAAGG AAGCGAGCGA
GGCGTCGGTT TAGCACTTGT CAAACAACAG GTAGAAAATC TCGGCGGCAG CATCGCCGTG
GAATCGGAAC CCGGGATTTT CACACAATTT TTTGTCCAGA TACCCTGGGA CGGGGAGAGG
TCGAACAGAT GA
 
Protein sequence
MRHSLPYRML RKRPMKLSTT VILMVSAVLF SVLLVVHLIY FSQISDMTRD GLANKALAVA 
RTLADSPEIR QGLQKKPQES GIQAIAEAVR KRNDLLFIVV TDMQSLRYSH PEAQRIGQPF
KGDDILKALN GEENVAINRG FLAQALRVFT PIYDENHKQI GVVAIGLELS RVTQQINDSR
WSIIWSVLFG MLVGLIGTCI LVKVLKKILF GLEPYEISTL FEQRQAMLQS IKEGVVAVDD
RGEVTLINDA AQELLNYRKS QDDEKLSTLS HSWSQVVDVS EVLRDGTPRR DEEITIKDRL
LLINTVPVRS NGVIIGAIST FRDKTEVRKL MQRLDGLVNY ADALRERSHE FMNKLHVILG
LLHLKSYKQL EDYILKTANN YQEEIGSLLG KIKSPVIAGF LISKINRATD LGHTLILNSE
SQLPDSGSED QVATLITTLG NLIENALEAL GPEPGGEISV TLHYRHGWLH CEVNDDGPGI
APDKIDHIFD KGVSTKGSER GVGLALVKQQ VENLGGSIAV ESEPGIFTQF FVQIPWDGER
SNR