Gene EcSMS35_4592 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4592 
SymboldcuS 
ID6146536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4694995 
End bp4696626 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content50% 
IMG OID641619408 
Productsensory histidine kinase DcuS 
Protein accessionYP_001746520 
Protein GI170682025 
COG category[T] Signal transduction mechanisms 
COG ID[COG3290] Signal transduction histidine kinase regulating citrate/malate metabolism 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACATT CATTGCCCTA CCGTATGTTA CGCAAACGTC CGATGAAATT GAGTACCACT 
GTGATCTTAA TGGTCAGCGC GGTACTGTTC TCGGTGCTAT TGGTGGTGCA TCTGATTTAC
TTCTCGCAAA TCAGTGATAT GACGCGAGAC GGGCTAGCTA ACAAGGCACT GGCAGTAGCG
CGTACCCTCG CCGACTCGCC GGAAATCCGG CAGGGCTTGC AGAAAAAACC GCAGGAGAGC
GGCATCCAGG CCATCGCGGA AGCCGTGCGC AAACGCAACG ATCTGCTATT TATTGTCGTT
ACCGATATGC AAAGTCTTCG CTACTCGCAT CCTGAAGCCC AGCGTATTGG TCAGCCATTT
AAAGGTGATG ACATCCTTAA AGCGCTGAAT GGCGAAGAAA ATGTCGCTAT CAATCGCGGT
TTTCTGGCGC AGGCTTTACG CGTATTTACC CCCATCTACG ATGAAAATCA TAAACAAATT
GGCGTGGTGG CGATCGGCCT TGAGTTAAGT CGCGTGACCC AACAGATCAA TGACAGTCGC
TGGAGCATCA TCTGGTCGGT ATTATTTGGC ATGCTGGTCG GGCTGATTGG CACCTGCATT
CTGGTTAAGG TACTGAAAAA AATCCTTTTC GGCCTGGAAC CCTACGAAAT CTCCACGCTG
TTTGAGCAAC GCCAGGCCAT GTTGCAGTCT ATCAAAGAAG GCGTCGTTGC CGTGGACGAT
CGCGGCGAAG TCACGCTGAT CAACGATGCC GCACAAGAAT TGCTGAATTA CCGTAAGTCG
CAGGACGATG AGAAGCTGTC GACGCTAAGC CACGCGTGGT CACAGGTAGT AGATGTCTCG
GAAGTGTTAC GCGACGGTAC CCCGCGCCGC GACGAAGAGA TTACGATTAA AGACCGGCTA
TTACTGATCA ACACCGTTCC GGTGCGCAGT AATGGCGTTA TCATCGGTGC CATTTCAACC
TTCAGGGACA AAACTGAAGT ACGTAAACTG ATGCAGCGAC TCGATGGTCT GGTCAACTAT
GCTGACGCAC TTCGTGAACG ATCCCACGAA TTTATGAATA AATTGCACGT GATTCTCGGA
TTATTGCATC TGAAGAGTTA TAAGCAGTTG GAAGATTACA TTCTCAAAAC AGCCAATAAC
TATCAGGAAG AGATTGGCTC TCTGCTGGGT AAGATCAAAT CTCCGGTTAT CGCTGGTTTT
TTAATCAGCA AGATTAACCG CGCGACCGAT TTAGGCCATA CGCTGATTTT AAACAGTGAA
AGCCAGCTGC CAGACAGCGG TAGTGAGGAT CAGGTCGCGA CGCTGATTAC CACGTTGGGA
AATCTGATAG AAAACGCGCT GGAGGCATTA GGGCCGGAAC CCGGAGGCGA AATTAGCGTA
ACATTGCACT ACCGTCACGG CTGGCTGCAC TGTGAAGTTA ACGATGATGG ACCGGGGATC
GCACCCGACA AAATCGATCA CATTTTTGAC AAAGGTGTCT CGACAAAAGG AAGCGAGCGA
GGCGTCGGTT TAGCACTTGT CAAACAACAG GTAGAAAATC TCGGCGGCAG CATCGCCGTG
GAATCGGAAC CCGGGATTTT CACACAATTT TTTGTCCAGA TACCCTGGGA CGGGGAGAGG
TCGAACAGAT GA
 
Protein sequence
MRHSLPYRML RKRPMKLSTT VILMVSAVLF SVLLVVHLIY FSQISDMTRD GLANKALAVA 
RTLADSPEIR QGLQKKPQES GIQAIAEAVR KRNDLLFIVV TDMQSLRYSH PEAQRIGQPF
KGDDILKALN GEENVAINRG FLAQALRVFT PIYDENHKQI GVVAIGLELS RVTQQINDSR
WSIIWSVLFG MLVGLIGTCI LVKVLKKILF GLEPYEISTL FEQRQAMLQS IKEGVVAVDD
RGEVTLINDA AQELLNYRKS QDDEKLSTLS HAWSQVVDVS EVLRDGTPRR DEEITIKDRL
LLINTVPVRS NGVIIGAIST FRDKTEVRKL MQRLDGLVNY ADALRERSHE FMNKLHVILG
LLHLKSYKQL EDYILKTANN YQEEIGSLLG KIKSPVIAGF LISKINRATD LGHTLILNSE
SQLPDSGSED QVATLITTLG NLIENALEAL GPEPGGEISV TLHYRHGWLH CEVNDDGPGI
APDKIDHIFD KGVSTKGSER GVGLALVKQQ VENLGGSIAV ESEPGIFTQF FVQIPWDGER
SNR