Gene EcE24377A_4679 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_4679 
SymboldcuS 
ID5590515 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp4681432 
End bp4683063 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content50% 
IMG OID640928291 
Productsensory histidine kinase DcuS 
Protein accessionYP_001465623 
Protein GI157154941 
COG category[T] Signal transduction mechanisms 
COG ID[COG3290] Signal transduction histidine kinase regulating citrate/malate metabolism 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGACATT CATTGCCCTA CCGCATGTTA CGCAAACGTC CGATGAAATT GAGTACCACA 
GTGATCTTAA TGGTCAGCGC GGTACTGTTC TCGGTGCTAT TGGTGGTGCA TCTGATTTAC
TTCTCGCAAA TCAGTGATAT GACGCGAGAC GGGCTGGCCA ACAAGGCACT GGCAGTGGCG
CGTACCCTCG CCGACTCGCC GGAAATCCGT CAGGGCTTGC AGAAAAAACC GCAGGAGAGC
GGAATCCAGG CCATCGCGGA AGCCGTGCGC AAACGCAACG ATCTGCTGTT TATTGTCGTT
ACCGATATGC AAAGTCTTCG CTACTCGCAT CCTGAAGCCC AGCGTATTGG TCAGCCATTT
AAAGGTGATG ACATCCTTAA AGCGCTGAAT GGCAAAGAAA ATGTCGCTAT CAATCGCGGT
TTTCTGGCGC AGGCTTTACG CGTATTTACC CCCATCTACG ATGAAAATCA TAAACAAATT
GGCGTGGTGG CGATCGGCCT TGAGTTAAGC CGCGTGACCC AACAAATCAA TGACAGTCGC
TGGAGCATTA TCTGGTCGGT ATTATTTGGC ATGCTGGTCG GACTGATTGG CACCTGCATT
CTGGTTAAGG TACTGAAAAA AATCCTTTTC GGCCTGGAAC CCTACGAAAT CTCCACGCTG
TTTGAGCAGC GCCAGGCCAT GTTGCAGTCT ATCAAAGAAG GCGTCGTTGC CGTGGACGAT
CGCGGCGAGG TCACGCTGAT CAACGATGCC GCACAAGAAT TGCTGAATTA CCGTAAGTCG
CAGGACGATG AGAAACTGTC GACGCTAAGC CACTCATGGT CACAGGTGGT AGATGTCTCG
GAAGTGTTAC GCGACGGTAC GCCGCGCCGC GACGAAGAGA TTACGATTAA AGACCGGCTA
TTACTGATCA ACACCGTTCC GGTGCGCAGT AATGGCGTTA TCATCGGTGC CATTTCAACC
TTCAGGGACA AAACTGAAGT TCGTAAACTG ATGCAGCGAC TCGACGGTCT GGTCAACTAT
GCTGATGCAC TTCGTGAACG ATCCCACGAA TTTATGAATA AATTGCATGT GATTCTCGGA
TTATTGCATC TGAAGAGTTA TAAGCAGTTG GAAGATTACA TTCTCAAAAC AGCCAATAAC
TATCAGGAAG AGATTGGCTC TCTGCTGGGT AAGATCAAAT CTCCGGTTAT CGCTGGTTTT
TTAATCAGCA AGATTAACCG CGCGACCGAT TTAGGCCATA CGCTGATTTT AAACAGTGAA
AGCCAGCTGC CAGACAGCGG CAGTGAGGAC CAGGTCGCGA CGCTGATTAC CACATTGGGA
AATCTGATAG AAAACGCGCT GGAAGCATTA GGGCCGGAAC CCGGTGGCGA AATTAGCGTA
ACATTGCACT ACCGTCACGG CTGGCTGCAC TGCGAAGTTA ACGATGATGG ACCGGGAATC
GCACCCGACA AAATCGATCA CATTTTTGAC AAAGGTGTCT CGACAAAAGG AAGCGAGCGA
GGCGTCGGTT TAGCACTTGT CAAACAACAG GTAGAAAATC TCGGCGGCAG CATCGCCGTG
GAATCGGAAC CCGGGATTTT CACACAATTT TTTGTCCAGA TACCCTGGGA CGGGGAGAGG
TCGAACAGAT GA
 
Protein sequence
MRHSLPYRML RKRPMKLSTT VILMVSAVLF SVLLVVHLIY FSQISDMTRD GLANKALAVA 
RTLADSPEIR QGLQKKPQES GIQAIAEAVR KRNDLLFIVV TDMQSLRYSH PEAQRIGQPF
KGDDILKALN GKENVAINRG FLAQALRVFT PIYDENHKQI GVVAIGLELS RVTQQINDSR
WSIIWSVLFG MLVGLIGTCI LVKVLKKILF GLEPYEISTL FEQRQAMLQS IKEGVVAVDD
RGEVTLINDA AQELLNYRKS QDDEKLSTLS HSWSQVVDVS EVLRDGTPRR DEEITIKDRL
LLINTVPVRS NGVIIGAIST FRDKTEVRKL MQRLDGLVNY ADALRERSHE FMNKLHVILG
LLHLKSYKQL EDYILKTANN YQEEIGSLLG KIKSPVIAGF LISKINRATD LGHTLILNSE
SQLPDSGSED QVATLITTLG NLIENALEAL GPEPGGEISV TLHYRHGWLH CEVNDDGPGI
APDKIDHIFD KGVSTKGSER GVGLALVKQQ VENLGGSIAV ESEPGIFTQF FVQIPWDGER
SNR