Gene EcolC_3902 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3902 
Symbol 
ID6064377 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4282805 
End bp4284436 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content50% 
IMG OID641603316 
Productsensory histidine kinase DcuS 
Protein accessionYP_001726831 
Protein GI170021877 
COG category[T] Signal transduction mechanisms 
COG ID[COG3290] Signal transduction histidine kinase regulating citrate/malate metabolism 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACATT CATTGCCCTA CCGCATGTTA CGCAAACGTC CGATGAAATT GAGTACCACA 
GTGATCTTAA TGGTCAGCGC GGTACTGTTC TCGGTGCTAT TGGTGGTGCA TCTGATTTAC
TTCTCGCAAA TCAGTGATAT GACGCGAGAC GGGCTGGCCA ACAAGGCACT GGCAGTGGCG
CGTACTCTCG CCGACTCGCC GGAAATCCGT CAGGGCTTGC AGAAAAAACC GCAGGAGAGT
GGAATCCAGG CCATCGCGGA AGCCGTGCGC AAACGCAACG ATCTGCTGTT TATTGTCGTT
ACCGATATGC AAAGTCTTCG CTACTCACAT CCTGAAGCCC AGCGTATTGG TCAGCCATTT
AAAGGTGATG ACATCCTTAA AGCGCTGAAT GGCGAAGAAA ATGTCGCTAT CAATCGCGGT
TTTCTGGCGC AGGCTTTACG CGTATTTACC CCCATCTACG ATGAAAATCA TAAACAAATT
GGCGTGGTGG CGATCGGCCT TGAGTTAAGC CGCGTGACCC AACAAATCAA TGACAGTCGC
TGGAGCATTA TCTGGTCGGT ACTATTTGGC ATGCTGGTCG GACTGATTGG CACCTGCATT
CTGGTTAAAG TACTGAAAAA AATCCTTTTC GGCCTGGAAC CCTACGAAAT CTCCACGCTG
TTTGAGCAAC GCCAGGCCAT GTTGCAGTCT ATCAAAGAAG GCGTCGTTGC CGTGGACGAT
CGCGGCGAGG TCACGCTGAT CAACGATGCC GCACAAGAAT TGCTGAATTA CCGTAAGTCG
CAGGACGATG AGAAACTGTC GACGCTAAGC CACTCGTGGT CACAGGTAGT AGATGTCTCG
GAAGTGTTAC GCGACGGTAC GCCGCGCCGC GACGAAGAGA TTACGATTAA AGACCGGCTA
TTACTGATCA ACACCGTTCC GGTGCGCAGT AATGGCGTTA TCATCGGTGC CATTTCAACC
TTCAGGGACA AAACTGAAGT TCGTAAACTG ATGCAGCGAC TCGACGGTCT GGTCAACTAT
GCTGATGCAC TTCGTGAACG ATCCCACGAA TTTATGAATA AATTGCATGT GATTCTCGGA
TTATTGCATC TGAAGAGTTA TAAGCAGTTG GAAGATTACA TTCTCAAAAC AGCCAATAAC
TATCAGGAAG AGATTGGCTC TCTGCTGGGT AAGATCAAAT CTCCGGTTAT CGCTGGTTTT
TTAATCAGCA AGATTAACCG CGCGACCGAT TTAGGCCATA CGCTGATTTT AAACAGTGAA
AGCCAGCTGC CAGACAGCGG CAGTGAGGAC CAGGTCGCGA CGCTGATTAC CACATTGGGA
AATCTGATAG AAAACGCGCT GGAAGCATTA GGGCCGGAAC CCGGTGGCGA AATTAGCGTA
ACATTGCACT ACCGTCACGG CTGGCTGCAC TGCGAAGTTA ACGATGATGG ACCGGGGATC
GCACCCGACA AAATCGATCA CATTTTTGAC AAAGGTGTCT CGACAAAAGG AAGCGAGCGA
GGCGTCGGTT TAGCACTTGT CAAACAACAG GTAGAAAATC TCGGCGGCAG CATCGCCGTG
GAATCGGAAC CCGGGATTTT CACACAATTT TTTGTCCAGA TACCCTGGGA CGGGGAGAGG
TCGAACAGAT GA
 
Protein sequence
MRHSLPYRML RKRPMKLSTT VILMVSAVLF SVLLVVHLIY FSQISDMTRD GLANKALAVA 
RTLADSPEIR QGLQKKPQES GIQAIAEAVR KRNDLLFIVV TDMQSLRYSH PEAQRIGQPF
KGDDILKALN GEENVAINRG FLAQALRVFT PIYDENHKQI GVVAIGLELS RVTQQINDSR
WSIIWSVLFG MLVGLIGTCI LVKVLKKILF GLEPYEISTL FEQRQAMLQS IKEGVVAVDD
RGEVTLINDA AQELLNYRKS QDDEKLSTLS HSWSQVVDVS EVLRDGTPRR DEEITIKDRL
LLINTVPVRS NGVIIGAIST FRDKTEVRKL MQRLDGLVNY ADALRERSHE FMNKLHVILG
LLHLKSYKQL EDYILKTANN YQEEIGSLLG KIKSPVIAGF LISKINRATD LGHTLILNSE
SQLPDSGSED QVATLITTLG NLIENALEAL GPEPGGEISV TLHYRHGWLH CEVNDDGPGI
APDKIDHIFD KGVSTKGSER GVGLALVKQQ VENLGGSIAV ESEPGIFTQF FVQIPWDGER
SNR