Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A4366 |
Symbol | dcuS |
ID | 5592118 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 4374053 |
End bp | 4375684 |
Gene Length | 1632 bp |
Protein Length | 543 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640923464 |
Product | sensory histidine kinase DcuS |
Protein accession | YP_001460909 |
Protein GI | 157163591 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3290] Signal transduction histidine kinase regulating citrate/malate metabolism |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 0.206477 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGACATT CATTGCCCTA CCGCATGTTA CGCAAACGTC CGATGAAATT GAGTACCACA GTGATCTTAA TGGTCAGCGC GGTACTGTTC TCGGTGCTAT TGGTGGTGCA TCTGATTTAC TTCTCGCAAA TCAGTGATAT GACGCGAGAC GGGCTGGCCA ACAAGGCACT GGCAGTGGCG CGTACCCTCG CCGACTCGCC GGAAATCCGT CAGGGCTTGC AGAAAAAACC GCAGGAGAGC GGAATCCAGG CCATCGCGGA AGCCGTGCGC AAACGCAACG ATCTGCTGTT TATTGTCGTT ACCGATATGC AAAGTCTTCG CTACTCGCAT CCTGAAGCCC AGCGTATTGG TCAGCCATTT AAAGGTGATG ACATCCTTAA AGCGCTGAAT GGCAAAGAAA ATGTCGCTAT CAATCGCGGT TTTCTGGCGC AGGCTTTACG CGTATTTACC CCCATCTACG ATGAAAATCA TAAACAAATT GGCGTGGTGG CGATCGGCCT TGAGTTAAGC CGCGTGACCC AACAAATCAA TGACAGTCGC TGGAGCATTA TCTGGTCGGT ATTATTTGGC ATGCTGGTCG GACTGATTGG CACCTGCATT CTGGTTAAGG TACTGAAAAA AATCCTTTTC GGCCTGGAAC CCTACGAAAT CTCCACGCTG TTTGAGCAGC GCCAGGCCAT GTTGCAGTCT ATCAAAGAAG GCGTCGTTGC CGTGGACGAT CGCGGCGAGG TCACGCTGAT CAACGATGCC GCACAAGAAT TGCTGAATTA CCGTAAGTCG CAGGACGATG AGAAACTGTC GACGCTAAGC CACTCATGGT CACAGGTGGT AGATGTCTCG GAAGTGTTAC GCGACGGTAC GCCGCGCCGC GACGAAGAGA TTACGATTAA AGACCGGCTA TTACTGATCA ACACCGTTCC GGTGCGCAGT AATGGCGTTA TCATCGGTGC CATTTCAACC TTCAGGGACA AAACTGAAGT TCGTAAACTG ATGCAGCGAC TCGACGGTCT GGTCAACTAT GCTGATGCAC TTCGTGAACG ATCCCACGAA TTTATGAATA AATTGCATGT GATTCTCGGA TTATTGCATC TGAAGAGTTA TAAGCAGTTG GAAGATTACA TTCTCAAAAC AGCCAATAAC TATCAGGAAG AGATTGGCTC TCTGCTGGGT AAGATCAAAT CTCCGGTTAT CGCTGGTTTT TTAATCAGCA AGATTAACCG CGCGACCGAT TTAGGCCATA CGCTGATTTT AAACAGTGAA AGCCTGCTGC CAGACAGCGG CAGTGAGGAC CAGGTCGCGA CGCTGATTAC CACATTGGGA AATCTGATAG AAAACGCGCT GGAAGCATTA GGGCCGGAAC CCGGTGGCGA AATTAGCGTA ACATTGCACT ACCGTCACGG CTGGCTGCAC TGCGAAGTTA ACGATGATGG ACCGGGGATC GCACCCGACA AAATCGATCA CATTTTTGAC AAAGGTGTCT CGACAAAAGG AAGCGAGCGA GGCGTCGGTT TAGCACTTGT CAAACAACAG GTAGAAAATC TCGGCGGCAG CATCGCCGTG GAATCGGAAC CCGGGATTTT CACACAATTT TTTGTCCAGA TACCCTGGGA CGGGGAGAGG TCGAACAGAT GA
|
Protein sequence | MRHSLPYRML RKRPMKLSTT VILMVSAVLF SVLLVVHLIY FSQISDMTRD GLANKALAVA RTLADSPEIR QGLQKKPQES GIQAIAEAVR KRNDLLFIVV TDMQSLRYSH PEAQRIGQPF KGDDILKALN GKENVAINRG FLAQALRVFT PIYDENHKQI GVVAIGLELS RVTQQINDSR WSIIWSVLFG MLVGLIGTCI LVKVLKKILF GLEPYEISTL FEQRQAMLQS IKEGVVAVDD RGEVTLINDA AQELLNYRKS QDDEKLSTLS HSWSQVVDVS EVLRDGTPRR DEEITIKDRL LLINTVPVRS NGVIIGAIST FRDKTEVRKL MQRLDGLVNY ADALRERSHE FMNKLHVILG LLHLKSYKQL EDYILKTANN YQEEIGSLLG KIKSPVIAGF LISKINRATD LGHTLILNSE SLLPDSGSED QVATLITTLG NLIENALEAL GPEPGGEISV TLHYRHGWLH CEVNDDGPGI APDKIDHIFD KGVSTKGSER GVGLALVKQQ VENLGGSIAV ESEPGIFTQF FVQIPWDGER SNR
|
| |