Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4592 |
Symbol | dcuS |
ID | 6146536 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4694995 |
End bp | 4696626 |
Gene Length | 1632 bp |
Protein Length | 543 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641619408 |
Product | sensory histidine kinase DcuS |
Protein accession | YP_001746520 |
Protein GI | 170682025 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3290] Signal transduction histidine kinase regulating citrate/malate metabolism |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACATT CATTGCCCTA CCGTATGTTA CGCAAACGTC CGATGAAATT GAGTACCACT GTGATCTTAA TGGTCAGCGC GGTACTGTTC TCGGTGCTAT TGGTGGTGCA TCTGATTTAC TTCTCGCAAA TCAGTGATAT GACGCGAGAC GGGCTAGCTA ACAAGGCACT GGCAGTAGCG CGTACCCTCG CCGACTCGCC GGAAATCCGG CAGGGCTTGC AGAAAAAACC GCAGGAGAGC GGCATCCAGG CCATCGCGGA AGCCGTGCGC AAACGCAACG ATCTGCTATT TATTGTCGTT ACCGATATGC AAAGTCTTCG CTACTCGCAT CCTGAAGCCC AGCGTATTGG TCAGCCATTT AAAGGTGATG ACATCCTTAA AGCGCTGAAT GGCGAAGAAA ATGTCGCTAT CAATCGCGGT TTTCTGGCGC AGGCTTTACG CGTATTTACC CCCATCTACG ATGAAAATCA TAAACAAATT GGCGTGGTGG CGATCGGCCT TGAGTTAAGT CGCGTGACCC AACAGATCAA TGACAGTCGC TGGAGCATCA TCTGGTCGGT ATTATTTGGC ATGCTGGTCG GGCTGATTGG CACCTGCATT CTGGTTAAGG TACTGAAAAA AATCCTTTTC GGCCTGGAAC CCTACGAAAT CTCCACGCTG TTTGAGCAAC GCCAGGCCAT GTTGCAGTCT ATCAAAGAAG GCGTCGTTGC CGTGGACGAT CGCGGCGAAG TCACGCTGAT CAACGATGCC GCACAAGAAT TGCTGAATTA CCGTAAGTCG CAGGACGATG AGAAGCTGTC GACGCTAAGC CACGCGTGGT CACAGGTAGT AGATGTCTCG GAAGTGTTAC GCGACGGTAC CCCGCGCCGC GACGAAGAGA TTACGATTAA AGACCGGCTA TTACTGATCA ACACCGTTCC GGTGCGCAGT AATGGCGTTA TCATCGGTGC CATTTCAACC TTCAGGGACA AAACTGAAGT ACGTAAACTG ATGCAGCGAC TCGATGGTCT GGTCAACTAT GCTGACGCAC TTCGTGAACG ATCCCACGAA TTTATGAATA AATTGCACGT GATTCTCGGA TTATTGCATC TGAAGAGTTA TAAGCAGTTG GAAGATTACA TTCTCAAAAC AGCCAATAAC TATCAGGAAG AGATTGGCTC TCTGCTGGGT AAGATCAAAT CTCCGGTTAT CGCTGGTTTT TTAATCAGCA AGATTAACCG CGCGACCGAT TTAGGCCATA CGCTGATTTT AAACAGTGAA AGCCAGCTGC CAGACAGCGG TAGTGAGGAT CAGGTCGCGA CGCTGATTAC CACGTTGGGA AATCTGATAG AAAACGCGCT GGAGGCATTA GGGCCGGAAC CCGGAGGCGA AATTAGCGTA ACATTGCACT ACCGTCACGG CTGGCTGCAC TGTGAAGTTA ACGATGATGG ACCGGGGATC GCACCCGACA AAATCGATCA CATTTTTGAC AAAGGTGTCT CGACAAAAGG AAGCGAGCGA GGCGTCGGTT TAGCACTTGT CAAACAACAG GTAGAAAATC TCGGCGGCAG CATCGCCGTG GAATCGGAAC CCGGGATTTT CACACAATTT TTTGTCCAGA TACCCTGGGA CGGGGAGAGG TCGAACAGAT GA
|
Protein sequence | MRHSLPYRML RKRPMKLSTT VILMVSAVLF SVLLVVHLIY FSQISDMTRD GLANKALAVA RTLADSPEIR QGLQKKPQES GIQAIAEAVR KRNDLLFIVV TDMQSLRYSH PEAQRIGQPF KGDDILKALN GEENVAINRG FLAQALRVFT PIYDENHKQI GVVAIGLELS RVTQQINDSR WSIIWSVLFG MLVGLIGTCI LVKVLKKILF GLEPYEISTL FEQRQAMLQS IKEGVVAVDD RGEVTLINDA AQELLNYRKS QDDEKLSTLS HAWSQVVDVS EVLRDGTPRR DEEITIKDRL LLINTVPVRS NGVIIGAIST FRDKTEVRKL MQRLDGLVNY ADALRERSHE FMNKLHVILG LLHLKSYKQL EDYILKTANN YQEEIGSLLG KIKSPVIAGF LISKINRATD LGHTLILNSE SQLPDSGSED QVATLITTLG NLIENALEAL GPEPGGEISV TLHYRHGWLH CEVNDDGPGI APDKIDHIFD KGVSTKGSER GVGLALVKQQ VENLGGSIAV ESEPGIFTQF FVQIPWDGER SNR
|
| |