Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2709 |
Symbol | |
ID | 6144786 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2784871 |
End bp | 2786298 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641617580 |
Product | sensor histidine kinase |
Protein accession | YP_001744745 |
Protein GI | 170680109 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAAACGCT GGCCCGTTTT TCCCCGCTCA TTACGACAAC TGGTAATGCT GGCATTTTTG CTGATTCTGC TACCCCTGTT GGTGCTGGCA TGGCAAGCCT GGCAAAGCCT GAATGCGCTT AGCGATCAGG CGGCGCTGGT TAACCGCACT ACGCTTATTG ATGCCCGGCG CAGTGAAGCG ATGACCAACG CGGCGCTGGA GATGGAGCGT AGCTACCGTC AGTATTGCGT GTTGGACGAC CCAACCCTGG CGAAGGTTTA TCAAAGCCAG CGCAAGCGTT ACAGCGAAAT GCTCGATGCC CACGCAGGCG TGCTGCCGGA CGATAAACTC TACCAGGCAT TACGTCAGGA CTTGAACAAT CTGGCTCAAC TTCAGTGTAA CAACAGCGGT CCCGATGCCG CCGCCGCCGC GCGTCTGGAA GCCTTTGCCA GTGCCAATAC CGAAATGGTG CAGGCCACGC GCACAGTGGT GTTCTCTCGT GGGCAGCAAC TTCAGCGTGA AATCGCCGAA CGTGGGCAAT ATTTTGGTTG GCAATCGCTG GTGCTATTTC TGGTGAGTCT GGTGATGGTG CTACTTTTCA CGCGGATGAT TATCGGGCCG GTGAAAAATA TCGAGCGCAT GATCAACCGG CTGGGGGAAG GGCGTTCTCT GGGCAATAGC GTCTCGTTCA GTGGACCGAG CGAGTTACGC TCGGTTGGGC AACGTATTCT TTGGTTAAGT GAGCGCCTGT CATGGCTGGA ATCCCAACGC CATCAATTTT TAAGACATTT ATCTCATGAA TTAAAAACCC CGCTGGCGAG TATGCGCGAG GGCACTGAAT TACTGGCAGA CCAGGTTGTC GGGCCGCTTA CGCCCGAGCA AAAAGAGGTG GTGAGCATCC TTGATAGCAG CAGCCGCAAT TTGCAAAAAC TGATCGAACA ACTGCTTGAT TACAACCGTA AACAAGCGGA CAGTGCGGTG GAACTGGAGA ATGTTGAGTT AGCACCGCTG GTGGAGACAG TAGTTTCTGC TCATAGCCTG CCCGCACGGG CTAAAATGAT GCATACCGAC GTTGATCTCA AAGCAACAGC TTGCCTGGCG GAGCCAATGC TGCTGATGAG CGTACTGGAT AATCTTTACT CCAATGCGGT GCACTACGGG GCTGAATCCG GTAACATTTG CCTTCGCAGC AGTTTACATG GTGCGCGGGT TTATATTGAT GTCATCAATA CAGGCACGCC CATTCCGCAA GAGGAACGCG CCATGATCTT CGAACCCTTT TTTCAGGGAA GCCACCAGCG AAAAGGGGCG GTGAAGGGCA GCGGTCTGGG ATTAAGCATT GCCAGGGATT GTATTCGCCG TATGCAAGGG GAACTGTATC TGGTCGACGA GAGCGGGCAA GATGTTTGTT TCCGCATTGA ATTACCGTCG TCGAAAAACA CGAAATAA
|
Protein sequence | MKRWPVFPRS LRQLVMLAFL LILLPLLVLA WQAWQSLNAL SDQAALVNRT TLIDARRSEA MTNAALEMER SYRQYCVLDD PTLAKVYQSQ RKRYSEMLDA HAGVLPDDKL YQALRQDLNN LAQLQCNNSG PDAAAAARLE AFASANTEMV QATRTVVFSR GQQLQREIAE RGQYFGWQSL VLFLVSLVMV LLFTRMIIGP VKNIERMINR LGEGRSLGNS VSFSGPSELR SVGQRILWLS ERLSWLESQR HQFLRHLSHE LKTPLASMRE GTELLADQVV GPLTPEQKEV VSILDSSSRN LQKLIEQLLD YNRKQADSAV ELENVELAPL VETVVSAHSL PARAKMMHTD VDLKATACLA EPMLLMSVLD NLYSNAVHYG AESGNICLRS SLHGARVYID VINTGTPIPQ EERAMIFEPF FQGSHQRKGA VKGSGLGLSI ARDCIRRMQG ELYLVDESGQ DVCFRIELPS SKNTK
|
| |