Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1408 |
Symbol | |
ID | 6143930 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1391468 |
End bp | 1393402 |
Gene Length | 1935 bp |
Protein Length | 644 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641616286 |
Product | serine kinase family protein |
Protein accession | YP_001743466 |
Protein GI | 170684310 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2766] Putative Ser protein kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0526547 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATATAT TCGATCACTA TCGCCAGCGA TATGAAGCTG CCAAGGACGA AGAGTTCACA CTGCAGGAGT TTCTTACCAC TTGTCGGCAA GATCGCAGTG CTTATGCCAA CGCGGCTGAG CGGCTATTGA TGGCTATTGG TGAGCCTGTC ATGGTCGATA CAGCCCAGGA ACCCAGACTT TCTCGACTCT TTTCTAACCG GGTCATTGCA CGTTATCCGG CGTTTGAAGA GTTTTACGGC ATGGAAGACG CGATTGAACA GATTGTCTCT TATCTGAAAC ACGCGGCTCA GGGGCTGGAA GAGAAGAAAC AAATCCTGTA TCTGCTGGGG CCTGTGGGTG GGGGTAAATC ATCGCTTGCT GAGCGACTGA AATCATTAAT GCAGCTCGTG CCGATCTATG TATTAAGCGC TAACGGTGAG CGGAGCCCGG TCAACGATCA TCCGTTCTGT CTTTTCAATC CGCAGGAAGA TGCGCAGATT CTGGAAAAAG AGTATGGCAT TCCTCGCCGT TATCTCGGCA CCATCATGTC GCCGTGGGCG GCAAAACGCC TGCATGAATT TGGTGGCGAT ATCACTAAGT TCCGGGTAGT GAAGGTCTGG CCGTCAATTC TGCAACAAAT TGCTATCGCC AAAACGGAAC CCGGTGATGA GAACAACCAG GACATCTCCG CCCTGGTAGG GAAAGTCGAT ATTCGTAAAC TCGAACACTA CGCGCAGAAT GACCCGGACG CCTACGGCTA TTCCGGTGCG CTGTGCCGCG CCAATCAGGG GATCATGGAA TTTGTCGAGA TGTTTAAAGC ACCGATAAAA GTGCTGCATC CCTTGTTAAC CGCCACCCAG GAAGGTAACT ACAACGGGAC GGAAGGTATC TCCGCCCTGC CGTTCAACGG GATTATTCTC GCGCACTCGA ACGAATCCGA ATGGGTCACT TTCCGTAATA ACAAAAACAA CGAAGCCTTC CTCGACCGTG TTTACATCGT GAAGGTGCCG TATTGCTTGC GCATTTCCGA AGAGATCAAA ATCTACGAGA AATTGCTTAA TCACAGTGAA TTGACTCACG CCCCCTGCGC CCCTGGCACG CTGGAAACGC TGTCACGTTT TTCTATTCTT TCGCGTCTGA AAGAGCCAGA AAACTCCAGC ATTTATTCAA AGATGCGGGT TTATGATGGT GAAAGTTTGA AAGACACTGA TCCCAAAGCC AAGTCGTATC AGGAATATCG TGACTACGCC GGTGTCGATG AAGGGATGAA CGGTCTGTCG ACGCGTTTTG CGTTTAAGAT CCTCTCCCGC GTGTTCAACT TCGATCATGT AGAAGTGGCG GCAAACCCGG TCCATCTGTT CTACGTCCTG GAACAGCAGA TCGAGCGCGA GCAGTTCCCA CAAGAGCAGG CAGAACGCTA TCTGGAGTTC CTGAAAGGTT ATCTGATCCC GAAATATGCC GAGTTTATCG GCAAAGAGAT CCAGACGGCC TACCTTGAAT CCTATTCCGA ATATGGGCAA AACATTTTCG ACCGTTATGT TACCTACGCG GATTTCTGGA TTCAGGATCA GGAGTATCGC GATCCGGATA CCGGGCAGTT GTTTGACCGC GAGTCTCTTA ACGCCGAGCT GGAGAAAATC GAGAAACCGG CGGGGATCAG TAATCCAAAA GATTTCCGCA ACGAGATTGT TAACTTCGTA CTGCGCGCCA GAGCGAATAA CAGCGGACGC AATCCGAACT GGACCAGCTA TGAAAAACTG CGCACGGTTA TTGAGAAGAA AATGTTCTCC AATACCGAGG AGCTGTTGCC GGTTATTTCG TTTAACGCCA AAACGTCAAC CGACGAGCAG AAGAAACATG ACGACTTTGT CGACCGTATG ATGGAAAAAG GCTACACCCG TAAACAGGTG CGTTTACTGT GCGAATGGTA TTTGCGCGTA CGTAAATCGT CTTAA
|
Protein sequence | MNIFDHYRQR YEAAKDEEFT LQEFLTTCRQ DRSAYANAAE RLLMAIGEPV MVDTAQEPRL SRLFSNRVIA RYPAFEEFYG MEDAIEQIVS YLKHAAQGLE EKKQILYLLG PVGGGKSSLA ERLKSLMQLV PIYVLSANGE RSPVNDHPFC LFNPQEDAQI LEKEYGIPRR YLGTIMSPWA AKRLHEFGGD ITKFRVVKVW PSILQQIAIA KTEPGDENNQ DISALVGKVD IRKLEHYAQN DPDAYGYSGA LCRANQGIME FVEMFKAPIK VLHPLLTATQ EGNYNGTEGI SALPFNGIIL AHSNESEWVT FRNNKNNEAF LDRVYIVKVP YCLRISEEIK IYEKLLNHSE LTHAPCAPGT LETLSRFSIL SRLKEPENSS IYSKMRVYDG ESLKDTDPKA KSYQEYRDYA GVDEGMNGLS TRFAFKILSR VFNFDHVEVA ANPVHLFYVL EQQIEREQFP QEQAERYLEF LKGYLIPKYA EFIGKEIQTA YLESYSEYGQ NIFDRYVTYA DFWIQDQEYR DPDTGQLFDR ESLNAELEKI EKPAGISNPK DFRNEIVNFV LRARANNSGR NPNWTSYEKL RTVIEKKMFS NTEELLPVIS FNAKTSTDEQ KKHDDFVDRM MEKGYTRKQV RLLCEWYLRV RKSS
|
| |