Gene EcSMS35_1408 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1408 
Symbol 
ID6143930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1391468 
End bp1393402 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content50% 
IMG OID641616286 
Productserine kinase family protein 
Protein accessionYP_001743466 
Protein GI170684310 
COG category[T] Signal transduction mechanisms 
COG ID[COG2766] Putative Ser protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0526547 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATAT TCGATCACTA TCGCCAGCGA TATGAAGCTG CCAAGGACGA AGAGTTCACA 
CTGCAGGAGT TTCTTACCAC TTGTCGGCAA GATCGCAGTG CTTATGCCAA CGCGGCTGAG
CGGCTATTGA TGGCTATTGG TGAGCCTGTC ATGGTCGATA CAGCCCAGGA ACCCAGACTT
TCTCGACTCT TTTCTAACCG GGTCATTGCA CGTTATCCGG CGTTTGAAGA GTTTTACGGC
ATGGAAGACG CGATTGAACA GATTGTCTCT TATCTGAAAC ACGCGGCTCA GGGGCTGGAA
GAGAAGAAAC AAATCCTGTA TCTGCTGGGG CCTGTGGGTG GGGGTAAATC ATCGCTTGCT
GAGCGACTGA AATCATTAAT GCAGCTCGTG CCGATCTATG TATTAAGCGC TAACGGTGAG
CGGAGCCCGG TCAACGATCA TCCGTTCTGT CTTTTCAATC CGCAGGAAGA TGCGCAGATT
CTGGAAAAAG AGTATGGCAT TCCTCGCCGT TATCTCGGCA CCATCATGTC GCCGTGGGCG
GCAAAACGCC TGCATGAATT TGGTGGCGAT ATCACTAAGT TCCGGGTAGT GAAGGTCTGG
CCGTCAATTC TGCAACAAAT TGCTATCGCC AAAACGGAAC CCGGTGATGA GAACAACCAG
GACATCTCCG CCCTGGTAGG GAAAGTCGAT ATTCGTAAAC TCGAACACTA CGCGCAGAAT
GACCCGGACG CCTACGGCTA TTCCGGTGCG CTGTGCCGCG CCAATCAGGG GATCATGGAA
TTTGTCGAGA TGTTTAAAGC ACCGATAAAA GTGCTGCATC CCTTGTTAAC CGCCACCCAG
GAAGGTAACT ACAACGGGAC GGAAGGTATC TCCGCCCTGC CGTTCAACGG GATTATTCTC
GCGCACTCGA ACGAATCCGA ATGGGTCACT TTCCGTAATA ACAAAAACAA CGAAGCCTTC
CTCGACCGTG TTTACATCGT GAAGGTGCCG TATTGCTTGC GCATTTCCGA AGAGATCAAA
ATCTACGAGA AATTGCTTAA TCACAGTGAA TTGACTCACG CCCCCTGCGC CCCTGGCACG
CTGGAAACGC TGTCACGTTT TTCTATTCTT TCGCGTCTGA AAGAGCCAGA AAACTCCAGC
ATTTATTCAA AGATGCGGGT TTATGATGGT GAAAGTTTGA AAGACACTGA TCCCAAAGCC
AAGTCGTATC AGGAATATCG TGACTACGCC GGTGTCGATG AAGGGATGAA CGGTCTGTCG
ACGCGTTTTG CGTTTAAGAT CCTCTCCCGC GTGTTCAACT TCGATCATGT AGAAGTGGCG
GCAAACCCGG TCCATCTGTT CTACGTCCTG GAACAGCAGA TCGAGCGCGA GCAGTTCCCA
CAAGAGCAGG CAGAACGCTA TCTGGAGTTC CTGAAAGGTT ATCTGATCCC GAAATATGCC
GAGTTTATCG GCAAAGAGAT CCAGACGGCC TACCTTGAAT CCTATTCCGA ATATGGGCAA
AACATTTTCG ACCGTTATGT TACCTACGCG GATTTCTGGA TTCAGGATCA GGAGTATCGC
GATCCGGATA CCGGGCAGTT GTTTGACCGC GAGTCTCTTA ACGCCGAGCT GGAGAAAATC
GAGAAACCGG CGGGGATCAG TAATCCAAAA GATTTCCGCA ACGAGATTGT TAACTTCGTA
CTGCGCGCCA GAGCGAATAA CAGCGGACGC AATCCGAACT GGACCAGCTA TGAAAAACTG
CGCACGGTTA TTGAGAAGAA AATGTTCTCC AATACCGAGG AGCTGTTGCC GGTTATTTCG
TTTAACGCCA AAACGTCAAC CGACGAGCAG AAGAAACATG ACGACTTTGT CGACCGTATG
ATGGAAAAAG GCTACACCCG TAAACAGGTG CGTTTACTGT GCGAATGGTA TTTGCGCGTA
CGTAAATCGT CTTAA
 
Protein sequence
MNIFDHYRQR YEAAKDEEFT LQEFLTTCRQ DRSAYANAAE RLLMAIGEPV MVDTAQEPRL 
SRLFSNRVIA RYPAFEEFYG MEDAIEQIVS YLKHAAQGLE EKKQILYLLG PVGGGKSSLA
ERLKSLMQLV PIYVLSANGE RSPVNDHPFC LFNPQEDAQI LEKEYGIPRR YLGTIMSPWA
AKRLHEFGGD ITKFRVVKVW PSILQQIAIA KTEPGDENNQ DISALVGKVD IRKLEHYAQN
DPDAYGYSGA LCRANQGIME FVEMFKAPIK VLHPLLTATQ EGNYNGTEGI SALPFNGIIL
AHSNESEWVT FRNNKNNEAF LDRVYIVKVP YCLRISEEIK IYEKLLNHSE LTHAPCAPGT
LETLSRFSIL SRLKEPENSS IYSKMRVYDG ESLKDTDPKA KSYQEYRDYA GVDEGMNGLS
TRFAFKILSR VFNFDHVEVA ANPVHLFYVL EQQIEREQFP QEQAERYLEF LKGYLIPKYA
EFIGKEIQTA YLESYSEYGQ NIFDRYVTYA DFWIQDQEYR DPDTGQLFDR ESLNAELEKI
EKPAGISNPK DFRNEIVNFV LRARANNSGR NPNWTSYEKL RTVIEKKMFS NTEELLPVIS
FNAKTSTDEQ KKHDDFVDRM MEKGYTRKQV RLLCEWYLRV RKSS