Gene EcSMS35_2649 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2649 
Symbolppk 
ID6143260 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2707339 
End bp2709405 
Gene Length2067 bp 
Protein Length688 aa 
Translation table11 
GC content49% 
IMG OID641617520 
Productpolyphosphate kinase 
Protein accessionYP_001744685 
Protein GI170679676 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0855] Polyphosphate kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTCAGG AAAAGCTATA CATCGAAAAA GAGCTCAGTT GGTTATCGTT CAATGAACGC 
GTGCTTCAGG AAGCGGCGGA CAAATCTAAC CCGCTGATTG AAAGGATGCG TTTCCTGGGG
ATCTATTCCA ATAACCTTGA TGAGTTCTAT AAAGTCCGCT TCGCTGAACT GAAGCGACGC
ATCATTATTA GCGAAGAACA AGGCTCCAAC TCTCATTCCC GCCATTTACT GGGCAAAATT
CAGTCCCGGG TGCTGAAAGC CGATCAGGAA TTCGACGGCC TCTACAACGA GTTGTTGCTG
GAGATGGCGC GCAACCAGAT CTTCCTGATT AATGAACGCC AGCTCTCCGT CAATCAACAA
AACTGGCTGC GTCATTATTT TAAGCAGTAT CTGCGTCAGC ACATTACGCC GATTTTAATC
AATCCTGACA CTGATTTAGT GCAGTTCCTG AAAGATGATT ACACCTATCT GGCGGTGGAA
ATTATCCGTG GCGATACCAT CCGTTACGCG CTGCTGGAGA TCCCATCAGA TAAAGTGCCG
CGCTTTGTGA ATTTACCGCC AGAAGCGCCG CGTCGACGCA AGCCGATGAT TCTTCTGGAT
AACATTCTGC GTTACTGCCT TGATGATATT TTCAAAGGCT TCTTTGATTA TGACGCGCTG
AATGCCTATT CAATGAAGAT GACCCGCGAT GCCGAATACG ATTTAGTGCA TGAGATGGAA
GCCAGCCTGA TGGAGTTGAT GTCTTCCAGT CTCAAGCAGC GTTTAACTGC TGAGCCGGTG
CGTTTTGTTT ATCAGCGCGA TATGCCCAAT GCGCTGGTTG AAGTGTTACG CGAAAAACTG
ACTATTTCCC GCTACGACTC CATCGTCCCT GGTGGTCGTT ATCATAATTT TAAAGACTTT
ATTAATTTCC CCAATGTCGG CAAAGCCAAT CTGGTGAACA AACCACTGCC GCGTTTACGC
CACATTTGGT TTGATAAAGC CCAGTTCCGC AATGGTTTTG ATGCTATTCG CGAACGCGAT
GTGTTGCTCT ATTATCCTTA TCATACCTTT GAGCATGTAC TGGAACTGCT GCGTCAGGCT
TCGTTCGACC CGAGCGTACT GGCGATTAAA ATTAACATTT ACCGCGTGGC GAAAGATTCA
CGCATCATCG ACTCGATGAT CCACGCCGCA CACAACGGTA AGAAAGTCAC CGTGGTGGTT
GAGTTACAGG CGCGTTTCGA CGAAGAAGCC AACATTCACT GGGCGAAGCG CCTGACTGAA
GCAGGCGTGC ACGTTATCTT CTCTGCGCCG GGGCTGAAAA TTCACGCCAA ACTGTTCCTG
ATTTCACGTA AAGAAAACGG TGAAGTGGTG CGTTACGCAC ACATCGGGAC CGGGAACTTT
AACGAAAAAA CCGCGCGTCT TTATACTGAC TATTCGTTGC TGACCGCCGA TGCGCGCATC
ACTAACGAAG TACGGCGGGT GTTCAACTTT ATTGAAAACC CATACCGTCC GGTGACATTT
GATTATTTAA TGGTGTCGCC GCAAAACTCT CGCCGTCTGT TGTATGAAAT GGTGGACCGC
GAGATCGCCA ACGCGCAGCA AGGGCTGCCC AGTGGTATCA CCCTGAAGCT AAATAACCTT
GTCGATAAAG GCCTGGTTGA TCGTCTGTAT GCGGCCTCCA GCTCCGGCGT ACCGGTTAAT
CTGCTGGTGC GCGGAATGTG TTCTCTGATC CCCAATCTGG AAGGCATTAG CGACAACATT
CGTGCCATCA GTATTGTTGA CCGTTACCTT GAACATGACC GGGTTTATAT TTTTGAAAAT
GGCGGCGATA AAAAGGTCTA CCTTTCTTCC GCCGACTGGA TGACGCGCAA TATTGATTAT
CGTATTGAAG TGGCGACGCC GCTGCTCGAT CCGCGCCTGA AGCAGCGGGT GCTGGACATC
ATCGACATAT TATTCAGCGA TACAGTCAAA GCACGTTATA TCGATAAAGA ACTCAGTAAT
CGCTACGTTC CCCGCGGCAA TCGCCGCAAA GTACGGGCGC AGTTGGCGAT TTACGACTAC
ATCAAATCAC TCGAACAACC TGAATAA
 
Protein sequence
MGQEKLYIEK ELSWLSFNER VLQEAADKSN PLIERMRFLG IYSNNLDEFY KVRFAELKRR 
IIISEEQGSN SHSRHLLGKI QSRVLKADQE FDGLYNELLL EMARNQIFLI NERQLSVNQQ
NWLRHYFKQY LRQHITPILI NPDTDLVQFL KDDYTYLAVE IIRGDTIRYA LLEIPSDKVP
RFVNLPPEAP RRRKPMILLD NILRYCLDDI FKGFFDYDAL NAYSMKMTRD AEYDLVHEME
ASLMELMSSS LKQRLTAEPV RFVYQRDMPN ALVEVLREKL TISRYDSIVP GGRYHNFKDF
INFPNVGKAN LVNKPLPRLR HIWFDKAQFR NGFDAIRERD VLLYYPYHTF EHVLELLRQA
SFDPSVLAIK INIYRVAKDS RIIDSMIHAA HNGKKVTVVV ELQARFDEEA NIHWAKRLTE
AGVHVIFSAP GLKIHAKLFL ISRKENGEVV RYAHIGTGNF NEKTARLYTD YSLLTADARI
TNEVRRVFNF IENPYRPVTF DYLMVSPQNS RRLLYEMVDR EIANAQQGLP SGITLKLNNL
VDKGLVDRLY AASSSGVPVN LLVRGMCSLI PNLEGISDNI RAISIVDRYL EHDRVYIFEN
GGDKKVYLSS ADWMTRNIDY RIEVATPLLD PRLKQRVLDI IDILFSDTVK ARYIDKELSN
RYVPRGNRRK VRAQLAIYDY IKSLEQPE