Gene ECH74115_3724 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3724 
Symbolppk 
ID6966644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3443303 
End bp3445369 
Gene Length2067 bp 
Protein Length688 aa 
Translation table11 
GC content49% 
IMG OID643387517 
Productpolyphosphate kinase 
Protein accessionYP_002271970 
Protein GI209399123 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0855] Polyphosphate kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.132116 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTCAGG AAAAGCTATA CATCGAAAAA GAGCTCAGTT GGTTATCGTT CAATGAACGC 
GTGCTTCAGG AAGCGGCGGA CAAATCTAAC CCGCTGATTG AAAGGATGCG TTTCCTGGGG
ATCTATTCCA ATAACCTTGA TGAGTTCTAT AAAGTCCGCT TCGCTGAACT GAAGCGACGC
ATCATTATTA GCGAAGAACA AGGCTCCAAC TCTCATTCCC GCCATTTACT GGGCAAAATT
CAGTCCCGGG TGCTGAAAGC CGATCAGGAA TTCGACGGCC TCTACAACGA GCTATTGCTG
GAGATGGCGC GCAACCAGAT CTTCCTGATT AATGAACGCC AGCTCTCCGT CAATCAACAA
AACTGGCTGC GTCATTATTT TAAGCAGTAT CTGCGTCAGC ACATTACGCC GATTTTAATC
AATCCTGACA CTGACTTAGT GCAGTTCCTG AAAGATGATT ACACCTATCT GGCGGTGGAA
ATTATCCGTG GCGATACCAT CCGTTACGCG CTTCTGGAGA TCCCATCAGA TAAAGTGCCG
CGCTTTGTGA ATTTACCGCC AGAAGCGCCG CGTCGACGCA AGCCAATGAT TCTTCTGGAT
AACATTCTGC GTTACTGCCT TGATGATATT TTCAAAGGCT TCTTTGATTA TGACGCGCTG
AATGCCTATT CAATGAAGAT GACCCGCGAT GCCGAATACG ATTTAGTGCA TGAGATGGAA
GCCAGCCTGA TGGAGTTGAT GTCTTCCAGT CTCAAGCAGC GTTTAACTGC TGAGCCGGTG
CGTTTTGTTT ATCAGCGCGA TATGCCCAAT GCGCTGGTTG AAGTGTTACG CGAAAAACTG
ACTATTTCCC GCTACGACTC CATCGTCCCC GGCGGTCGTT ATCATAATTT TAAAGACTTT
ATTAATTTCC CCAATGTCGG CAAAGCCAAT CTGGTGAACA AACCACTGCC GCGTTTACGC
CATATTTGGT TTGATAAAGC CCAGTTCCGC AATGGTTTTG ATGCCATTCG CGAACGCGAT
GTGTTGCTCT ATTATCCTTA TCACACCTTT GAGCATGTGC TGGAACTGCT GCGTCAGGCC
TCGTTCGATC CGAGCGTGCT GGCGATTAAA ATCAACATTT ACCGTGTGGC AAAAGATTCA
CGCATCATCG ACTCGATGAT CCACGCCGCA CACAACGGTA AGAAAGTCAC CGTGGTGGTT
GAGCTACAGG CACGTTTTGA CGAAGAAGCC AACATTCACT GGGCGAAGCG CCTGACCGAA
GCAGGCGTGC ACGTTATCTT CTCTGCGCCG GGGCTGAAAA TTCACGCCAA ACTGTTTCTG
ATTTCACGTA AAGAAAACGG TGAAGTGGTG CGTTATGCAC ACATCGGGAC CGGGAACTTT
AACGAAAAAA CCGCGCGTCT TTATACTGAC TATTCGTTGC TGACCGCCGA TGCGCGCATC
ACCAACGAAG TACGGCGGGT ATTTAACTTT ATTGAAAACC CATACCGTCC GGTGACATTT
GATTATTTAA TGGTGTCGCC GCAAAACTCC CGCCGCCTGT TGTATGAAAT GGTGGACCGC
GAGATCGCCA ACGCGCAGCA AGGGCTGCCC AGTGGTATCA CCCTGAAGCT AAATAACCTT
GTCGATAAAG GCCTGGTTGA TCGTCTGTAT GCGGCCTCCA GCTCCGGCGT ACCGGTTAAT
CTGCTGGTTC GCGGAATGTG TTCGCTGATC CCCAATCTGG AAGGCATTAG CGACAACATT
CGTGCCATCA GTATTGTTGA CCGTTACCTT GAACATGACC GGGTTTATAT TTTTGAAAAT
GGCGGCGATA AAAAGGTCTA CCTTTCTTCC GCCGACTGGA TGACGCGCAA TATTGATTAT
CGTATTGAAG TGGCGACGCC GCTGCTCGAT CCGCGCCTGA AGCAGCGGGT GCTGGACATC
ATCGACATAT TGTTCAGCGA TACGGTCAAA GCACGTTATA TCGATAAAGA ACTCAGTAAT
CGCTACGTTC CCCGCGGCAA TCGCCGCAAA GTACGGGCGC AGTTGGCGAT TTATGACTAC
ATCAAATCAC TCGAACAACC TGAATAA
 
Protein sequence
MGQEKLYIEK ELSWLSFNER VLQEAADKSN PLIERMRFLG IYSNNLDEFY KVRFAELKRR 
IIISEEQGSN SHSRHLLGKI QSRVLKADQE FDGLYNELLL EMARNQIFLI NERQLSVNQQ
NWLRHYFKQY LRQHITPILI NPDTDLVQFL KDDYTYLAVE IIRGDTIRYA LLEIPSDKVP
RFVNLPPEAP RRRKPMILLD NILRYCLDDI FKGFFDYDAL NAYSMKMTRD AEYDLVHEME
ASLMELMSSS LKQRLTAEPV RFVYQRDMPN ALVEVLREKL TISRYDSIVP GGRYHNFKDF
INFPNVGKAN LVNKPLPRLR HIWFDKAQFR NGFDAIRERD VLLYYPYHTF EHVLELLRQA
SFDPSVLAIK INIYRVAKDS RIIDSMIHAA HNGKKVTVVV ELQARFDEEA NIHWAKRLTE
AGVHVIFSAP GLKIHAKLFL ISRKENGEVV RYAHIGTGNF NEKTARLYTD YSLLTADARI
TNEVRRVFNF IENPYRPVTF DYLMVSPQNS RRLLYEMVDR EIANAQQGLP SGITLKLNNL
VDKGLVDRLY AASSSGVPVN LLVRGMCSLI PNLEGISDNI RAISIVDRYL EHDRVYIFEN
GGDKKVYLSS ADWMTRNIDY RIEVATPLLD PRLKQRVLDI IDILFSDTVK ARYIDKELSN
RYVPRGNRRK VRAQLAIYDY IKSLEQPE