Gene EcHS_A2636 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2636 
Symbolppk 
ID5591029 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2644979 
End bp2647045 
Gene Length2067 bp 
Protein Length688 aa 
Translation table11 
GC content49% 
IMG OID640921753 
Productpolyphosphate kinase 
Protein accessionYP_001459280 
Protein GI157161962 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0855] Polyphosphate kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value0.513516 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTCAGG AAAAGCTATA CATCGAAAAA GAGCTCAGTT GGTTATCGTT CAATGAACGC 
GTGCTTCAGG AAGCGGCGGA CAAATCTAAC CCGCTGATTG AAAGGATGCG TTTCCTGGGG
ATCTATTCCA ATAACCTTGA TGAGTTCTAT AAAGTCCGCT TCGCTGAACT GAAGCGACGC
ATCATTATTA GCGAAGAACA AGGCTCCAAC TCTCATTCCC GCCATTTACT GGGCAAAATT
CAGTCCCGGG TGCTGAAAGC CGATCAGGAA TTCGACGGCC TCTACAACGA GCTGCTGCTG
GAGATGGCGC GTAACCAGAT CTTCCTGATA AATGAACGCC AGCTCTCCGT CAATCAACAA
AACTGGCTTC GTCATTATTT TAAGCAGTAT CTGCGTCAGC ACATTACGCC GATTTTAATC
AATCCTGACA CTGACTTAGT GCAGTTCCTG AAGGATGATT ACACCTATCT GGCGGTGGAA
ATTATCCGTG GTGATACCAT CCGTTACGCG CTGCTGGAGA TCCCATCAGA TAAAGTGCCG
CGCTTTGTGA ATTTACCGCC AGAAGCGCCG CGTCGACGCA AACCGATGAT TCTTCTGGAT
AACATTCTGC GTTACTGCCT TGATGATATT TTCAAAGGCT TCTTTGATTA TGACGCGCTG
AATGCCTATT CAATGAAGAT GACCCGCGAT GCCGAATACG ATTTAGTGCA TGAGATGGAA
GCCAGCCTGA TGGAGTTGAT GTCTTCCAGT CTCAAGCAGC GTTTAACTGC TGAGCCGGTG
CGTTTTGTTT ATCAGCGCGA TATGCCCAAT GCGCTGGTTG AAGTGTTACG CGAAAAACTG
ACTATTTCCC GCTACGACTC CATCGTCCCC GGCGGTCGTT ATCATAATTT TAAAGACTTT
ATTAATTTCC CCAATGTCGG CAAAGCCAAT CTGGTGAACA AACCACTGCC GCGTTTACGC
CACATTTGGT TTGATAAAGC CCAGTTCCGC AATGGTTTTG ATGCCATTCG CGAACGCGAT
GTGTTGCTCT ATTATCCTTA TCACACCTTT GAGCATGTGC TGGAACTGCT GCGTCAGGCT
TCGTTCGACC CGAGCGTACT GGCGATTAAA ATTAACATTT ACCGCGTGGC GAAAGATTCA
CGCATCATCG ACTCGATGAT CCACGCCGCA CATAACGGTA AGAAAGTCAC TGTGGTGGTT
GAGTTACAGG CGCGTTTCGA CGAAGAAGCC AACATTCACT GGGCGAAGCG CCTGACCGAA
GCAGGCGTGC ACGTTATCTT CTCTGCGCCG GGGCTGAAAA TTCACGCCAA ACTGTTCCTG
ATTTCACGTA AAGAAAACGG TGAAGTGGTC CGTTACGCAC ACATCGGGAC CGGGAACTTT
AACGAAAAAA CCGCGCGTCT TTATACTGAC TATTCGTTGC TGACCGCAGA TGCGCGTATC
ACCAACGAAG TACGGCGGGT ATTTAACTTT ATTGAAAACC CATACCGCCC AGTGACATTT
GATTATTTAA TGGTGTCACC GCAAAACTCT CGCCGTCTGT TATATGAAAT GGTAGACCGC
GAAATCGCCA ACGCGCAGCA AGGGCTGCCC AGTGGTATCA CCCTGAAGCT AAATAACCTT
GTCGATAAAG GCCTGGTTGA TCGTCTGTAT GCGGCCTCCA GCTCCGGCGT ACCGGTTAAT
CTGCTGGTTC GCGGAATGTG TTCGCTGATC CCCAATCTGG AAGGCATTAG CGACAACATT
CGTGCCATCA GTATTGTTGA CCGTTACCTT GAACATGACC GGGTTTATAT TTTTGAAAAT
GGCGGCGATA AAAAGGTCTA CCTTTCTTCC GCCGACTGGA TGACGCGCAA TATTGATTAT
CGTATTGAAG TGGCGACACC GCTGCTCGAT CCGCGCCTGA AGCAGCGGGT GCTGGACATC
ATCGACATAT TGTTCAGCGA TACGGTCAAA GCACGTTATA TCGATAAAGA ACTCAGTAAT
CGCTACGTTC CCCGCGGCAA TCGCCGCAAA GTACGGGCGC AGTTGGCGAT TTACGACTAC
ATCAAATCAC TCGAACAACC TGAATAA
 
Protein sequence
MGQEKLYIEK ELSWLSFNER VLQEAADKSN PLIERMRFLG IYSNNLDEFY KVRFAELKRR 
IIISEEQGSN SHSRHLLGKI QSRVLKADQE FDGLYNELLL EMARNQIFLI NERQLSVNQQ
NWLRHYFKQY LRQHITPILI NPDTDLVQFL KDDYTYLAVE IIRGDTIRYA LLEIPSDKVP
RFVNLPPEAP RRRKPMILLD NILRYCLDDI FKGFFDYDAL NAYSMKMTRD AEYDLVHEME
ASLMELMSSS LKQRLTAEPV RFVYQRDMPN ALVEVLREKL TISRYDSIVP GGRYHNFKDF
INFPNVGKAN LVNKPLPRLR HIWFDKAQFR NGFDAIRERD VLLYYPYHTF EHVLELLRQA
SFDPSVLAIK INIYRVAKDS RIIDSMIHAA HNGKKVTVVV ELQARFDEEA NIHWAKRLTE
AGVHVIFSAP GLKIHAKLFL ISRKENGEVV RYAHIGTGNF NEKTARLYTD YSLLTADARI
TNEVRRVFNF IENPYRPVTF DYLMVSPQNS RRLLYEMVDR EIANAQQGLP SGITLKLNNL
VDKGLVDRLY AASSSGVPVN LLVRGMCSLI PNLEGISDNI RAISIVDRYL EHDRVYIFEN
GGDKKVYLSS ADWMTRNIDY RIEVATPLLD PRLKQRVLDI IDILFSDTVK ARYIDKELSN
RYVPRGNRRK VRAQLAIYDY IKSLEQPE