Gene EcolC_1175 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1175 
Symbol 
ID6067151 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1287639 
End bp1289705 
Gene Length2067 bp 
Protein Length688 aa 
Translation table11 
GC content49% 
IMG OID641600591 
Productpolyphosphate kinase 
Protein accessionYP_001724169 
Protein GI170019215 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0855] Polyphosphate kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.739142 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0387468 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTCAGG AAAAGCTATA CATCGAAAAA GAGCTCAGTT GGTTATCGTT CAATGAACGC 
GTGCTTCAGG AAGCGGCGGA CAAATCTAAC CCGCTGATTG AAAGGATGCG TTTCCTGGGG
ATCTATTCCA ATAACCTTGA TGAGTTCTAT AAAGTCCGCT TCGCTGAACT GAAGCGACGC
ATCATTATTA GCGAAGAACA AGGCTCCAAC TCTCATTCCC GCCATTTACT GGGCAAAATT
CAGTCCCGGG TGCTGAAAGC CGATCAGGAA TTCGACGGCC TCTACAACGA GCTGCTGCTG
GAGATGGCGC GTAACCAGAT CTTCCTGATA AATGAACGCC AGCTCTCCGT CAATCAACAA
AACTGGCTTC GTCATTATTT TAAGCAGTAT CTGCGTCAGC ACATTACGCC GATTTTAATC
AATCCTGACA CTGACTTAGT GCAGTTCCTG AAGGATGATT ACACCTATCT GGCGGTGGAA
ATTATCCGTG GTGATACCAT CCGTTACGCG CTGCTGGAGA TCCCATCAGA TAAAGTGCCG
CGCTTTGTGA ATTTACCGCC AGAAGCGCCG CGTCGACGCA AACCGATGAT TCTTCTGGAT
AACATTCTGC GTTACTGCCT TGATGATATT TTCAAAGGCT TCTTTGATTA TGACGCGCTG
AATGCCTATT CAATGAAGAT GACCCGCGAT GCCGAATACG ATTTAGTGCA TGAGATGGAA
GCCAGCCTGA TGGAGTTGAT GTCTTCCAGT CTCAAGCAGC GTTTAACTGC TGAGCCGGTG
CGTTTTGTTT ATCAGCGCGA TATGCCCAAT GCGCTGGTTG AAGTGTTACG CGAAAAACTG
ACTATTTCCC GCTACGACTC CATCGTCCCC GGCGGTCGTT ATCATAATTT TAAAGACTTT
ATTAATTTCC CCAATGTCGG CAAAGCCAAT CTGGTGAACA AACCACTGCC GCGTTTACGC
CATATTTGGT TTGATAAAGC CCAGTTCCGC AATGGTTTTG ATGCCATTCG CGAACGCGAT
GTGTTGCTCT ATTATCCTTA TCACACCTTT GAGCATGTGC TGGAACTGCT GCGTCAGGCA
TCGTTCGACC CGAGCGTACT GGCGATTAAA ATTAACATTT ACCGCGTGGC GAAAGATTCA
CGCATCATCG ACTCGATGAT CCACGCCGCA CACAACGGTA AGAAAGTCAC CGTGGTGGTT
GAGTTACAGG CGCGTTTCGA CGAAGAAGCC AACATTCACT GGGCGAAGCG CCTGACCGAA
GCAGGCGTGC ACGTTATCTT CTCTGCGCCG GGGCTGAAAA TTCACGCCAA ACTGTTCCTG
ATTTCACGTA AAGAAAACGG TGAAGTGGTC CGTTACGCAC ACATCGGGAC CGGGAACTTT
AACGAAAAAA CCGCGCGTCT TTATACTGAC TATTCGTTGC TGACCGCAGA TGCGCGCATC
ACTAACGAAG TACGGCGGGT ATTCAACTTT ATTGAAAACC CATACCGCCC GGTGACATTT
GATTATTTAA TGGTGTCACC GCAAAACTCT CGTCGTCTGT TATATGAAAT GGTGGACCGC
GAAATCGCCA ACGCGCAGCA AGGGCTGCCC AGTGGTATCA CCCTGAAGCT AAATAACCTT
GTCGATAAAG GCCTGGTTGA TCGTCTGTAT GCGGCCTCCA GCTCCGGCGT ACCGGTTAAT
CTGCTGGTTC GCGGAATGTG TTCGCTGATC CCCAATCTGG AAGGTATTAG CGACAACATT
CGTGCCATCA GTATTGTTGA CCGTTACCTT GAACATGACC GGGTTTATAT TTTTGAAAAT
GGCGGCGATA AAAAGGTCTA CCTTTCTTCC GCCGACTGGA TGACGCGCAA TATTGATTAT
CGTATTGAAG TGGCGACGCC GCTGCTCGAT CCGCGCCTGA AGCAGCGGGT GCTGGACATC
ATCGACATAT TGTTCAGCGA TACGGTCAAA GCTCGTTATA TCGATAAAGA ACTCAGTAAT
CGCTACGTTC CCCGCGGCAA TCGCCGCAAA GTACGGGCGC AGTTGGCGAT TTATGACTAC
ATCAAATCAC TCGAACAACC TGAATAA
 
Protein sequence
MGQEKLYIEK ELSWLSFNER VLQEAADKSN PLIERMRFLG IYSNNLDEFY KVRFAELKRR 
IIISEEQGSN SHSRHLLGKI QSRVLKADQE FDGLYNELLL EMARNQIFLI NERQLSVNQQ
NWLRHYFKQY LRQHITPILI NPDTDLVQFL KDDYTYLAVE IIRGDTIRYA LLEIPSDKVP
RFVNLPPEAP RRRKPMILLD NILRYCLDDI FKGFFDYDAL NAYSMKMTRD AEYDLVHEME
ASLMELMSSS LKQRLTAEPV RFVYQRDMPN ALVEVLREKL TISRYDSIVP GGRYHNFKDF
INFPNVGKAN LVNKPLPRLR HIWFDKAQFR NGFDAIRERD VLLYYPYHTF EHVLELLRQA
SFDPSVLAIK INIYRVAKDS RIIDSMIHAA HNGKKVTVVV ELQARFDEEA NIHWAKRLTE
AGVHVIFSAP GLKIHAKLFL ISRKENGEVV RYAHIGTGNF NEKTARLYTD YSLLTADARI
TNEVRRVFNF IENPYRPVTF DYLMVSPQNS RRLLYEMVDR EIANAQQGLP SGITLKLNNL
VDKGLVDRLY AASSSGVPVN LLVRGMCSLI PNLEGISDNI RAISIVDRYL EHDRVYIFEN
GGDKKVYLSS ADWMTRNIDY RIEVATPLLD PRLKQRVLDI IDILFSDTVK ARYIDKELSN
RYVPRGNRRK VRAQLAIYDY IKSLEQPE