Gene EcSMS35_0710 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0710 
Symbolpgm 
ID6143436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp712803 
End bp714443 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content55% 
IMG OID641615600 
Productphosphoglucomutase 
Protein accessionYP_001742799 
Protein GI170682528 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0033] Phosphoglucomutase 
TIGRFAM ID[TIGR01132] phosphoglucomutase, alpha-D-glucose phosphate-specific 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000485556 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.414307 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATCC ACAATCGTGC AGGCCAACCT GCACAACAGA GTGATTTGAT TAACGTCGCC 
CAACTGACGG CGCAATATTA TGTACTGAAA CCAGAAGCAG GGAATGCGGA GCACGCGGTG
AAATTCGGTA CTTCCGGTCA CCGTGGCAGT GCAGCGCGCC ACAGCTTTAA CGAGCCGCAC
ATTCTGGCGA TCGCTCAGGC AATTGCTGAA GAACGTGCGA AAAACGGCAT CACTGGCCCT
TGCTATGTGG GTAAAGATAC TCACGCCCTG TCCGAGCCTG CGTTTATTTC CGTTCTGGAA
GTGCTGGCAG CGAACGGCGT TGATGTTATT GTGCAGGAAA ACAATGGCTT CACTCCAACG
CCTGCCGTTT CCAATGCCAT CCTGGTTCAC AATAAAAAAG GTGGCCCGCT GGCAGACGGT
ATCGTGATTA CACCGTCCCA TAACCCGCCG GAAGATGGTG GTATCAAGTA CAACCCGCCA
AATGGTGGCC CGGCTGATAC CAACGTCACC AAAGTGGTGG AAGACAGGGC CAACGCACTG
CTGGCCGATG GCCTGAAAGG CGTGAAGCGT ATCTCCCTCG ACGAAGCGAT GGCATCCGGT
CATGTGAAAG AGCAGGATCT GGTGCAGCCG TTTGTGGAAG GTCTGGCCGA TATCGTTGAT
ATGGCGGCGA TTCAGAAAGC GGGCCTGACG CTGGGCGTTG ATCCGTTGGG CGGTTCCGGT
ATCGAATACT GGAAGCGCAT TGGCGAGTAT TACAACCTCA ACCTGACCAT CGTTAACGAT
CAGGTCGATC AAACCTTCCG CTTTATGCAC CTCGATAAAG ACGGTGCGAT CCGTATGGAC
TGCTCCTCCG AGTGTGCGAT GGCGGGTCTG CTGGCACTGC GTGATAAGTT CGATCTGGCG
TTTGCTAACG ACCCGGATTA TGACCGTCAC GGTATCGTCA CTCCGGCAGG TTTGATGAAT
CCGAACCACT ACCTGGCGGT GGCGATCAAC TACCTGTTCC AGCACCGTCC GCAGTGGGGC
AAAGATGTTG CTGTCGGTAA AACGCTGGTT TCATCTGCGA TGATCGACCG TGTGGTCAAT
GACTTGGGCC GTAAGCTGGT AGAAGTCCCG GTAGGTTTCA AATGGTTTGT TGATGGTCTG
TTCGATGGCA GCTTCGGCTT TGGCGGCGAA GAGAGCGCAG GGGCTTCCTT CCTGCGTTTC
GACGGCACGC CGTGGTCGAC TGATAAAGAC GGCATCATCA TGTGTCTGCT GGCGGCGGAA
ATCACCGCTG TCACTGGTAA GAACCCGCAG GAACACTACA ACGAACTGGC AGAACGCTTT
GGTGCGCCGA GCTATAACCG TTTGCAGGCA GCTGCGACTT CCGCACAAAA AGCGGCGCTG
TCTAAGCTGT CTCCGGAAAT GGTGAGCTCC AGCACCCTGG CAGGTGACCC GATCACCGCA
CGCCTGACGG CGGCTCCGGG TAATGGTGCT TCTATTGGCG GTCTGAAAGT GATGACTGAC
AACGGCTGGT TCGCCGCGCG TCCGTCAGGC ACAGAAGACG CATACAAAAT CTACTGCGAA
AGCTTCCTCG GGGAAGAACA TCGCAAGCAG ATTGAGAAAG AAGCGGTTGA GATTGTTAGC
GAAGTTCTGA AAAACGCGTA A
 
Protein sequence
MAIHNRAGQP AQQSDLINVA QLTAQYYVLK PEAGNAEHAV KFGTSGHRGS AARHSFNEPH 
ILAIAQAIAE ERAKNGITGP CYVGKDTHAL SEPAFISVLE VLAANGVDVI VQENNGFTPT
PAVSNAILVH NKKGGPLADG IVITPSHNPP EDGGIKYNPP NGGPADTNVT KVVEDRANAL
LADGLKGVKR ISLDEAMASG HVKEQDLVQP FVEGLADIVD MAAIQKAGLT LGVDPLGGSG
IEYWKRIGEY YNLNLTIVND QVDQTFRFMH LDKDGAIRMD CSSECAMAGL LALRDKFDLA
FANDPDYDRH GIVTPAGLMN PNHYLAVAIN YLFQHRPQWG KDVAVGKTLV SSAMIDRVVN
DLGRKLVEVP VGFKWFVDGL FDGSFGFGGE ESAGASFLRF DGTPWSTDKD GIIMCLLAAE
ITAVTGKNPQ EHYNELAERF GAPSYNRLQA AATSAQKAAL SKLSPEMVSS STLAGDPITA
RLTAAPGNGA SIGGLKVMTD NGWFAARPSG TEDAYKIYCE SFLGEEHRKQ IEKEAVEIVS
EVLKNA