Gene EcHS_A0735 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0735 
Symbolpgm 
ID5595392 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp743912 
End bp745552 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content55% 
IMG OID640919912 
Productphosphoglucomutase 
Protein accessionYP_001457486 
Protein GI157160168 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0033] Phosphoglucomutase 
TIGRFAM ID[TIGR01132] phosphoglucomutase, alpha-D-glucose phosphate-specific 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000226072 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAATCC ACAATCGTGC AGGCCAACCT GCACAACAGA GTGATTTGAT TAACGTCGCC 
CAACTGACGG CGCAATATTA TGTACTGAAA CCAGAAGCAG GGAATGCGGA GCACGCGGTG
AAATTCGGTA CTTCCGGTCA CCGTGGCAGT GCAGCGCGCC ACAGCTTTAA CGAGCCGCAC
ATTCTGGCGA TCGCTCAGGC AATTGCTGAA GAACGTGCGA AAAACGGCAT CACTGGCCCT
TGCTATGTGG GTAAAGATAC TCACGCCCTG TCCGAGCCTG CGTTCATTTC CGTACTGGAA
GTGCTGGCAG CGAACGGCGT TGATGTCATT GTGCAGGAAA ACAATGGCTT CACTCCAACG
CCTGCCGTTT CCAATGCCAT CCTGGTTCAC AATAAAAAAG GTGGCCCGCT GGCAGACGGT
ATCGTGATTA CACCGTCCCA TAACCCGCCG GAAGATGGTG GTATCAAGTA CAATCCGCCA
AATGGTGGCC CGGCTGATAC CAACGTCACC AAAGTGGTGG AAGACAGGGC CAACGCACTG
CTGGCCGATG GCCTGAAAGG CGTGAAGCGT ATCTCCCTCG ACGAAGCGAT GGCATCCGGT
CATGTGAAAG AGCAGGATCT GGTGCAGCCG TTCGTGGAAG GGCTGGCCGA TATCGTTGAT
ATGGCGGCGA TTCAGAAAGC GGGCCTGACG CTTGGCGTTG ATCCGCTGGG CGGTTCCGGT
ATCGAATACT GGAAGCGTAT TGGCGAGTAT TACAACCTCA ACCTGACTAT CGTTAACGAT
CAGGTCGATC AAACCTTCCG CTTTATGCAC CTTGATAAAG ACGGCGCGAT CCGTATGGAC
TGCTCCTCCG AGTGTGCGAT GGCGGGCCTG CTGGCACTGC GTGATAAGTT CGATCTGGCG
TTTGCTAACG ACCCGGATTA TGACCGTCAC GGTATCGTCA CTCCGGCAGG TTTGATGAAT
CCGAACCACT ACCTGGCGGT GGCGATCAAT TACCTGTTCC AGCATCGTCC GCAGTGGGGC
AAAGATGTTG CCGTTGGTAA AACGCTGGTT TCTTCTGCGA TGATCGACCG TGTGGTCAAT
GACTTGGGTC GTAAGCTGGT AGAAGTCCCG GTAGGTTTCA AATGGTTTGT TGATGGTCTG
TTCGACGGCA GCTTCGGCTT TGGCGGCGAA GAGAGCGCAG GGGCTTCCTT CCTGCGTTTC
GACGGCACGC CGTGGTCCAC CGACAAAGAC GGCATCATCA TGTGTCTGCT GGCGGCGGAA
ATCACCGCTG TCACCGGTAA GAACCCGCAG GAACACTACA ACGAACTGGC AAAACGCTTT
GGTGCGCCGA GCTACAACCG TTTGCAGGCA GCTGCGACTT CCGCACAAAA AGCGGCGCTG
TCTAAGCTGT CTCCGGAAAT GGTGAGCGCC AGCACCCTGG CAGGTGACCC GATCACCGCG
CGCCTGACTG CTGCTCCGGG CAACGGTGCT TCTATTGGCG GTCTGAAAGT GATGACTGAC
AACGGCTGGT TCGCCGCGCG TCCGTCAGGC ACGGAAGACG CATATAAGAT CTACTGCGAA
AGCTTCCTCG GTGAAGAACA TCGCAAGCAG ATCGAGAAAG AAGCGGTTGA GATTGTTAGC
GAAGTTCTGA AAAACGCGTA A
 
Protein sequence
MAIHNRAGQP AQQSDLINVA QLTAQYYVLK PEAGNAEHAV KFGTSGHRGS AARHSFNEPH 
ILAIAQAIAE ERAKNGITGP CYVGKDTHAL SEPAFISVLE VLAANGVDVI VQENNGFTPT
PAVSNAILVH NKKGGPLADG IVITPSHNPP EDGGIKYNPP NGGPADTNVT KVVEDRANAL
LADGLKGVKR ISLDEAMASG HVKEQDLVQP FVEGLADIVD MAAIQKAGLT LGVDPLGGSG
IEYWKRIGEY YNLNLTIVND QVDQTFRFMH LDKDGAIRMD CSSECAMAGL LALRDKFDLA
FANDPDYDRH GIVTPAGLMN PNHYLAVAIN YLFQHRPQWG KDVAVGKTLV SSAMIDRVVN
DLGRKLVEVP VGFKWFVDGL FDGSFGFGGE ESAGASFLRF DGTPWSTDKD GIIMCLLAAE
ITAVTGKNPQ EHYNELAKRF GAPSYNRLQA AATSAQKAAL SKLSPEMVSA STLAGDPITA
RLTAAPGNGA SIGGLKVMTD NGWFAARPSG TEDAYKIYCE SFLGEEHRKQ IEKEAVEIVS
EVLKNA