Gene EcolC_2968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2968 
Symbol 
ID6065727 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3241107 
End bp3242747 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content55% 
IMG OID641602378 
Productphosphoglucomutase 
Protein accessionYP_001725920 
Protein GI170020966 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0033] Phosphoglucomutase 
TIGRFAM ID[TIGR01132] phosphoglucomutase, alpha-D-glucose phosphate-specific 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00229066 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATCC ACAATCGTGC AGGCCAACCT GCACAACAGA GTGATTTGAT TAACGTCGCC 
CAACTGACGG CGCAATATTA TGTACTGAAA CCAGAAGCAG GGAATGCGGA GCACGCGGTG
AAATTCGGTA CTTCCGGTCA CCGTGGCAGT GCAGCGCGCC ACAGCTTTAA CGAGCCGCAC
ATTCTGGCGA TCGCTCAGGC AATTGCTGAA GAACGTGCGA AAAACGGCAT CACTGGCCCT
TGCTATGTGG GTAAAGATAC TCACGCCCTG TCCGAGCCTG CGTTCATTTC CGTACTGGAA
GTGCTGGCAG CGAACGGCGT TGATGTCATT GTGCAGGAAA ACAATGGCTT CACTCCAACG
CCTGCCGTTT CCAATGCCAT CCTGGTTCAC AATAAAAAAG GTGGCCCGCT GGCAGACGGT
ATCGTGATTA CACCGTCCCA TAACCCGCCG GAAGATGGTG GTATCAAGTA CAATCCGCCA
AATGGTGGCC CGGCTGATAC CAACGTCACC AAAGTGGTGG AAGACAGGGC CAACGCACTG
CTGGCCGATG GCCTGAAAGG CGTGAAGCGT ATCTCCCTCG ACGAAGCGAT GGCATCCGGT
CATGTGAAAG AGCAGGATCT GGTGCAGCCG TTCGTGGAAG GGCTGGCCGA TATCGTTGAT
ATGGCGGCGA TTCAGAAAGC GGGCCTGACG CTTGGCGTTG ATCCGCTGGG CGGTTCCGGT
ATCGAATACT GGAAGCGTAT TGGCGAGTAT TACAACCTCA ACCTGACTAT CGTTAACGAT
CAGGTCGATC AAACCTTCCG CTTTATGCAC CTTGATAAAG ACGGCGCGAT CCGTATGGAC
TGCTCCTCCG AGTGTGCGAT GGCGGGCCTG CTGGCACTGC GTGATAAGTT CGATCTGGCG
TTTGCTAACG ACCCGGATTA TGACCGTCAC GGTATCGTCA CTCCGGCAGG TTTGATGAAT
CCGAACCACT ACCTGGCGGT GGCGATCAAT TACCTGTTCC AGCATCGTCC GCAGTGGGGC
AAAGATGTTG CCGTTGGTAA AACGCTGGTT TCTTCTGCGA TGATCGACCG TGTGGTCAAT
GACTTGGGTC GTAAGCTGGT AGAAGTCCCG GTAGGTTTCA AATGGTTTGT TGATGGTCTG
TTCGACGGCA GCTTCGGCTT TGGCGGCGAA GAGAGCGCAG GGGCTTCCTT CCTGCGTTTC
GACGGCACGC CGTGGTCCAC CGACAAAGAC GGCATCATCA TGTGTCTGCT GGCGGCGGAA
ATCACCGCTG TCACCGGTAA GAACCCGCAG GAACACTACA ACGAACTGGC AAAACGCTTT
GGTGCGCCGA GCTACAACCG TTTGCAGGCA GCTGCGACTT CCGCACAAAA AGCGGCGCTG
TCTAAGCTGT CTCCGGAAAT GGTGAGCGCC AGCACCCTGG CAGGTGACCC GATCACCGCG
CGCCTGACTG CTGCTCCGGG CAACGGTGCT TCTATTGGCG GTCTGAAAGT GATGACTGAC
AACGGCTGGT TCGCCGCGCG TCCGTCAGGC ACGGAAGACG CATATAAGAT CTACTGCGAA
AGCTTCCTCG GTGAAGAACA TCGCAAGCAG ATCGAGAAAG AAGCGGTTGA GATTGTTAGC
GAAGTTCTGA AAAACGCGTA A
 
Protein sequence
MAIHNRAGQP AQQSDLINVA QLTAQYYVLK PEAGNAEHAV KFGTSGHRGS AARHSFNEPH 
ILAIAQAIAE ERAKNGITGP CYVGKDTHAL SEPAFISVLE VLAANGVDVI VQENNGFTPT
PAVSNAILVH NKKGGPLADG IVITPSHNPP EDGGIKYNPP NGGPADTNVT KVVEDRANAL
LADGLKGVKR ISLDEAMASG HVKEQDLVQP FVEGLADIVD MAAIQKAGLT LGVDPLGGSG
IEYWKRIGEY YNLNLTIVND QVDQTFRFMH LDKDGAIRMD CSSECAMAGL LALRDKFDLA
FANDPDYDRH GIVTPAGLMN PNHYLAVAIN YLFQHRPQWG KDVAVGKTLV SSAMIDRVVN
DLGRKLVEVP VGFKWFVDGL FDGSFGFGGE ESAGASFLRF DGTPWSTDKD GIIMCLLAAE
ITAVTGKNPQ EHYNELAKRF GAPSYNRLQA AATSAQKAAL SKLSPEMVSA STLAGDPITA
RLTAAPGNGA SIGGLKVMTD NGWFAARPSG TEDAYKIYCE SFLGEEHRKQ IEKEAVEIVS
EVLKNA