Gene RPB_4039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4039 
Symbol 
ID3911846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4609039 
End bp4610691 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content67% 
IMG OID637885943 
Productphosphoglucomutase 
Protein accessionYP_487643 
Protein GI86751147 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0033] Phosphoglucomutase 
TIGRFAM ID[TIGR01132] phosphoglucomutase, alpha-D-glucose phosphate-specific 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.425514 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGCGA AGGTAAGCCC GCTGGCGGGC AAGACCGTCG ACCCCAACAA CCTCGTCAAC 
GTGCCGCGCC TGGTGACGGC GTATTTCGCC GGCAAGCCTG ATCCGAAAGT CGCATCCGAG
CGCGTCGCGT TCGGCACCTC GGGGCATCGC GGCTCGTCGC TCAACAACGC CTTCAACGAG
GAGCACATTC TGGCTGTGAG CCAGGCGGTT TGCGACCATC GTGCCGGCGC CGGCATCACC
GGGCCCTTGT TCATCGGCAT CGACACCCAT GCGCTGGCCG AGCCGGCGCT GGTCAGTGCG
CTGGAAGTGT TCGCCGCCAA CGGCGTGGAC GTGGTGATCG ACCAGCACGG CGGCTACACG
CCGACGCCGG TGATCTCCCA CGCGATCCTG ACGCATAATC GCGGCCGCGA CAGCGGCCTC
GCCGACGGCG TGGTGGTGAC GCCGTCCCAC AATCCGCCGG AAGACGGCGG CTTCAAGTAC
AATCCGCCGA ACGGCGGCCC GGCCGATACC GACGTGACGT CCGTGATCGA GAAGGCCGCC
AATGCGCTGC TCGAAGGCGG CCTGAAGGGC GTCAAGCGCA TCCCGTACGA CCGCGCCCGC
AAGGCCGACA ACGTGCACCG GCGCGACTAC GTCACGCCCT ATGTCGAGGA TCTCGCCAAC
GTCGTCGACA TGGAGGCGAT CCGCAGCTCC GGCGTCAAGC TCGGCATCGA TCCGCTCGGC
GGTGCGGCGG TGCATTACTG GCATCCGATC ATCGAGCGCT ACAAGATCGA CGCGAAAGTC
GTCAGCGACG CGGTCGATCC GACTTTCCGT TTCATGACGC TGGATTGGGA CGGCAAGGTG
CGGATGGACT GCTCGTCGCC TTATGCGATG GCGCGGCTGA TCGGGATGCG CGACGATTTC
GACGTCGCCT TCGCCAACGA CACCGACGCC GACCGCCACG GCATCGTCAC CCGCTCCAGC
GGACTGATGA ACCCCAATCA CTATCTCGCG GTGGCGATCT CCTATCTGTT CGCCAACCGG
CCAGAATGGG GCGCGGGCGC CGCGATAGGC AAGACCGCGG TGTCGAGCGC GATGATCGAT
CGCGTCGCCG CCAAGATCGG CCGCAAGGTT GTGGAGACCC CTGTCGGCTT CAAATGGTTC
GTCGACGGGC TGATCGGCGG CGGCTTCGGC TTCGCCGGCG AGGAAAGCGC CGGCGCCTCG
TTCCTGCGCC GCGACGGCAG CGTCTGGACC ACCGACAAGG ACGGCGTCAT TCTCGGCCTG
CTCGCGGCGG AGATCACCGC CAGAAGCAAG GCCGATCCCG GCGAGATCTA TCAGCGCTTG
ACATCCGAAC TCGGCGCGCC GTTCTACGCG CGCATCGACG CGCCGGCCTC CGCCGCGCAG
AAGGCGCTGT TCAAGACGCT GACCGCCGAC AAGCTCGGCA TCCGGGAACT CGCCGGCGAG
CCGGTCACCG CGACGCTGAC CAACGCGCCG GGCAACAACC AGCCGATCGG CGGCGTCAAG
GTGACGACCG CCAACGGCTG GTTCGCGGCG CGGCCATCGG GCACCGAGGA CGTCTACAAG
ATCTACGCCG AGAGCTTCGT CAGCGCCGAG CATCTGACGC GCATCCAGCA CGAGGCGCAG
GCGGCGCTGA GCGCGATGTT CGCGGCAGGT TGA
 
Protein sequence
MAAKVSPLAG KTVDPNNLVN VPRLVTAYFA GKPDPKVASE RVAFGTSGHR GSSLNNAFNE 
EHILAVSQAV CDHRAGAGIT GPLFIGIDTH ALAEPALVSA LEVFAANGVD VVIDQHGGYT
PTPVISHAIL THNRGRDSGL ADGVVVTPSH NPPEDGGFKY NPPNGGPADT DVTSVIEKAA
NALLEGGLKG VKRIPYDRAR KADNVHRRDY VTPYVEDLAN VVDMEAIRSS GVKLGIDPLG
GAAVHYWHPI IERYKIDAKV VSDAVDPTFR FMTLDWDGKV RMDCSSPYAM ARLIGMRDDF
DVAFANDTDA DRHGIVTRSS GLMNPNHYLA VAISYLFANR PEWGAGAAIG KTAVSSAMID
RVAAKIGRKV VETPVGFKWF VDGLIGGGFG FAGEESAGAS FLRRDGSVWT TDKDGVILGL
LAAEITARSK ADPGEIYQRL TSELGAPFYA RIDAPASAAQ KALFKTLTAD KLGIRELAGE
PVTATLTNAP GNNQPIGGVK VTTANGWFAA RPSGTEDVYK IYAESFVSAE HLTRIQHEAQ
AALSAMFAAG