Gene RPB_0844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0844 
Symbol 
ID3909102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp959970 
End bp962480 
Gene Length2511 bp 
Protein Length836 aa 
Translation table11 
GC content66% 
IMG OID637882737 
Productglycogen/starch/alpha-glucan phosphorylase 
Protein accessionYP_484466 
Protein GI86747970 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0058] Glucan phosphorylase 
TIGRFAM ID[TIGR02093] glycogen/starch/alpha-glucan phosphorylases 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCCGCGC AGCCCGTGCC AGAGAAATTT CAGCAGCCGC CGATCGACGA ACTGGCGCTG 
TCCGAGATCA AGAGCGCCAT TCTGGCCAAG CTGACGCTTG CGATCGGCAA AGAGGCGACG
CAGGCGACCA AGCACGATTG GTACAAGGCC GCGGCGCTGG CGCTGCGCGA CCGCATCGTG
CACCGCTGGC TGGTGTCCGA GAAGGAGAGC TACGACGCCG GCCGCAAGCG GGTGTATTAC
CTCTCGCTGG AATTCCTGAT CGGCCGGCTG TTCACCGACG CGCTGAACAA TATGGGCCTG
CTGGCGCAAT ACGGCGCCGC GCTGGGCGAC CTCGGCGTCG GCCTCAACGA TTTGCGCAAA
TGCGAACCGG ACGCGGCGCT CGGCAATGGC GGCCTCGGCC GGCTCGCCGC CTGCTTCATG
GAAAGCATGG CGACGCTGGA GATCCCGGCG ATCGGCTACG GCATCCGCTA CGATTACGGC
CTGTTCCGGC AGATCATCAA TCACGGCTGG CAGCAGGAAT TCCCGGACGA GTGGCTGTCG
TTCGGCAACC CGTGGGAGCT GCAGCGGCCC GAGGTGGTGT ATCAGGTGAA GTTCGGCGGT
AGCGTCGAGC AGGTCACCGA CCCCAAGGGC GTGACGCGCG CGGTCTGGAC GCCGATCGAA
ACCGTGCAGG CGATGGCCTA CGACACGCCG ATCGTCGGCT GGCGCGGCGA GCACGTCAAC
GCGCTGCGGC TGTGGTCGGC GCGGGCGCCC GATCCGATGC TGATCGACGT CTTCAACACC
GGCGACTATC TCGGCGCCAC CGCCCACGAG GCGCGCGCCG AGGCGATCTG CAAATTCCTC
TATCCCAACG ACGAGAGCCC CGCCGGGCGC GAATTGCGGC TGCGGCAGGA ATATTTCTTC
GTCTCCGCCT CGCTGCAGGA CCTGATCAAG CGGCATCTGG ATTCGGACGG CCAGATCCGC
AACCTCGCCA AGAAGGCGGC GATCCAGCTC AACGACACCC ATCCCAGCCT CGCCGTCACC
GAGCTGATGC GGCTGTTGAT CGACGTCCAT CACCTGCGCT GGGACGACGC CTGGCAGATC
ACCACCGCGA CGCTGAGCTA CACCAATCAC ACGCTGCTGC CCGAGGCGCT GGAGACCTGG
CCGCTTGATC TGTTCGAGCG CACGCTGCCG CGGCATCTGC AGATCATCTA CCGCATCAAC
GAGGCGCATC TGGCGCTCGC CGAGCAGCGC TGCCCCGGCG ATATCGAGTT CCGCGCGTCG
GTGTCGCTGA TCGACGAGCG GGCCGGCCGC CGGGTCCGGA TGGGGCACCT CGCCTTCATC
GGCTCGCACC GCATCAACGG CGTCTCGGCG ATGCATTCCG ACCTGATGAA GGAGACCGTG
TTCCACGATC TCAATCATCT CTATCCGGAC CGCATCACCA ACAAGACCAA CGGCATCACC
TTCCGGCGCT GGCTGACGCT GGCCAATCCG GGGCTGACCG ATCTGGTGCG CTCGGCCTGC
GGCGACGAGG TGCTGGACGA TCCGACGAGG CTCGACCGCC TCGAAGCCTT CGCCGGCGAC
AGCGCGTTCC AGCAGCAGTT CCGAACCGTC AAGCACCGCA ACAAGATCGC GCTGGCGCGG
CTGATCAGCG AGCGCAACGG CATCCGGGTC GATCCGGGCG CGCTGTTCGA CGTCCAGATC
AAGCGCATCC ACGAATACAA GCGGCAGCTT CTCAACGTGC TGGAGACCGT CGCGCTGTAT
CACGCGATCA AGGACGAGCC GAACCGCGAC TGGGTGCCGC GCGTCAAGAT CTTCGCCGGC
AAGGCCGCGG CGAGCTATCG CTACGCCAAG CTGATCATCA AGCTGATCAA CGACGTCGCC
GAGGTCGTCA ACAACGACGC CTCGATCGGC GGCAAGCTGA AGGTGGTGTT CCTCGCCGAC
TACAATGTCA GCCTGGCCGA AGTGATCATT CCCGCCGCCG ATCTGTCCGA GCAGATTTCG
ACCGCCGGCA TGGAAGCGTC CGGCACCGGC AACATGAAAC TGGCGCTCAA CGGCGCGCTG
ACGATCGGCA CGCTCGACGG CGCCAATATC GAGATCCGCG ATCACGTCGG CGCCGAGAAT
ATCGCGATCT TCGGCATGGA AGCGCTCGAA GTGGTGGCGC GGCGCGCCCA GGGCCTCGAC
GCCAACGACG TCATCACCCG CTCGCAGCCG CTCGCCCGCG CCATCCGGGC GATCGACGCA
GGCGCGTTCT CGCCCGACGA TCCGGCGCGG TTCGCCTCGG TGGCGCACGC GCTGCGGCAC
CTCGACCACT ACATGGTCAG CGCCGATTTC GACAGCTACT ACGAGGCCCA GCGCGGCATC
GACGCACGCT GGGCCGCCGG CCCCGCCTGG ACCCGGGCCG GCATCCTCAA CGTCGCGCGG
ATGGCCTGGT TCTCGTCGGA CCGCACCATC CGCGAATACG CCGAGGACAT CTGGGACGTT
CCGACACGGC ACGCGACGCA ACCCCCGCCG GCACGGCTGG CGAAGGGGTA G
 
Protein sequence
MPAQPVPEKF QQPPIDELAL SEIKSAILAK LTLAIGKEAT QATKHDWYKA AALALRDRIV 
HRWLVSEKES YDAGRKRVYY LSLEFLIGRL FTDALNNMGL LAQYGAALGD LGVGLNDLRK
CEPDAALGNG GLGRLAACFM ESMATLEIPA IGYGIRYDYG LFRQIINHGW QQEFPDEWLS
FGNPWELQRP EVVYQVKFGG SVEQVTDPKG VTRAVWTPIE TVQAMAYDTP IVGWRGEHVN
ALRLWSARAP DPMLIDVFNT GDYLGATAHE ARAEAICKFL YPNDESPAGR ELRLRQEYFF
VSASLQDLIK RHLDSDGQIR NLAKKAAIQL NDTHPSLAVT ELMRLLIDVH HLRWDDAWQI
TTATLSYTNH TLLPEALETW PLDLFERTLP RHLQIIYRIN EAHLALAEQR CPGDIEFRAS
VSLIDERAGR RVRMGHLAFI GSHRINGVSA MHSDLMKETV FHDLNHLYPD RITNKTNGIT
FRRWLTLANP GLTDLVRSAC GDEVLDDPTR LDRLEAFAGD SAFQQQFRTV KHRNKIALAR
LISERNGIRV DPGALFDVQI KRIHEYKRQL LNVLETVALY HAIKDEPNRD WVPRVKIFAG
KAAASYRYAK LIIKLINDVA EVVNNDASIG GKLKVVFLAD YNVSLAEVII PAADLSEQIS
TAGMEASGTG NMKLALNGAL TIGTLDGANI EIRDHVGAEN IAIFGMEALE VVARRAQGLD
ANDVITRSQP LARAIRAIDA GAFSPDDPAR FASVAHALRH LDHYMVSADF DSYYEAQRGI
DARWAAGPAW TRAGILNVAR MAWFSSDRTI REYAEDIWDV PTRHATQPPP ARLAKG