Gene RPB_2659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2659 
Symbol 
ID3910452 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3040048 
End bp3041577 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content64% 
IMG OID637884559 
Productsugar transferase 
Protein accessionYP_486272 
Protein GI86749776 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03023] Undecaprenyl-phosphate glucose phosphotransferase
[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.155014 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGACG CCGCTGCGCA GGCCGCAGCG GGCAATGACA CGATCCGGCC GACGGTGGAG 
CGGCGCAAGC GGCTGTCGCC GGCGGCGCTC GCTGTGACCA ACGAAAAAGT CCCCGAAGCC
TATTCCCCGA TCGTGATCGC CGGACTGGTG CGGCTCGCCG ATTTCGTGCT GATCGCCGGC
GTGGGCATCG CGCTGTATCT CGGCTACGTC GCGCGCCGCG ACGGCGTGCA TTGGGAGTAC
ATCGCCGCGA TCATCGGAAT GACCGTGACG GCCGTGATCA GCTTCCAGGC TGCCGACATC
TACCAGGTCC AGGTGTTCCG CGGCACGCTG AAGCAGATGA CGCGGATGAT CTCCGCGTGG
TCGTTCGTGT TCTTGCTGTT CATCGGCGCA TCCTTCTTCG CCAAGCTCGG CGGCGACGTC
TCGCGGCTGT GGCTGGCGTC GTTCTACGTC ATCGGGCTCG CGCTATTGAT CGTCGGTCGT
CTGGTTCTGC GCAATCTGGT CCGGCACTGG GCGCGCCAGG GCCGGCTCGA CCGCCGCACC
ATCATCGTCG GCTCCGACGA GAACGGCGAG CGATTGATCA ACGCGCTGAA GGCGCAAGAG
GACGACGATT CCGACATCCG CCTCCTCGGC GTGTTCGACG ACCGCAACGA TTCCCGCGCC
CTCGACACCT GCGCGGGCAG CCCGAAGCTC GGCAAGATCG ACGATATTCT CGAATTTGCC
CGACGCACCC GGGTCGATCT GGTGCTGTTC GCGCTTCCGA TTTCGGCCGA GACCCGCATC
CTCGACATGC TGAAGAAGCT CTGGGTGCTA CCGGTCGACA TCCGGCTGTC GGCGCACACC
AACAAGCTGC GCTTTCGCCC CCGCGCCTAT TCGTATCTCG GCAACGTGCC GACGCTCGAA
GTGTTCGAGG CGCCGATCAC CGATTGGGAT CAGGTGACGA AGCGATTGTT CGATCACGTC
GTCGGCGGGC TGATCCTGCT CGCGGCCGCG CCGGTGATGG CGCTGGTTGC GCTGGCGATC
AAGCTCGACA GCCCGGGCCC GGTGCTGTTT CGTCAGAAAC GGTTCGGTTT CAACAACGAG
CGCATCGACG TCTTCAAGTT CCGCTCGATG TATCATCACC TTGCCGACCC GACCGCGTCG
AAGGTGGTGA CCAGGAACGA TCCGCGCGTC ACCCGCGTCG GGAAATTCAT CCGCCGCACC
AGCCTCGACG AATTGCCGCA GCTGTTCAAC GTGGTGTTCA AGAGCAATCT GTCGCTGGTC
GGCCCGCGGC CGCACGCCGT TCAGGGCAAG CTGCAGCATC GGCTGTTCGA CGAGACCGTC
GACGGTTACT TCGCCCGCCA CCGCGTCAAG CCGGGCATCA CCGGCTGGGC CCAGATCAAC
GGTTGGCGCG GCGAGATCGA CAACGAAGAG AAGATCCAGA AGCGCGTCGA GTTCGACCTG
TACTACATCG AAAACTGGTC GGTGCTGTTC GACCTGTTCA TCCTCCTGAA GACGCCGTGG
GCGCTGCTGA AGGGTGAGAA CGCGTACTGA
 
Protein sequence
MIDAAAQAAA GNDTIRPTVE RRKRLSPAAL AVTNEKVPEA YSPIVIAGLV RLADFVLIAG 
VGIALYLGYV ARRDGVHWEY IAAIIGMTVT AVISFQAADI YQVQVFRGTL KQMTRMISAW
SFVFLLFIGA SFFAKLGGDV SRLWLASFYV IGLALLIVGR LVLRNLVRHW ARQGRLDRRT
IIVGSDENGE RLINALKAQE DDDSDIRLLG VFDDRNDSRA LDTCAGSPKL GKIDDILEFA
RRTRVDLVLF ALPISAETRI LDMLKKLWVL PVDIRLSAHT NKLRFRPRAY SYLGNVPTLE
VFEAPITDWD QVTKRLFDHV VGGLILLAAA PVMALVALAI KLDSPGPVLF RQKRFGFNNE
RIDVFKFRSM YHHLADPTAS KVVTRNDPRV TRVGKFIRRT SLDELPQLFN VVFKSNLSLV
GPRPHAVQGK LQHRLFDETV DGYFARHRVK PGITGWAQIN GWRGEIDNEE KIQKRVEFDL
YYIENWSVLF DLFILLKTPW ALLKGENAY