Gene RPD_2694 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2694 
Symbol 
ID4023192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3009166 
End bp3010695 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content64% 
IMG OID637962893 
Productsugar transferase 
Protein accessionYP_569824 
Protein GI91977165 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03023] Undecaprenyl-phosphate glucose phosphotransferase
[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.335678 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.300269 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGAAG CCGCCGCAGG TGCCGCGGCA ACGATCGAAC CCAACCGCCC GATGGTCGAG 
CGGCGCAAAC GGCTGTCGCC GGCCGCGCTC GCTGTGGCCA ATGAGAAAGT GCAGCCCGCC
TACTCGCCGA TCGTGATCGC CGGCCTGGTG CGGCTGACGG ATTTCGTGCT GATCGCCGCT
GTCGGGATCG CGCTGTACCT CGGCTATGTC GCCCGTCGCG ACGGGCTTCA ATGGGAATAC
ATCGCTGCGA TTCTCGGCAT GACGATCACC GCCGTGATCA GCTTCCAGGC CGCCGACCTC
TACGAGGTCC AGGTGTTTCG CGGCACGCTG AAGCAGGTGA CCCGAATGAT CTCGGCGTGG
ACGTTGGTGT TTCTGCTGTT CATCGGCGCG TCGTTCTTGG CCAAGCTCGG CGGCGAAGTC
TCGCGACTGT GGTTGTCGTC GTTCTATCTG CTCGGCCTCG CGGTGCTGAT CGCCGAGCGC
CTTGTCCTGC GCAACATGGT CCGGCACTGG GCGCGCCAGG GCCGGCTCGA TCGACGTACC
ATCATCGTCG GCTCCGATGC TAACGGCGAG AAGTTGATCA ACGCGCTGCG AGCGCAGCAG
GACGACGACA CCGACATCCG GGTTCTCGGC GTGTTCGACG ACCGCAACGA TTCCAGGGCG
CTCGCGACCT GCGCCGGCGC GCCGAAGCTC GGCAAGATCG ATGACATTCT CGAATTCGCT
CGGCGCACCC GGGTCGATCT GGTACTGTTC GCGTTGCCGA TTTCGGCCGA AACCCGCATC
CTCGACATGT TGAAAAAGCT GTGGGTGCTG CCGGTCGATA TCCGGCTGTC GGCCCACACC
AACAAGCTGC GCTTTCGCCC CCGCGCCTAT TCCTATCTCG GCAACGTGCC GACGCTCGAC
GTCTTCGAGG CGCCGATCAC CGATTGGGAT CAGGTCACCA AGCGGTTGTT CGATCACGTC
GTCGGCGGGC TGATCCTGCT CGCGGCCGCG CCGGTGATGG CGCTGGTCGC GTTGGCGATC
AAGCTCGACA GCCCAGGCCC TGTGCTGTTT CGCCAGAAGC GGTTCGGCTT CAACAACGAG
CGCATCGACG TCTTCAAGTT TCGTTCGCTG TATCACCACC AGGCCGATCC GACCGCGTCG
AAGGTGGTCA CCAAGAACGA CCCGCGCGTC ACCCGCGTCG GCCGATTCAT CCGCCGCACC
AGCCTCGACG AACTGCCGCA GTTCTTCAAC GTGGTGTTCA AGGGCAATCT GTCGCTGGTC
GGCCCGCGCC CCCATGCGGT GCAGGGAAAG CTGCAGAGCC GGCTGTTCGA CGAAGCGGTC
GACGGTTACT TCGCCCGGCA CCGGGTCAAG CCGGGCATCA CCGGCTGGGC CCAGATCAAC
GGCTGGCGCG GCGAGGTCGA CAGCGAAGAG AAGATTCAGA AGCGCGTCGA GTTCGATCTG
TATTACATCG AGAACTGGTC GGTGCTGTTC GATCTCTTCA TTTTGCTGAA GACGCCGTGG
GCGCTGCTCA AGGGCGAGAA CGCGTATTAG
 
Protein sequence
MIEAAAGAAA TIEPNRPMVE RRKRLSPAAL AVANEKVQPA YSPIVIAGLV RLTDFVLIAA 
VGIALYLGYV ARRDGLQWEY IAAILGMTIT AVISFQAADL YEVQVFRGTL KQVTRMISAW
TLVFLLFIGA SFLAKLGGEV SRLWLSSFYL LGLAVLIAER LVLRNMVRHW ARQGRLDRRT
IIVGSDANGE KLINALRAQQ DDDTDIRVLG VFDDRNDSRA LATCAGAPKL GKIDDILEFA
RRTRVDLVLF ALPISAETRI LDMLKKLWVL PVDIRLSAHT NKLRFRPRAY SYLGNVPTLD
VFEAPITDWD QVTKRLFDHV VGGLILLAAA PVMALVALAI KLDSPGPVLF RQKRFGFNNE
RIDVFKFRSL YHHQADPTAS KVVTKNDPRV TRVGRFIRRT SLDELPQFFN VVFKGNLSLV
GPRPHAVQGK LQSRLFDEAV DGYFARHRVK PGITGWAQIN GWRGEVDSEE KIQKRVEFDL
YYIENWSVLF DLFILLKTPW ALLKGENAY