Gene Rpal_3089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3089 
Symbol 
ID6410760 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp3331756 
End bp3333312 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content65% 
IMG OID642712969 
ProductUndecaprenyl-phosphate glucose phosphotransferase 
Protein accessionYP_001992070 
Protein GI192291465 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03023] Undecaprenyl-phosphate glucose phosphotransferase
[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.411544 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGCCGA TCAGCGCACG TTCGATGATC AGTGCTGCTG CGACGGAAGC CGTGGTCGCC 
AGCGGCGACG GCGCGCCGCG GGTGGAACGC CGCAAGCGGT TGTCGCCGGC CGCCCTCGCC
GTCGCCAATC AGAAGGTGCC GCCGGCGTTT TCGCCGATCG TGATCGCCGG CTCCGTCCGC
CTCGCCGATT TTCTCGTGAT CGCCGCCGTC GGCATCGCGC TGTACTTCGC GCTCGTGGTC
CGCCGCGACG GCTTTGCCTG GGAGTACATC GCGGCGATCA TCGGCACCAC GGCGACTGCG
GTCGTCGCGT TCCAGGCTGC CGATCTTTAC AAGGTGCAAC TGTTCCGCGG CACCTTGAAA
CAGATGACCC GGATCATATC GACGTGGTCG ATCGTGTTCC TGCTGTTCAT CGGCGCATCG
TTCTTCGCCA AGCTCGGCGG CGAGGTGTCG CGGCTGTGGC TGGGTTCGTT CTTTTTCGCC
GGCCTCGCCT TGCTGATCAT CGAGCGATTG TCGGTGCGCG CGCTGGTGCG GCGCTGGGCG
TCGCAAGGCC GGCTCGACCG CCGCACCGTG ATCGTCGGTG CCGACGCTAA TGGCGCCAAG
CTGATCGAAG CGCTGAAGGC CGAGCACGCC GACGCCTCCG ACATCCGTAT CCTCGGCGTG
TTCGACGACC GCAACGACGC CCGCTCGCAG TCCACTTGCG CGGGCGTTCC GAAGCTCGGC
AAGGTCGATG ACATTCCCGA ATTCGCCCGC CGCACCCGTG TCGATCTCGT GCTGTTCGCG
CTGCCGATCT CGGCCGAGAC CCGCATCCTC GACATGCTGA AGAAGCTGTG GGTGCTGCCG
GTTGACATCC GGCTGTCGGC GCACACCAAC AAGCTGCGGT TCCGGCCGCG CGCCTATTCC
TATGTCGGCA AGGTGCCGAC GCTCGACGTG TTCGAAGCGC CGATCACCGA TTGGGATCAG
GTGATCAAGC AGGTATTCGA CCGCGTCGTC GGCGGTTTCA TCCTGCTGCT CGCCGCCCCG
GTAATGGCTT TGGTAGCGCT GGCGATCAAG CTCGACAGCC CGGGTCCTGT GCTGTTCCGG
CAGAAGCGGT TCGGCTTCAA CAACGAGCGC ATCGACGTGC TCAAGTTTCG GTCGATGTAT
CACGACCAGG CCGATCCCAC TGCGTCAAAG GTCGTCACCC GCAACGACCC GCGCGTCACC
CGGGTCGGCC GCTTCATCCG CCGCACCAGC CTCGACGAGC TGCCGCAACT GTTCAACGTG
GTGTTCAAGG GCAATCTGTC GCTGGTCGGC CCGCGCCCGC ATGCGGTGCA GGGCAAGCTG
CAGAGCCAGC TGTTCGACGA AGCCGTCGAC GGCTACTTCG CCCGCCACCG CGTCAAGCCG
GGTATCACCG GCTGGGCCCA GATCAACGGC TGGCGCGGCG AGATCGACAA CGAAGAGAAG
ATCCAGAAGC GCGTCGAGTT CGACCTGTAC TACATCGAGA ACTGGTCGGT CCTGTTTGAC
CTCTACATTC TGCTGAGAAC TCCGTGGGCG CTGCTCAAGG GCGAGAACGC GTACTGA
 
Protein sequence
MEPISARSMI SAAATEAVVA SGDGAPRVER RKRLSPAALA VANQKVPPAF SPIVIAGSVR 
LADFLVIAAV GIALYFALVV RRDGFAWEYI AAIIGTTATA VVAFQAADLY KVQLFRGTLK
QMTRIISTWS IVFLLFIGAS FFAKLGGEVS RLWLGSFFFA GLALLIIERL SVRALVRRWA
SQGRLDRRTV IVGADANGAK LIEALKAEHA DASDIRILGV FDDRNDARSQ STCAGVPKLG
KVDDIPEFAR RTRVDLVLFA LPISAETRIL DMLKKLWVLP VDIRLSAHTN KLRFRPRAYS
YVGKVPTLDV FEAPITDWDQ VIKQVFDRVV GGFILLLAAP VMALVALAIK LDSPGPVLFR
QKRFGFNNER IDVLKFRSMY HDQADPTASK VVTRNDPRVT RVGRFIRRTS LDELPQLFNV
VFKGNLSLVG PRPHAVQGKL QSQLFDEAVD GYFARHRVKP GITGWAQING WRGEIDNEEK
IQKRVEFDLY YIENWSVLFD LYILLRTPWA LLKGENAY