Gene Rpal_4579 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4579 
Symbol 
ID6412263 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4934873 
End bp4936003 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content67% 
IMG OID642714459 
Productglycosyl transferase group 1 
Protein accessionYP_001993548 
Protein GI192292943 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0140485 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAACGTCA TTCTGGTCAA TGTCTGGCTC GATGCGGAGC GCGGCGGCGG CACGGCTGAG 
CGAACCCGGC GACTCGCGGT GCATCTGTCG CGACTCGGCT GCCGATGTAC AATCGTCACC
ATGGGGCCGA CCCCATGGGG CGACGAGTTC GGCCGGGCTG GCGTCACCGT CATCAGCGTG
CCCTTCATCG GCCATCGCTT CCCCGCGCCG TTGGTCAATC CGTTCGCCCT CTACCGTCTG
TTTCGCGACG CCGACATCGT CCATGTCATG GGCTTCTGGT TTCTCCTGGC GTCGTTCAGC
TCTGCGATCG CCTGCGCCGC CGGCACGCCG CTGCTGCTGT GCCCGGCCGG CTCCCTCACC
CAGTACGGTA GAAGCGCCGC GATCAAGCGC GTCTTTACCG CGCTCGCCGG GCGACCGATG
CTGCGCAGCG CCGCTGCGAT CATTGCGACG ACGCGTCAGG AAGAAGCGCT GCTGGTCTCG
GATTTCGCGA TTCCGGCAGA CTCGATCCTG ATCGCGCCGA ACGGCATCGA GCTTCCCGGA
GAAGGACGAC CGGGAGGTAT GGTGATCCCG GACAAACGAT TCGTCCTGTT CGTTGGCCGG
CTGACCGCGA TCAAGGCGCC GGACCTGCTG CTCGAAGCGT TCGCGCGGAT CGCCCCGGAA
ATAGCGGATG TGAGCCTTGT GATCGCCGGC CCCGATCTCG GGATGCGGCC TCAGCTCGAA
CGCCGGACCG CAGAACTGGG GCTTCAGGCG CGGGTGCATT TTGCGGGCTT CGCCGATGAG
GCGCAGCGGA CGGCGCTGCT GGCCCGGGCG TCGCTGCTCG CGGTTCCGTC GCATTCCGAA
GTGATGTCGA TGGTGGCGCT CGAGGCCGGC GCGATGGGCG TCCCGGTCCT GCTCACCGAC
CGCTGCGGCT TCGACGAGGT CGAACAGATC GGCGGCGGCC GCGTGGTGCC GGTCGACGTT
GGAGCCATTG CGGAAGGTTT GCGTCAAATG TTGTCGGACG ACGACGCACT GCGGCAATCG
GGGCAAGCGC TGCGTGGTTT CGTGCTCGAG CACTACGAAT GGTCGCGGGT GGCGGCGGCG
TTGCTCCGCG ATTTCCGCCG CTTGGCAGCG CAACGTCACG GACCCCGCTG A
 
Protein sequence
MNVILVNVWL DAERGGGTAE RTRRLAVHLS RLGCRCTIVT MGPTPWGDEF GRAGVTVISV 
PFIGHRFPAP LVNPFALYRL FRDADIVHVM GFWFLLASFS SAIACAAGTP LLLCPAGSLT
QYGRSAAIKR VFTALAGRPM LRSAAAIIAT TRQEEALLVS DFAIPADSIL IAPNGIELPG
EGRPGGMVIP DKRFVLFVGR LTAIKAPDLL LEAFARIAPE IADVSLVIAG PDLGMRPQLE
RRTAELGLQA RVHFAGFADE AQRTALLARA SLLAVPSHSE VMSMVALEAG AMGVPVLLTD
RCGFDEVEQI GGGRVVPVDV GAIAEGLRQM LSDDDALRQS GQALRGFVLE HYEWSRVAAA
LLRDFRRLAA QRHGPR