Gene Rru_A3094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A3094 
Symbol 
ID3836540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp3559919 
End bp3561253 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content65% 
IMG OID637827209 
ProductO-antigen polymerase 
Protein accessionYP_428176 
Protein GI83594424 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID[TIGR03097] probable O-glycosylation ligase, exosortase system type 1-associated 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGGATC TGCTGCTCGC GCTATTGCTG GTTTCCAGCC TGCCGTTGAT CATCGTCCGG 
CCCTATGTGG GCGTGTTCGT CTGGTGCGTC GTGTCGCTGA TGAACCCGCA TCAGATGGCC
TATGGCTTCA TTTCATCGGC GCCGGTCGCC TTGCTGGTCG CCGGATCGAC GCTGATCGGG
CTGCTCGCCT CGCGCGAGCC CAAGCGCCTG CCCGGCGATC CCATCGTCAT CTGTATCCTG
CTTTTGGCCT TCTGGGTGTC GGTGACCACG GTCTTCGCCC TGCGCCCGGA CGTGGCCTTT
CCGTTGTGGG ACCGCACGAT CAAGATGTTC CTGATGGTGC TGGTCGGTCT GGCGCTGATC
CGCAGCCGGC AGCGCCTGCA CGCCCTGGTG TGGATCCTGG TCGTCAGCAT CGGCTTTTAT
GGGGTGCGTG GCGGCTTGTT CACCCTGGTC ACCGGCGGTG GCGGGCGGGT GGTCGGTCCG
CCAACGACGA TGATCGGCGA CAACAACTCC CTGGCCGCCG CCCTGATCAT GACCCTGCCC
CTGATGCGCT ATCTGCATCT GCAATCGGCC AACCGCTGGA TCCGCCTTGG CTTGGCCGGG
GCGATGGGGC TGACGGCGCT GTCGGTCCTT GGTTCGTTTT CGCGCGGCGC CCTGCTGGCG
TCGGTGGCGA TGTTTTTCTG GATGCTGGTG CGCTCGCGCA AGCGGCTGAT GATCCTGGCT
TTGGGGGCGA CCTTCGCCAT TGGCGGGCTG TTCCTGATGC CCGAGAGCTG GCACGACCGG
ATGAACACCA TCAGTGATTA TCAGGAGGAC GAATCGGCCA TGGGCCGCCT CGACGCCTGG
ACCTTCGCCT TCAATCTGGC GGTGGAGCGG CCCTTGGTCG GTGGCGGGTT CCGCATCAAT
GTCGACCGGG ATGTGTTTTT GCGGTTTTCC CCCGAGGCCG GGATCAACCG GGCTTTCCAC
AGCATTTATT TCCAGGTTCT GGGCGAGCAT GGCTTCGTTG GCCTGGGGTT GTTTCTGGCG
ACCCTGGCGC TGGGGTTTTT CAAGGCCGGC GCGCTGTCGC GTCAATGCGC CGGTGATCCG
AAACTGGCCT GGGCCCGCGA TCTTGGCGCG ATGTCCCAGG TCAGTCTGGT TGGCTATGCC
GCCGGCGGTG CCTTTCTCGA TCTGGCCTTC TTCGATCTTT TCTATTTCGT CGCCCTGTTG
CCGGTGATGG CCAGTTGGGT TCTGGCCAAC CCGCCGCCGG TCGTCTGGAA ACCGAAACCG
GTGAAAAGCC AAGTCCCGGG GCGCGGCCGG CCGCGCCGCG CCGCCTTGCC GCCGCCGAGG
GATGCCCTGC CATGA
 
Protein sequence
MRDLLLALLL VSSLPLIIVR PYVGVFVWCV VSLMNPHQMA YGFISSAPVA LLVAGSTLIG 
LLASREPKRL PGDPIVICIL LLAFWVSVTT VFALRPDVAF PLWDRTIKMF LMVLVGLALI
RSRQRLHALV WILVVSIGFY GVRGGLFTLV TGGGGRVVGP PTTMIGDNNS LAAALIMTLP
LMRYLHLQSA NRWIRLGLAG AMGLTALSVL GSFSRGALLA SVAMFFWMLV RSRKRLMILA
LGATFAIGGL FLMPESWHDR MNTISDYQED ESAMGRLDAW TFAFNLAVER PLVGGGFRIN
VDRDVFLRFS PEAGINRAFH SIYFQVLGEH GFVGLGLFLA TLALGFFKAG ALSRQCAGDP
KLAWARDLGA MSQVSLVGYA AGGAFLDLAF FDLFYFVALL PVMASWVLAN PPPVVWKPKP
VKSQVPGRGR PRRAALPPPR DALP